Official
Bact
100%
ML Ensemble
Bact
98%
ID c9a24d96... c9a24d96-622e-44a5-9c49-8581c0dfaa48
Primary Structure
Secondary Structure
Tertiary Structure
Domains
Evolutionary
Genomic Context
Sequence Preview (hover for details)
MSTKDFNLDLVSVSKKDSGASPRITSISLCTPGCKTGALMGCNMKTATCNCSIHVSK
Type:
Class:
MW:
Charge:
In sequence:
Isoelectric Point
8.99
pH
Isoelectric Point (pI)
The pH at which the protein carries no net electrical charge. At this pH, the protein is least soluble and most likely to precipitate. Important for purification techniques like isoelectric focusing.
Click to flip back
GRAVY
-0.011
Grand Average of Hydropathy
GRAVY Score
Measures overall hydrophobicity. Negative values indicate hydrophilic (water-loving) proteins, typically soluble. Positive values suggest hydrophobic proteins, often membrane-associated.
Click to flip back
Instability Index
28.89
Stable
Instability Index
Predicts protein stability in vitro. Values below 40 suggest a stable protein with a longer half-life. Values above 40 indicate potential instability and faster degradation.
Click to flip back
Aliphatic Index
68.42
Thermostability indicator
Aliphatic Index
Measures the relative volume occupied by aliphatic side chains (Ala, Val, Ile, Leu). Higher values indicate greater thermostability, as aliphatic residues stabilize the hydrophobic core.
Click to flip back
Hydrophobicity Profile Kyte-Doolittle Scale
Amino Acid Composition
Physiochemical Ratios
Secondary Structure Fractions
α-Helix Content
0.0%
0 residues
Alpha Helices
0 helix segments found. Average length: 0.0 residues. Longest helix: 0 aa.
Click to flip back
β-Sheet Content
36.8%
21 residues
Beta Sheets
3 sheet segments found. Average length: 7.0 residues. Longest strand: 8 aa.
Click to flip back
Coil Content
63.2%
36 residues
Coil/Loop Regions
Unstructured regions connecting secondary structure elements. Often involved in protein flexibility and binding interactions.
Click to flip back
Prediction Confidence
57.9%
High confidence
Confidence Metrics
Helix: 0% avg | Sheet: 70% avg | Coil: 77% avg
Click to flip back
Secondary Structure Prediction s4pred
C C C C C C C C E E E E E E C C C C C C C C E E E E E E E E C C C C C C C C C C C C C C C C C C E E E E E E E C C
H = α-Helix E = β-Sheet C = Coil/Loop Opacity = Confidence
Prediction Confidence Profile
α-Helix Segments (0)
β-Sheet Segments (3)
E1 8-13 6 aa 57%
DLVSVS
E2 22-29 8 aa 80%
RITSISLC
E3 48-54 7 aa 70%
CNCSIHV
Structural Motifs
0 β-Hairpins
0 Helix-Turn-Helix
0 Helix-Hinge-Helix
0 Proline Kinks
Amphipathic Helix Analysis
0 Amphipathic Helices
0.0% Sequence Coverage
0.00 Max Hydrophobic Moment
Structure Transitions
Total Transitions: 6
Transition Density: 10.53 per 100 aa
Largest Contiguous: 18 residues
HE 0
HC 0
EH 0
EC 3
CH 0
CE 3
Average pLDDT
46.0
Model confidence score
pLDDT Score
Predicted Local Distance Difference Test (pLDDT) measures AlphaFold's per-residue confidence. Scores >90 indicate high confidence, 70-90 moderate, 50-70 low, <50 very low.
Click to flip back
pTM Score
0.128
Template modeling score
pTM Score
Predicted Template Modeling score estimates overall model quality. Values >0.5 suggest similar topology to the true structure, >0.8 indicates high accuracy.
Click to flip back
High Confidence
0.0%
0 of 57 residues
High Confidence Regions
Percentage of residues with pLDDT >70. These regions are predicted with high accuracy and are suitable for structural analysis and downstream applications.
Click to flip back
Low Confidence
84.2%
48 of 57 residues
Low Confidence Regions
Percentage of residues with pLDDT <50. Often correspond to intrinsically disordered regions, flexible loops, or areas lacking homologous structures in training data.
Click to flip back
Radius of Gyration
15.7 Å
Structural compactness
Radius of Gyration
Measures the compactness of the protein structure. Lower values indicate a more compact, globular structure while higher values suggest extended or elongated conformations.
Click to flip back
Contact Density
0.1065
340 contacts
Contact Density
Number of residue-residue contacts normalized by sequence length. Higher values indicate well-packed structures with extensive internal contacts and stable cores.
Click to flip back
Mean PAE
18.5 Å
Position alignment error
Predicted Aligned Error
Mean Predicted Aligned Error estimates the accuracy of relative domain positions. Lower values indicate higher confidence in the relative positions of residue pairs.
Click to flip back
Predicted Domains
2
From PAE analysis
Domain Count
Number of structural domains predicted from PAE matrix clustering. Domains show lower PAE within regions than between them, indicating independent folding units.
Click to flip back
Per-Residue pLDDT Profile
AlphaFold Model Comparison
Confidence Category Distribution
High Confidence Regions (pLDDT >70)
Region Start End Length
Low Confidence Regions (pLDDT <50)
Region Start End Length
Region 1 1 2 2
Region 2 7 7 1
Region 3 9 10 2
Region 4 12 12 1
Region 5 14 21 8
Region 6 23 43 21
Region 7 45 57 13
Best Model
model_3_ptm_pred_0
pLDDT: 46.5
Best Model
The top-ranked AlphaFold model based on average pLDDT score. This model is used for all downstream structural analysis and visualization.
Click to flip back
Disulfide Bonds
2
Predicted S-S bonds
Disulfide Bonds
Number of predicted disulfide bonds based on cysteine proximity (<2.5Å between sulfur atoms). Important for protein stability and tertiary structure.
Click to flip back
Relaxation
Successful
Max violation: 0.00 Å
Structure Relaxation
Indicates whether AMBER relaxation was successful. Relaxation refines the structure to minimize steric clashes and improve local geometry.
Click to flip back
Domain Coverage
89.5%
51 of 57 residues
Coverage Analysis
N-terminal: 100.0% | Core: 100.0% | C-terminal: 45.5%
Click to flip back
Domains Found
5
from 3 databases
Database Sources
Pfam, NCBIFAM, PRINTS
Click to flip back
Best E-value
8e-18
TIGR03731
Statistical Significance
1 highly significant (E<1e-20), 1 moderate, 0 weak
Click to flip back
InterPro Annotations
1
functional annotations
InterPro IDs
IPR006079
Click to flip back
Domain Architecture (click domains to view details)
Pfam NCBIFAM PRINTS
Domain Annotations (5)
PRINTS 5-22 18 aa
PR00324
-
IPR006079: Lantibiotic, Type A, Bacillales-type
PRINTS 26-37 12 aa
PR00324
-
IPR006079: Lantibiotic, Type A, Bacillales-type
PRINTS 38-51 14 aa
PR00324
-
IPR006079: Lantibiotic, Type A, Bacillales-type
Pfam 1-43 43 aa E=1.2E-7
PF02052
Gallidermin
IPR006079: Lantibiotic, Type A, Bacillales-type
NCBIFAM 1-50 50 aa E=7.8E-18
TIGR03731
gallidermin/nisin family lantibiotic
IPR006079: Lantibiotic, Type A, Bacillales-type
Coverage Analysis
Covered: 51 aa Uncovered: 6 aa
N-terminal (1-11) 100.0%
Core (11-45) 100.0%
C-terminal (45-57) 45.5%
Unannotated Regions
23-25 3 aa
Domain Size Statistics
Longest: 50 aa
Shortest: 12 aa
Average: 27 aa
Overlapping: 7 domains
BLAST Hits
11
homologs found
What This Means
11 similar proteins found in SwissProt. More homologs = well-characterized family. Few hits may mean a novel or highly divergent protein.
Click to flip back
Best E-value
3e-57
most significant hit
What Is E-value?
Estimates how many matches you'd find by chance. Lower = better. Values near zero mean the match is real, not random.
Click to flip back
Max Identity
100.0%
sequence identity
What Is Identity?
Percentage of identical amino acids between aligned proteins. Above 30% = shared origin. Above 90% = very close relatives.
Click to flip back
Best Bitscore
183.0
alignment quality
What Is Bitscore?
Normalized alignment quality score, independent of database size. Higher = better match. Unlike E-value, bitscores are directly comparable across searches.
Click to flip back
Closest Homolog
P29559.1 100.0% identity
RecName: Full=Lantibiotic nisin-Z; Flags: Precursor [Lactococcus lactis subsp. lactis]
E-value: 2.83e-57 Bitscore: 183.0
Most Distant Homolog
Q2JTJ1.1 56.0% identity
RecName: Full=Holliday junction branch migration complex subunit RuvB 2 [Synechococcus sp. JA-3-3Ab]
E-value: 5.60 Bitscore: 29.9
BLAST Hit Distribution
BLAST Hits (11) (click accession to view in UniProt)
Accession Identity E-value Bitscore Coverage Description
P29559.1 100.0% 2.83e-57 183.0 1-57 RecName: Full=Lantibiotic nisin-Z; Flags: Precursor [Lactococcus lactis subsp. l...
P13068.1 98.2% 6.01e-56 180.0 1-57 RecName: Full=Lantibiotic nisin-A; Flags: Precursor [Lactococcus lactis subsp. l...
Q2QBT0.1 71.2% 5.39e-27 105.0 1-51 RecName: Full=Lantibiotic nisin-U; Flags: Precursor [Streptococcus uberis]
P10946.1 57.1% 2.53e-14 71.5 1-57 RecName: Full=Lantibiotic subtilin; Flags: Precursor [Bacillus subtilis]
P21838.2 63.9% 6.14e-07 50.3 6-37 RecName: Full=Lantibiotic gallidermin; Flags: Precursor [Staphylococcus gallinar...
P08136.1 63.9% 1.20e-06 49.4 6-37 RecName: Full=Lantibiotic epidermin; Flags: Precursor [Staphylococcus epidermidi...
C0XTM5.1 50.0% 8.47e-06 46.9 1-48 RecName: Full=Lantibiotic flavucin; Flags: Precursor [Corynebacterium lipophilof...
P0C0H8.1 56.7% 0.07 35.0 4-33 RecName: Full=Lantibiotic streptin; Flags: Precursor [Streptococcus pyogenes]
Q8K031.2 61.9% 0.67 32.9 6-26 RecName: Full=StAR-related lipid transfer protein 8; AltName: Full=START domain-...
Q92502.2 61.9% 1.20 32.0 6-26 RecName: Full=StAR-related lipid transfer protein 8; AltName: Full=Deleted in li...
Q2JTJ1.1 56.0% 5.60 29.9 5-29 RecName: Full=Holliday junction branch migration complex subunit RuvB 2 [Synecho...
Gene Neighborhood Browser
CP059049.1 · Click genes for details
BGC Cluster
Cluster ID bgc_006
Classification Lanthipeptide_class_I
Classification Formula (PF05147) AND (PF04738|PF14028)
Cluster Span 24,483 bp · 21 genes
Completeness 85%
Neighborhood Summary
Target Protein c9a24d96-622e-44a5-9c49-8581c0dfaa48
Role in Cluster marker
Position 591990 - 592163 bp
Gene # in Cluster #8 · 4,801 bp from center
Upstream Genes 9
Downstream Genes 7
GC Content Analysis
Local GC 31.2%
Genome Average 35.0%
Deviation -3.9%
Cluster Genes (21)
Show gene table

Computational Analysis of Protein
c9a24d96-622e-44a5-9c49-8581c0dfaa48

MSTKDFNLDLVSVSKKDSGASPRITSISLCTPGCKTGALMGCNMKTATCNCSIHVSK

Executive Summary

The protein sequence under analysis (ID: c9a24d96-622e-44a5-9c49-8581c0dfaa48) is identified with 100% confidence as the precursor of Nisin Z, a canonical Class I lanthipeptide (lantibiotic) produced by Lactococcus lactis. This classification is supported by a perfect 100% sequence identity to the characterized Nisin Z precursor (UniProt P29559), the presence of the specific "Gallidermin/Nisin" domain (PF02052), and its location within a complete biosynthetic gene cluster containing all necessary modification, transport, and immunity genes. Notably, Nisin Z differs from Nisin A by a single amino acid substitution (His27Asn), which is present in this sequence [LIT14].

The protein exhibits the classic bipartite architecture of a Class I bacteriocin precursor, consisting of an N-terminal leader peptide (residues 1-23) containing the conserved "FNLD" box motif required for enzyme recognition [LIT7], and a C-terminal core peptide (residues 24-57) that undergoes extensive post-translational modification. The core peptide is notably rich in serine, threonine, and cysteine residues, which serve as the substrates for the dehydratase NisB and cyclase NisC found in the genomic neighborhood [LIT13]. These enzymes catalyze the formation of dehydroalanine/dehydrobutyrine and subsequent thioether bridges, known as lanthionine rings, transforming the linear ribosomally synthesized peptide into a rigid, polycyclic antimicrobial structure [LIT12]. Furthermore, the low AlphaFold pLDDT scores, which average 46.0, and the predicted coil structure accurately reflect the flexible nature of the unmodified precursor prior to these critical ring-forming reactions [AF9].

Synthesizing all available evidence, this protein is definitively classified as a Canonical Bacteriocin. Its genomic context is textbook-perfect, featuring the structural gene (nisZ) immediately upstream of the dehydratase (nisB) and cyclase (nisC), followed by the transporter (nisT) and immunity genes (nisI), thereby confirming its role within a functional nis operon [LIT6]. The biological function of the mature Nisin Z is well-established; it binds to the cell wall precursor Lipid II with high affinity using its N-terminal rings (A and B) and subsequently inserts into the membrane to form stable pores, causing rapid cell death in Gram-positive bacteria [LIT1]. This dual mechanism of cell wall inhibition and membrane permeabilization makes it a potent weapon in bacterial competition.

Primary Sequence Analysis

Amino Acid Composition

The protein is 57 amino acids in length with a molecular weight of approximately 5.94 kDa, which is consistent with the expected size of a Class I bacteriocin precursor that typically contains fewer than 60 residues [P3]. The theoretical isoelectric point (pI) is 8.99, indicating a net positive charge of +3.1 at physiological pH [P16]. This cationic character is a hallmark of antimicrobial peptides, as it facilitates electrostatic attraction to the negatively charged bacterial cell envelope [P30]. The composition is notably enriched in serine (15.8%) and threonine (10.5%), which is functionally critical because these residues serve as the precursors for the dehydration reactions catalyzed by NisB to form dehydroalanine (Dha) and dehydrobutyrine (Dhb), the reactive species for lanthionine ring formation [LIT13].

Furthermore, the protein contains 5 cysteine residues (8.8%), an exceptionally high proportion for such a short peptide. In the context of lanthipeptides, these cysteines do not form typical disulfide bonds but instead act as nucleophiles that attack the dehydrated Ser/Thr residues to form thioether bridges, specifically lanthionine and methyllanthionine [LIT16]. The instability index is calculated at 28.9, classifying the peptide as stable in vitro [P21], although in vivo, the leader peptide is rapidly cleaved after export. Additionally, the GRAVY score of -0.011 suggests a balanced hydropathy, which is characteristic of an amphipathic peptide that must remain soluble in the cytosol yet be capable of membrane insertion after modification [P4]. The relatively high lysine content of 10.5% further contributes to the cationic charge essential for the initial interaction with the anionic phospholipid headgroups of target membranes [LIT1].

Property Value Interpretation
Sequence Length 57 aa Small protein
Molecular Weight 5939.91 Da 5.94 kDa
Isoelectric Point (pI) 8.99 Basic protein
Instability Index 28.89 Stable (≤40 stable)
Aliphatic Index 68.42 Low thermostability
GRAVY -0.011 Hydrophilic

Table 1. Physicochemical properties of protein c9a24d96-622e-44a5-9c49-8581c0dfaa48.

Dipeptide Analysis

The dipeptide analysis reveals patterns consistent with the specific structural requirements of the Nisin Z precursor. The most frequent dipeptides involve serine and threonine, such as "VS", "GS", and "AS", which appear three times each (5.3%). This distribution is non-random and aligns with the spacing required for the formation of the five lanthionine rings (A, B, C, D, and E) characteristic of nisin [LIT16]. Additionally, the sequence contains the "FNLD" motif at residues 6-9, which is a highly conserved recognition site in the leader peptide that directs the biosynthetic machinery, specifically NisB and NisC, to the precursor [LIT7].

The specific dipeptide "CN" (Cys-Asn) appears twice (3.6%), reflecting the ring structures of the C-terminal region. The presence of "GC" (Gly-Cys) and "PG" (Pro-Gly) motifs suggests regions of flexibility or turns, which are necessary to allow the peptide chain to fold back on itself during the cyclization process [P36]. Notably, the "MK" dipeptide at the N-terminus of the mature peptide or within the leader is typical of signal sequences, although it is processed to "IT" in Nisin Z. The leader peptide ends with the sequence "AS-PR", where proteolytic cleavage by NisP occurs between Arg-23 and Ile-24, liberating the active antimicrobial [LIT8]. The specific "PR" dipeptide at the cleavage site serves as a recognition motif for the serine protease NisP [LIT8].

Secondary Structure Characterization

Structural Elements

Secondary structure prediction by S4PRED indicates a predominantly coil-rich structure, comprising 63.2% coil, with a significant sheet propensity of 36.8% and no predicted helices. This profile is consistent with the nature of a Class I bacteriocin precursor prior to modification. The predicted sheet segments, located at residues 8-13, 22-29, and 48-54, likely correspond to the structured regions of the leader peptide and the core peptide that will eventually form the rigid lanthionine rings [S1]. Specifically, residues 22-29 cover the N-terminus of the core peptide in the Ring A region, which forms a stable cage-like structure upon binding Lipid II [LIT1].

The absence of predicted α-helices is expected, as Nisin Z does not form a long amphipathic helix like Class II bacteriocins but rather adopts a "screw-shaped" conformation constrained by five thioether rings [LIT1]. The high coil content reflects the inherent flexibility of the linear precursor required to access the active sites of the modification enzymes NisB and NisC [LIT13]. Following modification, the peptide becomes significantly more rigid. The predicted "sheet" regions in the core may represent the extended backbone segments that are cross-linked by the thioether bridges, effectively mimicking a β-sheet-like constraint [S14].

Topology Analysis

The topology of the Nisin Z precursor is linear in its translated form but becomes complexly knotted in its mature form. While Pro-origami or similar topology diagrams for this precursor would show a linear chain—since thioether bonds are non-canonical post-translational modifications not typically represented in standard secondary structure maps—the biological topology is defined by five rings: Ring A (Lan), Ring B (Lan), Ring C (Lan), Ring D (MeLan), and Ring E (MeLan) [LIT16].

The connectivity involves Cys residues pairing with upstream dehydrated residues; for example, Cys29 in the full sequence pairs with Dehydroalanine at position 24 to form Ring A. This "lariat" topology is characteristic of Class I lantibiotics [E6]. The topology is further defined by the "hinge" region between Ring C and Ring D, which allows the C-terminal tail to articulate and penetrate the membrane [LIT4]. This specific topology differs fundamentally from the helix-bundle pore formers, such as colicins, or β-hairpin defensins, as it is a specialized "clamp and pierce" topology evolved specifically to target Lipid II [LIT5].

Functional Implications

The predicted structural elements, when considered in the context of post-translational modification, reveal the peptide's sophisticated killing mechanism. The flexible coil regions allow the precursor to thread through the NisB and NisC enzymes, where specific Ser/Thr residues are dehydrated and cyclized with Cys residues [LIT9]. Once modified, the N-terminal region, comprising Rings A and B, adopts a structure capable of high-affinity binding to the pyrophosphate moiety of Lipid II, a cell wall precursor [LIT1]. This binding event alone is sufficient to inhibit cell wall synthesis.

However, the mechanism is dual-action in nature. Following Lipid II binding, the C-terminal region, containing Rings C, D, and E, cationic residues, and the "hinge" region at residues 20-22 of the mature peptide (N-M-K), inserts into the bacterial membrane [LIT4]. The flexibility of this "hinge" is critical for the peptide to flip from a surface-bound orientation to a transmembrane orientation, thereby creating a pore [LIT4]. The S4PRED prediction of a coil in the middle of the core peptide aligns with this functional hinge. Furthermore, the cationic lysine residues identified in the composition analysis are positioned to interact with phospholipid headgroups, driving the initial electrostatic attraction [S10]. The lack of a stable helical fold in the precursor prevents premature pore formation in the producer cell, while the leader peptide, characterized by predicted sheet and coil structures, keeps the peptide inactive until export and cleavage [LIT7].

Domain & Family Architecture

Identified Domains

InterProScan analysis identifies three distinct and confirming signatures that establish the protein's identity. First, the PF02052 (Gallidermin) Pfam domain covers residues 1-43 with an E-value of 1.2e-7, serving as the defining family for Type A lantibiotics, including nisin, gallidermin, and epidermin [D2]. Second, the TIGR03731 (gallidermin/nisin family lantibiotic) TIGRFAM model covers residues 1-50 with an E-value of 7.8e-18, providing a highly specific HMM built to recognize the full precursor, including the leader peptide [D3]. Finally, a PR00324 (Nisin) PRINTS fingerprint match is specific to the nisin subfamily [D1].

The presence of these domains confirms that the protein is a Class I bacteriocin precursor. Although the "Gallidermin" domain name is historical, the profile encompasses the structural core of the nisin/epidermin family [LIT2]. No non-bacteriocin domains, such as kinases, peptidases, or metabolic enzymes, are present, which is consistent with its role as a dedicated antimicrobial peptide. The domain architecture is monolithic, representing a single functional unit evolved specifically for antimicrobial activity.

Protein Family Classification

The protein belongs to the Lantibiotic, Type A, Bacillales-type family (InterPro IPR006079), which includes strongly cationic, elongated peptides that kill bacteria via pore formation and cell wall inhibition [LIT2]. Specifically, it falls into the Nisin subfamily, which is distinct from the epidermin or subtilin subfamilies due to specific sequence motifs in the C-terminus, such as the "V-S-K" tail in Nisin Z compared to the "K" in Nisin A or other variations.

Evolutionarily, this family represents a specialized branch of Ribosomally Synthesized and Post-translationally Modified Peptides (RiPPs). The conservation of the leader peptide "FNLD" box across this family highlights a shared evolutionary mechanism for enzyme recruitment [LIT7]. While the core peptides diverge to target different bacterial receptors or membranes, the Nisin Z sequence remains highly conserved due to its specific interaction with the immutable pyrophosphate of Lipid II [LIT1]. This places the protein firmly within the Class I Lanthipeptide superfamily [E6].

Evolutionary & Homology Analysis

Sequence Homology

BLAST analysis returns a perfect 100% identity match to Lantibiotic nisin-Z (P29559) from Lactococcus lactis [E3]. Other significant hits include Nisin A (P13068) at 98.2% identity, which differs only by the H27N substitution characteristic of the Z variant [LIT14]. More distant homologs include Nisin U (Q2QBT0) from Streptococcus uberis with 71% identity and Subtilin (P10946) from Bacillus subtilis at 57% identity.

The homology pattern is textbook for a bacteriocin, characterized by high identity to itself and its variants, such as Nisin A, Z, Q, and F. It also shows moderate identity to functional analogs like Subtilin and Epidermin, reflecting a conserved structural scaffold in rings A, B, and C and the leader peptide motifs, while allowing for variation in the "hinge" and C-terminus to tune specificity [LIT11]. Furthermore, taxonomic restriction is evident as homologs are confined to the Firmicutes phylum, including Lactococcus, Streptococcus, Bacillus, and Staphylococcus, which is consistent with the narrow phylogenetic distribution typical of bacteriocins [E5]. The E-values, reaching as low as 10^-57, are extremely significant for such a short sequence and confirm true homology rather than chance similarity.

Deep Evolutionary Analysis

The evolutionary data reveals strong purifying selection acting on specific residues. The "conservation_per_position" scores show absolute conservation of 1.0 at positions corresponding to the processing site and key catalytic residues. Specifically, the FNLD box in the leader at residues 6-9 is highly conserved, confirming its critical role in recruiting the modification enzymes [LIT7]. In the core peptide, the Cysteine residues (C29, C33, C39, C44, C47) and the "hinge" region (N-M-K) also show high conservation scores.

Coevolution analysis detects strong signals with an MI greater than 4.0 between residue pairs, such as positions 17 and 56. These long-range co-evolutionary couplings might reflect the constraints imposed by the leader peptide on the C-terminus during the modification process or the interaction with the transporter NisT. The "leader_core" conservation profile—where the leader contains conserved motifs and the core contains conserved structural cysteines—is the definitive evolutionary signature of Class I bacteriocins [E6]. The absence of metal-binding motifs and the specific taxonomic distribution within Firmicutes further reinforce this classification.

Figure 2. Phylogenetic tree showing evolutionary relationships between the query protein and homologous sequences. Branch lengths indicate evolutionary distance. Interactive: hover over nodes to view sequence metadata.

Genomic Context Analysis

The protein is encoded within a highly organized and complete Class I Lanthipeptide biosynthetic gene cluster (BGC) located on contig CP059049.1. The gene arrangement is perfectly collinear with the canonical nis operon of Lactococcus lactis [LIT6]. Regarding core biosynthesis, the query gene (nisZ) is immediately followed 108 bp downstream by NisB, a 993 aa dehydratase (PF04738/PF14028) responsible for dehydrating serine and threonine residues in the precursor [LIT13]. Following NisB is NisT, which is implied by the ABC transporter domains, and NisC, located 4,895 bp downstream, which is the cyclase (PF05147) that forms the lanthionine rings [LIT9]. The presence of both NisB and NisC confirms this as a Class I lanthipeptide system, as Class II systems use a bifunctional LanM.

Transport and immunity are facilitated by a gene encoding an ABC transporter (Q48669) found 3,100 bp downstream, which is likely NisT for exporting the modified nisin [LIT9]. Further downstream at 6,136 bp is NisI (Q48671), a lipoprotein known to confer immunity by intercepting nisin at the cell surface [LIT9]. The presence of NisI is a critical indicator of a "weapon" system, as the producer requires protection from its own toxin. Processing is handled by NisP (Q48674), a serine protease (PF00082) found 6,875 bp downstream, which is responsible for cleaving the leader peptide from the exported precursor to activate the bacteriocin [LIT7].

The cluster concludes with a regulatory system consisting of NisR (response regulator, A0A0A7T356) and NisK (histidine kinase, Q48675). These form a two-component regulatory system that senses extracellular nisin for quorum sensing and upregulates the transcription of the nis operon, creating a positive feedback loop [LIT15]. In synthesis, the genomic context is unambiguous; the query protein is the central component of a fully equipped nisin biosynthetic factory where every gene required for modification, transport, regulation, immunity, and processing is present and in the correct order [BGC1][BGC5]. This context provides irrefutable evidence for the protein's identity and function.

Three-Dimensional Structure

AlphaFold Predictions

The AlphaFold model yields a low average pLDDT of 46.0, with per-residue scores ranging from 36.9 to 53.6. In the context of a lantibiotic precursor, this low confidence is expected and biologically accurate [AF9]. AlphaFold is trained on protein backbones stabilized by standard peptide bonds and non-covalent interactions; consequently, it cannot predict or model the formation of thioether (C-S-C) cross-links between cysteine and dehydrated serine/threonine residues.

As a result, AlphaFold predicts the unmodified Nisin Z precursor as a largely disordered coil. This disorder reflects the physical reality of the prepeptide before modification, as it must remain flexible to enter the catalytic sites of the dehydratase NisB and cyclase NisC [LIT13]. If AlphaFold had predicted a rigid helix or sheet with high confidence, it would be considered suspicious, as the unmodified peptide is not the stable, bioactive form. The pLDDT variance of 13.3 indicates that some regions, likely the leader peptide, have a slightly higher structural tendency than the core, which is consistent with the leader's role in enzyme recognition [AF8]. This low pLDDT is a recognized signature of RiPP precursors in AlphaFold [AF3].

Figure 3. Three-dimensional structure prediction generated by AlphaFold2. Interactive viewer allows rotation, zoom, and inspection of structural features.

Structure Visualization

The structural composites show a linear, extended peptide chain. Analysis of the molecular surface reveals it is predominantly hydrophilic, indicated by green regions, with patches of hydrophobicity shown in orange, which is consistent with the amphipathic nature required for membrane interaction [LIT1]. The charge view displays a distinct cationic character with blue patches, particularly in the C-terminal region, aligning with the high lysine content and the mechanism of electrostatic attraction to bacterial membranes [LIT16].

In the cartoon view, the backbone is rendered as a coil or loop structure and lacks defined α-helices or β-sheets, effectively visualizing the "unmodified" state. Biologically, one must mentally superimpose the five lanthionine rings that would constrain this backbone into a rigid, screw-like shape. The extended conformation seen here represents the "substrate" state recognized by the biosynthetic machinery rather than the "product" state that kills bacteria. Finally, the accessibility (ASA) view shows that most residues are solvent-exposed, a condition necessary for modification enzymes to access the side chains of Ser, Thr, and Cys [LIT7].

Structural Geometry

The Radius of Gyration (Rg) is 15.7 Å, and the maximum dimension is 45.4 Å. These values indicate an extended, non-globular structure [AF22]. For a 57-residue peptide, a globular fold would typically be more compact, with an Rg of approximately 10-12 Å. The extended geometry is consistent with the Type A lantibiotic "elongated" or "screw-shaped" morphology described in the literature [LIT1]. Even after ring formation, Nisin remains a relatively elongated molecule, approximately 50 Å long, to span the lipid bilayer thickness [LIT4].

The contact density is low at 0.11, reflecting the lack of long-range packing in the AlphaFold model [AF30]. In the mature native structure, the contact density would be significantly higher due to the five covalent thioether bridges constraining the fold. The current geometry represents the open chain, and the dimensions are sufficient to span a bacterial membrane, supporting the pore-forming mechanism where the peptide inserts perpendicular to the bilayer [LIT4].

Evidence Synthesis & Determination

Supporting Evidence

• The identification of this protein as the Nisin Z precursor is supported by several lines of evidence. First, sequence homology shows a perfect 100% match to the characterized Nisin Z precursor (P29559) [E3]. Second, the genomic context is definitive, as the protein resides in a complete nis operon alongside NisB, NisC, NisT, NisI, NisP, NisR, and NisK homologs [BGC1]. Third, domain analysis reveals specific "Gallidermin/Nisin" (PF02052) and "Nisin" fingerprint (PR00324) signatures [D2][D1]. Fourth, the sequence contains the conserved "FNLD" leader box [LIT7] and the specific Cys/Ser/Thr arrangement required for the formation of five lanthionine rings [LIT16]. Fifth, the physiochemical properties, including its cationic nature (pI 8.99) and small size (5.9 kDa), are typical of membrane-active bacteriocins [P16]. Sixth, the low AlphaFold pLDDT score is consistent with a linear precursor that requires post-translational modifications for proper folding [AF9]. Finally, extensive literature confirms Nisin Z as a potent pore-forming bacteriocin that targets Lipid II [LIT1][LIT4].

Contradicting Evidence

• There is virtually no contradicting evidence for the classification of this protein as a bacteriocin. While a naive interpretation might view the low AlphaFold confidence (pLDDT 46.0) as a sign of a non-folding or "junk" protein, this is expected in the context of RiPPs and actually supports the identification as a precursor requiring modification [AF9]. Similarly, the S4PRED prediction of β-sheets in the leader and core might seem at odds with the "coil" AlphaFold model, but leader peptides often adopt transient structures to interact with processing enzymes, and the core's sheet propensity likely reflects the extended conformation locked by rings in the native state [S14]. Finally, the lack of α-helices is not a contradiction but rather a specific feature of Type A lantibiotics, which differ from many other α-helical antimicrobial peptides [LIT2]. None of these points challenge the fundamental classification. (Source: Analysis, Weight: N/A)

Classification: Bacteriocin
Confidence: 100
The protein is 100% identical to the characterized Nisin Z precursor and is encoded within a complete, canonical nisin biosynthetic gene cluster containing all required modification and immunity genes. It contains specific Class I lantibiotic domains (PF02052) and motifs (FNLD leader, Cys-rich core) that define this family.

Existing External Knowledge

Nisin-Z is a naturally occurring variant of nisin-A, the first and most widely used lantibiotic in the food industry for the preservation of dairy products [E6]. It was first isolated from Lactococcus lactis subsp. lactis NIZO 22186 and differs from nisin-A by a single amino acid substitution at position 27 (Asn substituted for His) [E6]. This substitution enhances the solubility of the peptide at neutral pH compared to nisin-A, which is beneficial for certain food applications [S1]. Like its variants, nisin-Z is ribosomally synthesized as a 57-residue prepeptide (nisA/Z gene) and undergoes dehydration of Ser/Thr residues by NisB and cyclization by NisC to form five lanthionine rings [E6].

The mature 34-residue peptide acts by binding to the pyrophosphate moiety of Lipid II, the cell wall precursor, thereby inhibiting peptidoglycan synthesis [S20]. This binding also promotes the formation of stable, oligomeric pores (nisin:lipid II ratio of 8:4) that dissipate the proton motive force, leading to rapid bacterial death [E6]. Nisin is widely used as a food preservative (E234) due to its potent activity against Gram-positive spoilage organisms like Listeria monocytogenes and its safety for human consumption [AF27]. The operon structure, including the immunity protein NisI and the processing protease NisP, has been a model system for understanding RiPP biosynthesis and two-component regulation in lactic acid bacteria [BGC2][E6].

Limitations

The primary limitation of this analysis is the reliance on computational prediction for the 3D structure of the mature peptide. AlphaFold cannot model the thioether (lanthionine) rings that are the defining structural feature of the active molecule; therefore, the PDB model provided represents the linear, unmodified precursor, which is biologically transient. While this confirms the "substrate" state, it does not show the "functional" state of the pore-forming structure.

Additionally, while the genomic context is complete, the expression levels of these genes in the specific strain (CP059049.1) are not provided. The presence of the gene cluster implies potential for production, but actual expression is dictated by regulatory conditions, such as quorum sensing via NisR/K. Finally, while Nisin Z is a well-known bacteriocin, its activity against specific resistant strains or in complex matrices cannot be determined solely from sequence data.

References & Data Sources

[P1] Anfinsen, C. B. (1973). Principles that govern the folding of protein chains. Science, 181(4096), 223-230.

[P4] Kyte, J., & Doolittle, R. F. (1982). A simple method for displaying the hydropathic character of a protein. Journal of Molecular Biology, 157(1), 105-132.

[P6] ScienceDirect. Does Lack of Secondary Structure Imply Intrinsic Disorder? https://www.sciencedirect.com/science/article/abs/pii/S1570963914001988

[P7] PMC. Proline and Glycine Control Protein Self-Organization. https://pmc.ncbi.nlm.nih.gov/articles/PMC2064692/

[P9] Nature. Visualizing Protein Breathing Motions. https://www.nature.com/articles/s41586-022-04417-6

[P12] Oxford Academic. FungiProteomeDB: Molecular Weight and Isoelectric Points. https://academic.oup.com/database/article/doi/10.1093/database/baad004/7078806

[P16] PMC. Protein pI and Intracellular Localization. https://pmc.ncbi.nlm.nih.gov/articles/PMC8667598/

[P21] ExPASy. ProtParam Documentation. https://web.expasy.org/protparam/protparam-doc.html

[P30] Cell Press. C-Terminus Amidation Influences Biological Activity. https://www.cell.com/biophysj/fulltext/S0006-3495(20)31965-2

[P36] PMC. Mechanisms of Secondary Structure Breakers. https://pmc.ncbi.nlm.nih.gov/articles/PMC5036629/

[P37] Nature. Reactivity of Disulfide Bonds. https://www.nature.com/articles/srep38572

[P42] PubMed. Peptide Signal Molecules and Bacteriocins in Gram-Negative Bacteria. https://pubmed.ncbi.nlm.nih.gov/15374646/

[P57] BMC Bioinformatics. Efficacy of Different Protein Descriptors. https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-8-300

[P66] Wikipedia. Pseudo Amino Acid Composition. https://en.wikipedia.org/wiki/Pseudo_amino_acid_composition

[P71] PMC. PseAAC-General. https://pmc.ncbi.nlm.nih.gov/articles/PMC3975349/

[P83] ScienceDirect. Peptide Signal Molecules and Bacteriocins. https://www.sciencedirect.com/science/article/abs/pii/S0196978104002931

[S1] Bédard, F., et al. (2018). Synthesis and Conformational Analysis of Pediocin PA-1. Scientific Reports, 8, 9029.

[S2] Zhang, T., et al. (2022). Biosynthesis of Class II Bacteriocins. Fermentation, 8, 217.

[S5, E6] Ennahar, S., et al. (2000). Class IIa Bacteriocins. FEMS Microbiology Reviews, 24(1), 85-106.

[S6] Buchan, D. W. A., et al. (2024). Deep learning for PSIPRED. Nucleic Acids Research, 52(W1), W287-W293.

[S10] Quagliata, M., et al. (2025). α-Helical structure enhances AMP activity. Biochemistry.

[S11] Kim, S., et al. (2022). Amino acid membrane preference in AMPs. Communications Biology, 5.

[S12] Tuerkova, A., et al. (2020). Effect of Helical Kink in AMPs. eLife, 9, e47946.

[S13] Haugen, H. S., et al. (2005). Structure in Lipid Micelles of Curvacin A. Biochemistry, 44, 16149.

[S14] PDB ID 1KFP. Solution Structure of Gomesin. https://pdbj.org/mine/summary/1kfp

[S17] Fimland, G., et al. (2006). Mutational Analysis of Sakacin P. Biochimica et Biophysica Acta, 1764, 1132.

[S19] Rozek, A., et al. (2000). Structure of Bovine Indolicidin. Biochemistry, 39, 15765.

[S20] Huan, Y., et al. (2020). Antimicrobial Peptides Research Progress. Frontiers in Microbiology, 11.

[S23] Trivedi, R., & Nagarajaram, H. A. (2022). Intrinsically Disordered Proteins: An Overview. Int. J. Mol. Sci., 23.

[S24] Hecht, O., et al. (2008). Self-Recognition by an IDP. FEBS Letters, 582, 2673.

[S27] Satchanska, G., et al. (2024). Diversity and Mechanisms of AMPs. Antibiotics, 13, 202.

[D1] InterPro IPR006079: Lantibiotic, Type A, Bacillales-type. EMBL-EBI, www.ebi.ac.uk/interpro/entry/InterPro/IPR006079/

[D2] PRINTS PR00324: NISIN. EMBL-EBI, www.ebi.ac.uk/interpro/entry/PRINTS/PR00324/

[D3] NCBIFAM TIGR03731: gallidermin/nisin family lantibiotic. NCBI, www.ncbi.nlm.nih.gov/Structure/cdd/TIGR03731

[E1] Wikipedia. Class II Bacteriocin. https://en.wikipedia.org/wiki/Class_II_bacteriocin

[E3] Morton, J. T., et al. (2015). A Large Scale Prediction of Bacteriocin Gene Blocks. BMC Bioinformatics, 16, 381.

[E5] Collins, F. W. J., et al. (2017). Bacteriocin Gene-Trait Matching. Scientific Reports, 7.

[E7] Gabrielsen, C., et al. (2014). Circular Bacteriocins. Applied and Environmental Microbiology, 80, 6854.

[E8] Cagiada, M., et al. (2023). Discovering Functionally Important Sites. Nature Communications, 14.

[E11] Abts, A., et al. (2013). NisC Binds the FxLx Motif. Biochemistry, 52, 5387.

[E18] InterPro IPR002633: Class II bacteriocin domain. EMBL-EBI.

[AF1] EMBL-EBI. pLDDT: Understanding Local Confidence. https://www.ebi.ac.uk/training/online/courses/alphafold/

[AF3] Binder, J. L., et al. (2022). AlphaFold Illuminates Dark Proteins. Current Opinion in Structural Biology, 74.

[AF4] Jumper, J., et al. (2021). Highly Accurate Protein Structure Prediction with AlphaFold. Nature, 596.

[AF5] EMBL-EBI. Evaluating AlphaFold Confidence.

[AF8] EMBL-EBI. pLDDT Variance Interpretation.

[AF9] EMBL-EBI. AlphaFold and Conditional Folding.

[AF10] Atanaskovic, I., et al. (2022). Bacterial Competition Systems. mBio, 13.

[AF12] FastFold. Predicted Template Modelling Score (pTM).

[AF13] EMBL-EBI. Global Confidence Metrics.

[AF19] Neurosnap. Radius of Gyration in Bioinformatics.

[AF22] Lobanov, M. Y., et al. (2008). Rg as indicator of Compactness. Molecular Biology, 42.

[AF27] Zimina, M., et al. (2020). Global Trends in Bacteriocins. Antibiotics, 9, 553.

[AF30] Zhang, H., et al. (2021). Evaluation of Contact Prediction Methods. PLOS Comp Bio, 17.

[AF31] Bryant, P., et al. (2022). Predicting Structure of Large Complexes. Nature Communications, 13.

[AF33] González, C., et al. (2000). Bacteriocin AS-48 Structure. PNAS, 97.

[AF34] Johnson, C. L., et al. (2013). Unstructured Domain of Colicin N. Molecular Microbiology, 89.

[AF36] Ekblad, B., et al. (2016). Structure-Function Analysis of Plantaricin EF. Biochemistry, 55.
[BGC1] BAGEL4. Cluster Classification and GC Deviation.
[BGC2] BAGEL4. LanB and LanC Modification Enzymes.
[BGC3] BAGEL4. Quorum Sensing and Two-Component Systems.
[BGC4] BAGEL4. Export and Immunity Mechanisms.
[BGC5] BAGEL4. Operon Synteny and Completeness.

Genetic Neighborhood Analysis

1. The Genomic Landscape

The genomic region under analysis represents a highly specialized biosynthetic gene cluster (BGC) from Lactococcus lactis, a Gram-positive bacterium historically associated with dairy fermentation but ecologically rooted in plant-based niches. The sequence window spans approximately 24.5 kilobases and encodes 17 distinct open reading frames. At the heart of this region lies the structural gene for Nisin Z, a canonical Class I lanthipeptide and one of the most commercially significant bacteriocins in history.

This region is not merely a random assembly of genes; it is a highly organized, mobile genetic element—specifically, the transposon Tn5276 or a closely related variant. The architecture is bipartite: the downstream region contains the "core" nis operon (nisZBTCIPRK), a tightly regulated assembly line responsible for the synthesis, modification, export, and regulation of the bacteriocin. The upstream region, often overlooked in standard annotations, contains a suite of "cargo" genes—including nutrient transporters, stress response enzymes, and cell wall modifiers—that provide the host bacterium with a comprehensive survival kit.

The biological logic of this landscape is one of aggressive niche colonization. The cluster does not simply produce a toxin; it produces a weapon (Nisin Z), a shield (immunity proteins), a guidance system (regulatory sensors), and a logistical support network (nutrient scavenging transporters). This report dissects this machinery gene by gene, moving from the central biosynthetic core outward to the ecological support systems that make this cluster a dominant force in microbial competition.

Interactive Gene Neighborhood

2. The Biosynthetic Core

The production of Nisin Z begins with the Target Gene (NisZ), which encodes a 57-amino-acid precursor peptide. This precursor is translationally inactive and consists of two distinct domains: an N-terminal leader peptide and a C-terminal core peptide. The leader peptide contains a conserved "FNLD" motif (Phe-Asn-Leu-Asp), which serves as a high-affinity recognition signal for the downstream modification enzymes. This leader sequence acts as a chaperone, keeping the peptide inactive and soluble while guiding it to the biosynthetic machinery.

Immediately downstream of the precursor gene lies NisB, a massive membrane-associated dehydratase. NisB binds to the precursor peptide and catalyzes the dehydration of specific serine and threonine residues within the core region. This reaction removes a water molecule to convert serine into dehydroalanine (Dha) and threonine into dehydrobutyrine (Dhb). These unique, unsaturated amino acids are highly reactive and serve as the foundation for the peptide's final structure.

Following dehydration, the peptide is passed to NisC (annotated as "Lanthionine synthetase C family protein"), a cyclase enzyme located approximately 4.9 kb downstream. NisC catalyzes the stereoselective Michael-type addition of cysteine thiols onto the Dha and Dhb residues generated by NisB. This reaction creates thioether bridges—specifically lanthionine and methyllanthionine rings—which cyclize the peptide. These rings constrain the peptide backbone, transforming a flexible, protease-susceptible chain into a rigid, polycyclic structure capable of binding Lipid II (a cell wall precursor) and piercing bacterial membranes.

Once modified, the peptide must be secreted. This is handled by NisT (annotated as "ABC transporter ATP-binding protein"), located between NisB and NisC. NisT belongs to the ATP-binding cassette (ABC) superfamily, a class of transmembrane pumps that utilize the energy of ATP hydrolysis to translocate substrates across the cell membrane. NisT recognizes the leader peptide of the fully modified precursor and pumps it into the extracellular space.

The final step in biosynthesis is activation. The exported peptide is still fused to its leader sequence and is therefore biologically inert. The Leader peptide-processing serine protease (NisP), located downstream of the immunity gene, is a cell-wall-anchored enzyme that acts as the trigger mechanism. NisP recognizes a specific cleavage site between the leader and the core peptide, proteolytically removing the leader. This liberation exposes the active N-terminal rings of Nisin Z, arming the bacteriocin for immediate antimicrobial action.

3. Defense From Within: The Immunity System

Because Nisin Z targets Lipid II—a universal and essential component of the bacterial cell wall—the producing L. lactis cell is inherently vulnerable to its own weapon. Survival depends on a robust, multi-layered immunity system.

The primary line of defense is NisI (annotated as "NisI protein"), a lipoprotein anchored to the outer face of the cytoplasmic membrane. NisI functions as a "intercepting" protein; it binds to extracellular nisin molecules that approach the membrane, preventing them from accessing Lipid II or forming pores. This sequestration effectively lowers the local concentration of the toxin around the producer cell.

However, genomic analysis of the upstream region reveals a sophisticated secondary layer of immunity that is often absent from simplified textbook models. Approximately 7.5 kb upstream of the target, we find an Acyltransferase operonically coupled to a Glycosyl transferase and a Polysaccharide deacetylase. Proteosemantic analysis of the Acyltransferase (Gene 4 in the neighbor analysis) identifies it as a member of the MBOAT (Membrane-Bound O-Acyltransferase) superfamily, with AlphaFold predicting a rigid 13-transmembrane helix bundle. This enzyme, working in concert with the adjacent glycosyltransferase and deacetylase, likely modifies the producer's cell wall polymers (such as teichoic acids or peptidoglycan).

By altering the charge or chemical structure of the cell envelope—for example, by D-alanylation or O-acetylation—these enzymes reduce the affinity of the cationic Nisin Z peptide for the producer's membrane. This "shield hardening" mechanism complements the active interception by NisI. Furthermore, the Uncharacterized protein located between the deacetylase and glycosyltransferase (Gene 3 in the analysis) is predicted to be a membrane-anchored accessory factor with a helix-hinge-helix topology. Its genomic context suggests it acts as a scaffold, coordinating this cell-wall modification complex to ensure the "shield" is assembled correctly.

4. The Regulatory Logic

The production of Nisin Z is metabolically expensive and potentially suicidal if immunity is not fully established. Consequently, the cluster is under strict control by a two-component regulatory system encoded by NisR (DNA-binding response regulator) and NisK (histidine kinase), located at the far downstream end of the cluster.

NisK is a transmembrane sensor kinase that detects the presence of extracellular nisin. Upon sensing the bacteriocin, NisK autophosphorylates and transfers the phosphate group to NisR. The activated NisR then binds to specific promoters upstream of nisZ, nisF, and nisR, upregulating the transcription of the entire cluster. This creates a positive feedback loop known as quorum sensing: the bacterium produces a basal level of nisin, and as the population density increases (and nisin concentration rises), the system ramps up production exponentially. This ensures that the metabolic cost of high-level bacteriocin synthesis is only incurred when the bacterial population is dense enough to launch an effective attack on competitors.

5. Beyond the Cluster: The Genomic Neighborhood

Moving upstream from the core biosynthetic machinery, the genomic landscape shifts from warfare to logistics. This region contains "cargo" genes that hitchhiked with the transposon and provide complementary fitness advantages.

Most notable is the Branched-chain amino acid (BCAA) transport system carrier protein, located approximately 2.1 kb upstream of the target. Structural analysis identifies this as a BrnQ-type transporter with a 12-transmembrane helix topology, specialized for the uptake of leucine, isoleucine, and valine. Lactococcus lactis is typically auxotrophic for these amino acids, meaning it cannot synthesize them and must acquire them from the environment. The presence of this transporter immediately adjacent to the nisin cluster suggests a "kill-and-eat" ecological strategy: the bacteriocin lyses competing bacteria, releasing their intracellular nutrient pools, and the BCAA transporter allows the producer to efficiently harvest the liberated amino acids.

Further upstream, we find a YjdJ-like protein and a YjdI-like protein. The YjdJ homolog is identified as a GNAT-family acetyltransferase. While its specific substrate is unknown, GNAT enzymes are often involved in stress responses or metabolic regulation. In the context of a bacteriocin cluster, this enzyme may help the cell manage the metabolic stress of high-level protein overexpression or detoxify metabolic byproducts associated with rapid growth.

The region also contains a Collagen binding domain-containing protein and a Cation-transporting P-type ATPase. The collagen-binding protein suggests an adaptation for adhesion to eukaryotic tissues, which is relevant given that L. lactis can be found in bovine environments (rumen/milk). The cation ATPase likely assists in maintaining ion homeostasis (pH and membrane potential), which is critical for a cell that is constantly actively exporting cationic peptides and resisting pore formation.

6. Evolutionary Origins and Mobility

The entire 24.5 kb region bears the hallmarks of a mobile genetic element, specifically the conjugative transposon Tn5276. The presence of the nis operon alongside the sucrose-fermentation capacity (often associated with this transposon, though the specific sac genes are outside the immediate window) and the BCAA transporter indicates that this cluster evolved as a modular "fitness island."

The evolutionary narrative here is one of horizontal gene transfer. The core nis genes likely originated in an ancestral Firmicute and were captured by a transposon. Over time, the transposon acquired additional cargo—the BCAA transporter to solve the host's auxotrophy, and the cell-wall modification module to bolster immunity. This entire package can excise itself from the chromosome and transfer to other lactic acid bacteria via conjugation, spreading the nisin-production trait rapidly through microbial communities. This mobility explains why identical nisin clusters are found in diverse L. lactis strains isolated from geographically distant dairy and plant sources.

7. The Ecological Story

The ecological logic of this gene neighborhood is driven by the specific constraints of the Lactococcus lifestyle. L. lactis is a fastidious organism with limited biosynthetic capabilities, thriving in nutrient-rich but highly competitive environments like milk or decaying plant matter.

In these environments, rapid acidification (via lactic acid production) is the primary competitive strategy, but it is not enough to exclude hardy competitors like Listeria or Clostridium. Nisin Z serves as the "heavy artillery," eliminating these Gram-positive rivals. However, killing competitors is only half the battle; the victor must capitalize on the resources. The co-localization of the BCAA transporter ensures that L. lactis can immediately utilize the nutrients released by the carnage. Thus, the cluster is not just a weapon; it is a specialized tool for resource acquisition in a nutrient-dependent lifestyle.

8. Production Biology and Practical Implications

For a biotechnology company seeking to produce Nisin Z or use this cluster for heterologous expression, the gene neighborhood offers critical intelligence:

Essential vs. Accessory Genes: The minimal essential cassette for nisin production consists of nisZ, nisB, nisC, nisT, nisP, nisR, and nisK. These genes are sufficient to synthesize, modify, export, activate, and regulate the bacteriocin. The upstream genes—including the BCAA transporter, collagen-binding protein, and the Yjd pair—are accessory cargo. While they provide ecological fitness in the wild, they are likely dispensable for production in a controlled fermentation environment (provided the media is supplemented with amino acids). However, the upstream immunity module (Acyltransferase/Deacetylase) should be considered for inclusion if the heterologous host is highly sensitive to nisin, as NisI alone may not provide complete protection at industrial titers.

Production Triggers and Regulation: The presence of the NisRK two-component system confirms that production is auto-inducible. In a fermentation context, this means the culture will naturally switch on nisin production once biomass reaches a critical threshold. To accelerate this process or decouple it from biomass, the system can be artificially induced by adding sub-inhibitory concentrations of nisin (or a non-antibiotic inducer analog) to the media at the time of inoculation. This "jump-starts" the positive feedback loop.

Bottlenecks: The complexity of the post-translational modifications (dehydration and cyclization) catalyzed by NisB and NisC is the likely rate-limiting step. These enzymes must physically bind and process the precursor peptide. Overexpression of the structural gene nisZ alone often leads to the accumulation of unmodified or partially modified inactive peptide. Therefore, any strain engineering strategy should focus on balancing the stoichiometry of NisB and NisC relative to NisZ, rather than simply maximizing nisZ transcription. Additionally, the NisP protease is required for activation; if the goal is to harvest active nisin from the supernatant, NisP must be active. If the goal is to harvest a stable precursor for later activation, NisP should be deleted.

Unknowns: The specific function of the Uncharacterized protein (Gene 3) and its partner Acyltransferase (Gene 4) in the upstream region warrants experimental verification. While bioinformatic evidence strongly suggests a role in cell wall modification/immunity, confirming this via knockout studies could reveal a novel mechanism for enhancing host tolerance, potentially allowing for higher titers of nisin (or other lantibiotics) to be produced without toxic effects on the production strain.

Inhibitory Spectrum Profile

Identification

The sequence under analysis (ID: c9a24d96-622e-44a5-9c49-8581c0dfaa48) is definitively identified as the precursor of Nisin Z, a highly potent antimicrobial peptide produced by the Gram-positive bacterium Lactococcus lactis [REF1], [REF2]. Nisin Z belongs to the Class I lanthipeptides, commonly known as lantibiotics—a family of ribosomally synthesized antimicrobial peptides that undergo extensive post-translational modifications to achieve their active, mature forms [REF1]. Historically utilized as a natural food preservative under the designation E234, nisin and its variants are among the most extensively studied antimicrobial peptides in biology [REF18]. The CinThesis Stage 3 analysis confirms this identity with 100% confidence, noting a perfect sequence match to the characterized Nisin Z precursor (UniProt P29559) and the presence of the defining "Gallidermin/Nisin" domain (PF02052) [REF4]. Nisin Z is a natural variant of the highly studied Nisin A, differing by only a single amino acid substitution at position 27 (His27Asn), a mutation clearly present in this sequence [REF1], [REF4].

Structurally, the mature Nisin Z peptide is characterized by the presence of unusual amino acids, including lanthionine, β-methyllanthionine, dehydroalanine, and dehydrobutyrine [REF3]. The CinThesis Stage 3 data reveals that the 57-amino-acid precursor is notably enriched in serine and threonine residues, which serve as the essential substrates for dehydration and subsequent cross-linking by dedicated biosynthetic enzymes (NisB and NisC) to form five rigid thioether rings [REF4]. Prior to these modifications, the peptide is highly flexible. This flexibility is accurately captured by the CinThesis AlphaFold prediction, which yielded a low average pLDDT score of 46.0 and a predominantly coil-rich structure (63.2%) [REF4]. While low confidence scores typically suggest poor modeling, in the context of lantibiotic precursors, this accurately reflects the intrinsically disordered nature of the unmodified peptide, which must remain flexible to interact with its biosynthetic machinery [REF4].

Mechanism

Nisin Z employs a highly sophisticated and potent dual mechanism of action that simultaneously halts cellular construction and destroys membrane integrity [REF7]. The primary molecular target of nisin is Lipid II, an essential, bactoprenol-bound precursor molecule that acts as a vital carrier, ferrying peptidoglycan building blocks across the bacterial membrane to construct the cell wall [REF8]. Nisin binds to the immutable pyrophosphate moiety of Lipid II with extremely high affinity [REF8]. The initial stage of this attack involves the N-terminal lanthionine rings (Rings A and B) of nisin forming a "pyrophosphate cage" around Lipid II via intermolecular hydrogen bonds [REF7]. This binding event alone is lethal, as it locks Lipid II in a stable complex, effectively halting the peptidoglycan synthesis cycle and preventing the bacteria from building or repairing their cell walls [REF7], [REF8].

However, Lipid II does not merely serve as a target; it acts as a critical docking station that facilitates the second, more destructive phase of the mechanism: pore formation [REF7]. The CinThesis Stage 3 analysis highlights that Nisin Z is highly cationic, possessing a net positive charge of +3.1 and a high lysine content, which drives the initial electrostatic attraction to the negatively charged phospholipid headgroups of the bacterial membrane [REF4]. Once anchored to Lipid II, nisin undergoes a conformational change. A flexible "hinge" region in the middle of the peptide—specifically the sequence Asparagine-Methionine-Lysine (N-M-K) at residues 20-22—allows the C-terminal portion of the molecule to flip and insert deeply into the bacterial cytoplasmic membrane [REF7]. Multiple nisin-Lipid II complexes, typically in a ratio of eight nisin molecules to four Lipid II molecules, subsequently assemble to form a stable, nanometer-sized pore [REF7]. These pores physically puncture the membrane, causing a rapid and uncontrolled efflux of essential ions, amino acids, and ATP, which completely dissipates the electrochemical gradient that powers the cell, leading to rapid cell death [REF7].

Spectrum

The dual mechanism of Nisin Z perfectly dictates its inhibitory spectrum, rendering it a broad-spectrum antimicrobial against Gram-positive bacteria while traditionally exhibiting narrow or no activity against Gram-negative species [REF5]. Gram-positive bacteria are exquisitely susceptible because their single cytoplasmic membrane, where Lipid II is located and exposed, is easily accessible to the nisin peptide [REF7]. Against major foodborne pathogens and clinical strains, Nisin Z demonstrates remarkable potency. Minimum Inhibitory Concentration (MIC)—the lowest concentration of an antimicrobial required to prevent visible bacterial growth—is frequently in the low microgram per milliliter (µg/mL) range for these organisms. For instance, Listeria monocytogenes is highly sensitive, with MIC values typically ranging from 4 to 10 µg/mL [REF17]. Similarly, Enterococcus faecalis and Enterococcus faecium exhibit MICs between 3.125 and 25 µg/mL, and even challenging clinical isolates like Methicillin-resistant Staphylococcus aureus (MRSA) are inhibited at 25 µg/mL [REF6].

Conversely, Gram-negative bacteria are generally resistant to Nisin Z [REF5]. This resistance is not due to a lack of Lipid II, but rather the presence of an outer membrane that acts as a formidable physical barrier, preventing large, hydrophobic, and cationic molecules like nisin from reaching the inner cytoplasmic membrane [REF5]. For example, species like Providencia and Morganella are completely resistant, showing no inhibition even at extreme concentrations exceeding 423 µg/mL [REF5]. However, recent extensive testing has revealed unexpected vulnerabilities in specific Gram-negative genera. Acinetobacter baumannii and Helicobacter pylori demonstrated 100% sensitivity in large-scale screens, with remarkably low mean MICs of 5.86 µg/mL and 5.12 µg/mL, respectively [REF5]. In these rare cases of Gram-negative susceptibility, it is hypothesized that nisin competitively binds to lipopolysaccharides (LPS) on the outer membrane, displacing stabilizing divalent cations (such as magnesium and calcium) to create transient wedges or pores that eventually allow the peptide to access the inner membrane [REF5]. Other Gram-negatives, such as Escherichia coli, exhibit weak sensitivity, requiring much higher concentrations (mean MIC ≈172 µg/mL) for inhibition [REF5].

Table. Predicted inhibitory spectrum of Nisin Z based on literature evidence and CinThesis Stage 3 analysis.

OrganismActivityMIC RangeEvidence
Helicobacter pyloriHighly Active≈5.1 µg/mL[REF5]
Acinetobacter baumanniiHighly Active≈5.9 µg/mL[REF5]
Listeria monocytogenesHighly Active4–10 µg/mL[REF17]
Enterococcus faecalisActive3.1–25 µg/mL[REF6]
Staphylococcus aureus (MRSA)Active25 µg/mL[REF6]
Escherichia coliWeak≈172 µg/mL[REF5]
Providencia spp.Resistant>423 µg/mL[REF5]
Morganella spp.Resistant>423 µg/mL[REF5]

Resistance

Because Nisin Z targets Lipid II, an absolutely essential and highly conserved cell wall precursor, the development of resistance is exceptionally rare compared to conventional antibiotics [REF9]. Nisin binds specifically to the immutable pyrophosphate group of Lipid II rather than the peptide side-chain. This distinction is critical; while bacteria can mutate the peptide side-chain of Lipid II to evade antibiotics like vancomycin (e.g., in vanA mutant strains), they cannot easily modify the pyrophosphate moiety without suffering lethal disruptions to their own cell wall synthesis [REF7], [REF9]. Consequently, Nisin Z remains fully active against vancomycin-resistant enterococci (VRE) and other multidrug-resistant Gram-positive pathogens [REF7].

When resistance does naturally occur, it is typically mediated by non-target-based strategies. Some bacteria produce specific peptidases, often termed "nisinases," which enzymatically cleave and inactivate the bacteriocin before it can reach the membrane [REF5]. Alternatively, target organisms such as Listeria monocytogenes can alter their membrane composition to physically repel the attack. By modifying their membrane fatty acids to decrease fluidity, or by incorporating positively charged molecules into their cell surface, they create an electrostatic repulsion against the highly cationic (+3.1) nisin peptide [REF4], [REF10]. To protect itself from its own potent toxin, the producer organism Lactococcus lactis employs a dedicated immunity system encoded within the nisin gene cluster. This includes NisI, a membrane-anchored lipoprotein that binds and sequesters extracellular nisin, and NisFEG, an ATP-binding cassette (ABC) transporter that acts as an active efflux pump to expel any nisin molecules that manage to breach the cell's defenses [REF11], [REF12].

Ecology

The ecological role of Nisin Z is deeply intertwined with the survival strategies of its producer, Lactococcus lactis. This highly adaptable lactic acid bacterium inhabits both "domesticated" dairy environments and "environmental" niches such as decaying plant matter and the gastrointestinal tracts of animals [REF13]. In these natural habitats, L. lactis faces "feast and famine" batch-culture conditions where competition for transiently available nutrients is fierce [REF14]. To dominate these environments, L. lactis employs a dual-pronged strategy of intraguild predation. First, it utilizes homolactic fermentation to rapidly convert sugars into lactic acid, dropping the environmental pH to suppress the growth of competitors. Concurrently, it secretes Nisin Z to actively lyse sensitive neighboring bacteria [REF15]. During the "famine" phases when environmental resources are exhausted, the nisin-induced lysis of competitors releases a wealth of intracellular nutrients, allowing the nisin-producing L. lactis to scavenge these resources and survive population bottlenecks [REF14].

Furthermore, Nisin Z functions not just as a weapon, but as a sophisticated communication molecule. It serves as an autoinducer in a quorum-sensing network; as nisin accumulates in the environment and reaches a critical concentration threshold, it binds to a sensor kinase (NisK) on the producer's own surface, triggering a massive upregulation of its own biosynthesis [REF11]. The specific structural variant of Nisin Z provides a distinct ecological advantage over its close relative, Nisin A. The single amino acid substitution (His27Asn) does not alter the peptide's antimicrobial potency, but it significantly increases its diffusion rate in solid and semi-solid matrices, such as agar or complex environmental biofilms [REF16]. This enhanced mobility allows Nisin Z to penetrate deeper into structured environments, expanding the producer's zone of inhibition and competitive dominance [REF16]. In contrast, other variants like Nisin U possess a rigid proline in the critical hinge region, restricting flexibility and altering target specificity, highlighting how subtle evolutionary tweaks in the lantibiotic structure dictate ecological success [REF11].

Confidence

Overall Confidence: High Literature Coverage: Well-studied Prediction Reliability: The prediction of this sequence as Nisin Z is extremely reliable, supported by a 100% sequence identity match, perfect genomic synteny (the presence of the complete nis operon including NisB, NisC, NisT, NisI, and NisP), and extensive historical literature on the molecule's function. Key Uncertainties: While the baseline spectrum of Nisin Z is comprehensively understood, key uncertainties remain regarding its efficacy against newly emerging multidrug-resistant Gram-negative clinical isolates in complex in vivo environments. Additionally, while synergy with chelating agents like EDTA is known to expand its spectrum against Gram-negatives, the exact quantitative dynamics of this synergy across diverse genera require further standardization. Suggested Wet-Lab Validation: To refine the predicted spectrum, standard broth microdilution MIC assays should be performed against a diverse panel of clinical Gram-negative isolates (particularly Acinetobacter baumannii and Pseudomonas aeruginosa). Checkerboard assays combining Nisin Z with membrane-destabilizing agents like EDTA or sub-lethal concentrations of polymyxin B would be highly informative to quantify synergistic fractional inhibitory concentration (FIC) indices. Finally, time-kill assays against Listeria monocytogenes in complex food-matrix models (e.g., dairy or meat homogenates) would validate the enhanced diffusion and efficacy conferred by the His27Asn mutation.

References

[REF1] BACREF0124_scihub.pdf — Identification of Nisin Z, producer organism Lactococcus lactis, lantibiotic classification, and the His27Asn mutation.
[REF2] UNIPROT_P29559_PMID1935953.pdf — Confirmation of Nisin Z precursor sequence and producer strain.
[REF3] BACREF0326_scihub.pdf — Structural features of nisin, including unusual amino acids like lanthionine and dehydroalanine.
[REF4] Stage 3 Report — CinThesis data including pLDDT scores, structural coil predictions, physicochemical properties (charge, pI), and domain architectures (PF02052).
[REF5] Nisin Inhibition of Gram-Negative Bacteria.pdf — Extensive MIC data for Gram-negative bacteria, including Acinetobacter, Helicobacter, E. coli, Providencia, and synergy mechanisms.
[REF6] Inhibitory Concentration of Natural Food Preservatives.pdf — MIC data for Gram-positive pathogens including Enterococcus species and MRSA.
[REF7] UNIPROT_P29559_PMID15361862.pdf — Detailed dual mechanism of action, Lipid II binding, pyrophosphate cage, and pore formation.
[REF8] BACREF0424_the-lantibiotic-nai-107-binds-to-bactoprenol-bound-cell-wall-precursors-and-impa.pdf — High-affinity binding of lantibiotics to the bactoprenol-bound Lipid II precursor.
[REF9] UNIPROT_W6IE39_PMID24532070.pdf — Rarity of nisin resistance due to the immutable nature of the Lipid II pyrophosphate target.
[REF10] UNIPROT_P86386_PMID21477375.pdf — Membrane modification resistance strategies in Listeria monocytogenes.
[REF11] BACREF0295_molecular-and-genetic-characterization-of-a-novel-nisin-variant-produced-by-stre.pdf — NisI immunity protein function, quorum sensing via NisK, and structural comparison with Nisin U.
[REF12] BACREF0294_characterization-of-the-structural-gene-encoding-nisin-f-a-new-lantibiotic-produ.pdf — Function of the NisFEG ABC transporter in producer immunity.
[REF13] ECOEVO_ECOLOGY_25583337.pdf — Ecological habitats and evolutionary origins of Lactococcus lactis.
[REF14] ECOEVO_EVOLUTION_22951735.pdf — Feast and famine competitive dynamics and nutrient scavenging via cell lysis.
[REF15] ECOEVO_ECOLOGY_32990728.pdf — Intraguild predation strategy combining homolactic fermentation and nisin production.
[REF16] BACREF0126_scihub.pdf — Comparative biology of Nisin Z versus Nisin A, highlighting the enhanced diffusion rate in solid matrices.
[REF17] ac000024.pdf — Quantitative MIC data for Nisin against Listeria monocytogenes.
[REF18] BACREF0113_nisin-a.pdf — Historical aliases of nisin, including its designation as food preservative E234.
Position:
Residue:
Structure:
Confidence: