Phylogenetic and amino acid conservation analyses of bacterial l-aspartate-α-decarboxylase and of its zymogen-maturation protein reveal a putative interaction domain
© Stuecker et al. 2015
Received: 22 April 2015
Accepted: 3 August 2015
Published: 15 August 2015
All organisms must synthesize the enzymatic cofactor coenzyme A (CoA) from the precursor pantothenate. Most bacteria can synthesize pantothenate de novo by the condensation of pantoate and β-alanine. The synthesis of β-alanine is catalyzed by l-aspartate-α-decarboxylase (PanD), a pyruvoyl enzyme that is initially synthesized as a zymogen (pro-PanD). Active PanD is generated by self-cleavage of pro-PanD at Gly24-Ser25 creating the active-site pyruvoyl moiety. In Salmonella enterica, this cleavage requires PanM, an acetyl-CoA sensor related to the Gcn5-like N-acetyltransferases. PanM does not acetylate pro-PanD, but the recent publication of the three-dimensional crystal structure of the PanM homologue PanZ in complex with the PanD zymogen of Escherichia coli provides validation to our predictions and provides a framework in which to further examine the cleavage mechanism. In contrast, PanD from bacteria lacking PanM efficiently cleaved in the absence of PanM in vivo.
Using phylogenetic analyses combined with in vivo phenotypic investigations, we showed that two classes of bacterial l-aspartate-α-decarboxylases exist. This classification is based on their posttranslational activation by self-cleavage of its zymogen. Class I l-aspartate-α-decarboxylase zymogens require the acetyl-CoA sensor PanM to be cleaved into active PanD. This class is found exclusively in the Gammaproteobacteria. Class II l-aspartate-α-decarboxylase zymogens self cleave efficiently in the absence of PanM, and are found in a wide number of bacterial phyla. Several members of the Euryarchaeota and Crenarchaeota also contain Class II l-aspartate-α-decarboxylases. Phylogenetic and amino acid conservation analyses of PanM revealed a conserved region of PanM distinct from conserved regions found in related Gcn5-related acetyltransferase enzymes (Pfam00583). This conserved region represents a putative domain for interactions with l-aspartate-α-decarboxylase zymogens. This work may inform future biochemical and structural studies of pro-PanD-PanM interactions.
Experimental results indicate that S. enterica and C. glutamicum l-aspartate-α-decarboxylases represent two different classes of homologues of these enzymes. Class I homologues require PanM for activation, while Class II self cleave in the absence of PanM. Computer modeling of conserved amino acids using structure coordinates of PanM and l-aspartate-α-decarboxylase available in the protein data bank (RCSB PDB) revealed a putative site of interactions, which may help generate models to help understand the molecular details of the self-cleavage mechanism of l-aspartate-α-decarboxylases.
Coenzyme A (CoA) is essential to all forms of life. Prokaryotes use CoA for diverse purposes. In some cases CoA is used as a carrier of naturally occurring or xenobiotic, weak organic acids of varied lengths, while in others CoA is critical to the maintenance of thiol-based redox homeostasis [1–5]. CoA is synthesized from the precursor pantothenate, which can be generated de novo by bacteria, plants and fungi . In bacteria, pantothenate synthesis is a branched pathway in which the intermediates β-alanine and pantoate are generated and then condensed to form pantothenate . l-aspartate-α-decarboxylase (PanD; EC 188.8.131.52) catalyzes the conversion of l-aspartate to β-alanine . l-aspartate-α-decarboxylase is a pyruvoyl enzyme that is synthesized as a 14-kDa zymogen, which undergoes self-catalyzed proteolysis to yield active enzyme . The cleavage reaction yields an 11-kDa α-subunit and 3-kDa β-subunit with the concomitant formation of a pyruvoyl moiety at the N-terminus of the α-subunit; the pyruvoyl moiety is necessary for l-aspartate decarboxylation .
The Salmonella enterica l-aspartate-α-decarboxylase zymogen contains all the determinants needed for maturation. However, at 37 °C, the optimal growth temperature of this bacterium, the ancillary protein PanM (formerly YhhK) is required in vitro and in vivo for cleavage of the l-aspartate-α-decarboxylase zymogen [10, 11]. At present, the mechanism by which PanM stimulates l-aspartate-α-decarboxylase cleavage is not known. What is known is that although PanM is a homologue of Gcn5-like N-acetyltransferases (GNATs), it lacks acetyltransferase activity . Interestingly, PanM activity is stimulated by acetyl-CoA, a result that led us to hypothesize that PanM functions as an acetyl-CoA sensor to regulate l-aspartate-α-decarboxylase zymogen maturation .
Previously, we showed that the l-aspartate-α-decarboxylase zymogen from Corynebacterium glutamicum did not require PanM to process its own maturation . This suggested that two classes of bacterial l-aspartate-α-decarboxylases might exist in nature, one class that would require PanM for processing, and a second class that could mature in the absence of PanM. To determine the distribution of these two forms of PanD in the prokaryotes, we performed a phylogenetic analysis of PanM and l-aspartate-α-decarboxylase. By comparing PanM sequences amongst all homologues, we identified a putative domain of interactions between the l-aspartate-α-decarboxylase zymogen and PanM.
Results and discussion
Phylogenetic distribution of prokaryotic l-aspartate-α-decarboxylase and PanM proteins
We determined the distribution of PanM and l-aspartate-α-decarboxylase proteins by searching the National Center for Biotechnology Information (NCBI) database of completed prokaryotic genomes for homologues of the S. enterica PanM and l-aspartate-α-decarboxylase proteins. Homologues of the latter were found in 829 genomes (~52 %) (Additional file 1: Dataset S1). Notably, seven l-aspartate-α-decarboxylase homologues were found in members of the Archaea, a finding that differs from those by Genschel who did not find archaeal l-aspartate-α-decarboxylase homologues . Most likely this difference is due to the increase in sequenced genomes since 2004. In contrast, PanM homologues were far less abundant, with only 128 genomes (~8 %) containing homologues of this protein (Additional file 2: Dataset S2). Importantly, all the PanM-containing genomes belonged to the domain Bacteria.
We predict that these l-aspartate-α-decarboxylase homologues require PanM for maturation. Notably, ~77 % of genomes encoding l-aspartate-α-decarboxylase lacked a PanM homologue, suggestive of l-aspartate-α-decarboxylase proteins that self-cleave in the absence of PanM.
On the basis of these phylogenetic data we predicted the existence of two classes of l-aspartate-α-decarboxylase enzymes, one class that required PanM for maturation, (Class I), and a second class that did not require PanM (Class II).
In vivo validation
All panD homologues restored growth of the S. enterica ΔpanD panM + strain in the absence of pantothenate, indicating that all PanD proteins had l-aspartate-α-decarboxylase activity in vivo (Fig. 3). When expressed in the S. enterica ΔpanM strain, all panD homologues from bacteria that also contained a panM gene (S. enterica, Klebsiella pneumoniae, Pseudomonas aeruginosa) failed to restore growth on minimal medium (Fig. 3, row 1). These data supported the phylogenetic analysis in assigning these l-aspartate-α-decarboxylase homologues to Class I, which we predicted would require PanM for maturation. In contrast, panD genes from bacteria that lacked panM (e.g., Bacillus halodurans, Bordetella pertussis, C. glutamicum, Helicobacter pylori, Legionella pneumophila, Magnetospirillum magneticum, Moorella thermoacetica, Neisseria gonorrhoeae and Ralstonia solanacearum) restored growth of an S. enterica strain carrying a ΔpanM deletion in the absence of β-alanine or pantothenate (Fig. 3, rows 2–4). These in vivo results combined with the phylogenetic data supported the existence of two classes of l-aspartate-α-decarboxylase enzymes. Class I l-aspartate-α-decarboxylases that require PanM for activation were present only in the Gammaproteobacteria. Class II l-aspartate-α-decarboxylases that did not require PanM to be active were found in a number of bacterial phyla along with a handful of archaeal species.
Conserved regions of PanM form a domain where putative interactions with l-aspartate-α-decarboxylases may interact
During the review of the work reported herein, Monteiro et al. published the structure of E. coli PanD in complex with PanM (PanZ in E. coli). This elegant and thorough set of studies revealed, among other things, the interaction domain between the PanD zymogen and PanM . Their analysis established that PanM binds to the C-terminus of PanD. In our study, we found several possible PanM binding regions with higher amino acid conservation among organisms that also contained a PanM homologue when compared with those lacking PanM (Fig. 5). Interestingly, the C-terminal portion of PanD identified by Monteiro et al.  was one of the possible PanM binding regions identified in our study. Specifically, variations in the motif Ala118 to Ala126 identified by the bar in Fig. 5, may be used to predict which PanD zymogens require PanM to expedite self processing.
For PanM (E.c. PanZ), Monteiro et al. also demonstrated that many of the conserved residues found in our study (Fig. 6) were important for PanD binding. Specifically, they found the loop formed by residues Leu66-Gly76 to be stabilized upon Ac-CoA binding, and conserved residue Asn45 on PanM to be critical for PanD binding. Both the Leu66-Gly76 loop and Asn45 were predicted to be involved in PanD binding in our study (Fig. 6). This shows the power of using conservation analysis for the prediction of protein–protein interactions.
There are many intriguing questions regarding the evolution of PanM in pro-PanD maturation. For example, what was the selective pressure that led to the evolution of PanM? This question is interesting in light of the existence of PanM-independent lineages of self-processing pro-PanD proteins.
In E. coli, it is known that the PanD zymogen can self-cleave in the absence of PanM. However, maturation, can only occur at high temperatures, optimally at 50 °C . This dependence on high-temperature is not physiological for neither E. coli nor S. enterica, since neither bacterium can grow at such a temperature. In E. coli and S. enterica, PanD zymogen maturation in the absence of PanM does not occur, or if it does, the amount of mature PanD generated is insufficient to support growth as indicated by the clear pantothenate phenotype of a S. enterica panM or PanZ mutant strain [10, 18].
The availability of the three-dimensional structure of the PanZ/PanM:pro-PanD complex will serve as a critical framework within which we can analyze results from experiments aimed at furthering our understanding of the mechanism of self-cleavage of l-aspartate α-decarboxylases.
In nature there are two classes of self-cleaving zymogens of l-aspartate-α-decarboxylase (PanD) enzymes. In both classes of zymogens, the cleaving event yields an N-terminal pyruvoyl moiety that is critical for substrate binding and catalysis. The putative l-aspartate-α-decarboxylase zymogen:PanM interaction region generated by amino acid conservation analysis using available structural models should facilitate the testing and analysis of the mechanism of l-aspartate-α-decarboxylase zymogen self-cleavage in vivo and in vitro. Structures of l-aspartate-α-decarboxylase zymogens that do not require PanM for maturation would be valuable to better our understanding of the differences between both zymogens and to understand what PanM does to the structure of the zymogen to trigger cleavage.
The protein sequences for l-aspartate-α-decarboxylase and PanM were obtained from the complete genome sequence of Salmonella enterica serovar Typhimurium LT2 (Accession: NC_003197, accessed: 04/01/2012). Specifically, this corresponded to the GenBank Protein IDs 16763570 (PanD) and 16766851 (PanM). We obtained all protein sequences associated with the complete prokaryotic genome collection found in GenBank (ftp://ncbi.nih.gov/genomes/Bacteria/all.faa.tar.gz, accessed: 04/01/2012) and used the stand alone version of BLAST  to format these sequences into a searchable BLAST database. This iteration of the complete prokaryotic genome collection contained a total of 1,606 genome sequences. BLAST was then used to query the l-aspartate-α-decarboxylase and PanM proteins against the complete prokaryotic proteome database and only those alignments with an e value <1e−03 were retained.
Phylogenetic trees were then generated as follows. The alignment program MUSCLE  was first used to generate an amino acid alignment of all l-aspartate-α-decarboxylase homologues obtained from the complete prokaryotic proteome. This alignment was then imported into the phylogenetic analysis program MEGA , and a maximum likelihood (ML) tree was generated. Visualization of this tree was performed using the Interactive Tree of Life web server . For each protein, the originating prokaryotic genus and species was retained throughout the phylogenetic tree construction process. A second tree was also generated using this approach for all homologues of PanM. In both cases, a phylogenetic tree using maximum parsimony was also constructed, but showed no difference in overall topology from the ML tree.
panD genes were amplified from genomic DNA listed in Additional file 3: Table S1 using GeneAmp High Fidelity PCR kit (Applied Biosystems) and cloned into the pBAD24  expression plasmid using EcoRI and HindIII restriction sites. Insert sequences were verified using BigDye® sequencing (Applied Biosystems) at the University of Wisconsin Biotechnology Center (Madison, WI, USA). The resulting plasmids are listed in Additional file 4: Table S2.
Bacterial growth conditions
The ΔpanD (JE13233) and ΔpanM (JE12555) S. enterica strains  were transformed with pBAD24 plasmids expressing panD genes (Additional file 4: Table S2). Strains were grown overnight on lysogeny broth (LB) [24, 25] containing 100 μg ml−1 ampicillin, then sub-cultured at 0.5 % (v/v) into no-carbon E medium (NCE)  containing 20 mM glycerol as the sole source of carbon and energy, and ampicillin (100 μg ml−1). Growth was monitored as an increase in optical density at 650 nm using an ELx808 plate reader (BioTek). All growth curves were performed in triplicate and data presented are averaged from at least two independent experiments. Data were graphed using Prism v4.0 (GraphPad).
Determination of conserved regions in PanD and PanM
PanM protein sequences from bacteria listed in Dataset S2 were aligned using ClustalW2 (http://www.ebi.ac.uk/Tools/msa/clustalW2). For species where multiple strains were listed, only one representative strain was used in the alignment. Conservation of PanM residues was determined from the alignment using ConSurf (consurf.tau.ac.il) . All residues with a ConSurf conservation score above 7 were highlighted on the NMR solution structure of Escherichia coli PanM (Cort, J. R., Yee, A., Arrowsmith, C. H. and Kennedy, M. A.; PDB 2K5T) using Pymol . To predict residues of l-aspartate-α-decarboxylase that may participate in interactions with PanM, two alignments of l-aspartate-α-decarboxylase homologues were created. One alignment contained l-aspartate-α-decarboxylase homologues from Gammaproteobacteria that also contained a panM gene (Class I). The second alignment contained l-aspartate-α-decarboxylase homologues from Gammaproteobacteria that lacked panM (Class II). Both alignments were generated and conserved residues determined as described above. Alignments with conservation scores were manually compared and residues with higher ConSurf scores in Class I compared to Class II l-aspartate-α-decarboxylases were highlighted on the crystal structure of E. coli l-aspartate-α-decarboxylase zymogen (PDB 1PPY) .
Availability of supporting data
All data associated with the phylogenetic trees generated in this study are publicly available in the Dryad Digital Repository at http://0-dx.doi.org.brum.beds.ac.uk/10.5061/dryad.j9d6q.
TNS, designed and performed the in vivo and in vitro molecular genetics experiments, performed the conservation analysis of pro-PanD and PanM and generated the models; wrote the paper; SB, performed the bioinformatics and phylogenetics analyses; KMH-H, participated in the performance of the in vivo and in vitro molecular genetics experiments; GS, designed the bioinformatics and phylogenetics analyses, helped write the paper; and JCE-S, designed the in vivo experiments; wrote the paper. The funding agencies supporting this work did not contribute to the conception or performance of this work. All authors read and approved the final manuscript.
This work was supported by USPHS Grant R01 GM062203 to JCE-S. TNS was supported in part by NIH Molecular Biosciences Training Grant T32-GM07215. This work was also supported by a DOE BER Early Career Research Program Award DE–SC0008104 to GS. We thank Hazel Holden, Robert Maier, Peter Greenberg, Gary Roberts, Joseph Dillard, Caitlin Allen, Arash Komeili and Michele Swanson for the gift of genomic DNA.
Members of the Escalante-Semerena research group (TNS, KMH-H, JCE-S) are cell physiologists that were first to describe the protein-mediated maturation of the l-aspartate-α-decarboxylase zymogen in any organism. Their report identified PanM (formerly YhhK) as the protein required for this process. The work reported here are the initial steps taken towards the elucidation of the molecular details of the interactions between PanM and the zymogen that lead to enzyme maturation. Members of the She Suen research group (SB, GS) are expert bioinformaticists interested in broad analysis of distribution of proteins of interest throughout the prokaryotic domains of life.
Compliance with ethical guidelines
Competing interests The authors declare that they have no competing interests.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Abiko Y (1975) Metabolism of coenzyme A. In: Greenburg DM (ed) Metabolic pathways: metabolism of sulfur compounds. Academic, New York, pp 1–25Google Scholar
- Lee C-Y, Chen A (1982) Immobilized coenzymes and derivatives. In: You E (ed) The pyridine nucleotide coenzymes. Academic, New York, pp 189–224Google Scholar
- Wallace BD, Edwards JS, Wallen JR, Moolman WJ, van der Westhuyzen R, Strauss E et al (2012) Turnover-dependent covalent inactivation of Staphylococcus aureus coenzyme A-disulfide reductase by coenzyme A-mimetics: mechanistic and structural insights. Biochemistry 51(39):7699–7711PubMed CentralView ArticlePubMedGoogle Scholar
- Darnell M, Weidolf L (2013) Metabolism of xenobiotic carboxylic acids: focus on coenzyme A conjugation, reactivity, and interference with lipid metabolism. Chem Res Toxicol 26(8):1139–1155View ArticlePubMedGoogle Scholar
- Hunt MC, Tillander V, Alexson SE (2014) Regulation of peroxisomal lipid metabolism: the role of acyl-CoA and coenzyme A metabolizing enzymes. Biochimie 98:45–55View ArticlePubMedGoogle Scholar
- Smith CM, Song WO (1996) Comparative nutrition of pantothenic acid. J Nutrition Biochem 7:312–321View ArticleGoogle Scholar
- Cronan JE Jr, Littel KJ, Jackowski S (1982) Genetic and biochemical analyses of pantothenate biosynthesis in Escherichia coli and Salmonella typhimurium. J Bacteriol 149:916–922PubMed CentralPubMedGoogle Scholar
- Cronan J Jr (1980) Beta-alanine synthesis in Escherichia coli. J Bacteriol 141:1291–1297PubMed CentralPubMedGoogle Scholar
- Ramjee MK, Genschel U, Abell C, Smith AG (1997) Escherichia coli l-aspartate-alpha-decarboxylase: preprotein processing and observation of reaction intermediates by electrospray mass spectrometry. Biochem J 323:661–669PubMed CentralView ArticlePubMedGoogle Scholar
- Stuecker TN, Hodge KM, Escalante-Semerena JC (2012) The missing link in coenzyme A biosynthesis: PanM (formerly YhhK), a yeast GCN5 acetyltransferase homologue triggers aspartate decarboxylase (PanD) maturation in Salmonella enterica. Mol Microbiol 84:608–619PubMed CentralView ArticlePubMedGoogle Scholar
- Stuecker TN, Tucker AC, Escalante-Semerena JC (2012) PanM, an acetyl-coenzyme A sensor required for maturation of l-aspartate decarboxylase (PanD). MBio 3:e00158–e00112Google Scholar
- Genschel U (2004) Coenzyme A biosynthesis: reconstruction of the pathway in archaea and an evolutionary scenario based on comparative genomics. Mol Biol Evol 21:1242–1251View ArticlePubMedGoogle Scholar
- Ashkenazy H, Erez E, Martz E, Pupko T, Ben-Tal N (2010) ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids. Nucleic Acids Res 38(Web Server issue):W529–W533Google Scholar
- Schmitzberger F, Kilkenny ML, Lobley CM, Webb ME, Vinkovic M, Matak-Vinkovic D et al (2003) Structural constraints on protein self-processing in l-aspartate-alpha-decarboxylase. EMBO J 22:6193–6204PubMed CentralView ArticlePubMedGoogle Scholar
- Nozaki S, Webb ME, Niki H (2012) An activator for pyruvoyl-dependent l-aspartate alpha-decarboxylase is conserved in a small group of the gamma-proteobacteria including Escherichia coli. MicrobiologyOpen 1:298–310PubMed CentralView ArticlePubMedGoogle Scholar
- Vetting MW, Carvalho LPSD, Yu M, Hegde SS, Magnet S, Roderick SL et al (2005) Structure and functions of the GNAT superfamily of acetyltransferases. Arch Biochem Biophys 433:212–226View ArticlePubMedGoogle Scholar
- Monteiro DC, Patel V, Bartlett CP, Nozaki S, Grant TD, Gowdy JA et al (2015) The structure of the PanD/PanZ protein complex reveals negative feedback regulation of pantothenate biosynthesis by coenzyme A. Chem Biol 22:492–503PubMed CentralView ArticlePubMedGoogle Scholar
- Adams MD, Wagner LM, Graddis TJ, Landick R, Antonucci TK, Gibson AL et al (1990) Nucleotide sequence and genetic characterization reveal six essential genes for the LIV-I and LS transport systems of Escherichia coli. J Biol Chem 265:11436–11443PubMedGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW (1990) Basic local alignment search tool. J Mol Biol 215:403–410View ArticlePubMedGoogle Scholar
- Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797PubMed CentralView ArticlePubMedGoogle Scholar
- Kumar S, Nei M, Dudley J, Tamura K (2008) MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief Bioinform 9:299–306PubMed CentralView ArticlePubMedGoogle Scholar
- Letunic I, Bork P (2007) Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics 23:127–128View ArticlePubMedGoogle Scholar
- Guzman LM, Belin D, Carson MJ, Beckwith J (1995) Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter. J Bacteriol 177:4121–4130PubMed CentralPubMedGoogle Scholar
- Bertani G (1951) Studies on lysogenesis. I. The mode of phage liberation by lysogenic Escherichia coli. J Bacteriol 62:293–300PubMed CentralPubMedGoogle Scholar
- Bertani G (2004) Lysogeny at mid-twentieth century: P1, P2, and other experimental systems. J Bacteriol 186:595–600PubMed CentralView ArticlePubMedGoogle Scholar
- Berkowitz D, Hushon JM, Whitfield HJ Jr, Roth J, Ames BN (1968) Procedure for identifying nonsense mutations. J Bacteriol 96:215–220PubMed CentralPubMedGoogle Scholar
- DeLano WL (2002) The pymol molecular graphics system. Schrodinger, LLCGoogle Scholar