Evolution of the Kdo2-lipid A biosynthesis in bacteria
BMC Evolutionary Biology volume 10, Article number: 362 (2010)
Lipid A is the highly immunoreactive endotoxic center of lipopolysaccharide (LPS). It anchors the LPS into the outer membrane of most Gram-negative bacteria. Lipid A can be recognized by animal cells, triggers defense-related responses, and causes Gram-negative sepsis. The biosynthesis of Kdo2-lipid A, the LPS substructure, involves with nine enzymatic steps.
In order to elucidate the evolutionary pathway of Kdo2-lipid A biosynthesis, we examined the distribution of genes encoding the nine enzymes across bacteria. We found that not all Gram-negative bacteria have all nine enzymes. Some Gram-negative bacteria have no genes encoding these enzymes and others have genes only for the first four enzymes (LpxA, LpxC, LpxD, and LpxB). Among the nine enzymes, five appeared to have arisen from three independent gene duplication events. Two of such events happened within the Proteobacteria lineage, followed by functional specialization of the duplicated genes and pathway optimization in these bacteria.
The nine-enzyme pathway, which was established based on the studies mainly in Escherichia coli K12, appears to be the most derived and optimized form. It is found only in E. coli and related Proteobacteria. Simpler and probably less efficient pathways are found in other bacterial groups, with Kdo2-lipid A variants as the likely end products. The Kdo2-lipid A biosynthetic pathway exemplifies extremely plastic evolution of bacterial genomes, especially those of Proteobacteria, and how these mainly pathogenic bacteria have adapted to their environment.
Kdo2-lipid A is a complex glycolipid consisting of glucosamine, 3-OH fatty acids, and unusual sugar 3-deoxy-D-manno-octulonsonic acid (Kdo) (Figure 1). Kdo2-lipid A is the principle and essential component of the outer leaflet of the outer cell wall of Gram-negative bacteria . It is the membrane anchor for a wide and variable range of polysaccharide repeating units extending beyond the cell wall. Together with Kdo2-lipid A the polysaccharide repeating units constitute lipopolysaccharide (LPS), a class of molecules believed to occur solely in Gram-negative bacteria and Cyanobacteria and essential for their viability . Lipid A is a potent immunoreactive factor responsible for triggering a macrophage mediated and frequently overwhelming immune response resulting in endotoxic shock, a potentially lethal condition. Indeed, as far as is known, all vertebrates and an assortment of invertebrates exhibit a robust and highly specific anti-LPS response.
For decades lipid A/LPS has been essentially synonymous with Gram-negative organisms, yet, despite the relative diversity of the Gram-negative bacteria, the origin and evolutionary pathway of this important molecule is virtually unknown. Presently the complex, nine-enzyme biosynthetic process of Kdo2-lipid A is best known from Escherichia coli (Figure 1). The genetic sequences for the enzymes have been derived from the E. coli genome (Additional file 1). This pathway and the concomitant genes in E. coli probably represent the most highly evolved Kdo2-lipid A biosynthetic pathway given the highly adapted association of E. coli with vertebrate enteric habitats, which depends heavily on the structure of LPS. Thus, the central question becomes: can the evolutionary radiation of LPS be described by understanding the comparative genomics of Kdo2-lipid A biosynthetic pathway?
Traditional classification of bacteria: Gram-positive vs. Gram-negative, is based on the Gram-staining procedure . Gram-positive bacteria retain the crystal-violet stain, whereas Gram-negative bacteria are decolorized with alcohol or acetone and stained red with safranin or basic fuchsin. Different responses to the staining are caused by the chemical properties of the cell walls. Gram-positive bacteria have an impermeable cell wall that is made of thick peptidoglycan and is not affected by decolorization. Gram-negative bacteria, on the other hand, have a thin peptidoglycan layer and an outer membrane containing LPS, which can be disrupted by decolorization.
Phylogenetic analysis of biomolecular sequences is now considered to be more reliable for bacterial classification. Molecular phylogenies do not support the traditional simple grouping of eubacteria: Gram-negative vs. Gram-positive [e.g., [4–6]]. The most important and often used phylogenetic marker for bacteria is 16S ribosomal RNA (rRNA) due to their highly conserved nature of sequences. The second edition of Bergey's Manual of Systematic Bacteriology , the gold standard of bacterial classification, is mostly based on 16S rRNA phylogenies. Gram-positive bacteria are now grouped into two paraphyletic groups: the phylum Firmicutes (low G+C content Gram-positive) and the phylum Actinobacteria (high G+C content Gram-positive). Gram-negative bacteria are composed of more than 20 of highly diverged phyla. Firmicutes, a Gram-positive phylum, forms a sister cluster with a Gram-negative phylum Cyanobacteria. The phylum Proteobacteria, the largest Gram-negative bacteria group that includes E. coli and many pathogens, is the most derived group of bacteria.
Understanding the distribution and diversity of the Kdo2-lipid A biosynthetic pathway among bacteria is important for a multitude of reasons. Notwithstanding its limitation, Gram-staining is still widely used in clinical practice. It is often the first diagnostic test, which is crucial for the initial diagnostic and treatments. Kdo2-lipid A is the highly immunoreactive endotoxic center of LPS. The endotoxicity of LPS is dependent on and mediated by the Kdo2-lipid A component. Furthermore, the Kdo2-lipid A pathway is being considered as a target for new antibiotic development [e.g., [8, 9]]. Kdo2-lipid A is required for growth of E. coli and most other Gram-negative bacteria. Inhibitors of the Kdo2-lipid A biosynthesis, therefore, can become good antibiotics against these bacteria.
In order to elucidate how the Kdo2-lipid A biosynthetic pathway has evolved in the bacterial kingdom, we examined the distribution of the nine enzymes involved in this pathway across 61 bacterial genomes. With Kdo2-lipid A as the final product, the entire pathway was expected to be highly conserved among Gram-negative bacteria. On the other hand, Gram-positive bacteria would lack some or all of the enzymes required for the Kdo2-lipid A biosynthesis. On the contrary, we identified a widely varied level of conservation in this pathway among Gram-negative bacteria. We showed that the currently known, considered to be "canonical", nine-enzyme pathway, which has been characterized mainly in E. coli and related bacteria, does not represent nor should be considered as ancestral to all Gram-negative bacteria. Rather, the nine-enzyme pathway represents the product of genomic plasticity, evolved in highly-derived Proteobacteria, especially in those closely related to E. coli.
Results and discussion
Distribution of Kdo2-lipid A biosynthetic enzymes across bacterial genomes
Gram-negative bacteria, by definition, should have LPS-containing outer membranes; hence all these bacteria are expected to possess all genes encoding Kdo2-lipid A biosynthetic enzymes. These genes, on the other hand, are likely to be missing from Gram-positive bacteria unless they are used in alternative functions in these bacteria. As we expected, none of the seven Gram-positive bacteria we examined had the genes encoding these enzymes. On the other hand, we found surprisingly a wide range of presence/absence patterns with these genes among 54 Gram-negative bacteria we studied (Additional file 1). Only one group of Gammaproteobacteria including Escherichia, Vibrio, and Shewanella species had all nine genes required for Kdo2-lipid A biosynthesis. We call them Group II Gammaproteobacteria (see Figure 2A). Remaining Gammaproteobacteria (Group I; e.g., Pseudomonas, Xylella fastidiosa, Coxiella burnetii) as well as Betaproteobacteria (e.g., Bordetella parapertussis and Dechloromonas aromatica) had all genes except lpxM. All other Gram-negative bacteria are missing lpxH as well as lpxM genes. This implies that the Kdo2-lipid A biosynthetic pathway consisting of the nine enzymes is not ancestral, but rather a specialized, derived form found only in E. coli and closely related Group II Gammaproteobacteria. The LpxM protein, found only in this group, shares a sequence similarity of 47% with the LpxL protein, and appeared to be a product of gene duplication. We also found two exceptions among Proteobacteria. Nitrosomonas europaea (Betaproteobacteria) had only one (lpxK) and two species of Walbachia (Alphaproteobacteria) we examined had none of the nine genes. Seven other bacteria also lacked all of the nine genes although they are classified as Gram-negative (Figure 2B and Additional file 1).
Gene duplication and functional specialization characterize the evolution of the Kdo2-lipid A biosynthetic pathway
We identified multiple gene-duplication events during the evolution of the Kdo2-lipid A biosynthetic pathway. lpxA and lpxD are duplicated genes and share 45% sequence similarity at the protein level. LpxA and LpxD proteins have trimer structures with different quaternary assembly and active sites [10, 11]. All bacteria we examined either possess or lack both enzymes. Therefore, generation of this pair of enzymes must have been an early event in Gram-negative bacteria. Further duplications of either of the genes have happened in some lineages independently (e.g., lpxA duplication in Geobacter sulfurreducens and lpxD duplication in Gloeobacter violaceus) (see Additional file 1 for details). lpxH gene, which encodes pyrophosphatase , appeared to have arisen from a duplication of lpxH2 gene within the Proteobacteria lineage but before Beta- and Gammaproteobacteria divergence (Figure 2). Furthermore, lpxL gene seems to be prone to duplicate. We identified several such duplication events including one that generated lpxM in the Group II Gammaproteobacteria (Figure 2).
lpxH/lpxH2duplication within Proteobacteria
LpxH and LpxH2 proteins share 42% sequence similarity . LpxH2 candidates were found mainly in the phylum Proteobacteria, but also in some other Gram-negative bacteria (Additional file 1, Figure 2). The duplication event that created the lpxH gene happened within the phylum Proteobacteria before the divergence of Beta- and Gammaproteobacteria (Figures 2A and 3). Interestingly, after the duplication event, lpxH2 gene was lost from the Group II Gammaproteobacteria except for Shewanella sp. MR-4 (Figure 2A). It implies that lpxH2 gene is dispensable when lpxH gene exists. Since some of the Gram-negative bacteria that have seven enzymes in the pathway have only lpxH2 gene or neither lpxH nor lpxH2 genes, it is tempting to speculate that functions of LpxH and LpxH2 proteins are interchangeable or can be replaced by other non-specific phosphatases (see below for LpxI discovery). Babinski et al.  showed that lpxH gene from Pseudomonas aeruginosa compensated for the loss of E. coli K12 lpxH, but lpxH2 of P. aeruginosa did not. For those Proteobacteria that have both of LpxH and LpxH2 proteins, the function of each protein may have been optimized for their specific roles. Such specialized proteins may not be as flexible as the ancestral form.
LpxH and LpxH2 proteins belong to the metallophosphoesterase superfamily (PF00149; the Pfam database ). The LpxH and LpxH2 protein groups form two distinct clusters among similar bacterial proteins identified (see the inset of Figure 3). All these proteins contain a five-block motif: D-Xn-GD-Xn-GNH(E/D)-Xn-H-Xn-GHXH, a signature of the metallophosphoesterase superfamily. Mutational and structural analyses showed that these residues are involved in metal-ion binding and catalysis [15–18]. The five-block residues are completely conserved among all proteins we examined including LpxH and LpxH2 (shown in the inset of Figure 3) except for the third block, GNH(E/D), where the histidine (H) is substituted to arginine (R) only in LpxH (Additional file 2). After the LpxH2/H ancestral protein was generated by a duplication of a gene encoding a protein similar to calcineurin phosphohydrolase (the closest relative of LpxH2/H), another duplication produced the two protein families, LpxH2 and LpxH (Figure 3). Although the function of LpxH2 has been unknown, comparisons of structural models of LpxH and LpxH2 proteins indicate that the five-block regions form similar cavities in these proteins (see Methods and Additional file 2). The histidine-to-arginine substitution in the third block in LpxH may have increased the LpxH activity significantly in Beta- and Gammaproteobacteria.
Recently, Metzger and Raetz  identified a gene located between lpxA and lpxB in Caulobacter crescentus (Alphaproteobacteria). This gene, named lpxI, which does not share similarity with lpxH/lpxH2 genes, was found to catalyze UDP-2,3-diacylglucosamine hydrolysis by a mechanism different from lpxH. It also rescued lpxH-deficient E. coli. lpxI orthologues were found in many, but not all of, bacteria that lack lpxH  (see also Additional file 1). Metzger and Raetz  reported that lpxI was found even from Gram-positive bacteria (e.g., Firmicutes). It suggests that replacement of lpxH gene function appears to be not very difficult, and multiple gene recruitments may have happened during the evolution of Kdo2-lipid A biosynthetic pathway.
Multiple lpxL duplicationswithin Gram-negative bacteria
Figure 4 shows the LpxL phylogeny for the entire set of bacteria we examined. Among the Gram-negative bacteria, we identified three or more independent duplication as well as loss events that involved with lpxL. The first duplication appears to have happened within the phylum Proteobacteria. After Coxiella burnetti has diverged from other Beta- and Gammabacteria, the duplication event generated two lpxL genes (we call them Types 1 and 2 in Figure 4). One of the duplicated lpxL genes, Type 2, was lost before the divergence of the Group-I Gammaproteobacteria. Interestingly this event coincided with the second gene duplication that created the lpxM gene in this group (Figure 4). The third duplication event is identified only in the lineage leading to closely related enterobacteria (E. coli, Salmonella enterica, and Yersinia pestis). The duplication product, LpxP, shares 74% sequence similarity with the LpxL protein. The original lpxL gene (Type 1) was subsequently lost from the Y. pestis genome (Figure 4). In addition to these major duplication events, we also identified at least two species-specific lpxL duplications in Gammaproteobacteria: one for Type 1 (in Alcanivorax borkumensis SK2; order Oceanospirillales) and another for Type 2 (in Francisella novicida; order Thiotrichales).
Functional specialization in lpxLand its paralogues
Although it is not clear if Types-1 and -2 LpxL function differently in all Beta/Gammabacteria, in some bacteria duplicated LpxL copies have apparently evolved to have slightly different functions. Type-1 LpxL (LpxL1) of Bordetella pertutssis adds secondary 2-hydroxy laurate at the position 2 of Kdo2-lipid A, and Type-2 LpxL (LpxL2) adds myristate at the position 2' of Kdo2-lipid A . Francisella novicida has species-specific duplicated copies of Type-1 LpxL, LpxL1 and LpxL2, and they function as Kdo-dependent and -independent acyltransferases, respectively. LpxL2 of F. novicida can add laurate at the position 2' of lipid IVA without Kdo addition by WaaA, which probably accounts for a large amount of 'free lipid A' (lipid A not linked to Kdo, core sugars, and O-antigen) .
Acyltransferases LpxL and LpxM share 47% sequence similarity. In E. coli, LpxL and LpxM add laurate at the position 2' and myristate at the position 3' of Kdo2-lipid A, respectively , and have conserved catalytic dyad motifs HX 4 D and HX 4 E, respectively . LpxM is not required for growth of E. coli K12 . While E. coli K12 mutants for lpxL and lpxM genes can grow on minimum medium and at all temperatures, they do not grow on rich media at temperatures above 32°C [24–26]. In Haemorphilus influenzae, LpxL and LpxM are both myristoyl transferases . Therefore, four types of related enzymes appear to have evolved after duplications: "generalist" LpxL enzyme that catalyzes the transfer of both laurate and myristate, more "specialized" LpxL enzyme that acts only either on laurate or myristate, and LpxM newly "specialized" as myristoyl transferase. LpxM is especially optimized for Group II Gammaproteobacteria, many of which live in warm body temperatures of the hosts.
Cold-temperature adaptation with lpxP
Another lpxL duplicate, lpxP, is found in E. coli and other closely related enterobacteria. While E. coli and S. enterica have all three paralogous genes (lpxL, lpxM, and lpxP), Y. pestis has only two derived copies (lpxM and lpxP). lpxP encodes the palmitoleoyl transferase, which is induced upon cold shock (12°C) . In E. coli, palmitoleate is incorporated to Kdo2-lipid IVA by LpxP at the position where normally LpxL incorporates laurate (See Figure 1) . Palmitoleoyl residue changes the properties (e.g., fluidity) of the outer membrane and makes bacteria adaptable to low growth temperatures. While survival outside of an animal host is necessary in bacteria such as E. coli, H. influenzae, for example, is transmitted from animal to animal without being exposed in the colder environment. These Gram-negative bacteria do not have lpxP gene.
Another example of lipid-A associated adaptation is found in Y. pestis. Y. pestis changes its host from flea to mouse, cold to warm temperature environment. Due to the absence of the lpxL gene, Y. pestis produces only tetra-acylated Kdo2-lipid IVA (see Figure 1) but not hexa-acylated Kdo2-lipid A at 37°C (mammalian host temperature) contributing bacteria's poor Toll-like receptor 4 (TLR4) stimulating activity . With lpxP activated, however, at 21°C (flea temperature) hexa-acylated Kdo2-lipid A with palmitoleate is synthesized in Y. pestis.
Four-enzyme pathway in Cyanobacteria and Dictyoglomi: the primordial form?
Cyanobacteria (Cn in Figure 2) and Dictyoglomi (Di) have only four of the nine genes: lpxA, lpxC, lpxD, and lpxB (Figure 2). These genes encode the first four enzymes of Kdo2-lipid A biosynthesis up to the point of producing the lipid A disaccharide (DS-1-P in Figure 1), implying that this may be the primordial form of lipid A. LPS from Cyanobacteria is simpler than that of LPS of enteric bacteria and lacks Kdo, heptose, and phosphate [30–33]. It is reported that LPS from E. coli is more toxic than LPS from Cyanobacteria .
Kdo2-lipid A biosynthetic pathway gene clusters
In prokaryotic genomes, many functionally related gene sets exist as gene clusters [e.g., [35–37]]. As shown in Additional file 3, genes encoding some or all of the first four enzymes (lpxA , lpxC, lpxD, and lpxB) are often found in a gene cluster. Dictyoglomi and Cyanobacteria have only these four genes and all of them exist in a conserved gene cluster, lpxD-lpxC-fabZ-lpxA-lpxB. Note that the majority of these clusters include fabZ, which encodes (3R)-hydroxymyristoyl acyl carrier protein (ACP) dehydratase . (3R)-hydroxymyristoyl-ACP serves as an important biosynthetic branch point. It is transferred to UDP-GlcNAc to initiate lipid A biosynthesis (Figure 1). Or it is elongated by FabZ and other enzymes of fatty acid synthesis to palmitate, which is a major component of the membrane glycerophospholipids. Therefore, this gene cluster is important for regulating the proportion of LPS and phospolipids in bacterial membranes. In some bacteria (e.g., Bacteriodetes, Chorolobi, and Verrucomicrobia), the lpxC-fabZ part of the cluster has been fused and exists as a single gene (denoted as lpxC/fabZ in Additional file 3). Although lpxH is not found as part of these gene clusters, as mentioned before, its functional equivalent lpxI is found as part of the cluster (existing between lpxA and B) in Alphaproteobacteria and some other bacteria (see Additional file 3). Another often found gene cluster is waaA-lpxK. Again these genes encode enzymes that catalyze the two consecutive steps (see Figure 1) indicating the importance of their functional association in Kdo2-lipid A biosynthesis.
Bacterial phylogeny based on Kdo2-lipid A biosynthetic enzymes
There have been a number of studies to reconstruct bacterial phylogenies using various molecular data [e.g., [6, 39–41]]. Our phylogenies based on Kdo2-lipid A biosynthesis enzymes, analyzed concatenated (Figure 2A) or individually (Figures 3 and 4), are largely consistent with previous studies. The monophyletic cluster including both Gamma- and Betaproteobacteria is strongly supported by high bootstrap values in both Lpx protein and 16S rRNA phylogenies (both 100% in Figure 2) as well as by the shared events with lpxH2/lpxH and lpxL duplications. Group II Gammaproteobacteria, which includes the orders Enterobacteriales (E. coli, S. enterica, Y. pestis), Pasteurellales (H. influenzae, Pasteurella multocida), Vibrionales (V. cholerae), and Alteromonadales (Shewanella), also form a highly-supported cluster (100% bootstrap values in Figure 2) and all share the newly emerged lpxM gene. This Group II Gammaproteobacteria clustering is also supported by the phylogenetic study by Gao et al.  based on 36 protein data as well as the existence of the unique indel within the RNA polymerase β-subunit (RpoB).
Using the Dictyoglomi (Di) as the outgroup (see Method), phylogenies based on both of the Kdo2-lipid A biosynthetic enzymes and the 16S rRNA (Figure 2) showed that most bacteria that have no or only four enzymes (both Gram-positive and -negative bacteria) are located at basal relative to those that have seven or more of the enzymes. Exceptions include Brachyspira hyodysenteriae (phylum Spirochaetes) and Thermodesulfovibrio yellowstonii (phylum Nitrospirae). Fusobacterium nucleatum (phylum Fusobacteria) was also located outside of Cyanobacteria cluster in Kdo2-lipid A enzyme phylogeney (Figure 2A). Note, however, that none of their phylogenetic positions was supported with high bootstrap values in 16S rRNA phylogeny (Figure 2B).
Gram-negative bacteria with no Kdo2-lipid A biosynthetic enzyme
Some Gram-negative bacteria (shown with stars in Figure 2B; described also in ) as well as Gram-positive bacteria (shown with orange letters and lines in Figure 2B) have none of the nine enzymes. As mentioned above, since both Dictyoglomi (used as the outgroup; Di) and Cyanobacteria (Cn) have the first four enzymes of the pathway, having the four-enzyme pathway appears to be the ancestral form. It implies that these enzymes must have been lost later in some groups of bacteria. Many of the Gram-negative bacteria with no Kdo2-lipid A biosynthetic enzyme have specialized life-styles: endosymbiomes (e.g., Wolbachia and Borrelia burgdorferi), obligate chemolithoautotroph (e.g., Nitrosomonas europaea), or hyperthermophiles (e.g., Thermosipho africanus and Petrotoga mobilis).
Wolbachia is either a parasite in arthropod hosts or a mutualist in nematode hosts. It belongs to the family Anaplasmataceae of the class Alphaproteobacteria. Bacteria in this family include Ehrlichia chaffeensis, Anaplasma phagocytophilum, and Neorickettsia sennetsu; all are obligatory intracellular and infect mononuclear cells and granulocytes. They have been found to lack Kdo2-lipid A biosynthetic genes [43, 44]. Therefore, the loss of these genes must have happened in the ancestral lineage after the divergence from other Alphaproteobacteria group (see Figure 2). Lipid A is directly recognized by hosts to trigger innate-immune responses [45, 46]. It is likely that the endosymbiotic bacteria have adapted to their symbiosis conditions by losing these enzymes and not producing lipid A to avoid the induction of defense response in the hosts.
B. burgdorferi is also an endosymbiotic bacterium that lives in ticks and transmits Lyme disease. T. denticola is the cause of periodontal disease. They belong to the family Spirochaetaceae (phylum Spirochaetes; order Spirochaetales). While there have been conflicting reports whether or not these bacteria have LPS [47–50], genomic analyses showed that these bacteria as well as T. pallidum lack Kdo2-lipid A biosynthetic enzymes [51–53] (see also Additional file 1). The genes encoding these enzymes have been identified from two other Spirochaetes families (Leptospiraceae and Brachyspiraceae), which diverged earlier than the family Spirochaetaceae (Figure 2A; also see [54, 55]). Therefore, the gene loss event is specific to the family Spirochaetaceae.
N. europaea is a obligate chemolithoautotroph that derives all its energy and reductant for growth from the oxidation of ammonia to nitrite [56, 57]. Only lpxK was detected from this Betaproteobacteria species. From the closely related Nitrosospira multiformis genome, only lpxA gene instead has been identified . These ammonia-oxidizing bacteria have the smallest genomes (~3 Mb) among the Betaproteobacteria . Such genome reduction is consistent with their limited lifestyles as we also described earlier for obligate endosymbionts and pathogens. Large numbers of insertion sequence elements and pseudogenes have been also found in these genomes, implying that the genomes are still undergoing reductive evolution.
The origin of lpxgenes found in plants
Although Kdo2-lipid A biosynthesis is generally considered to be a prokaryote-specific function, at least six of the nine lpx genes have been identified in Arabidopsis thaliana, Oryza sativa, and other plant genomes (data not shown; see also [2, 60]). Furthermore, Armstrong et al.  showed that green algae and chloroplasts of garden pea were stained with affinity reagents for lipid A, indicating that these plants synthesize lipid A-like molecules. Since Cyanobacteria possess only the first four genes, all or some of the lpx genes in plants must have been transferred from bacteria other than Cyanobacteria.
Despite the presence of lipid A genes in plants, attempts to isolate canonical lipid A from plants using standard methods have been unsuccessful (Pardy, unpublished data); no structural data for putative lipid A in higher plants has been reported. However, as mentioned earlier, using a novel lipid A preparation technique, Snyder et al.  recently provided chemical composition and structural data for a simple lipid A from strains of a marine cyanobacteria, Synechoccoccus. The composition and structure of this lipid A differs significantly from that of canonical lipid A from enteric bacteria in ways hypothesized to be adaptive to marine Synechoccoccus. Although not only the first four genes exist in plants, lipid A in plants likely has a structure different from canonical lipid A. Further investigation is required to elucidate the origin and functions of eukaryotic lpx genes.
Bacterial genomes are extremely plastic. Although the Kdo2-lipid A biosynthesis is one of the most fundamental and most conserved pathways among Gram-negative bacteria, this study showed that gene duplications as well as partial or complete losses of the genes encoding these enzymes have happened multiple times independently during bacterial evolution. Each group of bacteria took advantage of such evolutionary events to optimize the pathway and adapted to their specialized life style. The most optimized form of the pathway is found in the Proteobacteria lineage, especially among Gammaproteobacteria. The nine-enzyme pathway currently known for the Kdo2-lipid A biosynthesis, which is mainly studied in E. coli and related bacteria, is the most optimized, derived form of this pathway.
Bacterial genomes used
All bacterial genomes were downloaded from National Center for Biotechnology Information website  and other sources (see Additional file 1). Complete genomes of 61 bacterial species were sampled from 17 phyla including seven species of Gram-positive bacteria as controls. All species we examined were listed in Additional file 1. Accession numbers and sequences used in this study are available upon request.
Searching for Kdo2-lipid A biosynthetic enzymes
BLAST similarity search
We started our similarity searches using the nine enzyme sequences obtained from the E. coli K12 genome as queries against the other 60 bacterial genomes (Additional file 1). For LpxI, the protein sequence from Caulobacter crescentus (Accession # NP_420717) was used as the query. blastp and tblastn  were used with a constant sample size (database length = 6,400,000; obtained as the average length of the query, 320 amino acids, multiplied by a constant genome size, 20,000). A cut-off E-value of 0.01 was used to define when there was no hit. In order to identify the orthologues, 'reciprocal' searches were performed against all genomes again using the top hit from each genome as the query.
Note that for Brachyspira hyodysenteriae (phylum Spirochaetes), although Bellgard et al.  mentioned that seven of the nine genes including lpxM was identified from the genome, we found only one copy of the LpxL protein and no other protein similar to LpxL.
Profile hidden Markov models (profile HMMs)
Each bacterial genome was further searched using profile HMMs. The protein sequences found by BLAST similarity searches were used to build the profile HMM for each enzyme using the w0.5 script of the Sequence Alignment and Modeling Software System (SAM, version 3.5; ). Since LpxH and LpxM were found only from a limited number of Proteobacteria, additional protein sequences of these enzymes (29 for LpxH, 42 for LpxM, and 47 for LpxI) were added from other bacterial species to build profile HMMs. A cut-off E-value of 0.01 and a constant sample size of 20,000 were used for profile HMM searches. To identify LpxH2 from the profile HMM search results, all sequences with the metallophosphoesterase superfamily signature D-Xn-GD-Xn-GNH(E/D)-Xn-H-Xn-GHXH were selected. Phylogenetic analysis (described next) was performed on the selected sequences to discriminate LpxH2 from other metallophosphoesterase. All sequences found in the cluster (78% bootstrap support) that includes proteins identified as the "UDP-2,3-diacylglucosamine hydrolase family" by InterPro (IPR010138; ) were selected as LpxH2 as well as LpxH (see Figure 3). Based on the phylogenetic locations, sequences belonging to LpxH and LpxH2 were decided.
Multiple alignment and phylogenetic tree inferences
Multiple alignments of protein sequences were generated using MAFFT with the FFT-NS-i algorithm (version 6.24; ). The multiple alignment of 16S rRNA sequences was reconstructed using cmalign (Infernal package, version 1.02; ). Phylogenetic trees were reconstructed by a maximum likelihood method RAxML (version 7.0.4; ). The WAG and the general time reversible substitution model both with Gamma distribution for rate heterogeneity among sites were used for protein and 16S rRNA sequences, respectively. Bootstrap analysis was done with 1,000 replications for all phylogenetic reconstructions.
Many previous studies based on various molecular data showed that among the phyla included in this study, Thermotogae, Dictyoglomi, Deinococcus-Thermus, and Chloroflex have diverged earlier than other groups. Thermotogae and Dictyoglomi, for example, were shown as outmost groups based on studies from 16S and 23S rRNAs, ribosomal and other proteins, as well as based on the gene order [e.g., [6, 41, 69, 70]]. Firmicutes as an outmost group has been also supported by some other studies [40, 71, 72]. Our phylogenetic analysis showed paraphyly among Firmicutes. Among bacterial group that have Kdo2-lipid A biosynthetic enzymes, we chose Dictyoglomi as the outgroup.
For the concatenated protein phylogeny for Figure 2A, six proteins are included: LpxA, LpxC, LpxD, LpxB, WaaA, and LpxL. Each set of protein sequences were aligned individually, then concatenated as one alignment. For LpxA and LpxD, one copy was included when there was more than one duplicated copy. For LpxL, Type 1 was included when there was more than one copy.
Structural analyses of LpxH and LpxH2 proteins
Figure S1A (Additional file 2) shows the five conserved blocks found in multiple alignments of the calcineurin-like phosphoesterase (PF00149), LpxH, and LpxH2 families, illustrated by sequence logos [73–75]. For PF00149, the Pfam seed alignment including 330 sequences was used. For LpxH and LpxH2, the alignments included 17 and 38 sequences, respectively, after removing sequences from closely related Pseudomonas species except for those from P. aeruginosa as representatives.
Structural modeling of LpxH and LpxH2 proteins
Structural modeling is performed using SWISS-MODEL Web server [76, 77]. Figure S1B (Additional file 2) shows the models for P. aeruginosa LpxH (Pa LpxH; the positions 2- 232 out of 240 amino acids were used for modeling), P. aeruginosa LpxH2 (Pa LpxH2; pos. 16- 253 out of 270 aa), Y. pestis LpxH (Yp LpxH; pos. 3-238 out of 240 aa), and Sinorhizobium meliloti LpxH2 (Sm LpxH2; pos. 15-252 out of 281 aa). These four proteins were chosen to represent following different modes of LpxH/LpxH2 proteins: "LpxH2 only" mode before LpxH/LpxH2 duplication (Sm LpxH2), "dual" mode after the duplication (Pa LpxH and Pa LpxH2), and "LpxH only mode" after losing LpxH2 (Yp LpxH; modeling was not possible with the E. coli LpxH) (see Figure 2 for the evolution of LpxH/LpxH2 proteins). The template structure selected by the server was a potential phosphoesterase, aq_1956 of Aquifex aeolicus vf5 (PDB ID: 2YVT, A chain) . The sequence similarities (E-values) of the four proteins against the template are as follows: 2 × 10-06 (Pa LpxH), 1.7 × 10-26 (Pa LpxH2), 3.70 × 10-5 (Yp LpxH), and 9.20 × 10-26 (Sm LpxH2). All LpxH and LpxH2 models have the root mean square deviations (RMSD) against the template structure of less than 1 Å for the colored regions and they are considered to be usable for modeling. Structural mining and graphic representation were done by PyMol .
Structural comparisons between LpxH and LpxH2
The first step of the LpxH/H2 reaction is to fix the UDP moiety (the substrate) to the enzyme. All four models show that blocks 3 and 5 form a narrow gate in the middle of the long cleft formed by blocks 1, 2, 3, and 4. The block 3 potentially plays a key role in this UDP-binding. An arginine (R), included in the block 3 of LpxH, has been found to bind directly to a uridine in a deoxyuridine triphosphate pyrophosphatase (dUTPase) [80, 81]. The guanidine moiety in the arginine interacts with the uridine in UDP. Németh-Pongrácz et al.  showed that removal of the guanidine moiety almost compromised the dUTPase activity. Therefore, it is plausible that the replacement of His253 (the position number is based on the alignment shown in the sequence logo) in LpxH2 to Arg253 in LpxH has brought significant increase in the enzyme activity in LpxH. Based on the studies on the dUTPase  as well as on metal-dependent phosphatases [e.g., [15, 17]], the roles of the conserved residues in LpxH and LpxH2 proteins can be inferred as follows. Negatively charged aspartic acids (D) in blocks 1, 2, and 3 form metal-binding sites (e.g., for magnesium). The block 4 provides an attacking-water. Finally the block 5 holds the negatively charged sugar moiety of UDP-2,3-diacylglucosamine.
LpxH2 does not have the key arginine in the block 3. However, our modeling showed that the block 4 of Pa LpxH2 is located inside of the molecule (not visible in Figure S1B, Additional file 2) and the red-colored area is occupied by the amino acids "NRW" providing an arginine residue. Both in LpxH2 from S. meliloti (Sm LpxH2) and in LpxH from Y. pestis (Yp LpxH), the block 4's are also inside of the molecules (not visible), and other amino acids ("GDW" and "DTD", respectively) occupy the corresponding surface areas (red colored; these amino acid positions are also marked with red open boxes under the sequence logos for LpxH and LpxH2). Although the block 4 is highly conserved among LpxH and LpxH2 proteins, only the first histidine (H) is conserved among the metalophosphoesterase family (see the PF00149 sequence logo in Figure S1A, Additional file 2). Its exact spatial position may not significantly affect the enzyme function. Alternatively, a neighboring histidine may be used for the same function. After the duplication, acquiring an arginine (R) in the block 3 must have improved the enzymatic activity of LpxH (Pa LpxH and Yp LpxH). Such change did not happen in the duplicated counterpart protein, Pa LpxH2. However, a mutation to gain another arginine happened in the area structurally equivalent to the block 4. This could also increase the enzymatic activity of LpxH2 in P. aeruginosa.
Raetz CRH: Biochemistry of endotoxins. Annual Review of Biochemistry. 1990, 59: 129-170. 10.1146/annurev.bi.59.070190.001021.
Raetz CRH, Whitfield C: Lipopolysaccharide endotoxins. Annual Review of Biochemistry. 2002, 71: 635-700. 10.1146/annurev.biochem.71.110601.135414.
Bartholomew JW, Mittwer T: The Gram stain. Bacteriological reviews. 1952, 16: 1-29.
Woese CR, Fox GE: Phylogenetic structure of the prokaryotic domain: the primary kingdoms. Proceedings of the National Academy of Sciences, USA. 1977, 74: 5088-5090. 10.1073/pnas.74.11.5088.
Ludwig W, Klenk HP: Overview: a phylogenetic backbone and taxonomic framework for prokaryotic systematics. Bergey's Manual of Systematic Bacteriology. Edited by: Boon DR, Castenholz RW. 2001, New York, NY: Springer, 1: 49-65. 2
Rappé MS, Giovannoni SJ: The uncultured microbial majority. Annu Rev Microbiol. 2003, 57: 369-394. 10.1146/annurev.micro.57.030502.090759.
Garrity GM, Brenner DJ, Krieg NR, Staley JR, (eds.): Bergey's Manual of Systematic Bacteriology. The Proteobacteria. 2005, New York, NY: Springer, 2
Barb AW, McClerren AL, Snehelatha K, Reynolds CM, Zhou P, Raetz CR: Inhibition of lipid A biosynthesis as the primary mechanism of CHIR-090 antibiotic activity in Escherichia coli. Biochemistry. 2007, 46: 3793-3802. 10.1021/bi6025165.
Onishi HR, Pelak BA, Gerckens LS, Silver LL, Kahan FM, Chen MH, Patchett AA, Galloway SM, Hyland SA, Anderson MS, et al: Antibacterial agents that inhibit lipid A biosynthesis. Science. 1996, 274: 980-982. 10.1126/science.274.5289.980.
Williams AH, Raetz CRH: Structural basis for the acyl chain selectivity and mechanism of UDP-N-acetylglucosamine acyltransferase. Proceedings of the National Academy of Sciences, USA. 2007, 104: 13543-13550. 10.1073/pnas.0705833104.
Bartling CM, Raetz CRH: Crystal structure and acyl chain selectivity of Escherichia coli LpxD, the N-acyltransferase of lipid A biosynthesis. Biochemistry. 2009, 48: 8672-8683. 10.1021/bi901025v.
Babinski KJ, Ribeiro AA, Raetz CRH: The Escherichia coli gene encoding the UDP-2,3-diacylglucosamine pyrophosphatase of lipid A biosynthesis. Journal of Biological Chemistry. 2002, 277: 25937-25946. 10.1074/jbc.M204067200.
Babinski KJ, Kanjilal SJ, Raetz CRH: Accumulation of the lipid A precursor UDP-2,3-diacylglucosamine in an Escherichia coli mutant lacking the lpxH gene. Journal of Biological Chemistry. 2002, 277: 25947-25956. 10.1074/jbc.M204068200.
Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, et al: The Pfam protein families database. Nucleic Acids Res. 2009
Zhuo S, Clemens JC, Stone RL, Dixon JE: Mutational analysis of a Ser/Thr phosphatase. Journal of Biological Chemistry. 1994, 269: 26234-26238.
Tyagi R, Shenoy AR, Visweswariah SS: Characterization of an evolutionarily conserved metallophosphoesterase that is expressed in the fetal brain and associated with the WAGR syndrome. Journal of Biological Chemistry. 2009, 284: 5217-5228. 10.1074/jbc.M805996200.
Miller DJ, Shuvalova L, Evdokimova E, Savchenko A, Yakunin AF, Anderson WF: Structural and biochemical characterization of a novel Mn2+-dependent phosphodiesterase encoded by the yfcE gene. Protein Science. 2007, 16: 1338-1348. 10.1110/ps.072764907.
Shenoy AR, Capuder M, Draškovič P, Lamba D, Visweswariah SS, Podobnik M: Structural and biochemical analysis of the Rv0805 cyclic nucleotide phosphodiesterase from Mycobacterium tuberculosis. Journal of Molecular Biology. 2007, 365: 211-225. 10.1016/j.jmb.2006.10.005.
Metzger LE, Raetz CR: An alternative route for UDP-diacylglucosamine hydrolysis in bacterial lipid A biosynthesis. Biochemistry. 2010, 49: 6715-6726. 10.1021/bi1008744.
Geurtsen J, Angevaare E, Janssen M, Hamstra HJ, ten Hove J, de Haan A, Kuipers B, Tommassen J, van der Ley P: A novel secondary acyl chain in the lipopolysaccharide of Bordetella pertussis required for efficient infection of human macrophages. J Biol Chem. 2007, 282: 37875-37884. 10.1074/jbc.M706391200.
Raetz CR, Guan Z, Ingram BO, Six DA, Song F, Wang X, Zhao J: Discovery of new biosynthetic pathways: the lipid A story. J Lipid Res. 2009, 50 (Suppl): S103-108. 10.1194/jlr.R800060-JLR200.
Clementz T, Zhou Z, Raetz CRH: Function of the Escherichia coli msbB gene, a multicopy suppressor of htrB knockouts, in the acylation of lipid A. Journal of Biological Chemistry. 1997, 272: 13353-11360.
Six DA, Carty SM, Guan Z, Raetz CR: Purification and mutagenesis of LpxL, the lauroyltransferase of Escherichia coli lipid A biosynthesis. Biochemistry. 2008, 47: 8623-8637. 10.1021/bi800873n.
Karow M, Georgopoulos C: Isolation and characterization of the Escherichia coli msbB gene, a multicopy suppressor of null mutations in the high-temperature requirement gene htrB. Journal of Bacteriology. 1992, 174: 702-710.
Karow M, Fayet O, Cegielska A, Ziegelhoffer T, Georgopoulos C: Isolation and characterization of the Escherichia coli htrB gene, whose product is essential for bacterial viability above 33 degrees C in rich media. J Bacteriol. 1991, 173: 741-750.
Vorachek-Warren MK, Ramirez S, Cotter RJ, Raetz CR: A triple mutant of Escherichia coli lacking secondary acyl chains on lipid A. J Biol Chem. 2002, 277: 14194-14205. 10.1074/jbc.M200409200.
Lee NG, Sunshine MG, Engstrom JJ, Gibson BW, Apicella MA: Mutation of the htrB locus of Haemophilus influenzae nontypable strain 2019 is associated with modifications of lipid A and phosphorylation of the lipo-oligosaccharide. Journal of Biological Chemistry. 1995, 270: 27151-27159. 10.1074/jbc.270.45.27151.
Carty SM, Sreekumar KR, Raetz CR: Effect of cold shock on lipid A biosynthesis in Escherichia coli. Induction At 12 degrees C of an acyltransferase specific for palmitoleoyl-acyl carrier protein. J Biol Chem. 1999, 274: 9677-9685. 10.1074/jbc.274.14.9677.
Montminy SW, Khan N, McGrath S, Walkowicz MJ, Sharp F, Conlon JE, Fukase K, Kusumoto S, Sweet C, Miyake K, et al: Virulence factors of Yersinia pestis are overcome by a strong lipopolysaccharide response. Nat Immunol. 2006, 7: 1066-1073. 10.1038/ni1386.
Keleti G, Sykora JL: Production and properties of cyanobacterial endotoxins. Applied and Environmental Microbiology. 1982, 43: 104-109.
Weckesser J, Jurgens UJ: Cell walls and external layers. Methods in Enzymology. 1988, 67: 173-188. full_text.
Trent SM: Biosynthesis, transport, and modification of lipid A. Biochemistry and Cell Biology. 2004, 82: 71-86. 10.1139/o03-070.
Snyder SD, Brahamsha B, Azadi P, Palenki B: Structure of compositionally simple lipopolysaccharide from marine Synechococcus. Journal of Bacteriology. 2009, 191: 5499-5509. 10.1128/JB.00121-09.
Stewart I, Schluter PJ, Shaw GR: Cyanobacterial lipopolysaccharides and human health - a review. Environmental Health. 2006, 5: 7-10.1186/1476-069X-5-7.
Demerec M, Ozeki H: Tests for Allelism among Auxotrophs of Salmonella Typhimurium. Genetics. 1959, 44: 269-278.
Itoh T, Takemoto K, Mori H, Gojobori T: Evolutionary instability of operon structures disclosed by sequence comparisons of complete microbial genomes. Mol Biol Evol. 1999, 16: 332-346.
Overbeek R, Fonstein M, D'Souza M, Pusch GD, Maltsev N: The use of gene clusters to infer functional coupling. Proc Natl Acad Sci USA. 1999, 96: 2896-2901. 10.1073/pnas.96.6.2896.
Mohan S, Kelly TM, Eveland SS, Raetz CR, Anderson MS: An Escherichia coli gene (FabZ) encoding (3R)-hydroxymyristoyl acyl carrier protein dehydrase. Relation to fabA and suppression of mutations in lipid A biosynthesis. J Biol Chem. 1994, 269: 32896-32903.
Brochier C, Bapteste E, Moreira D, Philippe H: Eubacterial phylogeny based on translational apparatus proteins. Trends in Genetics. 2002, 18: 1-5. 10.1016/S0168-9525(01)02522-7.
Ciccarelli FD, Doerks T, Mering Cv, Creevey CJ, Snel B, Bork P: Toward automatic reconstruction of a highly resolved tree of life. Science. 2006, 311: 1283-1287. 10.1126/science.1123061.
Battistuzzi FU, Hedges SB: A major clade of prokaryotes with ancient adaptations to life on land. Molecular Biology and Evolution. 2009, 26: 335-343. 10.1093/molbev/msn247.
Gao B, Mohan R, Gupta RS: Phylogenomics and protein signatures elucidating the evolutionary relationships among the Gammaproteobacteria. Int J Syst Evol Microbiol. 2009, 59: 234-247. 10.1099/ijs.0.002741-0.
Lin M, Rikihisa Y: Ehrlichia chaffeensis and Anaplasma phagocytophilum lack genes for lipid A biosynthesis and incorporate cholesterol for their survival. Infect Immun. 2003, 71: 5324-5331. 10.1128/IAI.71.9.5324-5331.2003.
Wu M, Sun LV, Vamathevan J, Riegler M, Deboy R, Brownlie JC, McGraw EA, Martin W, Esser C, Ahmadinejad N, et al: Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements. PLoS Biol. 2004, 2: E69-10.1371/journal.pbio.0020069.
Aderem A, Ulevitch RJ: Toll-like receptors in the induction of the innate immune response. Nature. 2000, 406: 782-787. 10.1038/35021228.
Janssens S, Beyaert R: Role of Toll-like receptors in pathogen recognition. Clin Microbiol Rev. 2003, 16: 637-646. 10.1128/CMR.16.4.637-646.2003.
Beck G, Habicht GS, Benach JL, Coleman JL: Chemical and biologic characterization of a lipopolysaccharide extracted from the Lyme disease spirochete (Borrelia burgdorferi). Journal of Infectious Diseases. 1985, 152: 108-117.
Takayama K, Rothenberg RJ, Barbour AG: Absence of lipopolysaccharide in the Lyme disease spirochete, Borrelia burgdorferi. Infection and Immunity. 1987, 55: 2311-2313.
Dahle UR, Tronstad L, Olsen I: 3-hydroxy fatty acids in a lipopolysaccharide-like material from Treponema denticola strain FM. Endodontics and Dental Traumatology. 1996, 12: 205-209. 10.1111/j.1600-9657.1996.tb00515.x.
Schultz CP, Wolf V, Lange R, Mertens E, Wecke J, Naumann D, Zähringer U: Evidence for a new yype of outer membrane lipid in oral spirochete Treponema denticola. Functioning permeation barrier without lipopolysaccharides. Journal of Biological Chemistry. 1998, 273: 15661-15666. 10.1074/jbc.273.25.15661.
Fraser CM, Casjens S, Huang WM, Sutton GG, Clayton R, Lathigra R, White O, Ketchum KA, Dodson R, Hickey EK, et al: Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi. Nature. 1997, 390: 580-586. 10.1038/37551.
Fraser CM, Norris SJ, Weinstock GM, White O, Sutton GG, Dodson R, Gwinn M, Hickey EK, Clayton R, Ketchum KA, et al: Complete genome sequence of Treponema pallidum, the syphilis spirochete. Science. 1998, 281: 375-388. 10.1126/science.281.5375.375.
Seshadri R, Myers GS, Tettelin H, Eisen JA, Heidelberg JF, Dodson RJ, Davidsen TM, DeBoy RT, Fouts DE, Haft DH, et al: Comparison of the genome of the oral pathogen Treponema denticola with other spirochete genomes. Proc Natl Acad Sci USA. 2004, 101: 5646-5651. 10.1073/pnas.0307639101.
Bellgard MI, Wanchanthuek P, La T, Ryan K, Moolhuijzen P, Albertyn Z, Shaban B, Motro Y, Dunn DS, Schibeci D, et al: Genome sequence of the pathogenic intestinal spirochete Brachyspira hyodysenteriae reveals adaptations to its lifestyle in the porcine large intestine. PLoS One. 2009, 4: e4641-10.1371/journal.pone.0004641.
Ren SX, Fu G, Jiang XG, Zeng R, Miao YG, Xu H, Zhang YX, Xiong H, Lu G, Lu LF, et al: Unique physiological and pathogenic features of Leptospira interrogans revealed by whole-genome sequencing. Nature. 2003, 422: 888-893. 10.1038/nature01597.
Arp DJ, Stein LY: Metabolism of inorganic N compounds by ammonia-oxidizing bacteria. Critical Reviews in Biochemistry and Molecular Biology. 2003, 38: 471-495. 10.1080/10409230390267446.
Chain P, Lamerdin J, Larimer F, Regala W, Lao V, Land M, Hauser L, Hooper A, Klotz M, Norton J, et al: Complete genome sequence of the ammonia-oxidizing bacterium and obligate chemolithoautotroph Nitrosomonas europaea. J Bacteriol. 2003, 185: 2759-2773. 10.1128/JB.185.9.2759-2773.2003.
Norton JM, Klotz MG, Stein LY, Arp DJ, Bottomley PJ, Chain PS, Hauser LJ, Land ML, Larimer FW, Shin MW, et al: Complete genome sequence of Nitrosospira multiformis, an ammonia-oxidizing bacterium from the soil environment. Appl Environ Microbiol. 2008, 74: 3559-3572. 10.1128/AEM.02722-07.
Arp DJ, Chain PS, Klotz MG: The impact of genome analyses on our understanding of ammonia-oxidizing bacteria. Annu Rev Microbiol. 2007, 61: 503-528. 10.1146/annurev.micro.61.080706.093449.
Raetz CR, Reynolds CM, Trent MS, Bishop RE: Lipid A modification systems in gram-negative bacteria. Annu Rev Biochem. 2007, 76: 295-329. 10.1146/annurev.biochem.76.010307.145803.
Armstrong MT, Theg SM, Braun N, Wainwright N, Pardy RL, Armstrong PB: Histochemical evidence for lipid A (endotoxin) in eukaryote chloroplasts. FASEB J. 2006, 20 (12): 2145-2146. 10.1096/fj.05-5484fje.
National Center for Biotechnology Information. [http://www.ncbi.nlm.nih.gov/]
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.
Hughey R, Krogh A: Hidden Markov models for sequence analysis: extension and analysis of the basic method. Computer Applications in the Biosciences. 1996, 12: 95-107.
Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Das U, Daugherty L, Duquenne L, et al: InterPro: the integrative protein signature database. Nucleic Acids Res. 2009, D211-215. 10.1093/nar/gkn785. 37 Database
Katoh K, Toh H: Recent developments in the MAFFT multiple sequence alignment program. Briefings in Bioinformatics. 2008, 9: 286-298. 10.1093/bib/bbn013.
Eddy SR: Computational analysis of RNAs. Cold Spring Harbor Symposia on Quantitative Biology. 2006, 71: 117-128. 10.1101/sqb.2006.71.003.
Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006, 22: 2688-2690. 10.1093/bioinformatics/btl446.
Baldauf SL, Palmer JD, Doolittle FW: The root of the universal tree and the origin of eukaryotes based on elongation factor phylogeny. Proceedings of the National Academy of Sciences, USA. 1996, 93: 7749-7754. 10.1073/pnas.93.15.7749.
Fitz-Gibbon ST, House CH: Whole genome-based phylogenetic analysis of free-living microorganisms. Nucleic Acids Research. 1999, 27: 4218-4222. 10.1093/nar/27.21.4218.
Gupta RS: The phylogeny of proteobacteria: relationships to other eubacterial phyla and eukaryotes. FEMS Microbiology Reviews. 2000, 24: 367-402. 10.1111/j.1574-6976.2000.tb00547.x.
Lake JA, Skophammer RG, Herbold CW, Servin JA: Genome beginnings: rooting the tree of life. Philos Trans R Soc Lond B Biol Sci. 2009, 364: 2177-2185. 10.1098/rstb.2009.0035.
Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14: 1188-1190. 10.1101/gr.849004.
Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990, 18: 6097-6100. 10.1093/nar/18.20.6097.
WebLogo Web server. [http://weblogo.berkeley.edu/]
Arnold K, Bordoli L, Kopp J, Schwede T: The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics. 2006, 22: 195-201. 10.1093/bioinformatics/bti770.
SWISS-MODEL Web server. [http://swissmodel.expasy.org/]
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Research. 2000, 28: 235-242. 10.1093/nar/28.1.235.
DeLano WL: The PyMOL Molecular Graphics System. 2002, San Carlos, CA: DeLano Scientific
Tarbouriech N, Buisson M, Seigneurin JM, Cusack S, Burmeister WP: The monomeric dUTPase from Epstein-Barr virus mimics trimeric dUTPases. Structure. 2005, 13: 1299-1310. 10.1016/j.str.2005.06.009.
Homma K, Moriyama H: Crystallization and crystal-packing studies of Chlorella virus deoxyuridine triphosphatase. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2009, 65: 1030-1034. 10.1107/S1744309109034459.
Németh-Pongrácz V, Barabas O, Fuxreiter M, Simon I, Pichova I, Rumlova M, Zabranska H, Svergun D, Petoukhov M, Harmat V, et al: Flexible segments modulate co-folding of dUTPase and nucleocapsid proteins. Nucleic Acids Res. 2007, 35: 495-505. 10.1093/nar/gkl1074.
We thank anonymous reviewers for their constructive comments. This work was supported in part by Nebraska Tobacco Settlement Biomedical Research Development Funds to HM.
SOO carried out bioinformatics analysis and drafted the manuscript. HM carried out structural analysis and helped with interpretation of the data and drafting of the manuscript. RLP and ENM conceived of the study and helped with interpretation of the data and drafting of the manuscript. All authors read and approved of the manuscript.
About this article
Cite this article
Opiyo, S.O., Pardy, R.L., Moriyama, H. et al. Evolution of the Kdo2-lipid A biosynthesis in bacteria. BMC Evol Biol 10, 362 (2010). https://doi.org/10.1186/1471-2148-10-362
- Duplication Event
- Bacterial Genome
- Biosynthetic Enzyme
- Sequence Logo