Skip to main content

A phylogenetic mosaic plastid proteome and unusual plastid-targeting signals in the green-colored dinoflagellate Lepidodinium chlorophorum



Plastid replacements through secondary endosymbioses include massive transfer of genes from the endosymbiont to the host nucleus and require a new targeting system to enable transport of the plastid-targeted proteins across 3-4 plastid membranes. The dinoflagellates are the only eukaryotic lineage that has been shown to have undergone several plastid replacement events, and this group is thus highly relevant for studying the processes involved in plastid evolution. In this study, we analyzed the phylogenetic origin and N-terminal extensions of plastid-targeted proteins from Lepidodinium chlorophorum, a member of the only dinoflagellate genus that harbors a green secondary plastid rather than the red algal-derived, peridinin-containing plastid usually found in photosynthetic dinoflagellates.


We sequenced 4,746 randomly picked clones from a L. chlorophorum cDNA library. 22 of the assembled genes were identified as genes encoding proteins functioning in plastids. Some of these were of green algal origin. This confirms that genes have been transferred from the plastid to the host nucleus of L. chlorophorum and indicates that the plastid is fully integrated as an organelle in the host. Other nuclear-encoded plastid-targeted protein genes, however, are clearly not of green algal origin, but have been derived from a number of different algal groups, including dinoflagellates, streptophytes, heterokonts, and red algae. The characteristics of N-terminal plastid-targeting peptides of all of these genes are substantially different from those found in peridinin-containing dinoflagellates and green algae.


L. chlorophorum expresses plastid-targeted proteins with a range of different origins, which probably arose through endosymbiotic gene transfer (EGT) and horizontal gene transfer (HGT). The N-terminal extension of the genes is different from the extensions found in green alga and other dinoflagellates (peridinin- and haptophyte plastids). These modifications have likely enabled the mosaic proteome of L. chlorophorum.


Establishment of the plastid by endosymbiosis of a cyanobacterium was a key event in the evolutionary history of eukaryotes. An ancient primary endosymbiosis gave rise to photosynthethic organelles in members of the Viridiplantae, rhodophytes (red algae) and glaucophytes [15]. Such primary plastids (bound by two membranes) were subsequently spread to other protist lineages through a series of eukaryote-eukaryote secondary and tertiary endosymbiotic events, resulting in a large diversity of photosynthetic lineages [6, 7].

Both red and green algae have been involved in secondary endosymbioses. The exact number of secondary endosymbiotic events involving red algae remains controversial [812], with theories ranging from a single uptake in the common ancestor of all algal lineages harboring secondary plastids derived from a red alga (the chromalveolate hypothesis) to serial independent uptakes of red algal plastids or transfer of red alga-derived plastids between algal groups [4, 8, 13, 14]. Green plastids of secondary origin are known to be found in three distinct algal lineages: the chlorarachniophytes, the photosynthetic euglenids and in the dinoflagellate genus Lepidodinium. The chlorarachniophytes are marine phototrophic amoeboflagellates belonging to the lineage Cercozoa within the eukaryote 'supergroup' Rhizaria [1517], while the euglenids are the only photosynthetic lineage in the 'supergroup' Excavata [1820]. The dinoflagellate genus Lepidodinium contains a secondary chlorophyll a and b containing plastid derived from a green alga [2124]. The distant evolutionary relationships between the chlorarachniophyte, euglenid and dinoflagellate host lineages indicate that their plastids originate from three distinct endoymbiotic events [7, 18, 25, 26].

One of the main processes during plastid establishment is massive transfer of genes from the endosymbiont to the host nucleus, and the invention of protein trafficking machineries that involve protein synthesis in the host cytoplasm and direction of proteins to the plastids [27, 28]. Consequently, the plastids themselves encode only a minor fraction of the genes necessary for proper plastid function. In phototrophic organisms harboring primary plastids, these proteins are targeted back to the plastid by a transit peptide that directs the proteins across the double membrane [29]. In algae with secondary and tertiary plastids, proteins need (i) to have the N-terminal bipartite presequences that code for a signal peptide that directs the protein to the host endomembrane system and a transit peptide to lead the protein to the plastid, and (ii) to be transported across 3-4 membrane layers resulting from the establishment of the organelle [5, 30].

Intriguingly, several algal lineages, such as the red algae Cyanidioschyzon merolae, the land plant Arabidopsis thaliana, various chromalveolates and the chlorarachniophyte Bigelowiella natans, seem to have nucleus-encoded plastid-targeted protein genes originating from phylogenetically divergent algae [15, 3136]. The plastid proteomes of such organisms are therefore not acquired only from their current plastids, but may also include proteins originating from other sources.

Dinoflagellates are unique among eukaryotic lineages in having undergone several plastid replacement events [3741], and are therefore a key source of insights into the process of transformation of an endosymbiotic alga into an organelle. The most canonical dinoflagellate plastid type contains chlorophyll c and the pigment peridinin and is likely the ancestral plastid of the group [9, 42]. However, in several dinoflagellate lineages the peridinin-containing plastid has been lost, giving rise to a range of heterotrophic species, or has been replaced by plastids acquired from other algal groups (i.e. haptophytes, heterokonts, cryptomonads, and green algae). The level of plastid integration varies among dinoflagellates, ranging from fully integrated permanent plastid replacements such as the tertiary haptophyte-derived plastids found in Karenia and Karlodinium species [31] to less integrated plastids (i.e. permanent plastids that have not been degenerated) or temporary associations [39, 41]. Lepidodinium is the only dinoflagellate genus that has replaced the red alga-derived peridinin-containing plastid by a green plastid [21, 37], and thus represents a model for studying events occurring in the process of plastid replacement. In this context, an important issue is to understand how protein-targeting signals of pre-existing proteins from the original plastid have adapted to the green plastid environment. Furthermore, another key question concerns the evolutionary origins of plastid-targeted protein genes in the host genome in Lepidodinium. In fact, the plastid-targeted GAPDH gene in L. chlorophorum was recently shown to be of haptophyte origin [24], demonstrating that plastid-targeted genes in this species may originate from sources other than the original peridinin-containing plastid or the current green algal plastid.

We addressed these questions by investigating the nuclear-encoded plastid proteome of L. chlorophorum. We present 22 plastid-targeted genes identified in a cDNA library. The complete N-terminal extensions were obtained by 5'- RACE experiments of 20 of the 22 plastid-targeted genes. Phylogenetic analyses of the plastid-targeted protein genes reveal a mosaic plastid proteome in L. chlorophorum; many of the L. chlorophorum sequences showed apparent affinities to homologs from green algae, haptophytes, heterokonts, and peridinin-containing dinoflagellates. The N-terminal presequences (likely associated with plastid targeting) of L. chlorophorum differ from those of other dinoflagellates in having relatively low levels of serine and threonine residues, a high level of acidic residues and possibly an abnormally long hydrophilic region near the N-terminal end.


Plastid-targeted proteins in L. chlorophorum

4,746 clones from the L. chlorophorum cDNA-library were sequenced from the 5'-end, and 22 plastid-targeted protein genes were identified (Table 1). These correspond to a wide range of plastid-associated processes, and include components of photosystem II (PsbO, PsbP, PsbR), the gamma subunit of ATP synthase, ferredoxin B, ferredoxin NADP reductase, the small subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO), RuBisCO activase, fructose-1,6-bisphosphatase, phosphoribulokinase and several isoforms of light harvesting proteins. In addition, three genes encoding enzymes involved in the non-mevalonate isopentyl disphosphate (isoprenoid biosynethesis) pathway were identified.

Table 1 Plastid-targeted genes detected in the Lepidodinium chlorophorum cDNA library

The plastid-targeted genes have several evolutionary origins

Phylogenies of all identified plastid-targeted proteins were inferred including sequences from members of as many major photosynthetic lineages as possible. Both Maximum likelihood, Bayesian analyses and distance methods were applied, resulting in qualitatively similar results. In five phylogenies - PsbO, PsbP, PsbR, RuBisCO activase and plastid lipid associated protein kinase (PAP kinase) - the L. chlorophorum homologs were unambiguously related to those of green algae and land plants (see Fig. 1 and Table 1). In the PsbO phylogeny (Fig. 1A), the L. chlorophorum sequence clustered within green algae and landplants, and was differentiated from red algal sequences with the maximum bootstrap support. PsbP, PsbR, RuBisCO activase and PAP kinase encode plastid proteins found exclusively (or almost exclusively) in green algae and land plants: (Fig. 1B-E) [43]. We also found genes possibly originating from the ancient peridinin plastid exemplified by the phylogenies of ferredoxin NADP reductase (Fig. 2B) and 1-deoxy-D-xylolose 5-phosphate reductoisomerase (Fig. 2A) where L. chlorophorum formed strongly supported clades with the peridinin-containing dinoflagellates (95% and 100% bootstrap supports). In the phylogeny of 4-disphophocytidyl 2C methyl-D erythriol kinase, L. chlorophorum branched with K. veneficum with high support (95%) (Fig. 2C and Table 1). In this tree, the two dinoflagellates grouped with heterokonts to the exclusion of haptophytes, even though the plastid in K. veneficum is of haptophyte origin indicating that both K. veneficum and L. chlorophorum either still utilize the homolog from the peridinin-containing plastids or that they both recruited this gene from the same or very closely related organisms.

Figure 1
figure 1

Genes of green algal origin. Maximum likelihood trees inferring a green-algal origin of L. chlorophorum of PsbO, RuBisCO activase, PsbP, PsbR and PAP protein kinase. The phylogenies were inferred using RAxML. Bootstrap values >50%are indicated on the branches. Green and red lineages are indicated by color. Secondary green algae are in bold. Filled dots indicate 100% bootstrap support. Bayesian posterior probability values are indicated for some of the most important splits.

Figure 2
figure 2

Genes of peridinin- and heterokont origin. Maximum likelihood trees of 1-deoxy-D-xylolose-5 phosphate reductoisomerase (Dvr1), ferredoxin NADP oxireductase, 4-disphophocytidyl-2C-methyl-D-erythriol kinase demonstrating that these genes are of peridinin dinoflagellate origin. Csp41 clusters with heterokonts. All trees were inferred using RAxML. Bootstrap values >50%are indicated on the branches. Green and red lineages are indicated by color and secondary green algae are in bold. Filled dots indicate 100% bootstrap support. Bayesian posterior probability values are indicated for some of the most important splits.

Some of the other phylogenies strongly indicated that L. chlorophorum also harbors plastid-targeted protein genes that originate from sources other than green algae and the peridinin-containing dinoflagellate plastid. The phylogeny of RuBisCO small subunit (RbcS, Fig. 3A) shows that the L. chlorophorum homolog clusters with streptophytes with 81% bootstrap support; this origin is further supported by the presence of a streptophyte-specific insertion in the L. chlorophorum gene (Fig. 3C). In phylogenies of phosphoribulokinase (PRK, Fig. 3B) and csp41 (Fig. 2D), the L. chlorophorum homolog clustered with those of heterokonts. For csp41, this phylogenetic affinity was supported by 100% bootstrap support. The position of L. chlorophorum in the PRK phylogeny, however, is without significant statistical support, but the L. chlorophorum homolog bears a heterokont-specific insertion strongly indicating affinity to the homologs of heterokonts (Fig. 3D). A sequence identical to that of the previously reported plastid-targeted GAPDH gene was identified. This gene has previously been shown to be of haptophyte origin [24]. In four cases (fructose 1,6 bisphosphate, chloroplast stability factor hcf 136, transketolase and ATP synthase gamma, see Additional file 1 Figure S1 and Table 1), the L. chlorophorum homologs were excluded from those of green algae with high support, but their precise positions among red-algal derived homologs are unclear. In four of the phylogenies (Sedoheptulose bisphosphatase, 3,8 divinyl-protochlorophyllide, DnaJ and Ferredoxin B), the position of the L. chlorphorum gene-homolog remained unresolved due to insufficient taxon sampling and/or phylogenetic information (Additional file 2 Figure S2 and Table 1). Several isoforms of Chlorophyll a/b binding protein and Chlorophyll a/c binding proteins were also identified (Table 1).

Figure 3
figure 3

Phylogenies supported by shared sequence characteristics. A: Maximum likelihood trees inferring a heterokont origin of RuBisCO small subunit and phosphoribulokinase in L. chlorophorum. B: Alignment of the genes demonstrating shared sequence characteristics between L. chlorophorum and streptophytes and heterokonts, respectively. Filled dots indicate 100% bootstrap support. Bayesian posterior probability values are indicated for some of the most important splits.

The N-terminal extensions (presequences) of L. chlorophorumare different from those found in other dinoflagellates and green algae

Based on the results of 5' RACE, the putative N-terminal presequences of 20 of the 22 plastid-targeted protein genes were identified. The dinoflagellate-specific spliced leader sequence [44] was detected in the 5'-end of all sequences, confirming that the entire 5'-end had been retrieved. High S-score values of neural networks implemented in the program SignalP 3.0 (SignalP-NN) [45] were obtained in all of the N-terminal extensions, indicating the presence of a signal peptide and therefore of a secretory pathway (Additional file 3 Figure S3, hydrophobicity plots). The transit peptide cleavage sites were also estimated using SignalP. Only three of the sequences (csp41, dvr1 and psbO) gave the definite cleavage sites of signal peptides when estimating with both SignalP-NN and hidden Markov models, also implemented in SignalP 3.0 (SignalP-HMM). In the remaining sequences, the two programs (SignalP-NN and SignalP-HMM) often inferred different positions of the cleavage site and the statistical significance of the estimated sites was rather low in several cases. This may be caused by an abnormally long hydrophilic region at the beginning of the N-terminal extension. Such a hydrophilic region has not previously been detected in dinoflagellates, but a similar feature has been reported from the early diverging alveolate Perkinsus [46]. After removing the hydrophilic region of N-terminal extensions of plastid-targeted proteins other than csp41, dvr1 and psbO, the probability for the cleavage sites was increased with SignalP (Fig. S3), but still remained relatively low. Although the precise cleavage positions of the 17 plastid-targeted proteins could not be conclusively determined in this study, it is likely that the signal peptides are cut off after targeting plastid ER, and we consider the putative transit peptide cleavage sites of these proteins as reasonable estimates for investigating the transit peptide characteristics. A logoplot based on the putative cleavage sites estimated using SignalP-HMM is shown in Additional file 4 Figure S4 - illustrating that no conserved sequence motif was found. Additional file 5 Figure S5 displays the distribution of amino acids in the estimated transit peptides and compares the distribution in transit peptides of various eukaryotic lineages [47]. The putative L. chlorophorum transit peptides have lower levels of serine and threonine residues compared to those of green algae, while the proportion of acidic residues is increased compared to most other transit peptides. Conserved cleavage site sequences and stop membrane anchors (STMAs) were not detected in the estimated transit peptides.


L. chlorophorumcontains a true plastid with a phylogenetically hybrid plastid proteome

The genus Lepidodinium is the only known dinoflagellate lineage that possesses a secondary plastid of green algal origin rather than the red algal derived peridinin-containing plastid found in most photosynthetic dinoflagellates [2224, 37]. The green L. chlorophorum plastid is maintained for years in laboratory cultures. This, together with the identified plastid-targeted genes of probable green algal origin encoded by the nucleus of L. chlorophorum presented in this study, strongly indicates that the green algal plastid is a permanent and fully integrated organelle. Among the transferred plastid-targeted genes are genes characteristic of the green lineages, such as RuBisCO activase and components of the photosynthetic pathway (psbR, psbP), which have never previously been detected in dinoflagellates. All L. chlorophorum plastid-targeted proteins have been extended by a signal peptide to transport the protein across the additional membranes resulting from the secondary endosymbiosis, and are thus likely functional and transported to the organelle, demonstrating that the plastid is indeed stable and dependent on the host. This means that full plastid replacement events have occurred on at least two occasions among dinoflagellates (Lepidodinium and Karlodinium/Karenia) [9, 23, 24, 38].

In addition to genes originating from the current green plastid, we also identified putative plastid-targeted genes (with N-terminal extensions) that clustered with genes from the peridinin plastid in phylogenetic trees, implying that proteins from the ancient endosymbiont are still functional and now targeted to the new plastid. This shows that nuclear-encoded plastid-targeted protein genes can be retained even though the plastid itself is replaced, providing further support for the view that a plastid replacement event does not necessarily lead to loss of all characters from the previous plastid as earlier predicted [31, 48, 49]. The retention of "old" genes in the new plastid proteome mirrors the results of studies of the transcriptome of K. veneficum and K. brevis, which harbor tertiary haptophyte-derived plastids [31, 33]. Together, these results show that combining genes transferred via endosymbiotic gene transfer (EGT) from the new plastid and recycled genes from the ancient endosymbiont is a general trait among dinoflagellate lineages that have undergone repeated uptake and loss of plastids. This supports that the ancient plastid plays a role in the replacement process and likely facilitates integration of the new plastid [4].

It is unclear whether the original peridinin plastid of L. chlorophorum was replaced while still photosynthetically functional, or if the new plastid was established after the lineage had lost its ability to photosynthesize or lost its plastids altogether. In K. veneficum, the apparent lack of peridinin-plastid derived photosynthesis genes suggests that its ancestor was heterotrophic when the new haptophyte plastid was acquired, and that the relic plastid was retained at that time for anabolic purposes [31]. In contrast, identification of the ancient dinoflagellate form of ferredoxin NADP reductase (an iron-sulfur protein involved in several photosynthetic reactions) in L. chlorophorum implies that the green plastid was engulfed either while the host was still photosynthetic, or very soon after its ability to photosynthesize was lost.

Horizontal or endosymbiotic gene transfer?

The retention of nuclear-encoded genes from the peridinin-containing plastid in the L. chlorophorum plastid proteome is analogous with the situation in K. veneficum and K. brevis [31, 33]. However, the L. chlorophorum nucleus also contains plastid-targeted genes of other origins, including streptophytes, heterokonts and haptophytes, indicating that this lineage may be substantially affected by horizontal gene transfer (HGT). While HGT is well known in prokaryotes and have been identified in other dinoflagellate lineages [50], the significant impact of gene transfers in eukaryotic evolutionary history has only recently been recognized [51, 52]. Another plastid proteome that seems to be heavily affected by HGT is the chlorarachniophyte B. natans, in which about 20% of the plastid-targeted genes originate from sources other than its current green algal plastid [15]. The high degree of 'foreign' genes in B. natans has been suggested to be a result of horizontal transfer associated with the mixotrophic mode of nutrition of this organism [15]. The analogous pattern of horizontally transferred genes in dinoflagellates might also be due to this type of life style [53]. Recently, a distinct footprint of a green algal lineage (the prasinophytes) was revealed in major chromalveolate lineages, suggesting that a cryptic endosymbiont was present in the their ancestor [34]. If this theory is correct, at least 3 plastids (green, red, green) have been involved in shaping the plastid proteome of L. chlorophorum. Additionally, the high amount of foreign genes observed in L. chlorophorum and B. natans may reflect an ever more complex evolutionary history where genes were acquired via gene transfers from a series of cryptic endosymbionts. However, whether the L. chlorophorum genes originating from other sources than the known plastids were supplied by additional endosymbionts (EGT) or from prey (HGT) is difficult to infer without a larger dataset, and remains unclear.

Import systems in mixed chloroplast proteomes

The green plastid in L. chlorophorum has changed environment from being a primary plastid with two surrounding membranes in green alga to becoming a secondary plastid in dinoflagellates with four enveloping membranes. Accordingly, we found that the presequences of the plastid-targeted proteins in L. chlorophorum are different from those usually observed in green algae (see Additional file 5 Figure S5).

Green algal targeting-transit peptides have elevated levels of hydroxylated Ser and Thr residues, a net positive charge due to depletion of acidic residues, and elevated levels of Ala [47]. However, in L. chlorophorum, the transit peptides of plastid-targeted proteins are depleted of Ser and Thr residues relative to those of green algae. The N-terminal extension of plastid-targeted proteins from peridinin-containing dinoflagellates comprises a FVA/SP-motif and two classes of sequences are recognized, of which class I contains an additional hydrophobic region that functions as a stop transfer membrane anchor (STMA) downstream of the transit peptide region (probably involved in a transportation pathway through the Golgi apparatus), while class II is deprived of a transmembrane region [54]. We did not observe the conserved FVAP motif common in dinoflagellates with peridinin-containing plastids (this is, however, somewhat uncertain due to the difficulties in estimating the cleavage sites). STMAs were not observed and the levels of acidic residues were higher than those found in other transit peptides (except for K. veneficum). The putative change of transit peptide characteristics is also seen in K. veneficum, whose transit signals are different from those of peridinin-containing plastids and those of haptophyte plastids [31]. It is interesting to note that both dinoflagellate lineages with replaced plastids most likely use a new type of transit peptide rather than recycling the transit peptide from the ancestral condition or the endosymbiont. These modifications may have been driven by the co-existence of two divergent plastids after the uptake of a new plastid, which required a way to discriminate between the plastid genes [31]. Accordingly, it is intriguing to speculate that modification of N-terminal presequences is related to the mosaic plastid proteomes which have co-evolved and adapted to the trafficking machinery of the host [55].


In this study, we investigated the evolutionary origin of the plastid-targeted proteins of L. chlorophorum and analyzed their N-terminal presequences that are likely associated with plastid targeting. The plastid proteome of L. chlorophorum is a mixture of proteins with different phylogenetic origins. This hybrid proteome is seemingly shaped by the current plastid, the previous peridinin-type plastid and by horizontally transferred genes from various algal lineages. However, additional cryptic endosymbioses involving additional algal groups (streptophytes, heterokonts, haptophytes) cannot be ruled as the causes of phylogentic proteome mosaics. The L. chlorophorum transit peptides differ from the transit peptides found in green algae or other dinoflagellates. The characteristics of the altered transit peptides found in L. chlorophorum as well as Karenia/Karlodinium are likely to have played an important role for shaping the mosaic plastid proteome of these species.


Culturing, library construction, cDNA-sequencing and 5' RACE

The monoclonal L. chlorophorum strain RCC1488 (Roscoff Culture Collection: was cultured in K/2(-Tris,-Si) medium [56] at 17°C with illumination provided by daylight neon tubes at an intensity of 150 μE.m2.s1 and a photoperiod of 14L:10 D. Pre-cultures were treated for 24 hours with a range of concentrations of Provasoli antibiotic mixture (Sigma Aldrich) and sub-cultured regularly over a 3-week period in order to minimize bacterial contamination. Two 10 liter cultures were harvested in mid to late exponential growth phase by centrifugation (5 min at 6,000 rpm) in sterile 1 liter polycarbonate centrifuge flasks (Nalgene). Cell concentrates were immediately flash frozen in liquid nitrogen and stored at -80°C pending analysis.

RNA was isolated and a non-normalized, directional, cDNA library was constructed in the plasmid vector pAGEN-1 by Agencourt Bioscience Corp. (Beverly, MA, USA). 4,746 randomly picked clones were 5'-end sequenced, and subsequently quality checked and assembled to contigs using a Phred/Phrap pipeline at the open-access Bioportal service at University of Oslo BLASTx analyses and gene annotation of L. chlorophorum singletons and contigs were performed using Blast2GO [57].

The 5'-ends of the cDNA sequences were amplified by performing rapid amplification of cDNA ends (RACE), using the GeneRacer kit with SuperScript III RT (Invitrogen, Carlsbad, USA) using specific primers and a nested PCR approach. The products were subsequently cloned using the TOPO-TA cloning kit for sequencing (Invitrogen, Carlsbad, USA).

Identification of plastid targeted sequences

Plastid targeted sequences among singletons and contigs were identified according to their phylogenetic relationship to other plastid homologs, participation in plastid-located processes, and/or possession of plastid-targeted sequence comprising a signal peptide and a transit peptide [15]. Searches for possible signal peptides and their cleavage sites were estimated using SignalP ([45]. Logoplots were constructed using WebLogo, while transmembrane helices were identified using TMHMM server v.2.0 and the hydrophobicity profile of Kyte and Doolittle [58].

Alignment construction and phylogenetic analyses

Amino-acid sequence alignments were created by downloading homologous sequences from the nonredundant (NCBInr) and EST (NCBIest) databases in GenBank for each putative plastid-targeted gene detected in L. chlorophorum. We aimed to include taxa from all main photosynthetic eukaryotic lineages in each alignment. The sequences were initially aligned using MAFFT [59], and subsequently edited manually in MacClade [60]. Ambiguously aligned characters were detected by eye and removed manually. Maximum likelihood trees were inferred using RAxML [61]. Trees were identified with 100 separate heuristic searches from random starting trees, while bootstrap analyses involved 100 pseudoreplicates and one heuristic search for each replication with the same substitution model as the initial search. Bayesian analyses were performed using MrBayes v.3.1.2 [62, 63] Trees were generated from two runs with one heated and three cold chains in the MCMC, using a random starting tree and 2,000,000 generations. Tree sampling were done every 100 generations. Burn-in trees were set according to the assessment of the likelihood plots and the convergence diagnostics implemented in MrBayes, and the consensus of the sampled trees were used to calculate the posterior probabilities. The best fitting evolutionary model for each alignment according to the AIC (for ML) and BIC (for Bayesian) was applied (i.e. WAG, RtRev or CtRev;. estimated by Prottest v.2.4 [64]).

Accession numbers for the sequences used in the alignments are shown in Additional file 6 Table S1.


  1. 1.

    Gray MW, Spencer DF: Organellar evolution. Evolution of Microbial Life. Edited by: Roberts DM, Sharp P, Alderson G. 1996, Collins MA Cambridge: Cambridge University Press, 54: 109-126.

    Google Scholar 

  2. 2.

    Rodriguez-Ezpeleta N, Brinkmann H, Burey SC, Roure B, Burger G, Loffelhardt W, Bohnert HJ, Philippe H, Lang BF: Monophyly of primary photosynthetic eukaryotes: Green plants, red algae, and glaucophytes. Curr Biol. 2005, 15 (14): 1325-1330. 10.1016/j.cub.2005.06.040.

    Article  CAS  PubMed  Google Scholar 

  3. 3.

    Moreira D, Le Guyader H, Philippe H: The origin of red algae and the evolution of chloroplasts. Nature. 2000, 405 (6782): 69-72. 10.1038/35011054.

    Article  CAS  PubMed  Google Scholar 

  4. 4.

    Cavalier-Smith T: Principles of protein and lipid targeting in secondary symbiogenesis: Euglenoid, dinoflagellate, and sporozoan plastid origins and the eukaryote family tree. J Eukaryot Microbiol. 1999, 46 (4): 347-366. 10.1111/j.1550-7408.1999.tb04614.x.

    Article  CAS  PubMed  Google Scholar 

  5. 5.

    Reyes-Prieto A, Weber APM, Bhattacharya D: The origin and establishment of the plastid in algae and plants. Annu Rev Genet. 2007, 41: 147-168. 10.1146/annurev.genet.41.110306.130134.

    Article  CAS  PubMed  Google Scholar 

  6. 6.

    Archibald JM: The puzzle of plastid evolution. Curr Biol. 2009, 19 (2): R81-R88. 10.1016/j.cub.2008.11.067.

    Article  CAS  PubMed  Google Scholar 

  7. 7.

    Keeling PJ: Diversity and evolutionary history of plastids and their hosts. Am J Bot. 2004, 91 (10): 1481-1493. 10.3732/ajb.91.10.1481.

    Article  PubMed  Google Scholar 

  8. 8.

    Bodyl A, Stiller JW, Mackiewicz P: Chromalveolate plastids: direct descent or multiple endosymbioses. Trends Ecol Evol. 2009, 24 (3): 119-121. 10.1016/j.tree.2008.11.003.

    Article  PubMed  Google Scholar 

  9. 9.

    Shalchian-Tabrizi K, Skanseng M, Ronquist F, Klaveness D, Bachvaroff TR, Delwiche CF, Botnen A, Tengs T, Jakobsen KS: Heterotachy processes in rhodophyte-derived secondhand plastid genes: Implications for addressing the origin and evolution of dinoflagellate plastids. Mol Biology Evol. 2006, 23 (8): 1504-1515. 10.1093/molbev/msl011.

    Article  CAS  Google Scholar 

  10. 10.

    Falkowski PG, Katz ME, Knoll AH, Quigg A, Raven JA, Schofield O, Taylor FJR: The evolution of modern eukaryotic phytoplankton. Science. 2004, 305 (5682): 354-360. 10.1126/science.1095964.

    Article  CAS  PubMed  Google Scholar 

  11. 11.

    Burki F, Inagaki Y, Bråte J, Archibald JM, Keeling P, Cavalier-Smith T, Sakaguchi M, Hashimoto T, Horak A, Kumar S, et al: Large-scale phylogenomic analyses reveal that two enigmatic protist lineages, Telonemia and Centroheliozoa, are related to photosynthetic chromalveolates. Gen Biol Evol. 2009, 1: 231-238. 10.1093/gbe/evp022.

    Article  Google Scholar 

  12. 12.

    Burki F, Shalchian-Tabrizi K, Minge MA, Skjæveland Å, Nikolaev SI, Jakobsen KS, Pawlowski J: Phylogenomics reshuffles the eukaryotic supergroups. PloS ONE. 2007, 2 (8): e790-10.1371/journal.pone.0000790.

    PubMed Central  Article  PubMed  Google Scholar 

  13. 13.

    Patron NJ, Rogers MB, Keeling PJ: Gene replacement of fructose-1,6-bisphosphate aldolase supports the hypothesis of a single photosynthetic ancestor of chromalveolates. Eukaryot Cell. 2004, 3 (5): 1169-1175. 10.1128/EC.3.5.1169-1175.2004.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  14. 14.

    Bachvaroff TR, Sanchez Puerta MV, Delwiche CF: Chlorophyll-c containing plastid relationships based on analyses of a multi-gene dataset with all four chromalveolate lineages. Mol Biol Evol. 2005, 22 (9): 1772-1782. 10.1093/molbev/msi172.

    Article  CAS  PubMed  Google Scholar 

  15. 15.

    Archibald JM, Rogers MB, Toop M, Ishida K, Keeling PJ: Lateral gene transfer and the evolution of plastid-targeted proteins in the secondary plastid-containing alga Bigelowiella natans. Proc Natl Acad Sci USA. 2003, 100 (13): 7678-7683. 10.1073/pnas.1230951100.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  16. 16.

    Archibald JM, Keeling PJ: Recycled plastids: a 'green movement' in eukaryotic evolution. Trends Genet. 2002, 18 (11): 577-584. 10.1016/S0168-9525(02)02777-4.

    Article  CAS  PubMed  Google Scholar 

  17. 17.

    Cavalier-Smith T, Chao EEY: Phylogeny and classification of phylum Cercozoa (Protozoa). Protist. 2003, 154 (3-4): 341-358. 10.1078/143446103322454112.

    Article  PubMed  Google Scholar 

  18. 18.

    Rogers MB, Gilson PR, Su V, McFadden GI, Keeling PJ: The complete chloroplast genome of the chlorarachniophyte Bigelowiella natans: Evidence for independent origins of chlorarachniophyte and euglenid secondary endosymbionts. Mol Biol Evol. 2007, 24 (1): 54-62. 10.1093/molbev/msl129.

    Article  CAS  PubMed  Google Scholar 

  19. 19.

    Gibbs SP: The chloroplasts of Euglena may have evolved from symbiotic green algae. Can J Bot. 1978, 56: 2883-2889. 10.1139/b78-345.

    Article  Google Scholar 

  20. 20.

    Hallick RB, Hong L, Drager RB, Favreau MR, Monfort A, Orsat B, Spielmann A, Stutz E: Complete sequence of Euglena gracilis chloroplast DNA. Nucleic Acids Res. 1993, 21 (15): 3537-3544. 10.1093/nar/21.15.3537.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  21. 21.

    Watanabe MM, Takeda Y, Sasa T, Inouye I, Suda S, Sawaguchi T, Chihara M: A green dinoflagellate with chlorophylls A and B: morphology, fine strucure of the chloroplast and chlorophyll composition. J Phycol. 1987, 23: 382-389. 10.1111/j.1529-8817.1987.tb04148.x.

    Article  CAS  Google Scholar 

  22. 22.

    Watanabe MM, Suda S, Inouye I, Sawaguchi T, Chihara M: Lepidodinium viride gen. et sp. nov (Gymnodiniales, Dinophyta), a green dinoflagellate with a chlorophyll A- and B-containing endosymbiont. J Phycol. 1990, 26: 741-751. 10.1111/j.0022-3646.1990.00741.x.

    Article  Google Scholar 

  23. 23.

    Elbrächter M, Schnepf E: Gymnodinium chlorophorum, a new, green, bloom-forming dinoflagellate (Gymnodiniales, Dinophyceae) with a vestigial prasinophyte endosymbiont. Phycologia. 1996, 35 (5): 381-393.

    Article  Google Scholar 

  24. 24.

    Takishita K, Kawachi M, Noel MH, Matsumoto T, Kakizoe N, Watanabe MM, Inouye I, Ishida KI, Hashimoto T, Inagaki Y: Origins of plastids and glyceraldehyde-3-phosphate dehydrogenase genes in the green-colored dinoflagellate Lepidodinium chlorophorum. Gene. 2008, 410 (1): 26-36. 10.1016/j.gene.2007.11.008.

    Article  CAS  PubMed  Google Scholar 

  25. 25.

    Delwiche CF, Palmer JD: The origin of plastids and their spread via secondary symbiosis. Origins of Algae and Their Plastids. Edited by: Bhattacharya D. 1997, New York: Springer, 53-86.

    Chapter  Google Scholar 

  26. 26.

    Turmel M, Gagnon M-C, O'Kelly CJO, Otis C, Lemieux C: The chloroplast genomes of the green algae Pyramimonas, Monomastix, and Pycnococcus shed new light on the evolutionary history of prasinophytes and the origin of the secondary chloroplasts of euglenids. Mol Biol Evol. 2009, 26 (3): 631-648. 10.1093/molbev/msn285.

    Article  CAS  PubMed  Google Scholar 

  27. 27.

    Martin W, Rujan T, Richly E, Hansen A, Cornelsen S, Lins T, Leister D, Stoebe B, Hasegawa M, Penny D: Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus. Proc Natl Acad Sci USA. 2002, 99 (19): 12246-12251. 10.1073/pnas.182432999.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  28. 28.

    Martin W, Herrmann RG: Gene transfer from organelles to the nucleus: How much, what happens, and why?. Plant Physiol. 1998, 118 (1): 9-17. 10.1104/pp.118.1.9.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  29. 29.

    Soll J, Schleiff E: Protein import into chloroplasts. Nat Rev Mol Cell Bio. 2004, 5: 198-208. 10.1038/nrm1333.

    Article  CAS  Google Scholar 

  30. 30.

    McFadden GI: Plastids and protein targeting. J Eukaryot Microbiol. 1999, 46 (4): 339-346. 10.1111/j.1550-7408.1999.tb04613.x.

    Article  CAS  PubMed  Google Scholar 

  31. 31.

    Patron NJ, Waller RF, Keeling PJ: A tertiary plastid uses genes from two endosymbionts. J Mol Biol. 2006, 357 (5): 1373-1382. 10.1016/j.jmb.2006.01.084.

    Article  CAS  PubMed  Google Scholar 

  32. 32.

    Nosenko T, Bhattacharya D: Horizontal gene transfer in chromalveolates. BMC Evol Biol. 2007, 7 (173): 1-18.

    Google Scholar 

  33. 33.

    Nosenko T, Lidie KL, Van Dolah FM, Lindquist E, Cheng JF, Bhattacharya D: Chimeric plastid proteome in the florida "red tide" dinoflagellate Karenia brevis. Mol Biol Evol. 2006, 23 (11): 2026-2038. 10.1093/molbev/msl074.

    Article  CAS  PubMed  Google Scholar 

  34. 34.

    Moustafa A, Beszteri B, Maier UG, Bowler C, Valentin K, Bhattacharya D: Genomic footprints of a cryptic plastid endosymbiosis in diatoms. Science. 2009, 324: 1724-1726. 10.1126/science.1172983.

    Article  CAS  PubMed  Google Scholar 

  35. 35.

    Huang J, Gogarten JP: Did an ancient chlamydial endosymbiosis facilitate the establishment of primary plastids?. Genome Biol. 2007, 8 (6): R99-10.1186/gb-2007-8-6-r99.

    PubMed Central  Article  PubMed  Google Scholar 

  36. 36.

    Suzuki K, Miyagishima SY: Eukaryotic and eubacterial contributions to the establishment of plastid proteome estimated by large-scale phylogenetic analyses. Mol Biol Evol. 2010, 27 (3): 581-590. 10.1093/molbev/msp273.

    Article  CAS  PubMed  Google Scholar 

  37. 37.

    Shalchian-Tabrizi K, Minge MA, Cavalier-Smith T, Nedreklepp JM, Klaveness D, Jakobsen KS: Combined heat shock protein 90 and ribosomal RNA sequence phylogeny supports multiple replacements of dinoflagellate plastids. J Eukaryot Microbiol 2006. 2006, 53 (3): 217-224. 10.1111/j.1550-7408.2006.00098.x.

    Article  CAS  Google Scholar 

  38. 38.

    Tengs T, Dahlberg OJ, Shalchian-Tabrizi K, Klaveness D, Rudi K, Delwiche CF, Jakobsen KS: Phylogenetic analyses indicate that the 19 ' hexanoyloxy-fucoxanthin-containing dinoflagellates have tertiary plastids of haptophyte origin. Mol Biol Evol. 2000, 17 (5): 718-729.

    Article  CAS  PubMed  Google Scholar 

  39. 39.

    McEwan ML, Keeling PJ: HSP90, tubulin and actin are retained in the tertiary endosymbiont genome of Kryptoperidinium foliaceum. Journal of J Eukaryot Microbiol. 2004, 51 (6): 651-659. 10.1111/j.1550-7408.2004.tb00604.x.

    Article  CAS  Google Scholar 

  40. 40.

    Cavalier-Smith T: Genomic reduction and evolution of novel genetic membranes and protein-targeting machinery in eukaryote-eukaryote chimaeras (meta-algae). Philos T R Soc B. 2003, 358 (1429): 109-133. 10.1098/rstb.2002.1194.

    Article  CAS  Google Scholar 

  41. 41.

    Takishita K, Koike K, Maruyama T, Ogata T: Molecular evidence for plastid robbery (Kleptoplastidy) in Dinophysis, a dinoflagellate causing diarrhetic shellfish poisoning. Protist. 2002, 153 (3): 293-302. 10.1078/1434-4610-00106.

    Article  CAS  PubMed  Google Scholar 

  42. 42.

    Saldarriaga JF, Taylor FJ, Keeling PJ, Cavalier-Smith T: Dinoflagellate nuclear SSU rRNA phylogeny suggests multiple plastid losses and replacements. J Mol Evol. 2001, 53 (3): 204-213. 10.1007/s002390010210.

    Article  CAS  PubMed  Google Scholar 

  43. 43.

    De Las Rivas J, Heredia P, Roman A: Oxygen-evolving extrinsic proteins (PsbO, P, Q, R): Bioinformatic and functional analysis. BBA-Bioenergetics. 2007, 1767 (6): 575-582. 10.1016/j.bbabio.2007.01.018.

    Article  CAS  PubMed  Google Scholar 

  44. 44.

    Zhang H, Hou Y, Miranda L, Campbell DA, Sturm NR, Gaasterland T, Lin S: Spliced leader RNA trans-splicing in dinoflagellates. Proc Natl Acad Sci USA. 2007, 104 (11): 4618-4623. 10.1073/pnas.0700258104.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  45. 45.

    Bendtsen JD, Nielsen H, von Heijne G, Brunak S: Improved prediction of signal peptides: SignalP 3.0. J Mol Biol. 2004, 340 (4): 783-795. 10.1016/j.jmb.2004.05.028.

    Article  PubMed  Google Scholar 

  46. 46.

    Matsuzaki M, Kuroiwa H, Kuroiwa T, Kiyoshi K, Nozaki H: A cryptic algal group unveiled: A plastid biosynthesis pathway in the oyster parasite Perkinsus marinus. Mol Biol Evol. 2008, 25 (6): 1167-1179. 10.1093/molbev/msn064.

    Article  CAS  PubMed  Google Scholar 

  47. 47.

    Patron NJ, Waller RE: Transit peptide diversity and divergence: a global analysis of plastid targeting signals. Bioessays. 2007, 29: 1048-1058. 10.1002/bies.20638.

    Article  CAS  PubMed  Google Scholar 

  48. 48.

    Yoon HS, Hackett JD, Van Dolah FM, Nosenko T, Lidie L, Bhattacharya D: Tertiary endosymbiosis driven genome evolution in dinoflagellate algae. Mol Biol Evol. 2005, 22 (5): 1299-1308. 10.1093/molbev/msi118.

    Article  CAS  PubMed  Google Scholar 

  49. 49.

    Ishida K, Green BR: Second- and third-hand chloroplasts in dinoflagellates: Phylogeny of oxygen-evolving enhancer 1 (PsbO) protein reveals replacement of a nuclear-encoded plastid gene by that of a haptophyte tertiary endosymbiont. Proc Natl Acad Sci USA. 2002, 99 (14): 9294-9299. 10.1073/pnas.142091799.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  50. 50.

    Hackett JD, Yoon HS, Soares MB, Bonaldo MF, Casavant TL, Scheetz TE, Nosenko T, Bhattacharya D: Migration of the plastid genome to the nucleus in a peridinin dinoflagellate. Curr Biol. 2004, 14 (3): 213-218.

    Article  CAS  PubMed  Google Scholar 

  51. 51.

    Keeling PJ, Palmer JD: Horizontal gene transfer in eukaryotic evolution. Nat Rev Genet. 2008, 9 (8): 605-618. 10.1038/nrg2386.

    Article  CAS  PubMed  Google Scholar 

  52. 52.

    Andersson JO: Horizontal gene transfer between microbial eukaryotes. Methods Mol Biol. 2009, 532: 473-487. full_text.

    Article  CAS  PubMed  Google Scholar 

  53. 53.

    Archibald JM: Jumping genes and shrinking genomes - Probing the evolution of eukaryotic photosynthesis with genomics. IUBMB Life. 2005, 57 (8): 539-547. 10.1080/15216540500167732.

    Article  CAS  PubMed  Google Scholar 

  54. 54.

    Patron NJ, Waller RE, Archibald JM, Keeling PJ: Complex protein targeting to dinoflagellate plastids. J Mol Biol. 2005, 348: 1015-1024. 10.1016/j.jmb.2005.03.030.

    Article  CAS  PubMed  Google Scholar 

  55. 55.

    Keeling PJ: Role of horizontal gene transfer in the evolution of photosynthetic eukaryotes and their plastids. Methods Mol Biol. 2009, 501-515. full_text. 532:

  56. 56.

    Keller M, Selvin RC, Wolfgang C, Guillard RRL: Media for the culture of oceanic ultraphytoplankton. J Phycol. 1987, 23 (4): 633-638.

    Article  Google Scholar 

  57. 57.

    Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M: Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21 (18): 3674-3676. 10.1093/bioinformatics/bti610.

    Article  CAS  PubMed  Google Scholar 

  58. 58.

    Kyte J, Doolittle RF: A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982, 157 (1): 105-132. 10.1016/0022-2836(82)90515-0.

    Article  CAS  PubMed  Google Scholar 

  59. 59.

    Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30 (14): 3059-3066. 10.1093/nar/gkf436.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  60. 60.

    Maddison WP, Maddison DR: MacClade: Analysis of phylogeny and character evolution. Version 3. 1992, Sinauer Associates, Sunderland, Massachusetts, USA

    Google Scholar 

  61. 61.

    Stamatakis A, Ludwig T, Meier H: RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics. 2005, 21 (4): 456-463. 10.1093/bioinformatics/bti191.

    Article  CAS  PubMed  Google Scholar 

  62. 62.

    Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogeny. Bioinformatics. 2001, 17: 754-755. 10.1093/bioinformatics/17.8.754.

    Article  CAS  PubMed  Google Scholar 

  63. 63.

    Ronquist F, Huelsenbeck JP: MRBAYES 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.

    Article  CAS  PubMed  Google Scholar 

  64. 64.

    Abascal F, Zardoya R, Posada D: ProtTest: Selection of best-fit models of protein evolution. Bioinformatics. 2005, 21 (9): 2104-2105. 10.1093/bioinformatics/bti263.

    Article  CAS  PubMed  Google Scholar 

Download references


We thank the Bioportal at University of Oslo (UiO) for computer time and use of phylogenetic applications. The research project was supported by a grant from The Norwegian Research Council to KSJ. KST thanks the UiO for starting grants. IP was supported by the EU FP7 project ASSEMBLE (RI-227799).

Author information



Corresponding author

Correspondence to Kjetill S Jakobsen.

Additional information

Authors' contributions

MAM screened the EST library for plastid-associated genes, carried out the contig assembly, alignment construction and phylogenetic analyses, helped with the 5'-RACE experiments and wrote the manuscript. KST helped to design the study and contributed to the manuscript. OKT and KT performed 5'-RACE experiments and analyzed the signaling sequences. IP carried out the algal culturing. OKT, YI, KT and IP contributed to the manuscript. DK was involved in scientific design and technical discussions. KSJ conceived the study, was the project leader and contributed to the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material


Additional file 1: Supplementary Figure S1: Genes of red algal origin. Maximum likelihood trees inferring a red-algal origin of 4 plastid-associated genes. All trees were inferred using RAxML. Bootstrap values >50%are indicated on the branches. Green and red lineages are indicated by color, secondary green algae are in bold. Filled dots indicate 100% bootstrap support. Bayesian posterior probability values are indicated for some of the most important splits. (PDF 467 KB)

Supplementary Figure S2: Unresolved phylogenies

Additional file 2: . Maximum likelihood trees demonstrating an unresolved position for L. chlorophorum. Filled dots indicate 100% bootstrap support. Bayesian posterior probability values are indicated for some of the most important splits. (PDF 436 KB)


Additional file 3: Supplementary Figure S3: Kyte-Doolittle hydrophobicity plots of the N-terminal extensions (presequences). (PDF 661 KB)


Additional file 4: Supplementary Figure S4:Signal peptide sequence. Weblogo plot of the putative signal peptides sequence estimated using SignalP-HMM. (PNG 15 KB)

Supplementary Figure S5: Amino acid composition of transit peptides

Additional file 5: . Percentage bars of transit peptides of chlorophytes, peridinin-containing dinoflagellates, K. veneficum (all derived from Patron & Waller 2007), L. chlorophorum and the entire L. chlorophorum mature peptide. (PDF 26 KB)

Additional file 6: Supplementary Table S1: Accession numbers of sequences used in phylogenetic analyses. (DOC 354 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Minge, M.A., Shalchian-Tabrizi, K., Tørresen, O.K. et al. A phylogenetic mosaic plastid proteome and unusual plastid-targeting signals in the green-colored dinoflagellate Lepidodinium chlorophorum. BMC Evol Biol 10, 191 (2010).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Transit Peptide
  • RuBisCO Activase
  • Secondary Plastid
  • Primary Plastid
  • Endosymbiotic Gene Transfer