- Research article
- Open Access
Potential causes and consequences of rapid mitochondrial genome evolution in thermoacidophilic Galdieria (Rhodophyta)
BMC Evolutionary Biology volume 20, Article number: 112 (2020)
- The Correction to this article has been published in BMC Evolutionary Biology 2020 20:132
The Cyanidiophyceae is an early-diverged red algal class that thrives in extreme conditions around acidic hot springs. Although this lineage has been highlighted as a model for understanding the biology of extremophilic eukaryotes, little is known about the molecular evolution of their mitochondrial genomes (mitogenomes).
To fill this knowledge gap, we sequenced five mitogenomes from representative clades of Cyanidiophyceae and identified two major groups, here referred to as Galdieria-type (G-type) and Cyanidium-type (C-type). G-type mitogenomes exhibit the following three features: (i) reduction in genome size and gene inventory, (ii) evolution of unique protein properties including charge, hydropathy, stability, amino acid composition, and protein size, and (iii) distinctive GC-content and skewness of nucleotides. Based on GC-skew-associated characteristics, we postulate that unidirectional DNA replication may have resulted in the rapid evolution of G-type mitogenomes.
The high divergence of G-type mitogenomes was likely driven by natural selection in the multiple extreme environments that Galdieria species inhabit combined with their highly flexible heterotrophic metabolism. We speculate that the interplay between mitogenome divergence and adaptation may help explain the dominance of Galdieria species in diverse extreme habitats.
Acidic hot springs environments can place severe stresses on unicellular organisms due to the high temperature, low pH, and elevated heavy-metal concentrations . Most hot springs biodiversity is comprised of prokaryotes, with only a few eukaryotes able to compete successfully in these extreme habitats [2, 3]. The unicellular red algal class, Cyanidiophyceae, is a well-known group of extremophilic eukaryotes that thrives in acidic (pH 0.5–3.0) and high temperature (50–55 °C) habitats [4, 5]. Cyanidiophyceae have traditionally been reported from volcanic regions around the world [6,7,8], whereas some mesophilic species (e.g., Cyanidium chilense) are found in moderately acidic caves (pH 5–7) around volcanic regions in Chile, Italy, France, Israel, and Turkey [4, 9,10,11,12]. Cyanidiophyceae is the earliest diverging red algae lineage, having split from other Rhodophyta about 1.5 billion years ago [5, 13]. Due to limited ultrastructural differences and the “simple” morphology of cyanidiophycean cells, it is difficult to discriminate between species using microscopy. This has led to the hypothesis of cryptic diversity based on molecular phylogenetic studies [4, 14, 15]. Moreover, incongruent topologies or unresolved relationships of Cyanidiophyceae are reported in most molecular phylogenetic studies using single (e.g., rbcL, SSU rRNA) or a few genes (e.g., rbcL + psaA + psbA) [4, 10, 14, 16,17,18].
Most Cyanidiophyceae are photoautotrophs and inhabit niches exposed to sulfur fumes, however the genus Galdieria also occupies endolithic and interlithic habitats [4, 16], and is therefore more exposed to microenvironmental fluctuations [e.g., diverse microhabitats, desiccation, a broad range of acidity (pH 0.5–6.0), temperature (18–55 °C), and salinity (4–10%)] [4, 8, 10, 19,20,21,22,23,24,25] . Members of this genus are also grow mixotrophically, able to utilize > 50 carbon sources . From a genomic point of view, previous studies have demonstrated that Galdieria species contain a variety of horizontally transferred genes that confer adaptation for specific habitats (e.g., endolithic) [27,28,29]. Although Cyanidiophyceae genomes are significantly reduced in size (< 20 Mbp), Galdieria species retain a nearly intact spliceosomal machinery and have a relatively large number of introns that likely play a role in regulating the stress response .
Galdieria species have greatly reduced mitochondrial genomes and these genes have the highest substitution rate among all red algae likely due to the polyextremophilic lifestyle . However, only two published (Cyanidioschyzon merolae 10D and Galdieria sulphuraria 074 W) and one unpublished (Cyanidiophyceae sp. MX-AZ01) mitochondrial genomes are available to date [30, 31]. To better understand cyanidiophycean mitochondrial genome evolution, we generated five new complete mitogenomes that represent all of the major clades, including the three genera of Cyanidiophyceae. Among them, three Galdieria mitogenomes are not only highly reduced in size, but also substantially differ in protein characteristics compared to Cyanidiophyceae and other red algae. Based on GC-skewness and other mitogenomic features, we postulate that a unique replication system may exist in Galdieria species.
Results and discussion
To confirm the phylogenetic position of the eight strains under study, we used the rbcL phylogeny to identify the major groups of Cyanidiophyceae (Fig. S1). The taxon-rich (269 taxa) rbcL tree showed five major cyanidiophycean groups: 1) Cyanidium chilense assemblage (Cd. chilense; known as mesophilic Cyanidium sp.) , 2) Cyanidium caldarium (Cd. caldarium), 3) Galdieria sulphuraria assemblage (G. sulphuraria), 4) Cyanidioschyzon merolae (Cz. merolae), and 5) Cyanidiococcus yangmingshanensis assemblage (Cc. yangmingshanensis; known as Galdieria maxima) , which is consistent with previous studies [14, 18]. The eight strains used in this study adequately represent the diversity of the five major groups of Cyanidiophyceae (see in Fig. S1).
The general characteristics of the eight mitogenomes were compared including five new and three published datasets (Table 1) [30, 31]. These mitogenomes are clearly divided into two types as Cyanidium-type and Galdieria-type based on mitogenome features (e.g., genome size, genome structure, number of genes, skewness of nucleotides). Cyanidium-type (C-type) is comprised of five taxa (mesophilic Cd. chilense Sybil Cave, Cd. caldarium, Cz. merolae, Cc. yangmingshanensis, and Cyanidiophyceae sp. MX-AZ01), and the Galdieria-type (G-type) is comprised of three taxa (G. phlegrea, G. sulphuraria 108.79 E11 and 074 W).
General features of Cyanidiophyceae mitogenomes
The mitogenome size of Cyanidium-type (C-type) is roughly 10 kbp larger than in Galdieria-type (G-type) (see Table 1). This size difference between the two types is explained by a larger number of tRNA encoding genes and CDSs in the C-type (33–35 CDS, 23–25 tRNA genes) than in the G-type (18 CDS, 6–7 tRNA genes). SSU (rrs) and LSU (rrl) rRNAs are present in all species, yet 5S rRNA sequences were identified only in the C-type mitogenomes, except for Cyanidium caldarium. This finding is congruent with former results that showed the 5S rRNA gene in red algal mitogenomes to be pseudogenized or lost outright . Portions of non-coding regions in C-type (5.13–10.64%) are smaller than those of G-type (15.90–17.64%), mostly due to the existence of unknown sequence insertions (2–3 kbp length, ymf39-cox1 region).
Before comparing the genome structures, we aligned all genomes starting from the start codon of the cox2 gene. For comparative genome structure analysis, Mauve Genome Alignment v2.2.0  with default settings and UniMoG using the DCJ model  were used. Comparison of genome structure reveals that Cyanidiophyceae mitogenomes are divided into the four synteny blocks shown in Figs. 1, 2; cox2-cox1 region (red block), nad3-yejR region (purple block), rDNA region including adjacent genes (green block), and cob region (blue block). These synteny blocks are conserved between the Cyanidium-type and Galdieria-type, however, there is a unique 2–3 kbp sequence insertion (yellow block) found between atp4 and cox1 genes (red block) only in the Galdieria-type. An open reading frame (i.e., orf181 gene in G. sulphuraria 074 W) is present in this inserted region, but none of these sequences have significant hits in the NCBI or UniProt databases. These four, shared synteny blocks are identical in each type, however, two inversions and one deletion are found between the types; 1) the Cyanidium-type has a reversed nad3-yejR coding region (purple block) order, which is in the forward direction in Galdieria-type. 2) cob gene (blue block) is a reverse-complemented relative to Cyanidium-type (Figs. 1, 2). There is significant gene loss in the green synteny block of Galdieria-type. The 16–17 kbp of the rDNA region of Cyanidium-type includes 18–20 coding genes, two rDNAs, and two tRNAs, whereas the Galdieria-type (5–6 kbp length) is composed of five genes, two rDNAs, and 18 tRNAs.
Based on our observations and previous work, C-type and G-type are recognized not only using mitogenome characteristics, but also on the basis of morphological characteristics, cellular features, and ecological habitats (Fig. 1a) [4, 6,7,8, 10, 14, 16, 21,22,23, 36, 37]. The general morphology of G-type differs from C-type with regard to a larger cell size (G-type: 5–10 μm, C-type: 1–4 μm), presence of a vacuole, organelle shape (e.g., branched mitochondria, multi-lobed plastid), and a simple spherical morphology, whereas C-type species show more diverse cell shapes (e.g., spherical, oval, club-shaped) (Fig. 1a) [4, 8, 10, 19,20,21,22, 36, 38]. Particularly for mitochondrial features, G-type Galdieria sulphuraria contains several mitochondria per cell that have a net-like structure, whereas C-type Cd. caldarium and Cz. merolae contain a single mitochondrion in a cell . In our transmission electron microscopic observation, multiple mitochondria are identified in G-type G. sulphuraria 108.79 E11, whereas a single spheroid mitochondrion is found in C-type Cc. yangmingshanensis 8.1.23 F7 (Figs. 1a). The ecophysiological niche of G-type species is more diverse than C-type species and includes hydrothermal regions, acid mine drainage sites, endolithic environments, and surfaces of burning coal [4, 10, 16,17,18, 24]. In contrast, the habitats of C-type species are more ecologically “protected” such as in ditches and streams around hot springs that exhibit lower temperature fluctuations [4, 10, 16,17,18, 21, 24]. These habitat differences suggest that G-type species have radiated in more diverse environments that C-type species.
Cyanidium-type and Galdieria-type mitogenomes resolved using phylogenetic analysis
Phylogenetic analysis also supports the recognition of the C-type and G-type mitogenomes. The concatenated protein ML phylogeny using 36 mitochondrial genes resolves the C-type and G-type with full bootstrap support (see Fig. 1b). Within the C-type, the mesophilic Cd. chilense Sybil Cave diverged first, followed by Cd. caldarium ACUF 019. Cz. merolae 10D are grouped together with the monophyletic clade of the Cc. yangmingshanensis 8.1.23 F7 + Cyanidiophyceae sp. MX-AZ01. Our current mitogenome data resolve the internal relationships within the Cyanidiophyceae, in particular, the positions of the mesophilic Cd. chilense Sybil Cave and Cd. caldarium ACUF 019 that have been poorly resolved until now (Fig. S2). In this mitogenome data analysis, however, we observed an extraordinarily long internal branch of the G-type (Fig. 1b), which implies high divergence when compared to C-type species or unidentified/extinct genetic diversity in the G-type lineages. In addition, we tested individual gene phylogenies to see the consistency compared to concatenated gene tree in Supplementary Information 1. Applying variable mitochondrial gene datasets, we were able to resolve the phylogenetic relationships among major cyanidiophycean clades, in particular the unsettled position of the mesophilic Cyanidium clade (Fig. S2).
Different CDS content between Cyanidium-type and Galdieria-type mitogenomes: gene loss and transfer
After the recognition of two different groups in Cyanidiophyceae, we focused on mitogenomes gene gains and losses. Comparison of CDS content between the two different types revealed that one-half of mitochondrion-encoded genes in C-type mitogenomes are missing in the G-type, and changed in synteny regarding the green block (Fig. 2a) [i.e., losses of all ribosomal protein genes (rps, rpl) and a few core genes (ccmA,B and sdhB,D)]. To examine endosymbiotic gene transfer (EGT) from mitochondrial to the nuclear genome for these missing genes in G-type, we searched 18 homologous genes in the two available nuclear genomes of Cz. merolae 10D and G. sulphuraria 074 W [28, 39]. Ten genes were identified from the nuclear genome of G. sulphuraria 074 W, but we were not able to identify eight mitochondrial genes (‘N/D’ in Fig. 2a) that may indicate outright gene losses in the mitogenome, although it is not possible to rule out issues related to low-quality genome data. Other explanations for missing genes is either high diversification after gene transfer to the nucleus or degeneration of genes from the mitochondria . Out of twelve ribosomal proteins, only rps12 and rpl20 were found in the G. sulphuraria 074 W nuclear genome. These nuclear-encoded rps12 and rpl20 genes were not grouped together but rather located in two different scaffolds, 57 and 29, respectively (Fig. 2b), whereas most ribosomal protein encoding genes are located in a single syntenic block in C-type mitogenomes (ccmA-nad6; green block in Fig. 2a). We could not detect the remaining ribosomal protein encoding genes, instead, we found six nuclear-encoded homologs of ribosomal protein (rpl6, rpl14, rpl16, rps4, rps11, rps19) in the genome (see question marks in Fig. 2a). With the ccmF gene (see Supplementary Information 2 for details), the origins of six homologous genes of ribosomal protein were unclear based on phylogenetic analyses due to low bootstrap support values.
It is unlikely that mitochondrial translation would function properly without a complete set of ribosomal subunit proteins, therefore, the nuclear-encoded homologs could “compensate” for gene losses (e.g., 16–19 tRNAs loss in G-type mitogenomes). Meanwhile, the homologs of rps13 and rps19 were not detected from the Cz. merolae 10D but were found in the early branched, mesophilic Cd. chilense, suggesting independent gene losses in both C-type and G-type mitogenomes. It was possible to detect a plastidial-copy, nuclear-copy (host-derived), or other (e.g., unknown sources) ribosomal proteins from homologous searches, which implies the possibility to translocate ribosomal subunits from various origins (e.g., nuclear, plastid, other bacteria) into mitochondria as suggested in previous studies [41,42,43].
Changes in amino acid composition in Galdieria-type mitochondrial genes
Enzymes from extremophilic organisms have high thermostability, more charged amino acid composition, and reduced hydrophobic surfaces to withstand extreme temperatures and pH [44, 45]. Furthermore, proteins of thermophilic species are shorter in length than those in their mesophilic counterparts . To investigate protein characteristics, we compared protein charge, hydropathy, and stability in 16 genes (genes in Fig. 3) that are retained in all red algal mitogenomes (i.e., G-type, C-type, non-cyanidiophycean red algae) used in this study.
Amino acid composition of these 16 conserved mitochondrial genes shows that 13 out of 20 amino acids are significantly different (asterisks in Fig. 3a) between G-type and C-type and this resulted in a difference in the structure of amino acids that effectively modified the protein properties of the genes. Whereas there is a similar amino acid composition between the non-cyanidiophycean red algae (outgroup) and C-type. G-type mitogenomes have a distinct amino acid composition (Fig. 3a). G-type genomes show a higher proportion of positively charged amino acids than those of C-type (G-type: 10.77–10.82%, C-type: 6.84–7.44%; see in Fig. S6). A lower negative charge amino acid composition was found in G-type genomes when compared to C-type (G-type: 2.26–2.47%, C-type: 4.06–4.21%; see in Fig. S6). Likewise, the influence of amino acid changes altered protein charge, hydrophilicity, and hydrophobicity (Fig. 3b, c).
Hydrophilicity, which was measured for 16 conserved mitochondrial proteins (Fig. 3b), representing the G-type species showed relatively higher hydrophilic amino acids in mitochondrial proteins (50.56–50.97%) than those in other red algal species (C-type and outgroup species: 41.64–43.49%). Because some genes showed dramatic difference in amino acid composition or in protein size, we examined individual gene hydropathy (a scale of hydrophobicity and hydrophilicity) to avoid a biased assessment [47, 48]. G-type proteins are clearly less hydropathic than other groups (Fig. 3c) and 11/16 mitochondrial genes in G-type tend to have reduced protein length (Fig. S7A) when compared to other species. These dramatic differences in amino acids (e.g., charge, length) of mitochondrial genes are critical to protein structure that can affect solubility, stability, and their functions . We applied in silico analysis (e.g., instability index, aliphatic index) to calculate the stability of conserved mitochondrial proteins and also found G-type and C-type have a few significant differences (Fig. S7B, C) in their mitochondrial proteins. The instability index estimates in vivo instability of proteins based on a comparison of dipeptides features between unstable proteins and stable proteins, whereas the aliphatic index is calculated using the relative volume of aliphatic side chains to estimated thermostability [50, 51]. Conserved mitochondrial genes in Cyanidiophyceae are mostly membrane-bound proteins or in mitochondria they form protein complexes (e.g., mitochondrial respiratory complexes). In these cases, protein folding, which is a key factor to understand protein stability and their activity, can be highly dependent on lipid composition of the mitochondrial membrane or a protein-protein interaction with other supermatrix-forming proteins [52, 53]. Therefore, these protein interactions with mitochondrial membrane lipids need further study.
Extreme GC-skew in Galdieria-type mitogenomes and its associated characteristics
On the basis of mitogenomes comparison, G-type mitogenomes have distinctive characteristics, such as high gene divergence and asymmetric nucleotide substitution. The difference in GC-contents between the C-type and G-type is pervasive across genomes (Fig. S8) showing that C-type species have lower GC-contents (25.0–27.1%) than that of G-type species (41.4–44.0%) excluding mesophilic Cd. chilense (44.5%). However, GC-skew (G-C/G + C) and AT-skew (A-T/A + T) are clearly different in C-type and G-type (see Table 1): C-type composed symmetric AT and GC composition balances (AT-skew: 0.01–0.03; GC-skew: 0.01–0.06). In contrast, G-type showed unbalanced AT composition (AT-skew: 0.25–0.29) and extremely asymmetric composition of GC nucleotides (GC-skew: 0.66–0.74). Whereas in a member of the C-type species, mesophilic Cd. chilense, the GC-content (44.5%) is close to those of G-type, but GC-skew (0.01) or AT-skew (0.03) of mesophilic Cd. chilense are more similar to other C-type mitogenomes. In other words, mesophilic Cd. chilense may be regarded as an intermediate state between C-type and G-type based on its genomic features and phylogenetic position.
All genes, including 17 CDSs, seven tRNAs, and two rRNAs, are located in a single strand of G-type G. sulphuraria 074 W mitogenome except for the anticlockwise cob gene, whereas genes in C-type Cz. merolae 10D mitogenome are distributed in both strands as usual (Fig. 4a). According to the directional distribution of genes and extreme GC-skew, genes in G-type mitogenome appear to be substantially strand-biased. The cob gene, which is located in an antisense orientation of G-type species, has lower GC-skew than average GC-skew of G-type mitogenomes (cob gene region GC-skew: 0.43–0.48, mitogenome GC-skew: 0.66–0.74) and shows a higher TIGER value compared to other genes meaning that cob gene contains lower variable sites (TIGER value of cob gene: 0.766, average TIGER value: 0.630; see Fig. S4, Table S2). Based on these observations, we examined the potential impact of extreme GC-skew on G-type mitogenomes.
One of the key approaches to distinguish leading and lagging strands is analysis of the GC-skew; a positive GC-skew reflects the leading strand, whereas a negative GC-skew represents the lagging strand in most cases [54, 55]. GC-skew analysis of the G-type shows all positive values and its cumulative GC-skew is gradually increased without any decreasing points unlike other red algal species, including C-type (Fig. 4a; Fig. S9). Mitogenomes with a positive GC-skew in a single strand have been well studied for their replication system, particularly in human mitochondria. Although DNA replication mechanisms in mitochondria are still under debate due to the absence of strong evidence for replication mode, unidirectional mitochondrial replication systems were reported in several mammalian mitochondria [56,57,58,59,60,61,62,63,64,65]. Furthermore, asymmetric DNA replication mechanism was reported in vertebrates based on the inversion of “control region (mtDNA non-coding fragment)”, which has strand-specific compositional bias [55, 66].
Unlike C-type species and other red algae, GC-skew analysis of G-type mitogenomes suggest that a leading strand and a lagging strand are separated into the individual strands (Fig. S9). Mitogenomes of G-type species have a guanine-rich leading strand (H-strand) and a cytosine-rich lagging strand (L-strand), thus, likely to have an asymmetric replication mechanism (Fig. 4). After the separation of L-strand from H-strand, without a bidirectional replication fork, a daughter lagging strand is continuously synthesized by a nascent leading strand and a nascent lagging strand synthesizes a daughter leading strand (Fig. 4b). The synthesis of lagging strand is considered to be more accurate than the leading strand due to lagging strand’s discontinuous replication (short Okazaki fragments) , which have been proposed in some mitochondria (e.g., yeast, fish) or bacterial species with experimental validation [67,68,69,70,71]. In addition, different gene substitution rates have been reported between two DNA strands of mitochondrial genome (e.g., higher in lagging strand of fish) [68, 72].
The mitogenome variants of Cyanidiococcus yangmingshanensis 8.1.23 F7 strain (C-type representative) and Galdieria sulphuraria 108.79 E11 strain (G-type representative), derived from a single cell, were verified to test replication-associated mutational stress. Illumina short-read data were used to identify SNPs in mitogenomes, and mapping coverages were 7931x for Cc. yangmingshanensis mitogenome and 7896x for G. sulphuraria mitogenome. Using GATK4 and Picard to call variants, we identified a single SNP from non-coding region of Cc. yangmingshanensis 8.1.23 F7 and four SNPs from non-coding region of G. sulphuraria 108.79 E11 (Table S3). Although the size of mitogenome is ~ 1.5x larger in Cc. yangmingshanensis 8.1.23 F7 than G. sulphuraria 108.79 E11, G. sulphuraria 108.79 E11 mitogenome (four SNPs) has more SNPs than Cc. yangmingshanensis 8.1.23 F7 mitogenome (a single SNP). More frequent mitogenome variants of G-type (0.031 SNPs/kbp) compared to C-type (0.185 SNPs/kbp) may explain the effect of replication-associated mutational stress . Taken together, G-type species have higher mutation rates that are likely to be accelerated by unidirectional replication (Fig. 4b), which have also been proposed in the case of strand-the asymmetric model from inverted mitochondrial replication of vertebrate mitochondria .
On the basis of these findings, we propose unidirectional replication in G-type mitochondria that may have led to higher divergence than in the C-type and other non-cyanidiophycean red algae, which have bidirectional mitogenome replication. Because of mitochondrial replication system divergence, G-type mitogenomes may have different uses of the DNA polymerases described in Jain et al. , resulting in sequence variation including SNPs in G-type mitochondrial DNA. Such accelerated mutation rates of the two DNA strands may potentially contribute to G-type mitogenomes having a higher fitness in rapidly changing environments and play a role in adaptation . We speculate that extreme GC-skew in the G-type resulted in the use of unidirectional replication that led to changes in protein properties, affecting the fitness of mitochondrial proteins in harsh environments.
The rapid evolution we report here in Galdieria-type mitogenomes (i.e., long branches of G-type in the molecular phylogenies) may be explained by the fact that Cyanidiophyceae species inhabit extreme conditions unlike other mesophilic red algae. Among Cyanidiophyceae, Cyanidium-type (C-type) species inhabit ecologically more protected niches (e.g., aquatic habitats such as hot springs and ditches) within extreme environments than do G-type species. Therefore, they are likely to be less prone to environmental pressures such as temperature or pH fluctuation than Galdieria species [4, 10]. Reactive oxygen species in the internal environment of mitochondria, mitochondria replication, and absence or degeneration of mitochondrial DNA repair systems can elevate mitochondrial mutation . In cyanidiophycean mitochondria, high temperature and an acidic environment are potential oxidative stress inducers, which are detrimental to cellular components like DNA or their repair system. Along with these common factors, another mutagenic factor could be the unidirectional replication mode of G-type mitogenomes. Strand-based compositional asymmetry of the two separate replicated strands can be introduced through replication-associated mutational stress . However, in comparison to other eukaryotic mitogenomes that also have unidirectional replication, cyanidiophycean species have been exposed to extreme habitats and diverged into the G-type and C-type more than 800 Mya . Since then, the unidirectional mitochondrial DNA replication system in G-type species has facilitated mutagenesis in stressful habitats over a long evolutionary time span, resulting in exceptionally divergent G-type mitochondria that is not observed in unidirectional replication systems of other red algae or even in nuclear and plastid of G-type species. Specifically, we assume that dominant unidirectional DNA replication was established in the ancestral population of G-type mitochondria. Unlike the nuclear genome, mitochondria are subject to rapid genetic drift due to predominantly uniparental inheritance and lack of sexual recombination . Thus, in asexual species, the reduced effective population size of mitochondria, which weakens the power of natural selection (e.g., “double rachet” model) , may have allowed the initial survival of mutations that proved adaptive in the longer run, such as G-type mitochondrial DNA that confers thermostability of DNA and RNA (Supplementary Information 3).
However, because of their long evolutionary history, it is difficult, if not impossible, to assign a cause-and-effect relationship between a large mutagenesis event and unique trait evolution (e.g., protein properties). Interestingly, EGT-derived genes that were moved to the nucleus in the early stages of eukaryote evolution accumulated fewer mutations than mitochondrial-encoded genes (Supplementary Information 4). In addition, mutations of G-type mitochondrial genes are found to be under purifying selection, similar to mammalian mitochondrial genes , based on analysis of the nonsynonymous and synonymous mutation (Ka/Ks) ratio (Fig. S11). This suggests that G-type mitogenomes acquired mutations faster than other C-type mitogenomes or non-cyanidiophycean red algal mitogenomes during the same time period. The reason why these changes are restricted to the G-type mitochondrial genome and are not evident in plastid or nuclear DNA remains unclear. In summary, we speculate that a combination of factors (e.g., unidirectional replication system, polyextreme habitats, heterotrophic metabolism, reduced effective population size of mitochondria) have driven G-type mitogenome evolution. These results lay the foundation for future studies to better understand the biology of the intriguing Cyanidiophyceae.
Cyanidium caldarium samples were obtained from the predecessor strain (Cd. caldarium ACUF 019; Siena, Italy) established by Claudia Ciniglia. Due to strain contamination issues in some isolated strains, single cells from two established strains of Cyanidiococcus yangmingshanensis (Cc. yangmingshanensis 8.1.23; Kula Manisa, Turkey) and Galdieria sulphuraria (G. sulphuraria SAG 108.79; Yellow Stone National Park, USA) were isolated using fluorescence-activated cell sorting (FACS) to establish single cell derived culture strains. After FACS isolation, and initial cultivation in a liquid medium (5x Allen medium), the cells were spread on a sucrose-agar medium, and an individual colony from each species (108.79: isolate E11, 8.1.23: isolate F7) was transferred to liquid media to promote growth. Blue-green biofilms were obtained from the shaded side of the Sybil Cave tuff wall (Cume, Italy) to collect mesophilic Cyanidium chilense (Cd. chilense). Collected samples were mixed with 15 mL phosphate-buffered saline (PBS) and centrifuged briefly at 550 rpm, repeatedly, to harvest cells from environmental samples.
Transmission electron microscopy
Samples of G. sulphuraria 108.79 E11 and Cc. yangmingshanensis 8.1.23 F7 were initially fixed in a solution of 1.5% glutaraldehyde + 8% sucrose in 0.1 M phosphate buffer (pH 7.4) and incubated overnight at 4 °C. Fixed samples were rinsed for 10 min and centrifuged at 12,000 rpm for 5 min. After repeating the rinse three times, 1% osmium tetroxide in 0.1 M phosphate buffer was treated to the samples for 1.5 h. We repeated this step three times by rinsing the samples for 10 min and centrifuging at 12,000 rpm for 5 min. Pellets of fixed cyanidiophycean cells were dislodged and solidified in 1% agarose to divide them into the appropriate size for the section. The dehydration step in ethanol was done after the sectioning step and the pellet segments were embedded in Spurr’s resin. Samples were cut into 70 nm thickness and stained with uranyl acetate and lead citrate. The samples were observed using a Bio-HVEM System (JEM-1400 Plus at 100 kV and JEM-1000BEF at 1000 kV [JEOL, Japan]) at the Korea Basic Science Institute (Ochang, Korea).
DNA extraction and rbcL sequencing
The genomic DNA of three culture strains (Cd. caldarium ACUF 019, Cc. yangmingshanensis 8.1.23 F7, G. sulphuraria 108.79 E11) was extracted by DNeasy Plant Mini Kit (Qiagen, Hilden, Germany) and purified using the LaboPass™ DNA Isolation Kit (Cosmo Genetech, Seoul, Korea). Genomic DNA of environmental samples from Sybil Cave was extracted using the FastDNA® SPIN Kit for Soil (MP Bio, Santa Ana, USA). To identify species, rbcL gene was amplified using DNA KOD FX Neo (Toyobo, Osaka, Japan) polymerase from each strain. A different combination of rbcL primer sets were used for PCR; rbcL_rc_214F: 5′-GTTGTWTGGACWGATTTATTAAC-3′ (23 mers), rbcL_rc_1234R: 5′-GCTTGWATWCCATCTGGATC-3′ (20 mers), rbcL_90F: 5′-CCATATGCYAAAATGGGATATTGG-3′ (24 mers), rbcL_R: 5′-ACATTTGCTGTTGGAGTCTC-3′ (20 mers) . Amplified DNAs were purified by LaboPass™ PCR Purification Kit (COSMO Genetech, Seoul, Korea) and products were sent to a sequencing company (Macrogen, Seoul, Korea). We manually removed low quality sequences and merged forward and reverse strand sequences into a single aligned sequence.
Whole genome sequencing and mitogenome analysis
The raw data of Galdieria phlegrea DBV 009 (Naples, Italy), sequenced in a previous study , was used here to complete the mitogenome. Other whole genome data were generated using the Illumina HiSeq 2500 (Illumina, San Diego, USA) with 2 × 100 bp paired-end sequencing at the DNA Link (Seoul, Korea). Reads were assembled into contigs using SPAdes assembler v3.10.1  and mitogenomes were constructed following the established procedure described in Cho et al., 2018  using two published mitogenomes (NC_000887, NC_024666) as reference. Using the mitochondrial proteins of Cz. merolae 10D , assembled contigs that include mitochondrial genes were selected by tBLASTn. Next, the sorted contigs were re-assembled to get a scaffold sequence of mitogenomes using Geneious Prime 2019.0.3 (Biomatters, Auckland, New Zealand). After the re-assembly step, a complete circular mitogenome or partial linear mitogenome contigs were obtained from each species. The completeness of these mitogenome sequences were tested by LASTZ  and by comparison with two published mitogenomes (NC_000887: Cz. merolae 10D, NC_024666: G. sulphuraria 074 W). The genome sequences were mapped with raw reads to correct sequencing errors by mapping coverage or filling gaps for a linear mitogenome.
Protein coding genes (CDS) were manually annotated based on a homologous region of existing mitogenomes (NC_000887, NC_024666, KJ569774). The “standard genetic code 1” has been generally used in cyanidiophycean mitogenomes [30, 31] without clear justification, although there is a report of using “protozoan mitochondrial genetic code 4” in the Gigartinales (red algae) . To test which genetic code is better suited to our data, we translated the genes using both genetic codes, “standard genetic code 1” and “protozoan mitochondrial genetic code 4”. Because the “protozoan mitochondrial genetic code 4” resulted a 5′-extension that resulted in a gene annotation error (see Fig. S12), we chose to use the “standard genetic code 1” in this study. The ribosomal RNA sequences were acquired by BLASTn searching against other mitochondrial rDNA sequences of Cyanidiophyceae. tRNAs, and other small RNAs were searched by using tRNAscan-SE 2.0  and ARAGORN . For comparative analysis using mitochondrial genes, several genes and ambiguous sequences from the published mitogenomes were re-annotated (Table S5) in this study . GC-content was calculated with a sliding window scale of 48 bp, and the GC-skew of each species was calculated (window size: 1 kbp, step: 20 bp) using a Python script. Potential G-quadruplex forming sequences were predicted by G4Hunter using a default option .
To identify the phylogenetic relationship of the newly sequenced strains among the taxon-rich rbcL phylogeny, all available rbcL sequences of Cyanidiophyceae were collected from the NCBI nucleotide database (ntDB). After discarding the duplicated data, a total of 269 rbcL sequences including the five generated here was used for phylogenetic analysis with 10 other red algal species (i.e., Rhodophytina) as outgroups. For phylogenetic analysis of mitochondrial genes, eight Cyanidiophyceae taxa were selected, along with four other red algae as outgroups. The outgroup taxa included two species from Florideophyceae (Chondrus crispus and Hildenbrandia rubra), one species from Bangiophyceae (Porphyra purpurea), and one species from Compsopogonophyceae (Compsopogon caeruleus). A total of 32 mitochondrial proteins presents in at least five species out of eight cyanidiophycean species were concatenated for the ML analysis. Gene alignments were produced using MAFFT v7.310  with the default options. Maximum likelihood (ML) phylogenetic analysis with automated model selection was done using IQ-TREE v1.6.8  with 1000 ultrafast bootstrap replications. The model test option in IQ-TREE was applied to each protein in this alignment.
Comparison of mitochondrial genes
Because most of datasets used for statistical tests did not show a normal distribution, the ‘Independent-Samples Kruskal-Wallis Test’ was used to evaluate hypotheses (Table S6). The null hypothesis was that the distribution of factor (e.g., amino acid composition) is identical among taxa from two ingroup clades (i.e., Cyanidium-type and Galdieria-type) and outgroup taxa. If the null hypothesis is rejected, pairwise comparisons among three groups were applied (Table S7). Sixteen conserved mitochondrial genes (sdhC, atp4, atp6, atp8, atp9, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, nad6) that all encoded in mitogenomes of 13 species were chosen to test amino acid properties.
The alignment of each protein and their ML trees were used for pairwise comparisons and all results were summarized in Table S8. Only conserved sites excluding any gaps were selected from the protein alignment of conserved genes for analysis (3845/4825 sites). Average proportions of amino acids were recorded in each group and the values were visualized with standard errors.
The GRAVY value, which is determined by the total hydropathic values of all amino acids in the protein divided by its length, is used to represent its hydrophobic or hydrophilic properties for analysis. To estimate protein stability, two different types of indices were used in this study. The aliphatic index indicating thermostability of the globular protein is calculated by the relative volume of aliphatic side chains in the protein , and the instability index provides an estimate of in vivo protein stability based on protein dipeptide differences . To compare protein properties for 16 conserved mitochondrial genes, GRAVY (grand average of hydropathy), aliphatic index, and instability index were calculated using ProtParam in ExPASy , and the results were summarized in Table S9. The mean values of each feature were illustrated with bar plots with standard error bars.
For the EGT-derived gene detection and transit peptide prediction, nuclear genomes of G. sulphuraria 074 W and Cz. merolae 10D were used as references to find mitochondrion-derived nuclear-encoded genes [28, 39, 89]. EGT genes were further verified based on homologous search by MMSeqs2 . Transit peptide sequences in EGT candidates were predicted by TargetP .
Mitogenome variant calling
Due to sequences of mixed species (Cyanidium caldarium) from the published genome data of Galdieria phlegrea DBV 009 , we only chose two FACS-derived strains (Cyanidiococcus yangmingshanensis 8.1.23 F7 and Galdieria sulphuraria 108.79 E11) to identify single nucleotide polymorphisms (SNPs). Bowtie2 v.188.8.131.52  was used to map the sequencing data to each mitogenome (Table S3). Based on these mapping information, we used Picard v1.4.2 (see http://broadinstitute.github.io/picard/) and GATK4 v184.108.40.206  to identify SNPs.
Estimation of evolutionary rate and analysis of selection
Concatenated datasets of mitochondrial proteins were divided into 64 separate partitions (‘-b 100’: allocate the sites in 1 to 100 based on the evolutionary rates) based on TIGER values (Tables S1, S2), which calculate the evolutionary rates for each site on the basis of tree-independent approaches to differentially weighted characters . Non-synonymous substitutions per non-synonymous sites (Ka) and synonymous substitutions per synonymous sites (Ks) have been widely used to determine evolutionary selection on genes . A total of 16 conserved mitochondrial genes were selected for alignments to calculate ratios of non-synonymous substitutions per non-synonymous sites and synonymous substitutions per synonymous sites (Ka/Ks), which have been widely used to determine the evolutionary selection on genes . ParaAT  was used with MAFFT version 7  and KaKs_Calculator  to do this analysis.
Availability of data and materials
The mitogenome sequences have been deposited at GenBank under the accession numbers MT270115-MT270119. All sequence alignments and data are available from Dryad Digital Repository (https://doi.org/10.5061/dryad.nvx0k6dp7).
Reeb V, Bhattacharya D. The thermo-acidophilic cyanidiophyceae (Cyanidiales). In: Seckbach J, Chapman D, editors. Red Algae in the Genomic Age: Springer; 2010. p. 409–26. https://doi.org/10.1007/978-90-481-3795-4.
Hough DW, Danson MJ. Extremozymes. Curr Opin Chem Biol. 1999;3(1):39–46.
Rothschild LJ, Mancinelli RL. Life in extreme environments. Nature. 2001;409(6823):1092.
Ciniglia C, Yoon HS, Pollio A, Pinto G, Bhattacharya D. Hidden biodiversity of the extremophilic Cyanidiales red algae. Mol Ecol. 2004;13(7):1827–38.
Yoon HS, Müller KM, Sheath RG, Ott FD, Bhattacharya D. Defining the major lineages of red algae (Rhodophyta). J Phycol. 2006;42(2):482–92.
Seckbach J. Systematic problems with Cyanidium caldarium and Galdieria sulphuraria and their implications for molecular studies. J Phycol. 1991;27(6):794–6.
Ott FD, Seckbach J. New classification for the genus Cyanidium Geitler 1933. In: Evolutionary pathways and enigmatic algae: Cyanidium caldarium (Rhodophyta) and related cells: Springer; 1994. p. 145–52. https://doi.org/10.1007/978-94-011-0882-9_9.
Pinto G, Albertano P, Ciniglia C, Cozzolino S, Pollio A, Yoon HS, Bhattacharya D. Comparative approaches to the taxonomy of the genus Galdieria Merola (Cyanidiales, Rhodophyta). Cryptogamie Algol. 2003;24(1):13–32.
Schwabe G. Uber einige Blaualgen aus dem mittleren und sudlichen Chile. Verh Dt Wiss Verein Santiago NF. 1936;3:113–74.
Yoon HS, Ciniglia C, Wu M, Comeron JM, Pinto G, Pollio A, Bhattacharya D. Establishment of endolithic populations of extremophilic Cyanidiales (Rhodophyta). BMC Evol Biol. 2006;6(1):78.
Iovinella M, Eren A, Pinto G, Pollio A, Davis SJ, Cennamo P, Ciniglia C. Cryptic dispersal in non-acidic environments from Turkey of Cyanidiophytina (Rhodophyta). Extremophiles. 2018.
Ciniglia C, Cennamo P, De Natale A, De Stefano M, Sirakov M, Iovinella M, Yoon HS, Pollio A. Cyanidium chilense (Cyanidiophyceae, Rhodophyta) from tuff rocks of the archeological site of Cuma, Italy. Phycological Res. 2019;67(4):311–9.
Yang EC, Boo SM, Bhattacharya D, Saunders GW, Knoll AH, Fredericq S, Graf L, Yoon HS. Divergence time estimates and the evolution of major lineages in the florideophyte red algae. Sci Rep. 2016;6:21361.
Hsieh CJ, Zhan SH, Lin Y, Tang SL, Liu SL. Analysis of rbcL sequences reveals the global biodiversity, community structure, and biogeographical pattern of thermoacidophilic red algae (Cyanidiales). J Phycol. 2015;51(4):682–94.
Yoon HS, Nelson W, Lindstrom SC, Boo SM, Pueschel C, Qiu H, Bhattacharya D. Rhodophyta. In: Archibald JM, AGB S, Slamovits CH, editors. Handbook of the Protists. Cham: Springer International Publishing; 2017. p. 89–133.
Toplin JA, Norris TB, Lehr CR, McDermott TR, Castenholz RW. Biogeographic and phylogenetic diversity of thermoacidophilic Cyanidiales in Yellowstone National Park, Japan, and New Zealand. Appl Environ Microbiol. 2008;74(9):2822–33.
Skorupa DJ, Reeb V, Castenholz RW, Bhattacharya D, McDermott TR. Cyanidiales diversity in Yellowstone national park. Lett Appl Microbiol. 2013;57(5):459–66.
Ciniglia C, Yang EC, Pollio A, Pinto G, Iovinella M, Vitale L, Yoon HS. Cyanidiophyceae in Iceland: plastid rbcL gene elucidates origin and dispersal of extremophilic Galdieria sulphuraria and G. maxima (Galdieriaceae, Rhodophyta). Phycologia. 2014;53(6):542–51.
Gross W, Küver J, Tischendorf G, Bouchaala N, Büsch W. Cryptoendolithic growth of the red alga Galdieria sulphuraria in volcanic areas. Eur J Phycol. 1998;33(1):25–31.
Gross W. Revision of comparative traits for the acido-and thermophilic red algae Cyanidium and Galdieria. In: Enigmatic Microorganisms and life in Extreme Environments: Springer; 1999. p. 437–46. https://doi.org/10.1007/978-94-011-4838-2_34.
Albertano P, Ciniglia C, Pinto G, Pollio A. The taxonomic position of Cyanidium, Cyanidioschyzon and Galdieria: an update. Hydrobiologia. 2000;433(1–3):137–43.
Gross W, Oesterhelt C, Tischendorf G, Lederer F. Characterization of a non-thermophilic strain of the red algal genus Galdieria isolated from Soos (Czech Republic). Eur J Phycol. 2002;37(3):477–83.
Ciniglia C, Pinto G, Pollio A. Cyanidium from caves: a reinstatement of Cyanidium chilense Schwabe (Cyanidiophytina, Rhodophyta). Phytotaxa. 2017;295(1):86–8.
Barcytė D, Nedbalova L, Culka A, Košek F, Jehlička J. Burning coal spoil heaps as a new habitat for the extremophilic red alga Galdieria sulphuraria. Fottea. 2018;18(1):19–29.
Qiu H, Rossoni AW, Weber APM, Yoon HS, Bhattacharya D. Unexpected conservation of the RNA splicing apparatus in the highly streamlined genome of Galdieria sulphuraria. BMC Evol Biol. 2018;18(1):41.
Gross W, Schnarrenberger C. Heterotrophic growth of two strains of the acido-thermophilic red alga Galdieria sulphuraria. Plant Cell Physiol. 1995;36(4):633–8.
Qiu H, Price DC, Weber APM, Reeb V, Chan Yang E, Lee JM, Kim SY, Yoon HS, Bhattacharya D. Adaptation through horizontal gene transfer in the cryptoendolithic red alga Galdieria phleagrea. Curr Biol. 2013;23(19):R865–6.
Schönknecht G, Chen W-H, Ternes CM, Barbier GG, Shrestha RP, Stanke M, Bräutigam A, Baker BJ, Banfield JF, Garavito RM. Gene transfer from bacteria and archaea facilitated evolution of an extremophilic eukaryote. Science. 2013;339(6124):1207–10.
Rossoni A, Price D, Seger M, Lyska D, Lammers P, Bhattacharya D, Weber A. The genomes of polyextremophilic Cyanidiales contain 1% horizontally transferred genes with diverse adaptive functions. eLife. 2019;8.
Jain K, Krause K, Grewe F, Nelson GF, Weber APM, Christensen AC, Mower JP. Extreme features of the Galdieria sulphuraria organellar genomes: a consequence of polyextremophily? Genome Biol Evol. 2015;7(1):367–80.
Ohta N, Sato N, Kuroiwa T. Structure and organization of the mitochondrial genome of the unicellular red alga Cyanidioschyzon merolae deduced from the complete nucleotide sequence. Nucleic Acids Res. 1998;26(22):5190–8.
Liu S-L, Chiang Y-R, Yoon HS, Fu H-Y: Comparative genome analysis reveals Cyanidiococcus gen. nov., a new extremophilic red algal genus sister to Cyanidioschyzon (Cyanidioschyzonaceae, Rhodophyta). J Phycol 2020, n/a(n/a).
Leblanc C, Richard O, Kloareg B, Viehmann S, Zetsche K, Boyen C. Origin and evolution of mitochondria: what have we learnt from red algae? Curr Genet. 1997;31(3):193–207.
Darling AE, Mau B, Perna NT. ProgressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One. 2010;5(6):e11147.
Hilker R, Sickinger C, Pedersen CN, Stoye J. UniMoG—a unifying framework for genomic distance calculation and sorting based on DCJ. Bioinformatics. 2012;28(19):2509–11.
Merola A, Castaldo R, Luca PD, Gambardella R, Musacchio A, Taddei R. Revision of Cyanidium caldarium. Three species of acidophilic algae. Plant Biosyst. 1981;115(4–5):189–95.
Sentsova UJ. On the diversity of acido-thermophilic unicellular algae of the genus Galdieria (Rhodophyta, Cyanidiophyceae). Botaničeskij Žurnal. 1991;76(1):69–78.
Suzuki K, Kawano S, Kuroiwa T. Single mitochondrion in acidic hot-spring alga: behaviour of mitochondria in Cyanidium caldarium and Galdieria sulphuraria (Rhodophyta, Cyanidiophyceae). Phycologia. 1994;33(4):298–300.
Matsuzaki M, Misumi O, Shin-i T, Maruyama S, Takahara M, Miyagishima S-Y, Mori T, Nishida K, Yagisawa F, Nishida K, et al. Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D. Nature. 2004;428:653.
Timmis JN, Ayliffe MA, Huang CY, Martin W. Endosymbiotic gene transfer: organelle genomes forge eukaryotic chromosomes. Nat Rev Genet. 2004;5(2):123.
Adams KL, Daley DO, Whelan J, Palmer JD. Genes for two mitochondrial ribosomal proteins in flowering plants are derived from their chloroplast or cytosolic counterparts. Plant Cell. 2002;14(4):931–43.
Smits P, Smeitink JAM, van den Heuvel LP, Huynen MA, Ettema TJG. Reconstructing the evolution of the mitochondrial ribosomal proteome. Nucleic Acids Res. 2007;35(14):4686–703.
Desmond E, Brochier-Armanet C, Forterre P, Gribaldo S. On the last common ancestor and early evolution of eukaryotes: reconstructing the history of mitochondrial ribosomes. Res Microbiol. 2011;162(1):53–70.
Elleuche S, Schröder C, Sahm K, Antranikian G. Extremozymes—biocatalysts with unique properties from extremophilic microorganisms. Curr Opin Biotechnol. 2014;29:116–23.
Jaenicke R. Stability and stabilization of globular proteins in solution. J Biotechnol. 2000;79(3):193–203.
Das R, Gerstein M. The stability of thermophilic proteins: a study based on comprehensive genome comparison. Funct Integr Genomics. 2000;1(1):76–88.
Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982;157(1):105–32.
Gasteiger E, Hoogland C, Gattiker A, Wilkins MR, Appel RD, Bairoch A. Protein identification and analysis tools on the ExPASy server. In: Walker JM, editor. The Proteomics Protocols Handbook: Springer; 2005. p. 571–607. https://doi.org/10.1385/1-59259-890-0:571.
DePristo MA, Weinreich DM, Hartl DL. Missense meanderings in sequence space: a biophysical view of protein evolution. Nat Rev Genet. 2005;6(9):678–87.
Ikai A. Thermostability and aliphatic index of globular proteins. J Biochem. 1980;88(6):1895–8.
Guruprasad K, Reddy BB, Pandit MW. Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng Des Sel. 1990;4(2):155–61.
Hong H, Joh NH, Bowie JU, Tamm LK. Methods for measuring the thermodynamic stability of membrane proteins. In: Johnson ML, Holt JM, Ackers GK, editors. Methods in Enzymology, vol. 455: Academic Press; 2009. p. 213–36. https://doi.org/10.1016/S0076-6879(08)04208-0.
Tamm LK, Hong H, Liang B. Folding and assembly of β-barrel membrane proteins. Biochim Biophys Acta Biomembr. 2004;1666(1–2):250–63.
Grigoriev A. Analyzing genomes with cumulative skew diagrams. Nucleic Acids Res. 1998;26(10):2286–90.
Necşulea A, Lobry JR. A new method for assessing the effect of replication on DNA base composition asymmetry. Mol Biol Evol. 2007;24(10):2169–79.
Yasukawa T, Yang M-Y, Jacobs HT, Holt IJ. A bidirectional origin of replication maps to the major noncoding region of human mitochondrial DNA. Mol Cell. 2005;18(6):651–62.
Yasukawa T, Reyes A, Cluett TJ, Yang M-Y, Bowmaker M, Jacobs HT, Holt IJ. Replication of vertebrate mitochondrial DNA entails transient ribonucleotide incorporation throughout the lagging strand. EMBO J. 2006;25(22):5358–71.
Pohjoismäki JLO, Holmes JB, Wood SR, Yang M-Y, Yasukawa T, Reyes A, Bailey LJ, Cluett TJ, Goffart S, Willcox S, et al. Mammalian mitochondrial DNA replication intermediates are essentially duplex but contain extensive tracts of RNA/DNA hybrid. J Mol Biol. 2010;397(5):1144–55.
Pohjoismäki JL, Goffart S. Of circles, forks and humanity: topological organisation and replication of mammalian mitochondrial DNA. Bioessays. 2011;33(4):290–9.
Holt IJ, Reyes A. Human mitochondrial DNA replication. Cold Spring Harb Perspect Biol. 2012;4(12):a012971.
Lujan SA, Williams JS, Pursell ZF, Abdulovic-Cui AA, Clark AB, McElhinny SAN, Kunkel TA. Mismatch repair balances leading and lagging strand DNA replication fidelity. PLoS Genet. 2012;8(10):e1003016.
Reyes A, Kazak L, Wood SR, Yasukawa T, Jacobs HT, Holt IJ. Mitochondrial DNA replication proceeds via a 'bootlace' mechanism involving the incorporation of processed transcripts. Nucleic Acids Res. 2013;41(11):5837–50.
Miralles Fusté J, Shi Y, Wanrooij S, Zhu X, Jemt E, Persson Ö, Sabouri N, Gustafsson CM, Falkenberg M. In vivo occupancy of mitochondrial single-stranded DNA binding protein supports the strand displacement mode of DNA replication. PLoS Genet. 2014;10(12):e1004832.
Yasukawa T, Kang D. An overview of mammalian mitochondrial DNA replication mechanisms. J Biochem. 2018;164(3):183–93.
Herbers E, Kekäläinen NJ, Hangas A, Pohjoismäki JL, Goffart S. Tissue specific differences in mitochondrial DNA maintenance and expression. Mitochondrion. 2019;44:85–92.
Fonseca MM, Harris DJ, Posada D. The inversion of the control region in three mitogenomes provides further evidence for an asymmetric model of vertebrate mtDNA replication. PLoS One. 2014;9(9):e106654.
Fijalkowska IJ, Jonczyk P, Tkaczyk MM, Bialoskorska M, Schaaper RM. Unequal fidelity of leading strand and lagging strand DNA replication on the Escherichia coli chromosome. Proc Natl Acad Sci U S A. 1998;95(17):10020.
Bielawski JP, Gold JR. Mutation patterns of mitochondrial H- and L-strand DNA in closely related cyprinid fishes. Genetics. 2002;161(4):1589–97.
Pavlov YI, Mian IM, Kunkel TA. Evidence for preferential mismatch repair of lagging strand DNA replication errors in yeast. Curr Biol. 2003;13(9):744–8.
Kunkel TA, Burgers PM. Dividing the workload at a eukaryotic replication fork. Trends Cell Biol. 2008;18(11):521–7.
Kono N, Tomita M, Arakawa K. Accelerated laboratory evolution reveals the influence of replication on the GC skew in Escherichia coli. Genome Biol Evol. 2018;10(11):3110–7.
Reyes A, Gissi C, Pesole G, Saccone C. Asymmetrical directional mutation pressure in the mitochondrial genome of mammals. Mol Biol Evol. 1998;15(8):957–66.
Lin Q, Cui P, Ding F, Hu S, Yu J. Replication-associated mutational pressure (RMP) governs strand-biased compositional asymmetry (SCA) and gene organization in animal mitochondrial genomes. Curr Genomics. 2012;13(1):28–36.
Szczepanik D, Mackiewicz P, Kowalczuk M, Gierlik A, Nowicka A, Dudek MR, Cebrat S. Evolution rates of genes on leading and lagging DNA strands. J Mol Evol. 2001;52(5):426–33.
Lynch M, Koskella B, Schaack S. Mutation pressure and the evolution of organelle genomic architecture. Science. 2006;311(5768):1727.
Bergstrom CT, Pritchard J. Germline bottlenecks and the evolutionary maintenance of mitochondrial genomes. Genetics. 1998;149(4):2135–46.
Stewart JB, Freyer C, Elson JL, Wredenberg A, Cansu Z, Trifunovic A, Larsson N-G. Strong purifying selection in transmission of mammalian mitochondrial DNA. PLoS Biol. 2008;6(1):e10.
Yoon HS, Hackett JD, Bhattacharya D. A single origin of the peridinin-and fucoxanthin-containing plastids in dinoflagellates through tertiary endosymbiosis. Proc Natl Acad Sci U S A. 2002;99(18):11724–9.
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19(5):455–77.
Cho CH, Choi JW, Lam DW, Kim KM, Yoon HS. Plastid genome analysis of three Nemaliophycidae red algal species suggests environmental adaptation for iron limited habitats. PLoS One. 2018;13(5):e0196995.
Harris RS. Improved pairwise alignmnet of genomic DNA. Dissertation: Pennsylvania State University; 2007.
Boyen C, Leblanc C, Bonnard G, Grienenberger JM, Kloareg B. Nucleotide sequence of the cox3 gene from Chondrus crispus: evidence that UGA encodes tryptophan and evolutionary implications. Nucleic Acids Res. 1994;22(8):1400–3.
Lowe TM, Chan PP. tRNAscan-SE on-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 2016;44(W1):W54–7.
Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 2004;32(1):11–6.
Salomaki ED, Lane CE. Red algal mitochondrial genomes are more complete than previously reported. Genome Biol Evol. 2017;9(1):48–63.
Bedrat A, Lacroix L, Mergny J-L. Re-evaluation of G-quadruplex propensity with G4Hunter. Nucleic Acids Res. 2016;44(4):1746–59.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80.
Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2014;32(1):268–74.
Nozaki H, Takano H, Misumi O, Terasawa K, Matsuzaki M, Maruyama S, Nishida K, Yagisawa F, Yoshida Y, Fujiwara T, et al. A 100%-complete sequence reveals unusually simple genomic features in the hot-spring red alga Cyanidioschyzon merolae. BMC Biol. 2007;5(1):28.
Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol. 2017;35(11):1026.
Emanuelsson O, Brunak S, Von Heijne G, Nielsen H. Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protoc. 2007;2(4):953.
Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nat Methods. 2012;9(4):357–9.
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43(5):491–8.
Cummins CA, McInerney JO. A method for inferring the rate of evolution of homologous characters that can potentially improve phylogenetic inference, resolve deep divergence and correct systematic biases. Syst Biol. 2011;60(6):833–44.
Kryazhimskiy S, Plotkin JB. The population genetics of dN/dS. PLoS Genet. 2008;4(12):e1000304.
Zhang Z, Xiao J, Wu J, Zhang H, Liu G, Wang X, Dai L. ParaAT: a parallel tool for constructing multiple protein-coding DNA alignments. Biochem Biophys Res Commun. 2012;419(4):779–81.
Zhang Z, Li J, Zhao X-Q, Wang J, Wong GK-S, Yu J. KaKs_Calculator: calculating Ka and Ks through model selection and model averaging. Genom Proteomics Bioinform. 2006;4(4):259–63.
The authors thank for the assistance and constructive comments from Soyeong Park, Dongseok Kim, Seung Su Han, and Sunghoon Jang. We would also like to thank Minseok Jeong, and Woongghi Shin for taking the microscopic images of cyanidiophycean cells, and Eunyoung Moon, and the Korea Basic Science Institute (KBSI) for providing electron microscopy facilities.
This study was supported by the Collaborative Genome Program of the KIMST funded by the MOF , the NRF of Korea [NRF-2017R1A2B3001923], the Next-generation BioGreen21 Program [PJ01389003] from the RDA, Korea. D.B. was supported by a NIFA-USDA Hatch grant [NJ01170]. The funding bodies played no role in the design of the study and sample collection, analysis, and interpretation and in writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The authors identified an error in Fig. 1 and Fig. 4. The correct figures are given below: Fig. 1. Fig. 4. The correct figures and captions have been included in this correction, and the original article has been corrected.
Supplementary materials (pdf format), contain Supplementary Information 1–4, Supplementary Figures. S1-S12, and Supplementary References (references that were used in Supplementary Information 1–4). Supplementary Information 1–4: additional information describing the results of this study but not the necessary information to be provided in the main text. Supplementary Figures. S1-S13: additional data with legends to support the manuscript.
Supplementary materials (xlsx format), contain Supplementary Tables S1-S9. Supplementary Tables S1-S9: additional data with legends to support the manuscript.
About this article
Cite this article
Cho, C.H., Park, S., Ciniglia, C. et al. Potential causes and consequences of rapid mitochondrial genome evolution in thermoacidophilic Galdieria (Rhodophyta). BMC Evol Biol 20, 112 (2020). https://doi.org/10.1186/s12862-020-01677-6
- Mitogenome evolution
- Protein divergence
- Mitochondrial DNA replication