Inteins, introns, and homing endonucleases: recent revelations about the life cycle of parasitic genetic elements
BMC Evolutionary Biology volume 6, Article number: 94 (2006)
Self splicing introns and inteins that rely on a homing endonuclease for propagation are parasitic genetic elements. Their life-cycle and evolutionary fate has been described through the homing cycle. According to this model the homing endonuclease is selected for function only during the spreading phase of the parasite. This phase ends when the parasitic element is fixed in the population. Upon fixation the homing endonuclease is no longer under selection, and its activity is lost through random processes. Recent analyses of these parasitic elements with functional homing endonucleases suggest that this model in its most simple form is not always applicable. Apparently, functioning homing endonuclease can persist over long evolutionary times in populations and species that are thought to be asexual or nearly asexual. Here we review these recent findings and discuss their implications. Reasons for the long-term persistence of a functional homing endonuclease include: More recombination (sexual and as a result of gene transfer) than previously assumed for these organisms; complex population structures that prevent the element from being fixed; a balance between active spreading of the homing endonuclease and a decrease in fitness caused by the parasite in the host organism; or a function of the homing endonuclease that increases the fitness of the host organism and results in purifying selection for the homing endonuclease activity, even after fixation in a local population. In the future, more detailed studies of the population dynamics of the activity and regulation of homing endonucleases are needed to decide between these possibilities, and to determine their relative contributions to the long term survival of parasitic genes within a population. Two outstanding publications on the amoeba Naegleria group I intron (Wikmark et al. BMC Evol Biol 2006, 6: 39) and the PRP8 inteins in ascomycetes (Butler et al.BMC Evol Biol 2006, 6: 42) provide important stepping stones towards integrated studies on how these parasitic elements evolve through time together with, or despite, their hosts.
In the organism centered view of evolution, the individual is considered the unit of selection. By being fit, the individual's genes have a greater chance to become fixed in the population. Challenging this view, Dawkins introduced a gene centered view of evolution  where the individual organism is just a vessel created by selfish genes. Cooperative genes rely on the individual's success – they increase their frequency in the population through increasing the fitness of the organism; in contrast, the perpetuation of parasitic genes through generations is independent of the host's fitness. The molecular parasites have gained their own individuality, and to understand their evolution, one has to consider their life cycle. Frequently, these parasitic elements are labeled as selfish genes; however, according to Dawkins  all genes are selfish, thus we prefer the label "parasitic" to reflect the fact that these genes are not cooperating to the benefit of the host organism.
Self splicing introns and inteins are remarkable examples of parasitic elements [2–7]. These molecular parasites can excise themselves of the host molecule (mRNA, rRNA or tRNA for introns; protein for inteins) and ligate the ends of the host molecule without perturbing its biological function. Some of these molecular parasites are equipped with homing endonucleases (HE) [8–10], which produce a single double-strand cut in the genomic DNA, usually in the intein- or intron-free allele of the infected gene. During DNA repair, the parasitic element is copied into the previously empty allele. In sexual populations this invasion of uninfected alleles leads to super Mendelian inheritance of the molecular parasites. It is curious to note that the spread of these parasites in the population relies on the different alleles being brought together through sex or gene transfer; however, many mobile elements with HE activity are found in organisms and organelles thought to be nearly clonal (e.g., mitochondria and prokaryotes) or relying on asexual genetic exchange, e.g., fungi  and amoeba  without known sexual recombination.
The fitness of the molecular parasite and the fitness of the organismal host walk separate roads [8, 13]. For the parasite, fitness is reflected in its ability to spread in the population. Because of its splicing activity the parasite does not impact on the host protein function. The parasite actively colonizes all individuals until the entire population contains the intron/intein. Selection acts upon the splicing properties at all times, but for the HE activity selection only operates during invasion. When the parasitic element is fixed in the population, the HE function decays and is eventually lost. Goddard and Burt  first formulated the homing cycle (Fig. 1) for introns with HE activity; a modified model was applied to inteins [3, 14].
Conceptually, the homing cycle should be formulated for interbreeding populations , i.e., the units in which the parasitic genetic element can be fixed; however, often species, not populations, were considered as units in which the HE containing element was fixed, and the presence of a gene with active HE was interpreted as reflecting gene exchange across species boundaries. These assumptions were frequently confirmed by more detailed analysis of species, host protein, and HE phylogenies [13–16]. The presence of HE containing introns and inteins thus becomes an indicator for DNA exchange events within or across the species boundaries, for example, transfers between mitochondrial genomes in plants and fungi [13–16], or from eukaryotes to extreme thermophilic bacteria .
Recently discovered limitations of the homing cycle model
Several recent findings challenge the general applicability of the homing cycle: in Naegleria, an apparently asexual amoeba, an intron with HE was reported to be of ancient origin, frequently lost in different lineages, but persisting as a functional enzyme in others . And in three different orders of euascomycetes (Pezizomycota) inteins were discovered in the PRP8 gene , all inserted at the same location within the gene. The PRP8 intein has been horizontally transferred between euascomycetes and the basidiomycete Cryptococcus  but no evidence for transfer between the different lineages of euascomycetes was detected. Some of the PRP8 inteins in euascomycetes contain a functional HE under purifying selection, as judged by the ratio of synonymous to non-synonymous substitutions . According to a recent attempt to date fungal evolution , the groups of euascomycetes that contain a PRP8 intein already diverged in the Late Proterozoic (i.e., before 540 million years BP). Could a functioning HE survive within a species over several hundred million years without interspecies transfer?
We discuss several possible explanations, not all complementary, for these findings:
(A) Sex and genetic exchange occur more frequently than assumed. Rare events of genetic exchange that bring HE containing alleles together with intron/intein free alleles might be sufficient to maintain the HE under purifying selection.
(B) The HE might have a function beneficial to the host. In case of Bacillus phages, introns with HE were reported to increase the fitness of their respective host by destroying the genomes of competing phages, containing a different HE, that co-infected the same host . However, selection at the gene and the population level are interwoven tightly in this case and an alternative explanation considering only the gene's selfishness and competition of different HEs for target sites was proposed . Many self splicing introns encode maturases that assist in the splicing reaction catalyzed by the intron , and some of these evolved from homing endonucleases . Again selection at the different levels is interwoven, the intron now relies on the HE not only for spreading within the population, it also requires the HE for splicing. Without the maturase activity, the intron splices less effectively and thus the absence of the maturase/HE is detrimental to the host. The maturase is under purifying selection, as long as the intron is present. This sequence of events illustrates a neutral pathway to complexity : The organism harboring the self-splicing intron is no better off than without the intron, but it now requires a more complex machinery to catalyze the splicing reaction, and the parts required for splicing now are under purifying selection. However, as long as the maturase/HE only acts on the encoding intron, the homing cycle could continue with the simultaneous deletion of the intron and the encoded maturase/HE. In Saccharomyces cerevisiae an endonuclease that triggers recombination events leading to mating type changes is a homolog of the HE containing intein in the vacuolar ATPase catalytic subunit [3, 26, 27]; and a genome rearrangement function was also suggested in Thermococcus kodakaraensis . The acquisition of a new function that allowed the HE to exit from the homing cycle also occurred in plants and in soft corals where a mitochondrial mutS homolog fused twice independently with different HE types .
(C) The homing endonuclease might be maintained by balancing selection. Butler et al  suggested that the intein might decrease the fitness of the host organism. The long term survival of the functioning HE might result from the active invasion due to conversion of intein free alleles, balanced by the decreased propagation of the infected hosts. This balance could be achieved without the homing cycle operating in the population.
(D) The homing cycle might operate in subpopulations only. A low homing frequency might be sufficient to provide purifying selection for the HE function, but insufficient to fix the parasite in the whole population simultaneously. The process of fixation might be further delayed by a complex population structure. Decay and loss of the parasites' fitness already might occur in one subpopulation, whereas other subpopulations might be only at the beginning of the invasion phase. The resulting situation is comparable to waves in an excitable medium that run in circles, as in a fibrillating heart-muscle (re-entry ventricular fibrillation[30, 31]). In the spatial model the homing cycle operates on smaller sections of the population; the population or the species as a whole would be out of synchrony. Reinvasion therefore can occur from within the species or population.
Hypothesis A is compatible with the other three scenarios, and the presence of a HE containing parasitic gene already suggests a low level of gene flow occurring in these presumably clonal organisms. It will be interesting to learn how the homing process in these organisms begins – in the best studied example, the vma-1 intein in Saccharomyces cerevisiae [32–34], homing only occurs during meiosis [35, 36]. Hypothesis B might explain the persistence of a class I intron, if the encoded HE had acquired a maturase function  acting in trans on other self-splicing introns; however, under this assumption it remains puzzling that most Naegleria isolates lost the intron. The same argument applies to the PRP8 intein: The acquisition of a new function is at odds with the presence of several PRP8 genes in euascomycetes that lost either the HE domain or the whole intein altogether. If the HE acquired a function beneficial to the host, why was it lost repeatedly during evolution?
Hypothesis C is unlikely to be a sufficient explanation on its own. It would require an exact balance between forces that increase and those that decrease the HE frequency. A slight shift away from the equilibrium would mean either fixation or extinction for the HE containing parasite. However, an approximate balance between decreased fitness of the host and active spreading of the parasitic element, combined with a complex population structure could lead to long persistence times without the homing cycle operating, especially when the selective disadvantage to the carrier and the mobility of the element were not the same in different subpopulations. This hypothesis on the surface appears similar to hypothesis D; however, in the absence of the homing cycle the empty target sites would not result from decay and loss of the parasitic element, but from the faster growth rate of the organisms that never possessed the element.
The asynchronous homing cycle (hypothesis D) is a likely contributor to the long term persistence of functional HE in a single species. To corroborate this hypothesis, and to differentiate it from hypothesis C, more detailed population studies need to be performed, including sampling of isolated subpopulations with limited migration between them, and thorough sampling of well mixed isolated populations. In addition, a sampling of related species is also necessary since the detection of interspecies transfers depends on a sufficient sampling of taxa. At present it cannot be excluded that some interspecies transfer of the HE went undetected.
The unexpected complexity in the life cycle of inteins reported by Butler and colleagues reinforces the need for detailed epidemiological studies of HE containing parasitic genes. These studies should be combined with attempts to better understand the biochemical and physiological regulation of the parasitic genes, with measurements of their effect on the fitness of their host, with determination of the transmission efficiency of the parasite, and with phylogenetic analyses that might reveal the frequency of HE loss form the parasitic gene. In case of introns the study of the epidemiological dynamics can be further complicated by several factors (see  for a recent review): HE genes and self-splicing introns can be considered independent parasites and a HE can associate with different introns that provide integration sites that are selectively neutral; self-splicing introns are frequently found in ribosomal RNA encoding genes that are present in multiple copies per genome; in addition to using HEs, introns can also be mobile through reverse splicing; and the HE function might also assist the splicing reaction. The epidemiological dynamics of the HE containing parasitic genes are complex, and the determination of the relevant parameters will require collaborative efforts by molecular biologists, microbial ecologists, and epidemiologists. However, these studies will be worthwhile, because they provide a chance to untangle the interactions between the selective forces acting at the molecular and the organismal levels, and they also will allow detection of low rates of gene flow between populations and possibly between species. These studies thus will constitute important steps towards a more detailed genome 'ecology' .
Dawkins R: The Selfish Gene. 1976, Oxford University Press
Liu XQ: Protein-splicing intein: Genetic mobility, origin, and evolution. Annu Rev Genet. 2000, 34: 61-76. 10.1146/annurev.genet.34.1.61.
Gogarten JP, Senejani AG, Zhaxybayeva O, Olendzenski L, Hilario E: Inteins: structure, function, and evolution. Annu Rev Microbiol. 2002, 56: 263-287. 10.1146/annurev.micro.56.012302.160741.
Pietrokovski S: Intein spread and extinction in evolution. Trends Genet. 2001, 17 (8): 465-472. 10.1016/S0168-9525(01)02365-4.
Derbyshire V, Belfort M: Lightning strikes twice: intron-intein coincidence [comment]. Proc Natl Acad Sci U S A. 1998, 95 (4): 1356-1357. 10.1073/pnas.95.4.1356.
Anraku Y, Mizutani R, Satow Y: Protein splicing: its discovery and structural insight into novel chemical mechanisms. IUBMB Life. 2005, 57 (8): 563-574.
Perler FB: InBase: the Intein Database. Nucleic Acids Res. 2002, 30 (1): 383-384. 10.1093/nar/30.1.383.
Gimble FS: Invasion of a multitude of genetic niches by mobile endonuclease genes. FEMS Microbiol Lett. 2000, 185 (2): 99-107. 10.1111/j.1574-6968.2000.tb09046.x.
Stoddard BL: Homing endonuclease structure and function. Q Rev Biophys. 2005, 38 (1): 49-95. 10.1017/S0033583505004063.
Jurica MS, Stoddard BL: Homing endonucleases: structure, function and evolution. Cell Mol Life Sci. 1999, 55 (10): 1304-1326. 10.1007/s000180050372.
Taylor J, Jacobson D, Fisher M: THE EVOLUTION OF ASEXUAL FUNGI: Reproduction, Speciation and Classification. Annu Rev Phytopathol. 1999, 37: 197-246. 10.1146/annurev.phyto.37.1.197.
De Jonckheere JF: A century of research on the amoebaflagellate genus Naegleria. Acta Protozool. 2002, 41: 309 -3342.
Goddard MR, Burt A: Recurrent invasion and extinction of a selfish gene. Proc Natl Acad Sci U S A. 1999, 96 (24): 13880-13885. 10.1073/pnas.96.24.13880.
Koufopanou V, Goddard MR, Burt A: Adaptation for Horizontal Transfer in a Homing Endonuclease. Mol Biol Evol. 2002, 19 (3): 239-246.
Palmer JD, Adams KL, Cho Y, Parkinson CL, Qiu YL, Song K: Dynamic evolution of plant mitochondrial genomes: mobile genes and introns and highly variable mutation rates. Proc Natl Acad Sci U S A. 2000, 97 (13): 6960-6966. 10.1073/pnas.97.13.6960.
Cho Y, Palmer JD: Multiple acquisitions via horizontal transfer of a group I intron in the mitochondrial cox1 gene during evolution of the Araceae family. Mol Biol Evol. 1999, 16 (9): 1155-1165.
Nesbo CL, Doolittle WF: Active self-splicing group I introns in 23S rRNA genes of hyperthermophilic bacteria, derived from introns in eukaryotic organelles. Proc Natl Acad Sci U S A. 2003, 100 (19): 10806-10811. 10.1073/pnas.1434268100.
Wikmark OG, Einvik C, De Jonckheere JF, Johansen SD: Short-term sequence evolution and vertical inheritance of the Naegleria twin-ribozyme group I intron. BMC Evol Biol. 2006, 6: 39-10.1186/1471-2148-6-39.
Butler MI, Gray J, Goodwin TJ, Poulter RT: The distribution and evolutionary history of the PRP8 intein. BMC Evol Biol. 2006, 6 (1): 42-10.1186/1471-2148-6-42.
Padovan AC, Sanson GF, Brunstein A, Briones MR: Fungi evolution revisited: application of the penalized likelihood method to a Bayesian fungal phylogeny provides a new perspective on phylogenetic relationships and divergence dates of Ascomycota groups. J Mol Evol. 2005, 60 (6): 726-735. 10.1007/s00239-004-0164-y.
Goodrich-Blair H, Shub DA: Beyond homing: competition between intron endonucleases confers a selective advantage on flanking genetic markers. Cell. 1996, 84 (2): 211-221. 10.1016/S0092-8674(00)80976-9.
Edgell DR, Fast NM, Doolittle WF: Selfish DNA: the best defense is a good offense. Curr Biol. 1996, 6 (4): 385-388. 10.1016/S0960-9822(02)00502-X.
Lambowitz AM, Belfort M: Introns as mobile genetic elements. Annu Rev Biochem. 1993, 62: 587-622. 10.1146/annurev.bi.62.070193.003103.
Belfort M: Two for the price of one: a bifunctional intron-encoded DNA endonuclease-RNA maturase. Genes Dev. 2003, 17 (23): 2860-2863. 10.1101/gad.1162503.
Stoltzfus A: On the possibility of constructive neutral evolution. J Mol Evol. 1999, 49 (2): 169-181. 10.1007/PL00006540.
Kostriken R, Strathern JN, Klar AJ, Hicks JB, Heffron F: A site-specific endonuclease essential for mating-type switching in Saccharomyces cerevisiae. Cell. 1983, 35 (1): 167-174. 10.1016/0092-8674(83)90219-2.
Perler FB, Olsen GJ, Adam E: Compilation and analysis of intein sequences. Nucleic Acids Res. 1997, 25 (6): 1087-1093. 10.1093/nar/25.6.1087.
Nishioka M, Fujiwara S, Takagi M, Imanaka T: Characterization of two intein homing endonucleases encoded in the DNA polymerase gene of Pyrococcus kodakaraensis strain KOD1. Nucleic Acids Res. 1998, 26 (19): 4409-4412. 10.1093/nar/26.19.4409.
Abdelnoor RV, Christensen AC, Mohammed S, Munoz-Castillo B, Moriyama H, Mackenzie SA: Mitochondrial genome dynamics in plants and animals: convergent gene fusions of a MutS homologue. J Mol Evol. 2006, 63 (2): 165-173. 10.1007/s00239-005-0226-9.
Xie F, Qu Z, Yang J, Baher A, Weiss JN, Garfinkel A: A simulation study of the effects of cardiac anatomy in ventricular fibrillation. J Clin Invest. 2004, 113 (5): 686-693. 10.1172/JCI200417341.
Mines GR: On dynamic equilibrium in the heart. J Physiol. 1913, 46: 349–383 -
Hirata R, Ohsumk Y, Nakano A, Kawasaki H, Suzuki K, Anraku Y: Molecular structure of a gene, VMA1, encoding the catalytic subunit of H(+)-translocating adenosine triphosphatase from vacuolar membranes of Saccharomyces cerevisiae. J Biol Chem. 1990, 265 (12): 6726-6733.
Moure CM, Gimble FS, Quiocho FA: Crystal structure of the intein homing endonuclease PI-SceI bound to its recognition sequence. Nat Struct Biol. 2002, 9 (10): 764-770. 10.1038/nsb840.
Posey KL, Koufopanou V, Burt A, Gimble FS: Evolution of divergent DNA recognition specificities in VDE homing endonucleases from two yeast species. Nucleic Acids Res. 2004, 32 (13): 3947-3956. 10.1093/nar/gkh734.
Gimble FS, Thorner J: Homing of a DNA endonuclease gene by meiotic gene conversion in Saccharomyces cerevisiae. Nature. 1992, 357: 301-305. 10.1038/357301a0.
Gimble FS: Degeneration of a homing endonuclease and its target sequence in a wild yeast strain. Nucleic Acids Res. 2001, 29 (20): 4215-4223. 10.1093/nar/29.20.4215.
Haugen P, Simon DM, Bhattacharya D: The natural history of group I introns. Trends Genet. 2005, 21: 111 -1119. 10.1016/j.tig.2004.12.007.
Avise JC: Evolving genomic metaphors: a new look at the language of DNA. Science. 2001, 294 (5540): 86-87. 10.1126/science.294.5540.86.
Derr LK, Strathern JN: A role for reverse transcripts in gene conversion. Nature. 1993, 361 (6408): 170-173. 10.1038/361170a0.
Jeffares DC, Mourier T, Penny D: The biology of intron gain and loss. Trends Genet. 2006, 22 (1): 16-22. 10.1016/j.tig.2005.10.006.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Gogarten, J.P., Hilario, E. Inteins, introns, and homing endonucleases: recent revelations about the life cycle of parasitic genetic elements. BMC Evol Biol 6, 94 (2006). https://doi.org/10.1186/1471-2148-6-94