Skip to main content
  • Research article
  • Open access
  • Published:

A high density of ancient spliceosomal introns in oxymonad excavates



Certain eukaryotic genomes, such as those of the amitochondriate parasites Giardia and Trichomonas, have very low intron densities, so low that canonical spliceosomal introns have only recently been discovered through genome sequencing. These organisms were formerly thought to be ancient eukaryotes that diverged before introns originated, or at least became common. Now however, they are thought to be members of a supergroup known as excavates, whose members generally appear to have low densities of canonical introns. Here we have used environmental expressed sequence tag (EST) sequencing to identify 17 genes from the uncultivable oxymonad Streblomastix strix, to survey intron densities in this most poorly studied excavate group.


We find that Streblomastix genes contain an unexpectedly high intron density of about 1.1 introns per gene. Moreover, over 50% of these are at positions shared between a broad spectrum of eukaryotes, suggesting theyare very ancient introns, potentially present in the last common ancestor of eukaryotes.


The Streblomastix data show that the genome of the ancestor of excavates likely contained many introns and the subsequent evolution of introns has proceeded very differently in different excavate lineages: in Streblomastix there has been much stasis while in Trichomonas and Giardia most introns have been lost.


One of the prominent features that distinguishes eukaryotic genomes from those of prokaryotes is the presence of spliceosomal introns. Introns are intervening sequences that are removed from expressed RNAs, in the case of spliceosomal introns through a series of transesterfications mediated by a large riboprotein complex called the spliceosome [1]. Spliceosomal introns are only known from eukaryotic nuclear genomes, and were the subject of intense controversy over their potential role in early gene origins and evolution, the so-called introns early versus late debate [24]. One of the interesting features of intron evolution that came to light during this debate was the large range in intron density. At one extreme, introns appeared to be lacking in several protist lineages that were, at the time, thought to be the earliest-branching eukaryotes. These lineages included diplomonads (e.g., Giardia) and parabasalia (e.g. Trichomonas).

The early-branching status of these organisms has since been undermined by a variety of data, and now diplomonads and parabasalia are thought to be part of a large assemblage of protists called excavates, which also includes trypanosomes, euglenids, and a number of parasitic and free living flagellate or amoeboflagellate lineages [5]. However, despite the accumulation of a considerable quantity of molecular data from both Giardia and Trichomonas, as well as the identification of proteins involving splicing in Trichomonas [6], evidence for introns in their genomes remained intriguingly elusive. Indeed, only recently were introns finally characterized in these organisms [79], and remain extremely rare. Only three introns have been found in G. intestinalis among thousands of known genes [8, 9] and forty-one introns were identified in the T. vaginalis genome after exhaustive searches [7]. Information from excavates other than Trichomonas and Giardia is scarce, but overall there seems to be a generally low density of introns (with the possible exception of Jakobid flagellates based on one family of genes[10]). Moreover, other instances of non-canonical introns and splicing are known in excavates [1113], as are systems where splicing machinery is put to a slightly different use such as trans-splicing [1416].

One of the excavate groups about which we know very little are the oxymonads. Oxymonads are anaerobic flagellates found almost exclusively in association with animals, many in the guts of termites and wood-eating roaches [17]. This is the only group of amitochondriates for which secondary loss of mitochondria has not been yet demonstrated, but they are closely related to the flagellate Trimastix, which has a vestigial organelle, so a primary lack of mitochondria in oxymonads is unlikely. Mostoxymonads are not available in culture because they live in complex communities with other protists and prokaryotes. As a result, there are few molecular data available from any oxymonad, and no introns have been identified [18]. The oxymonad Streblomastix strix is asymbiont of the dampwood termite Zootermopsis angusticollis from North American Pacific coastal region. This species has a number of unusual morphological characters, including a peculiar long slender cell shape with deep longitudinal vanes which is apparently maintained by intimate association with epibiotic bacteria [19], So far, many copies of four genes (alpha-tubulin, beta-tubulin, HSP90, and elongation factor-1 alpha) have been characterized from S. strix [18], and the complete absence of introns from all sequences (a total of 19,888 bp) suggests the oxymonads might share low intron densities apparently common to excavates. Here, we have used the recent documentation of a rare non-canonical genetic code in Streblomastix [18] to identify 17 oxymonad genes from an environmental expressed sequence tag (EST) pool from the hindgut of Zootermopsis. The genomic DNA sequence for each mRNA was determined and we found that, in contrast to other amitochondriate protists and the limited data previously available for Streblomastix, a relatively high density of canonical spliceosomal introns. Moreover, a large proportion of these introns are shared in position with other distantly related eukaryotes, suggesting that they are ancient intron positions retained in oxymonads but lost in other excavates such as Giardia and Trichomonas.

Results and discussion

Identification of oxymonad sequences from ESTs

A total of 5,337 ESTs from a Z. angusticollis termite hindgut cDNAlibrary were sequenced and found to form 2,595 clusters of unique sequences. Overall, the sample was dominated by sequences of parabasalian origin (transcripts encoding parabasalian actin and actin-related proteins alone represented 32% of all ESTs). Moreover, there are few oxymonad sequences known outside this sample, so Streblomastix cDNAs could not be identified based on similarity to known genes (only 2 ESTs, corresponding to known Streblomastix alpha- and beta-tubulin sequences, were identified by BLASTX searches). Accordingly, we used the presence of a rare non-canonical genetic code in Streblomastix as a filter to identify at least those genes where non-canonical codons were sampled. In Streblomastix, TAA and TAG encode glutamine (Q) rather than stop as in the universal code [18], so all clusters were compared to public databases using BLASTX and examined individually for in frame stop codons, in particular at positions normally encoding glutamine. No other protist known to exist in Z. angusticollis has been shown to possess a non-canonical genetic code. The other prominent protists in this insect are parabasalia, which are not known to deviate from the universal genetic code and whose sequences are also easy to identify with BLASTX searches given their high similarity with T. vaginalis genomic sequences.

Using the non-canonical code as a filter, we were able to identify 17 protein-coding genes (Table 1), representing a major increase in the available sequence data from oxymonads. Formerly partial sequences of 4 protein coding genes were known from Streblomastix, and a handful of cDNAs were known from other species [18, 20, 21]. From this sample we recovered 8 complete protein-coding genes, an additional 5 genes missing only 1 to 30 codons at the N-terminus, and another three lacking from 100 to 160 codons at the N-terminus. In addition, a short fragment encoding 258 codons of the large protein UPF1 was severely truncated, but we failed to obtain more sequence. Complete or near-complete sequences included five ribosomal proteins (RPS7 and 9, RPL4, 18 and 21), alpha- and beta-tubulin, the nuclear transporter Ntf2, cyclophilin, a peptidyl-isomerase involved in assisting protein folding, and NAD-dependent glutamate dehydrogenase. Also, two versions of the cystein-protease Cathepsin B were obtained. Although related, these sequences exhibited several differences at the amino acid level, so they are likely to represent multiples copies of the gene. We also identified two copies of the carbon metabolism enzyme pyruvate phosphate dikinase (PPDK), the functional and evolutionary significance of which are discussed elsewhere [22]. One conserved hypothetical protein was also found to use the Streblomastix genetic code. This protein has homologues in diverse eukaryotes (e.g. Arabidopsis thaliana AAM67532), buthas no assigned function. UPF1 is a key member of nonsense-mediated decay (NMD). This protein may be of interest in Streblomastix because it is involved in a mechanism of mRNA surveillance devoted to eliminating defective transcripts, such as those carrying premature stop codons [23]. NMD has been described and studied in animals and yeasts, but not yet found in protists [24]. The presence of UPF1 in Streblomastix suggests NMD is used by oxymonads, and in organisms where stop codons are reassigned to encode amino acids. Finally, UAP56 is a member of the DEAD box family of RNA helicases that is associated with the spliceosome and intervenes in early steps of pre-mRNA splicing in mammals and yeasts, but is also linked to mRNA export [25, 26]. Even in the absence of introns, the presence of UAP56 indicates the likely presence of the spliceosome in oxymonads, and therefore by extension introns as well.

Table 1 S. strix genes identified in this study. Streblomastix genes recovered from the Z. angusticollis hindgut RNA sample. For incomplete sequences, the number of missing amino acids were estimated from homologues from Giardia and/or Trichomonas. UPF1 shows extensive size variation among eukaryotic lineages (between 800 and 1600 amino acids, approximately), so it is difficult to determine how much sequence this fragment is lacking. ND: not determined.

Introns in Streblomastix genes

Genomic DNA sequences were obtained for all Streblomastix coding regions identified from the cDNA library. Despite the fact that many alleles and loci representing four proteins were previously found to contain no introns, we found that most of the genes encoding these transcripts were interrupted by introns. In total, we found 21 introns in our sample of 17 genes with genes having as many as 5 introns (Table 1). Including previously known intronless EF-1 alpha and HSP90 genes (alpha and beta tubulin are included in our sample) [18], the overall density is 1.1 introns per gene. However this is likely to be an underestimation since some of our sequences are truncated and could contain further introns, and there is a bias favouring genes that are more often intronless (e.g. HSP90). This density is less than that observed in the relatively intron-rich mammals and plants, but comparable to many other eukaryotic genomes, and certainly much higher than Giardia and Trichomonas where only 3 and 41 introns have been detected despite very large quantities of genomic data [7, 9].

Overall, the Streblomastix introns were found to exhibit characteristics typical of eukaryotic spliceosomal introns. Introns ranged from 46 to 229 bases (Table 2), but most were between 60 and 100 bases long, and the AT content was markedly higher than that of the coding sequence (Table 2). Spliceosomal introns are flanked by GT and AG dinucleotides in the vast majority of known introns, while about 0.1% are U12 AT-AC introns[27] and a very small proportion of known introns use other non-canonical splice boundaries. Interestingly, however, the first of only three introns from G. intestinalis to be discovered has CU-AG boundaries [8]. Of the twenty-one introns from Streblomastix, 20 featured canonical GT-AG boundaries, but one intron in rps9 was flanked by AC-AG splice sites. However, the Streblomastix intron is located very close to the start of the transcript, so we cannot exclude the possibility that this intron sequence is incomplete and a canonical boundary lies upstream.

Table 2 Basic features of the S. strix introns. Characteristics of 21 introns found in 17 Streblomastix genes analysed. Size and base composition are shown. GC% mRNA shows base composition of the coding sequence (excluding introns).

We also inspected intron sequences to look for conserved features that may correspond to functional motifs. Although signals important for intron recognition and removal are not very well understood, some have been studied in certain detail, especially in mammals and yeasts. The branch-point is a sequence element required for lariat formation during splicing [28]. The mammalian branch point consensus sequence has been determined to be CURAY, where the A corresponds to the actual branching point. In yeast, the branch point sequence is more strictly defined as UACUAAC [29]. The plant branch point appears to be similar to that of mammals [30]. In all cases, the branch point is located near the 3' splice site, but the exact location varies. In contrast, the putative branch point found in the three introns of Giardia (ACURAC) is located directly adjacent to the 3' splice site [9]. Likewise, the potential branch points in Trichomonas are invariably ACUAAC and are also adjacent to the 3' splice site [7]. The apparently strict requirement for proximity between the branch point and 3' splice site is rare in metazoa and yeast, but common to Trichomonas and Giardia. This led to the suggestion that the branch point and 3' splice site recognition could be combined in these species [7]. Aligning the regions around the 5' splice site of all Streblomastix introns (Figure 1) reveals highly conserved A, U and G residues at positions +3 to +5, respectively. This is in good agreement with the first 5 positions of the yeast 5' splice site (typically GUAUGU), suggesting that interaction with U1 snRNA is conserved. At the 3' splice site no branch point motifs like those of Giardia or Trichomonas were observed, although the -1 position (adjacent to the AG dinucleotide) was invariably a pyrimidine and the region is T-rich. Overall, branch point specification in Streblomastix introns is probably different from that of Giardia or Trichomonas. Under the assumption that these lineages are related it is possible that the peculiar features observed in Giardia and Trichomonas may be a consequence of secondary implification in their spliceosomal apparatus.

Figure 1
figure 1

Examples of conserved intron positions between Streblomastix and other eukaryotes. In each case a section of the gene is shown aligned at the amino acid level, and the position of the intron found in all aligned sequences is indicated above by a triangle with a number indicating the phase (0, 1, or 2). Aligned sequences are from three unikont groups, animals (H. sapiens and P. troglodytes), fungi (S. pombe, U. maydis and A. fumigatus), and slime molds (D. discoideum), from one chromalveolate group, the ciliate (P. tetraurelia), and from three plantae groups, land plants (A. thaliana), green algae (Bigelowiella natans nucleomorph), and red algae (Guillardia thet a nucleomorph).

Conservation of intron positions in oxymonad genes

Streblomastix intron-containing genes were compared to homologues from other eukaryotes, and surprisingly more than half of the Streblomastix intron positions were shared with members of at least two different eukaryotic supergroups, unikonts and plants (where the best sampling of intron-containing genes exists), and in one case also with a chromalveolate (for six examples, see Figure 2). This suggests these are relatively ancient introns and perhaps date back to the last common ancestor of all eukaryotes. This degree of conservation is high, taking into account data such as those of Rogozin et al., who calculated that approximately 20% of the introns in Plasmodium are shared by at least one of the other genomes analysed (Human, Anopheles, Drosophila, Caenorhabditis, Arabidopsis, Schizosaccharomyces and Saccharomyces), and that 25% of the human introns are shared by Arabidopsis [31]. It is also possible that shared intron positions are due to independent gains, but it is very unlikely that the observed level of shared positions (about 50%) resulted from parallel gains, in particular in the many cases where the intron is found in several of the major lineages of eukaryotes. Whether intron gains or losses predominate in eukaryotic evolution is still a subject of controversy. Recently, several studies using different analytic approaches and datasets addressed this question with varied results, but in all cases, they show that ancestral conservation accounts for the large majority of shared positions [3236]. The degree of conservation observed in Streblomastix intron positions suggests two things. First, it suggests that the ancestor of excavates was relatively intron rich and retained a large number of ancient introns, many of which were subsequently lost in the genomes where we have the most information, such as trypanosomes, Giardia and Trichomonas. This assumes the relationship between oxymonads and other hypothesized excavates is correct, but this is not certain and oxymonads lack the morphological trait used to define excavates (the ventral groove). However, other ultrastructural characters [37] as well as molecular phylogenies have shown a close affiliation between oxymonads and Trimastix [38], a free living flagellate that does have excavate characteristics [5, 39]. Multi-gene phylogenies also lend additional support for a common origin of the lineages leading to oxymonads, diplomonads and parabasalia [21]. The second implication of this data is that intron gain and loss have taken place very slowly in the lineage leading up to Streblomastix: if intron turnover were rapid, then we would expect a low proportion of ancient introns to remain unless ancient intron positions were under some selection to be retained. While this is probably true in a few individual cases where introns have acquired some function in the control of gene expression, there is presently no evidence either for or against this as a common feature of ancient introns. None of these shared introns are known from either Giardia or Trichomonas, so any potential function is clearly dispensable, although it is interesting to note that the rps9 intron 1 has been retained by the G. theta nucleomorph, which is very intron poor, having kept a total of only 17 introns [40].

Figure 2
figure 2

Sequence logos showing conservation at intron borders. Top: 5’ splice site (position 1) and surrounding sequence. Bottom: 3’ splice site (-1) and surrounding sequence. Logos were made using Weblogo (


The present sampling of protein-coding gene sequences from Streblomastix suggests that oxymonad genomes contain a relatively large number of canonical splicesomal introns, many of which are at ancient conserved positions. This is in contrast to the better studied excavate genomes such as those of kinetoplastids, Giardia and Trichomonas where canonical spliceosomal introns are either rare or have been co-opted in specific ways, such as the spliced leaders in euglenozoa. The fact that many Streblomastix introns are ancient shows that the genome of the ancestor of these organisms, and indeed probably all extant eukaryotes, contained many introns and that the intron-poor state found in Giardia and Trichomonas is more likely independently derived.


cDNA library construction and EST sequencing

Termites were collected from a rotten log in Point Grey, Vancouver, Canada. The whole hindgut content of about 60 individuals of Zootermopsis angusticollis from a single colony was collected and total RNA was extracted using TRIZOL (Invitrogen). A directionally cloned cDNA library was constructed (Amplicon Express) and 5,337 clones were sequenced from the 5' end. ESTs were trimmed for vector and quality, and assembled into clusters by PEPdb

Identification and genomic characterisation of Streblomastix genes

Streblomastix sequences were recovered from EST data by identifying protein coding sequences containing in-frame TAA and TAG stop codons. Putatively stop-coding containing mRNAs were re-sequenced in both strands. In cases where cDNA clones were truncated, the sequences were extended by means of 3' and 5' RACE (Ambion) using total termite hindgut RNA. The genomic sequence for each mRNA was amplified using specific primers corresponding to the ends of each complete or partial cDNA and PCR-amplified using genomic DNA purified from the termite hindgut content. All PCR products were cloned using TOPO and sequenced both strands. Accession numbers for new sequences are [genbankDQ363664, genbankDQ363665, genbankDQ363666, genbankDQ363667, genbankDQ363668, genbankDQ363669, genbankDQ363670, genbankDQ363671, genbankDQ363672, genbankDQ363673, genbankDQ363674, genbankDQ363675, genbankDQ363676, genbankDQ363677, genbankDQ363678, genbankDQ363679].


  1. Maniatis T, Reed R: The role of small nuclear ribonucleoprotein particles in pre-mRNA splicing. Nature. 1987, 325 (6106): 673-678. 10.1038/325673a0.

    Article  CAS  PubMed  Google Scholar 

  2. Gilbert W: Why genes in pieces?. Nature. 1978, 271 (5645): 501-10.1038/271501a0.

    Article  CAS  PubMed  Google Scholar 

  3. Logsdon JM: The recent origins of spliceosomal introns revisited. Curr Opin Genet Dev. 1998, 8 (6): 637-648. 10.1016/S0959-437X(98)80031-2.

    Article  CAS  PubMed  Google Scholar 

  4. Zhaxybayeva O, Gogarten JP: Spliceosomal introns: new insights into their evolution. Curr Biol. 2003, 13 (19): R764-766. 10.1016/j.cub.2003.09.017.

    Article  CAS  PubMed  Google Scholar 

  5. Simpson AG: Cytoskeletal organization, phylogenetic affinities and systematics in the contentious taxon Excavata (Eukaryota). Int J Syst Evol Microbiol. 2003, 53 (Pt 6): 1759-1777. 10.1099/ijs.0.02578-0.

    Article  PubMed  Google Scholar 

  6. Fast NM, Doolittle WF: Trichomonas vaginalis possesses a gene encoding the essential spliceosomal component, PRP8. Mol Biochem Parasitol. 1999, 99 (2): 275-278. 10.1016/S0166-6851(99)00017-1.

    Article  CAS  PubMed  Google Scholar 

  7. Vanacova S, Yan W, Carlton JM, Johnson PJ: Spliceosomal introns in the deep-branching eukaryote Trichomonas vaginalis. Proc Natl Acad Sci U S A. 2005, 102 (12): 4430-4435. 10.1073/pnas.0407500102.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  8. Nixon JE, Wang A, Morrison HG, McArthur AG, Sogin ML, Loftus BJ, Samuelson J: A spliceosomal intron in Giardia lamblia. Proc Natl Acad Sci U S A. 2002, 99 (6): 3701-3705. 10.1073/pnas.042700299.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  9. Russell AG, Shutt TE, Watkins RF, Gray MW: An ancient spliceosomal intron in the ribosomal protein L7a gene (Rpl7a) of Giardia lamblia. BMC Evol Biol. 2005, 5: 45-10.1186/1471-2148-5-45.

    Article  PubMed Central  PubMed  Google Scholar 

  10. Archibald JM, O'Kelly CJ, Doolittle WF: The chaperonin genes ofjakobid and jakobid-like flagellates: implications for eukaryotic evolution. Mol Biol Evol. 2002, 19 (4): 422-431.

    Article  CAS  PubMed  Google Scholar 

  11. Muchhal US, Schwartzbach SD: Characterization of the unique intron-exon junctions of Euglena gene(s) encoding the polyprotein precursorto the light-harvesting chlorophyll a/b binding protein of photosystem II. Nucleic Acids Res. 1994, 22 (25): 5737-5744.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  12. Breckenridge DG, Watanabe Y, Greenwood SJ, Gray MW, Schnare MN: U1 small nuclear RNA and spliceosomal introns in Euglena gracilis. Proc Natl Acad Sci U S A. 1999, 96 (3): 852-856. 10.1073/pnas.96.3.852.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  13. Canaday J, Tessier LH, Imbault P, Paulus F: Analysis of Euglenagracilis alpha-, beta- and gamma-tubulin genes: introns and pre-mRNA maturation. Mol Genet Genomics. 2001, 265 (1): 153-160. 10.1007/s004380000403.

    Article  CAS  PubMed  Google Scholar 

  14. Muchhal US, Schwartzbach SD: Characterization of a Euglena geneencoding a polyprotein precursor to the light-harvesting chlorophyll a/b-binding protein of photosystem II. Plant Mol Biol. 1992, 18 (2): 287-299. 10.1007/BF00034956.

    Article  CAS  PubMed  Google Scholar 

  15. Tessier LH, Chan RL, Keller M, Weil JH, Imbault P: The Euglena gracilis rbcS gene contains introns with unusual borders. FEBS Lett. 1992, 304 (2–3): 252-255. 10.1016/0014-5793(92)80631-P.

    Article  CAS  PubMed  Google Scholar 

  16. Tessier LH, Paulus F, Keller M, Vial C, Imbault P: Structure and expression of Euglena gracilis nuclear rbcS genes encoding the small subunits of the ribulose 1, 5-bisphosphate carboxylase/oxygenase: a novel splicing process for unusual intervening sequences?. J Mol Biol. 1995, 245 (1): 22-33.

    Article  CAS  PubMed  Google Scholar 

  17. Brugerolle G, Lee JJ: Order Oxymonadida. The illustrated guide to the protozoa. Edited by: Lee JJ, Leedale GF, Bradbury P. 2000, Lawrence, KA: Society of Protozoologists, 1186-1195. 2

    Google Scholar 

  18. Keeling PJ, Leander BS: Characterisation of a non-canonical genetic code in the oxymonad Streblomastix strix. J Mol Biol. 2003, 326 (5): 1337-1349. 10.1016/S0022-2836(03)00057-3.

    Article  CAS  PubMed  Google Scholar 

  19. Leander BS, Keeling PJ: Symbiotic innovation in the oxymonad Streblomastix strix. J Eukaryot Microbiol. 2004, 51 (3): 291-300. 10.1111/j.1550-7408.2004.tb00569.x.

    Article  PubMed  Google Scholar 

  20. Moriya S, Tanaka K, Ohkuma M, Sugano S, Kudo T: Diversificationof the microtubule system in the early stage of eukaryote evolution: elongation factor 1 alpha and alpha-tubulin protein phylogeny of termite symbiotic oxymonad and hypermastigote protists. J Mol Evol. 2001, 52 (1): 6-16.

    Article  CAS  PubMed  Google Scholar 

  21. Hampl V, Horner DS, Dyal P, Kulda J, Flegr J, Foster PG, Embley TM: Inference of the phylogenetic position of oxymonads based on nine genes: support for metamonada and excavata. Mol Biol Evol. 2005, 22 (12): 2508-2518. 10.1093/molbev/msi245.

    Article  CAS  PubMed  Google Scholar 

  22. Slamovits CH, Keeling PJ: Pyruvate-phosphate dikinase of oxymonads and parabasalia and the evolution of pyrophosphate-dependent glycolysis in anaerobic eukaryotes. Eukaryot Cell. 2006, 5 (1): 148-154. 10.1128/EC.5.1.148-154.2006.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  23. Baker KE, Parker R: Nonsense-mediated mRNA decay: terminating erroneous gene expression. Curr Opin Cell Biol. 2004, 16 (3): 293-299. 10.1016/

    Article  CAS  PubMed  Google Scholar 

  24. Culbertson MR, Leeds PF: Looking at mRNA decay pathways throughthe window of molecular evolution. Curr Opin Genet Dev. 2003, 13 (2): 207-214. 10.1016/S0959-437X(03)00014-5.

    Article  CAS  PubMed  Google Scholar 

  25. Jensen TH, Boulay J, Rosbash M, Libri D: The DECD box putative ATPase Sub2p is an early mRNA export factor. Curr Biol. 2001, 11 (21): 1711-1715. 10.1016/S0960-9822(01)00529-2.

    Article  CAS  PubMed  Google Scholar 

  26. Linder P, Stutz F: mRNA export: travelling with DEAD box proteins. Curr Biol. 2001, 11 (23): R961-963. 10.1016/S0960-9822(01)00574-7.

    Article  CAS  PubMed  Google Scholar 

  27. Patel AA, Steitz JA: Splicing double: insights from the second spliceosome. Nat Rev Mol Cell Biol. 2003, 4 (12): 960-970. 10.1038/nrm1259.

    Article  CAS  PubMed  Google Scholar 

  28. Reed R, Maniatis T: The role of the mammalian branchpoint sequence in pre-mRNA splicing. Genes Dev. 1988, 2 (10): 1268-1276.

    Article  CAS  PubMed  Google Scholar 

  29. Lin RJ, Newman AJ, Cheng SC, Abelson J: Yeast mRNA splicing in vitro. J Biol Chem. 1985, 260 (27): 14780-14792.

    CAS  PubMed  Google Scholar 

  30. Lorkovic ZJ, Wieczorek Kirk DA, Lambermon MH, Filipowicz W: Pre-mRNA splicing in higher plants. Trends Plant Sci. 2000, 5 (4): 160-167. 10.1016/S1360-1385(00)01595-8.

    Article  CAS  PubMed  Google Scholar 

  31. Rogozin IB, Wolf YI, Sorokin AV, Mirkin BG, Koonin EV: Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution. Curr Biol. 2003, 13 (17): 1512-1517. 10.1016/S0960-9822(03)00558-X.

    Article  CAS  PubMed  Google Scholar 

  32. Yoshihama M, Nakao A, Nguyen HD, Kenmochi N: Analysis of Ribosomal Protein Gene Structures: Implications for Intron Evolution. PLoS Genet. 2006, 2 (3): e25-10.1371/journal.pgen.0020025.

    Article  PubMed Central  PubMed  Google Scholar 

  33. Sverdlov AV, Rogozin IB, Babenko VN, Koonin EV: Conservation versus parallel gains in intron evolution. Nucleic Acids Res. 2005, 33 (6): 1741-1748. 10.1093/nar/gki316.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  34. Roy SW, Gilbert W: Rates of intron loss and gain: implications for early eukaryotic evolution. Proc Natl Acad Sci U S A. 2005, 102 (16): 5773-5778. 10.1073/pnas.0500383102.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  35. Rogozin IB, Sverdlov AV, Babenko VN, Koonin EV: Analysis of evolution of exon-intron structure of eukaryotic genes. Brief Bioinform. 2005, 6 (2): 118-134. 10.1093/bib/6.2.118.

    Article  CAS  PubMed  Google Scholar 

  36. Nguyen HD, Yoshihama M, Kenmochi N: New maximum likelihood estimators for eukaryotic intron evolution. PLoS Comput Biol. 2005, 1 (7): e79-10.1371/journal.pcbi.0010079.

    Article  PubMed Central  PubMed  Google Scholar 

  37. Simpson AG, Radek R, Dacks JB, O'Kelly CJ: How oxymonads lost their groove: an ultrastructural comparison of Monocercomonoides and excavate taxa. J Eukaryot Microbiol. 2002, 49 (3): 239-248. 10.1111/j.1550-7408.2002.tb00529.x.

    Article  PubMed  Google Scholar 

  38. Dacks JB, Silberman JD, Simpson AG, Moriya S, Kudo T, Ohkuma M, Redfield RJ: Oxymonads are closely related to the excavate taxon Trimastix. Mol Biol Evol. 2001, 18 (6): 1034-1044.

    Article  CAS  PubMed  Google Scholar 

  39. Cavalier-Smith T: The excavate protozoan phyla Metamonada Grasse emend. (Anaeromonadea, Parabasalia, Carpediemonas, Eopharyngia) and Loukozoa emend. (Jakobea, Malawimonas): their evolutionary affinities and new higher taxa. Int J Syst Evol Microbiol. 2003, 53 (Pt 6): 1741-1758. 10.1099/ijs.0.02548-0.

    Article  CAS  PubMed  Google Scholar 

  40. Douglas S, Zauner S, Fraunholz M, Beaton M, Penny S, Deng LT, Wu X, Reith M, Cavalier-Smith T, Maier UG: The highly reduced genome of an enslaved algal nucleus. Nature. 2001, 410 (6832): 1091-1096. 10.1038/35074092.

    Article  CAS  PubMed  Google Scholar 

Download references


This work was supported by a grant from the Natural Sciences and Engineering Research Council of Canada, and EST sequencing was supported by the Protist EST Program through Genome Canada/Genome Atlantic. We thank A. de Koning for help isolating termite gut RNA, and N. Fast for critical reading of the manuscript. PJK is a Fellow of the Canadian Institute for Advanced Research and a New Investigator of the Canadian Institutes for Health Research and the Michael Smith Foundation for Health Research.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Patrick J Keeling.

Additional information

Authors' contributions

CHS analysed the EST data, performed PCR and sequencing, and examined conservation of intron positions in other organisms. PJK collected the termites and purified RNA for library construction. Both authors participated in the writing and editing of the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Slamovits, C.H., Keeling, P.J. A high density of ancient spliceosomal introns in oxymonad excavates. BMC Evol Biol 6, 34 (2006).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: