Skip to main content

Horizontal gene transfer of microbial cellulases into nematode genomes is associated with functional assimilation and gene turnover



Natural acquisition of novel genes from other organisms by horizontal or lateral gene transfer is well established for microorganisms. There is now growing evidence that horizontal gene transfer also plays important roles in the evolution of eukaryotes. Genome-sequencing and EST projects of plant and animal associated nematodes such as Brugia, Meloidogyne, Bursaphelenchus and Pristionchus indicate horizontal gene transfer as a key adaptation towards parasitism and pathogenicity. However, little is known about the functional activity and evolutionary longevity of genes acquired by horizontal gene transfer and the mechanisms favoring such processes.


We examine the transfer of cellulase genes to the free-living and beetle-associated nematode Pristionchus pacificus, for which detailed phylogenetic knowledge is available, to address predictions by evolutionary theory for successful gene transfer. We used transcriptomics in seven Pristionchus species and three other related diplogastrid nematodes with a well-defined phylogenetic framework to study the evolution of ancestral cellulase genes acquired by horizontal gene transfer. We performed intra-species, inter-species and inter-genic analysis by comparing the transcriptomes of these ten species and tested for cellulase activity in each species. Species with cellulase genes in their transcriptome always exhibited cellulase activity indicating functional integration into the host's genome and biology. The phylogenetic profile of cellulase genes was congruent with the species phylogeny demonstrating gene longevity. Cellulase genes show notable turnover with elevated birth and death rates. Comparison by sequencing of three selected cellulase genes in 24 natural isolates of Pristionchus pacificus suggests these high evolutionary dynamics to be associated with copy number variations and positive selection.


We could demonstrate functional integration of acquired cellulase genes into the nematode's biology as predicted by theory. Thus, functional assimilation, remarkable gene turnover and selection might represent key features of horizontal gene transfer events in nematodes.


Horizontal gene transfer (HGT), the movement of genes from one organism to another by mechanisms other than direct descent, is widespread in prokaryotes, but is thought to be rare among eukaryotes with sexual reproduction [1, 2]. However, recent genome and expressed sequence tag (EST) projects in nematodes provide strong evidence for HGT from bacteria, fungi, amoebozoa or from endosymbionts in the plant parasites Meloidogyne incognita and M. hapla, fungivorous and plant-parasitic nematodes of the Bursaphelenchus group, the necromenic nematode Pristionchus pacificus and the filarial nematode Brugia malayi [37]. In particular, cell wall degrading enzymes of glycosyl hydrolase families (endo-1,4-beta-glucanases; cellulases), pectate lyases and xylanases were acquired by several of these nematodes. Cellulases (EC can be found in 14 different glycosyl hydrolase families, of which only GHF5 and GHF45 were found in nematodes so far [8].

Surprisingly, phylogenetic reconstruction suggests that cellulase genes have been acquired multiple times independently in nematodes from distinct microbial donors [9]. Characterized cellulases from plant parasitic Tylenchida (plant sedentary endoparasitic nematodes such as Meloidogyne, Heterodera and Globodera) and migratory parasitic species (Pratylenchus penetrans) are from glycoside hydrolase family 5 (GHF5). They appear to have acquired a whole ancestral GHF5 gene cassette consisting of the catalytic domain and the carbohydrate-binding module 2 (CBM2) [10]. The intronless ancestral gene from putative bacterial donors later gained introns followed by intron shifts or losses in single gene copies. The pine wood nematode Bursaphelenchus xylophilus, which is part of the same clade as the cyst/root-knot nematodes above, but from a distinct grouping, acquired a different family of cellulases (GHF45) from fungi [3]. Genes obtained by HGT that are central to parasitic interaction, like pectin lyase or cellulase genes have undergone expansion by gene duplication after HGT in Meloidogyne hapla, M. incognita or P. pacificus [57, 10].

Evolutionary theory makes a number of predictions for the successful acquisition of genes by HGT. Specifically, evolutionary theory predicts that the successful integration of HGT-acquired genes into the biology of the host requires gene activity and longevity [11], resulting in a number of testable, but yet unanswered questions, i.e. are genes acquired by HGT functional? How stable are these genes over evolutionary time scales and do they undergo normal or unusual turnover rates? Finally, are such genes under positive selection?

To study the evolutionary history, activity and longevity of genes acquired by HGT requires a well-established phylogenetic framework at the species and family level and a number of natural isolates for intra-species comparisons. Nematodes of the beetle-associated genus Pristionchus fulfill these requirements as more than 160 strains of worldwide origin are available for the model species P. pacificus and 25 species of the Pristionchus genus have been phylogenetically characterized [12, 13]. Furthermore, molecular analysis places the genus Pristionchus well within the Diplogastridae family [1416]. Building on this phylogenetic framework, we used, first, transcriptomics of seven Pristionchus species and three additional diplogastrid species to study the evolution of cellulase genes and, second, compared individual genes in 24 natural isolates of P. pacificus.

Here we show that species with cellulase genes in their transcriptome always exhibited cellulase activity. Cellulase genes show high turnover with significant birth and death rates. The comparison of 24 natural isolates of P. pacificus strains from around the world suggests these high evolutionary dynamics to be associated with copy number variations and positive selection. We conclude that functional assimilation, high gene turnover and selection might represent key features of HGT events in nematodes.


Cellulase genes in diplogastrid transcriptomes

EST libraries of P. pacificus, six additional Pristionchus species, and three other diplogastrids (Acrostichus sp. RS5083, Diplogasteroides magnus RS5445 and Koerneria sudhausi SB413) were sequenced using the Roche 454-sequence technology platform (Figure 1A, see Materials and Methods for details) [17]. The datasets contained 16,000 to 28,000 assembled EST contigs and 42,000 to 148,000 singletons, each. Expressed gene contigs encompassing the carbohydrate-binding module (CBM) present at the C terminus of cellulases, were identified in all seven Pristionchus species and a near full-length cellulase transcript was found in Koerneria sudhausi (Figure 1A) [18]. In contrast, cellulase/CBM transcripts were not found in D. magnus and Acrostichus sp. RS5083 (Figure 1A). The CBM of the Pristionchus nematodes belonged to the CBM49 (pfam09478) family [19] and differed from that of plant-parasitic nematodes, which have similarity to the bacterial carbohydrate-binding module of family 2 (CBM2; pfam00553) [18]. The CBM49 domains of different Pristionchus species were all highly similar to each other (Figure 1B; additional file 1). In contrast, in the K. sudhausi cellulase a CBM domain could not be detected and the cellulase transcripts did not show sufficient sequence similarity to the Pristionchus cellulases to be included in the same multiple sequence alignments.

Figure 1
figure 1

Correlation between presence of cellulase genes and cellulase activity. (A) Phylogenetic relationships and results of diplogastrid species tested for cellulase transcripts and activity. These 10 species were used in Roche 454 sequencing. The tree was reconstructed from 4,043 bp of concatenated ribosomal protein genes by heuristic search using the maximum likelihood criterion in the PAUP*4.0b10 software as described in Mayer et al. (2009) [11, 12]. Numbers at nodes indicate bootstrap values. The presence of cellulase transcripts and observed cellulase activity in worm supernatants is indicated. The tree was rooted by Koerneria sudhausi. (B) Neighbour-joining tree of 246 bp of aligned EST contigs from Roche 454 transcriptomes encoding CBM49 domains of cellulases (see additional file 1). The best-fit substitution model (TrNef+G) has been selected by Modeltest 3.7 (Posada) corresponding to equal base frequencies, Nst = 6, substitution rates of A-C = 1, A-G = 1.1039, A-T = 1, C-G = 1, C-T = 1.7501, G-T = 1.0000; proportion of invariable sites I = 0; gamma distribution shape parameter α = 0.9033. Numbers at nodes indicate bootstrap values of 1,000 replications.

BLASTX searches of GenBank, Wormbase and UniRef90 databases with Pristionchus and Koerneria cellulase transcript sequences encompassing the full-length cellulase domain identified them as members of the glycosyl hydrolase family 5 (GHF5; pfam00150) (Figure 2) [6, 20, 21]. Different sets of closest matching sequences were found. The Pristionchus cellulase genes Ppa-cel-2, Ppa-cel-3 and Ppa-cel-1, the latter not detected in the sequenced transcriptome, were most closely related to a paralogous set of duplicated genes of the slime molds Dictyostelium discoideum and Polysphondylium pallidum (score: 264 bits, E-value: 4e-68), whereas K. sudhausi genes revealed high similarity to cellulase genes of the plant-parasitic nematode Aphelenchus avenae (score: 306 bits, E-value: 1e-81; Figure 2). These results of the similarity searches could be affected by skewed or insufficient sampling because matching homologues from related organisms are not yet in the databases, as has been argued by Mitreva et al. (2009) for GHF5 cellulase genes in plant-parasitic nematodes [22]. This explanation could hold for the Koerneria GHF5 cellulase gene, which appears to be in a monophyletic clade with plant-parasitic nematodes of the phylogenetic clade 12 [16]. However, the clade 9 nematode Koerneria is the only nematode in this clade - which encompasses Diplogastridae (Pristionchus and others) and Rhabditidae (Caenorhabditis and others) - to possess this GHF5 subclass, and no genes similar to GHF5 cellulases were detected in available EST datasets at GenBank from non-plant parasitic nematode clades or other species at intermediate phylogenetic positions (e. g., Rhabditida, 748,443 ESTs; Ascarida, 63,743 ESTs; Spirurida, 59,125 ESTs; Trichocephalida, 52,763 ESTs; Dorylaimida, 9,351 ESTs; Araelaimida, 2,591 ESTs).

Figure 2
figure 2

Phylogenies of diplogastrid GHF5 cellulase proteins and the best non-diplogastrid matches. The best protein matches from the GenBank database were identified and retrieved using BLAST Explorer and a phylogram was reconstructed using the server by the PhyML (v3.0) algorithm. Paralogous genes were omitted from the analysis. The GenInfo Identifier (gi) numbers are given after the taxon names. Numbers in black print at nodes indicate approximate Likelihood-Ratio Test (aLRT) values, numbers in blue print indicate bootstrap support values. Red arrows indicate the query taxa. (A) Phylogenetic relationships of P. pacificus cellulases Ppa-cel-1, Ppa-cel-2, and Ppa-cel-3 to 63 best-matching proteins from GenBank using 109 amino acid positions after curation by the Gblocks algorithm. The initial tree was obtained by the BIONJ algorithm, the amino acid substitution model was set to WAG. The discrete gamma model was applied with 4 categories, a gamma shape parameter of 1.119, and proportion of invariant sites of 0.030. (B) Phylogenetic relationships of the K. sudhausi cellulase EST contig to 51 best-matching proteins from GenBank using 141 amino acid positions after curation by the Gblocks algorithm. The initial tree was obtained by the BIONJ algorithm, the amino acid substitution model was set to WAG. The discrete gamma model was applied with 4 categories, a gamma shape parameter of 1.294, and proportion of invariant sites of 0.097.

In contrast, the Pristionchus cellulases belong to a GHF5 subclass that is distinct from those in plant-parasitic nematodes. Tracing the GHF5 genes of clade 9 and clade 12 nematodes and amoebozoans back to a common ancestor would require maintenance and sequence conservation of gene copies in genomes over extended evolutionary timescales and massive parallel losses of genes in most investigated nematodes and other phylogenetically related lineages of organisms during evolution. Hence, two independent HGT events, one from an amoebozoan or related microorganism to an ancestral Pristionchus species and a second from an unknown donor to Koerneria and Aphelenchus, appear to be the most likely scenario.

Correlation of cellulase genes and cellulase activity

Is the presence of cellulase genes in the EST sequence data correlated with cellulase activity? We tested the supernatant of mixed stage cultures of all ten species in a Congo red-polysaccharide interaction assay and found complete correlation with the transcriptome data [3, 23]. All eight tested Pristionchus and Koerneria species that contain cellulase genes in their EST libraries had detectable cellulase activity in the assay (Figure 1A; Figure 3). In contrast, D. magnus and Acrostichus sp. RS5083 did neither show cellulase activity nor could we detect expression of cellulase genes in their transcriptomes (Figure 1A; Figure 3). Thus, species with cellulase genes in their transcriptome always exhibited cellulase activity, suggesting that functional assimilation of HGT acquired genes represents an important force during HGT events.

Figure 3
figure 3

Cellulase activity in diplogastrid nematodes. The cellulase activity assays were performed on carboxymethylcellulose agar plates [3, 23]. Clear halos were detected in the supernatants of P. uniformis, P. entomophagus, P. aerivorus, P. pseudaerivorus, P. sp. 11, P. sp. 15, and K. sudhausi mixed-stage cultures and in the positive control (Aspergillus). No halos were observed in the supernatants of mixed-stage cultures of Acrostichus sp. and D. magnus and the negative control (Escherichia coli).

Evolutionary dynamics of Pristionchuscellulase genes

To study the evolutionary dynamics and turnover rates of cellulase genes, we reconstructed the phylogenetic relationship of the 12 Pristionchus ESTs encoding CBM49 domains by the maximum likelihood criterion using the alignment shown in additional file 1.The resulting phylogram revealed two important findings (Figure 1B). First, the cellulase CBM49 gene phylogeny is highly congruent to the species phylogeny of Pristionchus (Figure 1A,B) [13]. This result indicates an ancestral acquisition of cellulase genes followed by maintenance of the gene through several speciation events and subsequent divergence by point mutations during the evolution of species. The only discrepancy is presented by the P. uniformis Contig22434.1, which appears to be a cellulase gene that was maintained only in P. uniformis but was lost in other Pristionchus species. Homologous genes in the P. pacificus genome with highest similarity to P. uniformis Contig22434.1 are Ppa-cel-2 and Ppa-cel-3. Second, the cellulase genes evolve dynamically by gene duplications as indicated by species-specific duplicates in P. pacificus, P. sp. 11 and P. sp. 15, respectively. In addition to lineage-specific gene duplications, putative gene deletions have occurred as exemplified by an ancestral CBM49 domain that is detectable in only one extant species, P. uniformis. These results suggest conspicuous evolutionary dynamics of cellulase genes in the genus Pristionchus with high gene birth and death rates.

Copy number variation among populations is well known from human evolution [24]. We wanted to know if the evolutionary dynamics of genes acquired by HGT also results in patterns of copy number variations. We selected three paralogous cellulase genes from the P. pacificus genome and compared them in 24 natural isolates of worldwide origin (Table 1). Two of the genes (Ppa-cel-2 and Ppa-cel-3) are expressed in the transcriptome and are detectable by mass spectroscopy in the proteome [6, 25]. For comparison, the third gene (Ppa-cel-1) has been selected because it could be found as a complete copy in the genome and its transcription has been demonstrated, but at too low levels to be found in the transcriptome and proteome analyses [6, 25]. The analyzed parts of the genes Ppa-cel-1, Ppa-cel-2 and Ppa-cel-3 encompass 3.0, 2.6 and 2.8 kb, respectively, with a total of 3.9 kb of exonic sequence (Table 2). While Ppa-cel-1 and Ppa-cel-2 were obtained from all 24 natural isolates, Ppa-cel-3 could only be detected in 16 out of the 24 P. pacificus strains (Table 1). The variability of Ppa-cel-3 among the strains is low, with ≤ 0.4% nucleotide differences in pair-wise comparisons of the whole gene and ≤ 0.3% nucleotide differences in exons. At the same time, this gene is highly similar to Ppa-cel-2 in transcript sequence and to a large part in intronic sequences with most of the splice sites being conserved between them. These results suggest that the Ppa-cel-3 arose by recent gene duplication from Ppa-cel-2 within the species P. pacificus. The time since its birth was too short for substantial sequence diversification and the gene itself has not yet spread through the whole population. Thus, the comparison of Pristionchus species and P. pacificus strains provides strong evidence for notable evolutionary turnover of cellulase genes by gene duplications and putative deletions.

Table 1 Pristionchus pacificus strains
Table 2 Pristionchus pacificu s cellulase gene data

Evidence for positive selection of Ppa-cel-2

In addition to copy number variations, the evolutionary diversification of genes should result in intra-specific polymorphisms, which are either the result of non-adaptive or selective processes [26]. The P. pacificus collection of wild isolates of worldwide origin enables the inference of site-specific positive Darwinian and purifying selection. Site-specific positive Darwinian should result in a ratio of non-synonymous (Ka) to synonymous (Ks) substitutions higher than one at individual codons. We calculated the Ka/Ks ratio at each site of the cellulase proteins Ppa-CEL-1, Ppa-CEL-2 and Ppa-CEL-3 with the help of the Selecton server by using an empirical Bayesian method [27]. Sites with a Ka/Ks ratio ≥1 in Ppa-CEL-2 were identified. A likelihood ratio test was performed between the two models, M8 and M8a, implemented on this platform. M8 considers sites under purifying selection vs. a category of sites experiencing positive or neutral selection (ωs ≥1) [28]. M8a, which is nested in the M8 model, is restricted to purifying and neutral selection only (ωs = 1) [29]. The test showed a significance level of 0.001 in support for the M8 model, which allows site-specific positive selection (Table 2).

In contrast, Ppa-CEL-1 and Ppa-CEL-3 did not show any evidence for positive selection, arguing that the evolutionary forces act on individual genes with high specificity. This is also observable when comparing the degree of polymorphisms among the three genes. The highly expressed Ppa-cel-2 gene displayed up to 2.1% nucleotide differences in pair-wise comparisons of the whole gene and 1.5% in exons. In contrast, the variation in Ppa-cel-1 was considerably higher with up to 5.7% and 6.0% nucleotide differences in the whole gene and exons, respectively. We speculate that neutral processes underlie this variability in Ppa-cel-1, but not in Ppa-cel-2. Taken together, the comparison of three cellulase genes in 24 natural isolates of P. pacificus indicates that different evolutionary forces act on these genes and provides evidence for positive selection acting on Ppa-cel-2.

Evidence for intron gain after HGT

After a gene has been acquired by HGT it will assimilate to the new genomic environment, which may lead to changes in the size and number of spliceosomal introns [30]. Consistent with this, we found that the putative donor cellulase genes in the slime molds D. discoideum and P. pallidum contain only one or two small introns of 71 to 275 bp, whereas the P. pacificus cellulase genes Ppa-cel-1, Ppa-cel-2, and Ppa-cel-3 possess 13 to 16 introns, ranging from 14 to 452 bp (Table 2). This includes several extremely small introns of a length below 50 bp, which are typical for P. pacificus and other nematodes [6]. Assuming that the slime mold genes represent the ancestral state, these cellulase genes have gained up to 15 small spliceosomal introns. However, we want to note that in theory, it is also possible that slime molds might have lost ancestral introns.


We have shown that the diplogastrid nematode genera Koerneria and Pristionchus independently acquired cellulase genes by HGT from different putative donors. A detailed functional and genomic investigation in the phylogenetic framework of Pristionchus reveals that all features of evolutionary significance are fulfilled, including functional activity, longevity and the action of positive selection. Longevity of the transferred genes is indicated by their divergence from the donor through accumulated mutations. In addition, the resulting cellulase CBM49 phylogeny following HGT into Pristionchus is congruent to the host taxon phylogeny, i.e. the phylogenetic pattern of the transferred genes is largely identical to the phylogeny of the genus Pristionchus (Figure 1A,B). Thus, the original insertion of the cellulase gene occurred in an ancestral taxon and has been maintained over evolutionary time scales. During these processes, the cellulase gene became functionally integrated into the nematode's biology. Five notions support this. First, the cellulase genes are transcribed. Second, peptides corresponding to transcripts were identified by mass spectrometry [25]. Third, the nematodes secrete cellulases whose enzymatic activity can be demonstrated in functional assays with supernatant of mixed stage cultures. In nature, microorganismal and plant cellulose are potential targets for nematode cellulase activity, providing new potential food sources and fitness advantages for the nematode. Fourth, the cellulase genes gained introns in the host species, which are in phase with the functional open reading frame. The finding that the original open reading frames have not been disrupted by intron gain provides additional evidence for the functional assimilation of the acquired genes and for a selective advantage of active cellulase proteins in P. pacificus. Fifth, the cellulase genes display high intra-genus and intra-species turnover dynamics by birth-and-death processes, accompanied by selection for certain functional genes. In conclusion, we demonstrated the functional integration of acquired cellulase genes into the nematode's biology as predicted by theory. Functional assimilation, remarkable gene turnover and selection might represent key features of horizontal gene transfer events in nematodes. We speculate that HGT represents a major source for evolutionary innovation in nematodes and might have helped nematodes to invade the diversity of ecosystems, in which they can be found today, such as parasites of plants, animals, including humans.


Predictions for successful horizontal gene transfer by evolutionary theory include, in particular, integration of the genes into the host's genome, functional activity, evolutionary longevity and positive selection of the genes. Supported by a robust phylogenetic framework we examined here these predictions for cellulase genes in seven species of the diplogastrid nematode Pristionchus and three other related diplogastrid genera. The following conclusions can be drawn. First, species with cellulase genes in their transcriptome always showed enzymatic cellulase activity demonstrating functional integration into the biology of the nematode. Second, different cellulase genes were acquired independently by the diplogastrid nematodes Koerneria and Pristionchus. Third, the considerable congruence of the phylogenetic profile of cellulase genes within the genus Pristionchus with the phylogeny of the species indicates an ancestral gene integration event into the genome and gene longevity. Forth, sequence comparisons of three selected cellulase genes in 24 natural isolates of the species Pristionchus pacificus show notable evolutionary dynamics with elevated birth and death rates, associated with copy number variations and putative positive selection for one of the cellulase genes. Thus, functional integration of acquired cellulase genes into the nematode's biology, as predicted by theory, could be demonstrated.


Expressed sequence tag (EST) library preparation and sequencing

Mixed stage cultures of inbred nematode strains derived from isogenic female lines were collected and washed with phosphate-buffered saline. The strains were: Pristionchus pacificus PS312; P. aerivorus RS5106; P. pseudaerivorus RS5139; P. entomophagus RS0144; P. uniformis RS0141; P. sp. 11 RS5228; P. sp. 15 RS5229; Diplogasteroides magnus RS5445; Acrostichus sp. RS5083; and Koerneria sudhausi SB413. Total RNA was prepared by the Triazol method (Invitrogen). Normalized cDNA libraries from the polyadenylated RNA fraction from nine of the mixed stage nematodes were constructed at the GSC at Washington University by standard procedures and sequenced on the Roche 454 FLX platform producing 320,000 to 540,000 reads. Median read length was ~ 250 bp for all libraries. The K. sudhausi library was sequenced on the Roche 454 Titanium platform resulting in 1.2 million reads. High quality Roche 454 sequencer reads were assembled with the EST version of PCAP.REP [31] to EST contigs and annotated by similarity searches using BLASTX against the non-redundant protein databases Uniref90 and GenBank. The complete ten species EST library dataset will be published elsewhere. Carbohydrate-binding modules (CBM) and cellulase families were identified and classified with the help of the NCBI conserved domain database (CDD) ( and the Carbohydrate-Active EnZymes database (CAZy) server [8, 32, 33]. All cellulase genes identified in Pristionchus and Koerneria are available at under

Multiple sequence alignments and phylogenetic tree reconstruction

Cellulase nucleotide sequences were aligned by Clustalx [34, 35] and adjusted manually with the help of the Seqpup (v0.6f) software [36]. For phylogenetic tree reconstruction the optimal substitution model was selected by the Modeltest 3.7 software using the Akaike information criterion [37, 38]. A phylogenetic tree was then obtained by the neighbour-joining algorithm with the help of the PAUP*4.0b10 software [39]. The distance measure was set to ML with likelihood settings corresponding to the model as determined by Modeltest. Trees were visualized using FigTree v.1.3.1 [40].

To find the closest homologs to the diplogastrid cellulase EST contigs the 100 best protein matches were identified by a similarity search in the non-redundant protein database (2010-05-18) at GenBank and retrieved using BLAST Explorer [41]. The dataset was pruned by removal of highly similar paralogous or allelic protein sequences. The minimum score of the sequences kept in the final dataset was 140 bits with an E-value of 9e-31 for Pristionchus cellulase EST contigs, and a score of 156 bits, E-value of 1e-36 for the Koerneria sudhausi EST contig. Phylogenetic analysis of the dataset was performed by the workflow of the platform [42] comprising sequence alignment by the MUSCLE software (v3.7) [43], removal of ambiguous regions by Gblocks (v0.91b) [44] using the following parameters: minimum length of a block after gap cleaning: 10; no gap positions allowed in the final alignment; all segments with contiguous nonconserved positions bigger than 8 were rejected; minimum number of sequences for a flank position: 85%. The phylogenetic tree was reconstructed from the protein sequences using the maximum likelihood method implemented in the PhyML program (v3.0 aLRT) on the same platform [42, 45]. WAG was used as substitution model with 4 substitution rate categories, number of invariant sites and gamma distribution parameter were estimated. The gamma shape parameter was estimated directly from the data (gamma = 1.119). Reliability of internal branches was assessed using the aLRT test (SH-Like) [46].

Amplification of Pristionchus pacificuscellulase genes by polymerase chain reaction (PCR)

Cellulase genes from genomic DNA from 24 P. pacificus strains were amplified in four or five overlapping amplicons by PCR. The reactions were performed in 20 μl of 1× PCR buffer (GE HealthCare, Germany) containing 0.2 mM of each deoxynucleoside triphosphate, 0.5 μM of each primer, 40 ng of genomic DNA, and 1 unit of Taq DNA polymerase (GE HealthCare). The reactions were started by initial denaturation at 94°C for 5 min in a GeneAmp PCR System 9700 thermocycler (PE Applied Biosystems, Darmstadt, Germany), followed by 35 cycles of denaturation at 94°C for 30 sec, primer annealing at 55°C for 30 sec, and extension at 72°C for 2 min. A final incubation step at 72°C for 7 min concluded the reaction. The products were gel purified and directly sequenced by the PCR primers (additional file 2). The sequences were assembled using the SeqMan Pro (v8.0.2) software in the Lasergene 8 package (DNASTAR) and aligned to P. pacificus PS312 genomic and cDNA with the help of Seqpup (v0.6f) to deduce the positions of exons [6, 36]. Protein sites under selection were identified in a codon-alignment of exon sequences using the Selecton server at[27].

Cellulase Activity Assay

About 50,000 mixed stage worms per milliliter M9 medium were incubated on a shaking platform for 2 days at 20°C. The culture supernatants were sterile filtered and 40 μl were applied to holes in agar plates containing 0.5% carboxymethylcellulose sodium salt. After incubation for 24 h the plates were stained with 0.1% Congo red for 15 min and destained with 1 M NaCl for another 15 min. The assay was repeated at least 3 times for each nematode strain.

Accession codes

The nucleotide sequences reported here have been deposited at GenBank with the accession numbers HQ632018-HQ632092.


Pristionchus web server: Selecton server: web service: Uniref90 database: Genbank database: Carbohydrate-Active enZYmes Database (CAZy):



Carbohydrate binding module


copy number variation


expressed sequence tag


glycosyl hydrolase family


horizontal gene transfer


polymerase chain reaction


  1. Andersson JO: Lateral gene transfer in eukaryotes. Cellular and Molecular Life Sciences. 2005, 62: 1182-1197. 10.1007/s00018-005-4539-z.

    Article  CAS  PubMed  Google Scholar 

  2. Gladyshev EA, Meselson M, Arkhipova IR: Massive horizontal gene transfer in bdelloid rotifers. Science. 2008, 320: 1210-1213. 10.1126/science.1156407.

    Article  CAS  PubMed  Google Scholar 

  3. Kikuchi T, Jones JT, Aikawa T, Kosaka H, Ogura NA: Family of glycosyl hydrolase family 45 cellulases from the pine wood nematode Bursaphelenchus xylophilus. FEBS Lett. 2004, 572: 201-205. 10.1016/j.febslet.2004.07.039.

    Article  CAS  PubMed  Google Scholar 

  4. Dunning Hotopp JC, Clark ME, Oliveira DCSG, Foster JM, Fischer P, et al: Widespread lateral gene transfer from intracellular bacteria to multicellular eukaryotes. Science. 2007, 317: 1753-1756. 10.1126/science.1142490.

    Article  CAS  PubMed  Google Scholar 

  5. Abad P, Gouzy J, Aury JM, Castagnone-Sereno P, Danchin EGJ, et al: Genome sequence of the metazoan plant-parasitic nematode Meloidogyne incognita. Nat Biotechnol. 2008, 26: 909-915. 10.1038/nbt.1482.

    Article  CAS  PubMed  Google Scholar 

  6. Dieterich C, Clifton SW, Schuster LN, Chinwalla A, Delehaunty K, et al: The Pristionchus pacificus genome provides a unique perspective on nematode lifestyle and parasitism. Nat Genet. 2008, 40: 1193-1198. 10.1038/ng.227.

    Article  CAS  PubMed  Google Scholar 

  7. Opperman CH, Bird DM, Williamson VM, Rokhsar DS, Burke M, et al: Sequence and genetic map of Meloidogyne hapla: A compact nematode genome for plant parasitism. Proc Natl Acad Sci USA. 2008, 105: 14802-14807. 10.1073/pnas.0805946105.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Carbohydrate-Active enZYmes Database (CAZY): []

  9. Dieterich C, Sommer RJ: How to become a parasite - lessons from the genomes of nematodes. Trends Genet. 2009, 25: 203-209. 10.1016/j.tig.2009.03.006.

    Article  CAS  PubMed  Google Scholar 

  10. Kyndt T, Haegeman A, Gheysen G: Evolution of GHF5 endoglucanase gene structure in plant-parasitic nematodes: no evidence for an early domain shuffling event. BMC Evol Biol. 2008, 8: 305-10.1186/1471-2148-8-305.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Blaxter M: Symbiont genes in host genomes: fragments with a future?. Cell Host Microbe. 2007, 2: 211-213. 10.1016/j.chom.2007.09.008.

    Article  CAS  PubMed  Google Scholar 

  12. Hong RL, Sommer RJ: Pristionchus pacificus: a well-rounded nematode. Bioessays. 2006, 28: 651-659. 10.1002/bies.20404.

    Article  CAS  PubMed  Google Scholar 

  13. Mayer WE, Herrmann M, Sommer RJ: Phylogeny of the nematode genus Pristionchus and implications for biodiversity, biogeography and the evolution of hermaphroditism. BMC Evol Biol. 2007, 7: 104-10.1186/1471-2148-7-104.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Mayer WE, Herrmann M, Sommer RJ: Molecular phylogeny of beetle associated diplogastrid nematodes suggests host switching rather than nematode-beetle coevolution. BMC Evol Biol. 2009, 9: 212-10.1186/1471-2148-9-212.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Kanzaki N, Giblin-Davis RM, Davies K, Ye W, Center BJ, et al: Teratodiplogaster fignewmani gen. nov., sp. nov. (Nematoda: Diplogastridae) from the syconia of Ficus racemose in Australia. Zoolog Sci. 2009, 26: 569-578. 10.2108/zsj.26.569.

    Article  CAS  PubMed  Google Scholar 

  16. van Megen H, van den Elsen S, Holterman M, Karssen G, Mooyman P, et al: A phylogenetic tree of nematodes based on about 1200 full-length small subunit ribosomal DNA sequences. Nematology. 2009, 11: 927-S27. 10.1163/156854109X456862.

    Article  CAS  Google Scholar 

  17. []

  18. Boraston AB, Bolam DN, Gilbert HJ, Davies GJ: Carbohydrate-binding modules: fine-tuning polysaccharide recognition. Biochem J. 2004, 382: 769-781. 10.1042/BJ20040892.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Urbanowicz BR, Catala C, Irwin D, Wilson DB, Ripoll DR, et al: A tomato endo-beta-1,4-glucanase, SlCel9C1, represents a distinct subclass with a new family of carbohydrate binding modules (CBM49). J Biol Chem. 2007, 282: 12066-12074. 10.1074/jbc.M607925200.

    Article  CAS  PubMed  Google Scholar 

  20. Henrissat B: A classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem J. 1991, 280: 309-316.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Henrissat B: A new cellulase family. Mol Microbiol. 1997, 23: 848-849. 10.1046/j.1365-2958.1997.2661629.x.

    Article  CAS  PubMed  Google Scholar 

  22. Mitreva M, Smant G, Helder J: Role of horizontal gene transfer in the evolution of plant parasitism among nematodes. Methods Mol Biol. 2009, 532: 517-535. full_text.

    Article  CAS  PubMed  Google Scholar 

  23. Mateos PF, Jimenez-Zurdo JI, Chen J, Squartini AS, Haak SK, et al: Cell-associated pectinolytic and cellulolytic enzymes in Rhizobium leguminosarum biovar trifolii. Appl Environ Microbiol. 1992, 58: 1816-1822.

    CAS  PubMed  PubMed Central  Google Scholar 

  24. Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, et al: Global variation in copy number in the human genome. Nature. 2006, 444: 444-454. 10.1038/nature05329.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Borchert N, Dieterich C, Krug K, Schutz W, Jung S, et al: Proteogenomics of Pristionchus pacificus reveals distinct proteome structure of nematode models. Genome Res. 2010, 20: 837-846. 10.1101/gr.103119.109.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Lynch M: The frailty of adaptive hypotheses for the origins of organismal complexity. Proc Natl Acad Sci USA. 2007, 104 (Suppl 1): 8597-8604. 10.1073/pnas.0702207104.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Stern A, Doron-Faigenboim A, Erez E, Martz E, Bacharach E, et al: Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approach. Nucleic Acids Res. 2007, 35: W506-511. 10.1093/nar/gkm382.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Yang Z, Nielsen R, Goldman N, Pedersen AM: Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics. 2000, 155: 431-449.

    CAS  PubMed  PubMed Central  Google Scholar 

  29. Swanson WJ, Nielsen R, Yang Q: Pervasive adaptive evolution in mammalian fertilization proteins. Mol Biol Evol. 2003, 20: 18-20.

    Article  CAS  PubMed  Google Scholar 

  30. Moran NA, Jarvik T: Lateral transfer of genes from fungi underlies carotenoid production in aphids. Science. 2010, 328: 624-627. 10.1126/science.1187113.

    Article  CAS  PubMed  Google Scholar 

  31. Huang X, Yang SP, Chinwalla AT, Hillier LW, Minx P, et al: Application of a superword array in genome assembly. Nucleic Acids Res. 2006, 34: 201-205. 10.1093/nar/gkj419.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Marchler-Bauer A, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, et al: CDD: specific functional annotation with the Conserved Domain Database. Nucleic Acids Res. 2009, 37: D205-D210. 10.1093/nar/gkn845.

    Article  CAS  PubMed  Google Scholar 

  33. Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, et al: The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009, 37: D233-D238. 10.1093/nar/gkn663.

    Article  CAS  PubMed  Google Scholar 

  34. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research. 1997, 25: 4876-4882. 10.1093/nar/25.24.4876.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, et al: Clustal W and Clustal × version 2.0. Bioinformatics. 2007, 23: 2947-2948. 10.1093/bioinformatics/btm404.

    Article  CAS  PubMed  Google Scholar 

  36. Gilbert DG: SeqPup version 0.6f: a biosequence editor and analysis application. 1996, []

    Google Scholar 

  37. Akaike H: A new look at the statistical model identification. IEEE Trans Automatic Control. 1974, 19: 716-723. 10.1109/TAC.1974.1100705.

    Article  Google Scholar 

  38. Posada D, Crandall KA: Modeltest: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.

    Article  CAS  PubMed  Google Scholar 

  39. Swofford DL: PAUP: Phylogenetic Analysis Using Parsimony (and other methods). Version 4.0b10. 2002, Sunderland, Massachusetts: Sinauer Associates

    Google Scholar 

  40. Rambaut A: FigTree 1.3.1. 2010, []

    Google Scholar 

  41. Dereeper A, Audic S, Claverie JM, Blanc G: BLAST-EXPLORER helps you building datasets for phylogenetic analysis. BMC Evol Biol. 2010, 10: 8-10.1186/1471-2148-10-8.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Dereeper A, Guignon V, Blanc G, Audic S, Buffet S, et al: robust phylogenetic analysis for the non-specialist. Nucleic Acids Res. 2008, 36: W465-W469. 10.1093/nar/gkn180.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000, 17: 540-552.

    Article  CAS  PubMed  Google Scholar 

  45. Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704. 10.1080/10635150390235520.

    Article  PubMed  Google Scholar 

  46. Anisimova M, Gascuel O: Approximate likelihood ratio test for branches: A fast, accurate and powerful alternative. Syst Biol. 2006, 55: 539-552. 10.1080/10635150600755453.

    Article  PubMed  Google Scholar 

Download references


We thank the GSC at Washington University for carrying out the sequencing work. We thank H. Haussmann for keeping worm stocks; M. Riebesell, Drs. R. Rae, D. Bumbarger, A. Ogawa, K. Morgan and A. McGaughran for critically reading the manuscript.

R.J.S. was supported by the Presidential Innovation Fund of the Max Planck Society.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Ralf J Sommer.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

WEM, CD and RJS conceived and designed the experiments. WEM, GB and LNS performed the experiments. WEM and CD analyzed the data. WEM, CD and RJS wrote the paper. RJS was the principal investigator and oversaw experimental design and analysis. All authors read and approved the final manuscript.

Electronic supplementary material


Additional file 1: Multiple sequence alignments of the Pristionchus CBM49 domains obtained from the transcriptomes. (A) Nucleotide alignment. (B) Amino acid alignment. The alignments were created by Clustal × 2.0.11 [35]. (PDF 96 KB)

Additional file 2: Table PCR primers. PCR primers used to amplify P. pacificus cellulase genes. (DOC 30 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Mayer, W.E., Schuster, L.N., Bartelmes, G. et al. Horizontal gene transfer of microbial cellulases into nematode genomes is associated with functional assimilation and gene turnover. BMC Evol Biol 11, 13 (2011).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: