- Research article
- Open Access
Contrasted evolutionary histories of two Toll-like receptors (Tlr4 and Tlr7) in wild rodents (MURINAE)
BMC Evolutionary Biology volume 13, Article number: 194 (2013)
In vertebrates, it has been repeatedly demonstrated that genes encoding proteins involved in pathogen-recognition by adaptive immunity (e.g. MHC) are subject to intensive diversifying selection. On the other hand, the role and the type of selection processes shaping the evolution of innate-immunity genes are currently far less clear. In this study we analysed the natural variation and the evolutionary processes acting on two genes involved in the innate-immunity recognition of Microbe-Associated Molecular Patterns (MAMPs).
We sequenced genes encoding Toll-like receptor 4 (Tlr4) and 7 (Tlr7), two of the key bacterial- and viral-sensing receptors of innate immunity, across 23 species within the subfamily Murinae. Although we have shown that the phylogeny of both Tlr genes is largely congruent with the phylogeny of rodents based on a comparably sized non-immune sequence dataset, we also identified several potentially important discrepancies. The sequence analyses revealed that major parts of both Tlrs are evolving under strong purifying selection, likely due to functional constraints. Yet, also several signatures of positive selection have been found in both genes, with more intense signal in the bacterial-sensing Tlr4 than in the viral-sensing Tlr7. 92% and 100% of sites evolving under positive selection in Tlr4 and Tlr7, respectively, were located in the extracellular domain. Directly in the Ligand-Binding Region (LBR) of TLR4 we identified two rapidly evolving amino acid residues and one site under positive selection, all three likely involved in species-specific recognition of lipopolysaccharide of gram-negative bacteria. In contrast, all putative sites of LBRTLR7 involved in the detection of viral nucleic acids were highly conserved across rodents. Interspecific differences in the predicted 3D-structure of the LBR of both Tlrs were not related to phylogenetic history, while analyses of protein charges clearly discriminated Rattini and Murini clades.
In consequence of the constraints given by the receptor protein function purifying selection has been a dominant force in evolution of Tlrs. Nevertheless, our results show that episodic diversifying parasite-mediated selection has shaped the present species-specific variability in rodent Tlrs. The intensity of diversifying selection was higher in Tlr4 than in Tlr7, presumably due to structural properties of their ligands.
An effective immune defence is dependent on well-timed activation of an appropriate immune response. Pathogen recognition by innate immunity Pattern Recognition Receptors (PRRs) is crucial in this process [1, 2]. The PRRs detect molecular structures named Microbe-Associated Molecular Patterns (MAMPs) that are conservatively present among individual microorganism taxa, because they are essential for their survival (such as, e.g., bacterial lipopolysaccharides, muramyl dipeptide, peptidoglycan, flagellin, mannose, bacterial, fungal, parasitic and viral nucleic acids) . Recent studies have associated polymorphism in genes encoding PRRs with variability in resistance or susceptibility to several infectious diseases in humans, laboratory mice and poultry e.g. [4–8]. However, in wildlife, molecular variation in PRR genes is still poorly documented [9–14].
Understanding the evolution of the immune system in general has been a challenge for evolutionary biologists and ecologists since JBS Haldane associated natural selection with infectious diseases . In vertebrates, the study of selection patterns was mostly oriented towards genes of acquired immunity which are now intensively studied even in wild populations. Among them, genes of the major histocompatibility complex (MHC) are the most explored and the role of balancing selection in their evolution is generally accepted and well understood [16–23]. The quite late discovery of genes involved in the second branch of vertebrate immunity, i.e. innate immunity, among which the most important PRRs are Toll-like receptors (hereafter abbreviated according to the mouse gene and protein nomenclature as Tlrs and TLRs, respectively) [24–27], has resulted in modest research of their evolution in wildlife populations .
Generally, two subclasses of TLRs are distinguished in vertebrates according to the ligands they target [3, 9, 29, 30]. The first subclass includes TLR1, TLR2, TLR4, TLR5, TLR6 and TLR10. These TLRs predominantly detect bacterial components (but also fungal and to lesser extent viral components) and are expressed on the outer cell membrane. Throughout this paper we term them “bacterial-sensing” TLRs. The second subclass includes TLR3, TLR7, TLR8 and TLR9 and targets mainly viral components (e.g. ssRNA, dsRNA, DNA containing unmethylated CpG), hereafter termed “viral-sensing” TLRs. These TLRs are expressed mostly within cells into the membranes of endosomal compartments. This current spectrum of genes for TLRs arose by multiple gene duplication and during the last 700 Mya diversified to recognize distinct MAMPs [29, 31–36].
TLRs of both subclasses are transmembrane proteins composed of three domains [34, 37]. The Extra-Cellular Domain (ECD) consists of a varying number of Leucin-Rich Repeat motifs (LRRs) that form a horseshoe-shaped tertiary structure of the ECD. This domain contains the Ligand Binding Region (LBR) which is directly responsible for physical interactions with the pathogen-derived structures and as such it is likely subject to intensive selection. The ECD is followed by a short Transmembrane Domain (TM), and an Intracellular domain (ICD) containing the Toll/Interleukin-1 Receptor (TIR) domain responsible for TLR signaling . As previously shown , non-synonymous SNPs located in LBR may affect the 3D structure of the protein and its surface charge. This may have important functional consequences, influencing receptor ability to bind pathogens [14, 36, 39], and may even lead to the evolution of species-specific ligand recognition [40, 41]. Appropriate binding of MAMPs by LBR is connected with changes in receptor dimerization [42–44] that induce signaling and release of cytokines triggering mainly Th1 and Th17 inflammation, fever and phagocytosis [45–47]. The TLR signaling ensures an immediate response to invading microorganisms that, in a second step, further directs the following adaptive immune response [48, 49].
Previous studies, mostly based on investigation in humans, primates and domestic or laboratory animals, provided information regarding some general patterns of TLR evolution and maintenance of their genetic polymorphism [2, 9, 50–52]. These studies revealed that the ECD is more frequently a target of positive selection than the TIR domain. Moreover, in general the viral-sensing TLRs seem to evolve under stronger purifying selection than the bacterial-sensing ones [53–56]. However, up to now, the evidence of TLR polymorphism and the type of selection that shapes this polymorphism in natural populations remain rare [10–14]. Besides, to our knowledge the precise investigation of the LBR variability and evolution is missing. Such information could nevertheless be important to better understand species-specific differences in the susceptibility to various pathogens .
In the present study we focused on the molecular variation of the genes encoding the bacterial-sensing TLR4 (binding mainly bacterial lipopolysaccharides, LPS, as a ligand)  and the viral-sensing TLR7 (binding viral ssRNA) [59, 60] in 23 species of the subfamily Murinae. Murine rodents are largely distributed over the world and several species (such as rats and mice) live in close proximity to humans. A recent review showed that 60% of the agents of emerging diseases in humans circulate in animals  and most of the natural reservoirs of a number of serious viral and bacterial emerging agents of zoonoses are rodents [62, 63]. Species-specific molecular variability in immune-related genes may be responsible for differences in the ability of rodent species to transmit these pathogens. Herein we aimed to document evolutionary histories of these two Tlrs during murine diversification. We implemented statistical approaches to infer Tlr phylogeny and to detect selection acting on DNA and amino acid (AA) sequences. We searched for deviations from “species” phylogeny based on a comparably sized non-immune sequence dataset by contrasting phylogenetic trees reconstructed from Tlr sequences with those reconstructed from “neutral” genes (both mitochondrial and nuclear). Deviations would indicate the occurrence of non-neutral patterns during the Tlr evolutionary history, e.g. adaptive selection [9, 64, 65]. Next we estimated putative functional changes in the LBR by examining variability in predicted tertiary 3D-structures of the proteins, and in biophysical properties of proteins (charge and structural characteristics) at polymorphic binding sites. Finally, we compared the evolutionary histories of the two TLRs to reveal potentially distinct evolutionary pressures shaping these proteins.
Amplification and sequencing were successful in 96 samples representing 23 rodent species for Tlr4 and in 96 samples representing 22 species for Tlr7 (Additional file 1: Table S1). Only samples from one species - Maxomys surifer could not be completely sequenced for Tlr7 - the first 180 bp were missing and we excluded this species from the Tlr7 analyses. No stop codons, indels nor recombination were detected in these data using SBP (Datamonkey).
For the whole Tlr4 coding sequence (CDS), the three different domains were predicted by SMART as follows: ECD from AA position 1 to 635, TM from position 636 to 658 and ICD from position 659 to 835 in which the TIR domain (from position 671 to 816) and ICD distal part (ICD-DP; from 817 to 835) may be identified (Additional file 1: Figure S1). For Tlr7, the predicted location of the three domains was the following: ECD from position 1 to 850, TM from position 851 to 873 and ICD from position 874 to 1050 (TIR from 894 to 1033 and ICD-DP from 1034 to 1050; Additional file 1: Figure S1). In general, Tlr4 was more diverse than Tlr7, and within each Tlr, the ECD domain was more variable than the TIR domain in both molecules (Table 1). Surprisingly, ICD-DP located on the C-terminal end of Tlr4 represented the most variable region of exon 3 (πICD-DP-Tlr4 = 0.102±0.015).
Phylogeny and co-divergence between the tree based on a comparably sized non-immune sequence dataset and TLR trees
Both phylogenetic approaches (MrBayes and RAxML) displayed similar trees for both Tlrs (Additional file 2: Figures S2 and S3). Minor differences between ML and Bayesian trees were found only at the intraspecific level. Tlr4 topology was well-supported with posterior probabilities (pp) ≥ 0.95 despite a lack of resolution within the black rat species complex (including Rattus rattus, R. tanezumi, R. sakeratensis, R. tiomanicus, R. argentiventer, R. andamanensis), between two Bandicota species (Bandicota savilei and B. indica did not form reciprocal monophyletic clades) and between two subspecies of the house mouse (Additional file 2: Figures S2a and S3a). Sequences of Tlr7 were also predominantly clustered according to species with strong supports (pp ≥ 0.95). Relationships between Asiatic mouse species were not fully resolved (monophyly of Mus caroli, M. cooki and M. cervicolor supported with a moderate pp value of 0.86 and Bootstrap values, Bp = 81) as well as those between Leopoldamys species (L. edwardsi appeared more closely related to L. neilli, rather than to L. sabanus but with a low pp of 0.6, Bp = 48). Similarly to Tlr4, branching orders within the genus Rattus were not resolved: Rattus exulans (clade I) was retrieved monophyletic without ambiguity (pp = 1, Bp = 100), R. norvegicus and R. nitidus were grouped together with the highest support (clade II, pp = 1, Bp = 100) and the remaining Rattus species formed a moderately supported group (clade III, pp = 0.7, Bp = 98, for more details see Additional file 2: Figures S2b and S3b).
At the first glance, Tlr phylogenies (based on MrBayes approach) of the black rat complex was congruent to the tree based on a comparably sized non-immune sequence dataset (Figure 1). The number of co-divergence events inferred using JANE 4 was significantly higher than expected by chance, meaning that the two phylogenies were similar (Additional file 1: Figure S4). However, the Shimodaire-Hasegawa test showed significant disagreement between the species tree and both Tlrs phylogenies (Δln L = 257, ddl = 1, p < 0.001 for Tlr4; Δln L = 76, ddl = 0.008, p < 0.05 for Tlr7), indicating that neither of the Tlr trees coincided precisely with the tree based on a comparably sized non-immune sequence dataset. The incongruence was mainly caused by recently diverged species of Rattus. However, we revealed several other differences, such as the misplacement of the genus Bandicota (occurring within Rattus in the Tlr4 tree) and the different positions of R. sakeratensis and R. exulans in species and Tlr7 trees (Figure 1).
Evidence of signatures of selection
The comparison of ω (dN/dS) revealed substantial differences between the two Tlrs, as well as between gene parts encoding different domains (for details see Table 1). The difference between gene parts was mainly due to variations in the number of non-synonymous substitutions (which was higher in ECDs than in the TIR), while they both had similar numbers of synonymous substitutions.
The highly conservative SLAC (Single Likelihood Ancestor Counting) analysis (Datamonkey) revealed two codon positions evolving under positive selection in Tlr4 and only one in Tlr7, all of them being located within the ECD domain (p < 0.05, Table 2, Figure 2). We found 26 and 10 negatively selected sites for Tlr4 and Tlr7 respectively (p < 0.05, Table 2, Figure 2), distributed evenly over the whole sequences.
The imprint of natural selection on protein coding gene is often difficult to reveal because selection is frequently episodic (i.e. it affects only a subset of lineages) . We therefore looked for evidence of episodic diversifying selection at individual sites along the evolutionary branches of the trees using the MEME algorithm. Thirteen codon positions were found to be affected by episodic selection for Tlr4 (1.7% of all analysed codons) while only 4 codon positions showed this signature for Tlr7 (0.38% of all analysed codons). In Tlr4, 12 of these sites were located directly in LBR, while in Tlr7 none of the sites evolving under positive selection were in LBR. Whatever the Tlr gene considered, all sites found to evolve under positive selection using the SLAC were identified also by the MEME algorithm.
The signs of positive selection were scattered over whole Tlr trees, affecting nearly all branches of the Tlr4 phylogeny, both basal and terminal, while they mostly concerned the terminal branches for the Tlr7 phylogeny (Figure 3). Interestingly, one site evolving under positive selection (p < 0.05) was located in the ICD-DP of Tlr4 gene (Table 2, Figure 2a). We found that this part (i.e. the last 57 bp of C-terminal end of the protein following the TIR domain) was highly variable (19 nucleic acid alleles and 16 AA variants) with a mean ω = 1.11.
Analysis of the ligand binding regions
In general, the Ligand Binding Region (LBR) was much more variable in Tlr4 than in Tlr7 genes. We detected 50 different AA variants of the LBR in the TLR4 dataset, while only eight different AA variants were detected in TLR7. Out of the 222 AA sites of LBRTLR4, 43% were polymorphic, while among the 103 AA sites of LBRTLR7, only 10% exhibited genetic variations. The Consurf analysis performed to estimate the degree of evolutionary conservation of each amino acid position in LBR revealed 10% of phylogenetically variable positions (i.e. 22 positions assigned to grade 1 and corresponding to the most variable and rapidly evolving amino acid positions out of 222 positions in total) in TLR4, but only 2% (2 positions with grade 1 out of 103) in TLR7 (Figure 4). Other positions were assigned as conservative (57% and 79% in TLR4 and TLR7, respectively) or had insufficient support (33% and 19%, respectively; Figure 4).
Ligand-binding positions in rodents were predicted by comparison with those identified in humans by Park et al. . In TLR4, two out of eight LPS-binding amino acid positions were identical to humans and strictly conserved among rodents (F438 and F461). Three other were conserved in terms of amino acid features (i.e. polarity, hydrophobicity) but distinct from human residue and variable among rodents (R263K, K360R and K434R). Interestingly, one LPS binding site that was uniform in human was found to be evolving under positive selection using the MEME algorithm. We found hydrophobic and hydrophilic residues, although this position, L442Y, is known to be involved in hydrophobic interactions. Finally, two remaining positions were found to be highly variable in rodents (339 and 386) (Additional file 1: Table S3). In TLR7, the nine ligand binding residues predicted following Wei et al.  were strictly conserved within rodents and seven of them were common to both rodents and human TLR7 (Additional file 1: Table S4).
The pairwise RMSD that allowed estimating the differences in 3D protein structure among variants varied from 0 to 1.5Å in TLR4 variants, and from 0.6 to 1.7Å in TLR7 variants (Additional file 1: Figure S5). Yet, in the phenetic diagram of TLR4, 3D-structures of Rattus sakeratensis and Rattus nitidus were distinct from each other and also from all other species. Similarly for TLR7, the 3D-structure of the protein of Rattus exulans was separated from other species (Additional file 1: Figure S5). To provide wider context we performed additional comparison between PDB structures (obtained from The RCSB Protein Data Bank http://www.rcsb.org/pdb/home/home.do) of human (HoSaTLR4-3fxi_A) and mouse (MuMuTLR4-3vq2_A) ECDTLR4 and between ECD of mouse TLR4 and TLR3 (MuMuTLR3-3ciy_A). The comparison between species of the same TLR was 1.7Å (HoSaTLR4-MuMuTLR4). Comparison between two TLRs from most distant TLR families of the same species was 4.6Å (MuMuTLR4-MuMuTLR3). The analysis of electric charge of LBR revealed higher variation in TLR4 (from −7.7 to 1.5) when compared with TLR7 (from −1.6 to 0.6). Detailed analyses of LBRTLR4 revealed that Mus and Rattus species were well differentiated from each other (Mus: from −7.7 to −3.7; Rattus and related genera: from −3 to 1.5, Additional file 1: Figure S6a). Similar pattern was found for LBRTLR7 (Mus: -1.6, Rattus and related genera: from −1.4 to 0.6, Additional file 1: Figure S6b).
In this study we analysed the variability of two important vertebrate immune genes involved in innate immunity across wild murine rodents and we looked for evidence of selection. Overall, we found that Tlr4 was much more variable than Tlr7 and that the evolution of both genes had been influenced mostly by purifying selection. However, comparison of both Tlrs revealed contrasting evolutionary patterns. Tlr7, which is involved in the recognition of viral nucleic acids, was highly conserved across rodents and its evolution seemed to be strongly shaped by purifying selection. Predicted ligand binding sites in LBRTLR7 were identical across all species and only few sites were detected to evolve under positive selection within the whole molecule. By contrast, Tlr4, which detects several different pathogen ligands, was more variable and was affected by numerous events of episodic selection. Positively selected sites mostly occurred in LBR, probably as a result of co-evolution with pathogens. Analyses of the LBR variability in surface charge revealed a potential for interspecific differences in ligand binding capacities of both Tlrs.
Differences in TLRs evolution - phylogenetic approach
We found that both Tlrs were conserved genes as their phylogeny almost correctly recapitulated species phylogeny. In spite of this conservatism we revealed some incongruence between gene and species topologies, especially in branches represented by the shallow genealogy of the black rat complex and Bandicota spp. (Figure 1a). These species have experienced recent and rapid radiation during the Early Pleistocene about 1 Mya [69, 70]. Discrepancies between a gene genealogy and the species phylogeny in recently diverged species often results from Incomplete Lineage Sorting (ILS) of ancestral polymorphism and/or episodic gene flow and hybridization [71, 72]. Indeed, R. tanezumi R2 and R. tanezumi R3 were recently proposed as conspecifics or were suspected to hybridize in Southeast Asia . In addition, hybridization with introgression occurred between the invasive populations of R. tanezumi and R. rattus in the United States . These phenomena could explain incongruence between Tlrs and species trees. However, directional selection could also be involved. Discrepancies in Tlr7 phylogeny represented by R. exulans and R. sakeratensis seem more likely to be caused by pathogen selective pressure (Figure 1b). ILS and hybridization are unlikely to result in such deeper changes, whereas the influence of directional selection (positive or negative) on non-neutrally evolving genes could be at more likely explanation . The rejection of co-divergence (concerning basal nodes) between Tlrs and species phylogenies could reflect the occurrence of pathogen-driven selection on Tlrs during the evolutionary history of the murine rodents [32, 76]. The former hypothesis should now be tested by a detailed analysis of spectrum of pathogens from rodents to determine if the species producing the incongruent topology displayed specific pathogens that could mediate this selection.
Tlr variability and signatures of selection
We found that 92% and 100% sites (respectively for Tlr4 and Tlr7) evolving under positive selection were located in the ECD, which is responsible for pathogen recognition. For Tlr4 92% of these positively selected sites found by MEME algorithm were located in the LBR. This is in concordance with several recent studies conducted on primates, birds and rodents, that have suggested a high accumulation of positively selected sites at LBR [9–11, 77, 78]. Surprisingly, none of the sites evolving under positive selection was identified directly in the LBR of Tlr7.
The TIR domain of both Tlrs was evolving under much stronger functional constraint than the ECD in both genes. We found only 11 amino acid variants of TIRTLR4 in 23 species and six different variants of TIRTLR7 in 22 species. Altogether our results support the observation that Tlr exodomains evolve more rapidly than the intracellular TIR domain [9, 56, 77, 78]. The requirement of sites within ECD, which would be involved in ligand recognition and able to recognize permanently fast-evolving pathogens, could explain this pattern. Besides, the high conservation of the TIR domain could be adapted to maintain a functional response of signal transduction see, e.g. [9, 33, 50, 56, 58, 79].
Both genes showed non-significant differences between ECD and TIR with respect to dS, supporting the hypothesis that there was no difference in mutation rate between ECD and TIR. The same result has been found in comparative studies of 10 vertebrate TLRs . The distal part of ICD in Tlr4 was surprisingly highly variable among rodent species. The reason for such a high level of variability is still unknown; however some authors suggest that this region at the carboxy-terminal end of Tlr4 could be responsible for interspecific differences in LPS sensitivity .
Positive selection we also detected using the MEME approach that individually considers each codon along the Tlrs phylogeny . We found that episodic positive selection affected most lineages in the phylogenetic tree of Tlr4, while the situation was quite different in Tlr7, where the sites evolving under positive selection were mostly distributed only along the terminal branches. Episodic diversifying selection could have affected Tlr4 throughout its evolution and this process could still be in operating, while in Tlr7 diversifying selection seemed to have appeared more recently and the gene history was mostly maintained by the stronger purifying selection (Figure 3).
Analysis of the Ligand binding region
In TLR4 variants we found 22 rapidly evolving positions distributed all over the LBR. While TLR4 is able to detect several ligands, the most studied one is LPS of Gram negative bacteria. TLR4 does not interact with LPS alone directly but forms stable heterodimers with MD-2 . Analysis of the crystallographic structure of mouse TLR4-MD-2-ligand complex has shown that the interactions between TLR4, -LPS and MD‒2 take place on the concave surface of TLR4 . We predicted that sites involved in the TLR4-MD-2 interaction should be highly conserved to maintain the receptor function in LPS binding and these sites were thus not identified in the present study. Among the eight known LPS-binding sites, identified by Park et al.  in humans, two residues (F438 and F461) were conserved between humans and rodents as well as among rodents. These key residues are jointly involved also in hydrophobic interactions between TLR4 and MD-2 [39, 81]. It is possible that negative selection might maintain an invariable combination at these sites to preserve MD-2 binding, which supports our hypothesis mentioned above. One exception was the controversial site L442Y which was suggested by Park et al.  to be also involved in hydrophobic interactions between TLR4 and MD-2, but Resman et al.  challenged the importance of its function. Among the studied rodents this codon was found to be polymorphic and has been shown to be affected by episodic positive selection during rodent evolution. A hydrophobic nonpolar residue (Leucine, L) was commonly shared between rodent species except for Maxomys surifer that harbored a hydrophobic and polar Tyrosine (Y). For three LPS-binding sites, R263K, K360R and K434R, the biochemical features of the residue were maintained between rodents (all were positively charged residues) but distinct amino acids were detected. The important role of these residues was supported also by Ohto et al.  and the potential functional importance of substitution R263K was beside confirmed by conservation analysis. Finally, we have identified in TLR4 two ligand binding positions, 339 and 386, with important amino acid substitutions that might be responsible for variability in LPS binding. No signature of positive selection was detected for these sites; however functional importance of position 386 was supported by the Consurf analysis. Intriguingly, both residues form charge interactions with the same lipid A phosphate of the LPS, which might indicate that the evolution of this position is associated with phosphate binding. However, this interpretation must be taken cautiously since Resman et al.  have questioned the role of the site 386 (in human K388) in LPS binding.
LBRTLR7 sequence was much shorter than LBRTLR4 one (103 vs. 222 codons, respectively), which could be explained by the smaller size of LBRTLR7 ligand, the viral ssRNA . LBRTLR7 was highly conserved at the interspecific level. Only two rapidly evolving positions (out of 103 analysed sites) were detected and neither of them corresponded to the predicted ligand binding residues . Generally the conserved sites (sites evolving under negative selection), have important evolutionary roles for example in protein-protein interactions (TIR domain) or in the preservation of protein structure (e.g. LRR forming horseshoe structure).
We found that structural variation between rodent LBR of both TLRs (TLR4 - 1.5Å and TLR7 - 1.7Å) was comparable with the variation observed between ECDTLR4 of human and mouse (1.7Å). The 3D-protein structure modeling revealed that LBRTLR4 differed between Rattus sakeratensis, R. nitidus and all other rodent species. The analysis of LBRTLR4 sequences did not reveal any specific or unique substitution that could be responsible for this clustering. The same analysis performed on LBRTLR7 revealed that Rattus exulans substantially differed from other species. This difference could be explained by substitutions found at position H516Y, one being specific of R. exulans (Y at position 516) while other Rattus and Mus species harbored an H amino acid at this position. These inter-specific differences in LBR 3D structure were not related to the phylogenetic distance between species. They could be better explained by similar pathogen exposition and thus similar pathogen-mediated selection.
The results of charge analyses might be more important as they revealed interspecific variation in LBRs of both receptors. Mus species had generally a more negative overall charge at LBR than Rattus species (Additional file 1: Figure S6). Differences in protein charges were previously shown to be associated with differences in protein-ligand interactions [41, 65]. Likewise, differences between these two groups were also found in LBRTLR4 at positions that directly bind to LPS. However, some caution is needed, since variation of TLR4 and TLR7 in sensitivity to LPS or ssRNA, respectively, between rats and mice has not been investigated.
Differences in evolution of bacterial-sensing and viral-sensing Tlrs
Our results showed that the bacterial-sensing Tlr4 was more variable than the viral-sensing Tlr7, and that Tlr4 evolution was more intensively shaped by positive selection than in Tlr7. Tlr4 had 1.7% of codons under positive selection, while in Tlr7 it was only 0.38%. These differences are likely to be explained by Tlrs’ specificity to different groups of MAMPs with which they co-evolved . Tlr4 detects more types of ligands (e.g. bacterial LPS, envelope viral components, fungal cell wall components – Mannan)  and it seems that these pathogen structures have exerted more diversifying selective pressures on Tlr4 than the viral ssRNA affecting Tlr7. Recent studies of parasites show that there is an important structural variability in MAMPs between bacterial species (e.g. flagellin and LPS) [44, 81, 83–87]. We propose that the ligand binding region of Tlr4 detecting these MAMPs should reflect higher ligand variability observed in our data.
Reduced genetic variability in important genes generally results from strong purifying selection acting against deleterious mutations in these genes . It can result in a smaller effective population size and a lower amount of incomplete lineage sorting [72, 89]. These two phenomena were found to be more pronounced when analysing Tlr7 phylogeny. Moreover the Tlr7 gene is located on the X chromosome in mammals, which can be advantageous during evolution (e.g. lower polymorphism is maintained by quicker fixation of beneficial mutations and elimination of deleterious ones by stronger selection and more intense genetic drift) . We suggest that the tension between diversifying and purifying selection, caused by adaptation to the variability of viral motifs detected by viral-sensing Tlr7 and maintenance of function together played an important role in the distribution of Tlr7 polymorphisms.
This study brings a unique insight into the natural variability and molecular history of two Toll-like receptors in free-living populations of 23 murine species. Purifying selection seems to be the dominant evolutionary force shaping Tlr4 and Tlr7 polymorphism. However, specific sites putatively evolving under diversifying selection were detected in both Tlrs. These sites accumulated within Tlr4 LBR, and detailed analyses revealed that several important amino-acid substitutions might alter LPS binding. These substitutions were often species-specific and differentiated between the Rattini and Murini tribes. Interspecific charge variability of LBR and to lesser extent the variability in 3D structure indicated the potential differences in protein-ligand interaction. By contrast, the evolution of Tlr7 was strongly shaped by purifying selection. All predicted ligand binding residues in this receptor were uniform across all studied mammals to date. The contrasting evolutionary histories of these two Tlrs are likely to result from different structural variability of ligands they target. Since the crystallography of certain ligands (e.g. biglycans, hyaluronans and heparin sulphates, ssRNA) [44, 68] remains unknown and the precise positions of corresponding binding sites are still missing, our data provide important avenues towards understanding which codons might be candidates for ligand binding residues.
Murine rodents from 23 species belonging to the Rattini and Murini (sensu Lecompte et al. ) tribes were sampled mainly in South-East Asia, and three synanthropic species (i.e. Rattus rattus, Mus m. muscululus and Mus m. domesticus) were also sampled in Europe and Africa. In our sampling area, Rattus tanezumi specimens corresponded to two divergent mitochondrial lineages although they could not be distinguished according to their nuclear pool . These samples were further referred to clades R. tanezumi R2 and R3 according to their mitotype. Rattus sakeratensis corresponds to the lineage previously referred to as R. losea and found in central, northern Thailand and Vientiane Plain of Lao PDR (Rattus losea-like by Pagès et al. ). This lineage was recently distinguished from the true R. losea, which is restricted to Cambodia, Vietnam, China and Taiwan .
Species identification was initially based on morphological criteria and thereafter confirmed using molecular barcoding for problematic lineages [69, 92]. We sequenced two to 10 individuals per species. In total 103 specimens were analysed (Additional file 1: Table S1).
Toll-like receptor sequencing and sequence alignments
We sequenced the complete exon 3 of Tlr4 (2.250 bp) and Tlr7 (3.150 bp) as it encompasses the LBR in both genes. Exon 3 corresponds to 89.7% and 99.0% of the total coding sequence for Tlr4 and Tlr7, respectively. Short exons 1 and 2 (241 bp encoding 5´- untranslated (UT) region and first 257 bp of ECD in Tlr4exon2 and 154 bp of 5´-UT regions and 3bp of ECD in Tlr7exon2) were not analysed in present study, because we were preferentially interested by functional regions (e.g. LBR and TIR). For all analyses and discussion the codon numbering follows the sequences of Rattus norvegicus available in GenBank [GenBank Acc. NP_062051.1 for Tlr4, and NP_001091051.1, for Tlr7].
Primers for Polymerase Chain Reaction (PCR) and sequencing were designed according to the sequences available in the Ensembl database for Mus musculus [Tlr4 ENSMUSE00000354724/MGI:96824, Tlr7 ENSMUSE00000405820/ MGI:2176882] and Rattus norvegicus [Tlr4 ENSRNOE00000099045/NP_062051, Tlr7 ENSRNOE00000039897/NP_001091051]. We used the software Primer3 to design primers (see their sequences in Additional file 1: Table S2 and positions in Additional file 1: Figure S1). Total DNA was extracted from rodent tissue (biopsy from ear or necropsy from liver) using the DNeasy Blood & Tissue Kit (Qiagen AB, Hilden, Germany). Amplifications were carried out in a final volume of 25 μl containing 12.5 μl of Multiplex Kit PCR master mix (Qiagen), 9.3 μl of H2O, 0.5 μM of each of primer pairs and 2 μl of DNA. Cycling conditions included an initial denaturation at 95°C for 15 min, followed by 10 cycles of denaturation at 95°C for 40 s, annealing with touchdown at 65°C to 55°C (-1°C/cycle) for 45 s and extension at 72°C for 90 s, followed by 30 cycles of denaturation at 95°C for 40 s, annealing at 55°C for 45 s and extension at 72°C for 90 s, with a final extension phase at 72°C for 10 min. The final extension was performed for 10 min at 72°C. The lengths of amplicons were checked on 1.5% agarose gels. Sequencing was carried out using an ABI3130 automated DNA sequencer (Applied Biosystems). DNA sequences were aligned and edited using SeqScape v.2.5 (Applied Biosystems) and BioEdit v.7.1.3 (Hall 1999). All sequences have been submitted to NCBI GenBank (Accession numbers are presented in Additional file 1: Table S1).
Diploid genotypes were resolved using the Bayesian PHASE platform  implemented in DnaSP ver. 5.10 . Calculations were carried out using 1000 iterations, 10 thinning intervals, and 1000 burn-in iterations. Sequences were collapsed into individual alleles by Fabox DNA collapser, an online FASTA sequence toolbox . The identification and visualization of main domains (ECD, TM and ICD with TIR domain and ICD-DP) was performed in SMART  based on Rattus norvegicus sequences provided in GenBank [NP_062051.1 for Tlr4 and NP_001091051.1 for Tlr7]. 3D structure was predicted in Phyre2  and then visualized using FirstGlance in Jmol v.1.9. Finally, we estimated nucleotide diversity (π), number of polymorphic sites (S) and total number of mutations (ϵ) with DnaSP, and the number of nucleotide alleles (hN) and amino acid variants (hA) using Fabox DNA collapser.
Phylogenetic reconstructions and congruence between the tree based on a comparably sized non-immune sequence dataset and Tlr trees
We first tested Tlr sequences for recombination using SBP, to avoid further false positive events of selection. This method (implemented in Datamonkey,[66, 99]) allowed the screening of Tlr sequences for recombination breakpoints. SBP identify non-recombinant regions and allowed each region to have its own phylogenetic reconstruction [100, 101].
Phylogenies were reconstructed independently for each gene using the alignment of complete exon 3 sequences. A phylogeny inferred from the combination of one nuclear (the first exon of the gene encoding the interphotoreceptor retinoid binding protein, Irbp) and two mitochondrial genes (the cytochrome b gene, Cytb, and the cytochrome c oxidase I gene, CoI), taken from Pagès et al. , was used for comparison of “neutral” evolution of the studied rodents with trees obtained from the immune gene alignments. Both Maximum likelihood (ML) and Bayesian (BA) methods were applied to infer phylogenetic relationships from each Tlr alignments. The best evolutionary model of nucleotide substitution was determined using jModelTest 0.1.1 . Phylogenies based on ML analyses were reconstructed using RAxML 7.2.6 . Analyses were run as the rapid bootstrap procedure (option –f a) with bootstraps defined by option –NautoMR. For both Tlrs we used nucleotide substitution model GTR + Γ (option –m GTRGAMMA) selected by jModelTest 0.1.1 as the most appropriate to our data. Bayesian analyses were performed using a parallel version of MrBayes v3.1  at the University of Oslo Bioportal  and CBGP HPC computational platform located at Centre de Biologie et Gestion des Populations, Montpellier. Two runs of 50,000,000 generations in each were adopted, applying the best fitted model of substitution (GTR+ Γ). A burn-in period of 10,000,000 generations was determined using Tracer 1.4 . Convergence was also evaluated using Tracer v1.4. After discarding samples from the burnin period, results were based on the pooled samples from the stationary phases of the two independent runs. Trees were edited using FigTree v1.3.1. .
We tested the congruence between the rodent phylogeny and the Tlrs phylogeny based on the MrBayes approach using reconciliation analyses. Reconciliation analyses explore all possible mappings of one tree onto another, assigning different costs to evolutionary events and find optimal (i.e. yielding minimal costs) solutions. These analyses were conducted using JANE 4 . This software was initially built to reconcile parasite and host trees, yet it can also be used for comparative analysis of species and gene trees. In the context of host-parasite relationships, five evolutionary events between parasites and host can be taken into account in JANE 4: co-speciation, host switches, duplication, failure to diverge and parasite loss. These events are analogous to co-divergence, convergence, duplication, purifying selection and gene loss (respectively) when considered in the context of species and gene tree reconciliation. For each of these events the specific costs can be set. The lowest cost is attributed to the event considered as most likely. In order to obtain reconciliations that maximize the number of co-divergences we set the cost of a co-divergence event to 0 while other costs were set to 1 (see Cruaud et al.  for similar approach). The cost of the best solution is then compared with costs found in reconciliations in which tip mappings are permuted at random. This generates a null distribution of the costs of reconciliation. If the cost of the best solution is lower than that expected by the chance it means that the two phylogenies are significantly congruent. The following parameters were used: the number of generations (iterations of the algorithm) was set to 100 and the “population” (number of samples per generation) was set to 100. Input phylogenies were those obtained by the Bayesian inference. The cost of the best solution was compared to distribution of the costs of 1000 randomizations.
Moreover, we tested the congruence between genes and tree based on a comparably sized non-immune sequence dataset using SH test  as implemented in PAUP. Alternative topologies required for ML SH test were reconstructed by ML approach in the software Garli v. 2.0 . Two different ML trees were estimated for each Tlr; a first one inferred under non-constrained conditions with default options and a second one constrained by the tree topology based on a comparably sized non-immune sequence dataset. Mouse species (genus Mus) were excluded from the analysis of co-divergence in order to match data with the study of Pagès et al.  where the mice are missing.
Search for signatures of selection on Tlr sequences
We estimated separately the number of synonymous (dS) and non-synonymous (dN) substitutions per site for the whole exon 3, ECD, LBR and the TIR domains, and for both Tlrs. Computations were made with 1000 bootstraps and Nei-Gojobori method (with Jukes-Cantor correction) in MEGA 5 . We then estimated the overall ratio dN/dS for each domain and for the whole exon 3 of both Tlrs by Single Likelihood Ancestor Counting (SLAC) implemented in Datamonkey. The p-value was 0.05. As the SLAC method tends to be a very conservative test, the actual rate of false positives (i.e. neutrally evolving sites incorrectly classified as selected) can be much lower than the significance level . In the next step we estimated selection at each codon by SLAC to find which codons of the exons 3 have been subject to positive and negative selection. As a default tree we used a NJ tree and appropriate substitution model proposed by automatic model selection tool in Datamonkey.
Finally, we used the Mixed Effects Model of Evolution (MEME) algorithm in the Hyphy package accessed on the website of Datamonkey interface  to detect codons evolved under positive selection along the branches of the phylogenies. This method is recently recommended as a replacement for the Fixed Effects Likelihood (FEL) and SLAC models . It allows the detection of signatures of episodic selection, even when the majority of lineages are subject to purifying selection. This test permits ω to vary from site to site and also from branch to branch in phylogeny . Tests of episodic diversifying selection were performed at significance level p < 0.05 and MrBayes trees were used as working topologies. Only events of positive selection with Empirical Bayes Factor (EBF) estimated by MEME near to 100 were mapped on to the phylogeny.
Functional analysis of ligand binding region
Positions of LBR in both TLRs have been previously described in humans [39, 68]. The corresponding LBR position in rodents was predicted based on the human-rodent alignment. The LBR was located between codons AA248 and AA469 in TLR4 and between codons AA495 and AA597 in TLR7.
We first explored the evolutionary conservation of each amino acid position in LBR using the Consurf algorithm . Consurf estimates the evolutionary rate of amino acid positions in a protein molecule, based on the phylogenetic relationships between homologous sequences. Conservation scale is defined from the most variable amino acid positions (grade 1, color represented by turquoise) which are considered as rapidly evolving to conservative positions (grade 9, color represented by maroon) which are considered as slowly evolving. We used the proposed substitution matrix and computation was based on the empirical Bayesian paradigm. MrBayes trees were used as the working topology. Protein tertiary structure was adopted from R. norvegicus [Gene Bank Acc. TLR4/KC811688 and TLR7/KC811786].
Because protein tertiary structure is essential for its biological function we finally explored the variability in the 3D structures of LBRs in the different AA variants. The prediction of 3D structures of the variants was performed by homology modeling using Phyre2 . Differences in 3D protein structure among variants were then evaluated using the root mean square deviations (RMSD) calculated by the DALI pairwise comparison tool . The RMSD-based distance matrices were analysed in STATISTICA v. 8.0 (StatSoft, Inc., Tulsa) by joining tree clustering using Unweighted Pair Group Method with Arithmetic Mean (UPGMA, ). We then analysed the variability of the charge of each LBR variant, which could be another key indicator of functional changes, because differences in protein charge could influence the ability to bind ligands [41, 65]. LBR charge of each variant was estimated at predefined neutral pH = 7 using LRRfinder.
Availability of supporting data section
All sequences have been submitted to NCBI GenBank under Accession numbers from KC811609 to KC811800 (Individual accession numbers are presented in Additional file 1: Table S1). Tlr phylogenies based on MrBayes (Tlr4_MrBayes_final.nex, Tlr7_MrBayes_final.nex) and RAxML (Tlr4_RAxML_final.nex, Tlr7_RAxML_final.nex) approach were added to the TreeBase database (http://treebase.org/treebase-web/home.html). Trees are available at URL: http://purl.org/phylo/treebase/phylows/study/TB2:S14659.
Zak DE, Aderem A: Systems biology of innate immunity. Immunol Rev. 2009, 227: 264-282. 10.1111/j.1600-065X.2008.00721.x.
Barreiro LB, Quintana-Murci L: From evolutionary genetics to human immunology: how selection shapes host defence genes. Nat Rev Genet. 2010, 11: 17-30. 10.1038/nrg2698.
Akira S, Uematsu S, Takeuchi O: Pathogen recognition and innate immunity. Cell. 2006, 124: 783-801. 10.1016/j.cell.2006.02.015.
Schröder NWJ, Schumann RR: Single nucleotide polymorphisms of Toll-like receptors and susceptibility to infectious disease. Lancet Infect Dis. 2005, 5: 156-164.
Pandey S, Agrawal DK: Immunobiology of Toll-like receptors: emerging trends. Immunol Cell Biol. 2006, 84: 333-341. 10.1111/j.1440-1711.2006.01444.x.
Bochud P-Y, Bochud M, Telenti A, Calandra T: Innate immunogenetics: a tool for exploring new frontiers of host defence. Lancet Infect Dis. 2007, 7: 531-542. 10.1016/S1473-3099(07)70185-8.
Loo Y-M, Gale M: Immune signaling by RIG-I-like receptors. Immunity. 2011, 34: 680-692. 10.1016/j.immuni.2011.05.003.
Netea MG, Wijmenga C, O’Neill LAJ: Genetic variation in Toll-like receptors and disease susceptibility. Nat Immunol. 2012, 13: 535-542. 10.1038/ni.2284.
Wlasiuk G, Nachman MW: Adaptation and constraint at Toll-like receptors in primates. Mol Biol Evol. 2010, 27: 2172-2186. 10.1093/molbev/msq104.
Alcaide M, Edwards SV: Molecular evolution of the Toll-like receptor multigene family in birds. Mol Biol Evol. 2011, 28: 1703-1715. 10.1093/molbev/msq351.
Tschirren B, Råberg L, Westerdahl H: Signatures of selection acting on the innate immunity gene Toll-like receptor 2 (TLR2) during the evolutionary history of rodents. J Evol Biol. 2011, 24: 1232-1240. 10.1111/j.1420-9101.2011.02254.x.
Grueber CE, Wallis GP, King TM, Jamieson IG: Variation at innate immunity Toll-like receptor genes in a bottlenecked population of a New Zealand robin. PLoS ONE. 2012, 7: e45011-10.1371/journal.pone.0045011.
Tschirren B, Andersson M, Scherman K, Westerdahl H, Råberg L: Contrasting patterns of diversity and population differentiation at the innate immunity gene Toll-like receptor 2 (TLR2) in two sympatric rodent species. Evolution. 2012, 66: 720-731. 10.1111/j.1558-5646.2011.01473.x.
Tschirren B, Andersson M, Scherman K, Westerdahl H, Mittl PRE, Råberg L: Polymorphisms at the innate immune receptor TLR2 are associated with Borrelia infection in a wild rodent population. Proc Biol Sci. 2013, 280: 20130364-10.1098/rspb.2013.0364.
Haldane JBS: Malaria: disease and evolution. Genetic and Evolutionary Aspects. 2006, Boston: Kluwer Academic Publishers, 175-187.
Apanius V, Penn D, Slev PR, Ruff LR, Potts WK: The nature of selection on the major histocompatibility complex. Crit Rev Immunol. 1997, 17: 179-224. 10.1615/CritRevImmunol.v17.i2.40.
Bernatchez L, Landry C: MHC studies in nonmodel vertebrates: what have we learned about natural selection in 15 years?. J Evol Biol. 2003, 16: 363-377. 10.1046/j.1420-9101.2003.00531.x.
Aguilar A, Roemer G, Debenham S, Binns M, Garcelon D, Wayne RK: High MHC diversity maintained by balancing selection in an otherwise genetically monomorphic mammal. Proc Natl Acad Sci USA. 2004, 101: 3490-3494. 10.1073/pnas.0306582101.
Bryja J, Galan M, Charbonnel N, Cosson JF: Duplication, balancing selection and trans-species evolution explain the high levels of polymorphism of the DQA MHC class II gene in voles (Arvicolinae). Immunogenetics. 2006, 58: 191-202. 10.1007/s00251-006-0085-6.
Piertney SB, Oliver MK: The evolutionary ecology of the major histocompatibility complex. Heredity (Edinb). 2006, 96: 7-21.
Spurgin LG, Richardson DS: How pathogens drive genetic diversity: MHC, mechanisms and misunderstandings. Proc Biol Sci. 2010, 277: 979-988. 10.1098/rspb.2009.2084.
Cížková D, Gouy de Bellocq J, Baird SJE, Piálek J, Bryja J: Genetic structure and contrasting selection pattern at two major histocompatibility complex genes in wild house mouse populations. Heredity (Edinb). 2011, 106: 727-740. 10.1038/hdy.2010.112.
Smith C, Ondračková M, Spence R, Adams S, Betts DS, Mallon E: Pathogen-mediated selection for MHC variability in wild zebrafish. Evol Ecol Res. 2011, 67: 217-218.
Medzhitov R, Preston-Hurlburt P, Janeway CA: A human homologue of the Drosophila Toll protein signals activation of adaptive immunity. Nature. 1997, 388: 394-397. 10.1038/41131.
Hedrick SM: The acquired immune system: a vantage from beneath. Immunity. 2004, 21: 607-615. 10.1016/j.immuni.2004.08.020.
O’Neill LAJ: TLRs: Professor Mechnikov, sit on your hat. Trends Immunol. 2004, 25: 687-693. 10.1016/j.it.2004.10.005.
Bassett EH, Rich T: Introduction. Toll and Toll-Like Receptors: An Immunologic Perspective. 2005, Boston, MA: Springer US, 1-17.
Acevedo-Whitehouse K, Cunningham AA: Is MHC enough for understanding wildlife immunogenetics?. Trends Ecol Evol (Amst). 2006, 21: 433-438. 10.1016/j.tree.2006.05.010.
Barreiro LB, Ben-Ali M, Quach H, Laval G, Patin E, Pickrell JK, Bouchier C, Tichit M, Neyrolles O, Gicquel B, Kidd JR, Kidd KK, Alcaïs A, Ragimbeau J, Pellegrini S, Abel L, Casanova J-L, Quintana-Murci L: Evolutionary dynamics of human Toll-like receptors and their different contributions to host defense. PLoS Genet. 2009, 5: e1000562-10.1371/journal.pgen.1000562.
Vinkler M, Albrecht T: The question waiting to be asked: innate immunity receptors in the perspective of zoological research. Folia Zool. 2009, 58: 15-28.
Janssens S, Beyaert R: Role of Toll-like receptors in pathogen recognition. Clin Microbiol Rev. 2003, 16: 637-646. 10.1128/CMR.16.4.637-646.2003.
Roach JC, Glusman G, Rowen L, Kaur A, Purcell MK, Smith KD, Hood LE, Aderem A: The evolution of vertebrate Toll-like receptors. Proc Natl Acad Sci USA. 2005, 102: 9577-9582. 10.1073/pnas.0502272102.
Hughes AL, Piontkivska H: Functional diversification of the toll-like receptor gene family. Immunogenetics. 2008, 60: 249-256. 10.1007/s00251-008-0283-5.
Leulier F, Lemaitre B: Toll-like receptors–taking an evolutionary approach. Nat Rev Genet. 2008, 9: 165-178.
Temperley ND, Berlin S, Paton IR, Griffin DK, Burt DW: Evolution of the chicken Toll-like receptor gene family: a story of gene gain and gene loss. BMC Genomics. 2008, 9: 62-10.1186/1471-2164-9-62.
Huang Y, Temperley ND, Ren L, Smith J, Li N, Burt DW: Molecular evolution of the vertebrate TLR1 gene family–a complex history of gene duplication, gene conversion, positive selection and co-evolution. BMC Evol Biol. 2011, 11: 149-10.1186/1471-2148-11-149.
Werling D, Jann OC, Offord V, Glass EJ, Coffey TJ: Variation matters: TLR structure and species-specific pathogen recognition. Trends Immunol. 2009, 30: 124-130. 10.1016/j.it.2008.12.001.
Burke DF, Worth CL, Priego E-M, Cheng T, Smink LJ, Todd JA, Blundell TL: Genome bioinformatic analysis of nonsynonymous SNPs. BMC Bioinforma. 2007, 8: 301-10.1186/1471-2105-8-301.
Park BS, Song DH, Kim HM, Choi B-S, Lee H, Lee J-O: The structural basis of lipopolysaccharide recognition by the TLR4-MD-2 complex. Nature. 2009, 458: 1191-1195. 10.1038/nature07830.
Keestra AM, van Putten JPM: Unique properties of the chicken TLR4/MD-2 complex: selective lipopolysaccharide activation of the MyD88-dependent pathway. J Immunol. 2008, 181: 4354-4362.
Walsh C, Gangloff M, Monie T, Smyth T, Wei B, McKinley TJ, Maskell D, Gay N, Bryant C: Elucidation of the MD-2/TLR4 interface required for signaling by lipid IVa. J Immunol. 2008, 181: 1245-1254.
Zhu J, Brownlie R, Liu Q, Babiuk LA, Potter A, Mutwiri GK: Characterization of bovine Toll-like receptor 8: ligand specificity, signaling essential sites and dimerization. Mol Immunol. 2009, 46: 978-990. 10.1016/j.molimm.2008.09.024.
Botos I, Segal DM, Davies DR: The structural biology of Toll-like receptors. Structure. 2011, 19: 447-459. 10.1016/j.str.2011.02.004.
Kang JY, Lee J-O: Structural biology of the Toll-like receptor family. Annu Rev Biochem. 2011, 80: 917-941. 10.1146/annurev-biochem-052909-141507.
Pasare C, Medzhitov R: Toll pathway-dependent blockade of CD4+CD25+ T cell-mediated suppression by dendritic cells. Science. 2003, 299: 1033-1036. 10.1126/science.1078231.
Parker LC, Prince LR, Sabroe I: Translational mini-review series on Toll-like receptors: networks regulated by Toll-like receptors mediate innate and adaptive immunity. Clin Exp Immunol. 2007, 147: 199-207. 10.1111/j.1365-2249.2006.03203.x.
Kawai T, Akira S: The role of pattern-recognition receptors in innate immunity: update on Toll-like receptors. Nat Immunol. 2010, 11: 373-384. 10.1038/ni.1863.
Pasare C, Medzhitov R: Toll-like receptors and acquired immunity. Semin Immunol. 2004, 16: 23-26. 10.1016/j.smim.2003.10.006.
Netea MG, Ferwerda G, de Jong DJ, Jansen T, Jacobs L, Kramer M, Naber THJ, Drenth JPH, Girardin SE, Kullberg BJ, Adema GJ, Van der Meer JWM: Nucleotide-binding oligomerization domain-2 modulates specific TLR pathways for the induction of cytokine release. J Immunol. 2005, 174: 6518-6523.
Smirnova I, Poltorak A, Chan EK, McBride C, Beutler B: Phylogenetic variation and polymorphism at the Toll-like receptor 4 locus (TLR4). Genome Biol. 2000, 1: 002.1-002.10.
Ferwerda B, McCall MBB, Alonso S, Giamarellos-Bourboulis EJ, Mouktaroudi M, Izagirre N, Syafruddin D, Kibiki G, Cristea T, Hijmans A, Hamann L, Israel S, ElGhazali G, Troye-Blomberg M, Kumpf O, Maiga B, Dolo A, Doumbo O, Hermsen CC, Stalenhoef AFH, van Crevel R, Brunner HG, Oh D-Y, Schumann RR, de la Rúa C, Sauerwein R, Kullberg B-J, van der Ven AJAM, van der Meer JWM, Netea MG: TLR4 polymorphisms, infectious diseases, and evolutionary pressure during migration of modern humans. Proc Natl Acad Sci USA. 2007, 104: 16645-16650. 10.1073/pnas.0704828104.
Vinkler M, Bryjová A, Albrecht T, Bryja J: Identification of the first Toll-like receptor gene in passerine birds: TLR4 orthologue in zebra finch (Taeniopygia guttata). Tissue Antigens. 2009, 74: 32-41. 10.1111/j.1399-0039.2009.01273.x.
Krieg AM, Vollmer J: Toll-like receptors 7, 8, and 9: linking innate immunity to autoimmunity. Immunol Rev. 2007, 220: 251-269. 10.1111/j.1600-065X.2007.00572.x.
Barrat FJ, Coffman RL: Development of TLR inhibitors for the treatment of autoimmune diseases. Immunol Rev. 2008, 223: 271-283. 10.1111/j.1600-065X.2008.00630.x.
Waldner H: The role of innate immune responses in autoimmune disease development. Autoimmun Rev. 2009, 8: 400-404. 10.1016/j.autrev.2008.12.019.
Mikami T, Miyashita H, Takatsuka S, Kuroki Y, Matsushima N: Molecular evolution of vertebrate Toll-like receptors: evolutionary rate difference between their leucine-rich repeats and their TIR domains. Gene. 2012, 503: 235-243. 10.1016/j.gene.2012.04.007.
Worobey M, Bjork A, Wertheim JO: Point, counterpoint: the evolution of pathogenic viruses and their human hosts. Annu Rev Ecol Evol Syst. 2007, 38: 515-540. 10.1146/annurev.ecolsys.38.091206.095722.
Poltorak A, He X, Smirnova I, Liu MY, Van Huffel C, Du X, Birdwell D, Alejos E, Silva M, Galanos C, Freudenberg M, Ricciardi-Castagnoli P, Layton B, Beutler B: Defective LPS signaling in C3H/HeJ and C57BL/10ScCr mice: mutations in Tlr4 gene. Science. 1998, 282: 2085-2088.
Diebold SS, Kaisho T, Hemmi H, Akira S: Reis e Sousa C: Innate antiviral responses by means of TLR7-mediated recognition of single-stranded RNA. Science. 2004, 303: 1529-1531. 10.1126/science.1093616.
Heil F, Hemmi H, Hochrein H, Ampenberger F, Kirschning C, Akira S, Lipford G, Wagner H, Bauer S: Species-specific recognition of single-stranded RNA via Toll-like receptor 7 and 8. Science. 2004, 303: 1526-1529. 10.1126/science.1093620.
Jones KE, Patel NG, Levy MA, Storeygard A, Balk D, Gittleman JL, Daszak P: Global trends in emerging infectious diseases. Nature. 2008, 451: 990-993. 10.1038/nature06536.
Mills JN: Biodiversity loss and emerging infectious disease: an example from the rodent-borne hemorrhagic fevers. Biodiversity. 2006, 7: 9-17. 10.1080/14888386.2006.9712789.
Luis AD, Hayman DTS, O’Shea TJ, Cryan PM, Gilbert AT, Pulliam JRC, Mills JN, Timonin ME, Willis CKR, Cunningham AA, Fooks AR, Rupprecht CE, Wood JLN, Webb CT: A comparison of bats and rodents as reservoirs of zoonotic viruses: are bats special?. Proc Biol Sci. 2013, 280: 20122753-10.1098/rspb.2012.2753.
Fornarino S, Laval G, Barreiro LB, Manry J, Vasseur E, Quintana-Murci L: Evolution of the TIR domain-containing adaptors in humans: swinging between constraint and adaptation. Mol Biol Evol. 2011, 28: 3087-3097. 10.1093/molbev/msr137.
Govindaraj RG, Manavalan B, Basith S, Choi S: Comparative analysis of species-specific ligand recognition in Toll-like receptor 8 signaling: a hypothesis. PLoS ONE. 2011, 6: e25118-10.1371/journal.pone.0025118.
Pond SLK, Frost SDW: Datamonkey: rapid detection of selective pressure on individual sites of codon alignments. Bioinformatics. 2005, 21: 2531-2533. 10.1093/bioinformatics/bti320.
Murrell B, Wertheim JO, Moola S, Weighill T, Scheffler K: Kosakovsky Pond SL: Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 2012, 8: 1002764-10.1371/journal.pgen.1002764.
Wei T, Gong J, Jamitzky F, Heckl WM, Stark RW, Rössle SC: Homology modeling of human Toll-like receptors TLR7, 8, and 9 ligand-binding domains. Protein Sci. 2009, 18: 1684-1691. 10.1002/pro.186.
Pagès M, Chaval Y, Herbreteau V, Waengsothorn S, Cosson J-F, Hugot J-P, Morand S, Michaux J: Revisiting the taxonomy of the Rattini tribe: a phylogeny-based delimitation of species boundaries. BMC Evol Biol. 2010, 10: 184-10.1186/1471-2148-10-184.
Aplin KP, Suzuki H, Chinen AA, Chesser RT, Ten Have J, Donnellan SC, Austin J, Frost A, Gonzalez JP, Herbreteau V, Catzeflis F, Soubrier J, Fang Y-P, Robins J, Matisoo-Smith E, Bastos ADS, Maryanto I, Sinaga MH, Denys C, Van Den Bussche RA, Conroy C, Rowe K, Cooper A: Multiple geographic origins of commensalism and complex dispersal history of Black Rats. PLoS ONE. 2011, 6: e26357-10.1371/journal.pone.0026357.
Moore WS: Inferring phylogenies from mtDNA variation: mitochondrial-gene trees versus nuclear-gene trees. Evolution. 1995, 49: 718-726. 10.2307/2410325.
Hobolth A, Dutheil JY, Hawks J, Schierup MH, Mailund T: Incomplete lineage sorting patterns among human, chimpanzee, and orangutan suggest recent orangutan speciation and widespread selection. Genome Res. 2011, 21: 349-356. 10.1101/gr.114751.110.
Pagès M, Bazin E, Galan M, Chaval Y, Claude J, Herbreteau V, Michaux J, Piry S, Morand S, Cosson J-F: Cytonuclear discordance among Southeast Asian black rats (Rattus rattus complex). Mol Ecol. 2013, 22: 1019-1034. 10.1111/mec.12149.
Lack JB, Greene DU, Conroy CJ, Hamilton MJ, Braun JK, Mares MA, Van Den Bussche RA: Invasion facilitates hybridization with introgression in the Rattus rattus species complex. Mol Ecol. 2012, 21: 3545-3561. 10.1111/j.1365-294X.2012.05620.x.
Nichols R: Gene trees and species trees are not the same. Trends Ecol Evol. 2001, 16: 358-364. 10.1016/S0169-5347(01)02203-0.
Edwards SV: Natural selection and phylogenetic analysis. PNAS. 2009, 106: 8799-8800. 10.1073/pnas.0904103106.
Areal H, Abrantes J, Esteves PJ: Signatures of positive selection in Toll-like receptor (TLR) genes in mammals. BMC Evol Biol. 2011, 11: 368-10.1186/1471-2148-11-368.
Smith SA, Jann OC, Haig D, Russell GC, Werling D, Glass EJ, Emes RD: Adaptive evolution of Toll-like receptor 5 in domesticated mammals. BMC Evol Biol. 2012, 12: 122-10.1186/1471-2148-12-122.
Downing T, Lloyd AT, O’Farrelly C, Bradley DG: The differential evolutionary dynamics of avian cytokine and TLR gene classes. J Immunol. 2010, 184: 6993-7000. 10.4049/jimmunol.0903092.
Kim HM, Park BS, Kim J-I, Kim SE, Lee J, Oh SC, Enkhbayar P, Matsushima N, Lee H, Yoo OJ, Lee J-O: Crystal structure of the TLR4-MD-2 complex with bound endotoxin antagonist Eritoran. Cell. 2007, 130: 906-917. 10.1016/j.cell.2007.08.002.
Resman N, Vasl J, Oblak A, Pristovsek P, Gioannini TL, Weiss JP, Jerala R: Essential roles of hydrophobic residues in both MD-2 and Toll-like receptor 4 in activation by endotoxin. J Biol Chem. 2009, 284: 15052-15060. 10.1074/jbc.M901429200.
Ohto U, Fukase K, Miyake K, Shimizu T: Structural basis of species-specific endotoxin sensing by innate immune receptor TLR4/MD-2. Proc Natl Acad Sci USA. 2012, 109: 7421-7426. 10.1073/pnas.1201193109.
Raetz CRH, Whitfield C: Lipopolysaccharide endotoxins. Annu Rev Biochem. 2002, 71: 635-700. 10.1146/annurev.biochem.71.110601.135414.
van der Woude MW, Bäumler AJ: Phase and antigenic variation in bacteria. Clin Microbiol Rev. 2004, 17: 581-611. 10.1128/CMR.17.3.581-611.2004.
Andersen-Nissen E, Smith KD, Strobe KL, Barrett SLR, Cookson BT, Logan SM, Aderem A: Evasion of Toll-like receptor 5 by flagellated bacteria. Proc Natl Acad Sci USA. 2005, 102: 9247-9252. 10.1073/pnas.0502040102.
Sun W, Dunning FM, Pfund C, Weingarten R, Bent AF: Within-species flagellin polymorphism in Xanthomonas campestris pv campestris and its impact on elicitation of Arabidopsis flagellin sensinG2-dependent defenses. Plant Cell. 2006, 18: 764-779. 10.1105/tpc.105.037648.
Maeshima N, Fernandez RC: Recognition of lipid A variants by the TLR4-MD-2 receptor complex. Front Cell Infect Microbiol. 2013, 3: doi:10.3389/fcimb.2013.00003
Zwickl DJ: The University of Texas at Austin. Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion. 2006, Ph.D. dissertation
Charlesworth B, Morgan MT, Charlesworth D: The effect of deleterious mutations on neutral molecular variation. Genetics. 1993, 134: 1289-1303.
Salcedo T, Geraldes A, Nachman MW: Nucleotide variation in wild and inbred mice. Genetics. 2007, 177: 2277-2291. 10.1534/genetics.107.079988.
Lecompte E, Aplin K, Denys C, Catzeflis F, Chades M, Chevret P: Phylogeny and biogeography of African Murinae based on mitochondrial and nuclear gene sequences, with a new tribal classification of the subfamily. BMC Evol Biol. 2008, 8: 199-10.1186/1471-2148-8-199.
Galan M, Pagès M, Cosson J-F: Next-generation sequencing for rodent barcoding: species identification from fresh, degraded and environmental samples. PLoS ONE. 2012, 7: e48374-10.1371/journal.pone.0048374.
Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 2000, 132: 365-386.
Stephens M, Donnelly P: A Comparison of bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet. 2003, 73: 1162-1169. 10.1086/379378.
Librado P, Rozas J: DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009, 25: 1451-1452. 10.1093/bioinformatics/btp187.
Villesen P: FaBox: an online toolbox for fasta sequences. Mol Ecol Notes. 2007, 7: 965-968. 10.1111/j.1471-8286.2007.01821.x.
Schultz J, Milpetz F, Bork P, Ponting CP: SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci USA. 1998, 95: 5857-5864. 10.1073/pnas.95.11.5857.
Kelley LA, Sternberg MJE: Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc. 2009, 4: 363-371.
Delport W, Poon AFY, Frost SDW, Kosakovsky Pond SL: Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics. 2010, 26: 2455-2457. 10.1093/bioinformatics/btq429.
Pond SLK, Posada D, Gravenor MB, Woelk CH, Frost SDW: Automated phylogenetic detection of recombination using a genetic algorithm. Mol Biol Evol. 2006, 23: 1891-1901. 10.1093/molbev/msl051.
Pond SLK, Posada D, Gravenor MB, Woelk CH, Frost SDW: GARD: a genetic algorithm for recombination detection. Bioinformatics. 2006, 22: 3096-3098. 10.1093/bioinformatics/btl474.
Posada D: jModelTest: phylogenetic model averaging. Mol Biol Evol. 2008, 25: 1253-1256. 10.1093/molbev/msn083.
Stamatakis A, Hoover P, Rougemont J: A rapid bootstrap algorithm for the RAxML web servers. Syst Biol. 2008, 57: 758-771. 10.1080/10635150802429642.
Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001, 17: 754-755. 10.1093/bioinformatics/17.8.754.
Kumar S, Skjaeveland A, Orr RJS, Enger P, Ruden T, Mevik B-H, Burki F, Botnen A, Shalchian-Tabrizi K: AIR: a batch-oriented web program package for construction of supermatrices ready for phylogenomic analyses. BMC Bioinforma. 2009, 10: 357-10.1186/1471-2105-10-357.
Rambaut A, Drummond AJ: Tracer v1.4. 2007, Available from http://beast.bio.ed.ac.uk/Tracer
Rambaut A: FigTree v1.3.1 2006–2009. 2009, Available with the program package at http://tree.bio.ed.ac.uk/software/figtree
Conow C, Fielder D, Ovadia Y, Libeskind-Hadas R: Jane: a new tool for the cophylogeny reconstruction problem. Algorithms Mol Biol. 2010, 5: 16-10.1186/1748-7188-5-16.
Cruaud A, Rønsted N, Chantarasuwan B, Chou LS, Clement WL, Couloux A, Cousins B, Genson G, Harrison RD, Hanson PE, Hossaert-McKey M, Jabbour-Zahab R, Jousselin E, Kerdelhué C, Kjellberg F, Lopez-Vaamonde C, Peebles J, Peng Y-Q, Pereira RAS, Schramm T, Ubaidillah R, van Noort S, Weiblen GD, Yang D-R, Yodpinyanee A, Libeskind-Hadas R, Cook JM, Rasplus J-Y, Savolainen V: An extreme case of plant-insect codiversification: figs and fig-pollinating wasps. Syst Biol. 2012, 61: 1029-1047. 10.1093/sysbio/sys068.
Shimodaira H, Hasegawa M: Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol Biol Evol. 1999, 16: 1114-1116. 10.1093/oxfordjournals.molbev.a026201.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Ashkenazy H, Erez E, Martz E, Pupko T, Ben-Tal N: ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids. Nucleic Acids Res. 2010, 38: W529-533. 10.1093/nar/gkq399.
Holm L, Kääriäinen S, Rosenström P, Schenkel A: Searching protein structure databases with DaliLite v.3. Bioinformatics. 2008, 24: 2780-2781. 10.1093/bioinformatics/btn507.
Kalinowski ST: How well do evolutionary trees describe genetic relationships among populations?. Heredity (Edinb). 2009, 102: 506-513. 10.1038/hdy.2008.136.
Offord V, Coffey TJ, Werling D: LRRfinder: a web application for the identification of leucine-rich repeats and an integrative Toll-like receptor database. Dev Comp Immunol. 2010, 34: 1035-1041. 10.1016/j.dci.2010.05.004.
This work was supported by the French National Agency for Research projects CERoPath (grant number 00121 0505, 07 BDIV 012) http://www.ceropath.org/ and BioDivHealthSEA (grant number ANR 11 CPEL 002), and the Czech Science Foundation (grant number 206/08/0640). Cooperation on this project was also partly supported by bilateral project BARRANDE (grant number MEB021130/24504WM). The thesis of A. Fornůsková was partly funded by a three year French government fellowship and the fellowship from Masaryk University. MP is currently funded by an FRS - FNRS fellowship (Belgian Fund for Scientific Research).We are grateful to Anna Bryjová, Yannick Chaval, Gael Kergoat, Marian Novotný, Sylvain Piry, Lucie Vlčková for their help during various stages of the manuscript preparation and to Jamie Caroline Winternitz for language corrections. We also thank to the CBGP HPC computational platform and to the Centre Méditerranéen Environnement Biodiversité.
The authors declare that they have no competing interests.
Conceived and designed the experiments: AF JFC JB NCH MV. Performed the sequencing: AF MG FC. Analysed the data: AF MV MP EJ. Contributed samples: SM JFC AF. Wrote the paper: AF MV JFC JB NCH MP EJ (sorted by the significance of contributions). All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Table S1: Summary of sampled specimens and identification of haplotypes. Table S2. Primer description. Table S3. Residues binding to LPS in TLR4 based on knowledge of 3D-crystalography in human predicted by Park et al. 2009. Table S4. Potential residues binding ssRNA predicted by Wei et al. 2009. Figure S1. Protein structure of TLR4 (a, c) and TLR7 (b, d) identified by SMART (http://smart.embl-heidelberg.de/) (a, b) and CONSURF (c, d). SMART (a, b) identified following types of domains: LRR - Leucine rich repeat; LRRCT - Leucine rich repeat C-terminal domain; TIR - TIR domain, Fulfilled blue box (TD) - transmembrane domain; LRRNT - Leucine rich repeat N-terminal domain. Red box - LBR (from AA248 to AA469 for TLR4 and from AA495 to AA597 for TLR7). ECD - extracellular domain is represented by solid black double arrow; ICD - intracellular domain is represented by dashed double arrow. Distal part of ICD (ICD -DP) is indicated by a simple solid arrow. Positions of forward and reverse primers used for amplification are shown by arrows. Arrows of same color indicates primer pairs. Description of crystallographic structure (c, d) LBR is represented by red polygon; TD is present between two dashed lines. To the right from TD is ICD, to the left is ECD. Figures S4. Test of congruence between the presumably neutral and Tlr phylogenies (Tlr4 (a), Tlr7 (b) following JANE 4). Number at X axis represents costs of co-divergence. The red dashed line represents the cost observed in our data. The blue columns represent the random distributions of costs. Lower cost than random observed in our data signified higher congruence between species and gene topologies. Figure S5. Superimposition of structures, tree clustering diagrams based on linkage distance, (a) LBRTLR4 and (b) LBRTLR7; individual LBR-variants often unify more species; description of LBR-variants labels is in the Table S1 under Hap_LBRTLR4 and Hap_LBRTLR7. Figure S6. Analysis of LBR amino acid sequence charge at pH 7 (LRRFinder) for (a) LBRTLR4 and (b) LBRTLR7, individual LBR-variants often unify more species; description of LBR-variants labels is in the Table S1 under Hap_LBRTLR4 and Hap_LBRTLR7. Mouse species are in red, Rattus spp. and related genera are in blue. (PDF 434 KB)
About this article
Cite this article
Fornůsková, A., Vinkler, M., Pagès, M. et al. Contrasted evolutionary histories of two Toll-like receptors (Tlr4 and Tlr7) in wild rodents (MURINAE). BMC Evol Biol 13, 194 (2013). https://doi.org/10.1186/1471-2148-13-194
- Arms race
- Host-pathogen interaction
- Pattern recognition receptors
- Adaptive evolution
- Pathogen-Associated Molecular Pattern (PAMP)