Evolutionary rate patterns of genes involved in the Drosophila Toll and Imd signaling pathway
BMC Evolutionary Biology volume 13, Article number: 245 (2013)
To survive in a hostile environment, insects have evolved an innate immune system to defend against infection. Studies have shown that natural selection may drive the evolution of immune system-related proteins. Yet, how network architecture influences protein sequence evolution remains unclear. Here, we analyzed the molecular evolutionary patterns of genes in the Toll and Imd innate immune signaling pathways across six Drosophila genomes within the context of a functional network.
Based on published literature, we identified 50 genes that are directly involved in the Drosophila Toll and Imd signaling pathways. Of those genes, only two (Sphinx1 and Dnr1) exhibited signals of positive selection. There existed a negative correlation between the strength of purifying selection and gene position within the pathway; the downstream genes were more conserved, indicating that they were subjected to stronger evolutionary constraints. Interestingly, there was also a significantly negative correlation between the rate of protein evolution and the number of regulatory microRNAs, implying that genes regulated by more miRNAs experience stronger functional constraints and therefore evolve more slowly.
Taken together, our results suggested that both network architecture and miRNA regulation affect protein sequence evolution. These findings improve our understanding of the evolutionary patterns of genes involved in Drosophila innate immune pathways.
Over the past few years, molecular evolution within a network architecture has been of great interest in population genetics [1–7]. Studies on major cellular pathways have demonstrated that network topology constrains the evolutionary pattern of genes. Many reports have demonstrated the existence of a positive correlation between gene pathway position and nucleotide substitution rate. For example, in the melanin synthesis pathway of silkworms , the anthocyanin biosynthetic pathway in plants [9–11], and the Drosophila Ras signal transduction pathway  the upstream genes are subjected to stronger evolutionary constraints. In contrast, an opposite effect was observed in other pathways, such as the animal Toll-like receptor (TLR)  and the yeast HOG  signaling pathways. Moreover, in the Caenorhabditis insulin/TOR signaling transduction pathway, the pattern of selective constraints is driven by expression level , whereas in the N-glycosylation metabolic pathways across primates, connectivity of each gene drives the strength of purifying selection .
Multiple factors may affect the evolution of genes within networks and pathways, including the gene position, gene expression level [14–16], protein length , codon bias [7, 18], connectivity [19, 20], the number of regulatory microRNAs (miRNAs) that target a gene , and the length of its 3′-untranslated region (3′-UTR) [22, 23]. Furthermore, studies have demonstrated that miRNAs participate in the regulation of innate immunity and inflammatory responses [24, 25]. Thus, miRNA regulation should be analyzed from an evolutionary perspective to improve our understanding of the impact of miRNAs on protein evolution.
The nuclear factor κB (NF-κB) pathway plays a central role in innate immunity by which invertebrates defend against pathogens [26, 27]. Drosophila possess two pathways (Toll and Imd innate immune response system; Figure 1) to activate NF-κB transcription factors [27, 28]. The Toll pathway is responsible for defense against Gram-positive bacteria or fungi when the cleaved ligand Spatzle binds to the Toll receptor, eventually leading to the activation of the NF-κB family members Dorsal and Dorsal-related immunity factor (Dif). This pathway also participates both in developmental processes [29–31] and immunity [32, 33]. In contrast, the Imd pathway controls resistance to Gram-negative bacterial infections [32, 34]. The JAK-STAT pathway also correlates with the Drosophila innate immune response, but unlike the Toll and Imd pathways, it remains poorly understood, and only four genes (UPD, DOME, JAK, STAT) have been reported within this pathway [34–36]. Studies have shown that natural selection may drive the evolution of immune system proteins [37–39]. Yet, the mechanism by which network architecture influences protein evolution within innate immune systems remains unclear. The complete genome sequences of Drosophila species and our current knowledge of the innate immune pathways in D. melanogaster together offer opportunity for a fine-scale evolutionary analysis of the Drosophila Toll and Imd pathways within the context of a network framework.
By surveying recent papers, we identified 50 immune-related genes that are directly involved in the Drosophila Toll and Imd signaling pathways, by transferring signals from receptor to transcription factor. We further investigated the evolutionary mechanism of Toll and Imd pathway genes across Drosophila species to understand 1) whether there exists a correlation between the strength of purifying selection and gene pathway position within the Toll and Imd innate immune signaling pathways; and 2) which of the topological parameters that characterize network evolution contributes most to the observed selective patterns.
Analysis of protein sequence evolution
Variation in selective constraints across immunity pathways was assessed with the help of the program PAML  and a comparison of alternative evolutionary models. For all 50 immune-related genes that we identified from the literature, the M0 (one dN/dS ratio) model calculated a single nonsynonymous/synonymous substitution rate (ω = dN/dS) ratio for all branches using a MUSCLE alignment . According to our results, the Drosophila immune genes have undergone strong functional constraints, with ω values ranging from 0.0001 (eff) to 0.4290 (Spz) (Additional file 1: Table S1). Additionally, by comparing the M1a (nearly neutral) and M2a (positive selection) models, the M7 (beta) and M8 (beta & ω) models, and also the M8 and M8a (beta & ω s = 1) models, positive selection was observed for Spz, Mstprox, Tl, Toll-4, cact, Dif, Sphinx1, Ect4, and Dnr1 (Additional file 1: Table S2). However, after the false discovery rate test (q = 0.05), only five genes (Dnr1, Sphinx1, Dif, Cact and Ect4) remained significantly positively selected (Additional file 1: Table S2). To improve the reliability of our analysis, we further used a PRANK [42, 43] alignment and detected positive selection for Dnr1, Sphinx1, Mst and Tl after the FDR test (q = 0.05) (Additional file 1: Table S3). The PAML results based on two different alignments (MUSCLE and PRANK) were only consistent for two genes, Sphinx1 and Dnr1. Therefore, these two genes seemed to contain a rather robust signal of positive selection, strongly suggesting that they were indeed subjected to positive selective constraints.
The strength of purifying selection within Drosophila Toll and Imd signaling pathways
Correlations between evolutionary parameters (dS, dN, and ω) and topological factors (i.e., gene position within the pathway, gene expression level, protein length, codon usage bias, connectivity of individual genes, the number of regulatory miRNAs, the length of 3′-UTR) were estimated applying Spearman’s rank correlation coefficients (ρ). For different factors that may affect protein sequence evolution, a significant negative correlation between ω values and gene position was observed (ρ = −0.370 [−0.529, −0.114], P = 0.020 after FDR correction; Figure 2 and Additional file 1: Table S4), indicating that the downstream genes were more conserved and therefore underwent stronger purifying selection. In addition, dN and dS values were significantly correlated with gene position (dN: ρ = −0.476 [−0.649, −0.243], P = 0.004 after FDR correction; dS: ρ = −0.467 [−0.645, −0.237], P = 0.004 after FDR correction; Figure 2 and Additional file 1: Table S4).
Given that selective constraints can be affected by various topological factors, the relationship between different variables was further analyzed. According to our results, connectivity was significantly correlated with the expression level of immune genes after infection by bacterial (Exp1) or fungal (Exp2) (Exp1: ρ = 0.450 [0.197, 0.671], P = 0.005 after FDR correction; Exp2: ρ = 0.565 [0.329, 0.737], P < 0.001 after FDR correction; Additional file 1: Table S4). When we estimated the expression levels after infection at each separate time point, similar results were observed (Additional file 1: Table S5). The highly significant correlation between connectivity and gene expression level indicated that the more connected a protein was, the higher gene expression level it had.
We also observed that the number of regulatory miRNAs (N miR , predicted using Miranda 3.3  with default parameters) and the length of the 3′-UTR (L 3′UTR ) both significantly correlated with dN, dS, and ω (Figure 3 and Table 1). To test the robustness of our predicted miRNA targets, we set the Miranda score to a higher level of 150.0, 160.0, and the correlation remained significant (Additional file 1: Table S4). These results confirmed that genes regulated by more miRNAs were likely to undergo stronger functional constraints and therefore evolve at slower rates [21, 45].
To investigate whether the observed correlations resulted from direct or indirect influences, two multivariate analysis techniques (partial analysis and path analysis) were performed. As shown in Additional file 1: Table S4, gene pathway position, the number of regulatory miRNAs, and the length of the 3′-UTR are correlated with each other and with the dN, dS, and ω values. Partial analysis revealed that, when controlling for N miR and L 3′UTR , the correlation between position and dN remained significant (ρ = −0.318 [−0.506, −0.168], P = 0.028) while those between position and dS and between position and ω disappeared (dS: ρ = −0.203 [−0.465, −0.076], P = 0.167; ω: ρ = −0.233 [−0.422, 0.010], P = 0.112). Similarly, when controlling for L 3′UTR , the significant correlations disappeared, except for those between dN and position (ρ = −0.333 [−0.504, −0.206], P = 0.019), between N miR and dN (ρ = −0.384 [−0.664, −0.044], P = 0.007), and between N miR and dS (ρ = −0.361 [−0.701, 0.156], P = 0.011), suggesting that the correlations between N miR and dN and between N miR and dS were not mediated by the length of the 3′-UTR. However, when controlling for N miR , only the correlation between dN and position remained significant (ρ = −0.304 [−0.486, −0.126], P = 0.034), while the correlations between L 3′UTR and dN, dS, and ω were no longer significant (dN: ρ = 0.072 [−0.372, 0.554], P = 0.624; dS: ρ = 0.270 [−0.373, 0.670], P = 0.060; ω: ρ = −0.248 [−0.439, 0.011], P = 0.085), indicating that the correlation between gene evolutionary rate and L 3′UTR was mediated by the number of regulatory miRNAs. Throughout our partial analysis, the significant correlation between pathway position and dN consequently indicated that gene position within the pathway is an important parameter that influences protein sequence evolution within a network framework.
Through path analysis, the regression coefficients can be decomposed into direct and indirect correlations under the user-defined causal model. We therefore applied path analysis to investigate which contributor primarily constrained protein evolution within the network framework. During our path analysis (Figure 4), dN and ω were considered endogenous variables while the other variables were considered exogenous. Consistent with the results of partial correlation analysis, we observed that dN was significantly affected by position (β = −0.306, P = 0.020) while ω was only affected by dN (β = 0.418, P = 0.008) even after removing the influence of other topological features. In addition, dN was significantly correlated with ENC (β = 0.405, P = 0.002). Thus, Figure 4 demonstrates that pathway position and ENC can affect ω by influencing dN. Similar results were observed when ENC was considered as an endogenous variable in addition to dN and ω. Together, these results suggest that among the parameters that describe the network, pathway position is a major contributor to the tendency of the selective constraints.
Positive selection acting on Toll and Imd pathway genes
There were 40 genes in common between the study by Sackton et al. on the dynamic evolution of the Drosophila innate immune system  and ours. Sackton and his colleagues detected signals of positive selection for Dnr1, ModSP, PGRP-LC, and Nec, whereas we detected signals of positive selection for Dnr1, Sphinx1, Dif, Cact, and Ect4. The discrepancy between these studies might result from the use of different aligners (Sackton et al.: T-Coffee, us: MUSCLE). To explore the hypothesis, we performed the similarity analysis again using PRANK. Our results showed positive selection for Dnr1, Sphinx1, Mst, and Tl after the FDR test (q = 0.05). Only two of the genes, Sphinx1 and Dnr1, were consistently identified in at least two of the three analyses based on different alignments. Therefore, these two genes seemed to exhibit a rather robust signal of positive selection, strongly indicating that they have indeed undergone such selective constraints. Overall, the discrepancy between different alignments confirmed that the choice of alignment algorithm has a strong impact on estimates of positive selection . As described in Markova-Raina et al., not all alignment errors are created equal, and even PRANK, which was reported to outperform other aligners in simulation , still has a high false positives rate; therefore, one must compare the results of different alignment programs to reduce the rate of false positives in estimates of positive selection.
For Sphinx1, which is located upstream in the innate immune pathway, the positive selection acting on it might make it more functional, better adapted to the changing extracellular environment, and subsequently, gain new functions through adaptive evolution [38, 47, 48]. Furthermore, a quantitative analysis by Obbard et al.  confirmed that adaptive evolution is a major factor driving molecular evolution within the Drosophila immune system. While we performed our network-level analysis from a phylogenetic perspective, Obbard et al. focused on population genetic data (for D. melanogaster and D. simulans) to quantify the effects of natural selection. Despite different data and methods in our study and that of Obbard et al., we both found that the peptidoglycan recognition protein and Gram-negative binding protein showed no signs of positive selection, perhaps because of their roles in binding with highly conserved microbial molecules .
Selective constraints and pathway architecture
Because upstream genes are more exposed to the hostile environment, to defend against pathogens, mutations in these genes are likely to have more pleiotropic effects than those in genes acting downstream. Studies have also demonstrated that immune system genes tend to exhibit higher rates of adaptive evolution, which have been attributed to their coevolution with pathogens [37, 39, 50]. Indeed, in our analysis, we detected a robust negative correlation between the rate of protein evolution and a gene’s position in the Drosophila Toll and Imd signaling pathways, indicating that upstream genes experienced more relaxed selective constraints. A similar distribution of purifying selection has also been observed along the insulin/Tor pathways in vertebrates and Drosophila[51, 52], the animal TLR signaling pathway , the N-glycosylation metabolic pathway in primates , and the HOG signal transduction pathway in yeast .
Throughout our multivariate analysis, the correlation between gene position and dN was significant, and given the results of path analysis, we see that pathway position can influence ω values through an effect on dN. Because dN/dS is the measure of selection/constraint, dN is actually the metric of selective pressure; these results overall demonstrated that gene position within the network was an important factor driving the polarity of selective constraints along Toll and Imd pathways. Although in the Caenorhabditis insulin/TOR signaling transduction pathway, the pattern of selective constraints was driven by expression level , and in the N-glycosylation metabolic pathway in primates, connectivity was the main contributor  a study of animal TLR signaling pathway  agrees with our results in finding a negative relationship between evolutionary constraint and gene position. Because the Toll and Imd signaling pathways are homologous to the mammalian TLR signaling pathway, which plays a vital role in animal innate immunity , the shared evolutionary pattern provides strong supporting evidence for our observations.
The negative correlation between the number of regulatory miRNAs and protein sequence evolution
In addition to the negative correlation between gene evolutionary rates and pathway position, we observed significant correlations between N miR and the dS, dN, and ω values, and between L 3′UTR and the dS, dN, and ω values. Given the significant correlation between N miR and L 3′UTR , to determine which was the main factor influencing evolutionary rates (or whether they both had an influence), partial analysis was performed. When controlling for L 3′UTR , the correlation between N miR and dN and between N miR and dS remained significant. However, when controlling for N miR , the significant correlation between L 3′UTR and dN, dS and ω all disappeared, indicating that the correlation between gene evolutionary rates and L 3′UTR was mediated by the number of regulatory miRNAs. One possible explanation is that genes regulated by more miRNAs tend to have more molecular functions in different biological processes . Consequently, these pleiotropic genes require more complex and precise regulation by miRNA .
In this study, we estimated different topological parameters that may influence the evolution of immune-related gene within a network framework. Because dN, dS, and ω are subjected to many evolutionary pressures, there might be other topological factors that we did not consider in this study. Notably, to estimate the evolutionary pattern of genes in the Drosophila immune pathways Toll and IMD, the expression data we analyzed were gene expression levels after infection rather than constitutive expression. Compared with previous studies, we added two more topological parameters to our analysis (the number of regulatory miRNAs and the length of the 3′-UTR) to improve our understanding of the impact of miRNAs on protein sequence evolution.
We found a polarity in the strength of purifying selection along the Drosophila Toll and Imd pathways, with the downstream genes being more conserved. Of all the immune genes investigated, two (Sphinx1 and Dnr1) exhibited signals of positive selection. Notably, we provided strong evidence to show that gene position within the pathway was an important parameter influencing protein sequence evolution within the Drosophila Toll and Imd innate immune response systems. Moreover, the negative correlation between protein sequence evolution and the number of regulatory miRNAs confirmed that genes regulated by more miRNAs are likely to undergo stronger functional constraints, and therefore exhibit slower gene evolutionary rates. Further studies investigating the patterns of molecular evolution within different pathways will undoubtedly improve our understanding of natural selection in pathways and networks.
By searching research articles and recent reviews [26, 32, 34–36, 54–57], we compiled a list of immune-related genes involved in the Toll and Imd pathways and depicted their relationships (Figure 1). We first downloaded the protein-coding DNA sequences (CDS) of all immune genes (Additional file 1: Table S6) in the D. melanogaster genome from FlyBase  and then identified orthologs of these genes in other Drosophila species (D. simulans, D. sechellia, D. yakuba, D. erecta, and D. ananassae) using the Database of Orthologous Groups (OrthoDB, http://cegg.unige.ch/orthodb6) . If one gene had several transcripts, the longest was chosen as the ortholog for later analysis.
We recovered genes that were not annotated in OrthoDB through two BLAST steps. We first used protein sequences from D. melanogaster as queries and conducted BlastP searches against the whole genome of interest at NCBI, with a BLAST score >150 and length >50. The resulting sequence was then BLAST screened against the D. melanogaster genome. If the resulting protein sequence was the same as the original we obtained from FlyBase, we considered the resulting sequence an ortholog of the D. melanogaster one.
Some sequences appeared to have long deletions when aligned to orthologous genes; many such cases were artifacts rather than the true cases of deletion. Such sequences often contain stop codons when aligned, and consequently could not be analyzed in PAML. We recovered these incomplete sequences through several steps, as follows: we first conducted a Blat search on the UCSC genome browser , and the resulting sequence was used as a query to be predicted in the GeneWise tool with default parameters . Predictions that did not contain frameshift mutations or internal stop codons were considered to be orthologs to D. melanogaster sequences. Alignments of these newly-annotated gene sequences are provided in Additional file 2.
Multiple sequence alignment and phylogenetic analysis
Multiple sequence alignment of the orthologous CDSs was conducted using MUSCLE  with default options. Using the resulting alignments, we reconstructed a Drosophila phylogeny with MrBayes , applying a mixed amino acid substitution model. Four chains with independent runs of 10,000,000 generations were performed to examine the parameter space. We sampled the trees every 1000 generations. The first 2,500 trees were discarded as burn-in. The analyses gave a well-supported topology that corresponded to the known relationships among Drosophila species . This topology was used in subsequent codon-based tests of selection.
Protein sequence evolution analysis
The strength of selective pressures was examined by calculating dS, dN, and ω using the CODEML program in the PAML4.4 package . Values of ω < 1, = 1 and > 1 indicate purifying selection, neutral evolution, and positive selection of the target gene, respectively. To avoid synonymous site saturation, which would prevent us from analyzing the more divergent CDS alignments , we limited this analysis to the six species in the melanogaster group (D. melanogaster, D. sechellia, D. simulans, D. ananassae, D. erecta, and D. yakuba). The F3 × 4 codon frequency model  was applied throughout our codon-based analyses. We first conducted analyses using different models to evaluate changes in selective pressure: M0 (one ratio), M1a (nearly neutral), M2a (positive selection), M7 (beta), M8 (beta & ω), and M8a (beta & ω s = 1). We further tested whether some codon positions have undergone positive selection using a likelihood ratio test (LRT)  to compare models, specifically the M1a and M2a models, the M7 and M8 models, and the M8 and M8a models. To avoid false signals of positive selection, we also conducted a false discovery rate (FDR) test  controlling the q value at 0.05. We also performed our analyses using alignments obtained from PRANK [42, 43], which was reported to outperform other aligners in simulation , to explore discrepancies between our results and those of Sackton et al. .
Network framework analysis
Because the evolution of molecules within a network can be affected by multiple topological parameters, including gene position within the pathway, connectivity, protein length (Lpro), codon bias (ENC), gene expression level (mRNA abundance), the number of regulatory miRNAs (N miR ) that target the gene, and the length of its 3′-UTR (L 3′UTR ), we conducted a bivariate correlation analysis between these variables and the selective constraints parameters (dS, dN, and ω) applying Spearman’s rank correlation coefficients (ρ).
All the immune genes identified were directly involved in the Toll and Imd innate immune response signaling pathways, transducing signals from signal receptors (i.e., PGRP-SA, PGRP-LC) to transcription factors (Dif, dl, Rel). We assigned a position to each gene according to its function within the pathway. Paralogous genes were assigned the same position. For instance, the nine Toll-related genes (Tl, 18w, Mstprox, Toll-4, Tehao, Toll-6, Toll-7, Tollo, Toll-9) were assigned the same position. Imd proteins accept signals from Gram-negative bacteria, whereas Toll receptors receive signals from Gram-positive bacteria and fungi. Both Toll and Imd receptors are located on cell membranes, so we assigned the Imd gene the same position as Toll. With signals from the activated Toll receptors, three molecules (MyD88, Tube, Pelle) are recruited to form a heterotrimeric complex, and they are closely connected with each other through two distinct death domains, thus their positions were considered the same as well. During microbe recognition, SPZ is cleaved by the activated enzyme SPE and then transduces signals to membrane surface receptors. Based on their close relationship, we attributed the same position to SPE and SPZ.
We also analyzed the Toll and Imd pathways separately; we observed similar results (Additional file 1: Table S7, Additional file 1: Table S8), which are not discussed further. The Toll and Imd pathways are both homologous to the mammalian TLR signaling pathway, which plays a vital role in animal innate immunity , genes in the Toll or Imd pathway are closely associated with each other (e.g., Traf6 in the Toll pathway regulates IκB kinase /NF-κB kinase, and Tak1 in the IMD pathway takes part in the formation of IκB kinase /NF-κB kinase). Additionally, expression profile analyses showed that both the Toll and IMD pathways can be up-regulated upon infection by the same pathogen [67, 68]. Therefore, we preferred to analyze the two pathways as a unit.
To better characterize the relationships among different topological parameters (gene position within the pathway, gene expression level, protein length, ENC, PPI, the number of regulatory miRNAs, the length of 3′-UTR), we conducted partial analysis and path analysis. Through path analysis, the regression coefficients can be decomposed into direct and indirect correlations under a user-defined causal model. We therefore used path analysis to identify the main contributor constraining protein evolution within the network framework. For path analysis, dN and ω were considered to be endogenous variables while the other variables were considered exogenous. To correct for the fact that the causal model in path analysis is user defined, the analysis was repeated with codon usage bias treated as an endogenous variable in addition to dN and ω.
All these analyses were conducted applying PASW statistical software. Codon usage bias values were calculated with DnaSP 5.10.01 . Connectivity data (PPIs) of D. melanogaster were obtained from the BioGRID database . Because the expression data of immune related genes upon infection was not available for all the Drosophila species, we assumed that the relative expression levels of immune-related genes were conserved across species. Expression data of D. melanogaster after bacterial (Exp1) or fungal (Exp2) infection were obtained from De Gregorio et al. . The number of regulatory miRNAs targeting the immune genes were predicted with Miranda 3.3  using default parameters. To test the robustness of our predicted miRNA targets, we improved the Miranda score to 150.0, 160.0 and repeated the prediction. The resulting numbers of regulatory miRNAs were defined as N150 and N160, respectively.
FM and MH designed the study and drafted the manuscript together. MH, SQ and XJS performed the research and analyzed the data. YFL, LMC and PJ analyzed the data. All authors have read and approved the final version of the manuscript.
Dorsal-related immunity factor
False discovery rate
Protein protein interaction
Effective number of codons
Protein coding DNA sequences.
Jovelin R, Phillips PC: Expression level drives the pattern of selective constraints along the insulin/Tor signal transduction pathway in Caenorhabditis. Genome Biol Evol. 2011, 3: 715-10.1093/gbe/evr071.
Montanucci L, Laayouni H, Dall’Olio GM, Bertranpetit J: Molecular evolution and network-level analysis of the N-glycosylation metabolic pathway across primates. Mol Biol Evol. 2011, 28 (1): 813-823. 10.1093/molbev/msq259.
Obbard DJ, Welch JJ, Kim K-W, Jiggins FM: Quantifying adaptive evolution in the Drosophila immune system. PLoS Genet. 2009, 5 (10): e1000698-10.1371/journal.pgen.1000698.
Ramsay H, Rieseberg LH, Ritland K: The correlation of evolutionary rate with pathway position in plant terpenoid biosynthesis. Mol Biol Evol. 2009, 26 (5): 1045-1053. 10.1093/molbev/msp021.
Song X, Jin P, Qin S, Chen L, Ma F: The evolution and origin of animal toll-like receptor signaling pathway revealed by network-level molecular evolutionary analyses. PLoS One. 2012, 7 (12): e51657-10.1371/journal.pone.0051657.
Stern DL, Orgogozo V: Is genetic evolution predictable?. Science. 2009, 323 (5915): 746-751. 10.1126/science.1158997.
Y-h Y, F-m Z, Ge S: Evolutionary rate patterns of the Gibberellin pathway genes. BMC Evol Biol. 2009, 9 (1): 206-10.1186/1471-2148-9-206.
Yu H-S, Shen Y-H, Yuan G-X, Hu Y-G, Xu H-E, Xiang Z-H, Zhang Z: Evidence of selection at melanin synthesis pathway loci during silkworm domestication. Mol Biol Evol. 2011, 28 (6): 1785-1799. 10.1093/molbev/msr002.
Lu Y, Rausher MD: Evolutionary rate variation in anthocyanin pathway genes. Mol Biol Evol. 2003, 20 (11): 1844-1853. 10.1093/molbev/msg197.
Rausher MD, Lu Y, Meyer K: Variation in constraint versus positive selection as an explanation for evolutionary rate variation among anthocyanin genes. J Mol Evol. 2008, 67 (2): 137-144. 10.1007/s00239-008-9105-5.
Rausher MD, Miller RE, Tiffin P: Patterns of evolutionary rate variation among genes of the anthocyanin biosynthetic pathway. Mol Biol Evol. 1999, 16 (2): 266-274. 10.1093/oxfordjournals.molbev.a026108.
Riley RM, Jin WI, Gibson G: Contrasting selection pressures on components of the Ras-mediated signal transduction pathway in Drosophila. Mol Ecol. 2003, 12 (5): 1315-1323. 10.1046/j.1365-294X.2003.01741.x.
Wu X, Chi X, Wang P, Zheng D, Ding R, Li Y: The evolutionary rate variation among genes of HOG-signaling pathway in yeast genomes. Biol Direct. 2010, 5 (1): 46-10.1186/1745-6150-5-46.
Duret L, Mouchiroud D: Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. Mol Biol Evol. 2000, 17 (1): 68-070. 10.1093/oxfordjournals.molbev.a026239.
Pál C, Papp B, Hurst LD: Highly expressed genes in yeast evolve slowly. Genetics. 2001, 158 (2): 927-931.
Drummond DA, Bloom JD, Adami C, Wilke CO, Arnold FH: Why highly expressed proteins evolve slowly. Proc Natl Acad Sci U S A. 2005, 102 (40): 14338-14343. 10.1073/pnas.0504070102.
Lemos B, Bettencourt BR, Meiklejohn CD, Hartl DL: Evolution of proteins and gene expression levels are coupled in Drosophila and are independently associated with mRNA abundance, protein length, and number of protein-protein interactions. Mol Biol Evol. 2005, 22 (5): 1345-1354. 10.1093/molbev/msi122.
Sharp PM: Determinants of DNA sequence divergence betweenEscherichia coli andSalmonella typhimurium: Codon usage, map position, and concerted evolution. J Mol Evol. 1991, 33 (1): 23-33. 10.1007/BF02100192.
Fraser HB, Hirsh AE: Evolutionary rate depends on number of protein-protein interactions independently of gene expression level. BMC Evol Biol. 2004, 4 (1): 13-10.1186/1471-2148-4-13.
Fraser HB, Hirsh AE, Steinmetz LM, Scharfe C, Feldman MW: Evolutionary rate in the protein interaction network. Science. 2002, 296 (5568): 750-752. 10.1126/science.1068696.
Cheng C, Bhardwaj N, Gerstein M: The relationship between the evolution of microRNA targets and the length of their UTRs. BMC Genomics. 2009, 10 (1): 431-10.1186/1471-2164-10-431.
Sandberg R, Neilson JR, Sarma A, Sharp PA, Burge CB: Proliferating cells express mRNAs with shortened 3′untranslated regions and fewer microRNA target sites. Science. 2008, 320 (5883): 1643-1647. 10.1126/science.1155390.
Stark A, Brennecke J, Bushati N, Russell RB, Cohen SM: Animal MicroRNAs confer robustness to gene expression and have a significant impact on 3′UTR evolution. Cell. 2005, 123 (6): 1133-1146. 10.1016/j.cell.2005.11.023.
Taganov KD, Boldin MP, Baltimore D: MicroRNAs and immunity: tiny players in a big field. Immunity. 2007, 26 (2): 133-137. 10.1016/j.immuni.2007.02.005.
Sheedy F, O’Neill L: Adding fuel to fire: microRNAs as a new class of mediators of inflammation. Ann Rheum Dis. 2008, 67 (Suppl 3): iii50-iii55. 10.1136/ard.2008.100289.
Tsakas S, Marmaras V: Insect immunity and its signaling: an overview. ISJ. 2010, 7: 228-238.
Hetru C, Hoffmann JA: NF-κB in the immune response of Drosophila. Csh Perspect Biol. 2009, 1 (6):
Aggarwal K, Silverman NS: Positive and negative regulation of the Drosophila immune response. BMB Rep. 2008, 41 (4): 267-277. 10.5483/BMBRep.2008.41.4.267.
Qiu P, Pan PC, Govind S: A role for the Drosophila toll/cactus pathway in larval hematopoiesis. Development. 1998, 125 (10): 1909-1920.
Halfon MS, Hashimoto C, Keshishian H: The Drosophila toll gene functions zygotically and is necessary for proper motoneuron and muscle development. BMC Dev Biol. 1995, 169 (1): 151-167. 10.1006/dbio.1995.1134.
Belvin MP, Anderson KV: A conserved signaling pathway: the Drosophila toll-dorsal pathway. Annu Rev Cell Dev Biol. 1996, 12 (1): 393-416. 10.1146/annurev.cellbio.12.1.393.
Valanne S, Wang JH, Rämet M: The Drosophila toll signaling pathway. J Immunol. 2011, 186 (2): 649-656. 10.4049/jimmunol.1002302.
Lemaitre B, Nicolas E, Michaut L, Reichhart JM, Hoffmann JA: The dorsoventral regulatory gene cassette spätzle/toll/cactus controls the potent antifungal response in Drosophila adults. Cell. 1996, 86 (6): 973-983. 10.1016/S0092-8674(00)80172-5.
Reumer A, Van Loy T, Schoofs L: The complexity of Drosophila innate immunity. Invertebr Surviv J. 2010, 7 (1): 32-44.
Kim T, Kim Y: Overview of innate immunity in Drosophila. J Biochem Mol Biol. 2005, 38 (2): 121-10.5483/BMBRep.2005.38.2.121.
Wang L, Ligoxygakis P: Pathogen recognition and signalling in the Drosophila innate immune response. Immunobiology. 2006, 211 (4): 251-261. 10.1016/j.imbio.2006.01.001.
Sackton TB, Lazzaro BP, Schlenke TA, Evans JD, Hultmark D, Clark AG: Dynamic evolution of the innate immune system in Drosophila. Nat Genet. 2007, 39 (12): 1461-1468. 10.1038/ng.2007.60.
Schlenke TA, Begun DJ: Natural selection drives Drosophila immune system evolution. Genetics. 2003, 164 (4): 1471-1480.
Obbard DJ, Jiggins FM, Halligan DL, Little TJ: Natural selection drives extremely rapid evolution in antiviral RNAi genes. Curr Biol. 2006, 16 (6): 580-585. 10.1016/j.cub.2006.01.065.
Yang Z: PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007, 24 (8): 1586-1591. 10.1093/molbev/msm088.
Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.
Löytynoja A, Goldman N: An algorithm for progressive multiple alignment of sequences with insertions. Proc Natl Acad Sci U S A. 2005, 102 (30): 10557-10562. 10.1073/pnas.0409137102.
Löytynoja A, Goldman N: Phylogeny-aware gap placement prevents errors in sequence alignment and evolutionary analysis. Science. 2008, 320 (5883): 1632-1635. 10.1126/science.1158395.
Betel D, Wilson M, Gabow A, Marks DS, Sander C: The microRNA. org resource: targets and expression. Nucleic Acids Res. 2008, 36 (suppl 1): D149-D153.
Chen SC-C, Chuang T-J, Li W-H: The relationships among microRNA regulation, intrinsically disordered regions, and other indicators of protein evolutionary rate. Mol Biol Evol. 2011, 28 (9): 2513-2520. 10.1093/molbev/msr068.
Markova-Raina P, Petrov D: High sensitivity to aligner and high rate of false positives in the estimates of positive selection in the 12 Drosophila genomes. Genome Res. 2011, 21 (6): 863-874. 10.1101/gr.115949.110.
Castoe TA, Jiang ZJ, Gu W, Wang ZO, Pollock DD: Adaptive evolution and functional redesign of core metabolic proteins in snakes. PLoS One. 2008, 3 (5): e2201-10.1371/journal.pone.0002201.
Flowers J, Sezgin E, Kumagai S, Duvernell D, Matzkin L, Schmidt P, Eanes W: Adaptive evolution of metabolic pathways in Drosophila. Mol Biol Evol. 2007, 24 (6): 1347-1354. 10.1093/molbev/msm057.
Jiggins FM, Kim K-W: Contrasting evolutionary patterns in Drosophila immune receptors. J Mol Evol. 2006, 63 (6): 769-780. 10.1007/s00239-006-0005-2.
Wang H-Y, Tang H, Shen C-KJ WC-I: Rapidly evolving genes in human. The glycophorins and their possible role in evading malaria parasites. Mol Biol Evol. 2003, 20 (11): 1795-1804. 10.1093/molbev/msg185.
Alvarez-Ponce D, Aguadé M, Rozas J: Network-level molecular evolutionary analysis of the insulin/TOR signal transduction pathway across 12 Drosophila genomes. Genome Res. 2009, 19 (2): 234-242.
Alvarez-Ponce D, Aguadé M, Rozas J: Comparative genomics of the vertebrate insulin/TOR signal transduction pathway: a network-level analysis of selective pressures. Genome Biol Evol. 2011, 3: 87-10.1093/gbe/evq084.
He X, Zhang J: Toward a molecular understanding of pleiotropy. Genetics. 2006, 173 (4): 1885-1891. 10.1534/genetics.106.060269.
Silverman N, Paquette N, Aggarwal K: Specificity and signaling in the Drosophila immune response. Invert Surviv J. 2009, 6 (2): 163-
Silverman N, Maniatis T: NF-κB signaling pathways in mammalian and insect innate immunity. Genes Dev. 2001, 15 (18): 2321-2342. 10.1101/gad.909001.
Marcu O, Lera MP, Sanchez ME, Levic E, Higgins LA, Shmygelska A, Fahlen TF, Nichol H, Bhattacharya S: Innate immune responses of Drosophila Melanogaster are altered by Spaceflight. PLoS One. 2011, 6 (1): e15361-10.1371/journal.pone.0015361.
Huang HR, Chen ZJ, Kunes S, Chang GD, Maniatis T: Endocytic pathway is required for Drosophila Toll innate immune signaling. Proc Natl Acad Sci. 2010, 107 (18): 8322-8327. 10.1073/pnas.1004031107.
Crosby MA, Goodman JL, Strelets VB, Zhang P, Gelbart WM: FlyBase: genomes by the dozen. Nucleic Acids Res. 2007, 35 (suppl 1): D486-D491.
Kriventseva EV, Rahman N, Espinosa O, Zdobnov EM: OrthoDB: the hierarchical catalog of eukaryotic orthologs. Nucleic Acids Res. 2008, 36 (suppl 1): D271-D275.
Karolchik D, Baertsch R, Diekhans M, Furey TS, Hinrichs A, Lu Y, Roskin KM, Schwartz M, Sugnet CW, Thomas DJ: The UCSC genome browser database. Nucleic Acids Res. 2003, 31 (1): 51-54. 10.1093/nar/gkg129.
Birney E, Clamp M, Durbin R: GeneWise and genomewise. Genome Res. 2004, 14 (5): 988-995. 10.1101/gr.1865504.
Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574. 10.1093/bioinformatics/btg180.
Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman TC, Kellis M, Gelbart W, Iyer VN: Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007, 450 (7167): 203-218. 10.1038/nature06341.
Goldman N, Yang Z: A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol. 1994, 11 (5): 725-736.
Whelan S, Goldman N: Distributions of statistics used for the comparison of models of sequence evolution in phylogenetics. Mol Biol Evol. 1999, 16 (9): 1292-10.1093/oxfordjournals.molbev.a026219.
Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995, 57 (1): 289-300.
De Gregorio E, Spellman PT, Rubin GM, Lemaitre B: Genome-wide analysis of the Drosophila immune response by using oligonucleotide microarrays. Proc Natl Acad Sci. 2001, 98 (22): 12590-12595. 10.1073/pnas.221458698.
Irving P, Troxler L, Heuer TS, Belvin M, Kopczynski C, Reichhart J-M, Hoffmann JA, Hetru C: A genome-wide analysis of immune responses in Drosophila. Proc Natl Acad Sci. 2001, 98 (26): 15119-15124. 10.1073/pnas.261573998.
Librado P, Rozas J: DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009, 25 (11): 1451-1452. 10.1093/bioinformatics/btp187.
Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006, 34 (suppl 1): D535-D539.
This work was jointly supported by grants from the National Natural Science Foundation of China (No. 30970348), the Major Program of Natural Science Research of Jiangsu Higher Education Institutions (No. 12KJA180005), the Ph.D. Programs Foundation of Ministry of Education of China (No. 20113207110009) and the Priority Academic Program Development of Jiangsu Higher Education Institutions.
The authors declare that they have no competing interests.
Electronic supplementary material
Additional file 1: Table S1: Summary statistics used in the multivariate analysis. Table S2. Phylogenetic analysis by maximum likelihood of the innate immune pathway genes (MUSCLE alignment). Table S3. Phylogenetic analysis by maximum likelihood of the innate immune pathway genes (PRANK alignment). Table S4. Bivariate correlations of different factors that may influence evolution of genes within a network (Toll and Imd pathways). Table S5. Bivariate correlations between connectivity and expression level of genes after infection. Table S6. Genes involved in the D. melanogaster innate immune pathway. Table S7. Bivariate correlations of different factors that may influence evolution of genes within a network (Toll pathway). Table S8. Bivariate correlations of different factors that may influence evolution of genes within a network (Imd pathway). (PDF 780 KB)
Additional file 2: Multiple sequence alignments of the Toll and IMD pathway orthologs across six Drosophila species. Alignments of the protein-coding sequences were performed using PRANK. Stop codons were removed from the ends of the alignments. Dots indicate the same nucleotide and “–”represents a missing nucleotide relative to sequences in the first row. (PDF 2 MB)
About this article
Cite this article
Han, M., Qin, S., Song, X. et al. Evolutionary rate patterns of genes involved in the Drosophila Toll and Imd signaling pathway. BMC Evol Biol 13, 245 (2013). https://doi.org/10.1186/1471-2148-13-245