Positive natural selection in primate genes of the type I interferon response
BMC Ecology and Evolution volume 21, Article number: 65 (2021)
The Type I interferon response is an important first-line defense against viruses. In turn, viruses antagonize (i.e., degrade, mis-localize, etc.) many proteins in interferon pathways. Thus, hosts and viruses are locked in an evolutionary arms race for dominance of the Type I interferon pathway. As a result, many genes in interferon pathways have experienced positive natural selection in favor of new allelic forms that can better recognize viruses or escape viral antagonists. Here, we performed a holistic analysis of selective pressures acting on genes in the Type I interferon family. We initially hypothesized that the genes responsible for inducing the production of interferon would be antagonized more heavily by viruses than genes that are turned on as a result of interferon. Our logic was that viruses would have greater effect if they worked upstream of the production of interferon molecules because, once interferon is produced, hundreds of interferon-stimulated proteins would activate and the virus would need to counteract them one-by-one.
We curated multiple sequence alignments of primate orthologs for 131 genes active in interferon production and signaling (herein, “induction” genes), 100 interferon-stimulated genes, and 100 randomly chosen genes. We analyzed each multiple sequence alignment for the signatures of recurrent positive selection. Counter to our hypothesis, we found the interferon-stimulated genes, and not interferon induction genes, are evolving significantly more rapidly than a random set of genes. Interferon induction genes evolve in a way that is indistinguishable from a matched set of random genes (22% and 18% of genes bear signatures of positive selection, respectively). In contrast, interferon-stimulated genes evolve differently, with 33% of genes evolving under positive selection and containing a significantly higher fraction of codons that have experienced selection for recurrent replacement of the encoded amino acid.
Viruses may antagonize individual products of the interferon response more often than trying to neutralize the system altogether.
The interferon response plays an important role in defending human cells against viruses . Because viruses replicate within cells of the host, their nucleic acids and proteins are exposed, at least to some degree, to the cellular environment. To exploit this vulnerability of viruses, hosts have evolved numerous intracellular sensors that recognize viral nucleic acids and proteins [1, 2]. When cellular sensors detect virus-specific structures, a signaling cascade is activated which ultimately leads to the production and secretion of one or more interferon proteins [3, 4]. Interferons then produce transcriptional changes in the infected cell, inducing expression of hundreds of host genes (called “interferon-stimulated genes,” or ISGs) that collectively act to limit viral replication . The resulting interferon-stimulated proteins act through a diversity of mechanisms [1, 5]. Additionally, interferons don’t just cause these transcriptional changes in the infected cell; they also signal to neighboring cells, even those that are uninfected, and induce the same transcriptional changes in those cells [6, 7]. In solid tissues, this produces a firewall of protected cells around the infected ones, impeding cell-to-cell spread of the virus.
Interferons use cell-surface receptors to signal and are organized into three classes based on the cell-surface receptor that they use. Type I interferons bind to the interferon α receptor (IFNAR), Type II interferons bind to interferon γ receptor (IFNGR), and Type III interferons bind to the interferon λ receptor (IFNLR) . There are 21 unique genes encoding interferon proteins in the human genome. It remains unclear why there are more interferon proteins than receptors. Different interferon-receptor pairs may result in different signaling and transcriptional induction, but this is still an active area of research .
Viruses are formidable antagonists of the interferon response, and use diverse tactics to degrade, mislocalize, inhibit, or otherwise thwart proteins involved in interferon responses . Viruses are known to target proteins that are both up- and downstream of the production of interferon molecules themselves . As an example, STING is an important component of the signaling pathway leading to interferon production. Many flaviviruses encode proteins that target the host protein STING for degradation or inactivation [10,11,12,13,14]. Some of the same flaviviruses that inactivate STING also inactivate interferon-stimulated transcription factors (e.g., STAT1 and STAT2) as a second measure to fully ensure the interferon response is disabled [15,16,17,18,19,20,21].
Interferon pathways thus constitute a hotbed of antagonistic interactions between hosts and viruses. There is evolutionary pressure on viruses to evade or inhibit the interferon response, and then reciprocal pressure on the host to retain the advantage. This mode of evolution has been analogized as an “arms race,” in that it is ongoing with both parties evolving in response to the other. Host–pathogen arms races are responsible for the massive complexity of our immune system [22,23,24,25,26]. Indeed, many genes in mammalian interferon pathways have been identified as evolving rapidly, consistent with pathogens exerting selection for these proteins to alter key binding interfaces. Specific examples of proteins in the interferon pathway evolving under positive selection include cGAS, OASs, STING, and SAMHD1 [27,28,29]. In Table 1, we have listed some of the other examples of interferon-related genes that are under positive selection, as well as viral interactions that could be responsible. While there are many individual examples, a holistic understanding of whether different parts of the interferon response are evolving under different pressures has yet to be obtained. We provide such an analysis here. This analysis is important because identifying mammalian genes under positive selection, in particular if those genes are relevant to virus replication or defense, has proven itself to be a powerful shortcut in identifying host proteins that interact differently with viruses in one possible animal host versus another . These are the host genes, therefore, that limit virus spillover between species [10, 23, 26, 30,31,32,33,34,35,36,37,38,39,40,41,42].
Curation of multiple sequence alignments
First, we curated lists of genes for our analysis. We separated genes involved in Type I interferon responses into two temporal categories with the dividing line being the expression of interferon-stimulated genes (Fig. 1). The interferon “induction” category contains genes acting upstream of the production of interferon α and β molecules—for instance, genes encoding proteins that identify pathogens, signal, and ultimately produce secreted interferon molecules, as well as the proteins that act downstream of interferon receptors and ultimately lead to the induction of interferon-stimulated genes. Interferon-stimulated genes are those that become expressed or over-expressed in the presence of interferon. We reviewed recent literature and curated lists of between 100 and 150 genes in each of these three categories: induction genes, interferon-stimulated genes, and random genes (using a random gene generator). In order for the evolution of a gene to be assessed, alignments of orthologous gene sequences are analyzed to quantify how substitutions have occurred over time. As such, simian primate orthologs of each of the induction, interferon-stimulated, and random genes were collected from GenBank and used to make a multiple sequence alignment for each gene. After visually inspecting and curating all alignments (see methods for alignment and quality control pipeline), we ended up with high quality multiple sequence alignments for 131 interferon-induction genes, 100 interferon-stimulated genes, and 100 random genes. The lists of genes analyzed can be found in Additional file 1.
Because our goal is to compare evolutionary signatures between each of the three categories of genes, we wanted to confirm that the three datasets were similar in other qualities. First, we assessed the species composition of the datasets. A cladogram is shown of the species from which all sequences derive. Branch width demonstrates the percentage of the 331 multiple sequence alignments in which each species is represented (Fig. 2a). We then plotted the number of species represented in the alignments, for each of the three classes of genes. There are similar distributions of species in all three categories of datasets (Fig. 2b). We also compared the tree lengths of the multiple sequence alignments in each of the three categories. Tree length is a measure of sequence divergence and is the average number of nucleotide substitutions per site in a multiple sequence alignment . The interferon-stimulated gene category appears to have alignments with tree lengths of greater value than either the induction or random gene sets (Fig. 2c). This was an initial hint that interferon-stimulated genes may be more genetically divergent than either induction genes or random genes. We next analyzed these datasets for signatures of positive selection.
Interferon-stimulated genes experience more intense positive selection than interferon induction genes or randomly-selected genes
We analyzed each of the 331 multiple sequence alignments for signatures of recurrent positive natural selection [45,46,47]. Selection can operate on nonsynonymous substitutions (changing the encoded amino acid) and on synonymous mutations (silent, not changing the encoded amino acids). Most genes experience purifying selection, where non-synonymous mutations are strongly selected against over evolutionary time . In contrast, gene regions or specific codons that have experienced repeated rounds of natural selection in favor of protein-altering mutations exhibit a characteristic inflation of the rate of non-synonymous (dN) DNA substitutions compared to synonymous (dS) substitutions (dN/dS > 1). Because nonsynonymous mutations occur more often than synonymous mutations by chance (due to the structure of the genetic code), computational models have been developed that use statistical frameworks to account for these unequal substitution rates [43, 48]. To evaluate patterns of dN/dS in these alignments, we used the statistical program Phylogenetic Analysis by Maximum Likelihood (PAML) . PAML fits multiple sequence alignments to different models of codon evolution and calculates the likelihood of this model given the alignment data and known species tree.
We first wanted to determine if any of our 331 genes are under positive selection and, if so, how this varies between the three categories of genes. We fit each alignment to two codon models, M8 and M8a, which are illustrated in Fig. 3a. M8a is a null model where all codons in the multiple sequence alignment must be placed into one of 11 bins of specific dN/dS values. Ten of these bins are distributed along a beta distribution of dN/dS values bounded between 0 and 1. The 11th bin is defined to have a dN/dS value = 1. M8 is identical, except that the 11th bin can have a dN/dS value greater than one . Once a likelihood is determined for the data being represented by each of these models, a Likelihood Ratio Test is used to determine whether the null model (M8a) should be rejected in favor of the model allowing for positive selection of some codons (M8). After this analysis was performed on all 331 multiple sequence alignments, we ran the Benjamini–Hochberg procedure with an FDR of 10% to help correct for multiple testing and avoid false positives.
We found between 18 and 33 genes in each of the three categories to be under positive selection after Benjamini–Hochberg correction with an FDR of 10% (Fig. 3b). The number of genes under positive selection in the interferon-stimulated category was significantly larger than the number of genes under positive selection in the random category (two-tailed Fisher's exact test, p < 0.05; Fig. 3b). To ensure that this conclusion did not change when the number of genes in each category was equal, we chose 100 induction category genes randomly 100 times and compared the proportion of genes under positive selection in this category to the proportion of genes under positive selection in the interferon-stimulated and random categories for each instance. Ninety-two times out of 100, our results were the same and the number of genes under positive selection in the interferon-stimulated category was significantly larger than the number in the random category. In eight instances, the number of genes under positive selection in the interferon-stimulated category was significantly larger than the number in both the random and induction categories. We have included this analysis in Additional file 1. The genes identified as being under selection are listed in Table 2.
We next evaluated whether the intensity of selection might be different between these gene categories by utilizing other codon models in PAML. We compared the whole-gene (i.e. global) dN/dS values estimated for each multiple sequence alignment, calculated using the PAML model M0 (Fig. 4a, top). M0 is a model that allows only a single bin, with an optimized value of dN/dS, into which all codons must be placed. The average whole-gene dN/dS values for interferon-stimulated genes were significantly different than both those for our random set and the interferon induction set (Kolmogorov–Smirnov test; Fig. 4a, bottom). This suggests either that interferon-stimulated genes are evolving more rapidly than the other genes, or are experiencing more neutral evolution.
To differentiate between these two possibilities, we looked more closely at the codons assigned to the dN/dS > 1 bin in M8. We tested whether a higher percentage of codons were placed in the M8 bin > 1 for interferon-stimulated genes than for genes in the random or induction categories. Indeed, the average percentage of codons that fell in the estimated M8 dN/dS > 1 bin was significantly greater for interferon-stimulated genes (Fig. 4b). This suggests that interferon-stimulated genes are, on average, experiencing more positive natural selection than genes in the other two categories.
If more codons are under positive selection, we might expect fewer codons would be evolving under negative selection (i.e., conserved in sequence). M2, a simple model that allows for positive selection, places all codons into one of three bins: a bin at dN/dS less than one (conserved), a bin at dN/dS = 1 (neutral), and a bin at dN/dS > 1 (positive selection) (Fig. 4c, top). We tested whether PAML placed fewer codons in the M2 bin of dN/dS < 1 for interferon-stimulated genes then for genes in the induction or random category. Indeed, for interferon-stimulated genes the average percentage of codons placed in the M2 bin less than 1 is significantly less than for random genes (Fig. 4c, bottom). This is consistent with interferon-stimulated genes having fewer codons under negative selection (i.e., constraint).
We find that interferon-stimulated genes are evolving more rapidly than canonical interferon induction genes and more rapidly than our sample of random human genes. Rapidly evolving host genes are key in enforcing species barriers to viral spillover . While the entirety of the human immune system is important, only the parts that are functionally divergent from the immune systems of other animals are important in the defense against zoonotic viruses. In other words, any immune obstacle that a virus has already overcome in an animal will not be a barrier to infecting humans unless that obstacle has taken on a different flavor of interaction in the human genome. Arms races are the selective engine that drive rapid sequence evolution at the interaction interfaces between host and virus proteins, as they each jockey to establish or destroy these interactions . This conflict matters because it means that these interactions play out differently in different species, and thus these evolutionary dynamics enforce species barriers to the transmission of pathogens.
We had hypothesized that viruses are more likely to evolve mechanisms to halt the production of antiviral cellular states by antagonizing the initial expression of interferons rather than the individual proteins which produce antiviral states. Intuitively, it makes sense to “turn off the tap” instead of trying to mop up the after-effects of an induced interferon response. Therefore, we expected that the induction pathway would be under greater pressure to evolve rapidly and that we would see a higher signal of positive selection in the induction pathway. Instead, we found that interferon-stimulated genes are evolving more rapidly than both a randomly drawn set of human genes and proteins involved in interferon induction. We have not made conclusions about the strength of positive selection on specific sites or genes in any of these categories—rather, we have analyzed these genes as groups and found significant differences between interferon-stimulated genes and the other two categories. It is possible that induction genes as a whole are under more evolutionary constraint in order to preserve specific functions in their respective signaling pathways. Previous work suggests that interferon genes themselves are evolving under different evolutionary constraints . The interferon molecules and receptors, though much expanded in extant species, is an ancient class of proteins that has had to evolve under the constraints of binding partner compatibility . We show that the interferon receptors are evolving under positive selection, but interferons and interferon-signaling genes may, as a class, be more constrained. In contrast, interferon-stimulated genes may have more flexibility to obtain and tolerate mutations. Because interferon-stimulated genes are sometimes more specific in function or can rely on the redundancies of the induced response, nonsynonymous mutations may be tolerated to a greater extent. For example, mammalian cells have several ways to shut down host and virus translation during viral infection: the IFIT family of proteins, ISG15, and ZAP are all examples of proteins that are induced by interferon and prevent viruses and hosts from translating RNA [67, 68]. Redundancy in this specific antiviral defense might mean that mutations can be more easily tolerated in each individual protein. Interferon-stimulated genes remain relatively understudied in terms of their mechanistic antiviral action . However, they may be at the forefront of the host-virus “arms race” that has implications for pathogenicity of viruses, the ability of viruses to spillover to new hosts, and the evolution of our immune systems. Finally, we note that although we approached the curation of these genes with a particular emphasis on genes important in antiviral pathways, many of the genes that have been analyzed here are involved in the immune system’s defense against other types of pathogens as well. Therefore, the positive natural selection we have identified partly reflects antagonism by other types of pathogen.
Definition of gene categories
The list of 131 interferon induction genes was curated from reviews of interferon signaling pathways [70,71,72,73,74]. We didn’t include genes listed in these reviews that were primarily belonging to DNA damage pathway, since these pathways have been shown to experience positive selection as well [75,76,77]. We do not assume that this “induction” category is a complete list of genes upstream of interferon-stimulated genes expression, but rather have treated it as a representative list of genes known to be implicated in several induction pathways. Any gene mentioned in the reviews that could not be unambiguously identified (i.e. gene name was listed as an alias for multiple genes) was removed from this list. The list of 100 interferon-stimulated genes was curated from published literature [5, 78,79,80]. These genes were verified by the Interferome database , with the criteria that each interferon-stimulated gene was upregulated at least twofold by Type I interferons. A list of random human genes was formed using a random gene set generator . We did not place the same gene in more than one category. If a gene is implicated in canonical induction pathway, but also upregulated by interferons, it was placed in the induction category.
Creation of multiple sequence alignments for each gene
The longest human isoform of each gene, along with any simian primate orthologs available, were collected from the NCBI Gene database. We collected and retained as many primate sequences as possible, including sequences that were labeled as unassigned gene loci, as long as that sequence returned the correct human ortholog in a reciprocal BLAST search back to the human genome. In some cases, primate orthologs contained “n”s suggesting that these bases did not meet certain quality thresholds. These sequences were retained, but note that PAML treats “n’s” as gaps and will therefore not analyze codons in multiple sequence alignments that contain them. Further, any sequence that was marked holistically as “Low Quality” on NCBI were not included. The cDNA sequences were then translated to amino acids and aligned with the MUSCLE algorithm using the Unipro UGENE software  or MEGA . Pal2Nal  was then used to reference this amino acid alignment while producing a final alignment of cDNA by codon. The result was over 300 multiple sequence alignments containing human and primate orthologs of interferon-related or random genes.
Each multiple sequence alignment was then manually inspected and edited. Our pruning and quality control pipeline consisted of these steps. (1) We removed from the alignments any ortholog containing a gap (missing sequence) that spanned > 10% of the length of the cognate human gene. This was done because PAML won’t analyze codon sites containing a gap. We did not remove an ortholog if it had multiple gaps relative to the human sequence, as long as each gapped region was < 10% of the length of the human gene. (2) We removed from the alignments any ortholog which aligned poorly to other sequences in the alignments for a contiguous stretch spanning > 10% of the length of the human gene. We did this because simian primate sequences tend to align with very high identity since divergence is low in this clade , and such regions usually indicate regions of mis-annotation or gene prediction. (3) We trimmed from alignments sequence at either terminal end (starting at start codon and ending at stop codon) if less than ten orthologs in the alignment had the same start or termination site as the human sequence. In this case, we stopped trimming alignments at the first conserved site (or site where amino acid variation tracked with phylogeny). (4) We manually inspected all remaining gaps in the multiple sequence alignments. We deleted codon columns where more than one amino acid misaligned at the edge of a gap. (5) We deleted all regions in the alignments where an ortholog contained more than four amino acids in a row that did not align to any other orthologs in the alignment. (6) After all of these curation steps, multiple sequence alignments containing less than 10 orthologs (including human) were not analyzed further. This is because we have previously shown that the accuracy of evolutionary tests improves as the number of primate species and overall tree length of an alignment increases .
Positive selection was detected using the Phylogenetic Analysis by Maximum Likelihood (PAML) program. The codeml program packaged in PAML accommodates for the differences in rates of transition/transversion substitutions, unequal codon frequencies, and the probabilities of mutation across the codon . PAML requires the codon alignment be accompanied by a phylogenetic tree to accurately identify rates of substitutions. A master phylogenetic tree with the twenty possible primate species was made using Perelman et al. 2011 as a reference and modified as necessary for each gene . In all cases except in the fitting of data to the M8a model we kept the default parameters of the codeml program for each model and allowed omega to be estimated. In fitting data to the M8a model we fixed omega at 1. The tree length of each multiple sequence alignment was determined from the output file of model M0.
PAML fits the multiple sequence alignments to different models of codon substitution . For the analysis outlined in this paper, we used the M0, M2, M8a, and M8 models. We used likelihood ratio tests to determine which model, M8 or M8a, best fit the data for the evolution of each gene. PAML provides a log likelihood (lnL) value for each alignment in both the null and positive selection models. The difference of these values is then doubled, referred to here as “2ΔlnL”, and used to perform Chi-Square tests with a single degree of freedom. We defined a p-value of p < 0.05 allowing us to reject the null hypothesis that there is no difference in how well models M8 and M8a fit the data. These genes were determined to be evolving under positive selection.
In instances where M8a was rejected in favor of M8, specific codons are identified which have elevated rates of nonsynonymous fixed mutations. This is determined by the Bayes empirical Bayes (BEB) method which accounts for sampling errors in the parameters of the model . The codons identified by BEB, and the posterior probability by which they are predicted to fall in the bin > 1, was recorded (Additional file 1).
Availability of data and materials
The datasets produced and/or analyzed during the current study available from the corresponding author on reasonable request.
Fensterl V, Chattopadhyay S, Sen GC. No love lost between viruses and interferons. Ann Rev Virol. 2015;2:549–72. https://doi.org/10.1146/annurev-virology-100114-055249.
Acosta PL, Byrne AB, Hijano DR, Talarico LB. Human type I interferon antiviral effects in respiratory and reemerging viral infections. J Immunol Res. 2020;2020:1–27.
Mesev EV, LeDesma RA, Ploss A. Decoding type I and III interferon signalling during viral infection. Nat Microbiol. 2019;4:914–24.
McNab F, Mayer-Barber K, Sher A, Wack A, O’Garra A. Type I interferons in infectious disease. Nat Rev Immunol. 2015;15:87–103.
Schneider WM, Chevillotte MD, Rice CM. Interferon-stimulated genes: a complex web of host defenses. Annu Rev Immunol. 2014;32:513–45.
Wimmers F, Subedi N, van Buuringen N, Heister D, Vivié J, Beeren-Reinieren I, et al. Single-cell analysis reveals that stochasticity and paracrine signaling control interferon-alpha production by plasmacytoid dendritic cells. Nat Commun. 2018;9:3317.
Goraya MU, Zaighum F, Sajjad N, Anjum FR, Sakhawat I, ur Rahman S. Web of interferon stimulated antiviral factors to control the influenza A viruses replication. Microb Pathogenesis. 2019;139:103919.
Manry J, Laval G, Patin E, Fornarino S, Itan Y, Fumagalli M, et al. Evolutionary genetic dissection of human interferons. J Exp Medicine. 2011;208:2747–59.
Ten G-S. Strategies of interferon evasion by viruses. Cell Host Microbe. 2017;22:176–84.
Stabell AC, Meyerson NR, Gullberg RC, Gilchrist AR, Webb KJ, Old WM, et al. Dengue viruses cleave STING in humans but not in nonhuman primates, their presumed natural reservoir. Elife. 2018;7:e31919.
Ding Q, Gaska JM, Douam F, Wei L, Kim D, Balev M, et al. Species-specific disruption of STING-dependent antiviral cellular defenses by the Zika virus NS2B3 protease. Proc Natl Acad Sci USA. 2018;540:201803406–9.
Yu C-Y, Chang T-H, Liang J-J, Chiang R-L, Lee Y-L, Liao C-L, et al. Dengue virus targets the adaptor protein MITA to subvert host innate immunity. PLoS Pathog. 2012;8:e1002780.
Ding Q, Cao X, Lu J, Huang B, Liu Y-J, Kato N, et al. Hepatitis C virus NS4B blocks the interaction of STING and TBK1 to evade host innate immunity. J Hepatol. 2013;59:52–8.
Aguirre S, Maestre AM, Pagni S, Patel JR, Savage T, Gutman D, et al. DENV inhibits type I IFN production in infected cells by cleaving human STING. PLoS Pathog. 2012;8:e1002934.
Darnell J, Kerr I, Stark G. Jak-STAT pathways and transcriptional activation in response to IFNs and other extracellular signaling proteins. Science. 1994;264:1415–21.
Munoz-Jordan JL, Laurent-Rolle M, Ashour J, Martinez-Sobrido L, Ashok M, Lipkin WI, et al. Inhibition of alpha/beta interferon signaling by the NS4B protein of flaviviruses. J Virol. 2005;79:8004–13.
Ashour J, Laurent-Rolle M, Shi P-Y, Garcia-Sastre A. NS5 of dengue virus mediates STAT2 binding and degradation. J Virol. 2009;83:5408–18.
Mazzon M, Jones M, Davidson A, Chain B, Jacobs M. Dengue virus NS5 inhibits interferon-alpha signaling by blocking signal transducer and activator of transcription 2 phosphorylation. J Infect Dis. 2009;200:1261–70.
Morrison J, Laurent-Rolle M, Maestre AM, Rajsbaum R, Pisanelli G, Simon V, et al. Dengue virus co-opts UBR4 to degrade STAT2 and antagonize type I interferon signaling. PLoS Pathog. 2013;9:e1003265.
Muñoz-Jordán JL, Sánchez-Burgos GG, Laurent-Rolle M, García-Sastre A. Inhibition of interferon signaling by dengue virus. Proc Natl Acad Sci USA. 2003;100:14333–8.
Thurmond S, Wang B, Song J, Hai R. Suppression of type I interferon signaling by flavivirus NS5. Viruses. 2018;10:712.
Barreiro LB, Quintana-Murci L. From evolutionary genetics to human immunology: how selection shapes host defence genes. Nat Rev Genet. 2010;11:17–30.
Meyerson NR, Sawyer SL. Two-stepping through time: mammals and viruses. Trends Microbiol. 2011;19:286–94.
Enard D, Cai L, Gwennap C, Petrov DA. Viruses are a dominant driver of protein adaptation in mammals. Elife. 2016. https://doi.org/10.7554/eLife.12469.
Wilke CO, Sawyer SL. At the mercy of viruses. Elife. 2016;5:e16758.
Warren CJ, Sawyer SL. How host genetics dictates successful viral zoonosis. Plos Biol. 2019;17:e3000217. https://doi.org/10.1371/journal.pbio.3000217.
Hancks DC, Hartley MK, Hagan C, Clark NL, Elde NC. Overlapping patterns of rapid evolution in the nucleic acid sensors cGAS and OAS1 suggest a common mechanism of pathogen antagonism and escape. PLoS Genet. 2015;11:e1005203.
Mozzi A, Pontremoli C, Forni D, Clerici M, Pozzoli U, Bresolin N, et al. OASes and STING: adaptive evolution in concert. Genome Biol Evol. 2015;7:1016–32.
Laguette N, Rahm N, Sobhian B, Chable-Bessia C, Münch J, Snoeck J, et al. Evolutionary and functional analyses of the interaction between the myeloid restriction factor SAMHD1 and the lentiviral Vpx protein. Cell Host Microbe. 2012;11:205–17.
Sawyer SL, Wu LI, Emerman M, Malik HS. Positive selection of primate TRIM5 identifies a critical species-specific retroviral restriction domain. Proc Natl Acad Sci. 2005;102:2832–7.
Meyerson NR, Sharma A, Wilkerson GK, Overbaugh J, Sawyer SL. Identification of owl monkey CD4 receptors broadly compatible with early-stage HIV-1 isolates. J Virol. 2015;89:8611–22.
Rowley PA, Ho B, Bushong S, Johnson A, Sawyer SL. XRN1 is a species-specific virus restriction factor in yeasts. Plos Pathog. 2016;12:e1005890.
Meyerson NR, Warren CJ, Vieira DASA, Diaz-Griferro F, Sawyer SL. Species-specific vulnerability of RanBP2 shaped the evolution of SIV as it transmitted in African apes. Plos Pathog. 2018;14:e1006906.
Sawyer SL, Elde NC. A cross-species view on viruses. Curr Opin Virol. 2012;2:561–8.
Demogines A, Abraham J, Choe H, Farzan M, Sawyer SL. Dual host-virus arms races shape an essential housekeeping protein. Plos Biol. 2013;11:e1001571.
Demogines A, Farzan M, Sawyer SL. Evidence for ACE2-utilizing coronaviruses (CoVs) related to severe acute respiratory syndrome CoV in bats. J Virol. 2012;86:6350–3.
Kaelber JT, Demogines A, Harbison CE, Allison AB, Goodman LB, Ortega AN, et al. Evolutionary reconstructions of the transferrin receptor of Caniforms supports canine parvovirus being a re-emerged and not a novel pathogen in dogs. Plos Pathog. 2012;8:e1002666.
Ng M, Ndungo E, Kaczmarek ME, Herbert AS, Binger T, Kuehne AI, et al. Filovirus receptor NPC1 contributes to species-specific patterns of ebolavirus susceptibility in bats. Elife. 2015;4:e11785.
Johnson WE, Sawyer SL. Molecular evolution of the antiretroviral TRIM5 gene. Immunogenetics. 2009;61:163–76.
Meyerson NR, Zhou L, Guo YR, Zhao C, Tao YJ, Krug RM, et al. Nuclear TRIM25 specifically targets influenza virus ribonucleoproteins to block the onset of RNA chain elongation. Cell Host Microbe. 2017;22(627–638):e7.
Stabell AC, Hawkins J, Li M, Gao X, David M, Press WH, et al. Non-human primate schlafen11 inhibits production of both host and viral proteins. Plos Pathog. 2016;12:e1006066.
Boys IN, Xu E, Mar KB, Cruz-Rivera PCDL, Eitson JL, Moon B, et al. RTP4 is a potent IFN-inducible anti-flavivirus effector engaged in a host-virus arms race in bats and other mammals. Cell Host Microbe. 2020;28(712–723):e9.
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–91.
Perelman P, Johnson WE, Roos C, Seuánez HN, Horvath JE, Moreira MAM, et al. A molecular phylogeny of living primates. PLoS Genet. 2011;7:e1001342.
Hurst LD. The Ka/Ks ratio: diagnosing the form of sequence evolution. Trends Genet: TIG. 2002;18:486.
Bielawski JP, Yang Z. Maximum likelihood methods for detecting adaptive protein evolution. undefined. 2004;1–34. message:%3CDE5FAA75-E246-496B-881E-5B5E7E7537B5@gmail.com%3E.
Daugherty MD, Malik HS. Rules of engagement: molecular insights from host-virus arms races. Annu Rev Genet. 2012;46:677–700.
Yang Z, Nielsen R. Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage. Mol Biol Evol. 2008;25:568–79.
Smith EE, Malik HS. The apolipoprotein L family of programmed cell death and immunity genes rapidly evolved in primates at discrete sites of host-pathogen interactions. Genome Res. 2009;19:850–8. https://doi.org/10.1101/gr.085647.108.
Lim ES, Malik HS, Emerman M. Ancient adaptive evolution of tetherin shaped the functions of vpu and nef in human immunodeficiency virus and primate lentiviruses. J Virol. 2010;84:7124–34.
Slevogt H, Zabel S, Opitz B, Hocke A, Eitel J, N’Guessan PD, et al. CEACAM1 inhibits Toll-like receptor 2–triggered antibacterial responses of human pulmonary epithelial cells. Nat Immunol. 2008;9:1270–8.
Kohler KM, Kutsch M, Piro AS, Wallace G, Coers J, Barber MF. A rapidly evolving polybasic motif modulates bacterial detection by guanylate binding proteins. Biorxiv. 2019;689554.
Mitchell PS, Patzina C, Emerman M, Haller O, Malik HS, Kochs G. Evolution-guided identification of antiviral specificity determinants in the broadly acting interferon-induced innate immunity factor MxA. Cell Host Microbe. 2012;12:598–604.
Sironi M, Biasin M, Cagliani R, Gnudi F, Saulle I, Ibba S, et al. Evolutionary analysis identifies an MX2 haplotype associated with natural resistance to HIV-1 infection. Mol Biol Evol. 2014;31:2402–14.
Monit C, Morris ER, Ruis C, Szafran B, Thiltgen G, Tsai M-HC, et al. Positive selection in dNTPase SAMHD1 throughout mammalian evolution. Proc Natl Acad Sci. 2019;116:18647–54.
McLysaght A, Baldi PF, Gaut BS. Extensive gene gain associated with adaptive evolution of poxviruses. Proc National Acad Sci. 2003;100:15655–60.
Kerns JA, Emerman M, Malik HS. Positive selection and increased antiviral activity associated with the PARP-containing isoform of human zinc-finger antiviral protein. Plos Genet. 2008;4:e21.
Vasseur E, Boniotto M, Patin E, Laval G, Quach H, Manry J, et al. The evolutionary landscape of cytosolic microbial sensors in humans. Am J Hum Genetics. 2012;91:27–37.
Gayà-Vidal M, Albà MM. Uncovering adaptive evolution in the human lineage. BMC Genomics. 2014;15:599.
Patel MR, Loo Y-M, Horner SM, Gale M, Malik HS. Convergent evolution of escape from hepaciviral antagonism in primates. Plos Biol. 2012;10:e1001282.
Jin W, Wu D-D, Zhang X, Irwin DM, Zhang Y-P. Positive selection on the gene RNASEL: correlation between patterns of evolution and function. Mol Biol Evol. 2012;29:3161–8.
Wlasiuk G, Nachman MW. Adaptation and constraint at toll-like receptors in primates. Mol Biol Evol. 2010;27:2172–86.
Wlasiuk G, Khan S, Switzer WM, Nachman MW. A history of recurrent positive selection at the toll-like receptor 5 in primates. Mol Biol Evol. 2009;26:937–49.
Malfavon-Borja R, Sawyer SL, Wu LI, Emerman M, Malik HS. An Evolutionary screen highlights canonical and noncanonical candidate antiviral genes within the primate TRIM gene family. Genome Biol Evol. 2013;5:2141–54.
Sironi M, Cagliani R, Forni D, Clerici M. Evolutionary insights into host–pathogen interactions from mammalian sequence data. Nat Rev Genet. 2015;16:224–36.
Secombes CJ, Zou J. Evolution of interferons and interferon receptors. Front Immunol. 2017;8:209.
Yánez DC, Ross S, Crompton T. The IFITM protein family in adaptive immunity. Immunology. 2020;159:365–72.
Li MMH, MacDonald MR, Rice CM. To translate, or not to translate: viral and host mRNA regulation by interferon-stimulated genes. Trends Cell Biol. 2015;25:320–9.
Schoggins JW. Interferon-stimulated genes: roles in viral pathogenesis. Curr Opin Virol. 2014;6:40–6.
Hornung V. SnapShot: nucleic acid immune sensors, part 1. Immunity. 2014;41(868–868):e1.
Hornung V. SnapShot: nucleic acid immune sensors, part 2. Immunity. 2014;41(1066–1066):e1.
Kawai T, Akira S. The roles of TLRs, RLRs and NLRs in pathogen recognition. Int Immunol. 2009;21:317–37.
Shuai K, Liu B. Regulation of JAK–STAT signalling in the immune system. Nat Rev Immunol. 2003;3:900–11.
Barber GN. STING-dependent signaling. Nat Immunol. 2011;12:929–30.
Lou DI, Kim ET, Meyerson NR, Pancholi NJ, Mohni KN, Enard D, et al. An Intrinsically disordered region of the DNA repair protein Nbs1 is a species-specific barrier to herpes simplex virus 1 in primates. Cell Host Microbe. 2016;20:178–88.
Lou DI, McBee RM, Le UQ, Stone AC, Wilkerson GK, Demogines AM, et al. Rapid evolution of BRCA1 and BRCA2 in humans and other primates. Bmc Evol Biol. 2014;14:155.
Demogines A, East AM, Lee J-H, Grossman SR, Sabeti PC, Paull TT, et al. Ancient and recent adaptive evolution of primate non-homologous end joining genes. Plos Genet. 2010;6:e1001169.
Kane M, Zang TM, Rihn SJ, Zhang F, Kueck T, Alim M, et al. Identification of interferon-stimulated genes with antiretroviral activity. Cell Host Microbe. 2016;20:392–405.
Schoggins JW, Wilson SJ, Panis M, Murphy MY, Jones CT, Bieniasz P, et al. A diverse range of gene products are effectors of the type I interferon antiviral response. Nature. 2011;472:481–5.
Schoggins JW, Rice CM. Interferon-stimulated genes and their antiviral effector functions. Curr Opin Virol. 2011;1:519–25.
Rusinova I, Forster S, Yu S, Kannan A, Masse M, Cumming H, et al. INTERFEROME v2.0: an updated database of annotated interferon-regulated genes. Nucleic Acids Res. 2013;41:D1040–6.
Čermák V. Molecular Biology Online Apps. undefined. 2020. http://www.molbiotools.com/manual/randomgenesetgenman.html.
Okonechnikov K, Golosova O, Fursov M. Unipro UGENE: a unified bioinformatics toolkit. Bioinformatics. 2012;28:1166–7.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.
Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34(suppl_2):W609–12.
McBee RM, Rozmiarek SA, Meyerson NR, Rowley PA, Sawyer SL. The effect of species representation on the detection of positive selection in primate gene data sets. Mol Biol Evol. 2015;32:1091–6.
We thank Arturo Barbachano-Guerrero, Vanessa Bauer, Maryska Kaczmarek, Emma Worden-Sapper, and Sharon Wu for valuable insight on this work.
This work was sponsored by the National Institutes of Health (DP1-DA-046108 and R01-AI-137011 to SLS; T32-GM08759 to ARG). SLS is a Burroughs Wellcome Fund Investigator in the Pathogenesis of Infectious Disease. Funding bodies played no role in the design of the study, or in the collection, analysis, interpretation of data, or writing of the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Judd, E.N., Gilchrist, A.R., Meyerson, N.R. et al. Positive natural selection in primate genes of the type I interferon response. BMC Ecol Evo 21, 65 (2021). https://doi.org/10.1186/s12862-021-01783-z