- Research article
- Open Access
Different alternative splicing patterns are subject to opposite selection pressure for protein reading frame preservation
BMC Evolutionary Biology volume 7, Article number: 179 (2007)
Alternative splicing (AS) has been regarded capable of altering selection pressure on protein subsequences. Particularly, the frequency of reading frame preservation (FRFP), as a measure of selection pressure, has been reported to be higher in alternatively spliced exons (ASEs) than in constitutively spliced exons (CSEs). However, recently it has been reported that different ASE types – simple and complex ASEs – may be subject to opposite selection forces. Therefore, it is necessary to re-evaluate the evolutionary effects of such splicing patterns on frame preservation.
Here we show that simple and complex ASEs, respectively, have higher and lower FRFPs than CSEs. Since complex ASEs may alter the ends of their flanking exons, the selection pressure on frame preservation is likely relaxed in this ASE type. Furthermore, conservation of the ASE/CSE splicing pattern increases the FRFPs of simple ASEs but decreases those of complex ASEs. Contrary to the well-recognized concept of strong selection pressure on conserved ASEs for protein reading frame preservation, our results show that conserved complex ASEs are relaxed from such pressure and the frame-disrupting effect caused by the insertion of complex ASEs can be offset by compensatory changes in their flanking exons.
In this study, we find that simple and complex ASEs undergo opposite selection pressure for protein reading frame preservation, with CSEs in-between. Simple ASEs have much higher FRFPs than complex ones. We further find that the FRFPs of complex ASEs coupled with flanking exons are close to those of simple ASEs, indicating that neighboring exons of an ASE may evolve in a coordinated way to avoid protein dysfunction. Therefore, we suggest that evolutionary analyses of AS should take into consideration the effects of different splicing patterns and the joint effects of multiple AS events.
Alternative splicing (AS) is a topic of increasing interests because it has been suggested to be an important contributor to transcriptome/proteome complexity, gene function, and a wide variety of biological processes [1–7]. Previous studies have reported that as high as 40~80% of human genes undergo AS [8–12]. Of the observed AS events in mammals, the most common AS event is "cassette exon". It can add or remove an individual exon in a transcript [13–15]. Cassette exons are sometimes referred to as alternatively spliced exons (ASEs) [16–23]. It has been suggested that ASEs and constitutively spliced exons (CSEs, exons that are always included in the transcript) are under different selection pressures and evolve at distinct rates – the former have higher nonsynonymous (Ka) substitution rates but lower synonymous (Ks) substitution rates than the latter [16, 18, 19, 24–26]. ASEs are regarded as under relaxed selection pressure because of their dispensability in transcripts. Also, conserved ASEs (i.e., exons are alternatively spliced in a pair of compared species) have been suggested to be constrained for preservation of the reading frame [18, 22, 27]. Many studies have pointed out that preservation of reading frame may indicate functional selection pressure of an AS event [18, 22, 27–29].
Recently, the Alternative Splicing Database (ASD) project at European Bioinformatics Institute (EBI)  further classifies cassette exons into simple and complex cassette exons ("simple ASEs" and "complex ASEs"). Complex ASEs differ from simple ones in that the former change the lengths of one or both of their flanking exons when they are included in the transcripts, whereas the latter do not (see Fig. 1). Therefore, inclusion of a complex ASE results in simultaneous changes of two or three exons. In contrast, inclusion of a simple ASE does not alter its flanking exon(s) and appear to cause fewer changes. Chen et al. have reported that simple ASEs have higher Ka and lower Ks than CSEs, whereas complex ASEs have evolutionary rates to the opposite of simple ASEs vs. CSEs . They also found that GC contents and codon usage bias are associated with increased Ks values in complex ASEs but not in simple ones . Such observation modified the previous view that ASEs accelerate evolution of protein subsequences. However, whether simple/complex splicing pattern is related to preservation of reading frame has not been investigated.
Results and discussion
Since ASEs in one species may be CSEs in the other (i.e. lineage-specific ASE/CSEs), it is necessary to specify the splicing pattern of the exons studied. Therefore, we classify ASEs into three major groups according to splicing pattern conservation (see Materials and Methods). Each group is subsequently divided into four subsets (Table 1). We then compare the frequencies of reading frame preservation (designated as "FRFP", i.e., the proportions of exons of which the lengths are divisible by 3) between simple and complex ASEs (Fig. 2). For Group A, the FRFPs for human (mouse) simple and complex ASEs are 43.0% (45.7%) and 37.9% (35.5%), respectively. In comparison, the FRFPs of CSEs approximate 40% (39.7% in human and 39.5% in mouse ). It has been well recognized that CSEs have lower FRFPs than ASEs. However, we find that although this is true for simple ASEs (P-values < 0.01 in both human and mouse; all statistical tests used in this section are the Fisher's exact test), it does not seem to hold for complex ASEs. The FRFPs are higher in CSEs than in complex ASEs in both species, though the differences are not highly significant (both P-values > 0.01). Overall, our results indicate that simple and com1plex ASEs are under opposite selection pressure for protein reading frame preservation. Particularly, complex ASEs differ significantly in FRFP from commonly regarded ASEs, which are dominated in number by simple ASEs. We then extract conserved ASEs (Group C) from Group A. Note that "conservation" here refers to the conservation of the ASE/CSE splicing pattern between human and mouse, rather than the simple/complex pattern. We find that the FRFP of Group C simple ASEs increases to 49.8% in human and 53.4% in mouse (Fig. 2). Meanwhile, for Group C complex ASEs, the FRFPs decrease to <35% (34.3% for human; 33.3% for mouse) (Fig. 2). It is obvious that simple ASEs in Group C have higher FRFPs than in Group A, whereas the reverse is true for complex ASEs in both human and mouse. We then compare the FRFPs of simple and complex ASEs with those of CSEs. For simple ASEs, Group A has lower FRFPs than Group C, while both groups have higher FRFPs than those of CSEs (Fig. 3). However, for complex ASEs, the trend is reversed. Even if the expected FRFP of CSEs is set as 45% , the trends still hold well in conserved ASEs. Therefore, simple and complex ASEs seem to cause FRFP changes to the opposite ends when compared with CSEs. Note that the "CSEs" stated above are those with unspecified splicing pattern conservation. We therefore retrieve 21,669 pairs of conserved CSEs for comparison. The FRFPs of conserved CSEs are 38.4% in human and 38.3% in mouse, respectively. These figures further confirm our observations that CSEs tend to have higher FRFP than complex ASEs but lower FRFP than simple ones. Overall, our result supports Chen et al's suggestion that simple and complex ASEs cause evolutionary changes to the contrary ends with CSEs in-between .
To further probe the effects of splicing pattern conservation on frame preservation, we compare the FRFPs between conserved and lineage-specific ASEs (Groups C and B). As shown in Figure 2, for simple ASEs, conservation of ASE/CSE splicing pattern results in an increase in FRFP. In contrast, splicing pattern conservation causes the FRFP to drop in complex ASEs, such observation disobeys the previous view [18, 22, 27] that conserved ASEs have a higher probability to be frame-preserving than lineage-specific ones.
On the other hand, also see Table 1, we find that >70% of the ASEs (either simple or complex) have CSE counterparts in the other species, indicating that AS patterns tend not to be evolutionarily conserved in human and mouse. If only conserved ASEs are considered, the simple splicing pattern has a much higher probability of being conserved between human and mouse than the complex splicing pattern (Table 2). The result indicates that most complex ASEs are lineage-specific.
Another issue of interest is that, since a complex ASE looks like a simple event plus one (or two) exon extension/truncation event(s) (see Fig. 1B), the FRFPs of complex ASEs may in fact reflect the effects of exon extension/truncation events. However, as shown in Table 3, we find that the FRFPs in the lineage-specific exon extension/truncation events are around 50%, whereas in conserved events, the FRFPs significantly increase to over 60% (both P-values < 0.001; Table 3). Such an increase in FRFP towards conserved ASEs is similar to what is observed in simple ASEs. Therefore, exon extension/truncation events and complex ASEs may be under different selection pressures for reading frame preservation. We speculate that a complex splicing event is rather an integrated "module" that requires synchronized changes in neighboring exons, than merely a simple ASE accidentally coupled with exon extension/truncation events. To find support for this hypothesis, we further analyze whether the length changes caused by complex ASEs and their flanking exons can offset the frame-shifting effects of each other and retain the upstream reading frame. We find that the FRFPs of complex ASEs coupled with flanking exons (complex+flanking exons) are close to those of simple ASEs (Fig. 3). In Group C, the FRFPs of complex+flanking exons (49.2% in human and 47.8% in mouse) are significantly higher than those of conserved CSEs (dashed lines in Fig. 3; both P-values < 0.01). Therefore, the selection pressure for frame preservation may apply to transcripts as a whole, but not to complex ASEs per se. Furthermore, our results imply that in an alternatively spliced transcript, neighboring exons of an ASE may evolve in a coordinated way to avoid protein dysfunction.
In sum, one surprising finding of this study is that the FRFP of complex ASEs is lower than that of CSEs. Our result suggests that the frame-shifting effects of complex ASEs are rescued by the compensatory changes in the flanking exons, thus leaving the downstream protein reading frames unaltered. Therefore, complex ASEs appear to be more relaxed from selection pressure than simple ones in terms of reading frame preservation. One possible reason is that most observed ASEs (>80%) are simple ASEs (see Table 1) and the previously analyzed results are likely dominated by the effects of these exons. If we divide ASEs into simple and complex ASEs, the opposite evolutionary effects between them are observed. Previously, we have reported that complex ASEs are under stronger selection pressure against amino acid changes than simple ones . In addition, we find that exons that participate in both simple and complex AS events have intermediate FRFPs, which fall between those of simple and complex ASEs (data not shown). In sum, our results reveal that, simple and complex ASEs have quite distinct evolutionary features. It appears that both simple and complex AS patterns have functional importance in view of the two different forms of selection pressure (protein sequence conservation and reading frame preservation) for which they are constrained. Although the biology of complex ASEs has rarely been documented, it is likely that this ASE type has resulted from a different molecular mechanism and played a different role from that of simple ASEs.
We used 5,176 orthologous gene pairs of human and mouse from the EBI database  and extracted reciprocal best-hit coding exon pairs using the BLAST package (version 2.2.11 from NCBI website). The human and mouse files used to annotate exon types (including the ASE types) were download from ASD (AltSplice Human Release 2 based on Ensembl 27.35a.1 and AltSplice Mouse Release 2 based on Ensembl 27.33c.1 [30, 34]). Based on the above information, also see Table 1, we divided the extracted human-mouse exon pairs into three groups: A. ASE conservation unspecified (i.e., simple/complex ASEs vs. all exons, the ASEs of which the ASE/CSE splicing patterns of the orthologous exons are not limited), B. lineage-specific ASE (i.e., simple/complex ASEs vs. CSEs, the ASEs of which the orthologous exons are CSEs) and C. conserved ASE (i.e., simple/complex ASEs vs. all ASEs) groups. Note that "All exons" include CSEs and all ASEs; whereas "All ASEs" include simple ASEs, complex ASEs, and uncertain ASE type. Groups B and C are subsets of Group A.
The sequences and exon types of Groups A, B, and C human-mouse orthologous exons analyzed in this study are available at our web site .
Alternatively Spliced Exon
Constitutively Sliced Exon
Frequency of Reading Frame Preservation
Bracco L, Kearsey J: The relevance of alternative RNA splicing to pharmacogenomics. Trends Biotechnol. 2003, 21 (8): 346-353. 10.1016/S0167-7799(03)00146-X.
Brett D, Pospisil H, Valcarcel J, Reich J, Bork P: Alternative splicing and genome complexity. Nat Genet. 2002, 30 (1): 29-30. 10.1038/ng803.
Stetefeld J, Ruegg MA: Structural and functional diversity generated by alternative mRNA splicing. Trends Biochem Sci. 2005, 30 (9): 515-521. 10.1016/j.tibs.2005.07.001.
Lipscombe D: Neuronal proteins custom designed by alternative splicing. Curr Opin Neurobiol. 2005, 15 (3): 358-363. 10.1016/j.conb.2005.04.002.
Lareau LF, Green RE, Bhatnagar RS, Brenner SE: The evolving roles of alternative splicing. Curr Opin Struct Biol. 2004, 14 (3): 273-282. 10.1016/j.sbi.2004.05.002.
Modrek B, Lee C: A genomic view of alternative splicing. Nat Genet. 2002, 30 (1): 13-19. 10.1038/ng0102-13.
Novembre FJ, Saucier M, Anderson DC, Klumpp SA, O'Neil SP, Brown CR, Hart CE, Guenthner PC, Swenson RB, McClure HM: Development of AIDS in a chimpanzee infected with human immunodeficiency virus type 1. J Virol. 1997, 71 (5): 4086-4091.
Mironov AA, Fickett JW, Gelfand MS: Frequent alternative splicing of human genes. Genome Res. 1999, 9 (12): 1288-1293. 10.1101/gr.9.12.1288.
Kan Z, Rouchka EC, Gish WR, States DJ: Gene structure prediction and alternative splicing analysis using genomically aligned ESTs. Genome Res. 2001, 11 (5): 889-900. 10.1101/gr.155001.
Modrek B, Resch A, Grasso C, Lee C: Genome-wide detection of alternative splicing in expressed sequences of human genes. Nucleic Acids Res. 2001, 29 (13): 2850-2859. 10.1093/nar/29.13.2850.
Kan Z, States D, Gish W: Selecting for functional alternative splices in ESTs. Genome Res. 2002, 12 (12): 1837-1845. 10.1101/gr.764102.
Kampa D, Cheng J, Kapranov P, Yamanaka M, Brubaker S, Cawley S, Drenkow J, Piccolboni A, Bekiranov S, Helt G, Tammana H, Gingeras TR: Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22. Genome Res. 2004, 14 (3): 331-342. 10.1101/gr.2094104.
Maniatis T, Tasic B: Alternative pre-mRNA splicing and proteome expansion in metazoans. Nature. 2002, 418 (6894): 236-243. 10.1038/418236a.
Black DL, Grabowski PJ: Alternative pre-mRNA splicing and neuronal function. Prog Mol Subcell Biol. 2003, 31: 187-216.
Lopez AJ: Alternative splicing of pre-mRNA: developmental consequences and mechanisms of regulation. Annu Rev Genet. 1998, 32: 279-305. 10.1146/annurev.genet.32.1.279.
Chen FC, Wang SS, Chen CJ, Li WH, Chuang TJ: Alternatively and constitutively spliced exons are subject to different evolutionary forces. Mol Biol Evol. 2006, 23 (3): 675-682. 10.1093/molbev/msj081.
Chen FC, Chuang TJ: The effects of multiple features of alternatively spliced exons on the Ka/Ks ratio test. BMC Bioinformatics. 2006, 7 (1): 259-10.1186/1471-2105-7-259.
Xing Y, Lee C: Evidence of functional selection pressure for alternative splicing events that accelerate evolution of protein subsequences. Proc Natl Acad Sci U S A. 2005, 102 (38): 13526-13531. 10.1073/pnas.0501213102.
Modrek B, Lee CJ: Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss. Nat Genet. 2003, 34 (2): 177-180. 10.1038/ng1159.
Cusack BP, Wolfe KH: Changes in alternative splicing of human and mouse genes are accompanied by faster evolution of constitutive exons. Mol Biol Evol. 2005, 22 (11): 2198-2208. 10.1093/molbev/msi218.
Xing Y, Lee C: Assessing the application of Ka/Ks ratio test to alternatively spliced exons. Bioinformatics. 2005, 21 (19): 3701-3703. 10.1093/bioinformatics/bti613.
Resch A, Xing Y, Alekseyenko A, Modrek B, Lee C: Evidence for a subpopulation of conserved alternative splicing events under selection pressure for protein reading frame preservation. Nucleic Acids Res. 2004, 32 (4): 1261-1269. 10.1093/nar/gkh284.
Sorek R, Ast G: Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. Genome Res. 2003, 13 (7): 1631-1637. 10.1101/gr.1208803.
Hurst LD, Pal C: Evidence for purifying selection acting on silent sites in BRCA1. Trends Genet. 2001, 17 (2): 62-65. 10.1016/S0168-9525(00)02173-9.
Filip LC, Mundy NI: Rapid evolution by positive Darwinian selection in the extracellular domain of the abundant lymphocyte protein CD45 in primates. Mol Biol Evol. 2004, 21 (8): 1504-1511. 10.1093/molbev/msh111.
Iida K, Akashi H: A test of translational selection at 'silent' sites in the human genome: base composition comparisons in alternatively spliced genes. Gene. 2000, 261 (1): 93-105. 10.1016/S0378-1119(00)00482-0.
Thanaraj TA, Clark F, Muilu J: Conservation of human alternative splice events in mouse. Nucleic Acids Res. 2003, 31 (10): 2544-2552. 10.1093/nar/gkg355.
Philipps DL, Park JW, Graveley BR: A computational and experimental approach toward a priori identification of alternatively spliced exons. Rna. 2004, 10 (12): 1838-1844. 10.1261/rna.7136104.
Sorek R, Shemesh R, Cohen Y, Basechess O, Ast G, Shamir R: A non-EST-based method for exon-skipping prediction. Genome Res. 2004, 14 (8): 1617-1623. 10.1101/gr.2572604.
Stamm S, Riethoven JJ, Le Texier V, Gopalakrishnan C, Kumanduri V, Tang Y, Barbosa-Morais NL, Thanaraj TA: ASD: a bioinformatics resource on alternative splicing. Nucleic Acids Res. 2006, 34 (Database issue): D46-55. 10.1093/nar/gkj031.
Chen FC, Chaw SM, Tzeng YH, Wang SS, Chuang TJ: Opposite evolutionary effects between different alternative splicing patterns. Mol Biol Evol. 2007, 24 (7): 1443-1446. 10.1093/molbev/msm072.
de Souza SJ, Long M, Klein RJ, Roy S, Lin S, Gilbert W: Toward a resolution of the introns early/late debate: only phase zero introns are correlated with the structure of ancient proteins. Proc Natl Acad Sci U S A. 1998, 95 (9): 5094-5099. 10.1073/pnas.95.9.5094.
The EBI database. [http://www.ebi.ac.uk/]
The ASD database . [http://www.ebi.ac.uk/asd/]
The sequences of exons analyzed in this study . [http://www.sinica.edu.tw/~trees/Simple_Complex/Reading_frame.htm]
This work is supported by the Genomics Research Center, Academia Sinica, Taiwan; the National Health Research Institutes (NHRI), Taiwan (under contract NHRI-EX96-9408PC); National Science Council, Taiwan (under contract NSC 96-2628-B-001-005-MY3) (above to TJC); and NHRI intramural funding (to FCC). We thank EBI-ASD Web interface for freely downloaded data. Special thanks are due to 2 anonymous reviewers who provided very suggestive and helpful comments to the authors.
The author(s) declares that there are no competing interests.
TJC conceived the study. FCC analyzed the data. TJC and FCC wrote the draft. Both authors read and approved the final manuscript.
About this article
Cite this article
Chen, FC., Chuang, TJ. Different alternative splicing patterns are subject to opposite selection pressure for protein reading frame preservation. BMC Evol Biol 7, 179 (2007). https://doi.org/10.1186/1471-2148-7-179
- Alternative Splice
- Splice Pattern
- Cassette Exon
- Flank Exon
- Exon Pair