Chothia C, Gough J, Vogel C, Teichmann SA: Evolution of the protein repertoire. Science. 2003, 300 (5626): 1701-1703. 10.1126/science.1085371.
Article
CAS
PubMed
Google Scholar
Muller A, MacCallum RM, Sternberg MJ: Structural characterization of the human proteome. Genome Res. 2002, 12 (11): 1625-1641. 10.1101/gr.221202.
Article
CAS
PubMed Central
PubMed
Google Scholar
Vogel C, Bashton M, Kerrison ND, Chothia C, Teichmann SA: Structure, function and evolution of multidomain proteins. Curr Opin Struct Biol. 2004, 14 (2): 208-216. 10.1016/j.sbi.2004.03.011.
Article
CAS
PubMed
Google Scholar
Ekman D, Bjorklund AK, Frey-Skott J, Elofsson A: Multi-domain proteins in the three kingdoms of life: orphan domains and other unassigned regions. J Mol Biol. 2005, 348 (1): 231-243. 10.1016/j.jmb.2005.02.007.
Article
CAS
PubMed
Google Scholar
Moore AD, Bjorklund AK, Ekman D, Bornberg-Bauer E, Elofsson A: Arrangements in the modular evolution of proteins. Trends Biochem Sci. 2008, 33 (9): 444-451. 10.1016/j.tibs.2008.05.008.
Article
CAS
PubMed
Google Scholar
Buljan M, Bateman A: The evolution of protein domain families. Biochem Soc Trans. 2009, 37 (Pt 4): 751-755.
Article
CAS
PubMed
Google Scholar
Pal LR, Guda C: Tracing the origin of functional and conserved domains in the human proteome: implications for protein evolution at the modular level. BMC Evol Biol. 2006, 6: 91-10.1186/1471-2148-6-91.
Article
PubMed Central
PubMed
Google Scholar
Apic G, Gough J, Teichmann SA: An insight into domain combinations. Bioinformatics. 2001, 17 (Suppl 1): S83-S89. 10.1093/bioinformatics/17.suppl_1.S83.
Article
PubMed
Google Scholar
Marsh JA, Teichmann SA: How do proteins gain new domains?. Genome Biol. 2010, 11 (7): 126-10.1186/gb-2010-11-7-126.
Article
PubMed Central
PubMed
Google Scholar
Buljan M, Frankish A, Bateman A: Quantifying the mechanisms of domain gain in animal proteins. Genome Biol. 2010, 11 (7): R74-10.1186/gb-2010-11-7-r74.
Article
PubMed Central
PubMed
Google Scholar
Moore AD, Bornberg-Bauer E: The dynamics and evolutionary potential of domain loss and emergence. Mol Biol Evol. 2012, 29 (2): 787-796. 10.1093/molbev/msr250.
Article
CAS
PubMed Central
PubMed
Google Scholar
Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14 (9): 755-763. 10.1093/bioinformatics/14.9.755.
Article
CAS
PubMed
Google Scholar
Capra JA, Williams AG, Pollard KS: ProteinHistorian: tools for the comparative analysis of eukaryote protein origin. PLoS Comput Biol. 2012, 8 (6): e1002567-10.1371/journal.pcbi.1002567.
Article
CAS
PubMed Central
PubMed
Google Scholar
Margolin JF, Friedman JR, Meyer WK, Vissing H, Thiesen HJ, Rauscher FJ: Kruppel-associated boxes are potent transcriptional repression domains. Proc Natl Acad Sci U S A. 1994, 91 (10): 4509-4513. 10.1073/pnas.91.10.4509.
Article
CAS
PubMed Central
PubMed
Google Scholar
Toll-Riera M, Rado-Trilla N, Martys F, Alba MM: Role of low-complexity sequences in the formation of novel protein coding sequences. Mol Biol Evol. 2012, 29 (3): 883-886. 10.1093/molbev/msr263.
Article
CAS
PubMed
Google Scholar
Gibbs S, Fijneman R, Wiegant J, van Kessel AG, van De Putte P, Backendorf C: Molecular characterization and evolution of the SPRR family of keratinocyte differentiation markers encoding small proline-rich proteins. Genomics. 1993, 16 (3): 630-637. 10.1006/geno.1993.1240.
Article
CAS
PubMed
Google Scholar
Capra JA, Pollard KS, Singh M: Novel genes exhibit distinct patterns of function acquisition and network integration. Genome Biol. 2010, 11 (12): R127-10.1186/gb-2010-11-12-r127.
Article
CAS
PubMed Central
PubMed
Google Scholar
Tautz D, Domazet-Loso T: The evolutionary origin of orphan genes. Nat Rev Genet. 2011, 12 (10): 692-702. 10.1038/nrg3053.
Article
CAS
PubMed
Google Scholar
Domazet-Loso T, Tautz D: An evolutionary analysis of orphan genes in Drosophila. Genome Res. 2003, 13 (10): 2213-2219. 10.1101/gr.1311003.
Article
CAS
PubMed Central
PubMed
Google Scholar
Toll-Riera M, Bosch N, Bellora N, Castelo R, Armengol L, Estivill X, Alba MM: Origin of primate orphan genes: a comparative genomics approach. Mol Biol Evol. 2009, 26 (3): 603-612.
Article
CAS
PubMed
Google Scholar
Alba MM, Castresana J: Inverse relationship between evolutionary rate and age of mammalian genes. Mol Biol Evol. 2005, 22 (3): 598-606.
Article
CAS
PubMed
Google Scholar
Cai JJ, Woo PC, Lau SK, Smith DK, Yuen KY: Accelerated evolutionary rate may be responsible for the emergence of lineage-specific genes in ascomycota. J Mol Evol. 2006, 63 (1): 1-11. 10.1007/s00239-004-0372-5.
Article
CAS
PubMed
Google Scholar
Cai JJ, Petrov DA: Relaxed purifying selection and possibly high rate of adaptation in primate lineage-specific genes. Genome Biol Evol. 2010, 2: 393-409. 10.1093/gbe/evq019.
Article
PubMed Central
PubMed
Google Scholar
Hubbard TJ, Aken BL, Ayling S, Ballester B, Beal K, Bragin E, Brent S, Chen Y, Clapham P, Clarke L: Ensembl 2009. Nucleic Acids Res. 2009, 37 (Database issue): D690-D697.
Article
CAS
PubMed Central
PubMed
Google Scholar
Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J: The Pfam protein families database. Nucleic Acids Res. 2012, 40 (Database issue): D290-D301.
Article
CAS
PubMed Central
PubMed
Google Scholar
Williams AJ, Blacklow SC, Collins T: The zinc finger-associated SCAN box is a conserved oligomerization domain. Mol Cell Biol. 1999, 19 (12): 8526-8535.
Article
CAS
PubMed Central
PubMed
Google Scholar
Emerson RO, Thomas JH: Gypsy and the birth of the SCAN domain. J Virol. 2011, 85 (22): 12043-12052. 10.1128/JVI.00867-11.
Article
CAS
PubMed Central
PubMed
Google Scholar
Castresana J, Guigo R, Alba MM: Clustering of genes coding for DNA binding proteins in a region of atypical evolution of the human genome. J Mol Evol. 2004, 59 (1): 72-79.
Article
CAS
PubMed
Google Scholar
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W: Initial sequencing and analysis of the human genome. Nature. 2001, 409 (6822): 860-921. 10.1038/35057062.
Article
CAS
PubMed
Google Scholar
Rattan R, Narita K, Chien J, Maguire JL, Shridhar R, Giri S, Shridhar V: TCEAL7, a putative tumor suppressor gene, negatively regulates NF-kappaB pathway. Oncogene. 2010, 29 (9): 1362-1373. 10.1038/onc.2009.431.
Article
CAS
PubMed
Google Scholar
Ekman D, Bjorklund AK, Elofsson A: Quantification of the elevated rate of domain rearrangements in metazoa. J Mol Biol. 2007, 372 (5): 1337-1348. 10.1016/j.jmb.2007.06.022.
Article
CAS
PubMed
Google Scholar
Laurie S, Toll-Riera M, Rado-Trilla N, Alba MM: Sequence shortening in the rodent ancestor. Genome Res. 2012, 22 (3): 478-485. 10.1101/gr.121897.111.
Article
CAS
PubMed Central
PubMed
Google Scholar
Bjorklund AK, Ekman D, Light S, Frey-Skott J, Elofsson A: Domain rearrangements in protein evolution. J Mol Biol. 2005, 353 (4): 911-923. 10.1016/j.jmb.2005.08.067.
Article
PubMed
Google Scholar
Fong JH, Geer LY, Panchenko AR, Bryant SH: Modeling the evolution of protein domain architectures using maximum parsimony. J Mol Biol. 2007, 366 (1): 307-315. 10.1016/j.jmb.2006.11.017.
Article
CAS
PubMed Central
PubMed
Google Scholar
Frenkel ZM, Trifonov EN: Origin and evolution of genes and genomes. Crucial role of triplet expansions. J Biomol Struct Dyn. 2012, 30 (2): 201-210. 10.1080/07391102.2012.677771.
Article
CAS
PubMed
Google Scholar
Vibranovski MD, Sakabe NJ, de Oliveira RS, de Souza SJ: Signs of ancient and modern exon-shuffling are correlated to the distribution of ancient and modern domains along proteins. J Mol Evol. 2005, 61 (3): 341-350. 10.1007/s00239-004-0318-y.
Article
CAS
PubMed
Google Scholar
Weiner J, Beaussart F, Bornberg-Bauer E: Domain deletions and substitutions in the modular protein evolution. FEBS J. 2006, 273 (9): 2037-2047. 10.1111/j.1742-4658.2006.05220.x.
Article
CAS
PubMed
Google Scholar
Daubin V, Ochman H: Bacterial genomes as new gene homes: the genealogy of ORFans in E. coli. Genome Res. 2004, 14 (6): 1036-1042. 10.1101/gr.2231904.
Article
CAS
PubMed Central
PubMed
Google Scholar
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.
Article
CAS
PubMed Central
PubMed
Google Scholar
Notredame C, Higgins DG, Heringa J: T-Coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000, 302 (1): 205-217. 10.1006/jmbi.2000.4042.
Article
CAS
PubMed
Google Scholar
Yang Z: PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007, 24 (8): 1586-1591. 10.1093/molbev/msm088.
Article
CAS
PubMed
Google Scholar
R: A languange and environment for statistical computing. 2007, Vienna (Austria): R fundation for statistical computing