Skip to main content

A survey of nuclear ribosomal internal transcribed spacer substitution rates across angiosperms: an approximate molecular clock with life history effects



A full understanding of the patterns and processes of biological diversification requires the dating of evolutionary events, yet the fossil record is inadequate for most lineages under study. Alternatively, a molecular clock approach, in which DNA or amino acid substitution rates are calibrated with fossils or geological/climatic events, can provide indirect estimates of clade ages and diversification rates. The utility of this approach depends on the rate constancy of molecular evolution at a genetic locus across time and across lineages. Although the nuclear ribosomal internal transcribed spacer region (nrITS) is increasingly being used to infer clade ages in plants, little is known about the sources or magnitude of variation in its substitution rate. Here, we systematically review the literature to assess substitution rate variation in nrITS among angiosperms, and we evaluate possible correlates of the variation.


We summarize 28 independently calibrated nrITS substitution rates ranging from 0.38 × 10-9 to 8.34 × 10-9 substitutions/site/yr. We find that herbaceous lineages have substitution rates almost twice as high as woody plants, on average. We do not find any among-lineage phylogenetic constraint to the rates, or any effect of the type of calibration used. Within life history categories, both the magnitude of the rates and the variance among rates tend to decrease with calibration age.


Angiosperm nrITS substitution rates vary by approximately an order of magnitude, and some of this variation can be attributed to life history categories. We make cautious recommendations for the use of nrITS as an approximate plant molecular clock, including an outline of more appropriate phylogenetic methodology and caveats against over interpretation of results. We also suggest that for lineages with independent calibrations, much of the variation in nrITS substitution rates may come from uncertainty in calibration date estimates, highlighting the importance of accurate and/or multiple calibration dates.


Decades after the idea of a molecular clock was first broached[1], the weight of evidence suggests that rate heterogeneity, not rate constancy, is the norm in molecular evolution (reviewed in [2, 3]) and that an exact molecular clock will not be found. Nevertheless, an approximate clock can be established for a locus with many independent calibrations [4], and would be useful for roughly dating lineages and testing time-dependent evolutionary hypotheses. Local rate-constancy and even a single calibration can allow dating within a particular lineage, but much of the excitement surrounding molecular clocks stems from the possibility of approximating dates for the majority of lineages that lack independent calibrations. This has become commonplace in the animal kingdom, especially with the widely applied mtDNA clock [5, 6], but a general molecular clock able to date relatively recent divergences in plants remains elusive. Because there will be variation among independently calibrated substitution rates, a first step in determining whether a particular locus may be useful as a molecular clock is to ascertain the amount and sources of this rate variation.

In the last decade, the nuclear ribosomal internal transcribed spacer region (nrITS) has revolutionized species-level plant phylogenetics. Because concerted evolution has generally homogenized sequence variation among the numerous ribosomal DNA copies within an individual, direct sequencing of this region is possible for most systems. This, coupled with the availability of universal primers and elevated substitution rates compared to most chloroplast regions, make it especially accessible and appropriate for resolving interspecific phylogenetic relationships [7, 8]. Although reliance on nrITS as the sole source of phylogenetic evidence has come under criticism because of certain features of its evolution [9], it remains the most efficient locus for generating species-level phylogenetic inferences in most plant groups.

The widespread use of the nrITS region at the species level makes it a good candidate for a potential plant molecular clock. Substitution rates have been independently calibrated for a number of plant groups with diverse life histories and growth forms that represent distinct branches of the angiosperm phylogeny. Preliminary surveys of some of these rates suggest that they vary by almost an order of magnitude [10, 11]. However, these limited surveys do not fully evaluate the variation in available nrITS substitution rates nor do they examine possible sources of this variation. Even without a full summary of nrITS rate variation, there is a growing trend in the literature to utilize some published rates for groups that lack their own independent calibration, either by choosing a rate based on life history similarity or phylogenetic relatedness, or by using a range of the more commonly-cited rates, such as Suh (1993) or Sang (1994, 1995). Here, we conduct a systematic search of the literature in order to identify and evaluate more thoroughly nrITS rate variation. Specifically, we report a summary of independently calibrated rates, examine potential correlates that may account for some of the variation in rates, and make conservative recommendations regarding the use of nrITS as an approximate molecular clock for angiosperms.


Literature survey

We identified 29 independent nrITS substitution rates ranging from 0.38 × 10-9 subs/site/yr in Hamamelis to 19 × 10-9 subs/site/yr in Gentiana Sect. Ciminalis (Fig. 1; Table 1). The latter rate was excluded as an outlier from all of our analyses since the calibration was based on only a single base-pair substitution and the rate was greater than double the second highest rate of 8.34 × 10-9 subs/site/year in Soldanella. The mean of the 28 remaining rates was 2.86 × 10-9 subs/site/yr. Of those studies reporting results of rate constancy tests, 12 out of 19 passed or partially passed (Table 1).

Figure 1

The distribution of ITS rates. A frequency histogram of nrITS rates, showing the difference between woody perennial and herbaceous annual/perennial rates. Arrows indicate the average rates for each life history category. The outlier rate for Gentiana Sect. Cimialis (19 × 10-9 subs/site/yr) has been removed.

Table 1 Independently calibrated nrITS substitution rates sorted by magnitude of rate.

Phylogenetic signal

Twenty-one different angiosperm families are represented among the 29 rates, which comprise non-overlapping portions of the Angiosperm evolutionary tree. No significant phylogenetic signal to the rates was detected (p = 0.393), suggesting that phylogenetic relatedness is not an appropriate justification when choosing rates from the literature. Indeed, rates from the family Asteraceae alone span almost the entire range found in Table 1, from 2.5 × 10-9 subs/site/year in Eupatorium to 7.83 × 10-9 subs/site/year in Robinsonia.

Life history effects

The average nrITS substitution rates for herbaceous and woody lineages are significantly different (p = 0.0013, Wilcoxon Rank Sum Test; Fig. 1). The herbaceous annual/perennial rates ranged from 1.72 × 10-9 to 8.34 × 10-9 substitutions/site/year (mean = 4.13 × 10-9 subs/site/yr, N = 10), and the woody perennial rates ranged from 0.38 × 10-9 to 7.83 × 10-9 substitutions/site/year (mean = 2.15 × 10-9 subs/site/yr, N = 18). In our analysis of phylogenetic independent contrasts, we found that nine of ten life history contrasts exhibit a lower rate associated with the woody perennial lineage (Sign Test, p < 0.01). The single opposing contrast involved the Hawaiian silverswords. This rate is based on a clade that includes both the woody perennial silverswords and the herbaceous perennial tarweeds. Although initially coded as woody perennials, this state is considered the derived condition among the tarweeds. Therefore, the rate may be more representative of the herbaceous perennial ancestry in the clade than the derived woody perennial silversword lineage. Our contrast results are robust to either life history classification of the tarweed/silversword rate (alternative coding, p < 0.01).


Range of nrITS substitution rates

This study represents the most comprehensive collection of nrITS substitution rates to date. The largest previous survey [10] included independently calibrated rates from 12 angiosperm lineages. They recommended using a range of rates from 1.72 × 10-9 to 7.83 × 10-9 for dating the diversification of Inga, after removing the rate from Winteraceae because of its longer generation time and possible error associated with the calibration date. The addition of 16 independently calibrated rates in our study introduces seven new rates slower than 1.72 × 10-9, one of which is slower than the Winteraceae rate. Our broader sampling of nrITS substitution rates allowed us to analyze more fully potential sources of variation in these rates.

Life history effects

Striking differences in rates of nrITS evolution between life history categories, with annuals/herbaceous perennials exhibiting significantly higher rates, are consistent with a general pattern of higher substitution rates for annual compared to perennial plants and for herbs compared to woody trees that has been reported from chloroplast [1216] and mitochondrial studies [17]. Life history has also been suggested to affect rates specifically of nrITS substitution ([18, 19] but see [20]), with annual species exhibiting longer nrITS branch lengths than perennials. In a more comprehensive review, however, Whittle and Johnston [21] found no difference in nrITS branch lengths between sister taxa differing in annual versus perennial life history. Our study may be more likely to detect rate differences associated with life history because we are using more distinctly polarized life history categories and our comparisons are across deeper phylogenetic splits, with more time for rate differences to accumulate.

Although the underlying mechanism is unclear, our results agree with previous studies in which shorter generation times are associated with faster rates of molecular evolution [22, 23]. It is difficult to explain how generation time would affect substitution rates in plants, which do not sequester their germ lines and therefore can pass on somatic mutations to their gametes, unless the majority of inherited mutations arise during meiosis and not mitosis. Furthermore, we do not know whether generation time per se is a causative factor of substitution rate variation. Differences in generation time in plants are confounded by correlated differences in growth form [21] and mating system [24], such that perennials are the only plants forming extensive wood and annuals are much more frequently self-fertilizing. A mechanistic understanding of the effects of these factors on rates of molecular evolution will increase the power of molecular clock approaches in plant evolution.

Errors in rate estimation

A potentially confounding factor associated with our life history classes is the type of calibration point used and its potential associated rate biases. Fossil data typically provide a minimum age of a particular lineage, and therefore may be more likely to overestimate a substitution rate, while biogeographic or climatic events provide a maximum age and may be more likely to underestimate a rate. Thus, an unacknowledged correlation between life history and calibration type potentially could confound our results for rate differences between woody and herbaceous plants. These potential biases appear not to be an issue. First, our dataset is well balanced with fossil calibrated rates for eight woody perennial and for six herbaceous annual/perennial groups. There is no overall correlation between type of calibration and life history category (χ2 = 0.156, ns). Second, within each life history category, we see no apparent bias due to the type of calibration. For example, within the herbaceous rates, only one of the five highest rates is calibrated exclusively with fossil evidence. Similarly, of the five slowest woody perennial rates, three are biogeographic and two are fossil calibrated.

The age of the calibration point may affect the precision of the substitution rate estimates, since more recent calibration ages are based on fewer "ticks" of the molecular clock. Older lineages will have more nucleotide substitutions and thus these rate estimates should become more precise, at least until they become biased by sequence saturation, or multiple substitutions at the same site. Among our 29 original rates, calibration ages ranged from 0.1 Ma to 83 Ma. There appears to be substantially more variation in substitution rates based on recent calibration points compared to those from older calibration ages in both life history categories (Fig. 2). An extreme example is the exceptionally high rate from Gentiana sect. Ciminalis, that was more than twice as high as any other rate. This rate was calibrated based on a biogeographic split of only 0.1 Ma after which only one nucleotide substitution occurred [25]. When rate is regressed on age of calibration, separately for each life history category, the residuals exhibit a negative trend with increasing calibration age indicating lower variance for older calibration ages (Fig. 2). Another indication of this trend is found by comparing the variation in residuals for rates with calibration points older and younger than 30 Ma. The mean absolute value of these residuals in the more recently calibrated half is more than three times higher than in the half with older calibrations.

Figure 2

The effects of calibration age. A comparison of nrITS substitution rates with calibration age reveals higher variance at younger calibration points. The decreasing substitution rate with older calibration ages is only significant for the woody perennials. The herbaceous annual/perennial regression is not significant and becomes even less so when the outlier is removed (p = 0.861).

We also found a trend of decreasing rate with the age of calibration, which was significant among woody perennials (Fig. 2). This trend may be due to the models of sequence evolution utilized for estimating sequence divergence. Without correcting for sequence saturation, rate estimates are expected to decrease with increasing time since divergence [5, 6]. In molecular clock calculations, using a model of sequence evolution that does not fully capture sequence saturation and among-site rate variation potentially can lead to large errors in divergence time estimates (reviewed in [3]). Most of the original rates we found used the relatively simple Jukes-Cantor or Kimura 2-parameter models, so this might be a significant issue, particularly for studies with older calibration dates. Ideally, objective criteria for model selection, such as the likelihood ratio tests easily implemented in MODELTEST [26] should be used before calculating a rate.

As a post hoc exploration of the effects of methods of estimating sequence divergence, including the choice of substitution model and the treatment of rate heterogeneity among branches, we reanalyzed the sequence data used to estimate two of the nrITS rates, for the Winteraceae [27] with one of the oldest calibration dates and for Dendroseris [28] with a recent calibration. These are also two of the most frequently cited rates in the literature. For the Winteraceae, our resulting substitution rate estimates, based on a TrN+G substitution model and penalized likelihood rate smoothing, are higher than crude estimates based on uncorrected counts of pairwise differences. However, the uncertainty reported for their biogeographic calibration – the isolation of Pseudowinteraceae in Australia – for which estimates range from 50 to 80 Ma, affected the overall rate estimates substantially more than the different methods of estimating branch lengths (substitutions/site/year × 10-9: 50 Ma calibration crude = 0.52, corrected = 0.65; 80 Ma calibration crude = 0.33, corrected = 0.41). Importantly, for this group with one of the oldest calibrations, which should be more sensitive to the effects of sequence saturation and rate heterogeneity, branch length estimation methods make a small difference in light of the overall range of nrITS substitution rates in our survey, not even affecting its category in the histogram of Figure 1. For Dendroseris, there was no significant rate heterogeneity, and using a more complex substitution model had no effect on the substitution rate estimate. In contrast, there was substantial variation in rates due to uncertainty in the biogeographic calibration, ranging from 4 Ma based on the age of Masatierra Island to 2.6 Ma based on a chloroplast DNA clock calculation (substitutions/site/year × 10-9: 4 Ma calibration = 3.94; 2.6 Ma calibration = 6.06). These reanalyses suggest that fine-tuning the estimates of molecular substitution makes a relatively small change to the overall rate estimate in light of substantial uncertainty in calibration dates. While branch length estimates clearly should be optimized, there is also a real need for more accurate and multiple calibrations to reduce the uncertainty of rate estimates [29].

An approximate plant molecular clock?

The variance we find in nrITS substitution rates is likely the result of both error in individual rate estimates and biological differences among lineages in actual substitution rates. As noted above, substantial error in estimating individual rates can come from uncertainty in the dating of the fossil or biogeographic/climatic event used as a calibration and in the assignment of that calibration to a node in the phylogeny [29, 30]. Further error is contributed by uncertainty in the phylogeny itself, both in the branching topology and the branch length estimates [31]. Another issue is that gene trees are not identical to species trees [3]. Coalescence of nrITS alleles should predate species divergences by a factor related to the effective population size, and any reticulate evolution or lineage sorting could obscure the true tree topology [32]. In attempting to date a particular plant lineage, few studies adequately consider these sources of error.

While some of the variance in the rates we found likely comes from these sources of error, there are also very real differences in nrITS substitution rates within and among lineages. Statistically significant rate heterogeneity was found in seven of the nineteen studies reporting tests of rate constancy. Furthermore, in spite of all the sources of noise in our dataset, we found a substantial effect of life history, with long-lived woody plants exhibiting much slower substitution rates than shorter-lived herbaceous lineages. Thus even if error in substitution rate estimates could be eliminated, the existence of a universal nrITS substitution rate is highly unlikely.

Despite the many sources of error in rate estimates, however, we find a limited distribution of rates with a central tendency, especially when separated by life history category (Fig. 1). We therefore support the cautious use of this locus as a general molecular clock for roughly dating events in plant evolution, but we offer a few important caveats. Borrowing just one or a few rates from the literature, especially based on phylogenetic relatedness, does not adequately account for the nature and extent of variation in the nrITS "clock". Instead, to roughly estimate a date we suggest that, within a life history category, the entire range of rates be considered. Furthermore, before using any rate for dating historical events, whether calculated from an internal calibration or borrowed from the literature, nrITS rate constancy should be tested within the group of interest, ideally with a tree-based likelihood ratio test [33, 34], and any lack of local rate constancy should be considered in the dating methodology [35].

The distribution of rates could then be used to examine specific hypotheses for the timing of events for lineages without calibrations of their own. For example, if there is a hypothesis for the age of a particular node in the phylogeny of a group of taxa with an nrITS dataset, one could calculate the required substitution rate to fit that age to the node and then compare the required rate to the distribution of rates presented here. A good example of this type of hypothesis testing is found in the genus Aquilegia (Ranunculaceae). Hummingbird pollination in this group was suggested to have originated during the mid-Pliocene (ca. 3.5 Ma) based on floristic associations [36]. Hodges et al. [37] calculated the nrITS substitution rate necessary within Aquilegia to fit the hypothesized age of the hummingbird pollination syndrome (0.11% per my = 0.56 × 10-9 subs/site/yr). This is approximately ten times slower than the average annual/perennial herbaceous rate (4.13 × 10-9) and more than three times slower than the slowest annual/perennial herbaceous value. Even using the slowest annual/perennial rate (1.72 × 10-9), the estimated age of hummingbird pollination is only half as old as predicted (approximately 1.66 Ma). Without any justification for such an unusually slow nrITS substitution rate in Aquilegia, hummingbird pollination likely has evolved more recently than originally predicted [37].


The results from our literature survey underscore the popularity of the nrITS region as a tool to date relatively recent historical events in plant evolution. The original rates that we found came from a broad taxonomic spectrum representing a diversity of life histories and were calculated using a variety of calibrations and methodologies. Nevertheless, the rates were consistent to approximately an order of magnitude, or even less for herbaceous lineages considered separately. Although this survey should be considered an ongoing project, given these initial results, the nrITS may be useful as an approximate molecular clock for roughly dating divergences, calculating diversification rates, and hypothesis testing in plant groups that currently lack a fossil or biogeographic calibration of their own.


Literature survey

We surveyed Systematic Botany, Molecular Phylogenetics and Evolution, The American Journal of Botany, Evolution and any references therein, from January 1995 through December 2003. We recorded rates for any study that independently calibrated and reported a rate for ITS1 and ITS2. If multiple rates were reported, we followed the author's recommendation on which was likely to be most accurate, unless their reasoning was based on the rate's similarity to other published rates. In such cases, we took the average of the reported rates. When separate rates were reported for ITS1 and ITS2, a weighted average based on the average length of each nrITS region was calculated. For each rate, we also recorded the predominant growth form/life history [38, 39], rate constancy test results, calibration type, and calibration age.

Phylogenetic signal

Frequently, authors that utilize published rates in their own studies base their choices on phylogenetic relatedness. In order to determine whether such choices are warranted, we tested for any phylogenetic signal in the rates. First we constructed a phylogeny of the taxa represented in our rates survey using the web-based program Phylomatic [40] and then manually edited the tree to reflect recent and more detailed phylogenetic hypotheses for some families [4143] (tree available upon request). The Test for Serial Independence was implemented in the program Phylogenetic Independence v. 2.0 [44, 45] with 1000 randomizations. This test compares the autocorrelation of adjacent branches of the original tree to that of a series of randomized trees.

Life history effects

Generation time has been suggested to affect rates of molecular evolution [22, 23], and thus we classified each rate according to the predominant life history of the clade, either annual/herbaceous perennial or woody perennial, as a proxy for generation time. The herbaceous category consists of relatively short-lived plants that reach sexual maturity within one to a few years, while the woody perennial life history category consists of long-lived lineages. It is possible that phylogenetic relatedness and life history could be confounded. Thus, to be conservative we also tested for rate differences due to life history using phylogenetically independent contrasts. Contrasts were identified with the BRUNCH algorithm in CAIC [46], using the same cladogram as in the above mentioned Test for Serial Independence. BRUNCH locates nodes for which there is a contrast in life history, and then calculates the direction of the rate difference across that node, using each taxon in no more than one contrast. It assumes that if sister taxa have the same life history state their most recent common ancestor also had that state, and that, when branch lengths are unknown, the rate for this ancestor can be calculated as the average rate of the derived taxa.

Reanalysis of Winteraceae and Dendroseris examples

We reanalyzed the sequence data used to estimate the nrITS rates for the Winteraceae (GenBank accessions AY004111-AY004128; [27] Additional File 1) and for Dendroseris [28] Additional File 2. We aligned the sequences with ClustalX and refined the alignments manually. Phylogenetic relationships were estimated in PAUP 4.0b10 [47] using maximum likelihood and a TrN+G substitution model for Winteraceae and a TrNef model for Dendroseris, as chosen by MODELTEST. For Winteraceae, we found significant substitution rate heterogeneity in nrITS using a tree-wide likelihood ratio test for differences between clock-constrained and clock-unconstrained trees (p < 0.01; [34]). We then used penalized likelihood rate smoothing implemented in R8S [35] to estimate branch lengths and substitution rates in the face of this heterogeneity. Within Dendroseris, we did not find significant rate heterogeneity with a tree-wide LRT (p > 0.1), and therefore rate smoothing was unnecessary for estimating substitution rates.


  1. 1.

    Zuckerkandl E, Pauling L: Evolutionary divergence and convergence in proteins. Evolving Genes and Proteins. Edited by: Bryson V, Vogel HJ. 1965, New York: Academic Press, 97-166.

    Google Scholar 

  2. 2.

    Sanderson MJ: Estimating rate and time in molecular phylogenies: Beyond the molecular clock?. Molecular Systematics of Plants II: DNA Sequencing. Edited by: Soltis DE, Soltis PS, Doyle JJ. 1998, Boston: Kluwer Academic Publishers, 242-264.

    Chapter  Google Scholar 

  3. 3.

    Arbogast BS, Edwards SV, Wakeley J, Beerli P, Slowinski JB: Estimating divergence times from molecular data on phylogenetic and population genetic timescales. Annual Review of Ecology and Systematics. 2002, 33: 707-740. 10.1146/annurev.ecolsys.33.010802.150500.

    Article  Google Scholar 

  4. 4.

    Ayala F: Molecular clock mirages. Bioessays. 1999, 21: 71-75. 10.1002/(SICI)1521-1878(199901)21:1<71::AID-BIES9>3.0.CO;2-B.

    Article  CAS  PubMed  Google Scholar 

  5. 5.

    Brown WM, George M, Wilson AC: Rapid evolution of animal mitochondrial DNA. Proceedings of the National Academy of Sciences of the United States of America. 1979, 76 (4): 1967-1971.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  6. 6.

    Brown WM, Prager EM, Wang A, Wilson AC: Mitochondrial DNA sequences of primates: tempo and mode of evolution. Journal of Molecular Evolution. 1982, 18: 225-239. 10.1007/BF01734101.

    Article  CAS  PubMed  Google Scholar 

  7. 7.

    Small RL, Ryburn JA, Cronn RC, Seelanan T, Wendel JF: The tortoise and the hare: Choosing between noncoding plastome and nuclear Adh sequences for phylogeny reconstruction in a recently diverged plant group. American Journal of Botany. 1998, 85 (9): 1301-1315. 10.2307/2446640.

    Article  CAS  PubMed  Google Scholar 

  8. 8.

    Baldwin BG, Sanderson MJ, Porter JM, Wojciechowski MF, Campbell CS, Donoghue MJ: The ITS region of nuclear ribosomal DNA: A valuable source of evidence on angiosperm phylogeny. Annals of the Missouri Botanical Garden. 1995, 82 (2): 247-277. 10.2307/2399880.

    Article  Google Scholar 

  9. 9.

    Alvarez I, Wendel JF: Ribosomal ITS sequences and plant phylogenetic inference. Molecular Phylogenetics and Evolution. 2003, 29 (3): 417-434. 10.1016/S1055-7903(03)00208-2.

    Article  CAS  PubMed  Google Scholar 

  10. 10.

    Richardson JE, Pennington RT, Pennington TD, Hollingsworth PM: Rapid diversification of a species-rich genus of Neotropical rain forest trees. Science. 2001, 293 (5538): 2242-2245. 10.1126/science.1061421.

    Article  CAS  PubMed  Google Scholar 

  11. 11.

    Malcomber ST: Phylogeny of Gaertnera Lam. (Rubiaceae) based on multiple DNA markers: evidence of a rapid radiation in a widespread, morphologically diverse genus. Evolution. 2002, 56 (1): 42-57.

    Article  CAS  PubMed  Google Scholar 

  12. 12.

    Brunsfeld SJ, Soltis PS, Soltis DE, Gadek PA, Quinn CJ, Strenge DD, Ranker TA: Phylogenetic relationships among the genera of Taxodiaceae and Cupressaceae: evidence from rbcL sequences. Systematic Botany. 1994, 19 (2): 253-262. 10.2307/2419600.

    Article  Google Scholar 

  13. 13.

    Conti E, Fischbach A, Sytsma KJ: Tribal relationships in Onagraceae: implications from rbcL sequence data. Annals of the Missouri Botanical Garden. 1993, 80 (3): 672-685. 10.2307/2399853.

    Article  Google Scholar 

  14. 14.

    Bousquet J, Strauss SH, Doerksen AH, Price RA: Extensive variation in evolutionary rate of rbcL gene sequences among seed plants. Proc Natl Acad Sci U S A. 1992, 89 (16): 7844-7848.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  15. 15.

    Gaut BS, Muse SV, Clark WD, Clegg MT: Relative rates of nucleotide substitution at the rbcL locus of monocotyledonous plants. Journal of Molecular Evolution. 1992, 35: 292-303. 10.1007/BF00161167.

    Article  CAS  PubMed  Google Scholar 

  16. 16.

    Wilson M, Gaut B, Clegg M: Chloroplast DNA evolves slowly in the palm family (Arecaceae). Mol Biol Evol. 1990, 7 (4): 303-314.

    CAS  PubMed  Google Scholar 

  17. 17.

    Laroche J, Li P, Maggia L, Bousquet J: Molecular evolution of angiosperm mitochondrial introns and exons. Proceedings of the National Academy of Sciences of the United States of America. 1997, 94 (11): 5722-5727. 10.1073/pnas.94.11.5722.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  18. 18.

    Andreasen K, Baldwin BG: Unequal evolutionary rates between annual and perennial lineages of checker mallows (Sidalcea, Malvaceae): Evidence from 18S-26S rDNA internal and external transcribed spacers. Mol Biol Evol. 2001, 18 (6): 936-944.

    Article  CAS  PubMed  Google Scholar 

  19. 19.

    Ainouche AK, Bayer RJ: Phylogenetic relationships in Lupinus (Fabaceae : Papilionoideae) based on internal transcribed spacer sequences (ITS) of nuclear ribosomal DNA. American Journal of Botany. 1999, 86 (4): 590-607. 10.2307/2656820.

    Article  CAS  PubMed  Google Scholar 

  20. 20.

    Andreasen K: Implications of molecular systematic analyses on the conservation of rare and threatened taxa: Contrasting examples from Malvaceae. Conservation Genetics. 2005, 6 (3): 399-412. 10.1007/s10592-005-4977-z.

    Article  Google Scholar 

  21. 21.

    Whittle C-A, Johnston MO: Broad-scale analysis contradicts the theory that generation time affects molecular evolutionary rates in plants. Journal of Molecular Evolution. 2003, 56 (2): 223-233. 10.1007/s00239-002-2395-0.

    Article  CAS  PubMed  Google Scholar 

  22. 22.

    Gu X, Li W-H: Higher rates of amino acid substitution in rodents than in humans. Molecular Phylogenetics and Evolution. 1992, 1: 211-214. 10.1016/1055-7903(92)90017-B.

    Article  CAS  PubMed  Google Scholar 

  23. 23.

    Ohta T: An examination of generation time effect on molecular evolution. Proc Natl Acad Sci U S A. 1993, 90: 10676-10680.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  24. 24.

    Charlesworth D, Wright S: Breeding systems and genome evolution. Current Opinion in Genetics & Development. 2001, 11 (6): 685-690. 10.1016/S0959-437X(00)00254-9.

    Article  CAS  Google Scholar 

  25. 25.

    Hungerer KB, Kadereit JW: The phylogeny and biogeography of Gentiana L. sect.Ciminalis (Adans.) Dumort: a historical interpretation of distribution ranges in the European high mountains. Perspectives in Plant Ecology, Evolution & Systematics. 1998, 1 (1): 121-135. 10.1078/1433-8319-00055.

    Article  Google Scholar 

  26. 26.

    Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.

    Article  CAS  PubMed  Google Scholar 

  27. 27.

    Suh Y, Thien LB, Reeve HE, Zimmer EA: Molecular evolution and phylogenetic implications of internal transcribed spacer sequences of ribosomal DNA in Winteraceae. American Journal of Botany. 1993, 80 (9): 1042-1055. 10.2307/2445752.

    Article  CAS  Google Scholar 

  28. 28.

    Sang T, Crawford DJ, Kim S-C, Stuessy TF: Radiation of the endemic genus Dendroseris (Asteraceae) on the Juan Fernandez Islands: Evidence from sequences of the ITS regions of nuclear ribosomal DNA. American Journal of Botany. 1994, 81 (11): 1494-1501. 10.2307/2445322.

    Article  CAS  Google Scholar 

  29. 29.

    Magallon SA: Dating lineages: molecular and paleontological approaches to the temporal framework of clades. International Journal of Plant Sciences. 2004, 165 (4 Suppl): S7-S21. 10.1086/383336.

    Article  Google Scholar 

  30. 30.

    Magallon S, Sanderson MJ: Absolute diversification rates in angiosperm clades. Evolution. 2001, 55 (9): 1762-1780.

    Article  CAS  PubMed  Google Scholar 

  31. 31.

    Sanderson MJ, Doyle JA: Sources of error and confidence intervals in estimating the age of angiosperms from rbcL and 18S rDNA data. Am J Bot. 2001, 88 (8): 1499-1516.

    Article  CAS  PubMed  Google Scholar 

  32. 32.

    Nichols R: Gene trees and species trees are not the same. Trends in Ecology & Evolution. 2001, 16 (7): 358-364. 10.1016/S0169-5347(01)02203-0.

    Article  Google Scholar 

  33. 33.

    Huelsenbeck JP, Rannala B: Phylogenetic methods come of age: testing hypotheses in an evolutionary context. Science. 1997, 276 (5310): 227-232. 10.1126/science.276.5310.227.

    Article  CAS  PubMed  Google Scholar 

  34. 34.

    Felsenstein J: Phylogenies from molecular sequences – inference and reliability. Annual Review of Genetics. 1988, 22: 521-565. 10.1146/

    Article  CAS  PubMed  Google Scholar 

  35. 35.

    Sanderson MJ: r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics. 2003, 19 (2): 301-302. 10.1093/bioinformatics/19.2.301.

    Article  CAS  PubMed  Google Scholar 

  36. 36.

    Grant V: Historical development of ornithophily in the western North-American flora. Proceedings of the National Academy of Sciences of the United States of America. 1994, 91 (22): 10407-10411.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  37. 37.

    Hodges SA, Fulton M, Yang JY, Whittall JB: Verne Grant and evolutionary studies of Aquilegia. New Phytologist. 2004, 161 (1): 113-120. 10.1046/j.1469-8137.2003.00950.x.

    Article  Google Scholar 

  38. 38.

    Hickman JC, ed: The Jepson manual: higher plants of California. 1993, Berkeley: University of California Press

  39. 39.

    Mabberley DJ: The plant-book: a portable dictionary of the vascular plants. 1997, Cambridge: Cambridge University Press, 2

    Google Scholar 

  40. 40.

    Webb C, Donoghue M: Phylomatic. Tree version R20031202 edn. 2003, []

    Google Scholar 

  41. 41.

    APG II: An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG II. Botanical Journal of the Linnean Society. 2003, 141 (4): 399-436. 10.1046/j.1095-8339.2003.t01-1-00158.x.

    Article  Google Scholar 

  42. 42.

    Doyle JJ, Doyle JL, Ballenger JA, Dickson EE, Kajita T, Ohashi H: A phylogeny of the chloroplast gene rbcL in the Fabaceae: taxonomic correlations and insights into the evolution of nodulation. American Journal of Botany. 1997, 84: 541-554. 10.2307/2446030.

    Article  CAS  PubMed  Google Scholar 

  43. 43.

    Bayer RJ, Starr JR: Tribal phylogeny of the Asteraceae based on two noncoding chloroplast sequences, the trnL intron and trnL/trnF intergenic spacer. Annals of the Missouri Botanical Garden. 1998, 85 (2): 242-256. 10.2307/2992008.

    Article  Google Scholar 

  44. 44.

    Reeve J, Abouheif E: Phylogenetic Independence. 2003, Version 2.0, []

    Google Scholar 

  45. 45.

    Abouheif E: A method for testing the assumption of phylogenetic independence in comparative data. Evolutionary Ecology Research. 1999, 1: 895-909.

    Google Scholar 

  46. 46.

    Purvis A, Rambaut A: Comparative analysis by independent contrasts (CAIC): an Apple Macintosh application for analysing comparative data. Computational and Applied Bioscience. 1995, 11 (3): 247-251.

    CAS  Google Scholar 

  47. 47.

    Swofford DL: PAUP*: phylogenetic analysis using parsimony ; 4.0b10. 1998, Sunderland, MA: Sinauer Associates,

    Google Scholar 

  48. 48.

    Wen J, Shi S: A phylogenetic and biogeographic study of Hamamelis (Hamamelidaceae), an eastern Asian and eastern North American disjunct genus. Biochemical Systematics & Ecology. 1999, 27 (1): 55-66. 10.1016/S0305-1978(98)00067-2.

    Article  CAS  Google Scholar 

  49. 49.

    Manos PS: Systematics of Nothofagus (Nothofagaceae) based on rDNA spacer sequences (ITS): Taxonomic congruence with morphology and plastid sequences. American Journal of Botany. 1997, 84 (8): 1137-1155. 10.2307/2446156.

    Article  CAS  PubMed  Google Scholar 

  50. 50.

    Leskinen E, Alstrom-Rapaport C: Molecular phylogeny of Salicaceae and closely related Flacourtiaceae: evidence from 5.8 S, ITS 1 and ITS 2 of the rDNA. Plant Systematics & Evolution. 1999, 215 (1–4): 209-227. 10.1007/BF00984656.

    Article  Google Scholar 

  51. 51.

    Wen J: Internal transcribed spacer phylogeny of the Asian and eastern North American disjunct Aralia sect.Dimorphanthus (Araliaceae) and its biogeographic implications. International Journal of Plant Sciences. 2000, 161 (6): 959-966. 10.1086/317563.

    Article  Google Scholar 

  52. 52.

    Boehle U-R, Hilger HH, Martin WF: Island colonization and evolution of the insular woody habit in Echium L.(Boraginaceae). Proceedings of the National Academy of Sciences of the United States of America. 1996, 93 (21): 11740-11745. 10.1073/pnas.93.21.11740.

    Article  CAS  Google Scholar 

  53. 53.

    Savard L, Michaud M, Bousquet J: Genetic diversity and phylogenetic relationships between birches and alders using ITS, 18S rRNA, and rbcL gene sequences. Molecular Phylogenetics & Evolution. 1993, 2 (2): 112-118. 10.1006/mpev.1993.1011.

    Article  CAS  Google Scholar 

  54. 54.

    Li J, Alexander J, Ward T, Del Tredici P, Nicholson R: Phylogenetic relationships of Empetraceae inferred from sequences of chloroplast gene matK and nuclear ribosomal DNA ITS region. Molecular Phylogenetics & Evolution. 2002, 25 (2): 306-315. 10.1016/S1055-7903(02)00241-5.

    Article  CAS  Google Scholar 

  55. 55.

    Vargas P, Morton CM, Jury SL: Biogeographic patterns in Mediterranean and Macaronesian species of Saxifraga (Saxifragaceae) inferred from phylogenetic analyses of ITS sequences. American Journal of Botany. 1999, 86 (5): 724-734. 10.2307/2656582.

    Article  CAS  PubMed  Google Scholar 

  56. 56.

    Xiang Q-Y, Crawford DJ, Wolfe AD, Tang Y-C, Depamphilis CW: Origin and biogeography of Aesculus L. (Hippocastanaceae): A molecular phylogenetic perspective. Evolution. 1998, 52 (4): 988-997. 10.2307/2411231.

    Article  Google Scholar 

  57. 57.

    Thulin M, Lavin M: Phylogeny and biogeography of the Ormocarpum group (Fabaceae): A new genus Zygocarpum from the Horn of Africa region. Systematic Botany. 2001, 26 (2): 299-317.

    Google Scholar 

  58. 58.

    Richardson JE, Weitz FM, Fay MF, Cronk QCB, Linder HP, Reeves G, Chase MW: Phylogenetic analysis of Phylica L. (Rhamnaceae) with an emphasis on island species: evidence from plastid trnL-F and nuclear internal transcribed spacer (ribosomal) DNA sequences. Taxon. 2001, 50 (2): 405-427. 10.2307/1223889.

    Article  Google Scholar 

  59. 59.

    Baum DA, Small RL, Wendel JF: Biogeography and floral evolution of Baobabs (Adansonia, Bombacaceae) as inferred from multiple data sets. Systematic Biology. 1998, 47 (2): 181-207. 10.1080/106351598260879.

    Article  CAS  PubMed  Google Scholar 

  60. 60.

    Schmidt GJ, Schilling EE: Phylogeny and biogeography of Eupatorium (Asteraceae: Eupatorieae) based on nuclear ITS sequence data. American Journal of Botany. 2000, 87 (5): 716-726. 10.2307/2656858.

    Article  CAS  PubMed  Google Scholar 

  61. 61.

    Baldwin BG, Sanderson MJ: Age and rate of diversification of the Hawaiian silversword alliance (Compositae). Proceedings of the National Academy of Sciences of the United States of America. 1998, 95 (16): 9402-9406. 10.1073/pnas.95.16.9402.

    PubMed Central  Article  CAS  PubMed  Google Scholar 

  62. 62.

    Lavin M, Wojciechowski MF, Gasson P, Hughes C, Wheeler E: Phylogeny of robinioid legumes (Fabaceae) revisited: Coursetia and Gliricidia recircumscribed, and a biogeographical appraisal of the Caribbean endemics. Systematic Botany. 2003, 28 (2): 387-409.

    Google Scholar 

  63. 63.

    Kass E, Wink M: Molecular phylogeny and phylogeography of Lupinus (Leguminosae) inferred from nucleotide sequences of the rbcL gene and ITS 1+2 regions of rDNA. Plant Systematics and Evolution. 1997, 208 (3–4): 139-167. 10.1007/BF00985439.

    Article  Google Scholar 

  64. 64.

    Wojciechowski MF, Sanderson MJ, Hu J-M: Evidence on the monophyly of Astragalus (Fabaceae) and its major subgroups based on nuclear ribosomal DNA ITS and chloroplast DNA trnL intron data. Systematic Botany. 1999, 24 (3): 409-437. 10.2307/2419698.

    Article  Google Scholar 

  65. 65.

    Jobst J, King K, Hemleben V: Molecular evolution of the internal transcribed spacers (ITS1 and ITS2) and phylogenetic relationships among species of the family Cucurbitaceae. Molecular Phylogenetics & Evolution. 1998, 9 (2): 204-219. 10.1006/mpev.1997.0465.

    Article  CAS  Google Scholar 

  66. 66.

    Verboom GA, Linder HP, Stock WD: Phylogenetics of the grass genus Ehrharta : evidence for radiation in the summer-arid zone of the South African Cape. Evolution. 2003, 57 (5): 1008-1021.

    Article  PubMed  Google Scholar 

  67. 67.

    von Hagen KB, Kadereit JW: The phylogeny of Gentianella (Gentianaceae) and its colonization of the southern hemisphere as revealed by nuclear and chloroplast DNA sequence variation. Organisms Diversity & Evolution. 2001, 1 (1): 61-79. 10.1078/1439-6092-00005.

    Article  Google Scholar 

  68. 68.

    Wendel JF, Schnabel A, Seelanan T: An unusual ribosomal DNA sequence from Gossypium gossypioides reveals ancient, cryptic, intergenomic introgression. Molecular Phylogenetics & Evolution. 1995, 4 (3): 298-313. 10.1006/mpev.1995.1027.

    Article  CAS  Google Scholar 

  69. 69.

    Sang T, Crawford DJ, Stuessy TF, Silva-O M: ITS sequences and phylogeny of the genus Robinsonia (Asteraceae). Systematic Botany. 1995, 20 (1): 55-64. 10.2307/2419632.

    Article  Google Scholar 

  70. 70.

    Zhang L-B, Comes HP, Kadereit JW: Phylogeny and quaternary history of the European montane/alpine endemic Soldanella (Primulaceae) based on ITS and AFLP variation. American Journal of Botany. 2001, 88 (12): 2331-2345.

    Article  CAS  PubMed  Google Scholar 

  71. 71.

    Rahn K: A phylogenetic study of the Plantaginaceae. Bot J Linn Soc. 1996, 120: 145-98.

    Google Scholar 

Download references


We thank D. Morrow for advice on analyses, J. Yang, T. Near and M. Sanderson for helpful discussions, and B. Igic for helpful comments and criticisms on an earlier version of this manuscript.

Author information



Corresponding author

Correspondence to Kathleen M Kay.

Additional information

Authors' contributions

JW and KK designed and coordinated the study, carried out the literature survey, performed the analyses, and drafted the manuscript. SH provided original direction and helped to revise the manuscript. All authors discussed and interpreted the results and read and approved the final manuscript.

Kathleen M Kay, Justen B Whittall contributed equally to this work.

Electronic supplementary material


Additional File 1: This is the alignment of Winteraceae nrITS from Genbank accessions AY004111-AY004128[27], used in calculating the effects of calibration uncertainty and branch length estimation on substitution rate. It includes PAUP and r8s command blocks and the tree description from the maximum likelihood analysis. (TXT 37 KB)


Additional File 2: This is the alignment of Dendroseris nrITS from Sang [28] used in calculating the effects of calibration uncertainty and branch length estimation on nrITS substitution rate. It includes a PAUP command block and the tree description from the maximum likelihood analysis. (TXT 8 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Kay, K.M., Whittall, J.B. & Hodges, S.A. A survey of nuclear ribosomal internal transcribed spacer substitution rates across angiosperms: an approximate molecular clock with life history effects. BMC Evol Biol 6, 36 (2006).

Download citation


  • Substitution Rate
  • Molecular Clock
  • Diversification Rate
  • Sequence Saturation
  • Life History Effect