Single mitochondrial gene barcodes reliably identify sister-species in diverse clades of birds
BMC Evolutionary Biology volume 8, Article number: 81 (2008)
DNA barcoding of life using a standardized COI sequence was proposed as a species identification system, and as a method for detecting putative new species. Previous tests in birds showed that individuals can be correctly assigned to species in ~94% of the cases and suggested a threshold of 10× mean intraspecific difference to detect potential new species. However, these tests were criticized because they were based on a single maternally inherited gene rather than multiple nuclear genes, did not compare phylogenetically identified sister species, and thus likely overestimated the efficacy of DNA barcodes in identifying species.
To test the efficacy of DNA barcodes we compared ~650 bp of COI in 60 sister-species pairs identified in multigene phylogenies from 10 orders of birds. In all pairs, individuals of each species were monophyletic in a neighbor-joining (NJ) tree, and each species possessed fixed mutational differences distinguishing them from their sister species. Consequently, individuals were correctly assigned to species using a statistical coalescent framework. A coalescent test of taxonomic distinctiveness based on chance occurrence of reciprocal monophyly in two lineages was verified in known sister species, and used to identify recently separated lineages that represent putative species. This approach avoids the use of a universal distance cutoff which is invalidated by variation in times to common ancestry of sister species and in rates of evolution.
Closely related sister species of birds can be identified reliably by barcodes of fixed diagnostic substitutions in COI sequences, verifying coalescent-based statistical tests of reciprocal monophyly for taxonomic distinctiveness. Contrary to recent criticisms, a single DNA barcode is a rapid way to discover monophyletic lineages within a metapopulation that might represent undiscovered cryptic species, as envisaged in the unified species concept. This identifies a smaller set of lineages that can also be tested independently for species status with multiple nuclear gene approaches and other phenotypic characters.
Large scale sequencing of a predefined region of approximately 650 (base pairs) bp of the mitochondrial gene COI, known as DNA barcoding, has two main goals: 1) to develop a species identification system that also allows unknown individuals to be assigned to species; 2) and to enhance the discovery of new species [1–3]. Although DNA barcoding has proved effective in achieving both goals in several large groups of animals [4–11], the efficacy of the tests have been questioned [12–16].
A major test performed on 643 previously recognized species of birds of North America demonstrated the effectiveness of DNA barcoding because 94% possessed unique monophyletic COI clusters [10, 11]. The remaining 6% of the species did not have unique DNA barcodes, indicating that they either were (a) wrongly identified in the past as separate species, (b) closely related species that hybridize regularly, or (c) species losing identity by secondary contact . These groups may be in the indeterminate zone between differentiated populations and distinct species [10, 11]. Critics of DNA barcoding claim that in spite of the impressive number of bird species sampled , the precision of the method was compromised due to insufficient intraspecific sampling, and because comparisons among species were not exclusively from sister-species pairs [12, 15, 17], where taxonomic uncertainty, interspecific hybridization, and incomplete lineage sorting could decrease the effectiveness of the test . The suggested threshold of 10 times the mean intraspecific variation (10 × rule) to screen for splits referred to as 'putative' species  has also been criticized. Moritz and Cicero  reported significantly lower average mitochondrial DNA distances between sister species of birds than levels reported in the barcoding tests of birds [10, 11], although the distances from these sister-species comparisons came from a variety of methods and genes . Meyer and Paulay  tested different threshold methods in COI barcodes of cowries and found extensive overlap of overall intraspecific distances with interspecific distances, resulting in minimum error rates of ~17% to screen for putative new species. Additionally, a simulation study using the neutral coalescent and the Bateson-Dobzhansky-Muller (BDM) model of speciation suggested that mtDNA barcodes will have error rates lower than 10% in assigning individuals to species only when populations have been isolated for more than 4 million generations . A universal-distance cutoff is therefore not an objective criterion to delineate species limits .
Additionally, Hickerson et al.  argued that reciprocal monophyly of mtDNA sequences and the 10 × threshold will likely underestimate species diversity . Tree-based approaches with genetic distances that use reciprocal monophyly for species delimitation can be problematic because aggregations of haplotypes in phylogenetic trees, even when highly supported, do not necessarily imply that they belong to a distinctive taxonomic unit . To address these issues, Rosenberg  proposed a statistical test to test if monophyletic groups in a phylogenetic tree are more likely to represent distinctive taxonomical entities, or are just random branches of lineages within a species. This approach also suggests minimal sample sizes required for inferences to be made about taxonomic distinctiveness from observations of monophyly .
Some of the advantages of using a single mtDNA barcode to identify species are that it has a higher rate of evolution (and thus more mutations), and because matrilineal lineages sort into reciprocally monophyletic clades much faster than nuclear genes . This reduces the incidence of incompletely sorted lineages relative to that expected with nuclear genes. However, recent simulations with multiple nuclear genes indicate that very recently derived species can be identified well before the time to reciprocal monophyly . Additionally, species were correctly delimited in <50% of replicates simulating mtDNA sequences, suggesting that the single gene barcode approach was insufficient to delimit recently diverged species.
In response to the above criticisms we initiated a more comprehensive study of 60 sister-species pairs of birds defined rigorously with multigene phylogenies to determine whether mtDNA barcodes can reliably distinguish closely related sister species. Instead of the much criticized 10× rule, which may not apply in recently diverged sister-species pairs, we use coalescent-based statistical tests for species distinctiveness under reciprocal monophyly . Additionally, we show that even recently diverged sister-species pairs have fixed nucleotide substitutions that serve as diagnostic mtDNA barcodes envisioned in the original analogy. Such diagnostic barcodes are useful not only in quickly identifying known species of birds but also in flagging other recently derived evolutionary lineages that could be analyzed with multilocus methods [21–23] to determine if they represent emergent species.
DNA barcodes distinguish sister-species of birds
Monophyletic clusters of individuals corresponding to species were recovered in a Neighbor-joining (NJ) tree under the Kimura 2-parameter (K2P) model in all the sister-species pairs compared (Table 1, see Additional files 1, 2). Multiple diagnostic characters in the branches of the trees leading to species clusters were detected in all the pairs (see Additional file 1, Figure 1). Bootstrap support at the nodes grouping individuals of the same species varied from 55 to 100%, except for Eastern Meadowlark (Sturnella magna), with the majority of the values (93.1%) above 85% (see Additional file 1). Species with clusters of individuals supported with bootstrap levels below 85% were: Ruby-throated Hummingbird (Archilochus colubris), Black-chinned Hummingbird (Archilochus alexandri), Gunnison Sage-Grouse (Centrocercus minimus), Dusky Grouse (Dendragapus obscurus), Nuttall's Woodpecker (Picoides nuttallii), Jackass Penguin (Spheniscus demersus), and Magellanic Penguin (Spheniscus magellanicus). These species were distinguished by <10 fixed nucleotide substitutional differences or had multiple intraspecific clusters. Probabilities of chance occurrence of reciprocal monophyly arising from random-branching within a single taxon were smaller than the level of significance (α) of 5% (Table 1). Ideally, larger sample sizes are required to increase the power of the test and to confirm reciprocal monophyly over a broad geographic range.
Individuals were correctly assigned to their corresponding species
Individuals from the six species-pairs with adequate samples sizes were picked randomly to query whether they could be assigned correctly to their species using clustering in a NJ tree, fixed mutations, and a statistical test of assignment based on coalescent theory  (Table 2, Figure 2). In all the cases the query individual was correctly assigned to species with posterior probability of 1.0 and correspondingly tiny risk of misassignment (Table 2, Figure 2). When species barcodes were comprised of more than one intraspecific cluster, as in Southern Brown Kiwi (Apteryx australis, Figure 2A), Gull-billed Tern (Gelochelidon nilotica) and Gentoo Penguin (Pygoscelis papua), the query individual was assigned correctly to the each intraspecific cluster (Table 2).
Species level delimitation with the "10 × rule"
Mean among sister-species distances of mtDNA barcodes varied from 0.78% to 11.77%, with 20 out of 60 (28.6%) distances smaller than the 2.7% threshold used to flag potential new species of birds. Among-species distances overlapped maximum within-species distances in 39 of 60 (65%) sister-species pairs. Excluding cases that are likely to represent overlooked species based on other attributes, the overlap was observed in 21 of 60 sister-species pairs (35%, Figure 3A). However, COI sequences in several species were structured in NJ trees into clades that represent geographically structured populations, recognized subspecies or possibly cryptic species (Table 3). The ratios of among-species to within-species distances were above 1 except for western and eastern populations of Eastern Meadowlark (Sturnella magna) which are thought to be two species [11, 25] (Figure 3B).
Plots of corrected COI distances against divergence times revealed that mutations are accumulating roughly linearly in all the groups we evaluated (Figure 4). However, the rates of evolution are variable. For example, shanks accumulate more mutations in COI than do terns and penguins per unit time (Figures 4, and 5A–C). Variation in rates of evolution of COI in different clades of birds mitigates against a universal distance criterion for species recognition, in accordance with previous evidence from a mitogenomic timescale for birds .
Intraspecific variation suggesting potential distinctive taxonomical entities
Six species had distinctive intraspecific clusters with probabilities of chance reciprocal monophyly below a conservative level of α = 1%: Kittlitz's Murrelet (Brachyramphus brevirostris), Gentoo Penguin (Pygoscelis papua), Gull-billed Tern (Gelochelidon nilotica), Eastern Meadowlark (Sturnella magna), Common Redshank (Tringa totanus), and Little Penguin (Eudyptula minor, Table 3, Figure 6). These groups represent recognized subspecies, populations occupying different geographical areas or distinct morphotypes. DNA barcode sequences of Gelochelidon nilotica comprised three intraspecific clusters in NJ trees (Figure 6C, Table 3). Two of the groups had discontinuous beak size distributions (pers. obs.) that were thought to represent Australian and Asian subspecies S. n. macrotarsa and S. n. affinis, respectively . The other group comprised reciprocally monophyletic lineages representing the subspecies S. n. groenvoldi (South America) and S. n. vanrossemini (Russia), but they were poorly sampled (2 samples each) .
Using the test for chance reciprocal monophyly, the Little Penguins of Australia and New Zealand, respectively, currently lumped into Eudyptula minor, are probably two species (Table 3). This conclusion is supported by a high number of fixed differences in the DNA barcodes and in multigene phylogenies  (Table 3, Figure 6A). Other species are comprised of monophyletic groups that could be taxonomically distinctive, although the probabilities of chance reciprocal monophyly are between 1–5%. For example, specimens of Australasian Pipit (Anthus novaeseelandiae) from New Zealand and Australia differ by 4.1% in their barcodes, and Little Terns (Sterna albifrons) from England and Australia differ by about 1%. However, increased sampling of these species is required to properly test whether they represent separate taxonomic entities.
Effectiveness of single gene COI barcodes
Our study of 60 pairs of sister species from a broad range of bird clades showed that closely related pairs could not be distinguished using the 10× rule of among to within species divergence, as predicted by critics of this criterion [12, 15]. Similarly, the suggested threshold genetic distance of 2.7% to flag potential species failed to detect recently evolved sister species, and was further compromised by substantial variation in the rate of COI evolution in different clades and short species divergence times. However, all sister-species pairs were shown to possess unique DNA barcodes by which they could be identified. In particular, the COI sequences of even very closely related sister species were found to have diagnostic combinations of 5–64 fixed substitutional differences that better fit the analogy of a short DNA barcode. Individuals were correctly assigned to each sister species for which we had moderate sample sizes (N ≥ 4) using different lines of evidence: NJ clustering, diagnostic fixed substitutions, and a decision-theoretic framework based on coalescent theory implemented in Assigner . The concern about assigning taxonomically unknown specimens to an existing or new taxon is unlikely to be a serious problem in birds, given the uniqueness of species barcodes and the mature taxonomy of the clade.
Phylogroups of COI sequences representing within-species variation can potentially be confounded with recently diverged sister species, so to objectively discriminate between these two possibilities we applied a statistical test of the null hypothesis that reciprocal monophyly has arisen by random branching of lineages within a single species. The null hypothesis could be rejected in all closely related sister species (P < 0.05), verifying the power of the test. In addition, putative new species were strongly supported by the distinctive signatures of >12 fixed substitutional differences and low probabilities of chance reciprocal monophyly within a single species. For example, the barcodes of Little Penguins from Australia and New Zealand, and of Gentoo Penguins from Macquarie Island and the Falklands, provide strong inferences of separate lineages that may warrant species status for these groups. The existence of separately evolving metapopulation lineages is the species delimitation criterion for a recently proposed unified species concept , though contingent properties such as phenetic, behavioural and reproductive differences need to be assessed in future to provide additional lines of evidence for or against species status. This is not a weakness of a single mtDNA gene barcoding system as has been claimed , but rather is a rapid way to discover monophyletic lineages within a metapopulation that might represent undiscovered cryptic species. The barcoding approach used here can be applied to other organismal groups where individuals of the same species cluster in monophyletic clades despite overlaps in within- and among-species variation . However, will not be applicable in groups with no mitochondrial divergence observed between species pairs (ex. ).
Single gene versus multilocus approaches for species delimitation
One of the most cogent criticisms of single locus mtDNA barcodes is that a pattern of reciprocal monophyly in maternally inherited genes can also arise when female dispersal is very restricted, often contrasting with widespread apparent panmixia of autosomal and paternally inherited genes . However, if sister species have diverged very recently then sufficient time may not have passed for enough mutations in a nuclear gene to have accumulated to reliably track lineage splitting and resolve problems with incomplete sorting of ancestral polymorphism. This in turn can lead to erroneous inference of extensive gene flow in autosomal genes if it is based on single gene trees. In such situations use of multiple nuclear genes is increasingly being touted to help delimit species boundaries [21–23]. Recent simulations in a coalescent-based approach showed that species limits were delimited with high probability depending on the number of loci examined and the timing of species divergence . Ten loci were able to reliably detect species with effective population sizes of 100,000 that diverged in a timeframe (31,000 generations ago) when incomplete lineage sorting would be expected to occur. Obviously, this multilocus approach is currently infeasible for the purpose of barcoding life on the planet, but it will be invaluable for inferring species limits in very recently separated species pairs where mtDNA barcodes alone might not be definitive. The 60 previously identified sister-species pairs of birds we studied had unique mtDNA barcodes that identified them, and each species was characterized by fixed mutational differences that are unlikely to be reduced substantially in number by increased sampling of polymorphic sites. However, species in which well differentiated reciprocally monophyletic clades of COI haplotypes were detected would seem to be fertile ground for further investigation with independent multiple nuclear gene trees in a coalescent framework. For example, the split between Australian and New Zealand populations of Little Penguins was dated at approximately 1.3 Mya using the neutral coalescent method in IM , and a phylogenetic rate of COI evolution of 0.01354 substitutions/site/Myr . Given a generation time of 6.5 years (based age of first breeding of 2.5 years and annual survival of breeding adults 80%  this equates roughly to 200,000 generations, where incomplete lineage sorting of autosomal genes should be reduced unless effective population size is very large . The faster sorting of COI sequences might be an advantage in identifying possible recent speciation events, and they can be combined with nuclear gene sequences in IM to estimate whether the divergence is due to isolation or if gene flow has been ongoing. Thus we view DNA barcodes as useful complements in multigene data sets that might include more than one mtDNA gene , contrary to recent criticisms of maternally inherited genes in species delimitation.
We show that in a broad range of birds even closely related sister species delimited with independent evidence could be identified with mtDNA barcodes and diagnostic substitutions using standard COI sequences. All pairs were characterized by reciprocally monophyletic lineages, and tests of the null hypothesis of random branching within a single species were rejected. Thus in well studied groups like birds, mtDNA barcodes are extremely effective in identifying sister species. In species that are shown by COI barcodes to be comprised of several divergent monophyletic lineages that might flag unrecognized species, it is important to test these splits with multiple independent gene trees in a coalescent framework to guard against the alternative inference of population subdivision via restricted female dispersal. Combination of multiple genes including mtDNA barcodes should counter any biases in species detection and the high variance in associated genetic processes .
To evaluate the performance of COI barcoding in detecting species boundaries of birds we analyzed sister-species pairs defined rigorously by previous phylogenetic studies (Table 1). We excluded species that were known to hybridize to prevent confusion due to introgression, a problem that plagues all methods of species delimitation. In addition, we included species of birds with multiple clusters that might represent unrecognized species. The COI sequences generated and used in this work are deposited in the project "Royal Ontario Museum- Birds 1" in the Completed Projects selection of the Barcode of Life Data System (BOLD , Genbank Accession numbers EU525241–EU525592). COI sequences obtained from previous work are available in the Completed Projects selection of the BOLD, in the "Birds of North America" project [10, 11] (Genbank Accession numbers DQ432694–DQ433261, DQ433274–DQ433846, DQ434243–DQ434805).
DNA extraction and sequencing
DNA was extracted from blood, muscle or liver by phenol, chelex or a membrane purification procedure with glass fiber filtration plates (Acroprep 96 Filter Plate- 1.0 μm Glass, PALL Corporation ). PCR amplification of the 5' end of the COI gene were performed in a 12.5 μL reaction, with a buffer solution containing 10 mM Tris-HCl, pH 8.3, 50 mM KCl, 2.5 mM MgCl2, 0.01% gelatin, and 160 μg/ml bovine serum albumin (BSA) , 0.4 mM dNTPs, 0.2 μM of each primer, 1 U Taq polymerase (Invitrogen), and 20–25 ng of DNA. Cycle conditions were 36 cycles of 94°C for 40 s, 50°C for 40s, and 72°C for 1 m, with an initial denaturation of 94°C for 5 m and a final extension at 72°C for 7 m. Bird universal primers used were as follows: LTyr – TGTAAAAAGGWCTACAGCCTAACGC, (Oliver Haddrath, pers. comm.) and COI907aH2 – GTRGCNGAYGTRAARTATGCTCG, (Rebecca Elbourne, pers. comm.) Amplified segments were purified by excising bands from agarose gels and centrifuging each through a filter tip. Sequences were obtained on an ABI3100 (Applied Biosystems) according to the manufacturers' suggested protocols using the internal primers COIaRt (forward-AACAAACCACAAAGATATCGG, Oliver Haddrath, pers comm.) and COI748Ht (reverse-TGGGARATAATTCCRAAGCCTGG), or alternatively LTyr (primer used in amplification) and COI745h2 (reverse-ACRTGNGAGATRATTCCRAANCCNG, Rebecca Elbourne, pers. comm.). Sequences were checked for ambiguities in Sequencher 4.1.2 (GeneCodes Corp., Ann Arbor, Michigan) and the multiple alignments was performed in MacClade 4 .
Species delimitation with DNA barcodes
To check for reciprocal monophyly in sister-species with DNA barcodes, a Neighbor-Joining (NJ) tree was constructed in PAUP 4.10b  with the Kimura 2 parameter model (K2P). Statistical support was estimated with 1,000 bootstrap replicates in a heuristic search using stepwise addition with 10 random additions of sequences.
Because compound diagnostic characters are a valuable source of information to diagnose species  we filtered variable characters for each sister-species pairs in PAUP 4.10b , and fixed substitutions were selected in MacClade 4 .
The test for chance occurrence of reciprocal monophyly  was applied to the sister-species pairs with α = 5%. We also performed this test on 'intraspecific' clusters of individuals that might represent distinct taxonomical unities, and additional species from which the barcodes were available in our database, or in public databases (Genbank, BOLD, see Table 3). Additionally, as an example on Little Penguins, we used the non-equilibrium coalescent approach implemented in the program IM, where an ancestral population splits into two constant-sized populations in the past and potentially exchange migrants . Modal values of the population mutation parameter (θ), time of population divergence (tpop), time to the most recent common ancestor (TMRCA) and scaled migration rate (M) were obtained from the posterior distributions of these parameters using a Monte Carlo Markov Chain run for 12.26 million generations after a burnin of 100,000 generations.
The correct assignment of individuals to species was performed in a decision-theoretic framework based on coalescent theory in Assigner . The species selected had a ratio of among-species:maximum within-species genetic distances <10, and with N ≤ four individuals (Common Goldeneye, Lincoln's Sparrow, Sandwich Tern, and Gentoo Penguin). The COI sequence of one randomly selected individual was excluded from the matrix and used as the query sequence. For each of the sister species of the pair (target groups), the evolutionary parameter θ (twice the product of the female effective population size and neutral mutation rate) with corresponding maximum likelihood was estimated from the data in FLUCTUATE . These values were used to calculate the likelihood of each of the target groups after re-including the query sequence to be assigned in Assigner .
Distance and threshold estimation
Distances under the K2P model were calculated among sister-species and within-species in MEGA 3.1 . Complete deletion was used in each comparison, to keep the number of base pairs equal in intra- and interspecific comparisons. Because the precision of the mtDNA barcode relies on the expectation that within-species variation is lower than among-species variation , the mean estimate of among species distances and the maximum value of pairwise intraspecific distances were used in the comparisons. The average level of intraspecific variation estimated across 260 species of birds of North America (0.27% of sequence divergence, yielding a threshold of 2.7% sequence divergence)  was used to test the efficacy of the 10 × rule in the sister-species pairs. To evaluate how variation in rates of evolution of COI in different lineages of birds  affect distance comparisons at sister-species levels, we selected six clades of birds for which divergence times have been estimated previously with relaxed clock methods (terns , shanks , alcids , penguins , and kiwis ). K2P distances of species pairs were plotted against divergence times, and COI distances between sister species of Terns, Shanks and Penguins were mapped on the corresponding chronograms.
Hebert PD, Cywinska A, Ball SL, deWaard JR: Biological identifications through DNA barcodes. Proc Biol Sci. 2003, 270 (1512): 313–321-10.1098/rspb.2002.2218.
Stoeckle MY: Taxonomy, DNA and the bar code of life. BioScience. 2003, 53: 2-3. 10.1641/0006-3568(2003)053[0796:TDATBC]2.0.CO;2.
Hebert PD, Gregory TR: The promise of DNA barcoding for taxonomy. Syst Biol. 2005, 54 (5): 852-859. 10.1080/10635150500354886.
Ward RD, Zemlak TS, Innes BH, Last PR, Hebert PD: DNA barcoding Australia's fish species. Philos Trans R Soc Lond B Biol Sci. 2005, 360 (1462): 1847-1857. 10.1098/rstb.2005.1716.
Hebert PD, Penton EH, Burns JM, Janzen DH, Hallwachs W: Ten species in one: DNA barcoding reveals cryptic species in the neotropical Skipper Butterfly Astraptes fulgerator. Proc Natl Acad Sci USA. 2004, 101 (41): 14812-14817. 10.1073/pnas.0406166101.
Janzen DH, Hajibabaei M, Burns JM, Hallwachs W, Remigio E, Hebert PD: Wedding biodiversity inventory of a large and complex Lepidoptera fauna with DNA barcoding. Philos Trans R Soc Lond B Biol Sci. 2005, 360 (1462): 1835-1845. 10.1098/rstb.2005.1715.
Lambert DM, Baker A, Huynen L, Haddrath O, Hebert PD, Millar CD: Is a large-scale DNA-based inventory of ancient life possible?. J Hered. 2005, 96 (3): 279-284. 10.1093/jhered/esi035.
Smith MA, Fisher BL, Hebert PD: DNA barcoding for effective biodiversity assessment of a hyperdiverse arthropod group: the ants of Madagascar. Philos Trans R Soc Lond B Biol Sci. 2005, 360 (1462): 1825-1834. 10.1098/rstb.2005.1714.
Pook CE, McEwing R: Mitochondrial DNA sequences from dried snake venom: a DNA barcoding approach to the identification of venom samples. Toxicon. 2005, 46 (7): 711-715. 10.1016/j.toxicon.2005.07.005.
Kerr KCR, Stoeckle MY, Dove CJ, Weigt LA, Francis CM, Hebert PDN: Comprehensive DNA barcode coverage of North American birds. Mol Ecol Notes. 2007, 7 (4): 535-543. 10.1111/j.1471-8286.2007.01670.x.
Hebert PD, Stoeckle MY, Zemlak TS, Francis CM: Identification of Birds through DNA Barcodes. PLoS Biol. 2004, 2 (10): e312-10.1371/journal.pbio.0020312.
Moritz C, Cicero C: DNA barcoding: promise and pitfalls. PLoS Biol. 2004, 2 (10): e354-10.1371/journal.pbio.0020354.
Meyer CP, Paulay G: DNA barcoding: error rates based on comprehensive sampling. PLoS Biol. 2005, 3 (12): e422-10.1371/journal.pbio.0030422.
Meier R, Shiyang K, Vaidya G, Ng PK: DNA barcoding and taxonomy in Diptera: a tale of high intraspecific variability and low identification success. Syst Biol. 2006, 55 (5): 715-728. 10.1080/10635150600969864.
Hickerson MJ, Meyer CP, Moritz C: DNA barcoding will often fail to discover new animal species over broad parameter space. Syst Biol. 2006, 55 (5): 729-739. 10.1080/10635150600969898.
Rubinoff D: Utility of mitochondrial DNA barcodes in species conservation. Conserv Biol. 2006, 20 (4): 1026-1033.
Will KW, Mishler BD, Wheeler QD: The perils of DNA barcoding and the need for integrative taxonomy. Syst Biol. 2005, 54 (5): 844-851. 10.1080/10635150500354878.
DeSalle R, Egan MG, Siddall M: The unholy trinity: taxonomy, species delimitation and DNA barcoding. Philos Trans R Soc Lond B Biol Sci. 2005, 360 (1462): 1905-1916. 10.1098/rstb.2005.1722.
Rosenberg NA: Statistical tests for taxonomic distinctiveness from observations of monophyly. Evolution. 2007, 61 (2): 317-323. 10.1111/j.1558-5646.2007.00023.x.
Avise JC: Phylogeography: The History and Formation of Species. 2000, Cambridge, Massachusetts , Harvard University Press, 447-
Knowles LL, Carstens BB: Delimiting species without monophyletic gene trees. Syst Biol. 2007, 56 (6): 887-895. 10.1080/10635150701701091.
Maddison WP, Knowles LL: Inferring phylogeny despite incomplete lineage sorting. Syst Biol. 2006, 55: 21-30. 10.1080/10635150500354928.
Edwards SV, Liu L, Pearl DK: High-resolution species trees without concatenation. Proc Natl Acad Sci USA. 2007, 104: 5936-5941. 10.1073/pnas.0607004104.
Abdo Z, Golding GB: A step toward barcoding life: a model-based, decision-theoretic method to assign genes to preexisting species groups. Syst Biol. 2007, 56 (1): 44-56. 10.1080/10635150601167005.
Wells MG: World bird species checklist: With alternative English and scientific names. 1998, Bushey , Worldlist, 671-
Pereira SL, Baker AJ: A mitogenomic timescale for birds detects variable phylogenetic rates of molecular evolution and refutes the standard molecular clock. Mol Biol Evol. 2006, 23 (9): 1731-1740. 10.1093/molbev/msl038.
Rogers DI, Collins P, Jessop RE, Minton CDT, Hassell CJ: Gull-billed Terns in north-western Australia: subspecies identification, moults and behavioural notes. Emu. 2005, 105 (2): 145-158. 10.1071/MU04045.
Molina KC, Erwin RM: The distribution and conservation status of the Gull-billed Tern (Gelochelidon nilotica) in North America. Waterbirds. 2006, 29 (3): 271-295. 10.1675/1524-4695(2006)29[271:TDACSO]2.0.CO;2.
Banks JC, Mitchell AD, Paterson AM: An unexpected pattern of molecular divergence within the Blue Penguin (Eudyptula minor) complex. Notornis. 2002, 49: 29-38.
De Queiroz K: Species concepts and species delimitation. Syst Biol. 2007, 56 (6): 879-886. 10.1080/10635150701701083.
Neigel J, Domingo A, Stake J: DNA barcoding as a tool for coral reef conservation. Coral Reefs. 2007, 26 (3): 487-499. 10.1007/s00338-007-0248-4.
Irwin DE: Phylogeographic breaks without geographical barriers to gene flow. Evolution. 2002, 56 (12): 2383-2394.
Hey J, Nielsen R: Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics. 2004, 167 (2): 747-760. 10.1534/genetics.103.024182.
Saether BE, Lande R, Engen S, Weimerskirch H, Lillegard M, Altwegg R, Becker PH, Bregnballe T, Brommer JE, McCleery RH, Merila J, Nyholm E, Rendell W, Robertson RR, Tryjanowski P, Visser ME: Generation time and temporal scaling of bird population dynamics. Nature. 2005, 436 (7047): 99-102. 10.1038/nature03666.
Hudson RR, Coyne JA: Mathematical consequences of the genealogical species concept. Evolution. 2002, 56 (8): 1557-1565.
Pons J, Barraclough TG, Gomez-Zurita J, Cardoso A, Duran DP, Hazell S, Kamoun S, Sumlin WD, Vogler AP: Sequence-based species delimitation for the DNA taxonomy of undescribed insects. Syst Biol. 2006, 55 (4): 595-609. 10.1080/10635150600852011.
Barcode of Life Data System. [http://www.barcodinglife.org]
Ivanova NV, DeWaard JR, Hebert PDN: An inexpensive, automation-friendly protocol for recovering high-quality DNA. Mol Ecol Notes. 2006, 6: 998-1002. 10.1111/j.1471-8286.2006.01428.x.
Hagelberg E: Mitochondrial DNA from ancient bones. Ancient DNA. Edited by: Herrmann B, Hummel S. 1994, New York , Springer, 195-204.
Maddison WP, Maddison DR: MacClade 4: Analysis of Phylogeny and Character Evolution. 2005, Sunderland , Sinauer Associates, Inc., [http://macclade.org]Version 4.08
Swofford DL: PAUP*: Phylogenetic Analysis Using Parsimony (*and related methods) . 2002, Sunderland , Sinauer Associates, [http://paup.csit.fsu.edu/]4
Ratnasingham S, Hebert PDN: BOLD: The Barcode of Life Data System. Mol Ecol Notes. 2007, 7 (3): 355-364. 10.1111/j.1471-8286.2007.01678.x.
Nielsen R, Wakeley JW: Distinguishing migration from isolation: an MCMC approach. Genetics. 2001, 158: 885-896.
Kuhner MK, Yamato J, Felsenstein J: Maximum likelihood estimation of population growth rates based on the coalescent. Genetics. 1998, 149 (1): 429-434.
Kumar S, Tamura K, Nei M: MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment. Brief Bioinform. 2004, 5 (2): 150-163. 10.1093/bib/5.2.150.
Bridge ES, Jones AW, Baker AJ: A phylogenetic framework for the terns (Sternini) inferred from mtDNA sequences: implications for taxonomy and plumage evolution. Mol Phylogenet Evol. 2005, 35 (2): 459-469. 10.1016/j.ympev.2004.12.010.
Pereira SL, Baker AJ: Multiple gene evidence for parallel evolution and retention of ancestral morphological states in the shanks (Charadriiformes: Scolopacidae). Condor. 2005, 107: 514-526. 10.1650/0010-5422(2005)107[0514:MGEFPE]2.0.CO;2.
Pereira SL, Baker AJ: DNA evidence for a Paleocene origin of the Alcidae (Aves: Charadriiformes) in the Pacific and multiple dispersals across northern oceans. Mol Phylogenet and Evol. 2008, 46 (2): 430-445. 10.1016/j.ympev.2007.11.020.
Baker AJ, Pereira SL, Haddrath OP, Edge KA: Multiple gene evidence for expansion of extant penguins out of Antarctica due to global cooling. Proc Biol Sci. 2006, 273 (1582): 11-17. 10.1098/rspb.2005.3260.
Burbidge ML, Colbourne RM, Robertson HA, Baker AJ: Molecular and other biological evidence supports the recognition of at least three species of brown kiwi. Conserv Genet. 2003, 4: 167-177. 10.1023/A:1023386506067.
The Canadian Barcode of Life Network. [http://www.BOLNET.ca]
Lovette IJ, Rubenstein DR: A comprehensive molecular phylogeny of the starlings (Aves: Sturnidae) and mockingbirds (Aves: Mimidae): congruent mtDNA and nuclear trees for a cosmopolitan avian radiation. Mol Phyl Evol. 2007, 44 (3): 1031-1056. 10.1016/j.ympev.2007.03.017.
Carson RJ, Spicer GS: A phylogenetic analysis of the emberizid sparrows based on three mitochondrial genes. Mol Phylogenet Evol. 2003, 29 (1): 43-57. 10.1016/S1055-7903(03)00110-6.
Pereira SL, Baker AJ, Wajntal A: Combined nuclear and mitochondrial DNA sequences resolve generic relationships within the Cracidae (Galliformes, Aves). Syst Biol. 2002, 51 (6): 946-958. 10.1080/10635150290102519.
Freeman S, Zink RM: A phylogenetic study of the blackbirds based on variation in mitochondrial DNA restriction sites. Syst Biol. 1995, 44 (3): 409-420. 10.2307/2413601.
Friesen VL, Anderson DJ: Phylogeny and evolution of the Sulidae (Aves:Pelecaniformes): a test of alternative modes of speciation. Mol Phylogenet Evol. 1997, 7 (2): 252-260. 10.1006/mpev.1996.0397.
Klicka J, Fry AJ, Zink RM, Thompson CW: A cytochrome-b perspective on Passerina bunting relationships. Auk. 2001, 118 (3): 610-623. 10.1642/0004-8038(2001)118[0610:ACBPOP]2.0.CO;2.
Klicka J, Zink RM: The importance of recent ice ages in speciation: a failed paradigm . Science. 1997, 277: 1666-1669. 10.1126/science.277.5332.1666.
Rusch KM, Thusius K, Ficken MS: The organization of agonistic vocalizations in Ruby-throated Hummingbirds with a comparison to Blackchinned Hummingbirds. Wilson Bulletin. 2001, 113 (4): 425-430. 10.1676/0043-5643(2001)113[0425:TOOAVI]2.0.CO;2.
Baltosser WH: Annual molt in Ruby-throated and Black-chinned Hummingbirds. Condor. 1995, 97 (2): 484-491. 10.2307/1369034.
Moore SM, Weibel AC, Agius A: Mitochondrial DNA phylogeny of the woodpecker genus Veniliornis (Picidae, Picinae) and related genera implies convergent evolution of plumage patterns. Biol J Linnean Soc. 2006, 87: 611-624. 10.1111/j.1095-8312.2006.00586.x.
Austin JJ, Bretagnolle V, Pasquet E: A global molecular phylogeny of the small Puffinus shearwaters and implications for systematics of the Little–audubon’s shearwater complex. Auk. 2004, 121 (3): 847-864. 10.1642/0004-8038(2004)121[0847:AGMPOT]2.0.CO;2.
Wink M, Sauer-Gürth H, Fuchs M: Phylogenetic relationships in owls based on nucleotide sequences of mitochondrial and nuclear marker genes.: 2004; Budapest, Hungary.Edited by: Chancellor RD, Meyburg BU. 2003, World Working Group on Birds of Prey and Owls and Birdlife Hungary, 890-
Livezey BC: Phylogeny and evolutionary ecology of modern seaducks (Anatidae: Mergini). Condor. 1995, 97: 233-255. 10.2307/1368999.
Pierce RJ: Family Recurvirostridae. Handbook of the Birds of the World. Edited by: del Hoyo J, Elliott A, Sargatal J. 1996, Barcelona , Lynx Edicions, 3: 821-
Yamada K, Nishida-Umehara C, Matsuda Y: Characterization and chromosomal distribution of novel satellite DNA sequences of the Lesser Rhea (Pterocnemia pennata) and the Greater Rhea (Rhea americana). Chromosome Res. 2002, 10 (6): 513-523. 10.1023/A:1020996431588.
Klicka J, Zink RM, Winker K: Longspurs and snow buntings: phylogeny and biogeography of a high-latitude clade (Calcarius). Mol Phylogenet Evol. 2003, 26 (2): 165-175. 10.1016/S1055-7903(02)00360-3.
Oyler-McCance SJ, Kahn NW, Burnham KP, Braun CE, Quinn TW: A population genetic comparison of large- and small-bodied sage grouse in colorado using microsatellite and mitochondrial DNA markers. Mol Ecol. 1999, 8 (9): 1457-1465. 10.1046/j.1365-294x.1999.00716.x.
Shawkey MD, Balenger SL, Hill GE, Johnson LS, Keyser AJ, Siefferman L: Mechanisms of evolutionary change in structural plumage coloration among bluebirds (Sialia spp.). J R Soc Interface. 2006, 3 (9): 527-532. 10.1098/rsif.2006.0111.
Bonaccorso E, Peterson AT: A multilocus phylogeny of New World jay genera. Mol Phylogenet Evol. 2007, 42 (2): 467-476. 10.1016/j.ympev.2006.06.025.
Drovetski SV: Molecular phylogeny of grouse: individual and combined performance of W-linked, autosomal, and mitochondrial loci. Syst Biol. 2002, 51 (6): 930-945. 10.1080/10635150290102500.
Cohen BL, Baker AJ, Blechschmidt K, Dittmann DL, Furness RW, Gerwin JA, Helbig AJ, de Korte J, Marshall HD, Palma RL, Peter HU, Ramli R, Siebold I, Willcox MS, Wilson RH, Zink RM: Enigmatic phylogeny of skuas (Aves:Stercorariidae). Proc Biol Sci. 1997, 264 (1379): 181-190. 10.1098/rspb.1997.0026.
Lanyon SM, Omland KE: A molecular phylogeny of the blackbirds (Icteridae): Five lineages revealed by cytochrome-b sequence data. Auk. 1999, 116 (3): 629-639.
Omland KE, Lanyon SM, Fritz SJ: A molecular phylogeny of the New World orioles (Icterus): the importance of dense taxon sampling. Mol Phylogenet Evol. 1999, 12 (2): 224-239. 10.1006/mpev.1999.0611.
Whittingham LA, Sheldon FH, Emlen ST: Molecular phylogeny of jacanas and its implications for morphologic and biogeographic evolution. Auk. 2000, 117 (1): 22-32. 10.1642/0004-8038(2000)117[0022:MPOJAI]2.0.CO;2.
Lucchini V, Hoglund J, Klaus S, Swenson J, Randi E: Historical biogeography and a mitochondrial DNA phylogeny of grouse and ptarmigan. Mol Phylogenet Evol. 2001, 20 (1): 149-162. 10.1006/mpev.2001.0943.
Thomas GH, Wills MA, Szekely T: A supertree approach to shorebird phylogeny. BMC Evol Biol. 2004, 4: 28-10.1186/1471-2148-4-28.
We thank Rebecca Elbourne for providing some of the sequences, and the Zoological Museum University of Copenhagen, South Australian Museum, Louisiana State University Museum of Natural Science, Field Museum of Natural History, University of Michigan Museum of Zoology, American Museum of Natural History, Burke Museum of Natural History and Culture, and Bell Museum of Natural History for kindly permitting us to barcode loaned samples. We thank two anonymous referees for useful comments on the manuscript. This work was supported by funding through the Canadian Barcode of Life Network from Genome Canada through the Ontario Genomics Institute, NSERC, and other sponsors , and the ROM Governors' Fund.
AJB and EST designed the scope of the research. EST carried out the lab work, data assembly, and analysis except for coalescent simulations in IM which were done by AJB. Both authors wrote and approved the final manuscript.
Erika S Tavares and Allan J Baker contributed equally to this work.
Electronic supplementary material
Additional file 1: Sister-species differences in COI barcode sequences. Sister-species pairs and sampling, fixed substitutions (fixed subst.), bootstrap support, mean interspecific K2P distances (Dinter), and maximum intraspecific K2P distance within each species (Dintra). (XLS 18 KB)
Additional file 2: Neighbor-joining tree topology constructed from DNA barcodes of sister species of birds. Neighbor-joining tree topology of ~650 bp of mitochondrial gene COI, under the K2P model and pairwise deletion. File in nexus format, opens in TreeView. (TRE 47 KB)
Authors’ original submitted files for images
About this article
Cite this article
Tavares, E.S., Baker, A.J. Single mitochondrial gene barcodes reliably identify sister-species in diverse clades of birds. BMC Evol Biol 8, 81 (2008). https://doi.org/10.1186/1471-2148-8-81