Skip to main content
  • Research article
  • Open access
  • Published:

Phylogenetic analysis and temporal diversification of mosquitoes (Diptera: Culicidae) based on nuclear genes and morphology



Phylogenetic analyses provide a framework for examining the evolution of morphological and molecular diversity, interpreting patterns in biogeography, and achieving a stable classification. The generic and suprageneric relationships within mosquitoes (Diptera: Culicidae) are poorly resolved, making these subjects difficult to address.


We carried out maximum parsimony and maximum likelihood, including Bayesian, analyses on a data set consisting of six nuclear genes and 80 morphological characters to assess their ability to resolve relationships among 25 genera. We also estimated divergence times based on sequence data and fossil calibration points, using Bayesian relaxed clock methods. Strong support was recovered for the basal position and monophyly of the subfamily Anophelinae and the tribes Aedini and Sabethini of subfamily Culicinae. Divergence times for major culicid lineages date to the early Cretaceous.


Deeper relationships within the family remain poorly resolved, suggesting the need for additional taxonomic sampling. Our results support the notion of rapid radiations early in the diversification of mosquitoes.


Mosquitoes (Diptera: Culicidae) are a monophyletic group of true flies [14], recognizable by their elongate adult mouthparts through which the females of most species feed on vertebrate blood. Mosquitoes occur throughout temperate and tropical regions, and well beyond the Arctic Circle, but are most diverse in tropical forest environments [5]. A bewildering amount of morphological diversity parallels their spectacular radiation into virtually every conceivable collection of water, ranging from a few droplets trapped by plant parts to large bodies of fresh and brackish surface water, making mosquitoes "as ubiquitous as water" [6]. The relationship between human health and those species that are medically important (<200 of 3,524 currently recognized; has driven most mosquito research. Within this small subset of disease vector species, morphological similarities between close relatives (e.g., cryptic or sibling species complexes) continue to pose practical and academic challenges to disease control, conventional taxonomy, and phylogenetic inference. Ironically, it is not the morphological similarity but rather the morphological diversity of mosquitoes that has confounded efforts to delimit many supraspecific groups and reconstruct their evolutionary history.

The Culicidae had an ancient origin, probably in the Jurassic [7, 8], consistent with the fossil record of their sister group Chaoboridae [9]. Unfortunately, the sparse mosquito fossil record sheds no light on evolutionary relationships in the family. Traditional classification of Culicidae based on the phenetic framework of Edwards [7] resulted in arbitrary groupings reflecting intuitive interpretation of morphological similarities. More modern classifications have incorporated important revisions of select genera and tribes based on explicit methodology, but in the absence of a comprehensive application of quantitative methods across the family, the result still does not entirely reflect evolutionary history [reviewed in [2]]. Current classification divides the family into two subfamilies (Anophelinae and Culicinae), 11 tribes, and a minimum of 44 genera [2, 1013] (Table 1).

Table 1 Species of Culicidae included in the phylogenetic analysis, with reference to their classification and distribution [after 2].

Generic-level relationships across all Culicidae have rarely been studied. The first attempt [14] was based on comparative bionomics and morphology using intuitive methods typical of that time. Surprisingly, further attempts using modern cladistic methods were not made for nearly 50 years. The most comprehensive of these phylogenetic re-analyses employed 73 morphological characters to examine the relationships of the 38 genera then recognized [1]. In general, almost none of the hypotheses raised by the 1951 phylogeny were supported in the 1998 reconstruction, with few exceptions including the monophyletic and basal position of the subfamily Anophelinae, and the monophyly of the tribes Sabethini and Culicini. However, most characters were homoplastic - some extensively - and many relationships were inadequately resolved. Although the Harbach and Kitching [1] study challenged traditional generic groupings and reinforced the need for reappraisal, it also suggested that robust recovery of generic-level relationships of Culicidae may be difficult with morphological characters alone.

Only four higher-level phylogenies of Culicidae based on gene sequences have been published, each of which were very taxon-limited in scope. All were able to show that, in agreement with the morphological phylogenies, Anopheles was sister to other sampled genera [4, 1517]. The study by Miller et al. [4], based on four mosquito species, was inconclusive and the more comprehensive study of Shepard et al. [15], based on 18S rDNA sequences of 39 species representing nine genera, was unable to resolve deeper relationships. Of the four studies, only Besansky and Fahey [16] employed a single-copy nuclear protein coding gene to assess relationships. Their study, based on the white gene, included 13 species representing nine genera. When third codon positions were excluded, Anophelinae was recovered as a basal lineage and Sabethini, Culicini and Aedini were recovered as monophyletic, suggesting the potential of protein-coding sequences for reconstructing generic-level relationships within Culicidae.

The importance of sampling multiple genes when attempting to reconstruct species phylogenies is well recognized [18]. Mitochondrial DNA and nuclear ribosomal DNA are convenient targets, due to conserved primer binding sequences and ease of amplification based on their typically high copy number. However, both can be problematic for resolving deep phylogenetic relationships. Mitochondrial DNA may exhibit a high mutation rate, skewed base composition, and even symbiont-induced biases [19, 20]. Beyond base compositional bias, ribosomal DNA also can be exceedingly difficult to align [21]. Given the increasing availability of completely sequenced mosquito genomes, protein-coding nuclear genes represent a viable alternative as well as a rich and largely untapped resource.

In the study reported here, we explored the phylogenetic utility of six nuclear protein-coding genes: arginine kinase, CAD, catalase, enolase, hunchback, and white. As noted above, only white was used previously in mosquitoes [16]. All except catalase have been used in other insect groups: CAD in bees, empidoid flies, and lacewings, among others [2224]; arginine kinase in hymenopterans [25, 26]; enolase in beetles [27]; and hunchback in Hawaiian drosophilids [28]. We sequenced these six genes from 26 mosquito species representing 25 genera, and two chaoborid outgroup species. In addition, 80 morphological characters were scored from these mosquito and outgroup species. Our goals were twofold: (1) to estimate a generic-level phylogeny of Culicidae based on molecular and morphological evidence, and (2) to use fossils and sequence data to infer divergence times for major culicid lineages.


Taxon sampling

Twenty-six species of mosquitoes representing 25 genera were used as ingroup taxa (Table 1). Two chaoborid midges, Eucorethra underwoodi and Chaoborus astictopus, were used as outgroup taxa based on the sister-group relationship between Chaoboridae and Culicidae [3, 29, 30]. Specimens were preserved in 70-100% ethanol at -20°C. Sampling of additional ingroup genera for this study was precluded by unavailability of specimens adequately preserved for molecular analysis; DNA from pinned museum specimens or specimens stored at room temperature for prolonged periods was found to be excessively degraded.

DNA extraction, PCR, and sequencing

Sequences were obtained from six nuclear protein-coding genes (arginine kinase, CAD, catalase, enolase, hunchback, and white) from VectorBase for the three mosquito species with completely sequenced genomes (Anopheles gambiae, Culex quinquefasciatus, and Aedes aegypti), from GenBank for available white sequences (GenBank accession numbers U73829, AF318199, AF318200, U73834, U73827, AF318206, U73835, U73837, AY055811, AF318193, AF318209), or by PCR amplification and direct sequencing. Genes were amplified and sequenced using degenerate primers previously reported in the literature, or designed based on amino acid alignments of respective genes from the three sequenced mosquito species. Primer sequences and their sources are provided in Table 2. Primers were located in regions that would allow amplification within a single large exon, or across exons separated by small introns, to facilitate amplification from genomic DNA templates. Primers were hemi-nested, whereby the first round of amplification based on the outermost pair was followed by alternative second round PCR from which two internal, overlapping fragments were amplified (Figure 1).

Table 2 Primers used in this study.
Figure 1
figure 1

Approximate location and orientation of primers (black arrows) with respect to their target exons, in the context of gene models (or portions thereof) based on the An. gambiae genome. Models were exported from the Transcript View available in VectorBase; protein-coding exons are indicated by black rectangles with Roman numerals; introns by black lines. Gene names and corresponding An. gambiae gene ID (AGAP number) are provided at top left. Not to-scale.

Genomic DNA was extracted from whole mosquitoes using the DNeasy kit (Qiagen, Valencia, CA). PCR reactions (50 μl) contained 20 mM Tris-HCl (pH 8.4), 50 mM KCl, 1.5 mM MgCl2, 200 μM each dNTP, 2.5 U Taq polymerase, 2.4% DMSO, 0.5% BSA, 25 pmol each primer, and 1 μl template DNA (~1/50th-1/500th of the amount extracted from a single mosquito). Amplifications consisted of 1 cycle at 93°C for 1 min; 35 cycles of 94°C for 20 sec, 45°C for 20 sec, and 72°C for 2 min; and a final extension cycle of 72°C for 5 min. After inspection of an aliquot by electrophoresis through a 1.5% agarose gel, products of the expected size were excised from the gel and purified using GeneClean Kit (MP Biomedicals, Irvine, CA), or sequenced directly if only one band was observed. Excess primers and dNTPs were removed by adding 2 U of exonuclease 1, 1 U of shrimp alkaline phosphatase, and 1.8 μl of H2O to 8 μl of PCR product, incubating at 37°C for 15 min, and inactivating at 80°C for 15 min before sequencing. Sequencing was carried out using an Applied Biosystems 3730 × l DNA Analyzer and Big Dye Terminator v3.1 chemistry. Electropherograms were inspected and trimmed using SeqMan II (DNASTAR, Madison, WI). Sequences were deposited in GenBank under accession numbers: arginine kinase, GQ906806-GQ906829; CAD, GQ906830-GQ906854; catalase, GQ906855-GQ906874; enolase, GQ906879-GQ906901; hunchback GQ906902-GQ906926; white, GQ906927-GQ906937.

Morphological characters

Morphological structures were examined in the adult, pupal and larval (fourth-instar) stages. Heads were removed from adult mosquitoes for comparative studies of structures not readily visible in intact specimens. Heads were cleared in 5% sodium hydroxide solution, stained in acid fuchsin and mounted frontodorsal side uppermost in Euparal on microscope slides. Pinned adults were examined under simulated natural light. Dissected genitalia, larvae, and larval and pupal exuviae were studied with differential interference contrast optics. The morphological terminology follows Harbach and Knight [31, 32] and Harbach and Kitching [1].

The 28 species were coded for 80 characters (see additional file 1: FileS1) derived from fourth-instar larvae (24), pupae (12), adults (36), female genitalia (1), and male genitalia (7). The data matrix is shown in Additional file 2: TableS2. Characters were coded from direct observations except in a few cases where structures were missing from available specimens. Some missing data for the immature stages of Chaoborus astictopus were coded from literature sources. Characters that could not be scored due to missing data or absence of homologous structures (e.g., character 20 in three anophelines and two chaoborid species that lack a siphon) were denoted by a "?". All multistate characters were treated as unordered. Most of the structures and states of characters are illustrated in Harbach & Knight [31] and Harbach & Kitching [1].

Phylogenetic analysis

Nucleotide sequence alignments were guided by the corresponding amino acid alignments, using utilities within the program suite EMBOSS [33]; Inferred amino acid sequences were aligned using the program "emma," which provides an interface to ClustalW [34]. The Gonnet matrix [35] was chosen and the resulting alignment was followed by limited manual adjustment. "Tranalign" was then used to align the nucleotide sequence based on the previously aligned amino acid sequence. Introns were removed from aligned gene regions before analysis.

Basic sequence information (pairwise sequence divergence, base composition, statistical tests of homogeneity of base composition, number of variable and parsimony informative characters) was obtained using PAUP* v4.0b10 [36]. In addition, plots of transitions and transversions versus divergence at each codon position were based on observed (uncorrected) p-distances from PAUP*.

Maximum parsimony (MP) analyses were implemented in PAUP* on a phylogenetic data set that included concatenated genes with/without morphological characters. Third codon positions for each gene were removed and gaps were treated as missing data. Heuristic searches consisted of 1000 random sequence additions with tree bisection-reconnection (TBR) branch swapping. Bootstrap support values were based on 500 replicates, each with 10 random additions and TBR branch swapping.

Maximum likelihood (ML) analyses were performed on molecular data sets only, which included both individual/concatenated genes, with/without third codon positions. The ML heuristic searches were performed in PAUP*, using the model of nucleotide substitution and parameter values selected via Modeltest [37]. Values for the substitution matrix, base composition, gamma distribution of among-site rate variation (G) and the proportion of invariant sites (I) are available from the authors on request. Bootstrap resampling was conducted using 1000 replicate neighbour-joining (NJ) trees based on the ML substitution matrix. The Shimodaira-Hasegawa test [[38]; data not shown] was used to test for incongruence between phylogenies suggested by individual genes, or successive combinations of congruent genes.

Bayesian (BI) phylogenetic tree searches were performed in MrBayes 3.1.2 [39] on concatenated gene and gene + morphology data sets using aligned nucleotides, both including and excluding third codon positions, and using concatenated aligned amino acids. For concatenated genes (nucleotides) and genes (nucleotides) + morphology, a mixed model approach was used with model parameters specified per gene partition according to Modeltest and a Markov K + G model for morphology, with branch lengths unlinked and estimated for each partition [40, 41]. Bayesian tree searches using aligned amino acids were carried out in MrBayes 3.1.2 using the WAG model of amino acid evolution (WAG+I+G) [42]. Each Bayesian search was carried out for 10,000,000 generations (sampling every 1000) using four chains (default heating parameters) and a 30% burn-in value. The included Bayesian sets of trees were sampled after likelihood scores reached convergence and the mean split difference values were below 0.02.

Divergence time estimation

Estimates of divergence times for mosquito lineages were calculated using the parametric Bayesian-relaxed clock approach implemented in the programs ESTBRANCHES and MULTIDIVTIME [43] and using the combined gene data set (nucleotides) including third codon positions. Branch lengths and evolutionary rate priors were estimated from the data using the BASEML program in the PAML software package [44] and ESTBRANCHES. Tree topology, minimum and maximum root node age, and fossil-based minimum age constraints are set as user-defined analysis priors. For the tree topology we used the tree recovered from the BI search of combined amino acids (see Phylogenetic Analyses, below). The root max-min age prior between Culicidae and outgroups was set as 230-187 Ma corresponding to the hypothesized age of the Diptera [6, 8] and a fossil assignable to the Chaoboridae [187 Ma; ref [9]], and three lineages were constrained according to fossil-based minimum ages (Toxorhynchites mexicanus, 16 Ma; Culex winchesteri, 34 Ma; Anopheles dominicanus, 34 Ma;[45]. We followed the analytical procedure described in Rutschmann et al. [46] and in the MULTIDIVTIME readme files, and ran the Markov chain for 1.1 × 106 cycles with samples collected every 100 cycles and discarded the first 100,000 cycles as burn-in. We performed the MULTIDIVTIME analysis multiple times from different initial conditions to confirm convergence of the Markov chain on highly similar resulting time estimates and posterior intervals.

Results and Discussion

Sequence variation

Across six genes, the molecular data matrix consisted of 5352 aligned characters, of which 2839 were variable and 2259 were parsimony informative (Table 3). Not surprisingly, most of the variation was found in the third codon (nt3) position. Analysis of base composition of the combined genes for each major taxonomic grouping revealed significant departures from homogeneity at the nt3 position, owing to three groups: Anophelinae, Culicini (Cx. quinquefasciatus) and Ficalbiini (Mimomyia luzonensis) (Table 4). Moreover, plots of transitions and transversions against uncorrected pairwise nucleotide divergences at each codon position suggested saturation of transitions at the nt3 position (Figure 2). These results prompted us to perform ML and BI phylogenetic analyses both with and without the nt3 partition, and to exclude this partition in MP analyses (see Phylogenetic analyses).

Table 3 Character information for genes used in this study.
Table 4 GC content and compositional heterogeneity of major taxonomic groupings based on the species analyzed.
Figure 2
figure 2

Numbers of transitions or transversions at each codon position (nt1, nt2 and nt3) plotted against uncorrected nucleotide divergence for pairwise species comparisons across the combined six-gene data set.

Uncorrected pairwise sequence divergence within the mosquitoes sampled ranged widely, from 10-27% (summarized by major taxonomic groupings in Table 5). However, the average distance between tribes and subfamilies was at the upper end of the range (20% and 23%, respectively), approaching that between mosquitoes and their sister group, the chaoborid midges (26%).

Table 5 Mean pairwise uncorrected p-distances (%) across all six genes.

Phylogenetic analyses

Evidence was found for incongruence among some genes or gene combinations via the Shimodaira-Hasegawa test [38]; data not shown. However, examination of phylogenies resulting from individual genes revealed that topological incongruence was generally limited to certain poorly supported nodes. It has been suggested that different data sets may have a common phylogenetic signal recoverable only upon combined analysis [47, 48], and, under the hypothesis that combining data from multiple genes may potentially overcome misleading signal in individual genes [49], we conducted further analyses to compare results from a concatenated data set with results obtained from ML and BI.

Relationships inferred by MP and ML, including BI, are summarized in Figure 3 and Table 6. All three algorithms, as applied to various data partitions (± nt3; ± morphological characters; nucleotides or amino acids), gave overwhelming support for the monophyly of Culicidae (node O), the monophyly and basal position of the subfamily Anophelinae (node A; gray box), and the monophyly of the tribe Sabethini (node I; gray box). Less conclusive support by ML, but reasonable support by MP and BI, was observed for the monophyly of the tribe Aedini (node C). These results confirm the conclusions of Harbach [2] regarding what was already known about the phylogeny of mosquitoes.

Table 6 Bootstrap support or posterior probabilities for relationships inferred within Culicidae based on combined gene sequences, with or without morphological characters. Nodes refer to Figure 3.
Figure 3
figure 3

Phylogram of relationships among mosquito species, inferred by Bayesian likelihood analysis of combined amino acids. Amount of inferred character change is indicated by the scale bar below. Numbers associated with nodes are Bayesian posterior probabilities above 0.5. Letters associated with nodes refer to bootstrap support values and posterior probabilities estimated from alternative analyses, provided in Table 6. Gray-shaded boxes enclose (from top to bottom) Anophelinae, Aedini and Sabethini.

Support varied for relationships within these well-supported clades. The subfamily Anophelinae was represented in this study by three species from two genera: Bironella (Bironella gracilis) and Anopheles (Anopheles atroparvus, subgenus Anopheles; An. gambiae, subgenus Cellia). Not all analyses supported the monophyly of the genus Anopheles; an alternative relationship of Bi. gracilis + An. atroparvus was also recovered. There is precedence for the paraphyly of Anopheles relative to Bironella in previous morphological [50, 51] and molecular [52] studies. Reliable inference of relationships between these groups may be problematic due to conflicting signals or contemporaneous radiations, but the suggestion of Sallum et al. [50] to redefine Bironella as an informal group within Anopheles seems premature [2, 53].

Within tribe Sabethini, Malaya occupied the most basal position among the taxa sampled, although this placement was not recovered in a subset of BI analyses. The genus Maorigoeldia, containing only a single species exclusive to New Zealand, was sister to Tripteroides (Oriental, Australasian and Palaearctic species), in all cases with 100% bootstrap support or posterior probability of 1.0. These relationships were not recovered in cladistic analyses of morphological data that included representatives of a larger number of genera. Belkin [54] regarded Maorigoeldia to be sister to all other sabethine species. This was supported in the studies of Harbach and Kitching [1], Harbach and Peyton [55], and Harbach et al. [56], but not in the study of Judd [57], which placed Maorigoeldia as the sister group to the New World genera of Sabethini. Whereas Malaya was recovered as the sister of genus Topomyia in the first two of these four studies, it was paired with Limatus in the most derived clade of Sabethini when Harbach et al.[56] included the new genus Kimia in the data set of Harbach and Peyton [55]. Also, Tripteroides (Old World) was recovered as sister to Trichoprosopon (New World), which is supported by shared morphological characters that are unique to these two genera.

Decisive support for monophyly of the New World genera (Limatus, Sabethes, Shannoniana, Trichoprosopon, and Wyeomyia) was found, in agreement with previous studies [2, 57]. Among these genera, a close relationship between Limatus, Sabethes and Wyeomyia was strongly supported by all analyses, but other nodes were unstable. Although not apparent in Figure 3, Shannoniana and Trichoprosopon showed a sister relationship in all but the BI amino acid analysis. However, as indicated in the previous paragraph, the results of cladistic analyses based on morphological data and a larger sample of sabethine taxa casts doubt on these relationships.

The remarkably large tribe Aedini (1255 species, has been the subject of recent efforts to infer higher-level relationships based on morphological characters of all life stages [1013]. Although this has resulted in major changes to classification, phylogenetic resolution has been limited. In the present study, as in the cladistic analyses of extensive morphological data by Reinert et al. [1113], Psorophora was recovered as sister to all other Aedini. Sister-group relationships strongly supported in most cases were Aedes (Stegomyia) + Eretmapodites and Haemagogus + Ochlerotatus. Other relationships within Aedini were less clear. Moreover, no consensus could be reached regarding affinities of any other genera within Culicidae as a whole, outside of Aedini and Sabethini.

Divergence time estimates

A chronogram for Culicidae is given in Figure 4, and corresponding divergence time estimates are provided in Table 7. Based on the taxa sampled and three fossil constraints, earliest divergence within mosquitoes - between the lineages leading to Culicinae and Anophelinae - dates to ~226 Ma. This estimate is in reasonable agreement with Krzywinski et al. [58], who determined the split between Anopheles and Aedes (Stegomyia) to have occurred ~145-200 Ma based on mitochondrial DNA sequences. Although 226 Ma is substantially older than the 118 Ma divergence between Chaoboridae and Culicidae estimated by Bertone et al. [8], the 95% credibility ranges overlap between studies. Because the latter study was aimed at deeper divergences within lower Diptera, Bertone et al. [8] only included two mosquitoes and one chaoborid, possibly accounting for the discrepancy. Moreover, Bertone et al. [8] estimated divergence times from a single gene (28S rDNA) and used only a few fossil calibration points (none close to Culicidae), which may also have contributed to differences in age estimates. We favor the older divergence estimates, as they are consistent with other evidence suggesting that mosquitoes likely originated in the Jurassic [7, 59]. As early as 1923, Edwards [60] surmized that the "origin and phylogenetic history of the Culicidae must go back to well into the Mesozoic Era."

Table 7 Divergence time estimates and credibility intervals (Ma) for nodes in Figure 4.
Figure 4
figure 4

Chronogram of mosquito age divergences with 95% confidence intervals (red bars) Numerical node ages and their 95% confidence intervals are presented in Table 7. Calibration points: Chaoboridae+Culicidae, 230-187 Ma [8, 9]; Toxorhynchites mexicanus fossil, ≥ 16 Ma [66]; Culex winchesteri fossil, ≥ 34 Ma [67]; Anopheles dominicanus fossil, ≥ 34 Ma [45].

Unexpectedly, the split between Anopheles and Bironella was a remarkably shallow 51 Ma. This estimate contradicts Krzywinski et al. [58], who estimated a substantially greater (90-106 Ma) divergence between the Anopheles subgenera, Anopheles and Cellia, represented in this study by An. atroparvus and An. gambiae, respectively. To what extent this incongruence can be explained by limited taxon sampling or biases in the molecules used to infer divergence dates (e.g., strong base composition bias in Anophelinae; Table 4), is presently unknown. However, given the universal agreement that Anopheles occupies an early-branching position among the Culicidae, it seems likely that our crown-group estimates do not accurately reflect the age of this group.

The only well-supported clades of the subfamily Culicinae in the phylogenetic analyses are the tribes Aedini and Sabethini, which apparently arose at similar times (roughly 112 and 115 Ma, respectively) and diversified more recently. Eight genera of the subfamily Culicinae (Aedeomyia, Coquillettidia, Culiseta, Culex, Mimomyia, Orthopodomyia, Toxorhynchites, Uranotaenia), whose relationships were not strongly or consistently recovered, represent the deeper internal branches of the tree. The nodes connecting these branches are not only ancient (exceeding 127 Ma), but also relatively close together in time, occurring within a ~30 million year interval between 127-158 Ma. If these estimates are corroborated in the future by denser taxon sampling and more fossil-based age constraints, they will support the notion of rapid radiations early in the diversification of mosquitoes, potentially explaining the difficulty in attaining a stable phylogeny for these lineages. An early Cretaceous timing for these rapid radiations is consistent with the appearance of angiosperms, a group of plants whose nectar is exploited as an energy source by mosquitoes [61], and whose water-filled parts are the sole habitats occupied by the immature stages of many groups of mosquitoes, notably members of the tribe Sabethini [54].


This study represents one of the few attempts to reconstruct generic-level relationships within Culicidae as a whole, and the only attempt to combine morphological data and molecular characters from multiple genes. Among molecular phylogenetic studies of the family, it more than doubled the number of taxa sampled to date. Yet results were mixed. The ability to recover previously known clades (Anophelinae, Sabethini, and Aedini) was encouraging. However, the deeper relationships among genera could not be resolved unambiguously, potentially due to ancient and rapid radiation, as hypothesized for other insect groups [6]. There has been much debate regarding whether better resolution and support of relationships is achieved through broader taxonomic sampling [62, 63] or sequencing of more loci [64]. The current explosion of sequencing whole genomes from organisms, including mosquitoes, promises many potentially informative genes beyond those included here. On the other hand, the recent study by Wiegmann et al. [49] successfully resolved even deeper divergences and a longstanding controversy in the phylogeny of holometabolous insects, using only six single-copy nuclear genes comprising a similar number of base pairs to that compiled for the present study. Although more molecular, as well as morphological, characters may well prove useful, there is little doubt that broader taxonomic sampling is now the key roadblock. Considering that the mosquito diversity housed in museums is almost invariably preserved in a fashion that has impeded conventional molecular data collection, this roadblock may be substantial. There is an urgent need for fresh museum collections, particularly from under-sampled yet high-biodiversity regions worldwide, and their cryo- or ethanol preservation with vouchers. The present study was limited by what was available in existing collections. Broader taxon sampling is crucial not merely because it may help break up long branches. To the extent that the current generic system of classification includes paraphyletic and polyphyletic groups containing numerous species, it is clear that inclusion of only one or few generic exemplars can be misleading, and that more representative sampling is needed.





million years ago


maximum likelihood


maximum parsimony




polymerase chain reaction.


  1. Harbach RE, Kitching IJ: Phylogeny and classification of the Culicidae (Diptera). Syst Entomol. 1998, 23: 327-370. 10.1046/j.1365-3113.1998.00072.x.

    Article  Google Scholar 

  2. Harbach RE: The Culicidae (Diptera) a review of taxonomy classification. Zootaxa. 2007, 1668: 591-638.

    Google Scholar 

  3. Wood DM, Borkent A: Phylogeny and classification of the Nematocera. Agriculture Canada Monograph. 1989, 1333-1370.

    Google Scholar 

  4. Miller BR, Crabtree MB, Savage HM: Phylogenetic relationships of the Culicomorpha inferred from 18S and 5.8S ribosomal DNA sequences. (Diptera Nematocera). Insect Mol Biol. 1997, 6: 105-114. 10.1111/j.1365-2583.1997.tb00078.x.

    Article  CAS  PubMed  Google Scholar 

  5. Foley DH, Rueda LM, Wikerson RC: Insight into global mosquito biogeography from country species records. J Med Entomol. 2007, 44: 554-567. 10.1603/0022-2585(2007)44[554:IIGMBF]2.0.CO;2.

    Article  PubMed  Google Scholar 

  6. Grimaldi D, Engel MS: Evolution of the Insects. 2005, Cambridge: Cambridge University Press

    Google Scholar 

  7. Edwards FW: Genera Insectorum. Diptera, Fam. Culicidae. Fascicle 194. 1932, Desmet-Verteneuil, Brussels

    Google Scholar 

  8. Bertone MA, Courtney GW, Wiegmann BM: Phylogenetics and temporal diversification of the earliest true flies (Insecta: Diptera) based on multiple nuclear genes. Syst Entomol. 2008, 33: 668-687. 10.1111/j.1365-3113.2008.00437.x.

    Article  Google Scholar 

  9. Borkent A: A world catalogue of fossil and extant Corethrellidae and Chaoboridae (Diptera), with a listing of references to keys, bionomic information and descriptions of each known life stage. Entomol Scand. 1993, 24: 1-24.

    Article  Google Scholar 

  10. Reinert JF, Harbach RE, Kitching IJ: Phylogeny and classification of Aedini (Diptera: Culicidae), based on morphological characters of all life stages. Zool J Linn Soc-Lond. 2004, 142: 289-368. 10.1111/j.1096-3642.2004.00144.x.

    Article  Google Scholar 

  11. Reinert JF, Harbach RE, Kitching IJ: Phylogeny and classification of Ochlerotatus and allied taxa (Diptera: Culicidae: Aedini) based on morphological data from all life stages. Zool J Linn Soc. 2008, 153: 29-114. 10.1111/j.1096-3642.2008.00382.x.

    Article  Google Scholar 

  12. Reinert JF, Harbach RE, Kitching IJ: Phylogeny and classification of Finlaya and allied taxa (Diptera: Culicidae: Aedini) based on morphological data from all life stages. Zool J Linn Soc. 2006, 148: 1-101. 10.1111/j.1096-3642.2006.00254.x.

    Article  Google Scholar 

  13. Reinert JF, Harbach RE, Kitching IJ: Phylogeny and classification of tribe Aedini (Diptera: Culicidae). Zool J Linn Soc. 2009, 157: 700-794. 10.1111/j.1096-3642.2009.00570.x.

    Article  Google Scholar 

  14. Ross HH: Conflict with Culex. Mosq News. 1951, 11: 128-132.

    Google Scholar 

  15. Shepard JJ, Andreadis TG, Vossbrinck CR: Molecular phylogeny and evolutionary relationships among mosquitoes (Diptera: Culicidae) from the northeastern United States based on small subunit ribosomal DNA (18S rDNA) sequences. J Med Entomol. 2006, 43: 443-454. 10.1603/0022-2585(2006)43[443:MPAERA]2.0.CO;2.

    Article  CAS  PubMed  Google Scholar 

  16. Besanky NJ, Fahey GT: Utility of the white gene in estimating phylogenetic relationships among mosquitoes (Diptera: Culicidae). Mol Biol Evol. 1997, 14: 442-454.

    Article  Google Scholar 

  17. Mitchell A, Sperline FAH, Hickey DA: Higher-level phylogeny of mosquitoes (Diptera: Culicidae): mtDNA data support a derived placement for Toxorhynchites. Insect Syst Evol. 2002, 33: 163-174.

    Article  Google Scholar 

  18. Rannala B, Yang Z: Phylogenetic inference using whole genomes. Annu Rev Genomics Hum Genet. 2008, 9: 217-231. 10.1146/annurev.genom.9.081307.164407.

    Article  CAS  PubMed  Google Scholar 

  19. Lin CP, Danforth BN: How do insect nuclear and mitochondrial gene substitution patterns differ? Insights from Bayesian analyses of combined datasets. Mol Phylogenet Evol. 2004, 30: 686-702. 10.1016/S1055-7903(03)00241-0.

    Article  CAS  PubMed  Google Scholar 

  20. Hurst GDD, Jiggins FM: Problems with mitochondrial DNA as a marker in population, phylogeographic and phylogenetic studies: the effects of inherited symbionts. Proc R Soc B. 2005, 272: 1525-1534. 10.1098/rspb.2005.3056.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  21. Danforth BN, Lin C-P, Fang J: How to insect nuclear ribosomal genes compare to protein-coding genes in phylogenetic utility and nucleotide substitution patterns?. Syst Entomol. 2005, 30: 549-562. 10.1111/j.1365-3113.2005.00305.x.

    Article  Google Scholar 

  22. Danforth BN, Fang J, Sipes S: Analysis of family-level relationships in bees (Hymenoptera: Apiformes) using 28S and two previously unexplored nuclear genes: CAD and RNA polymerase II. Mol Phylogenet Evol. 2006, 39: 358-372. 10.1016/j.ympev.2005.09.022.

    Article  CAS  PubMed  Google Scholar 

  23. Moulton JK, Weigmann BM: Evolution and phylogenetic utility of CAD (rudimentary) among Mesozoic-aged Eremoneuran Diptera (Insecta). Mol Phylogenet Evol. 2004, 31: 363-378. 10.1016/S1055-7903(03)00284-7.

    Article  CAS  PubMed  Google Scholar 

  24. Winterton S, Freitas Sd: Molecular phylogeny of the green lacewings (Neuroptera: Chrysopidae). Aust J Entomol. 2006, 45: 235-243. 10.1111/j.1440-6055.2006.00537.x.

    Article  Google Scholar 

  25. Banks JC, Whitfield JB: Dissecting the ancient rapid radiation of microgastrine wasp genera using additional nuclear genes. Mol Phylogenet Evol. 2006, 41: 690-703. 10.1016/j.ympev.2006.06.001.

    Article  CAS  PubMed  Google Scholar 

  26. Hines HM: Historical biogeography, divergence times, and diversification patterns of bumble bees (Hymenoptera: Apidae: Bombus). Syst Biol. 2008, 57: 58-75. 10.1080/10635150801898912.

    Article  PubMed  Google Scholar 

  27. Wild AL, Maddisin DR: Evaluating nuclear protein-coding genes for phylogenetic utility in beetles. Mol Phylogenet Evol. 2008, 48: 877-891. 10.1016/j.ympev.2008.05.023.

    Article  CAS  PubMed  Google Scholar 

  28. Baker RH, DeSalle R: Multiple sources of character information and the phylogeny of Hawaiian drosophilids. Syst Biol. 1997, 46: 654-673.

    Article  CAS  PubMed  Google Scholar 

  29. Oosterbroek P, Courtney G: Phylogeny of the Nematocerous families of Diptera (Insecta). Zool J Linn Soc. 1995, 115: 267-311.

    Article  Google Scholar 

  30. Pawlowski J, Szadziewski R, Kmieciak D, Fahrni J, Bittar G: Phylogeny of the infraorder Culicomorpha (Diptera: Nematocera) based on 28S RNA gene sequences. Syst Entomol. 1996, 21: 167-178. 10.1046/j.1365-3113.1996.d01-5.x.

    Article  Google Scholar 

  31. Harbach RE, Knight KL: Taxonomists' glossary of mosquito anatomy. 1980, Marlton, New Jersey: Plexus Publishing, Inc

    Google Scholar 

  32. Harbach RE, Knight KL: Corrections and additions to Taxonomists' glossary of mosquito anatomy. Mosq Syst. 1982, 13: 201-217.

    Google Scholar 

  33. Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16: 276-277. 10.1016/S0168-9525(00)02024-2.

    Article  CAS  PubMed  Google Scholar 

  34. Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  35. Gonnet GH, Cohen MA, Benner SA: Exhaustive matching of the entire protein sequence database. Science. 1992, 256: 1443-1445. 10.1126/science.1604319.

    Article  CAS  PubMed  Google Scholar 

  36. Swofford DL: PAUP* Phylogenetic Analysis Using Parsimony (*and Other Methods). 2003, Sunderland, Massachusetts: Sinauer Associates, 4

    Google Scholar 

  37. Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.

    Article  CAS  PubMed  Google Scholar 

  38. Shimodaira H, Hasegawa M: Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol Biol Evol. 1999, 16: 1114-1116.

    Article  CAS  Google Scholar 

  39. Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.

    Article  CAS  PubMed  Google Scholar 

  40. Lewis PO: A likelihood approach to estimating phylogeny from discrete morphological character data. Syst Biol. 2001, 50: 913-925. 10.1080/106351501753462876.

    Article  CAS  PubMed  Google Scholar 

  41. Kolaczkowski B, Thornton JW: Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous. Nature. 2004, 431: 980-984. 10.1038/nature02917.

    Article  CAS  PubMed  Google Scholar 

  42. Whelan S, Goldman N: A general empirical model of protein evolution derived from multiple protein families using a maximum likelihood approach. Mol Biol Evol. 2001, 18: 691-699.

    Article  CAS  PubMed  Google Scholar 

  43. Thorne JL, Kishino H: Divergence time and evolutionary rate estimation with multilocus data. Syst Biol. 2002, 51: 689-702. 10.1080/10635150290102456.

    Article  PubMed  Google Scholar 

  44. Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997, 13: 555-556.

    CAS  PubMed  Google Scholar 

  45. Zavortink TJ, Poinar GO: Anopheles (nyssorhynchus) dominicanus sp. n. (Diptera: Culicidae) from Dominican amber. Ann Entomol Soc Am. 2000, 93: 1230-1235. 10.1603/0013-8746(2000)093[1230:ANDSND]2.0.CO;2.

    Article  Google Scholar 

  46. Rutschmann F, Eriksson T, Salim KA, Conti E: Assessing calibration uncertainty in molecular dating: the assignment of fossils to alternative calibration points. Syst Biol. 2007, 56: 591-608. 10.1080/10635150701491156.

    Article  CAS  PubMed  Google Scholar 

  47. Lee MSY: Hidden support from unpromising data sets strongly unites snakes with anguimorph "lizards". J Evol Biol. 2009, 22: 1308-1316. 10.1111/j.1420-9101.2009.01751.x.

    Article  CAS  PubMed  Google Scholar 

  48. Smith JF: Phylogenetic signal common to three data sets: combining data which initially appear heterogeneous. Plant Syst Evol. 2000, 221: 179-198. 10.1007/BF01089293.

    Article  Google Scholar 

  49. Wiegmann BM, Trautwein MD, Kim JW, Cassel BK, Bertone MA: Single-copy nuclear genes resolve the phylogeny of the holometabolous insects. BMC Biol. 2009, 7: 34-10.1186/1741-7007-7-34.

    Article  PubMed Central  PubMed  Google Scholar 

  50. Sallum MAM, Schultz TR, Wilkerson RC: Phylogeny of Anophelinae (Diptera Culicidae) based on morphological characters. Ann Entomol Soc Am. 2000, 93: 745-775. 10.1603/0013-8746(2000)093[0745:POADCB]2.0.CO;2.

    Article  Google Scholar 

  51. Harbach RE, Kitching IJ: Reconsideration of anopheline mosquito phylogeny (Diptera: Calicidae: Anophelinae) based on morphological data. Syst Biodiv. 2005, 3: 345-374. 10.1017/S147720000500174X.

    Article  Google Scholar 

  52. Sallum MAM, Schultz TR, Foster PG, Aronstein K, Wirtz RA, Wiilkerson RC: Phylogeny of Anophelinae (Diptera: Culicidae) based on nuclear ribosomal and mitochondrial DNA sequences. Syst Entomol. 2002, 361-382. 10.1046/j.1365-3113.2002.00182.x.

    Google Scholar 

  53. Krzywinski J, Wilkerson RC, Besansky NJ: Toward understanding Anophelinae (Diptera, Culicidae) phylogeny: insights from nuclear single-copy genes and the weight of evidence. Syst Biol. 2001, 50: 540-556. 10.1080/106351501750435095.

    Article  CAS  PubMed  Google Scholar 

  54. Belkin JN: Mosquitoes of the South Pacific. 1962, Berkeley, CA: University of California Press, 1:

    Google Scholar 

  55. Harbach RE, Peyton EL: Systematics of Onirion, a new genus of Sabethini (Diptera: Culicidae) from the Neotropical Region. Bull Nat Hist Museum Entomol Series. 2000, 69: 115-169.

    Google Scholar 

  56. Harbach RE, Huong Vu Duc, Kitching IJ: Systematics of Kimia, a new genus of Sabethini (Diptera: Culicidae) in the Oriental Region. Proc Entomol Soc Wash. 2007, 109: 102-120.

    Google Scholar 

  57. Judd DD: Review of the systematics and phylogenetic relationships of the Sabethini (Diptera: Culicidae). Syst Entomol. 1996, 21: 129-150. 10.1046/j.1365-3113.1996.d01-1.x.

    Article  Google Scholar 

  58. Krzywinski J, Grushko OG, Besansky NJ: Analysis of the complete mitochondrial DNA from Anopheles funestus: an improved dipteran mitochondrial genome annotation and a temporal dimension of mosquito evolution. Mol Phylogenet Evol. 2006, 39: 417-423. 10.1016/j.ympev.2006.01.006.

    Article  CAS  PubMed  Google Scholar 

  59. Borkent A, Grimaldi DA: The earliest fossil mosquito (Diptera:Culicidae), in mid-Cretaceous Burmese amber. Ann Entomol Soc Am. 2004, 97: 882-888. 10.1603/0013-8746(2004)097[0882:TEFMDC]2.0.CO;2.

    Article  Google Scholar 

  60. Edwards FW: Oligocene mosquitoes in the British Museum; with a summary of our present knowledge concerning fossil Culicidae. Q J Geol Soc London. 1923, 79: 139-155. 10.1144/GSL.JGS.1923.079.01-04.10.

    Article  Google Scholar 

  61. Clements AN: The Biology of Mosquitoes. Development, Nutrition and Reproduction. 1992, London: Chapman & Hall, 1:

    Google Scholar 

  62. Pollock DD, Zwickl DJ, McGuire JA, Hillis DM: Increased taxonomic sampling is advantageous for phylogenetic inference. Syst Biol. 2002, 51: 664-671. 10.1080/10635150290102357.

    Article  PubMed Central  PubMed  Google Scholar 

  63. Hedtek SM, Townsend TM, Hillis DM: Resolution of phylogenetic conflict in large data sets by increased taxon sampling. Syst Biol. 2006, 55: 522-529. 10.1080/10635150600697358.

    Article  Google Scholar 

  64. Rosenberg MS, Kumar S: Incomplete taxon sampling is not a problem for phylogenetic inference. Proc Natl Acad Sci USA. 2001, 98: 0751-10756.

    Google Scholar 

  65. Zwiebel LJ, Saccone G, Zacharopoulou A, Besansky NJ, Favia G, Collins FH, Louis C, Kafatos FC: The white gene of Ceratitis capitata: a phenotypic marker for germline transformation. Science. 1995, 270: 2005-2008. 10.1126/science.270.5244.2005.

    Article  CAS  PubMed  Google Scholar 

  66. Zavortink TJ, Poinar J, G O: Toxorhynchites (Toxorhnchties) mexicanus, n. sp. (Diptera:Culicidae) from Mexican amber: a new world species with old world affinities. Proc Entomol Soc Wash. 2008, 110: 116-125.

    Article  Google Scholar 

  67. Cockerell TDA: The oldest mosquitoes. Nature. 1919, 103: 44-10.1038/103044b0.

    Article  Google Scholar 

Download references


We thank M. Kern for assistance in generating sequence data, and E. C. Holmes for comments and advice on ML analyses. This work would not have been possible without the contributions of specimens by R. Wilkerson (Smithsonian), A. Sallum (University Sao Paolo), B. Harrison (North Carolina DENR Winston-Salem Regional Office), W. Foster (Ohio State University), Lt. Col. R. Cooper (Australian Army Malaria Institute), A. Cornel (University of California, Davis), G. Fritz (Eastern Illinois University), C. Porter (Centers for Disease Control), D. Wesson (Tulane University), P. Howell (MR4, CDC), A. Mori and C. Young (University of Notre Dame). NJB was supported by National Institutes of Health grants AI063508 and AI076584. KRR was supported by an Arthur J. Schmitt Fellowship from the University of Notre Dame.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Nora J Besansky.

Additional information

Authors' contributions

NJB and REH conceived and designed the experiments. KRR designed PCR primers and collected molecular data. REH collected morphological data. KRR, SC, MAB and BMW performed experiments and analyzed data. REH contributed specimens. KRR and NJB wrote the paper with contributions by all authors.

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Reidenbach, K.R., Cook, S., Bertone, M.A. et al. Phylogenetic analysis and temporal diversification of mosquitoes (Diptera: Culicidae) based on nuclear genes and morphology. BMC Evol Biol 9, 298 (2009).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: