Skip to main content

Table 2 The seven datasets, designed to address seven phylogenetic questions

From: The evolutionary history of holometabolous insects inferred from transcriptome-based phylogeny and comprehensive morphological data

Dataset Addressed phylogenetic question Covered subgroups/FcLM clusters (4 clusters per analysis) No. of species No. of OGs Alignment length (aa) Coverage [%] all species Coverage [%] addressed groups
Dataset 1 (complete dataset) Position of Hymenoptera? 1) Hymenoptera 88 1,343 662,107 61.1 100
2) outgroup taxa
3) Mecopterida
4) Neuropteroidea
Dataset 2 Are Neuropteroidea monophyletic? 1) Neuropterida 71 1,303 643,051 65.0 100
2) Mecopterida
3) Coleopterida
4) Hymenoptera
Dataset 3 Are Megaloptera monophyletic? 1) Raphidioptera 4 358 174,065 100 100
2) Corydalidae
3) Sialidae
4) Neuroptera
Dataset 4 Are Neuroptera and Megaloptera sister groups? 1) Raphidioptera 71 540 242,820 72.9 100
2) Megaloptera
3) Neuroptera
4) remaining holometabolans
Dataset 5 Are Coleopterida monophyletic? 1) Neuropterida 71 972 505,528 66.2 100
2) Strepsiptera
3) Coleoptera
4) remaining holometabolans
Dataset 6a a) Are Mecopterida monophyletic? or a) 1) Antliophora 71 1,343 662,107 64.3 100
Dataset 6b b) Are Antliophora + Coleopterida monophyletic? 2) Amphiesmenoptera
3) Neuropteroidea
4) remaining holometabolans
b) 1) Antliophora
2) Amphiesmenoptera
3) Coleopterida
4) remaining holometabolans
Dataset 7 Relationships within Antliophora? 1) Diptera 71 1,101 557,276 66.5 100
2) Siphonaptera
3) Mecoptera
4) remaining holometabolans
  1. For each dataset, we selected four taxonomic groups (clusters), assigned species to one of the groups, and extracted only those ortholog groups (OGs) that contained a sequence of at least one representative of each group. All species that were not assigned to either of the groups were excluded. Coverage [%] all species: Coverage of the dataset in terms of presence of OGs considering all species. Coverage [%] addressed groups: Coverage of the dataset in terms of presence of OGs considering the four groups defined for each dataset, which is, by definition, 100%.