Skip to main content

Reconstructing eight decades of genetic variation in an isolated Danish population of the large blue butterfly Maculinea arion



Fragmentation of terrestrial ecosystems has had detrimental effects on metapopulations of habitat specialists. Maculinea butterflies have been particularly affected because of their specialized lifecycles, requiring both specific food-plants and host-ants. However, the interaction between dispersal, effective population size, and long-term genetic erosion of these endangered butterflies remains unknown. Using non-destructive sampling, we investigated the genetic diversity of the last extant population of M. arion in Denmark, which experienced critically low numbers in the 1980s.


Using nine microsatellite markers, we show that the population is genetically impoverished compared to nearby populations in Sweden, but less so than monitoring programs suggested. Ten additional short repeat microsatellites were used to reconstruct changes in genetic diversity and population structure over the last 77 years from museum specimens. We also tested amplification efficiency in such historical samples as a function of repeat length and sample age. Low population numbers in the 1980s did not affect genetic diversity, but considerable turnover of alleles has characterized this population throughout the time-span of our analysis.


Our results suggest that M. arion is less sensitive to genetic erosion via population bottlenecks than previously thought, and that managing clusters of high quality habitat may be key for long-term conservation.


While gene flow decreases the differentiation among populations, it may increase genetic diversity within them. Population connectivity is therefore important to maintain overall genetic diversity across small local populations that would otherwise "erode" because of drift [1]. The effects of isolation by distance and reduced local population sizes tend to be most visible at the edges of species ranges, as these fringes go through periods of expansion with founder effects and contraction with bottlenecks [2]. Empirical studies on butterflies show that peripheral populations are indeed less diverse than central populations [3], and experience larger population fluctuations due to less favourable conditions [4]. In addition the breeding system will also affect within-population genetic diversity, with asexual species being least diverse and sexual systems being variably affected by deviations from random mating, which may affect effective population size independent of drift [1].

Endangered species often occur in small isolated populations where demographic and environmental stochasticity impose additional risks of local extinction. This has made some researchers question the role that genetic factors play in driving population extinction [5], because genetic factors are likely to be negligible when population decline occurs rapidly. However, when effective population sizes remain moderate, inbreeding over many generations may have marked fitness effects due to increasing disease susceptibility and inbreeding depression [2, 68]. This is because purging tends to remove primarily the few deleterious recessive alleles with large negative effects, and hardly affects the more numerous slightly deleterious alleles [6]. Theory indicates [9] and comparative studies across 170 species have shown [10] that a significant proportion of endangered populations/species have reduced levels of genetic variation compared to related non-endangered species, suggesting that genetic factors often play a role in population extinctions [11].

Researchers have traditionally been forced to evaluate present day diversity of endangered populations against other contemporary populations of the same or closely related species. Such studies are valuable, but since populations rarely have identical demographic and environmental histories, precise identification of the factors that caused extant genetic differences remains impossible. Recent technical advances in the extraction and amplification of old DNA have made the large resources of natural history collections (NHC) available for population genetic studies, providing direct and highly relevant reference points for studies of genetic diversity in endangered populations. Particularly taxa with long histories of collection by entomologists, such as beetles, butterflies and hoverflies, have thus become very useful for long term population studies of genetic change over time.

The number of studies utilizing NHC material for evolutionary genetic studies is increasing, and many focus on past and present genetic diversity in endangered populations [12]. Despite the promises these methods hold for conservation genetics, there are also limitations to the use of historical DNA, and special precautions are required in the experimental and the analytical phase of such work. The highly degraded nature of DNA extracted from historical samples, which increases with age, temperature and water content [13, 14], generally restricts PCR amplification to short fragments (< 200 bp) thus limiting the choice of genetic markers. Nuclear microsatellite markers have proven useful in this context, as they have short and highly polymorphic amplicons [12]. However, historical DNA is not only of low quality but also occurs in very low quantity, increasing the risk of genotype errors caused by cross contamination, allelic dropout or false alleles. The importance of following standard protocols when working with historical samples can therefore not be stressed enough, and the assessment and reporting of genotype error rates is indispensable in order to validate such datasets [1517].

The large blue butterfly, Maculinea arion, is one of many butterfly species that have declined in Europe during the last century, both in terms of population numbers and population connectivity [18]. As a result, many extant populations are considered endangered and only exist because they are actively managed [19]. The physical attractiveness and fascinating biology of M. arion has made the species popular among amateur collectors, so that many European natural history museums hold large collections often with good numbers of specimens collected in particular years that together form attractive time series for single localities. When these series coincide with periods of population decline they provide an outstanding opportunity to analyse how isolation and demographic fluctuations may have affected genetic variation in the past. Such time series are common for M. arion and we exploit such collection material in this study. In particular, we investigated whether/how a recent, severe reduction in population census size and a long history of isolation by distance has affected genetic diversity in the last extant Danish population of M. arion, on the island of Møn (Figure 1). As contemporary reference points we used a cluster of six M. arion populations in south and central Sweden approximately 100-600 km away [20] and as historical reference points we used NHC specimens from the Møn population covering the time period 1930-1975.

Figure 1
figure 1

Maculinea arion in Scandinavia. a) Count data of M. arion imagos on Møn, Denmark. Maximum (solid line) and minimum (dashed line) counts from the best day during the flight season. Imago counts were converted into approximate population census size (# imagos × 3.5) according to Thomas et al. [39]. b) Distribution of M. arion in Denmark and southern Sweden before (open symbols) and after (closed symbols) 1990, 10 km2 UTM grid. Records have been compiled since 1900 by the Atlas Project of Danish Butterflies and the Swedish ArtDatabankens fynddatabas. Danish populations marked by an asterisk went extinct in the late 1990s.


Contemporary reference populations

Two of the nine microsatellite loci that were analysed in the contemporary populations departed from Hardy-Weinberg equilibrium after sequential Bonferroni corrections (Møn; Macu17 and Macu20), but without showing evidence of null alleles. Null alleles were found in Macu8, but only in the Møn population and at low frequencies (0.146). Linkage disequilibrium between pairs of microsatellite loci was found between Macu26 and Macu44, and locus Macu26 was subsequently excluded from further analysis.

The two measures of genetic diversity, allelic richness and expected heterozygosity, differed among the contemporary populations (Figure 2a), but only significantly so for allelic richness (Oneway ANOVA, F6,55 = 3.98, P = 0.004). The Møn population had a medium level of genetic diversity in comparison with the Swedish populations, i.e. lower than the functional metapopulations in Skåne, Öland and Gotland, but higher than three isolated single site populations [20]. The allelic composition was, however, clearly different. While the isolated Swedish populations had few private alleles, the population on Møn had the highest proportion of diagnostic alleles (Figure 2a), even exceeding the numbers found in Skåne, Öland and Gotland. Consequently pairwise F ST values among the six Swedish populations were lower than their equivalents with the Møn population. The population on Møn was genetically most similar to the geographically closest population in Skåne reflecting a general isolation by distance pattern among population (Mantel's test, r = 0.371, P = 0.029; Figure 2b).

Figure 2
figure 2

Genetic diversity and differentiation among Scandinavian M. arion populations. a) Two measures of genetic diversity, expected heterozygosity (open circles) and allelic richness (closed circles), were estimated from genotype data of eight microsatellite loci (mean ± SE). Allelic richness differed significantly among populations (One-way ANOVA, F6,55 = 3.98, P < 0.01). Levels not connected with the same letter are significantly different according to post-hoc Tykey-Kramer HSD. Private allele numbers are given as diamonds. b) Pairwise genetic distances (F ST ) among the seven contemporary study populations. Figures in bold are significant after standard Bonferroni correction (P < 0.05).

Historical reference populations

Only one of 70 DNA extractions of historical samples failed (collection year 1975). Fitting a general linear model to the remaining 69 historical samples revealed that PCR amplification success depended on sample age, the maximum allele length at the specific locus and also the interaction between the two (GLM; sample age: χ2 = 54.47, df = 1, P < 0.0001; maximum allele length: χ2 = 427.78, df = 1, P < 0.0001; interaction: χ2 = 51.65, df = 1, P < 0.0001; Figure 3a). The negative effect of long alleles on amplification success thus increased more than proportionally with sample age. Loci with allele sizes > 166 bp did not amplify consistently in the historical samples (on average 29% amplified, range: 0-100%), and were not used in the further analysis (Figure 3b). Among the microsatellite loci used, genotype error rates per locus were 0.047 on average in historical samples (range: 0.000-0.103) and 0.004 in modern samples (range: 0.000-0.014), an order of magnitude difference (see Additional file 1 Table S1).

Figure 3
figure 3

DNA amplification success in historical samples. a) The longest allele (in base-pairs) amplifying at each locus plotted against the sample age. The size of the symbol indicates the number of samples for which genotyping was attempted at specific loci, and the pie charts give the proportion of samples that successfully amplified (white). Isoclines are given for the amplification success predicted by the GLM model, including sample age and maximum allele length as factors. The amplification success depended on the maximum allele length at the particular locus (χ2 = 427.78, df = 1, P < 0.0001), the age of the sample (χ2 = 54.47, df = 1, P < 0.0001) and the interaction between the two (χ2 = 51.65, df = 1, P < 0.0001). b) The 20 microsatellite markers tested in the study and their observed allele ranges. Only loci with amplification success > 80% were used for the temporal study, corresponding to maximum allele sizes below 166 base pairs (below the dashed line).

Two loci/population combinations departed from Hardy-Weinberg equilibrium after sequential Bonferroni corrections (2005; Macu20 and Macari18). In Macari18 this was due to homozygote excess, and evidence for null alleles was found in four of the sampling years (1940, 1949, 2005 and 2007: frequencies ranging from 0.19-0.25). The loci Macu15 and Macari16 also showed signs of null alleles, but only in sample year 1930 (0.25 and 0.28 frequencies respectively). Due to small sample sizes the presence of null alleles could not be tested for the samples from 1944 and 1959. No linkage disequilibrium was found between any pair of microsatellite loci. Two of the 12 microsatellite loci were monomorphic in all sampling years (Macu30 and Macu31), whereas the remaining ten loci were polymorphic in all populations, except Macari22 in 1944 and Macari16 in 1959.

The two measures of genetic diversity, allelic richness (F 8,80 = 0.326, P = 0.954) and expected heterozygosity (F 8,80 = 0.365, P = 0.936) did not differ significantly between years, nor between historical and contemporary samples (allelic richness: F 1,9 = 6.87, P = 0.133; expected heterozygosity: F 1,9 = 1.82, P = 0.402). Furthermore, there was no difference in the genetic diversity measures when comparing all historical samples vs. the two contemporary samples. Twelve of the 43 alleles were unique to the historical samples, four of which used to be present at relatively high frequencies (≥0.1; see Additional file 2 Figure S1). In contrast only one allele was unique to the contemporary samples, and was found at lower frequencies (2005: 0.022 and 2007: 0.059; see Additional file 2 Figure S1).

Based on the empirical and simulated M-ratios four of the sampling years showed signs of a recent bottleneck, independently of the parameter setting (1930, 1940, 1972 and 2007). The remaining five sampling years showed signs of bottlenecks in the vast majority of parameter combinations, except when the parameters p s (the fraction of mutations larger than a single step) and Δ g (the mean size of larger mutations) were set at their maximum (see Additional file 3 Table S2).

The overall F ST across loci was estimated to be 0.105 (range: 0.024-0.129). The pairwise genetic distances (F ST ) between sampling years were significantly correlated with the temporal difference between samples (Mantel's test, r = 0.647, P < 0.001; Figure 4), i.e. samples that were fewer years apart were more similar genetically.

Figure 4
figure 4

Temporal genetic change. The pairwise genetic distance (F ST /(1- F ST )), corrected for the presence of null alleles) is positively correlated with the pairwise temporal difference (in years) between sample years at the Møn population (Mantel's test, r = 0.647, P < 0.01). Comparisons among contemporary (open symbols), historical (closed symbols) and contemporary-historical (half closed) samples are indicated.


Contemporary vs. historical levels of genetic diversity

We found medium levels of genetic diversity in the Maculinea arion population on Møn compared to six contemporary Swedish populations. Significant differences in genetic diversity among populations could only be detected in allelic richness (Figure 2a), which is known to be more strongly affected by population size reductions than is heterozygosity [6]. The very distinct allelic composition of the Møn population shows that ongoing gene exchange with populations in Skåne is restricted, and has been for a long time. This is in line with recent findings showing that gene flow in M. arion may occur over long distances (~ 100 km) but only if suitable "stepping-stone" sites are found within ca 10 km of one another [20]. The fact that the population on Møn is more closely related to the Skåne populations is likely to reflect a shared history, as the open dots on the map in Figure 1b indicate.

While the southern Swedish populations (Gotland, Öland and Skåne) are thought to have been relatively stable over time, with several local populations within dispersal distance [21], the population on Møn has fluctuated markedly in census size in recent decades, and has been through two documented periods of consistently low numbers (Figure 1a). This might imply that the lower levels of contemporary genetic diversity on Møn compared to these southeastern Swedish populations are related to regular moderate bottlenecking. Similarly large fluctuations of M. arion have been reported in four UK populations. The magnitude of these oscillation around the carrying capacity has been ascribed to scramble competition between caterpillars after their adoption into the host-ant nests [22, see the Mehod section for a dscription of Maculinea arion biology] but the duration of these 'natural' population oscillations are normally shorter than the ones observed on Møn [22], where population census size was reduced to 50-85 individuals over a period of at least six years (Figure 1a).

Despite this, we found no evidence for higher genetic variation in the Møn population prior to the crash around 1991. Levels of allelic richness and heterozygosity did not differ significantly in the studied period covering 77 years (Table 1). Several scenarios may explain this: i) the population reduction was not severe enough to impact the genetic diversity measures. This scenario would be comparable to a study of two species of bumble bees [23] showing that only population reductions by at least 80% resulted in detectable loss of heterozygosity, ii) the number of historical samples was too low to estimate true levels of genetic diversity, or iii) the population was already genetically impoverished before the documented low population size in 1991. We cannot completely exclude that sample sizes in this study may compromise our ability to accurately assess historical levels of genetic diversity, but we believe that the third scenario is the most likely. The exact historical events causing the low level of contemporary genetic diversity on Møn cannot be determined, but the long-term isolation (even prior to 1990, Figure 1b) of the population has undoubtedly had an effect. The populations in northern Sweden show even lower levels of genetic diversity, but analysis of historical data similar to the present study would be needed to reconstruct the causes of these patterns of extant genetic diversity.

Table 1 Summary statistics by sampling year for ten microsatellite loci

Based on generally accepted measures, only two sampling years (1940 and 1972) showed evidence of bottlenecks, i.e. M-ratio < 0.7 [Table 1; but see also 24]. However, when comparing to parameter sets realistic for M. arion, all populations showed signs of having passed through a bottleneck except for combinations using extreme parameter values (see Additional file 3 Table S2). This suggests that the Møn population has not been in mutation-drift equilibrium for many decades, and highlights the importance of evaluating M-ratio values over a wide range of parameter values when the exact mutation model of the used microsatellite loci is unknown and the population specific N e uncertain [25].

Although we did not find temporal changes in genetic diversity, 28% of the sampled alleles were unique to historical samples, with half of these present at frequencies above 10% (see Additional file 2 Figure S1). The high number of so-called ghost alleles, i.e. alleles that are lost in modern samples, is remarkable, given the exhaustive sampling in the contemporary population. Ghost alleles are normally reported as evidence of a more diverse past [2628], which would suggest that the low population census sizes in the early 1990s have had some genetic impact on the Møn population. However, the high temporal turnover in allele frequencies (see Additional file 2 Figure S1) suggests that we cannot rule out that ghost alleles in this study rather reflect the general life history of Maculinea arion. The low mobility of patrolling M. arion males and the extremely heavy juvenile mortality rates mean that N e /N c in M. arion is likely to be very low because moderate population bottlenecks occur in each generation. A continuous turnover of alleles possibly associated with occasional female immigrants from far away [29, 30] would explain the relationship between pairwise genetic distance and temporal difference between samples (Figure 4).

Caveats for using historical DNA samples

Studies in the field of conservation genetics increasingly employ NHC samples to assess past genetic diversity levels [12]. Studies of insects have until very recently been underrepresented in these efforts, despite their potential because of large available time series from distinct populations often dating back to the late nineteenth century. While some studies using NHC samples of > 100 years old report successful amplification of nuclear microsatellite loci with alleles exceeding 250 bp [28, 31], we found that alleles longer than 160 bp did not amplify consistently in historical samples. Moreover, we found that amplification failure of long alleles increased more than linearly with sample age (Figure 3a), which is in accordance with DNA degradation and fragmentation being random processes, leading to an exponential decline in DNA template with increasing amplicon target size [14, 32].

Accordingly, only loci with short allele sizes were used in our present study (Figure 3b), which had the advantage that our genotype error rates (0-10%, see Additional file 1 Table S1) remained at the very low end of the much larger range (0.3-74.6%) reported from other studies using samples of low DNA quantity and quality [33]. This illustrates the importance of assessing the suitability of the genetic markers to be used for such studies, and of reporting genotype error rates that allow independent validation [as advocated by [12], [15], [17]].


Our study shows that the last Maculinea arion population in Denmark is only somewhat genetically impoverished compared to the larger, closest populations in southern Sweden [20]. Contrary to previous opinion, the genetic data of our present study indicate that this pattern is not due to the drastic reduction in population size in the early 1990s, but more likely a consequence of a history of long-term isolation from nearby populations. This emphasises that clusters of interconnected populations are crucial to maintain genetic variation within M. arion populations, as the species' extraordinary lifecycle makes local effective population sizes low. For conservation this implies that efforts should not be restricted to the active management of sites currently occupied by M. arion, but also include restoration of additional suitable sites within the ca. 10 km dispersal range, as is current practice in England and Denmark [34].

While many studies of conservation genetics have a rather pessimistic concluding paragraph, the results of our study indicate that extant M. arion populations in northwest Europe may be somewhat more robust than the dismal rates of extinction during recent decades have suggested. The enormous stochasticity of larval mortality likely imposes such consistent effects of genetic drift that local site-specific adaptations have little opportunity to evolve despite relatively low dispersal rates. This may imply that while neutral alleles are turned over at a fairly high rate, there may not be much room for maintaining genetic variation for life history traits that deviate from the species average. The key issue for M. arion conservation would thus be active site management to secure optimal conditions for both the specific food-plants and host-ants that large blue butterflies require. Once that has been achieved, the lack of variation for core life history traits may in fact facilitate natural recoveries from low numbers if eventual changes in the local habitat conditions remain within the standard M. arion niche. The same characteristics would facilitate re-introduction programs as a source population is unlikely to be differently adapted than the extinct population that it is meant to replace, if they experience similar climatic conditions. Both inferences appear to be consistent with field observations in native and introduced populations [34].


Biology of Maculinea arion

Maculinea arion is a habitat specialist, like the rest of the genus Maculinea (the large blues) and many other members of the tribe Polyommatini (the blues). M. arion caterpillars exploit two resources during their development; a specific food-plant on which they feed during the first three weeks after hatching, and subsequently a specific host-ant in the nests of which they live as obligate predators of ant brood, and where they overwinter and pupate in the late spring after 11-23 months [3537]. This extreme specialization leads to exceedingly high juvenile mortality rates, with 20-40% of a typical population dying in the egg or early larval stages, and mortality among caterpillars inside the host-ant nest representing 80-90% of the total breeding population mortality [22]. Whereas M. arion has relatively little impact on the fitness of the food-plant, its main host-ant, Myrmica sabuleti, experiences dramatic reductions in colony fitness upon infection [22]. The intimate butterfly-ant relationship leads to large oscillations in census population sizes of M. arion, suggesting that genetic diversity may primarily be maintained by gene flow between local low-density populations, rather than substantial effective population sizes at each local site.

The Møn site and demographic surveys

The number and connectedness of M. arion populations in Europe has decreased over the last century. In Denmark M. arion was previously known from approximately 40 localities, but only half of these persisted until the second half of the 20th century [Figure 1b; [30]]. At present only a single population remains, at Høvblege on the island of Møn. M. arion has a long and probably continuous history of occurrence at this site, with specimens in natural history collections dating back to 1926 (among the earliest in the collections). At that time three to four local populations were found on the island within the typical dispersal distance of the species [max dispersal distance is estimated to be at least 10 km based on genetic markers; [20]].

In 1991 a critically low number of imagos was observed flying at the single remaining Høvblege site, which at this point was ca. 8 ha (breeding area). The site, a grassland habitat, had been left almost untouched in the period 1915-1991, allowing for trees, scrub and larger grasses to invade the area with negative consequences for the food-plant distribution. Since 1991 the population has been managed and monitored every year, and now has a breeding area of ca. 10 ha. The number of imagos was determined using transect walks, a standard method for assessing year to year changes in butterfly abundance [38]. Population census sizes were estimated according to Thomas [39], assuming that approximately 1/3 of the total population is flying on the best day of the flight season and that 85% of these can be observed during a thorough survey, i.e. N c ≈ 3.5 × the number of imagos counted on the best day. Recent evaluations of butterfly monitoring methods conclude that caution is needed when estimating population census sizes from transect counts [4043]. The reason being that transect counts are influenced by adult longevity, which is affected by weather patterns and thus vary between years. However, methods such as mark-release-recapture are unfavourable in endangered populations as they may negatively impact the butterflies. Transect counts were therefore consistently used to estimate population sizes in all years, despite yielding cruder estimates.

In 1991-1996 the population size was consistently around 50-85 individuals, but increased to much larger numbers in 1997-2005 (175-440 individuals), to subsequently drop to 70-105 individuals in 2006-2008 (Figure 1a). The lack of empirical data on population size prior to 1991 makes it impossible to estimate the duration and precise magnitude of the population bottleneck around 1991, but according to anecdotal observations the Møn population was very numerous in 1973.

For comparison we used six M. arion populations in south and central Sweden (Figure 1b, see Ugelvig et al. [20] for details on these sampling localities). Population demography surveys are not available for these populations, but amateur lepidopterists maintain that the populations in Skåne, Gotland and Öland have never been close to extinction. Furthermore, although the number of local populations have also declined in these areas [44], a recent study suggests that well functioning meta-populations still exists at these three sites [20]. Conversely, the populations in Uppland, Västergötland and Södermanland are more isolated [21] and unlikely to still be part of a functioning metapopulation [20].


The contemporary samples were collected in the summers of 2005 (n = 46) and 2007 (n = 17) using a non-invasive sampling technique, collecting 2 × 2 mm2 wingtip fragments from adult M. arion butterflies. This sampling technique does not affect survival or flight ability of the butterflies [45; D.R. Nash unpublished data].

The historical samples were kindly provided by the Danish Natural History Museums in Copenhagen and Aarhus, and included specimens collected at Høvblege and the two nearby, now extinct, sites Jydelejet and Møns Klint. Unless there is a need to distinguish them, we will collectively refer to all these sites as Møn. One middle leg per museum specimen was sampled from years in which a reasonable number of specimens were collected, which gave a time series with 4-12 years between samples, covering 77 years (Table 1). After 1975, few M. arion specimens exist in the collections, reflecting the rarity of the butterfly, which was finally declared protected in Denmark in 1992. Forceps used for the collection of legs were cleaned in bleach (1% sodium hypochlorite) between each sampling to prevent cross contamination.

DNA extraction

DNA from the contemporary samples was extracted by homogenizing the wing fragment in a solution of 100 μl 5% chelex-TRIS (10 mM) and 5 μl proteinase K (0.75 units). The samples were then incubated at 56°C for 90 min, boiled at 99°C for 15 min, and centrifuged at 13000 rpm for 3 min. The supernatant was stored at -20°C.

DNA from the historical samples was extracted using a buffer slightly modified from Gilbert et al. [46], which consisted of 10 mM Tris (pH 8), 10 mM NaCl, 2.5 mM EDTA, 5 mM CaCl2, 2% sodium dodecyl sulphate (SDS), 40 mM dithiothreitol (DTT) and 10% proteinase K (final concentrations). The samples were incubated for 24 h at 56°C with gentle agitation. The extracted DNA was purified using a Qiagen PCR purification kit (QIAquick), re-suspended in 30 μl elution buffer and stored at -20°C. Extraction and PCR-setup was performed in dedicated ancient DNA clean-laboratories at the Centre for GeoGenetics at the Natural History Museum in Copenhagen, where only pre-PCR work occurs. According to standard protocols for work with low quality/quantity DNA [17], contamination was monitored at both the extraction and PCR steps by blank controls and all post-PCR procedures were conducted in physically distant laboratories.

Microsatellite amplification and genotype error rates

Two sets of nuclear microsatellite markers were employed corresponding to the two research questions (see Additional file 1 Table S1 for details on all microsatellite loci used in the study). In the comparison between contemporary samples from Møn and Sweden nine microsatellite loci were used; Macu8, Macu9, Macu11, Macu15, Macu17, Macu20, Macu26, Macu44 and Macu45 [with genotype data already existing for the Swedish populations, see [20]]. Of these loci, four were suitable for the temporal study, as only loci with allele sizes ≤160 bp allowed amplification in the historical samples (Macu15, Macu20, Macu26, Macu45). Primers for an additional ten loci were developed by ECOGENICS GmbH (Zürich, Switzerland), specifically targeting loci with short allele sizes (< 200 bp); Macu30, Macu31, Macari02, Macari05, Macari08, Macari16, Macari18, Macari19, Macari22, Macari23. Amplification from 1 μl of DNA extracts was carried out in 12 μl mastermix volumes using AmpliTaq Gold (contemporary samples; Applied Biosystems) or Platinium Taq High Fidelity (historical samples; Invitrogen). The following cycling conditions were used: initial denaturation 5 min at 95°C; 35-40 cycles of 30 s at 95°C, 30 s at the locus specific annealing temperature of 56/57°C, 30 s at 72°C; final elongation of 30 min at 72°C. PCR products were run on an ABI 3031 × l automated sequencer with the GeneScan-500 LIZ size standard and analysed using GENEMAPPER 4.0 (Applied Biosystems).

We applied a multiple tube approach when genotyping the historical samples, as they were expected to be prone to genotype errors such as allelic dropout and false allele amplification [16]. The amount of DNA was limited to that extracted from a single butterfly leg, thus it was not possible to replicate as extensively as originally proposed by Taberlet et al. (1996; 7-10 replicates per genotype). Instead, two independent amplifications were performed for each sample at each locus. If the same genotype was obtained, this was recorded as the consensus genotype. Conversely, if two different genotypes were found (e.g. one homozygote and one heterozygote) a third PCR was conducted. Genotypes were only scored when every allele was observed at least twice, and in cases where a consensus genotype was not found after three PCRs, it was recorded as missing. Genotype error rates were calculated as recommended by Pompanon et al. [15], i.e. the error rate per locus. In the contemporary samples, error rates were estimated by re-genotyping a subset (22%) of the samples.

Statistical analysis

PCR amplification success was analysed by fitting a general linear model (GLM) with binomial errors and logit link, correcting for over-dispersion, and using the number of samples successfully amplifying at each microsatellite locus as the response variable and the total number of samples as the binominal denominator. The maximum allele length amplified at each locus (in base pairs), the age of the samples (in years), and their interaction were used as explanatory variables. The analysis was carried out in JMP 7.02 (SAS Institute Inc.).

Linkage disequilibrium among pairs of microsatellite loci was tested using FSTAT 2.9.3[47]. The program GENALEX 6.3[48] was used to calculate expected and observed heterozygosities for each microsatellite locus, and for testing genotype frequencies against Hardy-Weinberg (HW) equilibrium expectations. When excess homozygosity was found, the program MICRO-CHECKER 2.2.3 [49] was used to check for evidence of null alleles, and their frequencies at different loci were estimated with the program FREENA[50]. High null allele frequencies were found in some of the historical samples, which may affect F-statistics [50, 51]. Pairwise F ST values among the historical samples were re-calculated after applying the ENA correction for null alleles as implemented in FREENA and then correlated with the temporal difference between samples using Mantel tests (Mantel 1967) in FSTAT, using 2000 permutations. The contemporary populations did not show signs of null alleles, and pairwise F ST values among the seven Scandinavian populations were calculated in FSTAT, using 1000 permutations. A second measure of genetic diversity, allelic richness, was computed in FSTAT for both contemporary and historical samples and differences in allelic richness and expected heterozygosity among samples were tested using repeated-measures ANOVA in JMP.

Evidence of recent genetic bottlenecks in the temporal samples was tested using the software developed by Garza and Williamson [24]. The program assumes that a reduction in population size will have a stronger affect on the number of alleles (k) than the range of allele sizes (r), leading to a smaller M-ratio (= k/r) in size-reduced populations compared to equilibrium populations. In order to evaluate the empirical M-ratio, an equilibrium population was simulated based on parameters describing the evolution of the analysed microsatellite loci (Δ g : the mean size of larger mutations, p s : fraction of mutations larger than a single step, and μ: the mutation rate/locus/generation) and the effective population size of pre-bottlenecked populations (N e ). These parameters are difficult to estimate in empirical samples and each sample estimate of M-ratio was thus tested under different evolutionary scenarios as suggested by Guinand and Scribner [25]. The scenarios include: i) a two-phase mutation model with proportions of non one-step mutations in the range 0.00 (SMM model), 0.05, 0.10 and 0.20, ii) Δ g varying between 2 and 4 (Garza and Williamson [24] suggest 3.5 as default setting), and iii) a constant μ of 10-4 /locus/generation, but with N e ranging from 50, 100, 250 and 500 corresponding to θ (= 4 × N e × μ) equal to 0.02, 0.04, 0.1 and 0.2. For each sample, an equilibrium population was simulated 10000 times using these parameter settings. The empirical M-ratio averaged across loci was compared to the distribution of simulated M-ratios, in order to evaluate the likelihood of a bottleneck event having taken place (95% criterion).


  1. Lowe A, Harris S, Ashton P: Ecological genetics: design, analysis, and application. 2004, Malden, MA: Blackwell Publishing

    Google Scholar 

  2. Willi Y, Van Buskirk J, Hoffmann AA: Limits to the adaptive potential of small populations. Annu Rev Ecol Evol Syst. 2006, 37: 433-458. 10.1146/annurev.ecolsys.37.091305.110145.

    Article  Google Scholar 

  3. Besold J, Schmitt T, Tammaru T, Cassel-Lundhagen A: Strong genetic impoverishment from the centre of distribution in southern Europe to peripheral Baltic and isolated Scandinavian populations of the pearly heath butterfly. J Biogeogr. 2008, 35: 2090-2101. 10.1111/j.1365-2699.2008.01939.x.

    Article  Google Scholar 

  4. Thomas JA, Moss D, Pollard E: Increased Fluctuations of Butterfly Populations towards the Northern Edges of Species' Ranges. Ecography. 1994, 17 (3): 215-220. 10.1111/j.1600-0587.1994.tb00096.x.

    Article  Google Scholar 

  5. Lande R: Genetics and demography in biological conservation. Science. 1988, 241 (4872): 1455-1460. 10.1126/science.3420403.

    Article  CAS  PubMed  Google Scholar 

  6. Frankham R: Conservation Genetics. Annu Rev Genet. 1995, 29 (1): 305-327. 10.1146/

    Article  CAS  PubMed  Google Scholar 

  7. Saccheri I, Kuussaari M, Kankare M, Vikman P, Fortelius W, Hanski I: Inbreeding and extinction in a butterfly metapopulation. Nature. 1998, 392: 491-494. 10.1038/33136.

    Article  CAS  Google Scholar 

  8. Schmitt T, Hewitt GM: The genetic pattern of population threat and loss: a case study of butterflies. Mol Ecol. 2004, 13: 21-31. 10.1046/j.1365-294X.2004.02020.x.

    Article  CAS  PubMed  Google Scholar 

  9. Frankham R: Effective population size/adult population size ratios in wildlife: a review. Genet Res. 1995, 66 (02): 95-107. 10.1017/S0016672300034455.

    Article  Google Scholar 

  10. Spielman D, Brook BW, Frankham R: Most species are not driven to extinction before genetic factors impact them. PNAS. 2004, 101 (42): 15261-15264. 10.1073/pnas.0403809101.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Frankham R: Genetics and extinction. Biol Conserv. 2005, 126 (2): 131-140. 10.1016/j.biocon.2005.05.002.

    Article  Google Scholar 

  12. Wandeler P, Hoeck PEA, Keller LF: Back to the future: museum specimens in population genetics. TREE. 2007, 22: 634-642.

    PubMed  Google Scholar 

  13. Lindahl T: Instability and decay of the primary structure of DNA. Nature. 1993, 362: (6422):709-715.

    Article  PubMed  Google Scholar 

  14. Watts P, Thompson D, Allen K, Kemp S: How useful is DNA extracted from the legs of archived insects for microsatellite-based population genetic analyses?. J Insect Conserv. 2007, 11 (2): 195-198. 10.1007/s10841-006-9024-y.

    Article  Google Scholar 

  15. Pompanon F, Bonin A, Bellemain E, Taberlet P: Genotyping errors: causes, consequences and solutions. Nature Rev Genet. 2005, 6: (11):847-846.

    Article  PubMed  Google Scholar 

  16. Taberlet P, Griffin S, Goossens B, Questiau S, Manceau V, Escaravage N, Waits LP, Bouvet J: Reliable genotyping of samples with very low DNA quantities using PCR. Nucl Acids Res. 1996, 24: (16):3189-3194.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Gilbert MTP, Bandelt H-J, Hofreiter M, Barnes I: Assessing ancient DNA studies. Trends in Ecology & Evolution. 2005, 20: (10):541-544.

    Article  Google Scholar 

  18. Wynhoff I: The recent distribution of the European Maculinea species. J Insect Conserv. 1998, 2: 15-27. 10.1023/A:1009636605309.

    Article  Google Scholar 

  19. Van Swaay CAM, Van Strien AJ, Harpke A, Fontaine B, Stefanescu C, Roy D, Maes D, Kühn E, Õunap E, Regan E, et al: The European Butterfly Indicator for Grassland species: 1990-2009. Report VS2010.010. 2010, Wageningen, De Vlinderstichting

    Google Scholar 

  20. Ugelvig LV: Ecological genetics and evolution of the Large Blue butterfly - consequences of an extraordinary lifecycle. PhD thesis. 2010, University of Copenhagen, Department of Biology

    Google Scholar 

  21. Eliason CU, Ryrholm N, Holmer M, Jilg K, Gärdefors U: Fjärilar: Dagfjärilar (Hesperiidae - Nymphalidae). 2005, Artsdatabanken, Sveriges lantbruksuniversitet (SLU), Uppsala

    Google Scholar 

  22. Thomas JA, Clarke RT, Elmes GW, Hochberg ME: Population dynamics in the genus Maculinea (Lepidoptera: Lycaenidae). Insect populations: in theory and in practice. Edited by: Dempster JP, McLean IFG. 1998, Dordrecht: Kluwer Academic Publishers, 261-290.

    Chapter  Google Scholar 

  23. Lozier JD, Cameron SA: Comparative genetic analyses of historical and contemporary collections highlight contrasting demographic histories for the bumble bees Bombus pensylvanicus and B. impatiens in Illinois. Mol Ecol. 2009, 18 (9): 1875-1886. 10.1111/j.1365-294X.2009.04160.x.

    Article  PubMed  Google Scholar 

  24. Garza JC, Williamson EG: Detection of reduction in population size using data from microsatellite loci. Mol Ecol. 2001, 10: 305-318. 10.1046/j.1365-294x.2001.01190.x.

    Article  CAS  PubMed  Google Scholar 

  25. Guinand B, Scribner KT: Evaluation of methodology for detection of genetic bottlenecks: inferences from temporally replicated lake trout populations. C R Biologies. 2003, 326 (Supplement 1): 61-67.

    Article  Google Scholar 

  26. Bouzat JL, Lewin HA, Paige KN: The Ghost of Genetic Diversity Past: Historical DNA Analysis of the Greater Prairie Chicken. Am Nat. 1998, 152 (1): 1-6. 10.1086/286145.

    Article  CAS  PubMed  Google Scholar 

  27. Groombridge JJ, Jones CG, Bruford MW, Nichols RA: Conservation biology: 'Ghost' alleles of the Mauritius kestrel. Nature. 2000, 403 (6770): 616-616. 10.1038/35001148.

    Article  CAS  PubMed  Google Scholar 

  28. Harper GL, Maclean N, Goulson D: Analysis of museum specimens suggests extreme genetic drift in the adonis blue butterfly (Polyommatus bellargus). Biol J Linn Soc. 2006, 88: 447-452. 10.1111/j.1095-8312.2006.00632.x.

    Article  Google Scholar 

  29. Pauler-Fürste R, Kaule G, Settele J: Aspects of the population vulnerability of the large blue butterfly, Glaucopsyche (Maculinea) arion, in south-west Germany. Species survival in fragmented landscapes. Edited by: Settele J, Margules C, Poschlod P, Henle K. 1996, Dordrecht: Kluwer Academic Publishers, 275-281.

    Chapter  Google Scholar 

  30. Nielsen PS, Bittcher J: Overvågning af sortplettet blåfugl Maculinea arion L. Lepidoptera. 2002, 8: 117-130.

    Google Scholar 

  31. Pertoldi C, Hansen MMl, Loeschcke V, Madsen AB, Jacobsen L, Baagoe H: Genetic Consequences of Population Decline in the European Otter (Lutra lutra): An Assessment of Microsatellite DNA Variation in Danish Otters from 1883 to 1993. Proc R Soc B. 2001, 268 (1478): 1775-1781. 10.1098/rspb.2001.1762.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Deagle B, Eveson JP, Jarman S: Quantification of damage in DNA recovered from highly degraded samples - a case study on DNA in faeces. Front Zool. 2006, 3 (1): 11-10.1186/1742-9994-3-11.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Valière N, Bonenfant C, Toïgo C, Luikart G, Gaillard J-M, Klein F: Importance of a pilot study for non-invasive genetic sampling: genotyping errors and population size estimation in red deer. Conserv Genet. 2007, 8 (1): 69-78.

    Article  Google Scholar 

  34. Thomas JA, Simcox DJ, Clarke RT: Successful Conservation of a Threatened Maculinea Butterfly. Science. 2009, 325 (5936): 80-83. 10.1126/science.1175726.

    Article  CAS  PubMed  Google Scholar 

  35. Thomas J, Wardlaw J: The capacity of a Myrmica ant nest to support a predacious species of Maculinea butterfly. Oecologia. 1992, 91: 101-109.

    Article  Google Scholar 

  36. Thomas JA: The ecology and conservation of Maculinea arion and other European species of Large Blue butterfly. Ecology and Conservation of Butterflies. Edited by: Pullin AS. 1995, London: Chapman & Hall, 180-197.

    Chapter  Google Scholar 

  37. Schönrogge K, Wardlaw JC, Thomas JA, Elmes GW: Polymorphic growth rates in myrmecophilous insects. Proceedings of the Royal Society of London Series B-Biological Sciences. 2000, 267 (1445): 771-777. 10.1098/rspb.2000.1070.

    Article  Google Scholar 

  38. Pollard E: A method for assessing changes in the abundance of butterflies. Biol Conserv. 1977, 12 (2): 115-134. 10.1016/0006-3207(77)90065-9.

    Article  Google Scholar 

  39. Thomas JA: The conservation of butterflies in temperate countries: past efforts and lessons for the future. Symposia of the Royal Entomological Society of London. 1984, 333-353.

    Google Scholar 

  40. Nowicki P, Settele J, Henry PY, Woyciechowski M: Butterfly monitoring methods: The ideal and the real world. Isr J Ecol Evol. 2008, 54 (1): 69-88. 10.1560/IJEE.54.1.69.

    Article  Google Scholar 

  41. Dennis RLH, Shreeve TG, Isaac NJB, Roy DB, Hardy PB, Fox R, Asher J: The effects of visual apparency on bias in butterfly recording and monitoring. Biol Conserv. 2006, 128 (4): 486-492. 10.1016/j.biocon.2005.10.015.

    Article  Google Scholar 

  42. Gross K, Kalendra EJ, Hudgens BR, Haddad NM: Robustness and uncertainty in estimates of butterfly abundance from transect counts. Population Ecology. 2007, 49 (3): 191-200. 10.1007/s10144-007-0034-8.

    Article  Google Scholar 

  43. Haddad NM, Hudgens B, Damiani C, Gross K, Kuefler D, Pollock K: Determining optimal population monitoring for rare butterflies. Conservation Biology. 2008, 22 (4): 929-940. 10.1111/j.1523-1739.2008.00932.x.

    Article  PubMed  Google Scholar 

  44. Elmquist H, Nielsen PS: Åtgärdsprogram för bevarande av svartfläckig blåvinge (Maculinea arion). Rapport 5652. 2007, Stockholm, Sweden, Naturvårdsverket

    Google Scholar 

  45. Hamm C, Aggarwal D, Landis D: Evaluating the impact of non-lethal DNA sampling on two butterflies, Vanessa cardui and Satyrodes eurydice. J Insect Conserv. 2010, 14 (1): 11-18. 10.1007/s10841-009-9219-0.

    Article  Google Scholar 

  46. Gilbert MTP, Moore W, Melchior L, Worobey M: DNA Extraction from Dry Museum Beetles without Conferring External Morphological Damage. PLoS ONE. 2007, 2 (3): e272.-

    Article  PubMed  PubMed Central  Google Scholar 

  47. Goudet J: FSTAT (Version 1.2): A computer program to calculate F-statistics. J Hered. 1995, 86 (6): 485-486.

    Google Scholar 

  48. Peakall R, Smouse PE: GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes. 2006, 6: 288-295. 10.1111/j.1471-8286.2005.01155.x.

    Article  Google Scholar 

  49. Van Oosterhout C, Hutchinson WF, Wills DPM, Shipley P: MICRO-CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Mol Ecol Notes. 2004, 4 (3): 535-538. 10.1111/j.1471-8286.2004.00684.x.

    Article  CAS  Google Scholar 

  50. Chapuis MP, Estoup A: Microsatellite null alleles and estimation of population differentiation. Mol Biol Evol. 2007, 24 (3): 621-631.

    Article  CAS  PubMed  Google Scholar 

  51. Dakin EE, Avise JC: Microsatellite null alleles in parentage analysis. Heredity. 2004, 93 (5): 504-509. 10.1038/sj.hdy.6800545.

    Article  CAS  PubMed  Google Scholar 

Download references


We thank A.E. Lomborg for providing the contemporary samples from 2005 and the Natural History Museums in Aarhus and Copenhagen, Denmark, for kindly providing leg samples of M. arion from their collections, in particular Prof. N.P. Kristensen, Dr. O. Karsholt and Dr. S. Kaaber. This study could not have been performed without access to special clean lab facilities, and we are very grateful to the Centre for GeoGenetics at the Natural History Museum in Copenhagen, for providing these. Furthermore, we thank M.T.P. Gilbert and E. Willerslev for sharing their great expertise on working with old DNA. The work was financed by the Danish National Science Research Foundation via a grant to the Centre for Social Evolution.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Line V Ugelvig.

Additional information

Authors' contributions

LVU carried out the experimental work, and analysed the data together with DRN. PSN contributed with the survey data. The study was designed by LVU, JJB and DRN, who also wrote the manuscript. All authors have approved of the final version of the manuscript.

Electronic supplementary material


Additional file 1: Table S1 - Microsatellite loci used in the study. Name, GenBank accession numbers, repeat motif and primer sequences (F: forward, R: reverse primer) are given for newly developed microsatellite loci (for previously developed loci see references below). Product size in base pairs and the optimal annealing temperature in degrees Celsius for M. arion are also provided. N = number of study populations; n = number of genotyped individuals; k = observed number of alleles; Ho = observed heterozygosity. The genotype error rate calculated per locus and the fraction of positive PCRs per locus is given separately for historic samples (1930-1975) and contemporary samples (2005-2007). (PDF 90 KB)


Additional file 2: Figure S1 - Presence of 'Ghost' alleles in the isolated Danish M. arionpopulation. Allele frequencies per microsatellite locus found in each sampling year. Twelve alleles are only present in the historical samples (black arrows), whereas two alleles are unique to the contemporary (and 1975) samples (white arrows). Microsatellite loci name abbreviations: Ma = Macari, Mu = Macu. (PDF 378 KB)


Additional file 3: Table S2 - Detection of recent genetic bottlenecks. Empirical M-ratio values averaged over the ten microsatellite loci for each historic (1930-1957) and contemporary (2005, 2007) sampling year. An equilibrium population was simulated 1000 times for the parameter combination of Θ, Δg and ps, and P-values for the occurrence of a genetic bottleneck computed. Grey cells show evidence of a bottleneck (P< 0.05), whereas black cells show no evidence of a bottleneck. (PDF 122 KB)

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Ugelvig, L.V., Nielsen, P.S., Boomsma, J.J. et al. Reconstructing eight decades of genetic variation in an isolated Danish population of the large blue butterfly Maculinea arion. BMC Evol Biol 11, 201 (2011).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Microsatellite Locus
  • Null Allele
  • Effective Population Size
  • Allelic Richness
  • Swedish Population