- Research article
- Open Access
Split it up and see: using proxies to highlight divergent inter-populational performances in aquaculture standardised conditions
BMC Ecology and Evolution volume 21, Article number: 206 (2021)
Considering wild inter-populational phenotypic differentiation can facilitate domestication and subsequent production of new species. However, comparing all populations across a species range to identify those exhibiting suitable key traits for aquaculture (KTA; i.e. important for domestication and subsequent production) expressions is not feasible. Therefore, proxies highlighting inter-populational divergences in KTA are needed. The use of such proxies would allow to identify, prior to bioassays, the wild population pairs which are likely to present differentiations in KTA expressions in aquaculture conditions. Here, we assessed the relevance of three alternative proxies: (i) genetic distance, (ii) habitat divergence, and (iii) geographic/hydrologic distances. We performed this evaluation on seven allopatric populations of Perca fluviatilis for which divergences in KTA had already been shown.
We showed differences in the correlation degree between the alternative proxy-based and KTA-based distance matrices, with the genetic proxy being correlated to the highest number of KTA. However, no proxy was correlated to all inter-populational divergences in KTA.
For future domestication trials, we suggest using a multi-proxy assessment along with a prioritisation strategy to identify population pairs which are of interest for further evaluation in bioassays.
Inter-populational differentiation is the divergence between allopatric, peripatric, or parapatric conspecific populations (e.g. in fishes  and insects ). It results from specific demographic history, limited gene flow, random genetic drift, and/or local adaptations [3, 4]. Often considered in evolutionary and conservation biology , analysing and exploiting inter-populational differentiation is also of crucial interest for agriculture development. Indeed, populations undergoing inter-populational differentiation process can also present divergent expressions of key traits, which are essential phenotypic traits for domestication and subsequent production (see examples of key traits for fishes in [6, 7]). Therefore, considering population specificities could allow identifying population(s) which might facilitate farming of a new species [7,8,9]. Several success-stories have highlighted the interest of considering inter-populational specificities. For instance, the buff-tailed bumblebee (Bombus terrestris), which has been domesticated to act as pollinator in greenhouses, displays phenotypic differentiations between allopatric wild populations for traits impacting its ability to be industrially produced and to pollinate valuable crops . At the beginning of its production, bumblebee breeders tried to rear several wild populations  and, eventually, identified one with superior characteristics, triggering a flourishing development of the bumblebee production industry . Similarly, the development of the Atlantic salmon (Salmo salar) industry was facilitated by reaping the benefits from inter-populational differentiation. Initial comparisons of several key traits for fish farmers (e.g. growth rate, disease resistance) between wild populations allowed identifying those with higher performances which were then used as cornerstone fish stock of salmon production [12, 13]. Beside new domestication instances, wild intraspecific differentiation can also be used to enhance old-established species farming, because introduction of adaptive traits (i.e. genomic introgression) from wild populations can lead to phenotypic novelty in produced stocks .
Given the potential benefits of considering inter-populational phenotypic differentiation in domestication/farming programs, there is a premium on highlighting wild populations with divergences in key traits. This requires population assessments that can only be achieved through bioassays in which expressions of key traits are evaluated in a common experimental environment (i.e. common garden) close to future farming conditions. Similar assessments in natura would be doubtful because (i) some traits cannot be characterised during fieldworks due to technical reasons, and/or (ii) trait expression is the result of both genetic and environmental factors. The latter point is the main issue because phenotypic plasticity (sensu ) can shape observed divergences between wild populations. However, such divergences would be useless for domestication programs because they will not be (i) observed in ex-situ farming and (ii) conserved over generations in production environment. Only genetically based differentiations are of interest for domestication because production occurs in a system that is often quite different from the wild environment. The common-garden experiments are an efficient way to assess such differentiations since, through population comparison, it can highlight inter-populational divergences with a genetic basis . However, such experiments are cumbersome and expensive procedures and thus all populations inhabiting the species distribution range cannot be pragmatically compared by this approach. This raises the need of developing a strategy to limit the number of populations that should be further investigated by common-garden experiments. Randomly selecting some populations cannot seen as a solution because populations potentially valuable for aquaculture could be missed out. Moreover, from a phenotypic perspective, populations are not all notably divergent from each other, making comprehensive assessments uselessly time/money consuming. Therefore, one solution relies on limiting the number of populations to be compared before performing bioassays by (i) gathering populations that are likely undifferentiated and (ii) highlighting populations/population groups that most likely display specificities or, at least, divergences in key traits.
Already available intraspecific taxonomic classifications like subspecies , historically widely defined (e.g. [18, 19]), could provide the needed population sorting. Although such classifications are not based directly on pieces of information about key traits (e.g. genes coding for key traits), their potential relevance was highlighted with the B. terrestris domestication in which inter-populational differentiation of key traits matches with the subspecies . However, these taxonomic statuses have been fiercely criticised because (i) their relevance for systematics and (ii) criteria that should be considered for their definitions are doubtful [19, 20]. This has triggered a trend to eliminate the trinomial designation in several species groups (e.g. [21, 22]). In conservation biology, evolutionarily significant units, management units, distinct population segments, or designatable units, have been proposed to complement existing taxonomy and/or to go beyond systematists’ feud [23,24,25]. However, their interpretations and definition criteria vary among species and scientists (e.g. [26, 27]). Overall, debates and lack of consensus in systematics, evolutionary biology, and biological conservation make a ready-to-use population sorting approach unavailable for species domestication/production programs. Moreover, such classifications can highlight populations that are diagnosably distinct from other conspecific population groups but cannot detect the clines of variation that could lead to inter-populational divergences for key trait without clear geographic boundaries (e.g. [1, 28]). Yet, such divergences could also be useful for domestication programs. Therefore, there is a need to establish which approach(es) could be used to efficiently highlight populations that most likely display divergences in key trait expressions prior to their assessment by bioassays. This would ultimately allow selecting wild populations which need to be evaluated in farming conditions to determine which ones are the most interesting for further domestication and production.
One potential approach can be based on proxies which (i) are correlated with key traits and (ii) can be easily studied at large-scale. It is obvious that a good proxy should consist of using differentiation in genes under selection and coding for key traits. However, related methodologies, whether quantitative trait locus mapping or genome-wide association studies, are still underway, present some intrinsic limitations (e.g. explaining only a part of the phenotypic variance, cost and time consuming, knowledge of genome features), and/or are so far only developed on a limited number of species [29,30,31,32]. Therefore, alternative pragmatic proxies of phenotypic differentiation are needed.
A first alternative proxy consists of using neutral genes (i.e. locus that does not influence fitness). Indeed, divergences in such genes is widely used to highlight population groups, connected by a low or null gene flow, which underwent divergent demographic histories . Considering previous observations (e.g. [33, 34]), one could expect that such groups could have also acquired some phenotypic specificities, including in key traits for production. Indeed, genetic differentiation in neutral markers has previously been suggested as, at least partially, indicative of differentiation in genes coding for quantitative loci (, but see opposite opinion in ). Therefore, neutral loci-based genetic distance could be a relevant proxy to discriminate populations that are most likely different in their key traits. A second alternative proxy is based on the degree of divergence of a phenotypic trait [18, 37] (e.g. behavioural, eco-chemical, or morphological features). Such features can be used to evaluate phenotypic distance that could be linked to genetic divergence [38, 39] and, therefore, potentially highlight divergences in key traits. A third alternative proxy relies on using the degree of habitat divergence to maximise the detection of population groups with different key trait expressions. Indeed, populations inhabiting different environments may be phenotypically divergent  because resulting distinct selective pressures act as driving forces for phenotypic differentiation, notably through genetic-based local adaptations [41, 42] that could impact key trait expression. Finally, a fourth alternative could be to consider spatial distance as a relevant proxy. Indeed, given the limited dispersal ability, individuals that are far apart tend to be genetically more divergent than individuals that are spatially close .
Overall, all these alternative strategies could provide a solution to highlight wild population groups with divergences in key traits for species production. However, the relevance of these alternative proxies is still unsettled because there is no comparison of their efficiency to discriminate populations which would likely express divergences in key trait expressions. Here, we aim to compare this relevance to highlight inter-populational divergences in the expression of key traits for aquaculture (KTA). We consider a part of the life cycle (larval stage) of seven allopatric populations of the European perch (Perca fluviatilis) as a test case. We evaluate a set of KTA considered as important for the larval production of this species [7, 33, 44]: (i) growth traits (here growth rate, final growth heterogeneity, initial and final lengths) , (ii) development (survival rate, deformity rate, swim bladder inflation rate) and nutrition (yolk sac volume) traits [46, 47], (iii) behavioural traits that are aggressiveness , inter-individual distances which reflect group structure , and swimming activity . First, we evaluate these KTA expressions in the seven European perch populations. Second, we perform a retro-assessment of the three proxies on the same populations using the bioassays’ results. Finally, we compare the degrees of correlation between alternative proxy-based and KTA-based distance matrices to identify the most relevant proxies to highlight inter-populational divergences in KTA. Ultimately, we aim at providing guidelines for the application of proxies through a prioritisation procedure. This latter allows limiting the costly step of common garden experiment to the population pairs that are the most likely to display divergent KTA expressions.
KTA evaluation in aquaculture standardised conditions
Inter-populational differences were assessed using Kruskal-Wallis or ANOVA F-tests (see Methods for details). There was no statistically significant difference between populations for aggressive rate (K = 8.85, df = 6, p-value = 0.18) and length heterogeneity (F(6,14) = 0.86, p-value = 0.55).
A significant statistical differentiation between populations was found for survival rate (F(6,14) = 9.45, p-value = 5.75 × 10–4), swim bladder inflation rate (F(6,14) = 22.73, p-value = 6.79 × 10–6), deformity rate (F(6,14) = 11.29, p-value = 1.12 × 10–4), specific growth rate (F(6,14) = 8.64, p-value = 4.69 × 10–4), length at hatching (F(6,14) = 17.33, p-value = 2.85 × 10–5), final length (F(6,14) = 8.35, p-value = 5.58 × 10–4), yolk sac volume (F(6,14) = 32.95, p-value = 1.77 × 10–7), activity (K = 20.24, df = 6, p-value = 0.003), and inter-individual distances (F(6,56) = 5.59, p-value = 1.35 × 10–4). All inter-populational comparisons are available in Additional file 1: Fig. S1.
Correlation degrees of the different proxies with the KTA-based matrices
The series of successive multivariate analyses using multi-regression on distance matrices coupled with commonality analyses (further referred as “MRDM-CA”) allowed identifying several suppressors (see Methods for methodology details). Therefore, some KTA are only correlated with a single proxy remaining after suppressors’ removal (Table 1). Overall, MRDM-CA results indicate that significant global MRDM R2 are relatively high (> 10%), and that all KTA are associated with a positive and significant correlation with a unique proxy-based matrix (when considering only proxies for which the commonality analysis [CA] unique contribution U > 5%), except for final length which is not correlated with any proxy (Table 1). The geographic distance proxy is not correlated with any KTA-based distance matrix while the genetic distance proxy is the one which is correlated to the highest number of KTA-based distances matrices. However, no proxy shows a systematic significant contribution with all KTA. Considering genetic, habitat, and hydrologic distance proxies, all KTA are correlated with at least one of the proxies, except for final length (Table 1). Regarding Mantel tests, results are available in Additional file 2: Fig S2 and are globally congruent with MRDM-CA analyses. Overall, for each KTA, the proxy which was the most correlated in the Mantel test is the one which was significant in the MRDM-CA analysis.
A prioritisation procedure based on the sum of KTA-based or proxy-based distances was applied to our dataset. This procedure involved a k-mean clustering approach aiming at dividing all population pairs into k clusters (i.e. population pairs belonging to different clusters should be considered as different in terms of KTA or proxy; see Methods for details). The results obtained for the prioritisation strategy are presented in Table 2.
When looking at proxies independently, the highest proxy-distance value corresponds to different population pairs depending on the proxy considered (Table 2). The prioritisation procedure indicates three clusters with the A cluster grouping populations presenting the highest values for the sum of proxy-based distances (Table 2). Populations belonging to cluster A mostly belong to clusters 1 and 2 when considering the sum of KTA-based distances and present high numbers of KTA statistically differentiated (Table 2).
Assessment of KTA expression differentiation
Because we used a common garden experiment, we assumed that observed differentiations in the KTA expression have a genetic basis and are likely not due to phenotypic plasticity. Nevertheless, we used individuals sampled at the egg stage in the wild. Therefore, we cannot rule out that a part of KTA expression divergences is shaped by transgenerational effects or phenotypic plasticity [15, 51]. However, we argue that our experimental design has minimised as far as possible such potential biases. Another potential bias relies on the variable number of egg ribbons used between populations which could have influenced KTA results. However, we minimised the female specificity bias (i.e. maternal effects) by taking a large number of egg ribbons for each population.
Single proxy approach: no silver bullet
We showed that no single proxy-based distance matrix is correlated with all KTA-based distance matrices. We found four significant correlations for genetic distance and two significant correlations for hydrologic distance and habitat divergence when we compared them with KTA-based distance matrices (Table 1). We also observed that each KTA is only significantly correlated with one proxy (Table 1).
Since inter-populational divergences can be shaped by (i) gene flow disruption/limitation and/or (ii) local adaptation to specific selective pressures, we were expecting that the assessments based on alternative proxies could lead to different results. Indeed, relationships between these two divergence-triggering factors and the alternative proxy-based distance matrices are different.
Although larger spatial disjunction, using geographic or hydrologic distance, increases the likelihood to reinforce these divergence-triggering factors (e.g. ), inter-populational differentiation can happen between adjacent populations, for instance due to a strong ecological barrier (e.g. for Perca fluviatilis ) or behavioural processes (e.g. natal homing behaviour ). Detections of such differentiations at small geographic scale are thus unlikely with spatial disjunction a priori (e.g. divergent results between hydrologic and survival rate distance with ISO, KIE, and BAL, see https://figshare.com/s/1067e47637c5572caaf7).
The habitat divergence proxy can theoretically overcome limitations of spatial disjunction proxies because it could highlight specific selective pressures at any geographic scale and, thus, inter-populational divergences (e.g. [54, 55]). Nevertheless, we observed only few significant correlations between KTA and habitat divergence. This can be explained because the habitat divergence proxy assessment most likely fails to highlight populations undergoing different selective pressures. First, the selection of relevant variables involved in local selective pressure is still hard to achieve. Indeed, the importance of environmental variables on population evolution is species-specific (and unsettled for P. fluviatilis) and cannot be known without long and difficult bioassays (e.g. ). Second, relevant environmental data on freshwater ecosystems are unevenly available for the different regions of the world. For instance, in our study, the worldwide databases of freshwater environmental variables (e.g. ) do not cover the whole studied sampling area. This led us to use terrestrial data in order to extrapolate aquatic environment characteristics (see similar strategy in [58, 59]), although it could potentially blur studied lake specificities. Optimally, analyses should be performed using data specific to lakes (e.g. lake physico-chemical parameters, feeding base) to improve the use of this proxy but the unavailability of sufficient data limits this strategy. Beside these difficulties, the habitat divergence proxy assessment can fail because extensive gene flow from adjacent regions can limit local adaptations  and, thus, minimise KTA expression divergences, even if specific environmental pressure occurs locally.
One could expect that the genetic distance proxy would be the most efficient to highlight populations with divergent KTA expressions. Indeed, it could detect populations with genetic divergences due to gene flow limitation/disruption triggered by spatial/hydrologic distance (i.e. isolation by distance), geographic barriers (i.e. geographic isolation), ecological barriers, or behavioural specificity (e.g. natal homing behaviour). However, our results highlight that not all KTA are correlated with the genetic distance proxy. Moreover, some traits which are not correlated with the genetic distance proxy are correlated with another proxy. To explain the absence of correlation of some KTA with the genetic distance proxy, it could be hypothesised that populations could have diverged recently, implying that few divergences in neutral markers had time to appear [60, 61].
How to better catch the divergence: a multi-proxy strategy
Because there is no proxy-based distance correlated with all KTA-based distances, highlighting populations that likely display KTA expression divergences should be based on the use of several proxies. Genetic distance and habitat divergence proxies are theoretically complementary to maximise the detection of such populations. On the one hand, the genetic distance proxy is relevant to highlight divergence by distance or populations with divergent demographic histories while the habitat divergence proxy can reflect local adaptations which are not (already) visible in neutral markers. On the other hand, populations occurring in similar habitats can display phenotypic divergences, including in KTA. Thus, the use of these two proxies allows taking into consideration both gene flow disruption/limitation and local adaptation to specific selective pressures. However, these proxies can be sometimes misleading due to their intrinsic limitations (see before), particularly the habitat divergence proxy (unless performing the analysis with more relevant lake variables). One way to mitigate their limitations relies on the combination of these two proxies with the hydrologic distance proxy (according to our results, the geographic distance proxy can be beneficially replaced by hydrologic distance proxy, which is more relevant for aquatic species and is the only one correlated to some KTA-based distance matrices). The hydrologic distance proxy could be correlated with both phenotypic differentiation by distance and local adaptations between distant populations. Moreover, the hydrologic distance proxy is correlated in our study with two KTA which are not correlated neither with genetic distance or habitat divergence proxies. Even though its calculation might be time and system memory-consuming, it requires no specific information except for geographic coordinates which makes it simple and useable in all scenarios. Here, we promote the use of a multi-proxy approach to pinpoint a population or population group to be assessed for KTA expression divergences. Even if some proxies appear as less efficient, their simple evaluation requiring little information (e.g. geographic distance from GPS coordinates) makes their use valuable to increase chances of finding out divergences in KTA. Another potential alternative would rely on ranking the KTA since not all KTA have the same importance to stakeholders. This strategy would consist of choosing the proxy which is correlated to the most important KTA (e.g. growth rate and survival rate ). However, this strategy is not optimal since a successful domestication process requires the favourable expression of several key traits involved in various biological functions.
Regarding the multi-proxy strategy, it is likely that the different proxies drive to divergent conclusions regarding which populations should be considered. This implies integrating all proxies in the same decision framework to highlight population pairs that should be considered in common garden experiments. Recommending for such experiments any population pair that is somewhat divergent for one or more proxies would fail to limit the number of populations involved in costly bioassays. Therefore, prioritisation procedures should be developed.
An example of prioritisation procedure exemplified with our test-case species
We recommend evaluating population/population group pairs according to both their degree of divergences in each proxy and the degree of congruence in divergence of all proxies. We applied here a prioritisation procedure on our dataset to validate this procedure.
Firstly, in this example, depending on the proxy considered, the conclusion regarding which population pairs are the most interesting would differ (Table 2). For instance, considering the genetic proxy, the highest distance is seen between VAL and BAL while with hydrologic and habitat proxies, the comparisons VAL-HOH and GEN-HOH would be promoted, respectively. This confirms the need for a multi-proxy approach and a prioritisation procedure to make a consensus between proxies. Second, the k-mean clustering approach allows to highlight groups of population pairs for which sum values can be considered as similar. The highest proxy-based distance sum values were shown for cluster A. When comparing to KTA-based clustering, its groups population pairs belonging to clusters 1 and 2, except for five population pairs, and exclude all population pairs belonging to cluster 3. Population pairs belonging to cluster A also present a high number of KTA significantly differentiated. The match between clusters obtained with KTA-based distances and proxy-based distances is not perfect but this is expected since (i) not all KTA are correlated to each proxy and (ii) the correlation value is inferior to 1. Overall, this procedure allows to limit the number of population pairs, here to the nine population pairs belonging to cluster A. Any of those nine population pairs could be considered for further bioassays and would be likely to present divergences in KTA. The final number of pairs that will be effectively evaluated in bioassays will also depend on other pragmatic and economic factors which will further decrease the final number of pairs considered. Those factors include, among others, biological sampling possibilities and limitations (e.g. TRACES rules), distance from the aquaculture facilities which impacts sampling costs, availability of experimental facilities, as well as financial and human means.
Our results show differences in the correlation degree between the alternative proxy-based and key trait-based distance matrices. Although we show that the genetic proxy is the most correlated to phenotypic key traits, no proxy was correlated to all inter-populational divergences in key traits. Therefore, we argue that the use of a multi-proxy approach is the most efficient and a population prioritisation strategy is proposed to optimize it.
We obtained this conclusion by performing a retrospective study (i.e. after having performed bioassays) and P. fluviatilis populations of interest have been highlighted in previous studies [33, 44]. For this species, the population prioritisation might be mainly useful to integrate wild specimens from highlighted populations to improve current fish stocks since European perch is already produced. However, compared to other well-known aquacultural fish species, the domestication process is quite recent in P. fluviatilis  and this study could therefore optimise the choice of the founder populations in new fish farms.
We argue that the multi-proxy approach can be applied on new candidate species for aquaculture for which KTA information are completely unknow. Indeed, it constitutes a first step, prior to domestication, that allows identifying population pairs which are likely differentiated in KTA and which are worth to be considered for bioassays. This will limit the trial-and-error approach and the random selection of populations which are further assessed in bioassays. Moreover, the calculation of those proxies will also provide additional information such as genetic diversity/variability, which are useful in the following domestication steps to avoid inbreeding issues and estimate potential for selective breeding programs . Overall, the use of the multi-proxy approach will be useful to help further aquaculture development with other fish species. However, it still requires to be validated on other test-case species prior to large-scale application to aquaculture.
Test case species
The European perch is a freshwater species which is widespread across a diverse range of habitats in Eurasia . Due to its high socio-economic value (i.e. food market, recreational interest), this species started to be produced in the 1990’s, mostly in intensive recirculated aquaculture systems (RAS) [62, 64]. A geographic differentiation was already shown for P. fluviatilis in standardised conditions for KTA related to growth (e.g. [65, 66]), development (e.g. [33, 67]), and behaviour (e.g. [44, 49]). This makes the species an appropriate test-case to assess the relevance of alternative proxies to highlight KTA differentiation among wild populations.
KTA evaluation in aquaculture standardised conditions
All procedures used in this study were in accordance with national and international guidelines for protection of animal welfare (Directive 2010/63/EU). This study was conducted with the approval Animal Care Committee of Lorraine (CELMA n°66) and the French Ministry of Higher Education, Research, and Innovation (APAFIS13368-2018020511226118, APAFIS17164-2018101812118180).
Egg ribbons were collected during the spawning seasons across two years (May 2018 and April–May 2019). Seven lakes were sampled (Fig. 1): Valkea-Müstajärvi (VAL; 2018, Finland; WGS84: 61° 13′ 08″ N, 25° 07′ 05″ E), Iso-Valkjärvi (ISO; 2018, Finland; WGS84: 60° 57′ 21″ N, 26° 13′ 3″ E), Kierzlinskie (KIE; 2019, Poland; WGS84: 53° 47′ 54″ N, 20° 44′ 45″ E), Geneva (GEN; 2019, France; WGS84: 46° 22′ 7.20″ N, 6° 27′ 14.73″ E), Bourget (BOU; 2019, France; WGS84: 45° 44′ 12.469″ N, 5° 52′ 1.617″ E), Hohen Sprenzer (HOH; 2019, Germany; WGS84: 53° 55′ 10.369″ N, 12° 13′ 6.005″ E), and Balaton (BAL; 2019, Hungary; WGS84: 46° 54′ 23.375″ N, 18° 2′ 43.119″ E). After transportation, 13 to 32 ribbons per lake were incubated at 13 °C at 400 lx (for incubation details, see [33, 49]).
The experiment, performed in a RAS, started at one day post-hatching (dph) until the end of weaning, at 26 dph. For each population, after hatching, larvae from the different egg ribbons were mixed and transferred to three green internal-wall cylindro-conical tanks (three replicates per population) at a density of 50 larvae/L at our experimental platform of aquaculture (Unit of Animal Research and Functionality of Animal Products, University of Lorraine, Vandœuvre-lès-Nancy, France). Temperature was gradually raised to 20 °C (1 °C rise per day). Photoperiod was 12L:12D and light intensity stayed constant during the lighting period at 400 lx at the water surface (with simulation of dawn and dusk for 30 min). Larvae were hand-fed from 3 days post-hatching, seven times a day, during the illuminated period, every 1h30, with newly hatched Artemia nauplii (Sep-Art, INVE). At 16 dph, weaning started: Artemia ration was diminished by 25% every three days while ration of dry feed [BioMar (Nersac, France), 300 µm until 21 dph, then 500 µm] was increased in the same proportions. After 25 dph, larvae were only fed with dry feed. Tanks were cleaned daily after first-feeding and dead larvae were removed and counted. Oxygen and temperature were checked daily while nitrite and ammonium concentrations and pH were monitored three times a week. At the end of the experiment, larvae left in each tank were counted and sorted according to the presence/absence of swim bladder inflation (following protocol used in ) and skeletal deformities. Additional experimental details, including water parameters values, are provided in Toomey et al. .
Survival rate was calculated using the following formula: Nf*100/(Ni − Ns), in which Nf is the final number of larvae counted at the end of experiment, Ni the initial number of individuals, and Ns the number of larvae sampled along the experiment (i.e. sampling for behaviour experiments, see below). In order to evaluate growth traits, 30 larvae per population (i.e. ten larvae per cylindro-conical) were sampled the first and last days of the experiment. After sampling, larvae were euthanized with an overdose of MS-222 and preserved in formalin 4%. Individuals were measured for total length in ImageJ  (± 0.01 mm). Specific growth rate (SGR) was calculated using the following formula: SGR = 100*(ln(Lf) − ln(Li))*∆T−1 where Li and Lf are respectively the initial and final length and ∆T the length of experiment. Final growth heterogeneity was calculated in the following way: CVLf/CVLi where CV is the coefficient of variation (100*standard deviation/mean) and Li and Lf the initial and final length, respectively. Swim bladder inflation rate was calculated in the following way: 100*(SB + /Nf) with SB + the number of larvae with swim bladder and Nf the final number of larvae. Deformity rate was evaluated using the following formula: 100*(Nm/Nf) with Nm the number of deformed larvae (visible skeletal deformities) and Nf the final number of larvae. Aggressiveness, including enucleation which is a specific aggressive behaviour in P. fluviatilis , was evaluated based on the daily examination of dead larvae using the following formula: (Ne + Nt)/Nd where Ne is the number of enucleated larvae (i.e. missing one eye), Nt the number of truncated larvae (cannibalism type I; ), and Nd the number of dead larvae counted between five and 26 dph (not possible to count dead larvae the first five days but aggressive interactions are reported to start at later stages [45, 72]).
The protocol to evaluate inter-individual distances  and activity is adapted from [49, 74]. All methods were performed in accordance with the relevant guidelines and regulations. In a nutshell, each population was evaluated at 25 and 26 dph. For each population, 90 larvae (i.e. 30 larvae per cylindro-conical tank) were sampled the day before the experiment and transferred to aquaria (58 L; 80 lx) at 20 °C. After one night of acclimatisation, populations were tested by groups of ten larvae which were placed in circular arenas (10 lx, 30 cm diameter, 1.5 cm of water depth) and filmed (three arenas tested simultaneously; three replicates per cylindro-conical tank, nine replicates per population). After 30 min acclimatisation, the following 30 min were used to evaluate activity and inter-individual distances. At the end of the experiment, larvae were euthanized with an overdose of MS-222 for further length measurements. Mean total lengths of larvae tested from VAL, ISO, KIE, GEN, BOU, HOH, and BAL were respectively 12.90 ± 0.62 mm, 14.05 ± 0.55 mm, 10.90 ± 0.73 mm, 11.81 ± 1.01 mm, 11.77 ± 0.48 mm, 11.35 ± 0.72 mm, and 10.62 ± 0.47 mm. Images were extracted from videos every five minutes (six images per video) to evaluated inter-individual distances (i.e. mean of distances between a given individual and all the other individuals of the group; group cohesion indicator ). Analyses were made in ImageJ . For each image, coordinates of each individual (using the middle point between the eyes) were noted. This allowed measuring distances between a given individual and the other individuals of the group and all these distances were averaged to obtain one value per individual. The mean of values of all group members were averaged per image. Finally, and the mean between all image values allowed getting a mean value of inter-individual distances per replicate . Regarding activity, one image per second was extracted for six consecutive seconds every five minutes. Distance swam every second for five seconds was calculated and the mean allowed obtaining the distance swam per second for each individual. The mean between individual values allowed getting an activity for each image series. The mean between image series allowed calculating the activity per replicate.
Distance matrix for the different proxies
In the present study, we did not use any phenotypic proxy because morphological and/or behavioural information was not available for all wild perch populations. Moreover, we did not estimate such a proxy on larvae reared in the bioassays because these phenotypic traits can be strongly influenced by the environment (e.g. ). Therefore, we only evaluated three proxies: (i) genetic distance, (ii) habitat divergence, and (iii) geographic/hydrologic distance.
Genetic distance proxy
The genetic assessment of the seven populations was performed on 10 individuals per population collected randomly at the end of the experiment. Larvae were stored in 99% ethanol at − 20 °C until analyses. Samples were sent to Genoscreen (Lille, France) for DNA isolation, marker amplification, and Sanger sequencing. Three mitochondrial regions were studied (D-loop of control region, cytochrome b, and 16S rRNA; see references for primers in ) following the protocol available in Toomey et al. . Both strands of each PCR product were sequenced. Consensus sequences of mitochondrial regions were computed and edited using CodonCode Aligner 7.1.2 (CodonCode Corporation, Dedham, Massachusetts, USA). There was no uncertainty in the consensus sequences. The P. fluviatilis origin of each sequence was verified using BLAST . Sequence alignment was performed in MAFFT (default parameters; ). Translation to proteins for Cytb was performed in Mesquite 3.20 . A tandemly repeated array was identified in D-loop (previously reported in previous P. fluviatilis studies; ) and mutations in the repeated array were coded as a single mutational step for further analyses. Mitochondrial markers were concatenated within a single alignment for further analyses using Mesquite. Haplotype sequences were deposited in GenBank (GenBank accession numbers: MN939382 to MN939395). Genetic distances between populations were calculated in R using ɸST distance  in SPADS .
Habitat divergence proxy
We used 19 bioclimatic data (BIO1 to BIO19; https://www.worldclim.org/data/bioclim.html) as well as annual mean solar radiation (kJ m−2 day−1), annual mean wind speed (m s−1), and annual mean water vapor pressure (kPa) from WorldClim Version2 to assess sampling place environments. We considered these abiotic environmental parameters (at the sampling location) recorded between 1970 and 2000 at a resolution of 10 arc‐minutes . We also used four additional variables: lake area (km2), catchment area (km2), average and maximum depths (m), and altitude (m). All variables were standardised to zero mean and unit variance due to their different scales of measurement (package “vegan” ). A principal component analysis was performed (Additional file 3: Table S1; packages “MASS”  and “factoextra” ) using the Kaiser–Guttman rule to determine the number of axes to retain (here the first four axes representing 98.0% of the variance). Coordinates for each population replicate were extracted from the first four axes and a distance matrix using the Bray–Curtis distance method was calculated (R package “vegan”).
Geographic and hydrologic distance proxies
Geographic distance matrix was set through the calculation of the great-circle distances (i.e. shortest distance over the earth’s surface) according to the “Vincenty” (ellipsoid) method  implemented in the R package “geosphere” . Because this geographic distance is a priori not relevant for fish species, we also estimated a hydrologic distance using an algorithm based on circuit theory and implemented in the program Circuitscape . Rivers and lakes shapefiles were respectively extracted from the Catchment Characterisation and Modelling river and catchment database v2  and HydroSHEDS  and from Natural Earth (www.naturalearthdata.com) databases, and jointly rasterised on a geo-referenced grid with a resolution of 0.5 arcmin. The algorithm implemented in Circuitscape computes pairwise electric resistances by treating a raster as a grid of electric resistance or conductance. In the present case, we used Circuitscape to compute hydrologic distances measured as electric resistance between locations when treating the rivers raster as a grid of electric conductance. One of the advantages of the Circuitscape algorithm is that it considers the contribution of several potential pathways to compute pairwise distances (resistances) between locations. Specifically, we assigned a value of 1 to terrestrial raster cells that were not crossed by a river or occupied by a lake and a value of 1 + k for terrestrial raster cells crossed by a river or occupied by a lake, the parameter k thus defining to what extent raster cells crossed by a river are more conductible (i.e. facilitate movement) than raster cells that are not crossed by a river. To explore the impact of k on the outcome of our analyses, we tested three different values for k: k = 10, 100, and 1000. These three different values led to highly similar results and did not alter our conclusions (results not shown). Therefore, we only report results based on hydrologic distances computed with k = 100.
Relevance of the different proxies
In order to create a trait-by-trait distance matrix, we first checked the statistical differentiation (p-value < 0.05) of phenotypic traits between populations: we tested normality of distribution and homogeneity of variances using a Shapiro–Wilk test and a Levene test, respectively (R package “lawstat” ). When assumptions were not respected, data were log-transformed. To check the influence of the cylindro-conical tank on results, we compared linear (phenotypic traits as fixed factors, no random factor) and linear mixed models (cylindro-conical tanks as random factor and phenotypic traits as fixed factors; R package “lmer” ) using the corrected Akaike Information Criterion (AICc; R package “qpcR” . For most traits, there was no significant influence of the cylindro-conical tank on the model. Therefore, we used one-way analyses of variance (ANOVA F-test), followed by Tukey post hoc tests (R package “stats”), to evaluate differences between populations. When the effect of the cylindro-conical tank was significant, we performed the ANOVA on the linear mixed model and estimated marginal means were calculated (R package “emmeans” . Regarding aggressive rate and activity, assumptions were not met despite log-transformation. Thus, Kruskal–Wallis H tests and Dunn post hoc test (R package “PMCMR”  were used. Traits presenting no significant differentiation between populations were excluded from further analyses.
A trait-by-trait distance matrix was created using the Euclidean method (R package “vegan”, three replicates per population). All distance matrices are available in Figshare (https://figshare.com/s/1067e47637c5572caaf7). We performed multivariate analyses using multi-regression on distance matrices (1000 permutations) coupled with commonality analyses (MRDM-CA) using R packages “ecodist”  and “yhat” . For these analyses, all proxy-based distance matrices were preliminary standardised. The multi-regression on distance matrices (MRDM) analysis allows estimating the Pearson correlation coefficient r (i.e. direct effect of the proxy on the response variable, irrespectively from the influence of other proxies) and beta weights (i.e. total effect of the proxy on the response variable, considering the contribution of other proxies). The commonality analysis (CA) is a detailed variance-partitioning procedure which allows taking into consideration collinearity between proxies . This analysis estimates the unique (“U”; amount of variance in the KTA accounted for by each single proxy) and common (“C”; variance jointly explained by several proxies) contributions of each proxy. After the first round of MRDM-CA, total suppressors were identified and discarded because they can be responsible for artefactual relationships among variables. This can allow purifying the relationships between the remaining proxies and the response variable (i.e. in our case the KTA-based distance matrices) . Successive MRDM-CA were performed until all suppressors were removed. A proxy was considered as a total suppressor when unique contribution is counter-balanced by its (negative) common contribution (classical suppression) and/or when regression and correlation coefficients are of opposite signs (cross-over suppression; [99, 100]). In addition, we investigated correlations between trait-based and proxy-based distance matrices using Mantel tests in R (R package “vegan”; 9,999 permutations). Four Mantel tests were performed for each KTA: (i) ɸST genetic distance vs KTA-based distance matrix, (ii) habitat divergence vs KTA-based distance matrix, (iii) geographic distance vs KTA-based distance matrix, and (iv) hydrologic distance vs KTA-based distance matrix.
We here propose a prioritisation procedure based on our dataset. First, all KTA-distance and proxy-distance matrices were standardised between 0 and 1 (all raw proxy and KTA matrices are available at https://figshare.com/s/1067e47637c5572caaf7). Second, the average distance value among replicates was calculated for each proxy/KTA of each population pair. Third, the sum of the three proxy-distance values was calculated per population pair. Similarly, the sum of all KTA-distance values was also calculated per population pair. Relevant ranking of population pairs according to the sum of KTA or proxy values must take into account that pairs with similar sum values should be regarded as equivalent and no prioritisation among them cannot be proposed (i.e. indifference between pairs). Therefore, we chose to use a k-means clustering procedure to determine indifference groups of population pairs. It aims at partitioning population pairs into k clusters in which each population belongs to the cluster with the nearest centroid. For each sum variable (i.e. sum of KTA distances and sum of proxy distances), the optimal number of clusters (k) was determined using the Elbow method (R package “factoextra” ). Then, a k-means clustering was performed for dividing all population pairs into k clusters for each sum variable independently (R package “stats”).
Availability of data and materials
Corrected Akaike Information Criterion
Multi-Regression on Distance Matrices
Recirculation Aquaculture System
Specific growth rate
Key traits for aquaculture
Blanck A, Lamouroux N. Large-scale intraspecific variation in life-history traits of European freshwater fish. J Biogeogr. 2007;34(5):862–75.
Lecocq T, Brasero N, De Meulemeester T, et al. An integrative taxonomic approach to assess the status of Corsican bumblebees: implications for conservation. Anim Conserv. 2015;18(3):236–48.
Mayr E. Animal species and evolution. Massachusetts: Harvard University Press; 1963. p. 797.
Avise JC. Phylogeography: the history and formation of species. Cambridge: Harvard University Press; 2000. p. 447.
Frankham R, Ballou JD, Briscoe DA. Introduction to conservation genetics. 2nd ed. Cambridge: Cambridge University Press; 2010. p. 618.
Vandeputte M, Garouste R, Dupont-Nivet M, et al. Multi-site evaluation of the rearing performances of 5 wild populations of European sea bass (Dicentrarchus labrax). Aquaculture. 2014;424–425(March):239–48.
Toomey L, Fontaine P, Lecocq T. Unlocking the intraspecific aquaculture potential from the wild biodiversity to facilitate aquaculture development. Rev Aquac. 2020;12(4):2212–27.
Lecocq T, Toomey L. A workflow to design new directed domestication programs to move forward current and future insect production. Anim Front. 2021;11(3):69–77.
Lecocq T. Insects: the disregarded domestication histories. In: Teletchea F, editor. Animal domestication. London: IntechOpen; 2020.
Velthuis HHW, van Doorn A. A century of advances in bumblebee domestication and the economic and environmental aspects of its commercialization for pollination. Apidologie. 2006;37(4):421–51.
Lecocq T, Rasmont P, Harpke A, et al. Improving international trade regulation by considering intraspecific variation for invasion risk assessment of commercially traded species: the Bombus terrestris case. Conserv Lett. 2016;9(4):281–9.
Gunnes K, Gjedrem T. Selection experiments with salmon IV growth of Atlantic Salmon during two years in the sea. Aquaculture. 1978;15:19–33.
Gjedrem T. The first family-based breeding program in aquaculture. Rev Aquac. 2010;2(1):2–15.
Warschefsky E, Varma Penmetsa R, Cook DR, et al. Back to the wilds: tapping evolutionary adaptations for resilient crops through systematic hybridization with crop wild relatives. Am J Bot. 2014;101(10):1791–800.
Pigliucci M, Murren CJ, Schlichting CD. Phenotypic plasticity and evolution by genetic assimilation. J Exp Biol. 2006;209(Pt 12):2362–7.
Hutchings JA. Old wine in new bottles: reaction norms in salmonid fishes. Heredity (Edinb). 2011;106(3):421–37.
Mayr E. Systematics and the origin of species. New York: Columbia University Press; 1942. p. 372.
Haig SM, Beever EA, Chambers SM, et al. Taxonomic considerations in listing subspecies under the U.S. endangered species act. Conserv Biol. 2006;20(6):1584–94.
Braby MF, Eastwood R, Murray N. The subspecies concept in butterflies: has its application in taxonomy and conservation biology outlived its usefulness? Biol J Linn Soc. 2012;106(4):699–716.
Phillimore AB, Owens IPF. Are subspecies useful in evolutionary and conservation biology? Proc R Soc B Biol Sci. 2006;273(1590):1049–53.
Mulcahy DG. Phylogeography and species boundaries of the western North American Nightsnake (Hypsiglena torquata): revisiting the subspecies concept. Mol Phylogenet Evol. 2008;46(3):1095–115.
Torstrom SM, Pangle KL, Swanson BJ. Shedding subspecies: The influence of genetics on reptile subspecies taxonomy. Mol Phylogenet Evol. 2014;76(1):134–43.
Frankham R, Ballou JD, Briscoe DA. Resolving taxonomic uncertainties and defining management units. In: Frankham R, Ballou JD, Briscoe DA, editors. A Primer of conservation genetics. Cambridge: Cambridge University Press; 2004. p. 101–22.
Mee JA, Bernatchez L, Reist JD, et al. Identifying designatable units for intraspecific conservation prioritization: a hierarchical approach applied to the lake whitefish species complex (Coregonus spp.). Evol Appl. 2015;8(5):423–41.
May SE, Medley KA, Johnson SA, et al. Combining genetic structure and ecological niche modeling to establish units of conservation: a case study of an imperiled salamander. Biol Conserv. 2011;144(5):1441–50.
Ryman N, Utter F, Laikre L. Protection of intraspecific biodiversity of exploited fishes. Rev Fish Biol Fish. 1995;5(4):417–46.
Fraser DJ, Bernatchez L. Adaptive evolutionary conservation: towards a unified concept for defining conservation units. Mol Ecol. 2001;10(12):2741–52.
Heibo E, Magnhagen C, Vøllestad LA. Latitudinal variation in life-history traits in Eurasian perch. Ecology. 2005;86(12):3377–86.
Yue GH. Recent advances of genome mapping and marker-assisted selection in aquaculture. Fish Fish. 2014;15(3):376–96.
Robledo D, Palaiokostas C, Bargelloni L, et al. Applications of genotyping by sequencing in aquaculture breeding and genetics. Rev Aquac. 2018;10(3):670–82.
Palaiokostas C, Houston R. Genome-wide approaches to understanding and improving complex traits in aquaculture species. CAB Rev. 2018;12(055):1–10.
Geng X, Zhi D, Liu Z. Genome-wide association studies of performance traits high-density linkage mapping. In: Zhanjiang JL, editor. Bioinformatics in aquaculture: principles and methods. Wiley: Hoboken; 2017. p. 415–33.
Toomey L, Lecocq T, Pasquet A, et al. Finding a rare gem: identification of a wild biological unit with high potential for European perch larviculture. Aquaculture. 2021;530:735807.
Lecocq T, Coppée A, Michez D, et al. The alien’s identity: consequences of taxonomic status for the international bumblebee trade regulations. Biol Conserv. 2016;195:169–76.
Merilä J, Crnokrak P. Comparison of genetic differentiation at marker loci and quantitative traits. J Evol Biol. 2001;14(6):892–903.
Holderegger R, Kamm U, Gugerli F. Adaptive vs. neutral genetic diversity: implications for landscape genetics. Landsc Ecol. 2006;21(6):797–807.
Craig JM, Crampton WGR, Albert JS. Revision of the polytypic electric fish Gymnotus carapo (Gymnotiformes, Teleostei), with descriptions of seven subspecies. Zootaxa. 2017;4318(3):401–38.
Joyce DA, Dennis RLH, Bryant SR, et al. Do taxonomic divisions reflect genetic differentiation? A comparison of morphological and genetic data in Coenonympha tullia (Müller) Satyrinae. Biol J Linn Soc. 2009;97:214–327.
Karakousis Y, Triantaphyllidis C, Economidis PS. Morphological variability among seven populations of brown trout, Salmo trutta L., Greece. J Fish Biol. 1991;38(6):807–17.
Carvalho GR. Evolutionary aspects of fish distribution: genetic variability and adaptation. J Fish Biol. 1993;43:53–73.
Schluter D. Ecology and the origin of species. Trends Ecol Evol. 2001;16(7):372–80.
Nakazato T, Bogonovich M, Moyle LC. Environmental factors predict adaptive phenotypic differentiation within and between two wild Andean tomatoes. Evolution (N Y). 2008;62(4):774–92.
Meirmans PG. The trouble with isolation by distance. Mol Ecol. 2012;21(12):2839–46.
Toomey L, Lecocq T, Bokor Z, et al. Comparison of single- and multi-trait approaches to identify best wild candidates for aquaculture shows that the simple way fails. Sci Rep. 2020;10(1):11564.
Kestemont P, Dabrowski K, Summerfelt RC. Biology and culture of percid fishes: Principles and practices. 2015. 1–901pp.
Boglione C, Gavaia P, Koumoundouros G, et al. Skeletal anomalies in reared European fish larvae and juveniles. Part 1: normal and anomalous skeletogenic processes. Rev Aquac. 2013;5:99–120.
Woolley LD, Qin JG. Swimbladder inflation and its implication to the culture of marine finfish larvae. Rev Aquac. 2010;2(4):181–90.
Kestemont P, Jourdan S, Houbart M, et al. Size heterogeneity, cannibalism and competition in cultured predatory fish larvae: Biotic and abiotic influences. Aquaculture. 2003;227(1–4):333–56.
Toomey L, Bláha M, Mauduit E, et al. When behavioural geographic differentiation matters: inter-populational comparison of aggressiveness and group structure in the European perch. Aquac Int. 2019;27(5):1177–91.
Boisclair D, Leggett WC. The importance of activity in bioenergetics models applied to actively foraging fishes. Can J Fish Aquat Sci. 1989;46(11):1859–67.
Youngson NA, Whitelaw E. Transgenerational epigenetic effects. Annu Rev Genomics Hum Genet. 2008;9(1):233–57.
Wright S. Isolation by distance. Genetics. 1943;28(2):114–38.
Bergek S, Björklund M. Cryptic barriers to dispersal within a lake allow genetic differentiation of Eurasian perch. Evolution (N Y). 2007;61(8):2035–41.
Jørgensen HB, Hansen MM, Bekkevold D, et al. Marine landscapes and population genetic structure of herring (Clupea harengus L.) in the Baltic Sea. Mol Ecol. 2005; 14(10):3219–34.
Langerhans RB, Chapman LJ, Dewitt TJ. Complex phenotype-environment associations revealed in an East African cyprinid. J Evol Biol. 2007;20(3):1171–81.
Chabot D, Claireaux G. Environmental hypoxia as a metabolic constraint on fish: the case of Atlantic cod, Gadus morhua. Mar Pollut Bull. 2008;57(6–12):287–94.
Domisch S, Amatulli G, Jetz W. Near-global freshwater-specific environmental variables for biodiversity analyses in 1 km resolution. Sci Data. 2015;2:150073.
Sherwood JL, Stites AJ, Dreslik MJ, et al. Predicting the range of a regionally threatened, benthic fish using species distribution models and field surveys. J Fish Biol. 2018;93(5):972–7.
Knouft JH, Anthony MM. Climate and local abundance in freshwater fishes. R Soc Open Sci. 2016;3:160093.
Arif I, Khan H. Molecular markers for biodiversity analysis of wildlife animals: a brief review. Anim Biodivers Conserv. 2009;38(9):989–1008.
Cruaud A, Gautier M, Galan M, et al. Empirical assessment of RAD sequencing for interspecific phylogeny. Mol Biol Evol. 2014;31(5):1272–4.
Fontaine P, Teletchea F. Domestication of the Eurasian Perch (Perca fluviatilis). In: Animal domestication. IntechOpen, London; 2019.
Stepien CA, Haponski AE. Taxonomy, distribution, and evolution of the Percidae. In: Kestemont P, Dabrowski K, Summerfelt RC, editors. Biology and culture of Percid fishes. New York: Springer; 2015. p. 3–16.
Fontaine P. L’élevage de la perche commune, une voie de diversification pour l’aquaculture continentale. INRA Prod Anim. 2004;17(3):189–93.
Vanina T, Gebauer R, Toomey L, et al. Genetic and aquaculture performance differentiation among wild allopatric populations of European perch (Percidae, Perca fluviatilis). Aquaculture. 2019;503:139–45.
Vanina T, Gebauer R, Toomey L, et al. Seeking for the inner potential: comparison of larval growth rate between seven populations of Perca fluviatilis. Aquac Int. 2019;27(4):1055–64.
Pimakhin A, Zak J. Effect of body size on swim bladder inflation in intensively cultured Eurasian perch larvae from different locations. In: World Aquaculture. 2014. p. 37–41.
Henderson-Arzapalo A, Lemm C, Hawkinson J, et al. Tricaine used to separate phase-I striped bass with uninflated gas bladders from normal fish. Progr Fish-Culturist. 1992;54(2):133–5.
Schneider CA, Rasband WS, Eliceiri KW. NIH Image to ImageJ: 25 years of image analysis. Nat Methods. 2012;9(7):671–5.
Jourdan S, Fontaine P, Boujard T, et al. Influence of daylength on growth, heterogeneity, gonad development, sexual steroid and thyroid levels, and N and P budgets in Perca fluviatilis. Aquaculture. 2000;186:253–65.
Baras E. Cannibalism in fish larvae: what have we learned? In: Qin JG, editor. Larval fish aquaculture. New York: Nova Science; 2013. p. 167–99.
Baras E, Kestemont P, Mélard C. Effect of stocking density on the dynamics of cannibalism in sibling larvae of Perca fluviatilis under controlled conditions. Aquaculture. 2003;219(1–4):241–55.
Buske C, Gerlai R. Early embryonic ethanol exposure impairs shoaling and the dopaminergic and serotoninergic systems in adult zebrafish. Neurotoxicol Teratol. 2011;6:698–707.
Colchen T, Teletchea F, Fontaine P, et al. Temperature modifies activity, inter-individual relationships and group structure in fish. Curr Zool. 2017;63(2):157–83.
Svanbäck R, Eklöv P. Genetic variation and phenotypic plasticity: causes of morphological and dietary variation in Eurasian perch. Evol Ecol Res. 2006;8(1):37–49.
Toomey L, Dellicour S, Vanina T, et al. Getting off the right foot: integration of spatial distribution of genetic variability for aquaculture development and regulations, the European perch case. Aquaculture. 2020;521:734981.
Zhang Z, Schwartz S, Wagner L, et al. A greedy algorithm for aligning DNA sequences. J Comput Biol. 2000;7(1–2):203–14.
Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20(4):1160–6.
Maddison WP, Maddison DR. Mesquite: a modular system for evolutionary analysis. 2001.
Nesbø CL, Arab MO, Jakobsen KS. Heteroplasmy, length and sequence variation in the mtDNA control regions of three percid fish species (Perca fluviatilis, Acerina cernua, Stizostedion lucioperca). Genetics. 1998;148(4):1907–19.
Excoffier L, Smouse PE, Quattro JM. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics. 1992;131(2):479–91.
Dellicour S, Mardulyn P. SPADS 1.0: a toolbox to perform spatial analyses on DNA sequence data sets. Mol Ecol Resour. 2014;14(3):647–51.
Fick SE, Hijmans RJ. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int J Climatol. 2017;37(12):4302–15.
Oksanen FJ, Blanchet G, Kindt R, et al. Vegan: Community Ecology Package. 2011. (Tertiary Vegan: Community Ecology Package).
Venables W, Ripley B. Modern applied statistics with S. 4th ed. New York: Springer; 2002.
Kassambara A, Mundt F. factoextra: Extract and visualize the results of multivariate data analyses. 2017.
Vincenty T. Direct and inverse solutions of geodesics on the ellipsoid with application of nested equations. Surv Rev. 1975;23(176):88–93.
Hijmans RJ. geosphere: Spherical Trigonometry. R package version 1.5–10. 2019.
Shah VB, Mcrae B. Circuitscape: A Tool for Landscape Ecology. In: Varoquaux G, Vaught T, Millman J, editors. Proceedings of the 7th Python in Science Conference. 2008. p. 62–6.
Vogt J, Soille P, De Jager A, et al. A pan-European River and Catchment Database. OPOCE. 2007.
Lehner B, Verdin K, Jarvis A. New global hydrography derived from spaceborne elevation data. Eos, Trans Am Geophys Union. 2008;89(10):93.
Gastwirth JL, Gel YR, Hui W, et al. Lawstat - Tools for Biostatistics, Public Policy, and Law. 2015.
Kuznetsova A, Brockhoff PB, Christensen RHB. lmerTest package: tests in linear mixed effects models. J Stat Softw. 2017;82(13):1–26.
Spiess A-N. qpcR: Modelling and Analysis of Real-Time PCR Data. R Package version 14-1. 2018.
Lenth R. emmeans: Estimated Marginal Means, aka Least-Squares Means. R Package version 1351. 2019.
Pohlert T. PMCMR: Calculate Pairwise Multiple Comparisons of Mean Rank Sums. 2014.
Goslee SC, Urban DL. The ecodist package for dissimilarity-based analysis of ecological data. J Stat Softw. 2007;22(7):1–19.
Nimon K, Lewis M, Kane R, et al. An R package to compute commonality coefficients in the multiple regression case: an introduction to the package and a practical example. Behav Res Methods. 2008;40(2):457–66.
Prunier JG, Colyn M, Legendre X, et al. Multicollinearity in spatial genetics: separating the wheat from the chaff using commonality analyses. Mol Ecol. 2015;24(2):263–83.
Dellicour S, Gerard M, Prunier JG, et al. Distribution and predictors of wing shape and size variability in three sister species of solitary bees. PLoS ONE. 2017;12(3):e0173109.
We gratefully acknowledge Yannick Ledoré (URAFPA, France), Zoltán Bokor, Árpád Ferincz, Adam Staszny (SZIU, Hungary), Fodor Ferenc (Balatoni Halgazdálkodási Nonprofit Zrt, Hungary), Sami Vesala, Katja Kulo, and Pasi Ala-Opas (LUKE, Finland), Espinat Laurent, and Goulon Chloé (INRAE, France) for their help with sampling. Special thanks to Joëlle Couturier (URAFPA, France). Authors acknowledge Fabrice Teletchea (URAFPA, France), Nicolas Poulet (AFB, France), and Edwige Quillet (INRAE, France) for their helpful comments and discussions on this experimental design.
LT was supported by a grant from the French Ministère de l'Enseignement Supérieur et de la Recherche. SD is supported by the Fonds National de la Recherche Scientifique (FNRS, Belgium). This work was supported by the région Grand Est (DomPop project).
Ethics approval and consent to participate
All procedures used in this study were in accordance with national and international guidelines for protection of animal welfare (Directive 2010/63/EU). This study was conducted with the approval Animal Care Committee of Lorraine (CELMA n°66) and the French Ministry of Higher Education, Research, and Innovation (APAFIS13368-2018020511226118, APAFIS17164-2018101812118180).
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Principal component analysis biplot representing environmental variables (in blue) and populations (dots in grey) using the first two axes. Populations: BAL: Balaton, VAL : Valkea-Müstajärvi, ISO: Iso-Valkjärvi, KIE: Kierzlinskie, GEN: Geneva, BOU: Bourget, HOH: Hohen Sprenzer. BIO1 to BIO19 correspond to the bioclimatic variables from Worldclim. Vap, wind, and srad correspond to water vapor pressure, wind speed, and solar radiation, respectively.
Barplots representing results obtained for all key traits studied (n = 3 per population, except for activity and inter-individual distances for which n=9). Different letters indicate significant differences between populations (p-value<0.05) using post-hoc tests.
Correlation results between proxy-based (geographic, hydrologic, genetic, and habitat proxies) and KTA-based distance matrices with r the Mantel correlation value and its associated p-value (significant r (p-value<0.05) are indicated in bold).
About this article
Cite this article
Toomey, L., Dellicour, S., Kapusta, A. et al. Split it up and see: using proxies to highlight divergent inter-populational performances in aquaculture standardised conditions. BMC Ecol Evo 21, 206 (2021). https://doi.org/10.1186/s12862-021-01937-z
- Perca fluviatilis