- Open Access
Dispersal ability, habitat characteristics, and sea-surface circulation shape population structure of Cingula trifasciata (Gastropoda: Rissoidae) in the remote Azores Archipelago
BMC Ecology and Evolution volume 21, Article number: 128 (2021)
In the marine realm, dispersal ability is among the major factors shaping the distribution of species. In the Northeast Atlantic Ocean, the Azores Archipelago is home to a multitude of marine invertebrates which, despite their dispersal limitations, maintain gene flow among distant populations, with complex evolutionary and biogeographic implications. The mechanisms and factors underlying the population dynamics and genetic structure of non-planktotrophic gastropods within the Azores Archipelago and related mainland populations are still poorly understood. The rissoid Cingula trifasciata is herewith studied to clarify its population structure in the Northeast Atlantic Ocean and factors shaping it, with a special focus in intra-archipelagic dynamics.
Coupling microsatellite genotyping by amplicon sequencing (SSR-GBAS) and mitochondrial datasets, our results suggest the differentiation between insular and continental populations of Cingula trifasciata, supporting previously raised classification issues and detecting potential cryptic diversity. The finding of connectivity between widely separated populations was startling. In unique ways, dispersal ability, habitat type, and small-scale oceanographic currents appear to be the key drivers of C. trifasciata’s population structure in the remote Azores Archipelago. Dispersal as non-planktotrophic larvae is unlikely, but its small-size adults easily engage in rafting. Although the typical habitat of C. trifasciata, with low hydrodynamics, reduces the likelihood of rafting, individuals inhabiting algal mats are more prone to dispersal. Sea-surface circulation might create dispersal pathways for rafts, even between widely separated populations/islands.
Our results show that gene flow of a marine non-planktotrophic gastropod within a remote archipelago can reveal unanticipated patterns, such that the understanding of life in such areas is far from well-understood. We expect this work to be the starting of the application of SSR-GBAS in other non-model marine invertebrates, providing insights on their population dynamics at distinct geographical scales and on hidden diversity. How transversal is the role played by the complex interaction between functional traits, ecological features, and sea-surface circulation in the population structure of marine invertebrates can be further addressed by expanding this approach to more taxa.
Species’ ranges are shaped by multiple biotic and abiotic factors; however, in the marine realm, dispersal ability is one of the main factors influencing the distribution of taxa, with significant evolutionary and biogeographic implications [1, 2]. Dispersal is frequently related to the duration and behaviour of marine invertebrates’ larval stages, classified either as planktotrophic or non-planktotrophic (np), the latter comprising lecithotrophic and direct developers [3, 4]. Among the dispersal strategies reviewed by Winston , rafting is the most relevant mechanism followed by epibenthic, shallow-water (< 50 m depth) np-invertebrates in temperate Atlantic waters [2, 6]. Details concerning the larval development are only known for a few gastropods (e.g. [7,8,9,10,11,12]), but are unclear for most species, in particular small ones. Dispersal pathways and processes are therefore only poorly understood in many marine invertebrates. In particular, the characteristics of dispersal could play a role to explain differentiation and genetic structure in response to ecological and geographical constrains.
Rissoidae Gray, 1847 is one of the best-known family of microgastropods. Research has been conducted over the past decades regarding distinct aspects of their biology, ecology, and evolution [10, 13,14,15,16,17,18,19,20,21,22,23,24]. The family comprises the largest number of small-sized, marine gastropod species, conspicuous around the world and with 546 species in the Atlantic Ocean and Mediterranean Sea [13, 19, 23]. Among these, Cingula trifasciata (J. Adams, 1800) is found throughout the Northeast Atlantic Ocean, being reported for the Azores Archipelago, Iberian Peninsula, up to the Bay of Biscay and British Isles [24,25,26,27,28,29]. This microgastropod species is commonly found at intertidal areas with low hydrodynamics and mesotidal regimes, which get exposed during low tide and submerged during high tide. A typical habitat for C. trifasciata is protected and enclosed gravel intertidal areas (Fig. 1a), especially beneath gravel/boulders that provides shelter and environmental stability [13, 14, 30]. The species is also present in the algal turf (Fig. 1b), which in turn provide protection to wave action, predation, and desiccation during low tide . Several features of the rissoid C. trifasciata increase the likelihood of initiate dispersal by rafting, namely: (1) its minute size (< 5 mm); (2) its high abundance; (3) its association with intertidal habitats; and, (4) the secretion of mucus from a posterior pedal gland that allows juveniles and adult rissoids to suspend themselves from the surface film .
Nevertheless, C. trifasciata is a np-species (direct developer), laying one to four eggs in algae . It is expected to be geographically restricted, as the lack of a free-swimming stage negatively affects the dispersal ability at early-life stages [6, 32, 33]. Its wide distribution across the NE Atlantic Ocean and reports on the Mediterranean Sea—Ceuta [34, 35]; Adriatic Sea —challenge the expectations for a np-species , although examples of other np-species with wide biogeographic ranges exist (e.g. Lasea spp. , Calyptraeid gastropods , trochids ). The np nature of C. trifasciata suggests that long-distance dispersal strategies other than rafting are unlikely. If that is the case, surface water currents should play a major role in shaping C. trifasciata genetic variation. These patterns may be especially intricate in a remote volcanic oceanic island system such as the Azores Archipelago.
The Northeast Atlantic Ocean surface circulation is characterized by a complex dynamics of the North Atlantic’s subtropical gyre (Fig. 2a). Near the Great Banks off Newfoundland, the Gulf Stream branches into the North Atlantic Current system to the north and the Azores Current south eastwards, which then flows south eastwards towards Madeira and the Gulf of Cádiz [39,40,41]. In this area, nine volcanic oceanic islands spread over 650 km at a west-northwest to east-southeast orientation, forming the remote Azores Archipelago, located hundreds of kilometres from other landmasses and divided in three island groups: Eastern Group (Santa Maria and São Miguel), Central Group (Terceira, Graciosa, Pico, Faial, and São Jorge), and Western Group (Flores and Corvo) (Fig. 2b). Oceanographic circulation in this region is particular: east of the islands, the Azores Current and associated front flow between 30 and 37.5°N on time scales ranging from months to decades (mean 33.9 ± 1.3°N) ; north of the Azores Front, and mostly at subsurface levels centred around 36°N, the Azores Counter Current circulates westward [40, 43,44,45].
Using molecular tools, important biogeographical questions, can be addressed, such as: (1) is there gene flow between continental and insular populations of the same species?; (2) how is the species dispersing within the archipelago, among distant populations?; (3) what are the drivers and factors influencing dispersal in the Azores? Similar questions have been posed in previous studies that intended to clarify patterns and processes affecting marine gastropods in the Azores [24, 48].
Mitochondrial markers display characteristics that make them powerful tools for the inference of molecular variability and population genetic structure, namely maternal inheritance, absence of genetic recombination, fast evolutionary rate, and increased susceptibility to the effects of genetic drift . Among this type of markers, the protein-coding Cytochrome Oxidase subunit I (COI) is among the most widely used genetic marker for intraspecific analysis and population dynamics inference in animals [49, 50]. Among marine gastropods, COI provides a particularly good phylogeographic signal within the superfamily Rissooidea [16, 51, 52], to which Cingula trifasciata belongs. Microsatellites (SSR), due to their codominant nature, biparental inheritance, high heterozygosity, and polymorphism levels, and being multi-allelic, are more informative and powerful than other markers [53,54,55,56].
During the past decades, microsatellite sets have been developed and characterized for marine gastropods, especially for abundant, commercially interesting or threatened species (e.g. [57,58,59,60,61,62,63,64]). These sets of SSR markers appear to be skewed towards large-sized genera, such as Nucella, Littorina, Buccinum, Haliotis and Concholepas. SSR markers, sometimes coupled with other molecular markers, have mainly been used in marine gastropods for paternity studies [65,66,67], phylogeography and species delimitation [68, 69], population structure [70,71,72,73,74], as well as dispersal and connectivity associated to larval development [75,76,77,78]. Microgastropods tend to be understudied when compared to larger relatives, requiring special attention in field surveys and still associated with taxonomic classification issues . This negative bias in the knowledge of microgastropods is also found in the application of molecular techniques to improve the study of population dynamics and phylogenetic questions.
In this work we investigate the processes shaping the genetic diversity and structure patterns of C. trifasciata, with a special focus on the influence of sea surface circulation. This was done by applying a traditional analysis of the COI molecular marker from several populations and developing a set of primers for microsatellite markers for this non-model species, using the SSR-GBAS approach, a method that relies on amplicon sequencing with second generation sequencing techniques to determine genotypes at microsatellite containing loci. By doing so, we aim to increase the current knowledge of the widespread rissoid C. trifasciata, as well as to clarify the intraspecific genetic structure and population dynamics of this species in the NE Atlantic Ocean, with an emphasis on its behaviour in the remote Azorean islands. With this study, we intend to understand the role of habitat and small-scale surface currents among islands in shaping the genetic structure in a remote archipelago.
Sequence data and genetic analyses
The COI dataset comprised a total of 75 sequences, with seven to 14 individuals per population. A total of 44 haplotypes were distinguished in the 658 bp alignment, none shared between populations from the Azores and Vigo. Among Azorean haplotypes, the estimates of evolutionary divergence ranged from 0.2 to 2.6% (see Additional file 1: Table S2). Unexpected high levels of divergence (~ 2%) within the populations at Graciosa and Santa Maria are detected, but a manual check of the chromatograms ensured that the reported variation is real and not a consequence of misreads. Differences between the two populations at Graciosa Island seem to be negligible, thus being considered a single population for interpretation purposes. The lowest inter-locality differentiation levels are found in the comparisons of haplotypes from the central group, namely São Jorge, Pico, Graciosa islands, and some haplotypes from the western locality Mosteiros in São Miguel island. Haplotypes from Vigo show low within population divergence (average 0.23%), but considerably high differentiation in relation divergence to the Azorean haplotypes, ranging from 3.6 to 5%.
The distribution of mitochondrial haplotypes (Fig. 3) reflects the estimates of evolutionary divergence, with congruent relationships inferred by the TCS network and UPGMA tree. Only two haplotypes are shared by two or more populations (H14 among Mosteiros (São Miguel Island) and both populations of Graciosa Island; H16 among Mosteiros, Graciosa, Pico, and São Jorge), the remaining exclusive to the population in which they are found. This level of genetic differentiation and uniqueness of the mitochondrial set is observed even within the same island, as Mosteiros and Caloura (São Miguel Island) share no haplotypes, although the first shares haplotypes with more distant central islands. The closely related haplotypes from Pico, São Jorge, and Graciosa form one major star-shaped group in the TCS network (Fig. 3a) that also includes one haplotype exclusive to Santa Maria Island (H4), another exclusive to Caloura (H11) and several from Mosteiros. The remaining haplotypes form a disperse second group, connected by several inferred mutation steps, which comprises haplotypes from the eastern (Santa Maria and most Caloura and Mosteiros’ haplotypes from São Miguel) and western (Flores) group of Azorean islands. Haplotypes 25 (Pico) and 28 (São Jorge) were found in this second group of the network, while haplotypes from Graciosa, Mosteiros and Caloura can be found in both groups. Haplotypes from Vigo (H42-44) form an isolated cluster, not connected to the Azorean haplotypes. The reconstruction of a Rissoidae phylogenetic tree (Fig. 4) allowed to clarify the divergence levels within the family and to ascertain the position of C. trifasciata from the Azores and Vigo. The separation of the samples from the mainland and the islands is well-supported (99%) in the phylogeny. A BLASTN search  of VIG sequences revealed 100% cover and 95–96% identity with several C. trifasciata haplotypes from the Azores, but 99–100% identity with a shorter Galician sequence deposited at the database (KU695304; 55% query cover).
Pairwise F-statistics between populations were estimated for the COI dataset (cf. Additional file 1: Table S3 for details). Comparatively high and significant levels of differentiation, over 0.8, were inferred between Vigo and Azorean populations. Among Graciosa, Pico, and São Jorge Islands, the FST levels were low and non-significant, whereas Mosteiros (São Miguel Island, eastern group) shows only significant differences with Graciosa and Pico from the central group. The analysis revealed significant and high FST values in the pairwise comparisons with Flores Island. In comparison with the remaining estimates, the pairwise comparisons involving Flores, Santa Maria, and Caloura retrieved higher FST levels, suggesting higher isolation of these populations.
SSR marker discovery
For the SSR marker discovery, the MiSeq runs for the two samples produced 766,960 and 1,382,293 paired reads, respectively for PSM4350 and FSC5131. A total of 445,578 and 248,419 reads, which passed the quality control and merging steps were screened for SSR motifs. Of these, respectively, 450 and 8,673 reads containing SSR motifs complied with the criteria defined for the SSR search pipeline. These comprised 203 di-, 123 tri-, 107 tetra-, and 17 pentanucleotide repeats for PSM4350; and 2168 di-, 1477 tri-, 1303 tetra-, and 254 pentanucleotide repeats for FSC5131. A total of 42 primer pairs for C. trifasciata were designed only from sequences containing penta- and tetranucleotide repeats. Eleven out of the 42 designed primers failed to amplify in the single PCR test phase. The remaining primers were included in 4 multiplex primer mixes and are provided in Additional file 1: Table S1.
SSR data analysis and SSR-GBAS genotyping
For the SSR-GBAS genotyping, a total of 2,543,202 paired reads were produced, which were reduced to 1,008,047 after quality control, merging and primer demultiplex steps. Raw reads can be accessed through the BioProject PRJNA702169. The number of reads per marker ranged from 1,794 to 121,460, with CT42_TAAA being the one with highest number of sequences. Regarding the number of sequences per sample, it varied between 38 (PS4380) and 56,441 (CAL5184). After filtering the matrix and exclude the samples with more than 60% missing data across all markers, the dataset was reduced to seven individuals from Santa Maria, nine from Caloura and two from Mosteiros (São Miguel), six from São Jorge, 11 from Pico, five from Graciosa, one from Flores, and three from Vigo (Galicia, Spain). As only one sample was successful in SSR-GBAS genotyping, the population from Flores island was excluded from further analyses. Although they had been successfully amplified in the single PCR test, the following markers were excluded due to failure in the multiplex step or Illumina MiSeq, resulting in data unfit for further analysis or missing data: CT6_TTGT and CT17_TTTG were excluded due to non-specific amplification; CT1_TGTT, CT3_GAATA, and CT10_GTTT were excluded for yielding over 60% missing data. Thus, a total of 26 markers and 43 samples were suitable for population genetic analyses of C. trifasciata.
Population structure analyses
Genetic diversity measures
In the WAI (Whole Amplicon Information) dataset, the markers analysed had between 6 and 41 alleles (Additional file 1: Fig. S1a), which can be found in GenBank Database  under the accession numbers MW623085-363. The polymorphic information content (PIC) per locus ranged between 0.43 and 0.954, with only 4 out of 26 markers displaying PIC values below 0.50. Although monomorphic markers for the complete WAI dataset were not detected, some of the markers were monomorphic in some populations (Table 1): CT23_TTTA in Caloura; CT24_CAAA in Graciosa; CT31_AATA in Graciosa; CT38_TCAT in Caloura; plus eight monomorphic markers in Vigo population (data not shown). Not accounting monomorphic markers across the studied populations, the complete WAI dataset showed Ho varying from 0.235 to 0.921 and He from 0.448 to 0.956. Within each of the five populations in study, with five or more individuals (Additional file 1: Fig. S1b), the number of alleles (Na) ranged from 3.346 (São Jorge) to 4.538 (Santa Maria) all with a frequency over 5%, except in Pico. The number of effective alleles (Ne) was lower to Na in all populations, ranging from 2.357 (Pico) to 3.317 (Santa Maria). Ho and He followed the same trend across all populations when analysing the dataset per population, with Ho levels varying between 0.394 (São Jorge) and 0.620 (Santa Maria), and He between 0.498 (São Jorge) and 0.623 (Santa Maria). Private alleles were detected in all populations.
Tests for Hardy–Weinberg Equilibrium (HWE; Table 1) per markers and population revealed deviations at one locus in Caloura (CT5_TGAA) and three loci in Pico (CT23_TTTA, CT33_CATT, CT38_CATT). The potential phenomena causing it were further evaluated with estimates of null alleles’ frequency and Wright’s Fixation index (FIS). Errors in the assignment of genotypes causing deviations to HWE were discarded after checking the markers plots generated after the Allele Length calling script. The deviations were either caused by the dominance of null alleles (CT5_TGAA in Caloura) inferred from FreeNA and checked in the original matrix, excess of homozygotes indicated by FIS (CT33_CATT in Pico), or both (CT23_TTTA in Pico). A total of six out of 26 markers deviated from HWE in the population from São Jorge (CT14_CAGA, CT18_AAAG, CT23_TTTA, CT24_CAAA, CT31_AATA, CT37_AAAT) due to excess of homozygotes suggested by the FIS levels, which led to higher frequencies of null alleles despite the absence of real null alleles. In Vigo population (data not shown), three loci failed to amplify (CT23_TTTA, CT30_TGTC, and CT5_TGAA) resulting in null alleles across all the samples of this population; the negative values of FIS in the several loci suggest an excess of heterozygotes in this population.
Genetic and spatial structure analyses
The PCoA analysis of the complete WAI dataset (Fig. 5a–c) revealed a clear distinction of populations from the Azores Archipelago (Santa Maria, Caloura, Mosteiros, Graciosa, Pico, São Jorge) and the one from Vigo (Galicia, Spain). The repetition of the PCoA without Vigo population allowed to clarify the similarities among Azorean populations (Fig. 5d–f), each forming smaller, distinct subclusters. In both dimensions, Mosteiros (São Miguel) is positioned between the populations from Graciosa or Pico, whereas Pico and Caloura show considerable dispersion along all the axis. Except for Mosteiros’ population, the eastern populations seem to be more divergent when compared to the remaining.
The STRUCTURE analysis was congruent with the PCoA results, with K-value of 6 estimated as optimal (Fig. 6), separating each population individually except for Pico and Mosteiros, which cluster together. At this K-value, Pico, Santa Maria, and Vigo populations each form a well-defined cluster, agreeing with the pattern retrieved from the PCoA. We have also considered lower K-values to evaluate potential correspondence between the hierarchy cluster and geographical distribution of the populations studied (Fig. 6). At K = 2, Vigo shares similar patterns of variation with the Azorean populations of Caloura and Santa Maria, whereas the second cluster is dominated by Pico and São Jorge individuals. The remaining populations, located in between these groups, show a considerable degree of admixture. At K = 3, Santa Maria and Vigo are assigned to one new cluster, while the other two clusters show evidences of admixture in some populations. Vigo forms a new cluster when K = 4 and, finally, at K = 5 it is assigned to an isolated cluster with no signal in other populations. From K = 3 to K = 5, the pattern roughly corresponds to a gradual transition of clusters from West to East. The assignment of Graciosa to a new cluster only when K = 6 falls out of this pattern, as does the unexpected placement of Mosteiros samples in the cluster together with Pico. For K-values over 7 (cf. Additional file 1: Fig. S2), cluster assignment becomes increasingly unclear for Azorean populations, and no further population sub-structure is inferred; Vigo population remains isolated.
The AMOVA analysis (Table 2), performed among regions (Azores and Vigo) and populations, revealed that 25% of the variation in the complete WAI dataset is found among regions, whereas the differentiation among populations accounts for 13% of the overall diversity. Regarding the F-statistics, a significant (p < 0.01) FST value of 0.385 was estimated for the overall WAI dataset, whereas the pairwise FST estimates for populations are detailed in Additional file 1: Table S4 (cf. Figure 8 for a summary). As expected, the highest differentiation is detected between Vigo and all the Azorean populations, with FST values over 35% in all pairwise comparisons. The lowest FST values are observed in Mosteiros vs. Pico (0.110), Mosteiros vs. Graciosa (0.119), and Graciosa vs. Pico (0.122), in congruence with the previous results of PCoA and STRUCTURE, suggesting gene flow between these populations.
With this work we address biogeography and the processes shaping the genetic structure and evolutionary dynamics of Cingula trifasciata in the NE Atlantic Ocean, with a special focus in the Azores Archipelago. The results indicate a divergence between insular and continental populations of this microgastropod and the existence of genetic structure within the Azores Archipelago on a small spatial scale. This indicates dispersal barriers for the non-planktotrophic species that could form the starting point of population differentiation. To our knowledge, this is the first time that the SSR-GBAS protocol generates a numerous set of de novo SSR primers for a marine microgastropod. Comparing to previous studies in other marine invertebrate taxa, a higher number of loci is herewith amplified contributing to an informative dataset even with a low number of individuals. Thus, this methodology has the potential to be applied in similar study systems, increasing the number of markers successfully developed for marine taxa. These topics will be further discussed in the next sections.
A de novo SSR primer dataset for Cingula trifasciata
In this study, a total of 26 SSR loci of C. trifasciata were developed and characterized and genotyped using sequence information of Illumina MiSeq according to the SSR-GBAS methodology described earlier [91,92,93]. Second-generation sequencing technologies revolutionized the discovery and genotyping of microsatellites, solving technical drawbacks posed by the traditional genotyping techniques [56, 92, 94,95,96,97]. The application of next-generation sequencing (NGS) to non-model species, to which a reference genome is usually not available, is attained by downsizing approaches for the analysis of a small subset of loci, ultimately contributing to the development of genotyping by sequencing [56, 97]. The approach is frequently used in population genetics studies, detecting more microsatellite markers per sample and unique alleles in a fast and cost-effective way [56, 91, 92, 96,97,98]. Due to its characteristics, amplicon sequencing increases the statistical power even with a low number of sequenced markers and the resolution of detailed population genetic structure, suitable even in large-scale de novo population genetic studies in non-model species [56, 94, 97]. Moreover, this approach allows to obtain the complete sequence of the loci analysed, including the flanking region, the repetition motif, and potential SNPs (Single Nucleotide Polymorphism), overcoming homoplasy typical of SSR studies [92, 96, 98]. The recent SSR-GBAS (Simple Sequencing Repeats—Genotyping by Amplicon Sequencing) workflow calls alleles based on the whole amplicon information (WAI), including allele length (AL) and SNPs. It results in a higher number of alleles recovered, marker variability, information content, and lower influence of homoplasy, which together provide a better resolution of the SSR dataset. A script for analysis of data was provided by de Barba et al.  and another later described by Curto et al. . Relying on approaches suggested by Illumina, the SSR-GBAS methodology follows a cost-effective optimization of a high level of multiplexing, up to 10 markers. The results are easily reproduced and analysed resorting to bioinformatic tools, allowing the automation of allele calling and reduction of traditional SSR artefacts. The SSR-GBAS method is useful for the study of small-scale systems, non-model organisms, and specific research questions, yielding potential to become an important tool to be incorporated in long-term screening projects and meta-analysis, along with data from other sources/teams, comparable to phylogenetic data collections. The SSR-GBAS methodology has been described in detail by [91,92,93] and summarized herewith.
In congruence with the previous studies using SSR-GBAS, a set of highly informative markers was obtained for the non-model, marine microgastropod, with moderate to high PIC values according to the classification scheme . Only loci with penta- and tetranucleotide repeats were included to reduce the complexity of PCR and artefacts that interfere with the call of alleles . Following this approach, allele call could be done unambiguously with no need to include manual edits after visual control of genotypes . Dinucleotides more frequently produce stutter bands that make allele determination difficult and, in the case of SSR-GBAS, additionally result in a higher error rate in the determination of single nucleotide polymorphisms, when alleles that differ by one repeat unit contain a SNP [92, 100, 101]. The additive signal of allele and stutter influences the base frequency at the SNP position. In our case, the penta- and tetranucleotide repeat loci comprised a high amount of information so the inclusion of the shorter repeat unit motifs was not necessary.
In addition to being highly informative, this newly generated set of SSR markers retrieved overall diversity parameters consistent with values reported for other marine gastropods [64, 76, 77]. Deviations from HWE in some populations – Caloura, Pico, São Jorge, and Vigo –, are frequently reported in marine invertebrates [70, 72, 73, 102]. Reproductive patterns have been suggested to contribute to the excess of homozygotes in marine invertebrates [73, 103]. C. trifasciata possesses np-larvae with low dispersal ability, which contributes in one hand to the high genetic structure detected but also to homozygote excess in some populations. Nevertheless, this should not affect the biogeographical interpretations regarding the Azorean populations, as their prevalence in other gastropod species has been shown to not influence the population structure inferred (e.g. [68, 73]). High variability in the flanking regions might contribute to the inference of null alleles in the Vigo population . Our results suggest that Azorean and Vigo populations constitute distinct evolutionary lineages, thus the specificity of the SSR primers, designed from Azorean samples, might be lower for individuals from Vigo. Therefore, the application of this set of SSR primer in Vigo’s individuals is comparable to a situation of cross-species amplification, with the potential to negatively affect the success of the protocol in the related species [92, 105].
Although laboratory and analytical methodologies have developed fast in the past decades, molecular markers for molluscs are scarce and mostly based on traditional methods, as microsatellites and allozymes [73, 76, 106], also as a consequence of the problems arising during DNA extraction of molluscan tissues . The combination of markers with different information content has been shown to improve the sensitivity of population genetic inferences . However, the few datasets so far available do not truly represent the genetic and geographic differentiation in marine gastropods, and the lack of widely used molecular markers hampers comparative analyses of markers’ suitability in a study system . The development of de novo molecular markers, ideally transversal to related taxa, is urgently needed to increase our understanding on connectivity in the marine realm and evolution of invertebrates with different ecological and biological characteristics. SSR-GBAS methodology can be an important tool to achieve these goals, being easy to implement and fast to develop, as shown here with C. trifasciata [91, 92]. Datasets generated with this methodology, comprising variability from the repetition motif and SNP in the flanking region, are informative in distinguishing evolutionary entities. In understudied groups, as marine invertebrates, this type of markers might even be useful to uncover cryptic patterns of diversity, as detected in this work and further discussed in the next section.
Differentiation between insular and continental populations
The analysis of the mitochondrial marker COI of Cingula trifasciata revealed differentiation between samples from the Azores Archipelago and Vigo (Galicia, Spain). A complete separation of the Azorean and Vigo networks at 95% connection limit (Fig. 3) has been previously pointed as indicative of deep differences, even considered by some as different species . Divergence levels between haplotypes from the Azores and Vigo ranged from 3.6 to 5% (cf. Additional file 1: Table S2) and this distinction becomes evident at a phylogenetic tree (Fig. 4). Within the superfamily Rissooidea, to which C. trifasciata belongs, the COI marker is an adequate diagnostic tool for species status by its strong and reliable phylogeographic signal . We have further assessed evolutionary divergence levels of COI within the family Rissoidae, ranging from 2.9 to 19.9% within the several genera analysed (Additional file 1: Table S5). The divergence between C. trifasciata Azores and Vigo falls within this interval, surprisingly high for conspecific individuals. A similar level of differentiation (2.9%) occurs between Alvania mediolittoralis (Gofas 1989) and A. formicarum (Gofas 1989), two recognized species occurring in the Azores Archipelago with distinctive morphological characters. The inclusion of these Alvania species in a calibrated molecular phylogeny of Rissoidae has shown them to have diverged quite recently, at about 0.36 million years (Ma), within a range uncertainty of 0.12–0.61 Ma , explaining the relatively low divergence levels. Following this rationale, we propose that a divergence event is currently acting on Cingula trifasciata, causing a split between the evolutionary lineages from the Azores and Vigo. The differentiation between insular and mainland (Vigo) populations is supported by the COI and WAI datasets, and evident both in clustering analyses (Figs. 5, 6). In the STRUCTURE analysis, samples from Vigo cluster together with no signs of admixture even when K = 2, and at K = 4 it forms an individual cluster, not admixed in other populations. The maintenance of this pattern and absence of further sub-structuring for K > 6 supports the uniqueness of Vigo’s samples relative to Azorean populations.
Although true allopatric isolation in the marine realm is difficult to accomplish, the high distances across deep-water and unsuitable habitats between locations could hamper dispersal and reduce gene flow between populations though to have similar ecological requirements and means of dispersal [2, 19, 111]. Lower dispersal leads to the accumulation of genetic variants exclusive to each population and perhaps partial isolation of the insular populations in relation to the mainland , here reflected in the genetic differences in the COI and SSR datasets. This scenario of insular isolation is not exclusive to C. trifasciata. Differentiation of Azorean populations, often together with high intra-archipelagic diversity levels, have been described for other marine organisms as the gastropod genus Patella  and several fishes [113,114,115,116,117,118], but also for terrestrial organisms (e.g. Nyctalus azoreum [119, 120]; starlings ).
Morphological differences between Azorean and Iberian individuals of C. trifasciata have previously been reported , in easily observable characters (e.g. broader brown bands on the whitish shell; cf. Figure 7) and in detailed observations of the tentacles and snout. Doubts regarding the assignment of Azorean C. trifasciata to the European taxon raised two decades ago are now supported by molecular data of mitochondrial and SSR markers. These results reinforce the importance to couple morphological, ecological, and molecular data to have a broader perspective of biodiversity, especially in such an understudied environment as the marine realm. Following the results of this study and life-traits of C. trifasciata, we propose that the insular and mainland populations are currently experiencing a split, constituting distinct evolutionary lineages. As C. trifasciata is the only known representative of the genus in the NE Atlantic Ocean, further studies including more populations across the established distribution range of the species in the NE Atlantic Ocean, as well as from the putative populations in the Mediterranean Sea, might clarify questions regarding its current taxonomic classification.
Population structure of Cingula trifasciata in the Azores Archipelago
The analyses of mitochondrial and SSR datasets provide insights at two different time scales, and both point to a strong genetic structure of C. trifasciata within the Azores Archipelago, suggesting low dispersal among islands. For Graciosa, Flores, and Mosteiros the discordant number of individuals analysed for both datasets can be explained by older samples or poorer preservation, causing their degradation and easier amplification of the COI. The main findings regarding the relationships among Azorean populations of C. trifasciata are summarized in a geographical map of the archipelago (Fig. 8).
Mitochondrial loci as COI are used for sequence analysis, indicated for the study of older divergence events and molecular differentiation of species. In their turn, SSR are a multilocus method in which differences are defined via allele frequency, making them especially useful to study spatial patterns of diversity at smaller time and geographic scales . In that regard, the mitochondrial results suggest the occurrence of two haplogroups: 1) a star-like distribution mainly from the central group of the Azores (Pico, São Jorge, and Graciosa) suggest a recent differentiation in the region; 2) a linear branched haplogroup mainly from the eastern (Santa Maria and São Miguel) and western (Flores) groups, as a sign of stable, isolated populations with less frequent gene flow. Several explanations can be hypothesised for this observation such as the retention of ancestral polymorphisms, as Flores is separated from Santa Maria and São Miguel by more than 585 km of deep, unsuitable waters. Restrictions to gene flow of Flores’ populations had already been reported in the marine realm for Patella candei d'Orbigny, 1839 and P. ulyssiponensis Gmelin, 1791 . Future fieldwork in more islands and localities in the Azores might allow more accurate inferences and reduce the number of unknown/unsampled mitochondrial haplotypes. Based on the WAI dataset, a stronger genetic structure is detected among populations. The clustering pattern in STRUCTURE roughly corresponds to a gradual East–West transition, except for the assignment of Graciosa and Mosteiros samples (cf. Figure 6). These deviations from the geographical pattern indicate a recent colonization from the Central group or the recent exchange of individuals due to unusual patterns of circulation between these island groups. Therefore, even though the Azorean islands are relatively isolated for long periods of times, occasional dispersal events of rafting and extreme weather might allow the exchange of individuals among distant populations [2, 48, 122].
Dispersal ability, habitat type, and oceanic circulation patterns within the archipelago, appear to be the main drivers of population structure in the Azores. The finding of strong genetic structure of C. trifasciata in the Azores is consistent with its np-larval development and habitat preferences of this microgastropod [51, 73,74,75, 77, 123, 124]. The lack of a free-swimming dispersal stage during the larval development of C. trifasciata likely hampers dispersal ability at early-life stages [6, 32, 33]. Considering the direct development strategy, most of the dispersal must be ensured by early juveniles and adults in occasional rafting in algal patches [2, 5, 125], as reported from other direct developers (e.g. Batillaria cumingi (Crosse, 1862) ). C. trifasciata gathers the common characteristics of rafters [126, 127], namely its minute size and presence among/close to algal patches in the intertidal. Rafting can be the process behind some inter-island dispersal events in the Azores Archipelago.
In the Azores, C. trifasciata and other microgastropod fauna are easily found in protected sites, under boulders in enclosed tide pools with low hydrodynamics and mesotidal regimes [e.g. Cerco da Caloura, Poça da Barra, and Fajã da Caldeira do Santo Cristo , Mosteiros]. C. trifasciata has been reported in similar habitats outside of the Azores (Lough Hyne, Ireland ). This rissoid has also been reported in Ceuta  and Praia de Lobos (Santa Maria Island) associated with gravel/boulders, which get uncovered during low tide but prone to wave action in bad weather conditions or high tide. In the absence of such conditions, they are found in lower numbers in algal substrates or among big boulders in protected intertidal areas of the Azorean islands, as reported for Flores and Graciosa islands . The influence of habitat in the likelihood of dispersal events of juveniles and adults by rafting, thus affecting population structure, has been proposed to Steromphala spp. . In our study system, one expects juveniles to drive genetic exchange in the gravel/boulder habitats, depending on the oceanographic features in the area. If the fauna is mainly associated with algal substrate, the chances of rafting in algae to distant localities are higher [2, 6]. The habitat type seems to play a major role at São Miguel Island, particularly at the enclosed locality of Caloura [14, 30, 34]. Migration events from and to Caloura seem to be rare, as the environment might hamper dispersal of juveniles from leaving the protected area and the low algal coverage reduces the chances of adult rafting. The surprisingly low connectivity between Caloura and Mosteiros, just 45 km apart, can be attributed to the geomorphological characteristics of the southern coast of São Miguel Island, which features long stretches of unsuitable sandy habitats for C. trifasciata, making gene flow virtually impossible between these populations. Mosteiros closely resembles populations from Graciosa and Pico (central group), located 215 to 225 km far, respectively. The sea-surface circulation regimes nearby the westernmost tip of São Miguel Island, where Mosteiros is located, might favour the frequent dispersal of individuals to or from the Central Group. Within the central islands, gene flow seems to be frequent and results point to recent haplotype differentiation. The distance separating the islands sampled in the central group (20 km Pico-São Jorge; 35 km São Jorge-Graciosa) is probably small enough to not constitute a barrier for the occasional migration and demographic expansion, supported by the low mitochondrial differentiation in levels comparable to other gastropods [65, 106].
The success of rafting episodes and maintenance of connectivity are influenced by the complex sea-surface circulation patterns and hydrological conditions around the Azores Archipelago [129,130,131]. Although surface spatial differences between islands are attributed to large-scale circulation in the NE Atlantic, patterns at fine scale close to island shores are far from well-understood but likely related to colder waters from upwelling or mixing processes . Lagrangian transport pathways in the NE Atlantic Ocean are useful to understand potential rafting routes and major transport directions in the open ocean [130, 131]. These studies show the complexity of superficial (0–5 m) connectivity in the Azores area and among island groups, supporting our inferences regarding gene flow and exchange of individuals among populations: the western islands are the most isolated with lower changes of exchange with the remaining; high connectivity levels are expected within the Central Group; the Eastern Group is isolated to some extent but with some degree of connectivity with the Central Group [130, 131]. While the direction and circulation of the transport pathways within and among island groups await further studies, one cannot ignore the potential role of temporary disruptions of these circulation patterns in the episodic long-distance transports of individuals , often due to extreme weather events lashing the Azores. Future long-term studies regarding oceanographic circulation and extreme events in the NE Atlantic might provide the necessary data to better explain the genetic structuring of C. trifasciata and other np marine invertebrates in the Azores Archipelago.
Combining mitochondrial and SSR datasets provide the opportunity to detect hidden patterns of cryptic diversity and to get a clearer perspective of gene flow among populations. Differentiation between C. trifasciata from the Azores and Vigo suggest the existence of potential cryptic diversity within the genus. Based on genetic data and life-history traits, these constitute distinct evolutionary lineages. Deep insights into spatial genetic structure of a microgastropod in the isolated Azores Archipelago were also reached with this study, providing crucial knowledge to better understand gene flow and connectivity among islands and considerably isolated populations. Several aspects seem to work in concert to shape the genetic structure of C. trifasciata in the Azores. Its np nature hinders dispersal during larval stage, but occasional exchange of individuals must occur through processes of rafting or by extreme weather events. Our results, especially the SSR dataset, reveal a preliminary analysis of these patterns in the remote islands of the Azores. A most comprehensive interpretation can only be achieved with further studies in related research areas, namely oceanography, but also on expanding molecular approaches to other marine invertebrates. Without a thorough understanding of fine-scale circulation patterns within the Azores, unexpected connections, as the one detected between Mosteiros and Graciosa, become difficult to justify. The application of the SSR-GBAS approach to other marine invertebrates might be the key to generate comparative datasets and to determine how widespread are the patterns inferred for C. trifasciata. How transversal is the complex interaction between the larval development type, ecological traits related to the habitat occupied, and the sea-surface circulation patterns in the population dynamics and genetic structure of marine invertebrates can be further addressed by expanding the taxa studied. Understanding differentiation levels and patterns of diversity at the regional scale might be a useful proxy for other intertidal invertebrate species, and to implement conservation and management strategies of the shore habitats in a way that connectivity in the remote Azores Archipelago will be preserved.
With this work we aim to investigate the processes shaping the intraspecific genetic diversity and population dynamics of C. trifasciata in the NE Atlantic Ocean, with a special focus on its behaviour and factors shaping the genetic structure in the remote Azores Archipelago.
Sampling and DNA isolation
Cingula trifasciata specimens from the Azores Archipelago (Portugal) and Vigo (Spain) were used in this study, either recently collected by the authors in intertidal habitats or retrieved from collections (Table 3). Fresh samples from Flores Island were obtained collected by integral algal scrapping with 20 × 20 cm squares. In other Azorean localities, numerous populations of C. trifasciata are sheltered under coastal boulder deposits in intertidal pools: Santa Maria, São Miguel, São Jorge, and Pico. All the recently collected specimens were stored in 96% ethanol and deposited in the Marine Molluscs Collection of the Department of Biology of the University of the Azores (DBUA). Permits for sampling were issued by the respective authorities in the Azores (Direção Reginal da Ciência e Tecnologia, Governo Regional dos Açores; AMP 2018/014, CCIP 24/2019/DRCT, CCIP 35/2019/DRCT). Samples from two localities in Graciosa island and from a NE Atlantic population in Vigo were loaned, respectively, from the DBUA and CIBIO-InBIO molluscs’ collections. A total of 75 samples were considered in this study.
Due to its minute size, total genomic DNA (gDNA) was extracted from the entire animal, removed from the shell when possible, following the manufacturer’s instructions for the column-based commercial kit PureLink® Genomic DNA (Invitrogen™). gDNA was eluted in a volume of 40 μl and its quality assessed based on the absorbance ratios measured with Nanodrop®2000. An electrophoretic run in agarose gel 0.8% at 100 V for 50 min was performed to evaluate DNA integrity.
Sequence data and genetic analyses
The fragment of the mitochondrial cytochrome oxidase subunit I (COI) were amplified in 25 μl volumes, containing 12.5 μl of QIAGEN Multiplex PCR Master Mix (Qiagen, CA, USA), 5 µL of each primer at a concentration of 2 μM, and 2.5 µL of gDNA. The amplification of COI was achieved with the primers LCO1490/HCO1490  or jgLCO1490/jgHCO2198 . The following cycling profile was applied to both markers: 95 °C for 15 min; 35 cycles of 95 °C for 30 s, 50/55 °C for 1 min, 72 °C for 30 s; 72 °C for 10 min. PCR products were checked by electrophoresis. Purification of the PCR products and bi-directional Sanger sequencing were realized by a commercial facility (Genewiz, Leipzig, Germany) using the same primers as for PCR.
Geneious 8.1.9  was used for manual check of potential misreads in the generated chromatograms. The reviewed mitochondrial coding COI sequences were inspected for the existence of stop codons and putative pseudogenes by translating into amino acids with ExPASy Translate Tool . All the sequences generated during this study were deposited at GenBank , under the accession numbers MW518062-117 and MW518858-63. Additional sequences of C. trifasciata from the Azores, publicly available at GenBank , were included in the COI dataset: MG652395-MG562406. The COI dataset was aligned with Clustal Omega algorithm via Web Services by EMBL-EBI  and the 75 sequences were reduced to 44 haplotypes. Raw (p) distances among haplotypes of C. trifasciata were calculated in MEGA v7  to estimate evolutionary divergence and sequence diversity. Frequency, distribution, and genetic structure between the haplotypes in the populations was further examined at a statistical parsimony haplotype network at the 95% connection limit, generated with the software TCS v1.21 . The output was rendered using tcsBU web-based program , allowing to overlap the genetic structure retrieved by TCS with the geographical structure of the populations studied. A UPGMA tree of C. trifasciata haplotypes  was obtained with Geneious 8.1.9 , and bootstraps calculated with 1,000 replicates. The evolutionary divergence levels within Rissoidae family were further evaluated with a maximum-likelihood analysis conducted in RAxML v8.2.7 , under the GTR model of substitutions and based on 18 COI sequences of different rissoid species retrieved from GenBank  (cf. Additional file 1: Table S5 for details) and C. trifasciata from the Azores and Vigo. Pairwise FST estimates with COI haplotypes of C. trifasciata and Rissoidae species were performed in Arlequin v3.5 , with statistical significance tested by 1023 permutations and considered for p-values below 0.01.
SSR marker discovery
Two low-coverage Illumina MiSeq runs, from two C. trifasciata individuals (PSM4350 from Graciosa and FSC5131 from São Jorge), were conducted for marker development and raw reads can be accessed through the BioProject PRJNA702169, at GenBank . Library preparation and sequencing using shot-gun genomic libraries without enrichment on the Illumina MiSeqs paired-end (PE) 300 bp were performed at the Genomics Service Unit, Ludwig-Maximilian University Munich, Germany. The resulting reads, after quality check with FastQC software, were processed by Trimmomatic v0.39  to trim adapters and low-quality regions (Phred score > 20) in forward and reverse reads, which were then merged using Usearch v11 . Merged reads were used as input for the identification of sequences containing microsatellite (SSR) motifs with the SSR_pipeline’s script SSR_search.py . For the SSR search, the following parameters of quality control were defined to ensure that the sequence contains: 1) a minimum of 30 bp flanking regions on both sides of the motif; 2) a minimum of five repeats for penta- and tetranucleotides; 3) a minimum of seven repeats for trinucleotides; 4) a minimum of nine repeats for dinucleotides. These less stringent parameters allowed the extraction of a considerable number of SSR motif containing sequences, which were manually checked to remove sequences containing either interrupted motifs, more than one repetitive motif, and long mononuclear stretches (> 6 bp). If this filtering step resulted in a low final number of usable reads, sequences containing mononucleotide repeats were maintained.
Primer3  as implemented in Geneious 8.1.9  was chosen to design primers, as a batch under manual control. The following parameters were set for the primer design: length of 19 to 22 bp, optimal melting temperature of 55 °C, GC content between 20 and 80% and optimal as 50%, and amplification product size between 300 and 450 bp. Only primers producing amplicons comprising the complete SSR repetitive motif in the first or last 300 bp were selected, ensuring its coverage by one of MiSeq’s paired reads. This is crucial to avoid problems in the overlap of the paired reads during the merging step of the bioinformatics pipeline [91, 92]. We added recognition sequences corresponding to the Illumina adapter to the selected primer pairs: part of the P5 motif (TCTTTCCCTACACGACGCTCTTCCGATCT) elongated the forward primer, whereas part of the P7 motif (CTGGAGTTCAGACGTGTGCTCTTCCGATCT) was added to the reverse primer. These recognition sequences serve as linkers for a second index PCR using primers containing the eight bp indexes and TrueSeq adapters (P5: AATGATACGGCGACCACCGAGATCTACAC [Index] ACACTCTTTCCCTACACGACG; and P7: CAAGCAGAAGACGGCATACGAGAT [Index] TGACTGGAGTTCAGACGTGT) [91, 92]. The primers designed from the initial two Illumina MiSeq runs were individually tested using gDNA for two specimens of C. trifasciata. PCR reactions were conducted for a final volume of 10 µL: 5 µL of QIAGEN Multiplex PCR Master Mix (Qiagen, CA, USA), 1 µL of each primer at 1 µM, 1 µl of diluted gDNA in a 1:3 proportion, and water. The cycling profile was applied as follows: 95 °C for 15 min; 30 cycles of 95 °C for 30 s, 55 °C for 1 min, and 72 °C for 1 min; and a final extension at 72 °C for 10 min. The PCR results were visualized after an electrophoretic run in 1.5% agarose gel and 80 V. Primers that successfully generated amplicons of the expected size were combined in three mixes of eight primer pairs and one mix of seven primer pairs, each with a final concentration of 1 µM (Additional file 1: Table S1).
Multiplex PCR and Illumina sequencing
Multiplex amplification was achieved with PCR reactions of 5 µL, containing 2.5 µL of QIAGEN Multiplex PCR Master Mix (Qiagen, CA, USA), 0.5 µL of each primer mix, and 2 µL of gDNA diluted in 1:3 proportion. Cycling conditions were the same as used in the single PCR to test the primers individually. For each sample, and following Curto et al.’s (2019) protocol, equal volumes of PCR products from different primers mixes were pooled into a final volume of 6 µL. PCR clean-up, aiming the removal of unused primers and primer-dimer products, was conducted using the magnetic bead technology offered by the Agencourt AMPure XP PCR Purification kit (Beckman Coulter Inc., Bree, CA, USA), applying some modifications to the standard protocol. The total volume of pooled PCR product (6 µL) was mixed with 4.3 µL of AMPure XP beads, followed by a 5 min incubation period at room temperature. The beads, to which DNA bound after the first step, were captured by an inverted magnetic bead extraction device, VP407‐AM‐N (V&P Scientific, INC.), and afterwards washed twice for 45 s in 200 µL of 80% ethanol. The beads were then dried at room temperature for 5 min and the DNA finally eluted in 17 µL of elution buffer (10 mM Tris–HCl, pH 8.3) at 65 °C.
Once purified, the multiplex PCR product underwent a second PCR, whereby designated as index-PCR, for the assignment of indexes to each sample. A unique combination of forward and reverse indexes was carefully chosen, so that each sample can be unambiguously identified after the MiSeq Run. A 10 µL PCR reaction was performed, comprising 5 µL of QIAGEN Multiplex PCR Master Mix (Qiagen, CA, USA), 1 µL of each index primer at 1 µM, and 1 µL of pooled purified PCR product. The cycling conditions were applied as follows: 95 °C for 15 min; 10 cycles of 95 °C for 30 s, 58 °C for 60 s, and 72 °C for 60 s; 72 °C for 5 min. The index-PCR product, from 5’ to 3’, entails: P5 motif for flow cell hybridization, index 1 with a length of 8 bp, P5 sequencing primer, specific forward primer, target DNA for sequence, specific reverse primer, P7 sequencing primer, 8 bp long index 2, P7 motif for flow cell hybridization. After visualizing the index-PCR products in 1.5% agarose gel stained with GelRed (Biotium), all the samples were pooled in equal volumes of 2 µL. The Illumina MiSeq run for PE 300 bp sequencing at the Genomics Service Unit at Ludwig Maximillian Universität, München, Germany, using the pooled index-PCR product as input, produced sequences to be analysed with a genotyping by amplicon sequencing (GBAS) protocol.
SSR data analysis and SSR-GBAS genotyping
Raw FASTQ sequence data (reads R1 and R2), automatically extracted by the MiSeq equipment based on the index combinations and each corresponding to a different sample, were downloaded from Illumina BaseSpace. These Illumina reads, containing all sequences per index, underwent the quality control and merging procedures resorting to FastQC, Trimmomatic v0.39, Usearch v11 (cf. “SSR discovery” section). Genotyping and analysis of each sample for all the SSR loci was performed with the SSR_GBS_pipeline scripts available at GitHub [92, 143]. Script 1 (primer_demultiplex.py) allowed to demultiplex merged fastq files by identifying the primers on both sides of the merged reads and sort them by locus. Script 2 (CountLengths.sh) calculated the number of occurrences of each sequence length, excluding sequences below the 250 bp threshold. With Script 3, potential alleles and histograms were plotted according to sequence length, setting a minimum of 10 reads to define a valid genotype. The stutter effect in the defined alleles is also assessed by Script 3, as described previously [91, 92]. A codominant matrix in.csv format, based on the allele length information of each sample is generated and used as input for the downstream recovery of sequence alleles, which considers SNP variation within alleles of the same size along with the sequence length variation determined at this point. Sequence_Allele_Call.py (Script 4) constitutes the second part of the SSR_GBS_pipeline.py, for the extraction of the corresponding reads and definition of consensus sequence for each length-based allele. It also allows the detection of possible SNP variation and calls alleles based on sequence information. This information set is called in the following WAI.
Population structure analyses
Descriptive population genetics and marker variability analyses, as well as evaluation of genetic structure patterns, were performed on the codominant WAI dataset of Cingula trifasciata. The two populations of Graciosa island were analysed together, as no major differences were found between them. Genetic diversity indices, null alleles and deviations to Hardy–Weinberg Equilibium (HWE) in the WAI dataset were estimated only for populations with five or more individuals analysed – Santa Maria, Caloura, Graciosa, Pico, and São Jorge. FreeNA  was used to estimate the frequency of null alleles per marker and population, using the AL dataset as required by the software. GenAlEx v6.5 [87, 88] allowed to check the number of alleles per marker and allelic patterns per population – number of alleles (Na), number of private (Npa) and effective (Ne) alleles, observed (Ho) and expected (He) heterozygosity. Although excluded from the inference of genetic diversity, monomorphic markers were kept in the dataset for downstream analyses, as they are informative . Tests for deviations to HWE and Wright’s Fixation index (FIS) were performed by marker and population resorting to GenAlEx. The remaining analyses based on the WAI dataset included information of all individuals studied, belonging to the seven populations sampled. The software Cervus v3.0.7  was used to estimate the overall polymorphism information content (PIC) of each marker. Genetic structure patterns in the dataset were inferred in a Principal Coordinate Analysis (PCoA), as implemented in GenAlex v6.5 [87, 88], and STRUCTURE v2.3.4 [89, 90] with the complete dataset, allowing to assess the informativeness of the developed SSR markers. PCoA, performed with the complete dataset and excluding Vigo to clarify the position of the Azorean populations, evaluates genetic structure among populations without assumptions of HWE and based on absolute genetic distances between individuals. With the number of clusters (K) varying between 2 and 10, STRUCTURE ran for 10 independent replicates for 500,000 generations, following a burn-in period of 100,000 and maintaining the default settings for the admixture model and correlated allele frequencies . The online program Structure Harvester  was used to validate multiple K-values for optimal detection of genetic structure, according to the Delta-K method and inferring the K that best suits the data from hundreds of iterations. The results from STRUCTURE across the K-values were summarized and graphically displayed resorting to the online pipeline CLUMPAK . A hierarchical analysis of molecular variance (AMOVA ), as implemented in GenAlEx v6.5 [87, 88], was performed to evaluate the differentiation between populations and regions, with a simultaneous estimate of pairwise population FST values.
Availability of data and materials
All the mitochondrial COI sequences of Cingula trifasciata generated in this study have been deposited at GenBank database, under the accession numbers MW518062-117 and MW518858-63. Raw reads from the low-coverage whole-genome sequencing library used for marker development and WAI dataset can be accessed through the BioProject PRJNA702169. The WAI allele sequences were submitted to GenBank database, under the accession numbers MW623085-363. Public access to the databases is open.
Hierarchical analysis of Molecular Variance
Cytochrome oxidase subunit I
Department of Biology of the University of the Azores
Genotyping by Amplicon Sequencing
Principal Coordinates Analysis
Polymerase Chain Reaction
Single Nucleotide Polymorphism
Simple Sequencing Repeats/ microsatellites
Simple Sequencing Repeats—Genotyping by Amplicon Sequencing
Whole Amplicon Sequencing
Robinson LM, Elith J, Hobday AJ, Pearson RG, Kendall BE, Possingham HP, Richardson AJ. Pushing the limits in marine species distribution modelling: lessons from the land present challenges and opportunities. Glob Ecol Biogeogr. 2011;20:789–802. https://doi.org/10.1111/j.1466-8238.2010.00636.x.
Ávila SP. Unravelling the patterns and processes of evolution of marine life in oceanic islands: a global framework. In: Fernández-Palacios JM, de Nascimento L, Hérnandez JC, Clemente S, González A, Díaz-González JP, editors. Climate Change Perspectives From the Atlantic: Past, Present and Future. Tenerife: Universidad de La Laguna; 2013. p. 95–125.
Thorson G. Reproduction and larval development of Danish marine bottom invertebrates. Meddelelser fra Kommissionen for Danmarks Fiskeri- og Havundersøgelser, Serie Plankton. 1946;4:1–523.
Thorson G. Reproduction and larval ecology of marine bottom invertebrates. Biol Rev Camb Philos Soc. 1950;25:1–45.
Winston JE. Dispersal in marine organisms without a pelagic larval phase. Integr Compar Biol. 2012;52:447–57. https://doi.org/10.1093/icb/ics040.
Scheltema RS. On dispersal and planktonic larvae of benthic invertebrates: an eclectic overview and summary of problems. Bull Mar Sci. 1986;39:290–322.
Pechenik JA. The relationship between temperature, growth rate, and duration of planktonic life in larvae of the gastropod Crepidula fornicata. J Exp Mar Biol Ecol. 1984;74:241–57. https://doi.org/10.1016/0022-0981(84)90128-X.
Johannesson K. The paradox of Rockall: why is a brooding gastropod (Littorina saxatilis) more widespread than one having a planktonic larval dispersal stage (L. littorea)? Mar Biol. 1988;99:507–13. https://doi.org/10.1007/BF00392558.
Oliverio M. Larval development and allozyme variation in the East Atlantic Columbella (Gastropoda: Prosobranchia: Columbellidae). Sci Mar. 1995;59:77–86.
Ávila SP, Melo PJ, Lima A, Amaral A, Martins AMF, Rodrigues A. Reproductive cycle of the rissoid Alvania mediolittoralis Gofas, 1989 (Mollusca, Gastropoda) at São Miguel island (Azores, Portugal). Invertebr Reprod Dev. 2008;52:31–40.
Crothers JH. Common topshells: An introduction to the biology of Osilinus lineatus with notes on other species in the genus. F Stud. 2001;10:115–60.
Modica MV, Russini V, Fassio G, Oliverio M. Do larval types affect genetic connectivity at sea? Testing hypothesis in two sibling marine gastropods with contrasting larval development. Mar Environ Res. 2017;127:92–101. https://doi.org/10.1016/j.marenvres.2017.04.001.
Ponder WF. A review of the Genera of the Rissoidae (Mollusca: Mesogastropoda: Rissoacea). Rec Aust Museum. 1984;4:1–221. https://doi.org/10.3853/j.0812-7387.4.1985.100.
Gofas S. The littoral Rissoidae and Anabathridae of São Miguel, Azores. In: Martins AMF, editor. The Marine Fauna and Flora of the Azores. (Proceedings of the First International Workshop of Malacology, Vila Franca Do Campo, São Miguel, Azores). Açoreana, Supplement 2; 1990. p. 97–134.
Gofas S. Rissoidae (Mollusca: Gastropoda) from northeast Atlantic seamounts. J Nat Hist. 2007;41:779–885.
Davis GM, Wilke T, Spolsky C, Qiu CP, Qiu DC, Xia MY, et al. Cytochrome Oxidase I-based phylogenetic relationships among the Pomatiopsidae, Hydrobiidae, Rissoidae and Truncatellidae (Gastropoda: Caenogastropoda: Rissoacea). Malacologia. 1998;40:251–66.
Ávila SP. The shallow-water Rissoidae (Mollusca, Gastropoda) of the Azores and some aspects of their ecology. Iberus. 2000;18:51–76.
Costa AC, Ávila SP. Macrobenthic mollusc fauna inhabiting Halopteris spp. subtidal fronds in São Miguel island, Azores. Scientia Marina. 2001;65:117–26.
Ávila SP, Goud J, Martins AMF. Patterns of Diversity of the Rissoidae (Mollusca: Gastropoda) in the Atlantic and the Mediterranean Region. Sci World J. 2012. https://doi.org/10.1100/2012/164890.
Ávila SP, Melo C, Silva L, Ramalho RS, Quartau R, Hipólito A, et al. A review of the MIS 5e highstand deposits from Santa Maria Island (Azores, NE Atlantic): Palaeobiodiversity, palaeoecology and palaeobiogeography. Quat Sci Rev. 2015;114:126–48. https://doi.org/10.1016/j.quascirev.2015.02.012.
Criscione F, Ponder WF. A phylogenetic analysis of rissooidean and cingulopsoidean families (Gastropoda: Caenogastropoda). Mol Phyl Evol. 2013;66:1075–82. https://doi.org/10.1016/j.ympev.2012.11.026.
Cordeiro R, Ávila SP. New species of Rissoidae (Mollusca, Gastropoda) from the Archipelago of the Azores (northeast Atlantic) with an updated regional checklist for the family. ZooKeys. 2015;480:1–19. https://doi.org/10.3897/zookeys.480.8599.
Criscione F, Ponder WF, Köhler F, Takano T, Kano Y. A molecular phylogeny of Rissoidae (Caenogastropoda: Rissooidea) allows testing the diagnostic utility of morphological traits. Zool J Linn Soc. 2016;179:23–40. https://doi.org/10.1111/zoj.12447.
Baptista L, Santos AM, Cabezas MP, Cordeiro R, Melo C, Ávila SP. Intertidal or subtidal/circalittoral species: which appeared first? A phylogenetic approach to the evolution of non-planktotrophic species in Atlantic Archipelagos. Mar Biol. 2019;166:1–16. https://doi.org/10.1007/s00227-019-3536-y.
Duff M le, Hily C. La zone intertidale du site Natura 2000 de Guisseny - Inventaire des habitats marins. Brest; 2001.
Davidson IC. Structural gradients in an intertidal hard-bottom community: examining vertical, horizontal, and taxonomic clines in zoobenthic biodiversity. Mar Biol. 2005;146:827–39. https://doi.org/10.1007/s00227-004-1478-4.
Cordeiro R, Borges JP, Martins AMF, Ávila SP. Checklist of the littoral gastropods (Mollusca: Gastropoda) from the Archipelago of the Azores (NE Atlantic). Biodivers J. 2015;6:855–900.
Borges LMSS, Hollatz C, Lobo J, Cunha AM, Vilela AP, Calado GG, et al. With a little help from DNA barcoding: Investigating the diversity of Gastropoda from the Portuguese coast. Sci Rep. 2016;6:20226. https://doi.org/10.1038/srep20226.
Miralles L, Ardura A, Arias A, Borrell YJ, Clusa L, Dopico E, et al. Barcodes of marine invertebrates from north Iberian ports: Native diversity and resistance to biological invasions. Mar Pollut Bull. 2016;112:183–8. https://doi.org/10.1016/j.marpolbul.2016.08.022.
Ávila SP. Zonação intertidal de uma comunidade malacológica na “Poça da Barra”, uma lagoa localizada na plataforma costeira da Vila das Lajes do Pico. Açores Açoreana. 1998;8:457–85.
Buršić M, Iveša L, Jaklin A, Arko PM. A preliminary study on the diversity of invertebrates associated with Corallina officinalis Linnaeus in southern Istrian peninsula. Acta Adriat. 2019;60:127–35. https://doi.org/10.32582/aa.60.2.2.
Scheltema RS. Planktonic and non-planktonic development among prosobranch gastropods and its relationship to the geographic range of species. In: Ryland JS, Tyles PA, editors. Reproduction, Genetics and Distribution of Marine Organisms. Fredensborg: Olsen and Olsen; 1989. p. 183–8.
Scheltema RS. The relevance of passive dispersal for the biogeography of Caribbean mollusks. Am Malacol Bull. 1995;11:99–115.
Ponder WF. A gravel beach shelled micro-gastropod assemblage from Ceuta, Strait of Gibraltar, with the description of a new truncatelloidean genus. Bull du Muséum Natl d’histoire Nat Sect A, Zool Biol Écol Anim. 1990;12:291–311.
Tringali LP. Marine malacological records (Gastropoda : Prosobranchia, Heterobranchia, Opisthobranchia and Pulmonata) from Torres de Alcalé, Mediterranean Morocco, with the description of a new philinid species. Bollettino Malacologico. 2001;37:207–22.
Ó Foighil D. Planktotrophic larval development is associated with a restricted geographic range in Lasaea, a genus of brooding, hermaphrodite bivalves. Mar Biol. 1989;103:349–58. https://doi.org/10.1007/BF00397269.
Collin R. Phylogenetic relationships among calyptraeid gastropods and their implications for the biogeography of marine speciation. Syst Biol. 2003;52:618–40.
Donald KM, Kennedy M, Spencer HG. Cladogenesis as the result of long-distance rafting events in South Pacific topshells (Gastropoda, Trochidae). Evolution. 2005;59:1701–11. https://doi.org/10.1111/j.0014-3820.2005.tb01819.x.
Käse RH, Krauss W. The Gulf Stream, the North Atlantic Current, and the origin of the Azores Current. In: Krauss W, editor. The warmwatersphere of the North Atlantic Ocean. Berlin, Stuttgart: Gebrüder Borntraeger; 1996. p. 291–337.
Carracedo LI, Gilcoto M, Mercier H, Pérez FF. Progress in Oceanography Seasonal dynamics in the Azores – Gibraltar Strait region: A climatologically-based study. Prog Oceanogr. 2014;122:116–30. https://doi.org/10.1016/j.pocean.2013.12.005.
Klein B, Siedler G. On the origin of the Azores Current. J Geophys Res: Oceans. 1989;94:6159–68. https://doi.org/10.1029/JC094iC05p06159.
Fründt B, Waniek J. Impact of the Azores Front propagation on deep ocean particle flux. Cent Eur J Geosci. 2012;4:531–44. https://doi.org/10.2478/s13533-012-0102-2.
Onken R. The azores countercurrent. J Phys Oceanogr. 1993;23:1638–46. https://doi.org/10.1175/1520-0485.
Alves MLGR, de Verdière A. Instability Dynamics of a Subtropical Jet and Applications to the Azores Front Current System: Eddy-Driven Mean Flow. J Phys Oceanogr. 1999;29:837–64. https://doi.org/10.1175/1520-0485.
Comas-Rodriguez I, Hernandez-Guerra A, Fraile-Nuez E, Benitez-Barrios VM, Perez-Hernandez MD, et al. The Azores Current System from a meridional section at 24.5°W. J Geophys Res. 2011;116:C09021. https://doi.org/10.1029/2011JC007129.
Portuguese Hydrographic Institute. 2014. https://www.hidrografico.pt/op/33.
Baptista L, Santos AM, Melo CS, Rebelo AC, Madeira P, Cordeiro R, et al. Untangling the origin of the newcomer Phorcus sauciatus (Mollusca: Gastropoda) in a remote Atlantic archipelago. Mar Biol. 2021;168:9. https://doi.org/10.1007/s00227-020-03808-5.
Ladoukakis ED, Zouros E. Evolution and inheritance of animal mitochondrial DNA: rules and exceptions. Journal of Biological Research-Thessaloniki. 2017;24:1–7. https://doi.org/10.1186/s40709-017-0060-4.
Avise JC. Molecular markers, natural history and evolution. 2nd ed. Sunderland, Massachusetts: Sinauer Associates Inc.; 2004.
Wilke T, Davis GM. Infraspecific mitochondrial sequence diversity in Hydrobia ulvae and Hydrobia ventrosa (Hydrobiidae: Rissooidea: Gastropoda): Do their different life histories affect biogeographic patterns and gene flow? Biol J Linn Soc. 2000;70:89–105. https://doi.org/10.1006/bijl.1999.0388.
Wilke T, Falniowski A. The genus Adriohydrobia (Hydrobiidae: Gastropoda): polytypic species or polymorphic populations? J Zool Syst Evol Res. 2001;39:227–34. https://doi.org/10.1046/j.1439-0469.2001.00171.x.
Ellegren H. Microsatellites: simple sequences with complex evolution. Nat Rev Genet. 2004;5:435–45. https://doi.org/10.1038/nrg1348.
Bhargava A, Fuentes F. Mutational Dynamics of Microsatellites. Mol Biotechnol. 2010;44:250–66. https://doi.org/10.1007/s12033-009-9230-4.
Putman AI, Carbone I. Challenges in analysis and interpretation of microsatellite data for population genetic studies. Ecol Evol. 2014;4:4399–428. https://doi.org/10.1002/ece3.1305.
Farrell ED, Carlsson JEL, Carlsson J. Next gen pop gen: Implementing a high-throughput approach to population genetics in boarfish (Capros aper). R Soc Open Sci. 2016;3: 160651. https://doi.org/10.1098/rsos.160651.
Kawai K, Hughes RN, Takenaka O. Isolation and characterization of microsatellite loci in the marine gastropod Nucella lapillus. Mol Ecol Notes. 2001;1:270–2. https://doi.org/10.1046/j.1471-8278.2001.00103.x.
Dupont L, Viard F. Isolation and characterization of highly polymorphic microsatellite markers from the marine invasive species Crepidula fornicata (Gastropoda: Calyptraeidae). Mol Ecol Notes. 2003;3:498–500. https://doi.org/10.1046/j.1471-8286.2003.00491.x.
McInerney CE, Allcock AL, Johnson MP, Prodöhl PA. Characterization of polymorphic microsatellites for the periwinkle gastropod, Littorina littorea (Linnaeus, 1758) and their cross-amplification in four congeners. Conserv Genet. 2009;10:1417–20. https://doi.org/10.1007/s10592-008-9750-7.
Weetman D, Hauser L, Shaw PW, Bayes MK. Microsatellite markers for the whelk Buccinum undatum. Mol Ecol Notes. 2005;5:361–2. https://doi.org/10.1111/j.1471-8286.2005.00926.x.
Cárdenas L, Daguin C, Castilla JC, Viard F. Isolation and characterization of 11 polymorphic microsatellite markers for the marine gastropod Concholepas concholepas (Brugière, 1789). Mol Ecol Notes. 2007;7:464–6. https://doi.org/10.1111/j.1471-8286.2006.01619.x.
Beldade R, Bell CA, Raimondi PT, George MK, Miner CM, Bernardi G. Isolation and characterization of 8 novel microsatellites for the black abalone, Haliotis cracherodii, a marine gastropod decimated by the withering disease. Conserv Genet Resour. 2012;4:1071–3. https://doi.org/10.1007/s12686-012-9709-3.
Vega-Retter C, Briones M, Véliz D. Characterization of sixteen microsatellite loci from the marine gastropod Monetaria caputdraconis (Gastropoda: Cypraeidae) by next generation sequencing. Rev Biol Mar Oceanogr. 2016;51:695–8. https://doi.org/10.4067/S0718-19572016000300021.
López-Márquez V, García-Jiménez R, Calvo M, Templado J, Machordom A. Isolation of microsatellite loci for the endangered vermetid gastropod Dendropoma lebeche using Illumina MiSeq next generation sequencing technology. Mol Biol Rep. 2018;45:2775–81. https://doi.org/10.1007/s11033-018-4346-x.
Brante A, Fernández M, Viard F. Microsatellite evidence for sperm storage and multiple paternity in the marine gastropod Crepidula coquimbensis. J Exp Mar Biol Ecol. 2011;396:83–8. https://doi.org/10.1016/j.jembe.2010.10.001.
le Cam S, Riquet F, Pechenik JA. Paternity and gregariousness in the ex-changing sessile marine gastropod Crepidula convexa: comparison with other protandrous Crepidula species. J Hered. 2014;105:397–406.
Xue D, Zhang T, Liu J-X. Microsatellite evidence for high frequency of multiple paternity in the marine gastropod Rapana venosa. PLoS ONE. 2014;9:e86508. https://doi.org/10.1371/journal.pone.0086508.
Kemppainen P, Panova M, Hollander J, Johannesson K. Complete lack of mitochondrial divergence between two species of NE Atlantic marine intertidal gastropods. J Evol Biol. 2009;22:2000–11. https://doi.org/10.1111/j.1420-9101.2009.01810.x.
Teske PR, Sandoval-Castillo J, Waters J, Beheregaray LB. An overview of Australia’s temperate marine phylogeography, with new evidence from high-dispersal gastropods. J Biogeogr. 2017;44:217–29. https://doi.org/10.1111/jbi.12783.
Dupont L, Bernas D, Viard F. Sex and genetic structure across age groups in populations of the European marine invasive mollusc, Crepidula fornicata L. (Gastropoda). Biol J Linn Soc. 2007;90:365–74. https://doi.org/10.1111/j.1095-8312.2007.00731.x.
Ribeiro PA, Branco M, Hawkins SJ, Santos AM. Recent changes in the distribution of a marine gastropod, Patella rustica, across the Iberian Atlantic coast did not result in diminished genetic diversity or increased connectivity. J Biogeogr. 2010;37:1782–96. https://doi.org/10.1111/j.1365-2699.2010.02330.x.
Pálsson S, Magnúsdóttir H, Reynisdóttir S, Jónsson ZO, Örnólfsdóttir EB. Divergence and molecular variation in common whelk Buccinum undatum (Gastropoda: Buccinidae) in Iceland: a trans-Atlantic comparison. Biol J Linn Soc. 2013;111:145–59. https://doi.org/10.1111/bij.12191.
Cahill AE, Viard F. Genetic structure in native and non-native populations of the direct-developing gastropod Crepidula convexa. Mar Biol. 2014;161:2433–43. https://doi.org/10.1007/s00227-014-2519-2.
Wort EJG, Chapman MA, Hawkins SJ, Henshall L, Pita A, Rius M, et al. Contrasting genetic structure of sympatric congeneric gastropods: Do differences in habitat preference, abundance and distribution matter? J Biogeogr. 2019;46:369–80. https://doi.org/10.1111/jbi.13502.
Bell JJ. Similarity in connectivity patterns for two gastropod species lacking pelagic larvae. Mar Ecol Prog Ser. 2008;357:185–94. https://doi.org/10.3354/MEPS07301.
Donald KM, Keeney DB, Spencer HG. Contrasting population makeup of two intertidal gastropod species that differ in dispersal opportunities. J Exp Mar Biol Ecol. 2011;396:224–32. https://doi.org/10.1016/j.jembe.2010.10.028.
Donald KM, McCulloch GA, Dutoit L, Spencer HG. Population structure of the New Zealand whelk, Cominella glandiformis (Gastropoda: Buccinidae), suggests sporadic dispersal of a direct developer. Biol J Linn Soc. 2020;130:49–60. https://doi.org/10.1093/biolinnean/blaa033.
Quintero-Galvis JF, Bruning P, Paleo-López R, Gomez D, Sánchez R, Cárdenas L. Temporal variation in the genetic diversity of a marine invertebrate with long larval phase, the muricid gastropod Concholepas concholepas. J Exp Mar Biol Ecol. 2020;530–531: 151432. https://doi.org/10.1016/j.jembe.2020.151432.
Albano PG, Sabelli B, Bouchet P. The challenge of small and rare species in marine biodiversity surveys: Microgastropod diversity in a complex tropical coastal environment. Biodivers Conserv. 2011;20:3223–37. https://doi.org/10.1007/s10531-011-0117-x.
Zhang Z, Schwartz S, Wagner L, Miller W. A greedy algorithm for aligning DNA sequences. J Comput Biol. 2000;7:203–14. https://doi.org/10.1089/10665270050081478.
Clement M, Posada D, Crandall KA. TCS: A computer program to estimate gene genealogies. Mol Ecol. 2000;9:1657–9. https://doi.org/10.1046/j.1365-294x.2000.01020.x.
Santos AM, Cabezas MP, Tavares AI, Xavier R, Branco M. tcsBU: a tool to extend TCS network layout and visualization. Bioinformatics. 2016. https://doi.org/10.1093/bioinformatics/btv636.
Sneath P, Sokal RR. Unweighted Pair Group Method with Arithmetic Mean. In: Numerical Taxonomy. San Francisco: Freeman; 1973. p. 230–234.
Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–9. https://doi.org/10.1093/bioinformatics/bts199.
Stamatakis A. RAxML Version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014. https://doi.org/10.1093/bioinformatics/btu033.
Peakall R, Smouse PE. GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes. 2006;6:288–95. https://doi.org/10.1111/j.1471-8286.2005.01155.x.
Peakall R, Smouse PE. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research-an update. Bioinformatics. 2012;28:2537–9. https://doi.org/10.1093/bioinformatics/bts460.
Tibihika PD, Curto M, Dornstauder-Schrammel E, Winter S, Alemayehu E, Waidbacher H, et al. Application of microsatellite genotyping by sequencing (SSR-GBS) to measure genetic diversity of the East African Oreochromis niloticus. Conserv Genet. 2019;20:357–72. https://doi.org/10.1007/s10592-018-1136-x.
Curto M, Winter S, Seiter A, Schmid L, Scheicher K, Barthel LMF, et al. Application of a SSR-GBS marker system on investigation of European Hedgehog species and their hybrid zone dynamics. Ecol Evol. 2019;9:2814–32. https://doi.org/10.1002/ece3.4960.
Lanner J, Gstöttenmayer F, Curto M, Geslin B, Huchler K, Orr MC, et al. Evidence for multiple introductions of an invasive wild bee species currently under rapid range expansion in Europe. BMC Ecol Evol. 2021;21:71. https://doi.org/10.1186/s12862-020-01729-x.
Guichoux E, Lagache L, Wagner S, Chaumeil P, Léger P, Lepais O, et al. Current trends in microsatellite genotyping. Mol Ecol Resour. 2011;11:591–611. https://doi.org/10.1111/j.1755-0998.2011.03014.x.
Castoe TA, Poole AW, de Koning APJ, Jones KL, Tomback DF, Oyler-McCance SJ, et al. Rapid microsatellite identification from Illumina paired-end genomic sequencing in two birds and a snake. PLoS ONE. 2012;7:e30953. https://doi.org/10.1371/journal.pone.0030953.
de Barba M, Miquel C, Lobréaux S, Quenette PY, Swenson JE, Taberlet P. High-throughput microsatellite genotyping in ecology: improved accuracy, efficiency, standardization and success with low-quantity and degraded DNA. Mol Ecol Resour. 2016;17:492–507. https://doi.org/10.1111/1755-0998.12594.
Darby BJ, Erickson SF, Hervey SD, Ellis-felege SN. Digital fragment analysis of short tandem repeats by high-throughput amplicon sequencing. Ecol Evol. 2016;6:4502–12. https://doi.org/10.1002/ece3.2221.
Vartia S, Villanueva-Cañas JL, Finarelli J, Farrell ED, Collins PC, Hughes GM, et al. A novel method of microsatellite genotyping-by-sequencing using individual combinatorial barcoding. R Soc Open Sci. 2016;3: 150565. https://doi.org/10.1098/rsos.150565.
Botstein D, White R, Skolnick M. Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet. 1980;32:314–31.
Litt M, Hauge X, Sharma V. Shadow bands seen when typing polymorphic dinucleotide repeats — Some causes and cures. Biotechniques. 1993;15:280–4.
Ginot F, Bordelais I, Nguyen S, Gyapay G. Correction of some genotyping errors in automated fluorescent microsatellite analysis by enzymatic removal of one base overhangs. Nucleic Acids Res. 1996;4:540–1. https://doi.org/10.1093/nar/24.3.540.
Addison JA, Hart M. Spawning, copulation, and inbreeding coefficients in marine invertebrates. Biol Lett. 2005;1:450–3. https://doi.org/10.1098/rsbl.2005.0353.
Zouros E, Foltz DW. Possible explanations of heterozygote deficiency in bivalve molluscs. Malacologia. 1984;25:583–91.
Papetti C, Schiavona L, Milan M, Lucassen M, Cavaco JA, Paterno M, et al. Genetic variability of the striped venus Chamelea gallina in the northern Adriatic Sea. Fish Res. 2018;201:68–78. https://doi.org/10.1016/j.fishres.2018.01.006.
Turini FG, Steinert C, Heubl G, Bringmann G, Lombe BK, Mudogo V, et al. Microsatellites facilitate species delimitation in Congolese Ancistrocladus (Ancistrocladaceae), a genus with pharmacologically potent naphthylisoquinoline alkaloids. Taxon. 2014;63:329–41. https://doi.org/10.12705/632.36.
Weersing K, Toonen RJ. Population genetics, larval dispersal, and connectivity in marine systems. Mar Ecol Prog Ser. 2009;393:1–12. https://doi.org/10.3354/meps08287.
Sokolov EP. An improved method for DNA isolation from mucopolysaccharide-rich molluscan tissues. J Molluscan Stud. 2000;66:573–5. https://doi.org/10.1093/mollus/66.4.573.
Sunnucks P. Efficient genetic markers for population biology. Trends in Ecol Evol. 2000;15:199–203. https://doi.org/10.1016/s0169-5347(00)01825-5.
Teske PR, Golla TR, Sandoval-Castillo J, Emami-Khoyi A, van der Lingen CD, von der Heyden S, et al. Mitochondrial DNA is unsuitable to test for isolation by distance. Sci Rep. 2018;8:1–9. https://doi.org/10.1038/s41598-018-25138-9.
Hart MW, Sunday J. Things fall apart: Biological species form unconnected parsimony networks. Biol Lett. 2007;3:509–12. https://doi.org/10.1098/rsbl.2007.0307.
Palumbi SR. Genetic divergence, reproductive isolation, and marine speciation. Annu Rev Ecol Syst. 1994;25:547–72. https://doi.org/10.1146/annurev.es.25.110194.002555.
Waters JM, Craw D. Cyclone-driven marine rafting: storms drive rapid dispersal of buoyant kelp rafts. Mar Ecol Prog Ser. 2018;602:77–85. https://doi.org/10.3354/meps12695.
Lee HJ, Boulding EG. Spatial and temporal population genetic structure of four northeastern Pacific littorinid gastropods: the effect of mode of larval development on variation at one mitochondrial and two nuclear DNA markers. Mol Ecol. 2009;81:2165–84. https://doi.org/10.1111/j.1365-294X.2009.04169.x.
Kojima S, Hayashi I, Kim D, Iijima A, Furota T. Phylogeography of an intertidal direct-developing gastropod Batillaria cumingi around the Japanese Islands. Mar Ecol Prog Ser. 2004;276:161–72. https://doi.org/10.3354/meps276161.
Sá-Pinto A, Branco M, Sayanda D, Alexandrino P. Patterns of colonization, evolution and gene flow in species of the genus Patella in the Macaronesian Islands. Mol Ecol. 2007;17:519–32. https://doi.org/10.1111/j.1365-294X.2007.03563.x.
Aurelle D, Guillemaud T, Afonso P, Morato T, Wirtz P, Santos RS, et al. Genetic study of Coris julis (Osteichtyes, Perciformes, Labridae) evolutionary history and dispersal abilities. Comptes Rendus Biol. 2003;326:771–85. https://doi.org/10.1016/j.crvi.2003.08.001.
Domingues VS, Santos RS, Brito A, Almada VC. Historical population dynamics and demography of the Eastern Atlantic pomacentrid Chromis limbata (Valenciennes, 1833). Mol Phylogenet Evol. 2006;40:139–47. https://doi.org/10.1016/j.ympev.2006.02.009.
Domingues VS, Santos RS, Brito A, Alexandrou M, Almada VC. Mitochondrial and nuclear markers reveal isolation by distance and effects of Pleistocene glaciations in the northeastern Atlantic and Mediterranean populations of the white seabream (Diplodus sargus, L.). J Exp Mar Biol Ecol. 2007;346:102–13. https://doi.org/10.1016/j.jembe.2007.03.002.
González-Wangüemert M, Cánovas F, Pérez-Ruzafa A, Marcos C, Alexandrino P. Connectivity patterns inferred from the genetic structure of white seabream (Diplodus sargus L.). J Exp Mar Biol Ecol. 2010;383:23–31. https://doi.org/10.1016/j.jembe.2009.10.010.
Francisco SM, Almada VC, Faria C, Velasco EM, Robalo JI. Phylogeographic pattern and glacial refugia of a rocky shore species with limited dispersal capability: the case of Montagu’s blenny (Coryphoblennius galerita, Blenniidae). Mar Biol. 2014;161:2509–20. https://doi.org/10.1007/s00227-014-2523-6.
Stefanni S, Castilho R, Sala-Bozano M, Robalo JO, Francisco SM, Santos RS, et al. Establishment of a coastal fish in the Azores: recent colonisation or sudden expansion of an ancient relict population? Heredity. 2015;115:527–37. https://doi.org/10.1038/hdy.2015.55.
Salgueiro P, Palmeirim JM, Ruedi M, Coelho MM. Gene flow and population structure of the endemic Azorean bat (Nyctalus azoreum) based on microsatellites: implications for conservation. Conserv Genet. 2008;9:1163–71. https://doi.org/10.1007/s10592-007-9430-z.
Salgueiro P, Coelho MM, Palmeirim JM, Ruedi M. Mitochondrial DNA variation and population structure of the island endemic Azorean bat (Nyctalus azoreum). Mol Ecol. 2004;13:3357–66. https://doi.org/10.1111/j.1365-294X.2004.02354.x.
Neves VC, Griffiths K, Savory FR, Furness RW, Mable BK. Are European starlings breeding in the Azores archipelago genetically distinct from birds breeding in mainland Europe? Eur J Wildl Res. 2010;56:95–100. https://doi.org/10.1007/s10344-009-0316-x.
Thiel M, Haye PA. The ecology of rafting in the marine environment. III. Biogeographical and evolutionary consequences. Oceanogr Mar Biol. 2006;44:323–429. https://doi.org/10.1201/9781420006391.ch7.
Thiel M, Gutow L. The ecology of rafting in the marine environment. I. The floating substrata. Oceanogr Mar Biol. 2005;42:181–264. https://doi.org/10.1201/9780203507810.ch6.
Thiel M, Gutow L. The ecology of rafting in the marine environment. II. The rafting organisms and community. Oceanogr Mar Biol. 2005;43:279–418. https://doi.org/10.1201/9781420037449.ch7.
Ávila SP, Medeiros A, Martins AMF, Silva A, Melo C, Gomes C, et al. Lajes do Pico “À Ban-baxe-muro.” Publiçor; 2011.
Silva A, Brotas V, Valente A, Sá C, Diniz T, Patarra RF, et al. Coccolithophore species as indicators of surface oceanographic conditions in the vicinity of Azores islands. Estuar Coast Shelf Sci. 2013;118:50–9. https://doi.org/10.1016/j.ecss.2012.12.010.
Sala I, Caldeira RMA, Estrada-Allis SN, Froufe E, Couvelard X. Lagrangian transport pathways in the northeast Atlantic and their environmental impact. Limnol Oceanogr Fluids Environ. 2013;3:40–60. https://doi.org/10.1215/21573689-2152611.
Sala I, Harrison CS, Caldeira RMA. The role of the Azores Archipelago in capturing and retaining incoming particles. J Mar Syst. 2016;154:146–56. https://doi.org/10.1016/j.jmarsys.2015.10.001.
Folmer O, Black M, Hoeh W, Lutz R, Vrijenhoek R. DNA primers for amplification of mitochondrial Cytochrome C Oxidase subunit I from diverse metazoan invertebrates. Mol Mar Biol Biotechnol. 1994;3:294–9.
Geller J, Meyer C, Parker M, Hawk H. Redesign of PCR primers for mitochondrial Cytochrome C Oxidase subunit I for marine invertebrates and application in all-taxa biotic surveys. Mol Ecol Resour. 2013;13:851–61. https://doi.org/10.1111/1755-0998.12138.
ExPASy Translate Tool. https://web.expasy.org/translate/. Accessed 15 July 2020.
GenBank Database. https://www.ncbi.nlm.nih.gov/genbank/. Accessed 15 Jan 2021.
McWilliam H, Li W, Uludag M, Squizzato S, Park YM, Buso N, et al. Analysis tool web services from the EMBL-EBI. Nucleic Acids Res. 2013;41:W597-600. https://doi.org/10.1093/nar/gkt376.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33:1870–4. https://doi.org/10.1093/molbev/msw054.
Excoffier L, Lischer HEL. Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10:564–7. https://doi.org/10.1111/j.1755-0998.2010.02847.x.
Andrews S. FastQC - A quality control tool for high throughput sequence data. 2010. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20. https://doi.org/10.1093/bioinformatics/btu170.
Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–1. https://doi.org/10.1093/bioinformatics/btq461.
Miller MP, Knaus BJ, Mullins TD, Haig SM. SSR_pipeline: A bioinformatic infrastructure for identifying microsatellites from paired-end Illumina high-throughput DNA sequencing data. J Hered. 2013;104:881–5. https://doi.org/10.1093/jhered/est056.
Untergasser A, Cutcutache I, Koressaar T, Ye J, Faircloth BC, Remm M, et al. Primer3—New capabilities and interfaces. Nucleic Acids Res. 2012;40:e115–e115. https://doi.org/10.1093/nar/gks596.
GitHub mcurto – SSR-GBS-pipeline. https://github.com/mcurto/SSR-GBS-pipeline. Accessed 15 Apr 2020.
Chapuis MPM, Estoup A. Microsatellite null alleles and estimation of population differentiation. Mol Biol Evol. 2007;24:621–31. https://doi.org/10.1093/molbev/msl191.
Holla S, Khan J, Sowjanya MS, Shashidhar H. Monomorphic molecular markers are as informative as polymorphic molecular markers. Indian J Genet Plant Breed. 2014;74:596. https://doi.org/10.5958/0975-6906.2014.00896.7.
Kalinowski ST, Taper ML, Marshall TC. Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment. Mol Ecol. 2007;16:1099–106. https://doi.org/10.1111/j.1365-294X.2007.03089.x.
Pritchard JK, Stephens M, Donnely P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.
Hubisz M, Falush D, Stephens M, Pritchard JK. Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour. 2009;9:1322–32. https://doi.org/10.1111/j.1755-0998.2009.02591.x.
Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003;164:1567–87.
Earl DA, VonHoldt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2012;4:359–61.
Kopelman NM, Mayzel J, Jakobsson M, Rosenberg NA, Mayrose I. Clumpak: a program for identifying clustering modes and packaging population structure inferences across K. Mol Ecol Res. 2015;15:1179–91. https://doi.org/10.1111/1755-0998.12387.
Excoffier L, Smouse PE, Quattro JM. Analysis of Molecular Variance Inferred from Metric Distances Among DNA Haplotyes: Application. Genetics. 1992;491:479–91.
The authors acknowledge the governmental entities for the sampling licenses issued, as well as the access provided to the samples deposited at the collections of the Department of Biology of the University of the Azores and CIBIO-InBIO. Eva Dornstauder‐Schramler provided technical assistance during the development of primer set. Open access funding provided by University of Natural Resources and Life Sciences Vienna (BOKU). We thank the reviewers for their comments and suggestions on the manuscript.
This work was supported by Fundação para a Ciência e Tecnologia, IP (PhD grant SFRH/BD/135918/2018 to LB; research contract IF/00465/2015 to SPA). This work benefits from FEDER funds through the Operational Programme for Competitiveness Factors – COMPETE and national funds through FCT- Fundação para a Ciência e Tecnologia, IP (projects UID/BIA/50027/2013, POCI-01–0145-FEDER-006821); by regional funds through Direção Regional para a Ciência e Tecnologia (DRCTM1.1. a/005/Funcionamento-C-/2018). It was also supported by FEDER funds (85%) and by funds of the Regional Government of the Azores (15%) through Programa Operacional Açores 2020, in the scope of the projects “VRPROTO”: Virtual Reality PROTOtype: the geological history of “Pedra-que-pica”: ACORES-01–0145-FEDER-000078, and “AZORESBIOPORTAL – PORBIOTA”: ACORES-01–0145-FEDER-000072. This work is included in The Austrian Barcode of Life (ABOL) Initiative.
Ethics approval and consent to participate
The fresh material from the Azores Archipelago was sampled after permission of the respective authorities in the region (Direção Reginal da Ciência e Tecnologia, Governo Regional dos Açores), who issued the permits AMP 2018/014, CCIP 24/2019/DRCT, and CCIP 35/2019/DRCT. Material from the Marine Molluscs Collection of the Department of Biology of the University of the Azores and from CIBIO-InBIO’s marine invertebrate collection were loaned with the permission of the curators.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Complete list of all SSR primers designed in this study. Includes sequence of forward and reverse primers (5’-3’), the repetition motif, and number of repeats in the original sequence used for the primer design, the primer mix in which it was included for the Multiplex PCR, and allele length range (bp). Table S2. Estimates of evolutionary divergence (raw p-distances) between COI haplotypes of Cingula trifasciata. The analysis, performed on MEGA v7 , considered 44 haplotypes and 658 bp after excluding positions containing gaps and/or missing data for each sequence pair analysed. Populations of the sequences collapsed within each haplotype is referred in the “Pops” column, as follows: SMA-Santa Maria, CAL-Caloura, MOS-Mosteiros, GRW-Graciosa, PIX-Pico, FSC-São Jorge, FLW-Flores, VIG-Vigo. Table S3. Pairwise FST values between populations of Cingula trifasciata, based on the COI dataset. Table S4—Pairwise FST values between populations of Cingula trifasciata, based on the complete WAI dataset. Table S5 – Estimates of evolutionary divergence (raw p-distances) between COI sequences of several Rissoidae species. The analysis, performed on MEGA v7 , considered 18 sequences and 658 bp after excluding positions containing gaps and/or missing data for each sequence pair analysed. Sequences retrieved from GenBank , accession Number (AN) and species identification are provided in the table. Highlighted in bold is the divergence level between the recognized species Alvania formicarum and A. mediolittoralis, and between Cingula trifasciata from the Azores and Vigo. Figure S1. Genetics diversity of the SSR dataset in study. a) Variability measures estimated for each of the 26 SSR loci in study in complete whole amplicon information (WAI) dataset: number of alleles (Na) and polymorphic information content (PIC); b) Genetic diversity patterns across the five populations of Cingula trifasciata with more than five individuals: number of alleles (Na), number of different alleles with frequency over 5%, number of effective alleles (Ne), number of private alleles (Npa), expected (He) and observed (Ho) heterozygosity. Fig. S2. Genetic structure analysis of the complete WAI dataset of Cingula trifasciata. Inferred with STRUCTURE v2.3.4 [5, 6], reporting the results from K = 7 to K = 10.
About this article
Cite this article
Baptista, L., Meimberg, H., Ávila, S.P. et al. Dispersal ability, habitat characteristics, and sea-surface circulation shape population structure of Cingula trifasciata (Gastropoda: Rissoidae) in the remote Azores Archipelago. BMC Ecol Evo 21, 128 (2021). https://doi.org/10.1186/s12862-021-01862-1
- Cingula trifasciata
- Population structure