Skip to main content

The adult shell matrix protein repertoire of the marine snail Crepidula is dominated by conserved genes that are also expressed in larvae

Abstract

Mollusca is a morphologically diverse phylum, exhibiting an immense variety of calcium carbonate structures. Proteomic studies of adult shells often report high levels of rapidly-evolving, ‘novel’ shell matrix proteins (SMPs), which are hypothesized to drive shell diversification. However, relatively little is known about the phylogenetic distribution of SMPs, or about the function of individual SMPs in shell construction. To understand how SMPs contribute to shell diversification a thorough characterization of SMPs is required. Here, we build tools and a foundational understanding of SMPs in the marine gastropod species Crepidula fornicata and Crepidula atrasolea because they are genetically-enabled mollusc model organisms. First, we established a staging system of shell development in C. atrasolea for the first time. Next, we leveraged previous findings in C. fornicata combined with phylogenomic analyses of 95 metazoan species to determine the evolutionary lineage of its adult SMP repertoire. We found that 55% of C. fornicata’s SMPs belong to molluscan orthogroups, with 27% restricted to Gastropoda, and only 5% restricted at the species level. The low percentage of species-restricted SMPs underscores the importance of broad-taxon sampling and orthology inference approaches when determining homology of SMPs. From our transcriptome analysis, we found that the majority of C. fornicata SMPs that were found conserved in C. atrasolea were expressed in both larval and adult stages. We then selected a subset of SMPs of varying evolutionary ages for spatial-temporal analysis using in situ hybridization chain reaction (HCR) during larval shell development in C. atrasolea. Out of the 18 SMPs analyzed, 12 were detected in the larval shell field. These results suggest overlapping larval vs. adult SMP repertoires. Using multiplexed HCR, we observed five SMP expression patterns and three distinct cell populations within the shell field. These patterns support the idea that modular expression of SMPs could facilitate divergence of shell morphological characteristics. Collectively, these data establish an evolutionary and developmental framework in Crepidula that enables future comparisons of molluscan biomineralization to reveal mechanisms of shell diversification.

Peer Review reports

Background

Calcium carbonate biomineralization is a significant innovation in the evolution of divergent body forms, and is thought to have evolved independently at least 20 times in metazoans (Fig. 1A) [1,2,3]. Within the phylum Mollusca alone it is hypothesized that calcium carbonate biomineralization evolved multiple times, contributing to a rich diversity of shell shape, size, color, microstructure, and mineral content (Fig. 1B) [4,5,6,7,8,9,10]. During molluscan shell biomineralization, specialized epithelial cells in the mantle tissue secrete organic materials into an enclosed extrapallial space, including polysaccharides, proteoglycans, and shell matrix proteins (SMPs) [11, 12]. Secreted SMPs self-assemble to create an extracellular matrix that facilitates the precipitation of calcium carbonate, and SMPs become occluded in the shell [13,14,15]. Shell proteomic comparative studies have identified only a few SMPs with homologs across molluscan species [16], suggesting that SMP repertoires include many rapidly evolving, lineage-specific proteins [5, 17]. The presence of lineage-specific SMPs has led to the hypothesis that at the protein structure level, SMP diversity may have driven shell diversification and contributed to the acquisition of novel shell morphologies [6, 12, 18]. One approach to test this hypothesis is to (1) identify a species’ SMP repertoire through proteomics, (2) determine the evolutionary age of its SMPs, (3) determine the spatiotemporal expression patterns of the SMPs across life history stages, and (4) functionally perturb conserved and lineage-restricted SMP gene products to assess the morphological consequences of losing specific SMPs. All four lines of analysis should be feasible in the same organism. Two such species are the marine slipper snails Crepidula fornicata and Crepidula atrasolea [19,20,21,22]. In the present study we determine the evolutionary age of all known C. fornicata SMPs and examine the expression patterns for a subset of lineage-specific SMPs during larval shell development. These results set the stage for future work to determine how SMPs may contribute to novel shell morphology and evolution.

Fig. 1
figure 1

Crepidula fornicata and Crepidula atrasolea are complementary model systems for biomineralization. Phylogeny of calcium carbonate biomineralization throughout the Metazoa (A; Gilbert et al. 2022) and the Mollusca (B; Kocot et al. 2020). Triangles indicate taxa that produce calcium carbonate skeletons; squares indicate taxa that have a published biomineral proteome as of 2023; circles indicate taxa that have conducted in situ hybridization for shell matrix proteins (SMPs). Comparison of indirect larval development in C. fornicata vs. direct development in C. atrasolea (C). Adult shells of the two species (D). Organism silhouettes are from PhyloPic (www.phylopic.org) with the exception of Crepidula and Polyplacophora

While many studies examine SMPs that are occluded in adult shells, it is also important to study SMPs that are occluded in the larval shell. In gastropods, larval shell formation begins after gastrulation when the posterior dorsal epithelium thickens and becomes known as the shell field [23]. In the literature, “shell field” and “larval mantle” both refer to the larval tissues responsible for shell formation [23,24,25,26]. The shell field (which will eventually mature into the adult mantle tissue) secretes SMPs throughout larval development when shell morphology is first established [23]. Larval shells and adult shells can differ in microstructure, mineral content, and SMP composition [27,28,29,30]. Most studies identify SMPs use adult shells as the source material and relatively few studies focus on comparing SMP repertoires at different life history stages. While several studies highlight differences in the SMP content of larval vs. adult shells (e.g. in bivalves [29,30,31], similarities have also been identified. For example, we recently showed that several of the most highly-expressed adult C. fornicata SMPs are also expressed in the larval shell field [19]. Although functional perturbation is required to conclusively determine gene function, the expression of adult SMPs in the larval shell field is highly suggestive of a role in shell biomineralization. By examining SMPs during larval shell development, when shell morphology is first established, we can study the role of SMPs in the acquisition of new shell morphologies and shell diversity over developmental time. The deployment of SMPs during development could change over evolutionary time scales, contributing to shell diversity.

Shell diversification may also rely on the division of the mantle tissue into discrete regions of SMP expression, with each region responsible for producing specific shell characteristics [24, 32]. Distinct regions of SMP expression within the mantle have been observed in multiple molluscan species [24, 33, 34]. For example, in adult bivalves, SMPs involved in nacreous-layer formation are found in the dorsal region of the mantle epithelium [34, 35], whereas SMPs implicated in prismatic-layer biomineralization are localized to the ventral portion of the mantle epithelium [36, 37]. The initial regionalization of SMPs during larval shell development could enable adult mantle modularity, and modularity has been hypothesized to drive novel shell features [32]. One could imagine that if an SMP responsible for prismatic-layer biomineralization is expressed in a new region within the shell field, that could result in a novel shell microstructure, promoting shell diversification [32]. To further explore such possibilities, we need to account for how the evolutionary age of an SMP relates to its expression domain within the shell field or mantle. For example, if lineage-restricted SMPs are responsible for specialized shell characteristics, perhaps they are localized to specific regions of the shell field. Likewise, a reasonable hypothesis is that conserved SMPs, which contribute to fundamental processes of biomineralization (e.g. calcium-regulation), may be more broadly expressed in shell field cells compared to lineage-restricted SMPs.

To understand how conserved vs. lineage-restricted SMPs might contribute to shell construction and diversification, accurate estimates of the evolutionary ages of SMPs are necessary. However, a finding from multiple proteomic comparisons of SMPs using BLAST approaches is that homologs could not be identified between the shell proteomes of different species [34, 38, 39]. One main conclusion stemming from the lack of homology in BLAST hits was that SMPs were deemed highly “lineage-specific”, meaning they were found only within a particular clade [36, 40, 41]. In many cases, the phylogenetic level to which SMPs are restricted remains unclear due to the small sample sizes of available shell proteomes relative to the number of representative species [12]. Additionally, previous comparative studies relied heavily on BLAST approaches, which have been shown to generate false-negatives for sequences that are short and repetitive (for which many SMPs are susceptible), resulting in an underestimate of orthologs between different species [42]. With the emergence of more molluscan shell proteomes, transcriptomes, and genomes we can now use new orthology inference approaches to identify orthologs with more confidence [9, 10, 19, 31, 38, 43,44,45,46]. These new approaches (e.g. OrthoFinder) [46] compensate for gene-length bias, and allow us to more confidently determine the evolutionary age of SMPs and hypothesize how SMPs influence shell morphology.

Here we present a toolkit of resources for studying molluscan SMP biology in two Crepidula species that are amenable to gene perturbation techniques (Fig. 1C and D). In particular we focus on C. atrasolea (a congener of the established model C. fornicata) that is an excellent species for studying shell development due to its availability year round in laboratory culture [20, 22]. We use our previous proteomic and transcriptomic work from C. fornicata [19] to identify the evolutionary age of its adult SMP repertoire and identify SMPs that are shared at different taxonomic levels with molluscs and metazoans. We then adapted in situ hybridization chain reaction (HCR) to Crepidula spp for the first time to examine the spatiotemporal expression patterns of a subset of SMPs during larval stages. With these data we revisit how SMP diversity and deployment during development might control shell evolution.

Methods

Animal culture

C. atrasolea [47] adults were maintained in the Lyons lab at the Scripps Institution of Oceanography, San Diego, California, USA. The animals were kept at 27 °C, in 100 mm x 20 mm plastic petri dishes (Corning # 430,167) with daily filtered natural seawater changes, and fed daily with 0.002% Phyto-Feast (Reed Mariculture Inc., Campbell, CA). Clutches of eggs inside their capsules were collected from the adults using forceps, as previously described in Henry et al. 2017, and placed in 35 mm x 10 mm plastic petri dishes (Falcon # 351,007) with bottle top-filtered (0.2 μm pore size; Thermo Scientific # 291–4520) natural sea water with streptomycin (2ug/mL; Sigma # S6501-100G) and penicillin (0.6 ug/mL; Sigma # 13752-5G-F) [20]. The collected embryos were then raised at room temperature ( 20 °C) inside their capsules until reaching the appropriate stages needed for fixation. Since C. atrasolea eggs are fertilized internally, the developmental age (days or hours post fertilization) of embryos was approximated based on hours or days since the embryo was uncleaved. The development of embryos within a single clutch is asynchronous, resulting in uncleaved, 2-cell, and 4-cell embryos all in the same clutch. In addition to days post fertilization (dpf), older embryos were also staged based on the morphology of their developing mouth, velar lobes, foot rudiment, and shell field. When embryos reached a desired stage, they were decapsulated under a dissecting scope using forceps in a gelatin coated 35 mm x 10 mm plastic petri dish prior to fixation or RNA extraction.

Embryo fixation and fluorescent staining for shell development characterization

For embryos with ciliated velar lobes (starting at the organogenesis stage; Fig. 1C), individuals were relaxed in 7.5% magnesium chloride (dissolved in filtered natural seawater; Sigma-Aldrich #M7304) at benchtop for 30 min prior to fixation. Decapsulated embryos were fixed at  20 °C with 4% formaldehyde (Thermo Scientific # 28,908) in double filtered sea water for 60 min on a shaking plate followed by three 1 min 1X PBS Tween (1X Phosphate Buffered Saline: Gibco #70011-044, 0.5% Tween 20: Sigma-Aldrich P1379-100 mL) washes followed by three 10 min 1X PBS Tween washes. Animals were then dehydrated in methanol with three 5 min 100% methanol washes followed by three 1 min 100% methanol washes. Fixed individuals were stored at -80 °C in 100% methanol.

To visualize glycoproteins in the shell matrix, embryos were stained with wheat germ agglutinin (WGA), a lectin with a high affinity to N-acetylglucosamine and N‐acetylneuraminic acid residues [48,49,50]. Selected samples fixed in methanol (detailed above) were rehydrated into 5X SSCT solution (5X sodium chloride sodium citrate: Invitrogen # 15,557,044, 0.1% Tween 20) following 5 min 75% methanol 25% 5X SSCT, 50% methanol 50% 5X SSCT, 25% methanol 75% 5X SSCT, and two 100% 5X SSCT washes. Embryos were then stained in the dark with 0.001 mg/ml WGA (Alexa Fluor™ 594 Conjugate; Invitrogen W11262) and counterstained with 0.001 mg/ml hoechst in 5X SSCT for either 2–3 h at 20 °C or overnight at 4 °C overnight. Embryos were then rinsed with the 5X SSCT solution three times for 5 min at 20 °C. Samples were stored in the dark at 4 °C, and imaged within a week. Imaging methods are detailed below.

RNA extraction and long read RNA sequencing

Three embryonic samples (Samples 1–3) spanning zygote through early juvenile development, and three samples (Sample 4–6) of C. atrasolea adults or their organs were collected for RNA extraction. Each embryonic RNA sample was made from two separate pools of embryos where RNA extractions were performed separately. Sample 1 contains one pool (n = 279) of uncleaved zygotes (0–8 hpf), 25-cell (24 hpf), and compaction ( 36 hpf) staged embryos; and a second pool (n = 186) of rectangular stage embryos (5 dpf). Sample 2 includes one pool (n = 180) of early organogenesis stage embryos (7 dpf) and a second pool (n = 168) of late organogenesis stage embryos (8–11 dpf). Sample 3 includes one pool (n = 133) of pre-hatch juveniles (14 dpf) and a second pool (n = 55) of post-hatched juveniles (14–15 dpf). Pools of embryos were kept separate and transferred to 1.5 mL tubes for RNA extraction. RNA from the separate pools of embryos were combined into the three samples as described above then sent for sequencing. Three samples (Sample 4–6) of C. atrasolea adults or their organs were collected for RNA extraction. Sample 4 contains one entire adult male specimen of C. atrasolea. Samples 5–6 contain RNA extracted from individually dissected organs from a single adult female. Sample 5 contains RNA only extracted from the mantle. Sample 6 contains RNA from Sample 5 along with RNA extracted from the remaining organs: head, goot, gill, visceral mass. Each dissected organ was kept separate for RNA extraction (supplementary methods S.1, Supplementary Material), and later recombined in equal parts to form Sample 6 prior to library preparation (supplementary methods S.2.1, Supplementary Material).

Total RNA was extracted using TRIzol (ThermoFisher), according to the manufacturer instructions (Invitrogen Doc. Part No. 15,596,026.PPS). Briefly, TRIzol was added to the collection tubes, and tubes were immediately frozen by placing them into a dry ice ethanol bath (70% ethanol, dry ice). Once frozen, mortar and pestle were used to completely homogenize tissues followed by the addition of chloroform to achieve separation of RNA in the aqueous layer. Precipitation of RNA from the aqueous layer was performed using 100% isopropanol, and after centrifugation, the RNA pellet was cleaned of impurities using 70% ethanol followed by resuspension in nuclease free water. For more details on RNA quality and quantity please refer to supplementary methods S.2.1, Supplementary Material. Six RNA samples (described above) were sent to the Roy J. Carver Biotechnology Center at the University of Illinois at Urbana-Champaign where the library preparation and sequencing were performed. 300 ng of total RNA were converted to cDNA with the Iso-Seq Express Oligo Kit. cDNA was barcoded with the Barcoded Overhang Adaptor Kit 8 A, and converted into a library with the SMRTBell Express Template Prep kit 2.0 (Pacific Biosciences). The libraries were pooled in equal concentration and the pool was sequenced on 1 SMRTcell 8 M on a PacBio Sequel IIe using the CCS sequencing mode and a 30hs movie time.

Hybrid transcriptome assembly and annotation

Both short read transcriptomes generated by Illumina sequencing and long read transcriptomes generated by Iso-seq were used to construct a high coverage transcriptome for C. atrasolea (supplementary methods S.3, Supplementary Material). The long read transcriptomes were obtained as described in the supplementary methods S.3.1. The short read transcriptomes were downloaded from the NCBI Sequence Read Archive under accession number (SRP114839) [20]. We used rnaSPAdes (version 3.15.4) [51] to construct the hybrid transcriptome with both the short read and the long read data. The resulting transcriptome contains 848,512 transcripts, with a median length of 631 nt. CD-HIT (version 4.8.1) [52] was then used to merge redundant transcripts, resulting in 699,466 transcripts. Transcripts from non-eukaryotic species were then removed using alien index code [53], resulting in a transcriptome with 694,602 transcripts. The transcriptome was uploaded to TSA database where it underwent further adapter and contamination screening steps using FCS-GX (Astashyn et al., 2024). After foreign contamination was removed, the final accepted transcriptome uploaded to NCBI/TSA database contained 693,671 transcripts. We inferred a C. atrasolea proteome from this hybrid transcriptome using TransDecoder (version 5.5.0) [54] and used InterproScan (version 5.52) [55] to generate annotations for the inferred protein products. For additional details on the hybrid transcriptome and proteome assembly and annotation please see supplementary methods S.3.

To assess the quality of the transcriptome, BUSCO (version 5.3.2) [56] was used to score its completeness against Metazoa as well as Mollusca databases (metazoa_odb10 and mollusca_odb10). The hybrid transcriptome scored high completeness against both databases, with 99.2% for Metazoa (S:24.9%, D:74.3%, F:0.5%, M:0.3%) and 91.7% for Mollusca (S:27.0%, D:64.7, F:1.9%, M:6.4%).

Identification of C. atrasolea SMPs in different IsoSeq libraries

The long read IsoSeq data consist of 6 individual samples including samples of adult tissue and samples spanning development (for more information see “RNA extraction and Long read RNA sequencing” above). To determine how many of the C. atrasolea SMP orthogroups are found in both embryonic development and adult tissues, we compared SMP orthogroup sequences with the raw IsoSeq data from each of the individual samples. We used Blast + command line to blast (tblastn) the sequences found in each SMP orthogroup to the individual IsoSeq sample sequences. If any of the sequences comprising the SMP ortholog had a blast hit to any of the individual samples, it was included as present in that sample.

Species proteomes and data curation for orthology inference

In total 95 metazoan species proteomes were obtained from both publicly available databases and from a previously computed Orthofinder run (supplementary file S.1, Supplementary Materials) [43]. The 95 proteomes included in the Orthofinder run span 16 different metazoan phyla (including 61 molluscan species) and represent multiple sub-phylum-level metazoan clades. Additionally, 15 biomineral proteomes, which were for species whose proteomes were already included in the 95 proteomes (supplementary file S.1, Supplementary Materials), were analyzed using Orthofinder.

Orthofinder2 analysis to identify orthogroups

To determine whether SMPs had orthologous copies in other species, orthology inference was used to identify conserved orthogroups within the previously mentioned input species proteomes using Orthofinder2 [46]. First, an all-vs-all DIAMOND (version 0.9.15) [57] search provided pairwise similarity scores for all sequences, and was used by Orthofinder2 to normalize for gene-length-bias and perform clustering of sequences into orthogroups using the MCL clustering algorithm [58]. Genes contained in each orthogroup are considered orthologs [46]. Second, sequences found within orthogroups were aligned using MAFTT (version 7.221) [59], and gene tree inference was performed using FastTree [60] to generate gene trees from multiple sequence alignments (supplementary figures S1: S13, Supplementary Materials).

Kinfin analysis to identify lineage-restricted orthogroups

Orthogroups for each of the previously identified 185 C. fornicata SMPs were produced by Orthofinder2 and analyzed by Kinfin (version 1.0.3) [61] to identify lineage-restricted orthogroups at the species (C. fornicata), genus (Crepidula), family (Calyptraeidae), order (Littorinimorpha), class (Gastropoda), and phylum (Mollusca) levels. Based on these classifications, and at various times referred to in the text, the term “metazoan orthogroup” is used to refer to orthogroups that have molluscan and non-molluscan taxa represented. It is noted that non- metazoan species (e.g. eukaryotic outgroups) were not included in the Kinfin and Orthofinder analysis. Thus, the “metazoan orthogroups” likely contain many genes that are inherited from a more distant common ancestor to eukaryotes and metazoans. The broader term “molluscan orthogroup” is used to refer to orthogroups that contain only molluscan taxa. The term “lineage-restricted” is used to refer to orthogroups that contain proteins from more than one species that form a respective monophyletic clade; an exception being for a species-restricted gene (genes that were not found in any orthogroup).

SMPs selected for in situ hybridization chain reaction (HCR)

HCR was used to examine the expression of SMPs from different lineage-restricted levels in C. atrasolea during mid organogenesis and veliger stages, prioritizing SMPs that were previously found differentially expressed in the mantle compared to the head, foot, and gill tissues of C. fornicata [19]. In total, the in situ expression patterns of 18 SMPs in 13 orthogroups were examined by HCR in C. atrasolea (supplementary file S.2, Supplementary Materials). These orthogroups contain 9 metazoan-restricted, 4 molluscan-restricted, and 5 gastropod-restricted SMPs. A C. fornicata-restricted SMP, CfSMP6, from orthogroup OG0088976 was also selected for HCR analysis. To annotate orthogroups, BLAST searches for all proteins within an orthogroup were performed against GenBank and their best reciprocal BLAST hit descriptions for all proteins were considered to arrive at a putative annotation for the orthogroup. As a result, 4 orthogroups returned BLAST hit descriptions of uncharacterized proteins, while the remaining 9 orthogroups returned descriptions of proteins that have been previously implicated in biomineralization including chitin-binding, calmodulin, lectin-like, and cysteine-protease (supplementary file S.1; supplementary file S.2, Supplementary Materials).

In situ HCR

Following the HCR probe design protocol described in Kuehn et al. (2022), a custom software (ÖzpolatLab-HCR, 2021) was used to generate 15–30 DNA oligo probe pairs to C. atrasolea messenger RNA (mRNA) for each of the 18 genes of interest (supplementary file S.2, Supplementary Materials). We used our newly generated C. atrasolea transcriptome to design DNA probe pairs for our genes of interest. Using these DNA probe sequences, custom DNA oligos pools (50 pmol DNA oPools Oligo Pool) were ordered from Integrated DNA Technologies (Coralville, Iowa, USA). Probes were stored in nuclease-free and RNase-free H2O at a concentration of 1 pmol at -80 °C.

The HCR protocol was based upon those previously published for whole embryos in Kuehn et al. (2021) and Choi et al. (2018) [62, 63], with modifications to better suit our samples. Methanol fixed embryos were re-hydrated into 5X SSCT buffer before a 30 min pre-hybridization at 37 °C in probe hyb buffer (30% formamide: VWR # JT4028-1, 5X sodium chloride sodium citrate, 9 mM citric acid (pH 6.0): Sigma-Aldrich # C1909, 0.1% Tween 20, 50 µg/ml heparin: Sigma-Aldrich # H3393, 1X Denhardt’s solution: Thermo Scientific # AAJ63135AE, 10% dextran sulfate: Sigma-Aldrich # S4030). Embryos were then incubated in probe solution (1 pmol probe in probe hyb buffer) for 20–24 h at 37 °C. The probe solution was then removed with probe wash buffer (30% formamide, 5X sodium chloride sodium citrate, 9 mM citric acid (pH 6.0), 0.1% Tween 20, and 50 µg/ml heparin; at 37 °C) and 5X SSCT buffer (at 20 °C) rinses. Next, samples were incubated in the hairpin solution for 22–24 h at 20 °C. The following day, hairpins were removed with 5X SSCT buffer washes. Samples were then counterstained with 0.001 mg/ml hoechst in 5X SSCT (detailed above), and stored at 4 °C for up to a week. For additional details please refer to supplementary methods S.5, Supplementary Materials. Improved mRNA expression detection of CaSMP1 in the shell field was observed using HCR vs. colorimetric in situ hybridization (supplementary fig. S14, Supplementary Materials), and thus HCR was the preferred method for examining the expression profiles of C. atrasolea SMPs.

Signal specificity was confirmed with control samples that were hybridized without a probe set. In the B3 546 fluorophore hairpin-only control veliger embryos, very faint labeling in the larval kidney, velum, and ocellus (supplementary fig. S15; fig. S16; fig. S17, Supplementary Materials) was noted. Faint background labeling of the larval kidney and ocellus was also noted in veliger staged B1 647 controls (supplementary fig. S15, Supplementary Materials).

Mounting and imaging

Following Hoechst and/or WGA staining, individual specimens were mounted in either 80% Glycerol (Promega H5433) 20% 5X SSCT or 2% methyl cellulose in 5X SSCT (for posterior imaging) on Rain-X-coated (ITW Global Brands, Houston, TX) 22 × 50 mm glass coverslips for fluorescent imaging on a Zeiss LSM 700 on the 20x objective with an AxioCam HRm camera. Fluorescent Z-stack images were acquired in ZenBlack (Zeiss), then processed in ImageJ and Adobe Photoshop (Adobe Inc., California, USA). Samples for dark field images were mounted as described above, and imaged on a Zeiss Axio Imager M2 on the 10x objective with an AxioCam 506 Color camera. Darkfield Z-stack images were acquired in ZenBlue (Zeiss), Z-stacks were then flattened and focused in Helicon Focus (HeliconSoft (RRID: SCR_014462), and cropped in ImageJ. Figures were created in Adobe Illustrator (Adobe Inc., California, USA).

Results

Characterization and timing of shell development in Crepidula atrasolea

To establish C. atrasolea as a model research organism for shell development in gastropods, we generated a detailed staging scheme examining stage-specific morphological features of shell development (Fig. 2). Refining the published staging scheme in [20], we based our stages on development at 20 °C with a focus on morphological changes to the shell field, the embryonic and larval shell-forming tissue. We found that wheat germ agglutinin (WGA), traditionally used to label cell membrane glycoproteins, marks the shell field and secreted shell material throughout larval development.

Fig. 2
figure 2

Characterization and timing of shell development in Crepidula atrasolea. Dark field and confocal images of fixed C. atrasolea embryos from approximately 8 days post-fertilization (dpf) to approximately 15 dpf at 20 °C. Fixed embryos were stained with wheat germ agglutinin (WGA) and imaged at the 20x objective on a confocal microscope. Cartoons highlight changes in shell morphology during development. In mid ovoid staged embryos (A), WGA stains nuclear membranes throughout the embryo. WGA begins to mark the shell field starting at the late ovoid stage (B). WGA continues to mark the developing shell throughout the organogenesis stages (C-E). WGA also marks the velar lobes, larval kidney, and foot in veliger staged embryos (F-K) Hoechst is shown in gray and WGA in yellow. fr, foot rudiment; ft, foot; hv, head vesicle; lk, larval kidney; in, intestine; ir, intestinal rudiment; oc, ocellus; sf, shell field; sfe, shell field edge; st, stomodeum; tn, tentacles; ve, velar lobes. Scale bars each represent 100 μm

During larval development, the dorsal epithelium invaginates to produce a transient structure, known as the shell gland [23, 64], where organic shell secretions are first detected [23, 65]. The shell gland tissue evaginates to form the shell field. We first detected organic shell secretions–as assayed by WGA staining–in the shell field during ovoid stages (8–9 dpf). Larval structures, including the foot rudiment, stomodeum, shell field, and intestinal rudiment, were first visible in mid ovoid embryos at  8 dpf, when elongation along the anterior-posterior axis is apparent (Fig. 2A). During the mid ovoid stage, the posteriorly-located shell field is translucent and rounded. Translucent ventral structures, including the velar lobes and foot rudiment, become more prominent. At the mid ovoid stage, WGA intensely stains the outlines of cell nuclei throughout the embryo and very weakly marks the developing shell field in the posterior terminus (Fig. 2A’’). The late ovoid stage ( 9 dpf) is distinguished by the presence of two ciliated velar lobes, a lengthened foot rudiment, and a flattened shell field along the dorsal-posterior end of the embryo (Fig. 2B). At this stage, glycoprotein secretion from the shell field cells into the extracellular matrix is first strongly visible, as indicated by bright WGA staining (Fig. 2B’).

During organogenesis stages (10–12 dpf), larval shell accretion corresponds to elongation along the anterior-posterior axis (Fig. 2C: Fig. 2E). In the early organogenesis stage ( 10 dpf), the shell field begins to diverge from the midline for the first time, as it rounds out in the posterior of the embryo (Fig. 2C). WGA staining is visible in the larval shell secretions and larval kidney during early organogenesis (Fig. 2C’’). At the mid I organogenesis stage ( 11 dpf), the shell field elongates along the anterior-posterior axis, larval organs continue to develop, and pigmented ocelli are visible (Fig. 2D). The larval shell field thickens and expands in the mid II organogenesis stage (12 dpf), and the pigmented ocelli are pronounced (Fig. 2E). Increased WGA staining highlights shell field expansion during mid I organogenesis (Fig. 2D’) and mid II organogenesis (Fig. 2E’).

In veliger stages ( 13 dpf to  20 dpf), the larval shell is fully formed, and larval organs are well developed (Fig. 2F: Fig. 2K). At the early veliger stage, an enlarged foot extending ventrally along the body, enhanced velar lobes, and paired tentacles are also present (Fig. 2F and G). The WGA staining of secreted shell materials (e.g. glycoproteins) diminished during the veliger stages. At mid veliger stages ( 14 dpf) the shell is angled upwards (Fig. 2H and I), and then flattens out to cover the entire animal during late veliger stages (Fig. 2J and K). During the veliger stages, the yolk reserves begin to diminish and are no longer visible in the hatched juvenile ( 21 dpf).

Most SMPs have conserved orthologs with only a few being species-restricted to C. fornicata

We asked whether C. fornicata SMPs have orthologs in other species, or whether most are species-restricted SMPs. Furthermore, we asked whether or not we could determine the evolutionary age of those SMPs through an orthology inference approach. To identify orthologs of C. fornicata in C. atrasolea, we used our newly assembled hybrid C. atrasolea transcriptome (described in methods above). Using the orthology inference program Orthofinder, we examined the proteomes of 95 metazoan species to determine the level at which 185 SMPs from C. fornicata are restricted to. In total, Orthorfinder assigned 89.1% (n =  3,777,255 of 4,237,370) of the input proteins to 213,404 orthogroups. We examined the Orthofinder results and determined that 95% (n = 175 of 185) of the SMPs were assigned to 145 orthogroups containing two or more species (Fig. 3; Fig. 4). In total, we found that 40% (74 of 185) of SMPs were in orthogroups that contain taxa from at least one molluscan species and one non-molluscan species, which we classify as a “metazoan orthogroup” (see Methods). Additionally, we determined that the remaining 55% (101 of 185) of SMPs are found within orthogroups that contained only molluscan taxa, which we classify as a “molluscan orthogroup”.

Fig. 3
figure 3

Shell matrix proteins of Crepidula fornicata clustered into orthogroups. Rows represent SMP orthogroups that were generated through ortholog inference techniques, and ordered based on the taxonomic level to which they are found to be lineage-restricted. Columns depict counts of proteins per taxon with their order (tree depicted above columns) based on current understandings of their phylogenetic positions (Kocot et al. 2011; Laumer et al. 2015; Laumer et al. 2018; Cunha and Giribet 2019). Gray boxes indicate presence of at least one protein from a taxon in an orthogroup. White boxes indicate that no protein was found for a respective taxon in an orthogroup. Green boxes indicate the presence of a skeletal matrix protein in the given orthogroup

Fig. 4
figure 4

Lineage-restricted shell matrix proteins selected for HCR in Crepidula atrasolea. Rows represent SMP orthogroups that were examined by HCR in C. atrasolea. They are ordered based on the taxonomic level to which they are found to be lineage-restricted (Metazoa-restricted, Gastropoda-restricted, and Crepidula-restricted). Columns depict counts of proteins per taxon with their order (tree depicted above columns) based on current understandings of their phylogenetic positions (Kocot et al. 2011; Laumer et al. 2015; Laumer et al. 2018; Cunha and Giribet 2019). Gray and white boxes indicate presence or absence of respective species within SMP orthogroups. Green boxes indicate the presence of a skeletal matrix protein in the given orthogroup

These molluscan orthogroups were further sub-divided into multiple lineage-restricted SMPs: 5% (n = 9 of 185) Molluscan-restricted; 27% (n = 51) Gastropod-restricted; 5% (n = 9) Littorinimorpha-restricted; 17% (n = 32) Crepidula-restricted (Fig. 3). We observed that Crepidula-restricted SMP orthogroups were responsible for nearly one sixth of all SMP orthogroups, and that of the two other Crepidula species in the dataset, C. atrasolea had orthologs for 91% (n = 168 of 185) of all SMPs, compared to 72% (n = 134 of 185) for C. navicella. Interestingly, in the IsoSeq data the majority of C. fornicata SMPs conserved in C. atrasolea (116/167) were found in both larval and adult stages (supplementary file S3, Supplementary Materials). We note that fewer C. atrasolea SMP orthologs (167 vs. 168) were detected using only IsoSeq data compared to the Orthofinder findings, likely due to the combined transcriptome being more complete. Our results showed that only 5% (10 of 185) of SMPs were not found in any other species, suggesting that few species-restricted SMPs exist in the shell proteome. Taken together, these results demonstrate that the majority of SMPs have conserved orthologs while few SMPs are actually novel to C. fornicata.

Shared shell field-specific expression of SMP1 in C. atrasolea and C. fornicata embryos

To confirm that gastropod-restricted SMPs have comparable expression patterns in C. fornicata and C. atrasolea, we compared larval expression of Shell Matrix Protein 1 (SMP1), the most upregulated SMP in the adult C. fornicata mantle [19], in the two species. The species comparison of SMP1 also demonstrated that we can apply previous proteomic and transcriptomic findings from C. fornicata to C. atrasolea. We found exclusive shell field expression of SMP1 in both C. fornicata and C. atrasolea (Fig. 5). The CaSMP1 homolog we found contains an extracellular matrix-binding reeler domain, which is also found in C. fornicata. Based on the morphology of the velar lobes and foot rudiment, we identified late ovoid and late organogenesis stage embryos as the most comparable stages between the two species (Fig. 1C). Using HCR, we found that expression of CfSMP1 and CaSMP1 was confined to cells within the shell field at the ovoid stages (Fig. 5A and C), and a stronger ring of expression was seen in the shell field edge during organogenesis (Fig. 5B and D). We used WGA to stain the larval shell field in C. fornicata and C. atrasolea embryos (Fig. 5B: Fig. 5D). WGA staining of the shell field in C. fornicata was very faint at the ovoid stage (Fig. 5A’’’) and most prominent at the organogenesis stage (Fig. 5B’’’). Meanwhile, WGA staining was bright in both ovoid (Fig. 5C’’’) and organogenesis staged embryos of C. atrasolea (Fig. 5D’’’). In both species, the WGA staining encompasses the SMP1 expressing cells (Fig. 5A: Fig. 5D). Together, SMP1 expression and WGA staining revealed a more dorsal-lateral positioned, and flatter, shell field in C. fornicata embryos compared to the larval shell field of C. atrasolea.

Fig. 5
figure 5

Shell field expression of Shell Matrix Protein 1 in Crepidula fornicata and Crepidula atrasolea embryos. Using hybridization chain reaction (HCR), mRNA expression was detected for SMP1 in both species (A-D). Wheat germ agglutinin (WGA) was used to mark the larval shell. CfSMP1 is expressed throughout the shell field in late ovoid staged C. fornicata embryos (A), and more intensely around the shell field edge during organogenesis (B). Similar expression patterns are seen for CaSMP1 in late ovoid staged (C) and organogenesis staged C. atrasolea embryos (D). Hoechst is shown in gray, SMP1 in magenta, and WGA in yellow. fr, foot rudiment; ft, sf, shell field; ve, velar lobes; lk, larval kidney. Scale bars represent 100 μm

12 out of 18 SMPs examined exhibit regionalized expression in the larval C. atrasolea shell field

To identify SMPs expressed in the shell field of C. atrasolea, we characterized the spatial-temporal expression of 18 different SMPs from 13 different orthologous groups at the late organogenesis and veliger stages. Out of the 18 genes we examined, 12 were expressed in the shell field, and five (CaSMP1, CaSMP10, CaSMP9, CaSMP17, and Ca211122) were exclusively expressed in the shell field (Fig. 6; supplementary fig. S18; supplementary file S.2, Supplementary Materials). We found six SMPs without shell field expression in C. atrasolea at the late organogenesis and veliger stages (supplementary fig. S19, Supplementary Materials). Many of the SMPs we examined were expressed in other tissues outside of the shell field, including in the statocyst (supplementary file S2, Supplementary Materials). Overall, we found two general SMP expression patterns in the shell field: SMPs with restricted expression in the shell field edge (Fig. 6A: Fig. 6F; supplementary fig. S18, Supplementary Materials), and SMPs with broader expression throughout the shell field (Fig. 6G: Fig. 6L; supplementary fig. S18, Supplementary Materials). Within these general domains, we found 5 distinct expression patterns (Fig. 7A and B). Based on the spatial-temporal location of these expression patterns, we identified three primary regions of the shell field: the outer edge, the inner edge, and the broader shell field (Fig. 7A and C).

Fig. 6
figure 6

Shell field expression of SMPs in Crepidula atrasolea at organogenesis and veliger stages. Using hybridization chain reaction, mRNA expression was detected for Ca211122 (A and B), Ca163409 (C and D), Ca94322 (E and F), CaSMP10 (G and H), CaSMP4 (I and J), and CaSMP12 (K and L). Hoechst is shown in gray and each SMP in cyan. ft, foot; sf, shell field; sfe, shell field edge; tn, tentacles; ve, velar lobes; ls, larval statocyst; lk, larval kidney. Scale bars in each represent 100 μm

Fig. 7
figure 7

Summary of regionalized SMP expression during larval shell development in Crepidula atrasolea. Summary table showing the lineage-restriction of each SMP, the best blast hit description, and shell field region(s) of expression (A). Circles in (A) indicate presence or absence of expression in the outer edge (outer Sf), inner edge (inner Sf), and/or the broader shell field (broad Sf). Half circles indicate presence of expression in one stage (late organogenesis or veliger) but not both. Cartoons depicting 5 different SMP expression patterns within the shell field of organogenesis and veliger stage C. atrasolea embryos (B). The SMPs are color coded in (A) based on their expression patterns shown in (B). Cartoons showing regionalization of the shell field into 3 potential zones of biomineralization in late organogenesis and veliger stage C. atrasolea embryo (C). These zones are highlighted in different colors: outer shell field edge in dark blue, inner shell field edge in light blue, and broader shell field in dark gray (C)

Restricted shell field edge expression of uncharacterized molluscan and gastropod-specific SMPs

We found restricted shell field edge expression of Ca211122, a gastropod-restricted c-type lectin domain-containing SMP, in C. atrasolea (Figs. 4 and 6A and B). In late organogenesis stage embryos, Ca21122 expression was strongest in a continuous band of cells along the shell field edge (Fig. 6A). During the veliger stages, Ca211122 persisted in the shell field edge and was weakly expressed in the dorsal-anterior portion of the shell field (Fig. 6B). We found another mollusc-specific uncharacterized SMP, Ca163409, with restricted expression in the shell field edge (Fig. 6C and D). In mid organogenesis stage embryos, the uncharacterized molluscan SMP, Ca163409, was expressed in a subset of ventral-laterally positioned shell field cells (Fig. 6C’). The most prominent expression was detected adjacent to the shell field edge (Fig. 6C’’). In veliger stage embryos, Ca163409 was expressed in a small discrete group of cells located within the anterior portion of the shell field, adjacent to the shell field edge (Fig. 6D). We noted variability in the number of shell field cells labeled between specimens, and also noted anterior expression in the head of one individual (supplementary fig. S20, Supplementary Materials).

Restricted shell field edge expression of molluscan-specific calmodulin SMP

We identified a molluscan-specific calmodulin-like 7 gene, Ca94322, with strong expression in one ring of cells around the shell field edge of the larval shell field (Fig. 6E). Ca94322 expression diminished in cells further from the shell field edge (Fig. 6E’’). At the veliger stage, the most intense expression was seen within a continuous band of Ca94322-positive cells between the outer and inner shell field edge (Fig. 6F and F’’). During the veliger stages, Ca94322 expression expanded to the anterior portion of the foot, and diminished levels were detected throughout the head of the embryo (Fig. 6F). The shell field edge-specific patterns of Ca94322 were similar to those reported in P. fucata [66] and H. asinina [67, 68]. These P. fucata and H. asinina calmodulin genes are not orthologs to Ca94322, but were instead placed in another calmodulin orthogroup with Calmodulin-A, CaSMP20.

Broad shell field expression of SMPs from metazoan orthogroups

Mucin and mucin-like genes are cell surface glycoproteins [69]. Mucins are secreted by epithelial cells and aid biomineralization of vertebrate bones, teeth, and cartilage [70]. Mucin-like genes have been previously identified in molluscan mantle tissue, and are hypothesized to aid shell biomineralization [71,72,73]. In C. atrasolea, we identified two mucin-like genes, CaSMP10 (Fig. 6G: Fig. 6H) and CaSMP9 (supplementary fig. S18, Supplementary Materials), in the metazoan orthogroup (Fig. 4). We found that these genes are expressed throughout shell field cells during organogenesis and veliger stages (Fig. 6G: Fig. 6H). While expression is distributed throughout the shell field, the brightest signal is detected in the dorsal-anterior cells of the veliger shell field (Fig. 6H’’).

We found four coiled-coil domain-containing SMPs (CaSMP3, CaSMP4, CaSMP5, and CaSMP12) in a metazoan orthogroup, with patchy expression throughout the shell field at both the late organogenesis and veliger stages (Figs. 6I and 6L: supplementary fig. S18, Supplementary Materials). In the late organogenesis stages, CaSMP4 & CaSMP12 expressing cells were also located in the statocyst (as seen in Fig. 6I and K). While there was some individual variability in the intensity of signal, we also observed bright expression in a band of squamous cells in the shell field edge during organogenesis (Fig. 6I’ and Fig. 6K’). In the veliger stages, patchy expression of these genes persisted throughout the shell field, with a brighter, continuous band of cells labeled in the shell field edge (Fig. 6J’’’; Fig. 6L’’’). CaSMP4 positive cells were highly concentrated in an anterior-ventral patch within the shell field where the shell is curved to one side (Fig. 6J).

Co-expression of CaSMP3 and CaSMP20 mRNA reveals three distinct shell field cell populations and confirms shell field regionalization during larval development

To confirm that different regions of SMP expression (e.g. broader vs. shell field edge specific regions) correspond to distinct shell field cell populations, we examined the spatial-temporal expression of CaSMP3 (broader shell field domain) and CaSMP20 (shell field edge specific region) throughout larval shell development (Fig. 8). We confirmed that CaSMP20 and CaSMP3 have two discrete regions and one overlapping region of expression in late ovoid to veliger staged embryos (Fig. 8). We identified at least three shell field cell populations during larval development: a CaSMP3 + population in the shell field, a CaSMP20 + population lining the edge of the shell field, and a CaSMP3+/CaSMP20 + population in between.

Fig. 8
figure 8

Co-expression of CaSMP3 and CaSMP20 mRNA throughout embryonic shell development in Crepidula atrasolea. Right lateral (A), left lateral (B), and posterior views (C) of late ovoid embryos show co-expression of CaSMP3 (magenta) and CaSMP20 (green) in the shell field. These two expression domains continue during early organogenesis (D-F) and mid organogenesis (G-H). In the veliger stages, laterally asymmetric expression is most apparent in the shell field edge (I-J). Hoechst is shown in gray, CaSMP3 in magenta, and CaSMP20 in green. ft, foot; sf, shell field; sfe, shell field edge; tn, tentacles; ve, velar lobes; ls, larval statocyst. Scale bars each represent 100 μm unless otherwise specified

Specifically, at the late ovoid stage, posteriorly located CaSMP3 positive cells were surrounded by a fainter ring of CaSMP20 expression, with a small population of cells expressing both CaSMP3 and CaSMP20 (Fig. 8A: Fig. 8C). At the early organogenesis stage three cell populations are discernible by a patch of CaSMP3 expression in posterior end, a ring of CaSMP3 and CaSMP20 co-expression around the shell field, and a second, outer ring of CaSMP20 expression in the shell field edge (Fig. 8D: Fig. 8F). At this stage, these three cell populations were most visible on the posterior end (Fig. 8F’ and Fig. 8F’’). In mid organogenesis embryos, CaSMP3 expressing cells were located throughout the shell field, CaSMP3 and CaSMP20 co-expressing cells were concentrated to a band along the shell field edge, and CaSMP20 expressing cells were situated along a second band above the CaSMP3 and CaSMP20 positive cell population (Fig. 8G’ and Fig. 8H’). At the veliger stage, we saw a laterally asymmetric distribution of shell field cell populations (Fig. 8I and J). There was only one, thicker band of CaSMP20 positive cells on the left lateral side (Fig. 8I’’’), with the CaSMP3 and CaSMP20 co-expressing cell population restricted to a smaller patch of the shell field edge (Fig. 8I’’). In contrast, on the right lateral side there were two prominent bands of CaSMP20 expression within the shell field edge (Fig. 8J’’’). CaSMP3 and CaSMP20 were co-expressed in cells of the second, posterior-most band (Fig. 8J’’). The third cell population, marked by CaSMP3 expression, was located throughout the shell field (Fig. 8J’’’’).

We also observed larval expression of CaSMP3 and CaSMP20 outside of the shell field, with higher levels of CaSMP20 expression in anterior and ventral tissues compared to CaSMP3 (Fig. 8). From the late ovoid to veliger stages, the CaSMP20 signal was detected faintly in patches throughout the embryo and more prominently in three non-shell field structures: the posterior edge of the foot rudiment, the head vesicle, and the velar lobes (Fig. 8A: Fig. 8F). Apart from the shell field, CaSMP3 was only expressed in the larval statocysts (Fig. 8A: Fig. 8H). Taken together, CaSMP3 and CaSMP20 exhibit co-expression in the shell field and mark SMP expressing cell populations during shell development in C. atrasolea. Given the different SMP expression patterns detected (Fig. 7B), there are at least three different cell populations in the larval shell field of C. atrasolea.

Discussion

C. fornicata and C. atrasolea: complementary models for studying molluscan biomineralization

In our previous study [19], we used the large-bodied species C. fornicata to extract peptides from its shell and perform differential gene expression analysis on different adult organs to determine which shell matrix proteins are highly expressed in the mantle. We also performed a preliminary in situ hybridization screen to determine which of these highly-expressed SMPs were expressed earlier in the larval shell field [19]. But to make a more comprehensive analysis of SMP expression in development–and to ultimately knock out specific SMPs in the future–we chose to switch to C. atrasolea [20, 22]. While C. fornicata is an indirect developer and has a long generation time (> year), C. atrasolea is a direct-developer with a generation time of just  4 months. Even a modest colony (several dozen mature reproductive females and an equal number of younger males) can provide embryos year-round in a recirculating sea water system [20, 22], making it the more practical species for day to day use.

We found comparable expression patterns of the gastropod-specific SMP1 gene in the shell field of both species, which suggests that previous finds in C. fornicata can be applied to work in C. atrasolea. We also found 10 SMPs that are expressed in the adult stage of C. fornicata [19], that are also expressed during larval shell development in C. atrasolea, suggesting shared SMP expression in the adult mantle and larval shell field. Leveraging data from both species, we provide a comprehensive analysis of larval SMPs and build a foundation of knowledge that can be used to design future studies. For example, these two species could be used to study what, if any, aspects of SMP expression are related to the direct versus indirect life cycle. Or, SMPs could be functionally perturbed, since both species are amenable to CRISPR/Cas9 genome editing [20,21,22].

“Adult SMPs” are also expressed during larval shell development in C. atrasolea

Proteomic investigations using larval shells in bivalves revealed drastically different repertoires of SMPs during larval and adult shells. Zhao et al. (2018) identified only 4 SMPs that were shared between larval and adult bivalves; Carini et al. (2019) found 45 larval SMPs with only four sequences sharing similar domains with their adult complement; and Cavallo et al. (2022) identified only 1 out of 5 adult SMPs with larval expression in the Antarctic bivalve Laternula elliptica. By contrast, we found many SMPs that are expressed in adult and larval stages in C. fornicata and C. atrasolea. There are a few possible explanations for the discrepancy between our results and studies in bivalves. First, adult and larval biomineralization might be more similar in gastropods than they are in bivalves. Second, it might reflect the different methods used to identify SMPs: we have not sequenced larval shell field tissue or performed proteomics on the larval shell, thus missing larval-specific SMPs. More research dedicated to comparisons of larval and adult biomineralization will be very useful, especially if the same methods and analyses are applied to both stages, in the same species.

Previous approaches underestimate evolutionary conservation of SMP orthogroups

Understanding the phylogenetic relationships of SMPs is essential for tracing the evolution of biomineralization in marine invertebrates. Within the past two decades, a surprising outcome from proteomic investigations of biomineral structures is the preponderance of lineage-restricted shell matrix proteins [74]. Examples in bivalves include 52% in Pinctada maxima, 54% in Mya truncata, and 66% Mytilus edulis of SMPs had no BLAST hit [75]. These observations have been informed in part by BLAST pairwise similarity E-value scores against large sequence databases, under the assumption that high pairwise similarity-scoring SMPs may have evolved more recently because they lack homologous reciprocal BLAST hits. Indeed, close examination of the sequences of lineage-restricted SMPs has shown that they contain long stretches of repetitive, low complexity regions [40], which are generally a hallmark of recently-diverging genes [76]. Likewise, in our previous study we took a BLAST approach and determined that 29% of C. fornicata SMPs had no best-reciprocal BLAST hit against sequences in GenBank, and that many of these SMPs contain long stretches of repetitive low complexity domains [19].

However, a lack of reciprocal BLAST hit is not always a reliable measure of lineage restriction or recent divergence events, as it can be an artifact due to homology detection failure in heuristic BLAST searches, which result in high numbers of false-negatives, especially in short and rapidly evolving genes [42]. This is due to the BLAST algorithm placing greater emphasis on pairwise-similarity scores that are directly correlated with gene-length and without regard to variable sequence evolution. This detection failure suggests that previously classified ‘species-restricted’ SMPs that were assessed through BLAST approaches could have homology outside of their own lineages, but due to their highly divergent sequences, their degree of similarity does not meet a certain threshold by BLAST standards. Reassessment of previous classifications using an orthology inference approach is needed to confirm these observations.

A second explanation for homology detection failure is the underrepresentation of species in GenBank. We observed a large number of C. fornicata SMPs are shared by two other Crepidula species: C. navicella and C. atrasolea. These two species were found to have orthologs for 91% (C. atrasolea) and 72% (C. navicella) of C. fornicata SMPs. Without these two species, the number of fornicata-restricted SMPs would increase from 5 to 23%, the latter being similar to our previous findings that 29% of C. fornicata SMPs had no BLAST hits. This discrepancy could be explained by underrepresented taxa, especially from related species, being absent in BLAST databases like GenBank. This leaves open the possibility that other underrepresented taxa, in addition to gene-length-bias, may result in homology detection failure. These results underscore the importance of broad taxon sampling, and data accessibility, for future studies of molluscan biomineralization genes. In the course of this study we constructed a new, comprehensive C. atrasolea transcriptome that should aid future work. Likewise, our Orthofinder dataset will be useful for studies of biomineralization in other molluscan systems.

Crepidula SMPs have regionalized shell field expression

Molluscan mantle modularity has been widely observed in adult and juvenile tissue [32, 33, 36, 66, 77, 78]. By contrast fewer studies focused on larval tissue [24, 26, 65, 79]. During larval shell development of Lymnaea stagnalis, Herlitze et al. (2018) detected regionalized expression of SMPs to specific zones of the shell field. Similarly, in C. atrasolea we found distinct regions of SMP expression: broad expression throughout the shell field, and more restricted expression in the shell field edge. Through HCR and confocal imaging, we were able to attain a high cellular and subcellular resolution of the mRNA localization, and identified additional domains of SMP expression within the shell field. Given that the SMPs we examined were initially found in adult tissue, we hypothesize that the larval shell field regionalization of SMPs may correspond to similar regions in the adult mantle.

Different cell populations of the shell field have been identified in the bivalve Crassostrea gigas [79] and in multiple gastropod species [24, 26, 65, 80]. In the whelk Tritia obsoleta, a band of cells with high proliferation rate occupies the outermost portion of the shell field, which we refer to as the “shell field edge”, also known as the “leading edge” or “aperture growth zone” [26]. We examined the spatial-temporal expression of multiple SMPs within the same individual to identify distinctive SMP-expressing cell populations. We identified CaSMP3 expressing cells located throughout the shell field, CaSMP3 + and CaSMP20 + cells in the inner edge, and CaSMP20 expressing cells in the outer edge. Interestingly, we also found CaSMP3 + cells in the larval statocysts, small sensing organs that contain calcified stones known as statoliths. Sleight (2023) recently hypothesized that non-skeletal biomineralizing cells, such as those found in the statocyst, may represent an evolutionary precursor to biomineralizing cells in the molluscan shell field. The expression of CaSMP3, CaSMP5, CaSMP12, and CaSMP4 in the statocysts of C. atrasolea larvae supports this hypothesis, and suggests that there are a subset of conserved SMPs shared between the statocyst and shell field. Our study provides a characterization of different SMP-expressing cell populations, and SMP regionalization within the shell field of C. atrasolea. To determine how cellular diversity contributes to the evolution of shell biomineralization, SMP-expressing cell populations should be compared in molluscs with diverse biomineralized structures.

Evolutionary lineage of SMPs partly explains shell field modularity

One way by which shell diversification could have arisen is through the incorporation of novel SMPs, or co-option of conserved SMPs, in the biomineralization gene regulatory network. SMPs that have homologs outside of molluscs, for example in cnidarians or vertebrates, are considered ‘highly conserved’ genes likely inherited from a distant last common ancestor of metazoans, or bilaterians, respectively. Because the functions of these highly conserved genes have often been characterized in genetic model organisms, we know that their proteins mediate fundamental cellular processes like calcium regulation (e.g. calmodulin) or bicarbonate production (e.g. carbonic anhydrase) that may have been co-opted for biomineralization functions [2, 17, 30, 40, 41, 66, 81]. We found that the majority of C. atrasolea SMPs broadly-expressed in the shell field belonged to metazoan orthogroups (n = 6/7), and the majority of SMPs with a more restricted shell field edge expression were lineage-restricted to Mollusca or Gastropoda (n = 4/5). Although it is a small sample size, the trend may suggest that SMPs in Gastropod-specific and Mollusca-specific orthogroups tend to be expressed in more specific regions of the shell field. There are exceptions to this trend; for example, SMP20 is a metazoan SMP that was found restricted to the shell field edge, and SMP17 is a gastropod-restricted SMP, with a broader shell field expression domain. It is important to note that we have yet to fully characterize all species-restricted SMPs to understand how their evolutionary lineage affects shell regionalization. Our preliminary data show that the shell field regionalization of C.fornicata-specific CfSMP6 (OG0088976) varied between different larval stages (supplementary fig. S21, Supplementary Material). The expression of CfSMP6 was restricted to fewer shell field regions at the veliger stage compared to the late organogenesis stage of larval development (supplementary fig. S21, Supplementary Material). Although this represents only one species-restricted SMP, this may suggest dynamic regionalization of species-restricted SMPs during larval development. Based on our data, we hypothesize that highly conserved, metazoan SMPs involved in fundamental biomineralization processes [2], tend to be expressed in multiple shell field regions (i.e., broader shell field and shell field edge expression) compared to more recently-diverged, lineage-restricted SMPs, which may be restricted to singular shell field regions (e.g., shell field edge). On the other hand, conserved SMPs with specific shell field and/or restricted shell field edge expression may have adopted specialized roles in molluscan biomineralization by acquiring new expression domains. To test this hypothesis, future work can now compare the expression of conserved metazoan SMPs we have identified in Crepidula across multiple metazoan phyla.

Examining both the evolutionary age and regionalization of SMPs during larval shell development allows us to more precisely test hypotheses about the mechanisms that underpin mollusc shell divergence. Together, our findings point to the possibility that other mechanisms, such as changes to the modular expression of SMPs, also contribute to shell diversification. Future studies should prioritize broad taxon sampling, interspecies comparisons, and data accessibility.

Conclusions

Here we present an evolutionary and developmental framework to allow future interspecies comparisons of molluscan biomineralization. We used orthogroup analysis to determine the evolutionary lineage of previously identified adult SMPs in C. fornicata, and found only 5% of those SMPs to be species-restricted. This small percentage suggests that there may be other drivers of the molluscan shell diversification beyond novel or species-specific SMPs. HCR analysis of a subset of SMPs in C. atrasolea larvae demonstrates expression in both the adult mantle and the larval shell field, suggesting that the adult and larval SMP complements may be more similar than those reported in other taxa. These findings highlight similarities in the SMP repertoires of adult and larval biomineralization. We also show that both highly conserved and lineage-restricted SMPs show distinct shell regionalization patterns; conserved SMPs tend to have broader expression throughout the shell field, while lineage-restricted SMPs show more restricted expression in the shell field edge. Together, these results suggest that the evolutionary age of an SMP may influence the modular shell field expression of SMPs during shell field development. Future gene perturbation experiments can use the Crepidula SMPs we have described to test the function of specific SMPs in biomineralization. Our work highlights the importance of integrating knowledge across taxa and developmental stages to better understand the role of SMPs in molluscan biomineralization and its diversification.

Data availability

The IsoSeq raw reads used to assemble the C. atrasolea transcriptome are currently available through NCBI’s SRA database under BioProject PRJNA1018959, and the assembled hybrid transcriptome was deposited to DDBJ/ENA/GenBank under the accession GKTO00000000. The version described in this paper is the first version, GKTO01000000. Orthofinder, Kinfin, and their accompanying input files and commands for job submission are available through Dryad (https://doi.org/10.5061/dryad.zpc866tf1). The SMP sequences used to design HCR probes are available through NCBI's GenBank (accession numbers PP639282- PP639299).

References

  1. Knoll AH. Biomineralization and evolutionary history. Rev Mineral Geochem. 2003;54(1):329–56.

    Article  CAS  Google Scholar 

  2. Gilbert PUPA, Bergmann KD, Boekelheide N, Tambutté S, Mass T, Marin F, et al. Biomineralization: integrating mechanism and evolutionary history. Sci Adv. 2022;8(10):eabl9653.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  3. Sleight VA. Cell type and gene regulatory network approaches in the evolution of spiralian biomineralisation. Brief Funct Genomics. 2023;22(6):509–16.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  4. Chateigner D, Hedegaard C, Wenk HR. Mollusc shell microstructures and crystallographic textures. J Struct Geol. 2000;22(11):1723–35.

    Article  Google Scholar 

  5. McDougall C, Aguilera F, Degnan BM. Rapid evolution of pearl oyster shell matrix proteins with repetitive, low-complexity domains. J R Soc Interface. 2013;10(82):20130041.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Kocot KM, Aguilera F, McDougall C, Jackson DJ, Degnan BM. Sea shell diversity and rapidly evolving secretomes: insights into the evolution of biomineralization. Front Zool. 2016;13:23.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Williams ST. Molluscan shell colour. Biol Rev Camb Philos Soc. 2017;92(2):1039–58.

    Article  PubMed  Google Scholar 

  8. Murdock DJE. The biomineralization toolkit and the origin of animal skeletons. Biol Rev Camb Philos Soc. 2020;95(5):1372–92.

    Article  PubMed  Google Scholar 

  9. Varney RM, Speiser DI, McDougall C, Degnan BM, Kocot KM. The Iron-Responsive Genome of the Chiton Acanthopleura granulata. Genome Biol Evol. 2021;13(1). https://doi.org/10.1093/gbe/evaa263

  10. Varney RM, Yap-Chiongco MK, Mikkelsen NT, Kocot KM. Genome of the lepidopleurid chiton Hanleya Hanleyi (Mollusca, Polyplacophora). F1000Res. 2022;11:555.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  11. Arias JL, Fernández MS. Polysaccharides and proteoglycans in calcium carbonate-based biomineralization. Chem Rev. 2008;108(11):4475–82.

    Article  PubMed  CAS  Google Scholar 

  12. Marin F. Mollusc shellomes: past, present and future. J Struct Biol. 2020;212(1):107583.

    Article  PubMed  CAS  Google Scholar 

  13. Falini G, Albeck S, Weiner S, Addadi L. Control of Aragonite or Calcite Polymorphism by Mollusk Shell Macromolecules. Science. 1996;271(5245):67–9.

    Article  Google Scholar 

  14. Thompson JB, Paloczi GT, Kindt JH, Michenfelder M, Smith BL, Stucky G, et al. Direct observation of the transition from calcite to aragonite growth as induced by abalone shell proteins. Biophys J. 2000;79(6):3307–12.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  15. Xie J, Liang J, Sun J, Gao J, Zhang S, Liu Y, et al. Influence of the Extrapallial Fluid of Pinctada fucata on the Crystallization of Calcium Carbonate and Shell Biomineralization. Cryst Growth Des. 2016;16(2):672–80.

    Article  CAS  Google Scholar 

  16. Song X, Liu Z, Wang L, Song L. Recent advances of shell matrix proteins and cellular orchestration in marine molluscan shell biomineralization. Front Mar Sci. 2019;6. https://doi.org/10.3389/fmars.2019.00041/full

  17. Aguilera F, McDougall C, Degnan BM. Co-option and De Novo Gene Evolution Underlie Molluscan Shell Diversity. Mol Biol Evol. 2017;34(4):779–92.

    PubMed  PubMed Central  CAS  Google Scholar 

  18. Kocot KM, McDougall C, Degnan BM. Developing perspectives on molluscan shells, part 1: introduction and molecular biology. Physiology of Molluscs. Apple Academic; 2017. pp. 17–58.

  19. Batzel GO, Moreno BK, Lopez LS, Nguyen CK, Livingston BT, Joester D, et al. Proteomic and transcriptomic analyses in the Slipper snail Crepidula fornicata uncover Shell Matrix genes expressed during adult and larval biomineralization. Integr Org Biol. 2022;4(1):obac023.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  20. Henry JQ, Lesoway MP, Perry KJ, Osborne CC, Shankland M, Lyons DC. Beyond the sea: Crepidula Atrasolea as a spiralian model system. Int J Dev Biol. 2017;61(8–9):479–93.

    Article  PubMed  CAS  Google Scholar 

  21. Perry KJ, Henry JQ. CRISPR/Cas9-mediated genome modification in the mollusc, Crepidula fornicata. Genesis. 2015;53(2):237–44.

    Article  PubMed  CAS  Google Scholar 

  22. Lyons DC, Henry JQ. Slipper snail tales: how Crepidula fornicata and Crepidula atrasolea became model molluscs. Curr Top Dev Biol. 2022;147:375–99.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  23. Kniprath E. Ontogeny of the molluscan shell field: a review. Zool Scr. 1981;10(1):61–79.

    Article  Google Scholar 

  24. Herlitze I, Marie B, Marin F, Jackson DJ. Molecular modularity and asymmetry of the molluscan mantle revealed by a gene expression atlas. Gigascience. 2018;7(6). https://doi.org/10.1093/gigascience/giy056

  25. Wollesen T, Scherholz M, Rodríguez Monje SV, Redl E, Todt C, Wanninger A. Brain regionalization genes are co-opted into shell field patterning in Mollusca. Sci Rep. 2017;7(1):5486.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Johnson AB, Fogel NS, Lambert JD. Growth and morphogenesis of the gastropod shell. Proc Natl Acad Sci U S A. 2019;116(14):6878–83.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  27. Lee SW, Jang YN, Ryu KW, Chae SC, Lee YH, Jeon CW. Mechanical characteristics and morphological effect of complex crossed structure in biomaterials: fracture mechanics and microstructure of chalky layer in oyster shell. Micron. 2011;42(1):60–70.

    Article  PubMed  Google Scholar 

  28. Auzoux-Bordenave S, Brahmi C, Badou A, de Rafélis M, Huchette S. Shell growth, microstructure and composition over the development cycle of the European abalone Haliotis tuberculata. Mar Biol. 2015;162(3):687–97.

    Article  Google Scholar 

  29. Carini A, Koudelka T, Tholey A, Appel E, Gorb SN, Melzner F, et al. Proteomic investigation of the blue mussel larval shell organic matrix. J Struct Biol. 2019;208(3):107385.

    Article  PubMed  CAS  Google Scholar 

  30. Zhao R, Takeuchi T, Luo YJ, Ishikawa A, Kobayashi T, Koyanagi R, et al. Dual gene repertoires for larval and adult shells reveal molecules essential for Molluscan Shell formation. Mol Biol Evol. 2018;35(11):2751–61.

    PubMed  PubMed Central  CAS  Google Scholar 

  31. Cavallo A, Clark MS, Peck LS, Harper EM, Sleight VA. Evolutionary conservation and divergence of the transcriptional regulation of bivalve shell secretion across life-history stages. R Soc Open Sci. 2022;9(12):221022.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  32. Jackson DJ. Mantle Modularity underlies the plasticity of the Molluscan Shell: supporting data from Cepaea nemoralis. Front Genet. 2021;12:622400.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  33. Jolly C, Berland S, Milet C, Borzeix S, Lopez E, Doumenc D. Zona localization of shell matrix proteins in mantle of Haliotis tuberculata (Mollusca, Gastropoda). Mar Biotechnol. 2004 Nov-Dec;6(6):541–51.

  34. Marie B, Joubert C, Tayalé A, Zanella-Cléon I, Belliard C, Piquemal D, et al. Different secretory repertoires control the biomineralization processes of prism and nacre deposition of the pearl oyster shell. Proc Natl Acad Sci U S A. 2012;109(51):20986–91.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  35. Suzuki M, Saruwatari K, Kogure T, Yamamoto Y, Nishimura T, Kato T, et al. An acidic matrix protein, Pif, is a key macromolecule for nacre formation. Science. 2009;325(5946):1388–90.

    Article  PubMed  CAS  Google Scholar 

  36. Takeuchi T, Endo K. Biphasic and dually coordinated expression of the genes encoding major shell matrix proteins in the pearl oyster Pinctada Fucata. Mar Biotechnol. 2006;8(1):52–61.

    Article  CAS  Google Scholar 

  37. Funabara D, Ohmori F, Kinoshita S, Koyama H, Mizutani S, Ota A, et al. Novel genes participating in the formation of prismatic and nacreous layers in the pearl oyster as revealed by their tissue distribution and RNA interference knockdown. PLoS ONE. 2014;9(1):e84706.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Zhang Y, Mao F, Mu H, Huang M, Bao Y, Wang L, et al. The genome of Nautilus pompilius illuminates eye evolution and biomineralization. Nat Ecol Evol. 2021;5(7):927–38.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Mann K, Jackson DJ. Characterization of the pigmented shell-forming proteome of the common grove snail Cepaea nemoralis. BMC Genomics. 2014;15:249.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Marie B, Jackson DJ, Ramos-Silva P, Zanella-Cléon I, Guichard N, Marin F. The shell-forming proteome of Lottia gigantea reveals both deep conservations and lineage-specific novelties. FEBS J. 2013;280(1):214–32.

    Article  PubMed  CAS  Google Scholar 

  41. Feng D, Li Q, Yu H, Kong L, Du S. Identification of conserved proteins from diverse shell matrix proteome in Crassostrea gigas: characterization of genetic bases regulating shell formation. Sci Rep. 2017;7:45754.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  42. Weisman CM, Murray AW, Eddy SR. Many, but not all, lineage-specific genes can be explained by homology detection failure. PLoS Biol. 2020;18(11):e3000862.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  43. Goodheart JA, Rio RA, Taraporevala NF, Fiorenza RA, Barnes SR, Morrill K et al. A chromosome-level genome for the nudibranch gastropod Berghia stephanieae helps parse clade-specific gene expression in novel and conserved phenotypes. bioRxiv. 2023; https://doi.org/10.1101/2023.08.04.552006

  44. Bu L, Zhong D, Lu L, Loker ES, Yan G, Zhang SM. Compatibility between snails and schistosomes: insights from new genetic resources, comparative genomics, and genetic mapping. Commun Biol. 2022;5(1):940.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  45. Bai Y, Liu S, Hu Y, Yu H, Kong L, Xu C, et al. Multi-omic insights into the formation and evolution of a novel shell microstructure in oysters. BMC Biol. 2023;21(1):204.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  46. Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20(1):238.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Collin R. Phylogeny of the Crepidulaplana (Gastropoda: Calyptraeidae) cryptic species complex in North America. Can J Zool. 2000;78(8):1500–14.

    Article  Google Scholar 

  48. Mishkind M, Raikhel NV, Palevitz BA, Keegstra K. Immunocytochemical localization of wheat germ agglutinin in wheat. J Cell Biol. 1982;92(3):753–64.

    Article  PubMed  CAS  Google Scholar 

  49. Fodero LR, Sáez-Valero J, Barquero MS, Marcos A, McLean CA, Small DH. Wheat germ agglutinin-binding glycoproteins are decreased in Alzheimer’s disease cerebrospinal fluid. J Neurochem. 2001;79(5):1022–6.

    Article  PubMed  CAS  Google Scholar 

  50. Baddeley D, Crossman D, Rossberger S, Cheyne JE, Montgomery JM, Jayasinghe ID, et al. 4D super-resolution microscopy with conventional fluorophores and single wavelength excitation in optically thick cells and tissues. PLoS ONE. 2011;6(5):e20645.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  51. Bushmanova E, Antipov D, Lapidus A, Prjibelski AD. rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data. Gigascience. 2019;8(9). https://doi.org/10.1093/gigascience/giz100

  52. Fu L, Niu B, Zhu Z, Wu S, Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012;28(23):3150–2.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  53. Ryan JF. alien_index: identify potential non-animal transcripts in animal transcriptomes. Github; [cited 2023 Dec 3]. https://github.com/josephryan/alien_index

  54. Haas BJ, Papanicolaou A. TransDecoder 5.5. 0. GitHub; 2015.

  55. Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C, et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30(9):1236–40.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  56. Seppey M, Manni M, Zdobnov EM. BUSCO: Assessing Genome Assembly and Annotation Completeness. Methods Mol Biol. 2019;1962:227–45.

    Article  PubMed  CAS  Google Scholar 

  57. Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12(1):59–60.

    Article  PubMed  CAS  Google Scholar 

  58. van Dongen S. Graph clustering via a discrete uncoupling process. SIAM J Matrix Anal Appl, 30:121–141. SIAM J Matrix Anal Appl. 2008;30(1):121–41.

  59. Katoh K, Misawa K, Kuma KI, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30(14):3059–66.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  60. Price MN, Dehal PS, Arkin AP. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol. 2009;26(7):1641–50.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  61. Laetsch DR, Blaxter ML. KinFin: Software for Taxon-Aware analysis of clustered protein sequences. G3. 2017;7(10):3349–57.

  62. Kuehn E, Clausen DS, Null RW, Metzger BM, Willis AD, Özpolat BD. Segment number threshold determines juvenile onset of germline cluster expansion in Platynereis dumerilii. J Exp Zool B Mol Dev Evol. 2022;338(4):225–40.

    Article  PubMed  Google Scholar 

  63. Choi HMT, Schwarzkopf M, Fornace ME, Acharya A, Artavanis G, Stegmaier J et al. Third-generation in situ hybridization chain reaction: multiplexed, quantitative, sensitive, versatile, robust. Development. 2018;145(12). https://doi.org/10.1242/dev.165753

  64. Eyster LS, Morse MP. Early Shell formation during Molluscan Embryogenesis, with New studies on the surf clam, Spisula solidissima1. Integr Comp Biol. 2015;24(4):871–82.

    Google Scholar 

  65. Hohagen J, Jackson DJ. An ancient process in a modern mollusc: early development of the shell in Lymnaea stagnalis. BMC Dev Biol. 2013;13:27.

    Article  PubMed  PubMed Central  Google Scholar 

  66. Li S, Xie L, Zhang C, Zhang Y, Gu M, Zhang R. Cloning and expression of a pivotal calcium metabolism regulator: calmodulin involved in shell formation from pearl oyster (Pinctada fucata). Comp Biochem Physiol B Biochem Mol Biol. 2004;138(3):235–43.

    Article  PubMed  Google Scholar 

  67. Jackson DJ, McDougall C, Green K, Simpson F, Wörheide G, Degnan BM. A rapidly evolving secretome builds and patterns a sea shell. BMC Biol. 2006;4:40.

    Article  PubMed  PubMed Central  Google Scholar 

  68. Jackson DJ, Wörheide G, Degnan BM. Dynamic expression of ancient and novel molluscan shell genes during ecological transitions. BMC Evol Biol. 2007;7:160.

    Article  PubMed  PubMed Central  Google Scholar 

  69. Strous GJ, Dekker J. Mucin-type glycoproteins. Crit Rev Biochem Mol Biol. 1992;27(1–2):57–92.

    Article  PubMed  CAS  Google Scholar 

  70. Midura RJ, Hascall VC. Bone sialoprotein–a mucin in disguise? Glycobiology. 1996;6(7):677–81.

    Article  PubMed  CAS  Google Scholar 

  71. Marin F, Corstjens P, de Gaulejac B, de Vrind-De Jong E, Westbroek P. Mucins and molluscan calcification: molecular characterization of mucoperlin, a novel mucin-like protein from the nacreous shell layer of the fan mussel Pinna Nobilis (Bivalvia, Pteriomorphia). J Biol Chem. 2000;275(27):20667–75.

    Article  PubMed  CAS  Google Scholar 

  72. Marin F, Luquet G. Molluscan biomineralization: the proteinaceous shell constituents of Pinna nobilis L. Mater Sci Engineering: C. 2005;25(2):105–11.

    Article  Google Scholar 

  73. Zeng D, Guo X. Mantle Transcriptome provides insights into biomineralization and growth regulation in the Eastern Oyster (Crassostrea virginica). Mar Biotechnol. 2022;24(1):82–96.

    Article  CAS  Google Scholar 

  74. Clark MS. Molecular mechanisms of biomineralization in marine invertebrates. J Exp Biol. 2020;223(Pt 11). https://doi.org/10.1242/jeb.206961

  75. Arivalagan J, Yarra T, Marie B, Sleight VA, Duvernois-Berthet E, Clark MS, et al. Insights from the Shell Proteome: biomineralization to Adaptation. Mol Biol Evol. 2017;34(1):66–77.

    Article  PubMed  CAS  Google Scholar 

  76. Toll-Riera M, Radó-Trilla N, Martys F, Albà MM. Role of low-complexity sequences in the formation of novel protein coding sequences. Mol Biol Evol. 2012;29(3):883–6.

    Article  PubMed  CAS  Google Scholar 

  77. Sudo S, Fujikawa T, Nagakura T, Ohkubo T, Sakaguchi K, Tanaka M, et al. Structures of mollusc shell framework proteins. Nature. 1997;387(6633):563–4.

    Article  PubMed  CAS  Google Scholar 

  78. Zhang C, Zhang R. Matrix proteins in the outer shells of molluscs. Mar Biotechnol. 2006;8(6):572–86.

    Article  Google Scholar 

  79. Liu G, Huan P, Liu B. Identification of three cell populations from the shell gland of a bivalve mollusc. Dev Genes Evol. 2020;230(1):39–45.

    Article  PubMed  CAS  Google Scholar 

  80. Nederbragt AJ, van Loon AE, Dictus WJAG. Expression of Patella vulgata orthologs of engrailed and dpp-BMP2/4 in adjacent domains during molluscan shell development suggests a conserved compartment boundary mechanism. Dev Biol. 2002;246(2):341–55.

    Article  PubMed  CAS  Google Scholar 

  81. Le Roy N, Jackson DJ, Marie B, Ramos-Silva P, Marin F. Carbonic anhydrase and metazoan biocalcification: a focus on molluscs. Key Eng Mater. 2016;672:151–7.

    Article  Google Scholar 

Download references

Acknowledgements

We thank the Hamdoun Lab (UC San Diego) for the use of their confocal, and the Ozpolat Lab (Washington University) and the Katz Lab (UMass Amherst) for advice about adapting HCR to Crepidula embryos. We also thank J.D. Lambert and T.K. Dao (University of Rochester) for assembling and providing the Tritia obsoleta transcriptome that was used in our analysis.

Funding

This work was funded by a National Science Foundation Faculty Early Career Development (CAREER) Award [1943606] and a National Institute of General Medical Sciences Maximizing Investigators’ Research Award (MIRA) [R35GM133673] to DCL.

Author information

Authors and Affiliations

Authors

Contributions

RNLA, GOB, and DCL conceived and designed the study; RNLA, GOB, GR, YW, SN, and JAG collected data; RNLA, GOB, YW, SN, and JAG performed data analysis; RNLA, GOB, YW, MPL, JAG, and DCL contributed to data interpretation; RNLA, GOB, GR, YW, SN, JAG, and DCL read and approved of the final manuscript.

Corresponding author

Correspondence to Deirdre C. Lyons.

Ethics declarations

Ethics approval and consent of participants

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lopez-Anido, R.N., Batzel, G.O., Ramirez, G. et al. The adult shell matrix protein repertoire of the marine snail Crepidula is dominated by conserved genes that are also expressed in larvae. BMC Ecol Evo 24, 120 (2024). https://doi.org/10.1186/s12862-024-02237-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12862-024-02237-y

Keywords