Skip to main content
  • Research article
  • Open access
  • Published:

Unusual conservation among genes encoding small secreted salivary gland proteins from a gall midge



In most protein-coding genes, greater sequence variation is observed in noncoding regions (introns and untranslated regions) than in coding regions due to selective constraints. During characterization of genes and transcripts encoding small secreted salivary gland proteins (SSSGPs) from the Hessian fly, we found exactly the opposite pattern of conservation in several families of genes: the non-coding regions were highly conserved, but the coding regions were highly variable.


Seven genes from the SSSGP-1 family are clustered as one inverted and six tandem repeats within a 15 kb region of the genome. Except for SSSGP-1A2, a gene that encodes a protein identical to that encoded by SSSGP-1A1, the other six genes consist of a highly diversified, mature protein-coding region as well as highly conserved regions including the promoter, 5'- and 3'-UTRs, a signal peptide coding region, and an intron. This unusual pattern of highly diversified coding regions coupled with highly conserved regions in the rest of the gene was also observed in several other groups of SSSGP-encoding genes or cDNAs. The unusual conservation pattern was also found in some of the SSSGP cDNAs from the Asian rice gall midge, but not from the orange wheat blossom midge. Strong positive selection was one of the forces driving for diversification whereas concerted homogenization was likely a mechanism for sequence conservation.


Rapid diversification in mature SSSGPs suggests that the genes are under selection pressure for functional adaptation. The conservation in the noncoding regions of these genes including introns also suggested potential mechanisms for sequence homogenization that are not yet fully understood. This report should be useful for future studies on genetic mechanisms involved in evolution and functional adaptation of parasite genes.


Insect salivary glands are the main organs for producing proteins that are injected into hosts [1]. Plant-feeding insects, especially those with sucking mouthparts, inject proteins and other substances into host plants to facilitate mouthpart penetration, partially digest food before ingestion, and suppress plant defense [24]. Substances, including proteins with regulatory roles that can alter host physiology, are referred to as effectors [5]. Pathogens, including bacteria, fungi, oomycetes, and nematodes, deliver various effector proteins into host tissues [58]. Substantial evidence suggests that some of the salivary proteins injected into host plants by insects also act as effectors to suppress defense and/or reprogram physiological pathways of host plants [3, 5, 912]. Gall midges (Cecidomyiidae), a large family of plant-feeding insects, apparently secrete effectors into host tissues, inducing various forms of plant outgrowth (galls) and altering other aspects of host physiology [13, 14]. Plant galls contain a zone of "metabolic habitat modification" in which the parasite experiences a selective advantage because of enhanced nutrition and reduced plant defense [15]. Several organic compounds and enzymes injected into host plants by galling insects have been identified, including amino acids, auxin, proteases, oxidases, and pectinases [13], but the general composition of the proteins delivered into host plants by gall midges has not yet been fully characterized.

The Hessian fly, Mayetiola destructor, is the most destructive insect pest of wheat worldwide [16]. Because of its importance in agriculture, intriguing behavior, ease of maintenance in culture, and relatively well-characterized genetics, Hessian fly is becoming a model species for studying insect-plant interactions [17, 18]. Hessian fly does not induce the formation of an outgrowth gall, but nutritive cells with similarity to those inside macroscopic galls are formed at the larval feeding site [19]. Larvae do not cause extensive tissue damage to host plants, with their specialized mandibles making only a pair of small holes [19, 20]. Nevertheless, wheat plants become permanently and irreversibly stunted after 4-5 days of feeding by a single larva [9]. Even if larvae are removed, growth of wheat seedlings cannot be restored [9, 20], suggesting that larvae inject substances into host plants that dramatically alter biochemical and physiological pathways of the attacked plant [21, 22].

As the first step to identify some of those proteins that are injected into host plants, we have previously generated numerous ESTs from cDNAs derived from dissected salivary glands of Hessian fly first instar larvae [23, 24]. The majority of the salivary gland transcripts encode small proteins (50 to 200 amino acids) with typical secretion signal peptides at the N-termini. We refer to these proteins as "small secreted salivary gland proteins" (SSSGPs). Here we report unusual conservation patterns of SSSGP-encoding genes and we discuss potential mechanisms for gene evolution and functional adaptation.

Results and Discussion

Unconventional conservation of SSSGP-encoding genes

The SSSGP-1 gene family includes seven members and is clustered as one inverted and six tandemly repeated genes within a 15 kb region of the genome (Figure 1A). The predicted structures of the genes were verified by comparing the genomic sequences with cDNA clones corresponding to genes SSSGP-1A, SSSGP-1B1, SSSGP-1C1 and SSSGP-1D1 (a cDNA for SSSGP-1E1 has yet to be identified). All seven genes have a common structure, including a conserved putative promoter region, a 5'-untranslated region (5'-UTR), a signal peptide-coding region (SPCR), an intron, a mature protein-coding region (MPCR), and a 3'-untranslated region (3'-UTR; Figure 1B). Intergenic regions are small, ranging from 0.2 to 1 kB (Genbank accession: GU196316). Among the seven genes, SSSGP-1A2, present in the inverted repeat, was apparently recently duplicated and encodes an identical protein with SSSGP-1A1. The other six genes consist of highly diversified MPCRs as well as highly conserved regions, including the promoter region, 5'- and 3'-UTRs, SPCR, and the intron (Figure 1B, Additional file 1, Figure S1A). The predicted proteins are almost identical in their putative signal peptides, but share little similarity among the mature proteins (Figure 1C). This extreme pattern of diversification in MPCR, which we refer to here as super-diversification, coupled with strong conservation in other regions was also observed in several other groups of SSSGP-encoding genes (Additional file 1, Figure S1) or cDNAs from Hessian fly (Table 1, Additional file 2, Figure S2).

Figure 1
figure 1

Genomic organization and structural comparison of the Hessian fly SSSGP-1 family members. A: SSSGP-1 family members derived by sequencing a BAC clone made from biotype GP. B: Nucleotide sequence comparison of the seven SSSGP-1 genes. Comparisons were divided into the promoter region (Promoter), 5'-untranslated region (5'-UTR), signal peptide coding region (SPCR), an intron, mature protein coding region (MPCR), and 3'-untranslated region (3'-UTR). The numbers in boxes are average scores and score range (in parentheses) derived from pair-wise comparisons of all possible combinations of the genes (see Materials and Methods). Red color indicates conserved regions. Blue color indicates diversified regions. The lowest score for any pairwise comparison in the MPCR was 13. Unrelated random sequences can produce scores as high as 15. The actual alignments of these genes are shown in Additional file 1, Figure S1A. C: Sequence alignments of putative proteins. Dashes represent gaps in the sequence alignments. The first 18 amino acids constitute a putative signal peptide.

Table 1 Similarity of different regions among cDNAs from different gene groups

Except for the common features of diversification/conservation, there are no noticeable sequence or structural similarities between the different groups of SSSGP genes, and no apparent sequence similarities could be detected among different groups with currently available alignment methods such as BLAST. Most groups of SSSGP genes contain one intron (Additional file 1, Figures S1A, S1C, S1D). However, one group lacks introns (Additional file 1, Figure S1B) and several other groups contain multiple introns (Additional file 1, Figure S1E). For those genes containing introns, the first (or the sole) intron is located either at the boundary between the SPCR and MPCR, or within the SPCR (Additional file 1, Figure S1). The positions of intron/exon boundaries are generally conserved among members within a group. However, deletions or shifts in intron/exon boundaries occur in gene groups with multiple introns (Additional file 1, Figure S1E). For all gene groups, multiple members in each group are clustered within short chromosome regions in the Hessian fly genome (Additional file 3, Figure S3).

To determine if such a genetic phenomenon exists in other gall midges, a similar analysis of salivary gland cDNAs was conducted on two other related insects, the orange wheat blossom midge (Sitodiplosis mosellana) and the Asian rice gall midge (Orseolia oryzae). Approximately 8,500 cDNAs from the wheat blossom midge and 3,500 from the Asian rice gall midge were sequenced. In each case, a similar proportion (45-50%) of cDNA clones was found encoding different SSSGPs. Forty-eight different groups of putative SSSGPs were identified from the wheat blossom midge while 25 different groups of putative SSSGPs were identified from the Asian rice gall midge. Comparative analysis revealed that cDNAs and their encoded proteins from the Asian rice midge, wheat blossom midge, and Hessian fly were typically found to be species-specific; cDNAs from one species shared no detectable sequence similarity with those from the other two species, consistent with the rapidly evolving nature of SSSGP-encoding genes. The species-specific nature of SSSGP-encoding genes was further confirmed by PCR and by Southern blot analysis. No PCR amplification could be achieved using primer pairs designed according to cDNAs from another species. Similarly, no cross hybridization could be observed on Southern blots using cDNA probes from a different species (data not shown). The typical unconventional conservation pattern of SSSGP-encoding genes observed in Hessian fly was also found in some of the SSSGP-encoding transcripts of the Asian rice midge (Additional file 2, Figure S2G), but not in any transcripts of the wheat blossom midge. This observation indicates that the unconventional conservation of SSSGP-encoding genes might be linked to adaption to environmental changes such as a change in host plants. Even though they live on different plant species, the Asian rice midge and Hessian fly larvae share a similar feeding mechanism. Larvae of both species feed on the meristem of a leaf-sheath within a plant, and their survival strictly depends on their ability to induce the formation of nutritive cells of plant tissue at the feeding site, to inhibit plant growth, and to suppress host defense [17, 19, 25]. Wheat blossom midges, on the other hand, feed on developing wheat seeds and either do not require extensive manipulation of host plants such as growth inhibition [19], or manipulate host plants in different ways.

Several genes from different mosquito species have been found encoding diverse secreted salivary proteins and some of these genes are also organized as tandem repeats [26]. Diverse toxic small peptides have been found in the venoms of predatory cone snails [27]. However, the extreme cases described here with a very short (100 to 500 bp), highly diversified segment followed by a very short (~500 bp), highly conserved segment arranged as multiple tandem repeats has not been found in any other organisms.

Strong positive selection on SSSGPloci and alleles

Strong positive (diversifying) selection appears to be one of the forces driving diversifications in MPCRs. Highly diversified members with less than 80% sequence identity within MPCRs did not produce meaningful alignments for analyzing nonsynonymous to synonymous substitution ratio (dN/dS), but the fact that the coding regions are hard to align is itself evidence for fast evolution by positive selection or other mechanisms such as Y-family polymerases [28]. Analysis of moderately diversified group members with 80 to 95% sequence identity in their MPCRs all yielded dN/dS above one (Table 2, Additional file 4, Figure S4). One pair of group members produced a dN/dS ratio above 18, indicating very strong positive selection. Due to the small size, similar sequences with greater than 95% sequence identity within MPCRs did not possess sufficient nucleotide substitutions to confidently discern evolutionary patterns through analyzing dN/dS. However, a different analysis of similar sequences derived from different alleles also produced strong evidence for positive selection (below).

Table 2 Evidence for positive selection on SSSGP group members

Multiple transcripts corresponding to genes SSSGP-1A1, SSSGP-1B1, and SSSGP-1C1 were isolated from three different Hessian fly populations. These different transcripts were likely derived from different alleles since evidence from in situ hybridization, Southern blots with genomic and BAC DNA samples, and primer specific PCR suggests a single locus for this gene family (Additional file 5, Figure S5). The ratio between nonsynonymous and synonymous substitutions was 1.5 or more within the MPCR, but less than 0.9 in the SPCR (Table 3, Additional files 6 and 7), again indicating positive selection in MPCRs for different alleles.

Table 3 Evidence for positive selection on different alleles (Additional file 6, Figure S6)

Evidence for positive selection is not common but has been demonstrated at several different types of genes controlling interactions between organisms that are mediated by molecular recognition. Typical examples are defense-related genes including the major histocompatibility complex [29], immunoglobulins [30], defensins [31], plant resistance genes [32], plant chitinase genes [33], and pathogen effector genes [34]. The strong positive selection observed in SSSGP-encoding genes indicated that SSSGPs are also likely involved in interactions between Hessian fly and other organisms. Considering that Hessian fly larvae live within host plants, some of these SSSGPs may be secreted into host plants as effector proteins with a role in the insect's virulence. In plant-herbivore interactions, successful pathogens and parasitic arthropods not only require a large number of genes coding for effector proteins to suppress innate defense of host plants [35], but also require the ability to change this arsenal in response to shifts in the host population [36]. Evolution of plant populations in parasite recognition and surveillance systems thus provides strong selection for counter changes in effector proteins from parasites [36, 37]. The Hessian fly has been very successful in adaptation to changes in host plant populations [16, 17]. The super-diversification in SSSGP genes may have provided the genetic basis for the development of counter-resistance in Hessian fly in response to changes in host plants.

Concerted homogenization of noncoding regions

Very strong selection for divergence could account for rapid divergence of MPCR but the high homology of the other regions of the genes is difficult to explain. Recombination between gene-family members, particularly those arranged in tandem arrays, acts to homogenize their sequences so they evolve in a concerted fashion [38, 39]. Typically, however, this homogenization occurs throughout the whole gene and even the intergenic regions, not just specific domains in the genes. While crossover events would tend to homogenize the whole array, smaller gene conversion events might homogenize smaller regions. Little is known about recombination in gall midges, but conversion tracts at the Rosey locus of Drosophila have been found to be in the order of a few hundred base pairs [40]. Differences in sequence affinity among the various sub-regions of the SSSGP-1 family members corroborate frequent recombination in short DNA regions during Hessian fly evolution (Figure 2). The homogenization could be confined to termini of the genes if the conversion events were initiated near the ends of the genes or in intergenic regions. The nature of recombination hotspots varies between species [41], but they are commonly initiated intergenically [42], possibly at specific sequence motifs [43] or regulatory regions. Sequence heterogeneity in the MPCR due to strong positive selection could, in turn, affect the length of conversion tracks or how the recombination intermediates are resolved; conversion or crossover events [44]. If the sequence homogeneity of the SSSGP-encoding families was caused by concerted evolution from short conversion tracks initiated in the flanking regions, one would expect introns in the middle of the larger genes to be less homogenized. This is in fact what was observed in the SSSGP-2 family; noticeably, several introns (introns 22, 23, 26, 27, 35, 36, 37) were rearranged or deleted (Additional file 1, Figure S1E). The coding regions of the two SSSGP-2 family members correspond to approximately 950 nucleotides with 35 introns.

Figure 2
figure 2

Phylogenetic tree for different regions of SSSGP-1 family members inferred using the Neighbor-joining method implemented in MEGA.

To explore whether functional adaptation might explain conservation of certain regions of gene families [45], we analyzed the patterns of transcript levels corresponding to specific genes under different conditions (Figure 3). In general, SSSGPs with higher sequence similarity in the promoter regions had more similar patterns of gene expression (Figures 2, 3). SSSGP-1A1, SSSGP-1A2, SSSGP-1B1, and SSSGP-1C1, whose promoters were very similar (Figure. 2C), also exhibited similar expression patterns among tissues (Figure 3A) and developmental stages (Figure 3B), and among insects interacting with different plant genotypes (Figure 3C). The promoters of SSSGP-1C2, and SSSGP-1E1 were also similar to each other (Figure 2C), and these two genes also exhibited similar transcription patterns. However, the genes in the first group (SSSGP-1A1, SSSGP-1A2, SSSGP-1B1, and SSSGP-1C1) and the second group (SSSGP-1C2 and SSSGP-1E1) had strikingly different expression patterns (Figure 3). Small differences in the transcription patterns among members in the same promoter group were also observed. For example, SSSGP-1C2 was expressed abundantly in 0.5-day old larvae (Figure 3B, 1C2), whereas little SSSGP-1E1 expression was observed in the same larvae (Figure 3B, 1E1). These differences could indicate that small differences in the promoter (or other regulatory elements in other regions) of the genes can fine-tune the level of transcripts to satisfy specific requirements. These observations suggest that the conservation/diversification of the promoter regions has been strictly driven by functional adaptation.

Figure 3
figure 3

Distribution and abundance of transcripts corresponding to specific SSSGP-1 family members. A: Transcript distribution among tissues was determined using 3-day old biotype GP larvae. The remains after removing salivary glands, gut, and Malphigian tubules were designated as carcass. B: Transcript abundance in 0.5 to 12-day old larvae on susceptible wheat plants (cultivar 'Newton'). C: Transcript abundance in 0.5 to 4-day old (dying) larvae on resistant wheat (cultivar 'Molly' containing H13 R-gene). Primer pairs and methods are shown in Additional file 8, Table S1.

The homogenization of 5'- and possibly even 3'-UTRs may also have a functional basis. Because UTRs play critical roles in post-transcriptional regulation of gene expression [46, 47], we speculate that the SSSGP UTRs are critical for proper post-transcriptional regulation. For example, part of the conserved UTRs could serve as elements for binding with regulatory proteins or as pairing sites for interacting with micro-RNAs that may affect RNA stability or translation efficiency [48]. Multiple layers of gene regulation may be needed to ensure spatial and tissue-specific expression and prompt response of SSSGP-encoding genes to changes of host and other environmental conditions.

Functional division of SSSGPs: initiators and maintainers

SSSGPs appear to have a division of labor, with "initiators" expressed only immediately after the start of feeding and "maintainers" expressed at later stages in the time course of feeding and plant response. Initiators, such as SSSGP-1C2 and SSSGP-1E in the SSSGP-1 family, were predominantly expressed in salivary glands (Figure 3A) at early stage of larval development (Figure 3B), and their expression was elevated at later time points in larvae feeding on resistant plants (Figure 3C). These observations are consistent with the postulation that initiators are secreted into plant tissue as effectors to manipulate plant cells. Hessian fly suppresses plant defense and induces the formation of nutritive cells within the first couple of days [9, 19]. Once the insect has successfully manipulated host plants, one would expect that the expression of initiators is no longer needed. Indeed the manipulation of wheat seedlings is irreversibly achieved within the first few days following the Hessian fly initial attack [9]. The elevated expression of initiators in larvae feeding on resistant plants at later stages may reflect the fact that Hessian fly larvae continue to secret effectors to counter increased plant defense in these plants [21, 22].

Maintainers, such as SSSGP-1A, SSSGP-1C1, and SSGP-1C1, were also expressed in other tissues besides the salivary glands (Figure 3A). The proteins produced in Malphigian tubules and carcass are unlikely to play a role in interaction with host plants, but could play a role in regulating Hessian fly symbiotic or associated microbes in insect tissues [49]. In addition, some SSSGPs could also play a role in regulating secondary microbial infection of the host tissues damaged at the feeding site [50]. The maintainers may possess antimicrobial activity, and are under selection pressure from changes in microbial populations. Further research on the network of these initiators and maintainers encoded by rapidly evolving genes will shed light on the biology and feeding behavior of gall midges.


In this study, we observed an unconventional conservation pattern in genes encoding SSSGPs in the Hessian fly. In the SSSGP-encoding genes, noncoding regions are highly conserved whereas regions coding for mature proteins are highly diversified. Rapid diversification in mature SSSGPs suggests that the genes are under selection pressure for functional adaptation. Considering the fact that most SSSGP-encoding genes are exclusively expressed in salivary glands, it is likely that rapid diversification in SSSGP-encoding genes is for the insect to counter changes in host plants for virulence. The conservation in the noncoding regions of these genes including introns also suggested potential mechanisms for sequence homogenization that are not yet fully understood. This report should be useful for future studies on genetic mechanisms involved in evolution and functional adaptation of parasite genes.


DNA libraries and sequencing

cDNA libraries and sequencing were as described previously [23, 24]. A BAC library with 5× coverage was made from biotype GP Hessian fly larvae through a commercial contract with Amplicon Express (Pullman, WA). The BAC library contains inserts with average size of ~150 kB ligated into Hind III of pECBA1. A positive BAC clone, 10A23, was identified by screening the BAC library with a cDNA probe corresponding to the SSSGP-1C1 gene. A shotgun library with average sizes of 1.5 kB was made with 10 times coverage of the BAC clone 10A23, again through a commercial contract with Amplicon Express. The shotgun library was sequenced using ABI 3730 DNA analyzer at Kansas State University DNA sequencing facility. The shotgun sequences were assembled using Cap3 [51] and confirmed by PCR amplification and resequencing. The sequence of the whole BAC clone is 130 kB and was deposited into Genbank with accession number GU196316. The 15 kB cluster was located in the middle region toward 5'-end of the BAC.

Quantitative real-time polymerase chain reaction (qRT-PCR) analysis

RNA extraction, reverse-transcription and real-time PCR were carried out as described previously [24]. Two hundred larvae or tissues from 200 larvae were collected and pooled for RNA isolation for each replicate. Three biological replicates were included for each analysis. The ratio between abundances on resistant plants and the corresponding ones on susceptible plants were calculated. Primers used for PCR reactions are listed in Additional file 8, Table S1.

Sequence analysis and comparison

Sequence alignments and comparison were conducted using ClustalW [52]. For pairwise comparison, each sequence was compared with every other sequence. Scores for individual alignments are calculated based on the method of Wilbur and Lipman [53]. The higher the score is for a pairwise alignment, the higher the degree of conservation is between the two aligned sequences. Average scores were derived by dividing the sum of all pair-wise scores with the number of alignments. Score range was the lowest score to the highest score among all pair-wise alignments. For analysis of nucleotide substitutions, pair-wise alignments were obtained using ClusterW. Nonsynomonous (dN) and synomonous (dS) substitution ratios (dN/dS) were obtained using PAML42 [54].

Phylogenetic trees were produced based on neighbor joining and maximum likelihood using MEGA4 [55].

Southern blot analysis

Hessian fly genomic DNA was isolated following a salting out protocol [56]. For Southern blot, 10 μg of purified genomic DNA was digested with individual restriction enzymes. The digested DNA fragments were separated on a 0.8% agarose gel and blotted onto GeneScreen membrane (Perkin Elmer, Beltsville, MD). The membranes were then hybridized separately to individual probes of cDNAs from either the Hessian fly, or Asian rice midge, or wheat blossom midge. cDNA probes were produced with 32P dCTP using a random labeling kit from Stratagene (La Jolla, CA). Hybridization was carried out overnight at 42°C in a plastic bag containing a 15-mL hybridization solution, which consisted of 10% dextran sulfate, 1% SDS, 1 M NaCl, pH 8.0. After hybridization, the membranes were washed twice with 2 × SSC at room temperature for 30 min, twice with 2 × SSC (0.3 M sodium chloride and 30 mM tri-Sodium citrate dihydrate, pH 7.0) plus 1% SDS at 65°C for 30 min, and twice with 0.1 × SSC plus 1% SDS at room temperature for 30 min. Images were visualized by exposing the membranes to Kodak SR-5 X-ray film overnight.


  1. Chapman RF: The insects: structure and function (Chapman eds.). 1998, Cambridge Univeristy press, Cambridge, UK, 12-36. 4

    Chapter  Google Scholar 

  2. Miles PW: Aphid saliva. Biol Rev. 1999, 74: 41-85. 10.1017/S0006323198005271.

    Article  Google Scholar 

  3. Tjallingii WF: Salivary secretions by aphids interacting with proteins of phloem wound responses. J Exp Bot. 2006, 57: 739-745. 10.1093/jxb/erj088.

    Article  CAS  PubMed  Google Scholar 

  4. Mutti NS, Pappan LK, Begum K, Pappan K, Chen M-S, Park Y, Reese JC, Reeck GR: A protein from the salivary glands of the pea aphid, Acyrthosiphon pisum, is essential in feeding on a host plant. Proc Natl Acad Sci USA. 2008, 105: 9965-9969. 10.1073/pnas.0708958105.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  5. Grant SR, Fisher EJ, Chang JH, Mole BM, Dangl JL: Subterfuge and manipulation: Type III effector proteins of phytopathogenic bacteria. Ann Rev Microbiol. 2006, 60: 425-449. 10.1146/annurev.micro.60.080805.142251.

    Article  CAS  Google Scholar 

  6. De Wit PJGM, Mehrab IR, Van den Burg HA, Stergiopoulo SI: Fungal effector proteins: past, present and future. Mol Plant Pathol. 2009, 10: 735-747. 10.1111/j.1364-3703.2009.00591.x.

    Article  CAS  PubMed  Google Scholar 

  7. Kamoun S: A catalogue of the effector secretome of plant pathogenic oomycetes. Ann Rev Phytopathol. 2006, 44: 41-60. 10.1146/annurev.phyto.44.070505.143436.

    Article  CAS  Google Scholar 

  8. Pate N, Hamamouch N, Li C, Hewezi T, Hussey RS, Baum TJ, Mitcchum MG, Davis EL: A nematode effector protein similar to annexins in host plants. J Exp Bot. 2010, 61: 235-248. 10.1093/jxb/erp293.

    Article  Google Scholar 

  9. Byers RA, Gallun RL: Ability of Hessian fly to stunt winter wheat. 1. Effect of larval feeding on elongation of leaves. J Econ Entomol. 1972, 65: 955-958.

    Article  Google Scholar 

  10. Bede JC, Musser RO, Felton GW, Korth KL: Caterpillar herbivory and salivary enzymes decrease transcript levels of Medicago truncatula genes encoding enzymes in terpeniod biosynthesis. Plant Mol Biol. 2006, 60: 519-531. 10.1007/s11103-005-4923-y.

    Article  CAS  PubMed  Google Scholar 

  11. Musser RO, Hum-Musser SM, Eichenseer H, Peiffer M, Ervin G, Murphy JB, Felton GW: Herbivory: Caterpillar saliva beats plant defences. Nature. 2002, 416: 599-600. 10.1038/416599a.

    Article  CAS  PubMed  Google Scholar 

  12. Weech M-H, Chapleau M, Pan L, Ide C, Bede JC: Caterpillar saliva interferes with induced Arabidopsis thaliana defence responses via the systemic acquired resistance pathway. J Exp Bot. 2008, 59: 2437-2448. 10.1093/jxb/ern108.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  13. Hori K: Insect secretions and their effect on plant growth. Biology of Insect-Induced Galls. Edited by: Shorthouse JD, Rohfritsch D. 1992, Oxford University Press, New York, 118-140.

    Google Scholar 

  14. Dieleman FL: Effect of gall midge infestation on plant growth and growth regulating substances. Ento Exp Appl. 1969, 12: 745-749. 10.1007/BF00297101.

    Article  Google Scholar 

  15. Goethals K, Vereecke D, Jaziri M, Van Montagu M, Holsters M: Leafy gall formation by Rhodococcus fascians. Annu Rev Phytopathol. 2001, 39: 27-52. 10.1146/annurev.phyto.39.1.27.

    Article  CAS  PubMed  Google Scholar 

  16. Hatchett JH, Starks KJ, Webster JA: Insect and mite pests of wheat. Wheat and Wheat improvement. Agronomy Monograph No. 1987, 13: 625-675.

    Google Scholar 

  17. Harris MO, Stuart JJ, Mohan M, Nair S, Lamb RJ, Rohfritsch O: Grasses and gall midges: plant defense and insect adaptation. Annu Rev Entomol. 2003, 48: 549-577. 10.1146/annurev.ento.48.091801.112559.

    Article  CAS  PubMed  Google Scholar 

  18. Stuart JJ, Chen MS, Harris M: Hessian fly. Genome Mapping and Genomics in Animals, Volume 1: Genome Mapping and Genomics in Arthropods. Edited by: Hunter, Kole. 2008, Springer, Berlin, Heidelberg, New York, 93-100. full_text.

    Chapter  Google Scholar 

  19. Harris MO, Freeman TP, Rohfritsch O, Anderson KG, Payne SA, Moore JA: Virulent Hessian fly (Diptera: Cecidomyiidae) larvae induce a nutritive tissue during compatible interactions with wheat. Ann Entomol Soc Am. 2006, 99: 305-316. 10.1603/0013-8746(2006)099[0305:VHFDCL]2.0.CO;2.

    Article  Google Scholar 

  20. Stuart JJ, Hatchett JH: Morphogenesis and cytology of the salivary gland of the Hessian fly, Mayetiola destructor (Diptera: Cecidomyiidae). Ann Entomol Soc Am. 1987, 80: 475-482.

    Article  Google Scholar 

  21. Liu XL, Bai J, Huang L, Zhu L, Liu X, Weng N, Reese JC, Harris M, Stuart JJ, Chen MS: Gene expression of different wheat genotypes during attack by virulent and avirulent Hessian fly (Mayetiola destructor) larvae. J Chem Ecol. 2007, 33: 2171-2194. 10.1007/s10886-007-9382-2.

    Article  CAS  PubMed  Google Scholar 

  22. Zhu L, Liu XM, Liu X, Jeannotte R, Reese JC, Harris M, Stuart JJ, Chen MS: Hessian fly (Mayetiola destructor) attack causes dramatic shift in carbon and nitrogen metabolism in wheat. Mol Plant-Microbe Interact. 2008, 21: 70-78. 10.1094/MPMI-21-1-0070.

    Article  PubMed  Google Scholar 

  23. Chen MS, Fellers JP, Stuart JJ, Reese JC, Liu XM: A group of related cDNAs encoding secreted proteins from Hessian fly [Mayetiola destructor (Say)] salivary glands. Insect Mol Biol. 2004, 13: 101-108. 10.1111/j.1365-2583.2004.00465.x.

    Article  CAS  PubMed  Google Scholar 

  24. Chen MS, Zhao HX, Zhu YC, Scheffler B, Liu XM, Liu X, Hulbert S, Stuart JJ: Analysis of transcripts and proteins expressed in the salivary glands of Hessian fly (Mayetiola destructor) larvae. J Insect Physiol. 2008, 54: 1-16. 10.1016/j.jinsphys.2007.07.007.

    Article  CAS  PubMed  Google Scholar 

  25. Bennett J, Bentur JS, Pasalu IC, Krishnaiah K: New approaches to gall midge resistance. Proceedings of the International Workshop. 2004, Hyderabad, India, 1-23. 22-24 November 1998

    Google Scholar 

  26. Calvo E, Mans BJ, Andersen JF, Ribeiro MC: Function and evolution of a mosquito salivary protein family. J Biol Chem. 2006, 281: 1935-1942. 10.1074/jbc.M510359200.

    Article  CAS  PubMed  Google Scholar 

  27. Olivera BM, Rivier J, Clark C, Ramilo CA, Corpuz GP, Abogadie FC, Mena EE, Woodward SR, Hillyard DR, Cruz LJ: Diversity of conus neuropeptides. Science. 1990, 249: 257-263. 10.1126/science.2165278.

    Article  CAS  PubMed  Google Scholar 

  28. Lehmann AR: New functions for Y family polymerases. Mol Cell. 2006, 24: 493-495. 10.1016/j.molcel.2006.10.021.

    Article  CAS  PubMed  Google Scholar 

  29. Hughes AL, Nei M: Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature. 1998, 335: 167-170. 10.1038/335167a0.

    Article  Google Scholar 

  30. Tanaka R, Nei M: Positive Darwinian selection observed at the variable-region genes of immunoglobulins. Mol Biol Evol. 1989, 6: 447-459.

    CAS  PubMed  Google Scholar 

  31. Hughes AL, Yeager M: Coordinated amino acid changes in the evolution of mammalian defensins. J Mol Evol. 1997, 44: 675-682. 10.1007/PL00006191.

    Article  CAS  PubMed  Google Scholar 

  32. Michelmore RW, Meyers BC: Clusters of resistance genes in plants evolve by divergent selection and a birth-and-death process. Genome Res. 1998, 8: 1113-1130.

    CAS  PubMed  Google Scholar 

  33. Bishop JG, Dean AM, Mitchell-Olds T: Rapid evolution in plant chitinases: Molecular targets of selection in plant-pathogen coevolution. Proc Natl Acad Sci USA. 2000, 97: 5322-5327. 10.1073/pnas.97.10.5322.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  34. Morgan W, Kamoun S: RXLR effectors of plant pathogenic oomycetes. Curr Opin Microbiol. 2007, 10: 332-338. 10.1016/j.mib.2007.04.005.

    Article  CAS  PubMed  Google Scholar 

  35. Boller T, He SY: Innate immunity in plants: An arms race between pattern recognition receptors in plants and effectors in microbial pathogens. Science. 2009, 324: 742-744. 10.1126/science.1171647.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  36. Burdon JJ, Thrall PH: Coevolution of plants and their pathogens in natural habitats. Science. 2009, 324: 755-756. 10.1126/science.1171663.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  37. Bent AF, Machey D: Elicitors, effectors and R genes: The new paradigm and a lifetime supply of questions. Annu Rev Phytopathol. 2007, 45: 399-436. 10.1146/annurev.phyto.45.062806.094427.

    Article  CAS  PubMed  Google Scholar 

  38. Dover GA: Molecular drive: a cohesive mode of species evolution. Nature. 1982, 199: 111-117. 10.1038/299111a0.

    Article  Google Scholar 

  39. Liao D: Concerted evolution. Nature Encyclopedia of the Human Genome. Edited by: Cooper DN. 2003, Nature Publishing Group, London, 1: 938-942.

    Google Scholar 

  40. Hilliker AJ, Harauz G, Reaume AG, Gray M, Clark SH, Chovnick A: Meiotic gene conversion tract length distribution within the rosy locus of Drosophila melanogaster. Genetics. 1994, 137: 1019-1026.

    PubMed Central  CAS  PubMed  Google Scholar 

  41. Cromie GA, Hyppa RW, Cam HP, Farah JA, Grewal SI, Smith GR: A discrete class of intergenic DNA dictates meiotic DNA break hotspots in fission yeast. PLoS Genet. 2007, 3: e141-10.1371/journal.pgen.0030141.

    Article  PubMed Central  PubMed  Google Scholar 

  42. Mézard C: Meiotic recombination hotspots in plants. Biochem Soc Trans. 2006, 34: 531-534. 10.1042/BST0340531.

    Article  PubMed  Google Scholar 

  43. Steiner WW, Steiner EM, Girvin AR, Plewik LE: Novel nucleotide sequence motifs that produce hotspots of meiotic recombination in Schizosaccharomeces prombe. Genetics. 2009, 182: 459-469. 10.1534/genetics.109.101253.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  44. Jeffreys AJ, May CA: Intense and highly localized gene conversion activity in human meiotic crossover hot spots. Nat Genet. 2004, 36: 151-156. 10.1038/ng1287.

    Article  CAS  PubMed  Google Scholar 

  45. Hurst LD, Smith GC: The evolution of concerted evolution. Proc R Soc Lond B. 1998, 265: 121-127. 10.1098/rspb.1998.0272.

    Article  Google Scholar 

  46. Black BL, Lu J, Olson EN: The MEF2A 3' untranslated region functions as a cis-acting translational repressor. Mol Cell Biol. 1997, 17: 2756-2763.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  47. Di Liegro CM, Bellafiore M, Izquierdo JM, Rantanen A, Cuezva JM: 3'-untranslated regions of oxidative phosphorylation mRNAs function in vivo as enhancers of translation. Biochem J. 2000, 15: 109-115. 10.1042/0264-6021:3520109.

    Article  Google Scholar 

  48. He L, Hannon GJ: MicroRNAs: small RNAs with a big role in gene regulation. Nat Rev Genet. 2004, 5: 522-531. 10.1038/nrg1379.

    Article  CAS  PubMed  Google Scholar 

  49. Bourtzis K, Miller TA: Insect Symbiosis. 2006, CRC Press/Taylor &Francis, New York, 2: 1-276.

    Chapter  Google Scholar 

  50. Boosalis GM: Hessian fly in relation to the development of crown and basal stem rot of wheat. Phytopathol. 1954, 44: 224-229.

    Google Scholar 

  51. Huang X, Madan A: CAP3: A DNA sequence assembly program. Genome Res. 1999, 46: 37-45.

    Article  Google Scholar 

  52. Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD: Multiple sequence alignment with the Clustal series of programs. Nucl Acid Res. 2003, 31: 3497-3500. 10.1093/nar/gkg500.

    Article  CAS  Google Scholar 

  53. Wilbur WJ, Lipman DJ: Rapid similarity searches of nucleic acid and protein data banks. Proc Natl Acad Sci USA. 1983, 80: 726-730. 10.1073/pnas.80.3.726.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  54. Yang Z: PAML 4: Phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007, 24: 1586-1591. 10.1093/molbev/msm088.

    Article  CAS  PubMed  Google Scholar 

  55. Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.

    Article  CAS  PubMed  Google Scholar 

  56. Chen MS, Liu X, Zhu YC, Reese JC, Wilde GE: Genes encoding a group of related small secreted proteins from the gut of Hessian fly larvae [Mayetiola destructor (Say)]. Insect Sci. 2006, 13: 339-348. 10.1111/j.1744-7917.2006.00102.x.

    Article  CAS  Google Scholar 

Download references


Mention of commercial or proprietary product does not constitute endorsement by the USDA. The authors thank Drs Frank White and Richard Beeman for reviewing an earlier version of the manuscript. Hessian fly voucher specimens (No. 150) are located in the KSU Museum of Entomological and Prairie Arthropod Research, Kansas State University, Manhattan, Kansas. This research was supported by USDA-ARS and a grant from USDA-NRI (2004-03099).

Author information

Authors and Affiliations


Corresponding author

Correspondence to Ming-Shun Chen.

Additional information

Authors' contributions

MSC participated in sequence analysis and manuscript preparation. XL participated in library construction and sequence analysis. ZY involved in phylogenetic analysis and bioinformatics. HZ did real-time PCR analysis. RHS characterized cDNAs from rice midge and wheat midge. JJS and SH participated in data analysis and manuscript preparation. All authors have read and approved the final manuscript.

Electronic supplementary material

Additional file 1: Figure S1: Sequence alignments of different groups of SSSGP-encoding genes. (DOC 4 MB)

Additional file 2: Figure S2: Sequence alignments of different groups of SSSGP-encoding cDNAs. (DOC 4 MB)

Additional file 3: Figure S3: Evidence for clustered organization of SSSGP-encoding genes. (DOC 109 KB)

Additional file 4: Figure S4: Alignments of moderately diversified SSSGPgroup members (cDNAs). (DOC 2 MB)

Additional file 5: Figure S5: Evidence for single location of genes in the SSSGP-1family. (DOC 496 KB)


Additional file 6: Figure S6: Sequence alignment of similar SSSGP-encoding cDNAs (presumably derived from different alleles). (DOC 221 KB)

Additional file 7: Figure S7: Sequence alignment of cDNAs encoding ribosomal proteins. (DOC 212 KB)

Additional file 8: Table S1: Primers used for PCR reactions. (DOC 36 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Chen, MS., Liu, X., Yang, Z. et al. Unusual conservation among genes encoding small secreted salivary gland proteins from a gall midge. BMC Evol Biol 10, 296 (2010).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: