Skip to main content

Convergent evolution of heat-inducibility during subfunctionalization of the Hsp70 gene family



Heat-shock proteins of the 70 kDa family (Hsp70s) are essential chaperones required for key cellular functions. In eukaryotes, four subfamilies can be distinguished according to their function and localisation in different cellular compartments: cytosol, endoplasmic reticulum, mitochondria and chloroplasts. Generally, multiple cytosol-type Hsp70s can be found in metazoans that show either constitutive expression and/or stress-inducibility, arguing for the evolution of different tasks and functions. Information about the hsp70 copy number and diversity in microbial eukaryotes is, however, scarce, and detailed knowledge about the differential gene expression in most protists is lacking. Therefore, we have characterised the Hsp70 gene family of Paramecium caudatum to gain insight into the evolution and differential heat stress response of the distinct family members in protists and to investigate the diversification of eukaryotic hsp70s focusing on the evolution of heat-inducibility.


Eleven putative hsp70 genes could be detected in P. caudatum comprising homologs of three major Hsp70-subfamilies. Phylogenetic analyses revealed five evolutionarily distinct Hsp70-groups, each with a closer relationship to orthologous sequences of Paramecium tetraurelia than to another P. caudatum Hsp70-group. These highly diverse, paralogous groups resulted from duplications preceding Paramecium speciation, underwent divergent evolution and were subject to purifying selection. Heat-shock treatments were performed to test for differential expression patterns among the five Hsp70-groups as well as for a functional conservation within Paramecium. These treatments induced exceptionally high mRNA up-regulations in one cytosolic group with a low basal expression, indicative for the major heat inducible hsp70s. All other groups showed comparatively high basal expression levels and moderate heat-inducibility, signifying constitutively expressed genes. Comparative EST analyses for P. tetraurelia hsp70s unveiled a corresponding expression pattern, which supports a functionally conserved evolution of the Hsp70 gene family in Paramecium.


Our analyses suggest an independent evolution of the heat-inducible cytosol-type hsp70s in Paramecium and in its close relative Tetrahymena, as well as within higher eukaryotes. This result indicates convergent evolution during hsp70 subfunctionalization and implies that heat-inducibility evolved several times during the course of eukaryotic evolution.


The environmental stress response in all organisms as diverse as pro- and eukaryotes is generally coupled with a remarkable change in gene expression patterns and an enhanced synthesis of several ‘stress proteins’ [1]. Because they were first described in Drosophila melanogaster larvae that were accidentally exposed to elevated temperatures [2], these stress-related proteins were called heat-shock proteins (Hsps). Extensive research on Hsps revealed also a constitutive expression of some members of these proteins, suggesting that they are also essential in maintaining the cellular functions under normal physiological conditions. These members are therefore designated as heat-shock cognate proteins (Hscs) [3, 4]. Furthermore, Hsps respond not only to increased temperatures, but also chemicals, heavy metals, UV light, hypoxia and other stressors can induce their synthesis [5, 6].

Some of the most important and well investigated Hsps are the members of the 70 kDa heat-shock protein family (Hsp70s). They belong to the highest conserved proteins and are present in almost all species, except for some archaea. Prokaryotic Hsp70 (DnaK) proteins share about 50% amino acid identity with eukaryotic Hsp70s. All known Hsp70 proteins exhibit highly conserved amino acid sequences and domain structures, such as a conserved N-terminal ATPase domain, a region with protease sensitive sites and a peptide binding domain at the C-terminal region (reviewed in e.g. [79]). Due to this high conservation, Hsp70 has been widely used as a suitable phylogenetic marker in molecular evolution. It has been applied to deep phylogenetic relationships (such as between animals and fungi) or to confirm the monophyly of Metazoa [10], as well as relationships between archaea and Gram-positive bacteria or Gram-negative bacteria and eukaryotes [1113]. Hsp70 genes and proteins have also been used for phylogenetic studies of different protozoan parasites such as Trypanosoma or Leishmania[14, 15], as well as of non-parasitic protozoans such as Euplotes or Paramecium[16, 17].

In eukaryotes, several of these Hsp70 proteins that belong to four subfamilies are encoded by the nuclear genome (reviewed in e.g. [18, 19]). The discrimination of the four Hsp70 subfamilies corresponds to the intracellular localisation of the Hsp70 proteins in the major compartments of the cell: the cytosol, the endoplasmic reticulum (ER) and the organelles mitochondria and chloroplasts [11]. These multigene family members emerged from numerous duplication events or replicative transpositions and evolved by a combination of birth-and-death processes, gene conversion events and purifying selection (e.g. [2022]). Therefore, caution is called for when using Hsp70s as phylogenetic marker, because paralogy can distort phylogenetic relationships [23]. While the evolution of all Hsp70 family members is not yet completely understood, the pathways for the evolution of the nucleus-encoded Hsp70s of the cell organelles are well known. The mitochondrial and the chloroplast homologs are derivatives from an endosymbiotic alphaproteobacterium and cyanobacterium, respectively, followed by a subsequent horizontal gene transfer of the hsp70 homologs to the nucleus. Further, the evolutionary relationship between the ER- and cytosol-type hsp70s indicates that these genes have arisen very early in the common eukaryotic ancestor by gene duplication [11, 12, 24].

All members of the Hsp70 family carry out molecular chaperone functions that facilitate correct protein folding, as well as membrane translocation and the subsequent refolding of proteins [25]. Through the course of evolution, these multiple Hsp70 family members, which can be found in pro- and eukaryotes, have acquired different chaperone tasks and functions (e.g. [26, 27]). For example, in all multicellular eukaryotes studied so far, some cytosol-type members show a constitutive expression under normal physiological conditions (Hsc70s) while others are highly inducible (Hsp70s) in consequence of several stress conditions (below, we will use the term Hsp70 to refer to both the inducible and constitutive 70 kDa chaperones). Detailed knowledge about the number, diversity and differential gene expression of the Hsp70 family members in most unicellular eukaryotes is, however, scarce. For example, the yeast genome of Saccharomyces cerevisiae encodes fourteen Hsp70-like genes [28] comprising cytosol-type hsp70s that show constitutive expression (SSA1, SSA2) as well as heat-inducibility (SSA1, SSA3; e.g. [3, 29]). Further, the genome sequencing projects of the ciliate species and model eukaryotes Tetrahymena thermophila[30] and Paramecium tetraurelia[31] unveiled several putative hsp70 genes with conserved Hsp70 domains. In Paramecium, whole genome duplications and subsequent gene loss events have shaped the Hsp70 gene family [31], but it is still uncertain whether subfunctionalization or gene dosage constraints are the main forces for hsp70 duplicate retention. While the major constitutively expressed and heat-inducible cytosol-type hsp70s have been identified for T. thermophila[32], the differential expression pattern of the Hsp70 family in P. tetraurelia has remained unclear. Additionally, the evolutionary relationships of the hsp70 homologs from these closely related microbial eukaryotes are vague. Therefore, we have investigated the Hsp70 multigene family of another paramecia species, Paramecium caudatum, by sequencing an hsp70 cDNA clone library.

Based on the resultant sequence data, we performed phylogenetic and comparative sequence analyses to gain insight into the evolution of the Hsp70 gene family in Paramecium. To disentangle the evolutionary relationships among ciliate hsp70s, which are in certain cases affected by whole genome duplications, we included not only a comprehensive set of homologous hsp70s of different pro- and eukaryotes, but also data from recent ciliate genome projects: Tetrahymena, Ichthyophthirius and Oxytricha. We further developed a target specific RT-qPCR assay for P. caudatum hsp70s in order to demonstrate an expected differential gene expression among the family members and to distinguish constitutively expressed and heat-induced hsp70s. Using these data, we tested for a hypothesised functional conservation of Paramecium hsp70 orthologs via comparative EST analyses and investigated the diversification of eukaryotic hsp70s to clarify the evolution of heat-inducibility within the 70 kDa molecular chaperone family.


Characterisation of P. caudatum hsp70s

Sequencing of sixty putative hsp70 gene fragments, amplified from cDNA with degenerated primers, revealed eleven different hsp70 nucleotide sequences (1363bp – 1378bp) from the Hsp70/DnaK-family of Paramecium caudatum. When the majority of the obtained sequences became redundant, we stopped sequencing of further clones of our hsp70 cDNA clone library. Therefore, we cannot exclude the possibility of additional homologs expressed in P. caudatum, but a sequence homology search (BLASTn/p) clearly assigned the eleven homologs to the three major Hsp70-subfamilies: cytosol (CY), endoplasmic reticulum (ER) and mitochondria (MT). These analyses further revealed the expression of one MT, five CY and five ER related Hsp70 proteins in P. caudatum. The sequences were designated according to their origin and sequence similarity as follows: PcHsp70CY1a, PcHsp70CY1b, PcHsp70CY1c, PcHsp70CY2a, PcHsp70CY2b, PcHsp70ER1a, PcHsp70ER1b, PcHsp70ER2a, PcHsp70ER2b, PcHsp70ER2c, PcHsp70MT1a.

The homologs PcHsp70CY1a and PcHsp70CY1b differed at only 4 nucleotide positions, and encode the same amino acid sequence. The homologs PcHsp70ER1a and PcHsp70ER1b differed by only one ‘C’ to ‘T’ substitution, resulting also in the same amino acid sequence. PcHsp70ER2a, PcHsp70ER2b and PcHsp70ER2c showed 1 to 3 nucleotide substitutions, but were indiscriminative at the amino acid level too (Additional file 1: Table S1). Because we used a proofreading polymerase for PCR amplification and a high fidelity RNA-dependent DNA polymerase for cDNA synthesis, these nucleotide substitutions can be considered authentic. The analyses of the deduced amino acid sequences revealed seven different homologous sequences, with an averaged pairwise distance of 0.38 and a maximum pairwise distance of 0.66. The amplified fragments for all CY homologs as well as PcHsp70ER1a and PcHsp70ER1b encode 459 amino acids (aa); PcHsp70ER2a, PcHsp70ER2b and PcHsp70ER2c encode 457 aa and PcHsp70MT1a encodes 454 aa.

All identified sequences shared two amino acid signature motifs (Figure 1B/E; see also Additional file 2: Figure S1), which are highly conserved regions and present in all eukaryotic Hsp70 family members (note: the forward primer Hsp70ForDeg targeted the Hsp70 family signature 1). The consensus sequences are as follows: Hsp70 family signature 2 [VI]-[FY]-D-L-G(3)-T-F-D-[VI]-S-[IL]-L and Hsp70 family signature 3 [VI]-[VI]-L-V-G(2)-[SM]-[TS]-R-[IM]-P-K-[VI]-[QR]-[QEDK], where the square brackets show a list of acceptable amino acids at the given position and the numerical values between parentheses indicate repetition of the respective amino acid [33]. Another conserved signature, the putative ATP/GTP-binding site motif A (P-loop), could be identified in all eleven homologs, albeit with relatively high variation (Figure 1A, see also Additional file 2: Figure S1). As expected, the putative bipartite nuclear localization signal, a targeting sequence essential for the selective translocation of nucleo-/cytoplasmatic proteins into the nucleus, could be detected for the CY homologs only [34]. Its consensus sequence is [KR]-K-x(10)-L-R(2)-L-R, where the symbol ‘x’ indicates a position of any acceptable amino acid (Figure 1C, see also Additional file 2: Figure S1). Additionally, all CY and ER homologs possess the R-A-[KR]-F-E-E-L consensus motif that serves as a potential signature for eukaryotic non-organellar Hsp70 proteins [35].

Figure 1

Amino acid sequence logos of P. caudatum Hsp70 motifs. (A) putative ATP/GTP-binding site motif A (P-loop), (B) Hsp70 family signature 2, (C) putative bipartite nuclear localization signal, (D) potential signature for eukaryotic non-organellar Hsp70 proteins, and (E) Hsp70 family signature 3. The size of each letter is proportional to the frequency of occurrence of the respective amino acid in a multiple sequence alignment. The letters are sorted with the most frequent one on top and the overall height of each stack indicates the information content of the sequence at the given position (0–4 bits). The sequence logos were prepared from sequences of all P. caudatum hsp70s (A,B,E), from the cytosolic and ER-type homologs (D), and from the cytosolic hsp70 genes only (C).

Evolutionary relationships

As illustrated in the phylogenetic Hsp70 tree in Figure 2 showing the three major Hsp70-subfamilies (CY, ER, MT) among different eukaryotes, the ascertained P. caudatum sequences integrated well within the separate subfamilies with high support values. In addition, all ciliate sequences clustered into clades in which the Paramecium homologs formed monophyla. These analyses also revealed the existence of different Hsp70-groups within the different P. caudatum Hsp70 subfamilies. Here, two distinct Hsp70-groups within the cytosolic as well as the ER-type subfamily could be detected. Each of these P. caudatum Hsp70-groups from the same subfamily (CY or ER) showed a closer relationship to orthologous Hsp70 sequences of Paramecium tetraurelia than to another P. caudatum Hsp70-group. The group CY-A consisted of the three in-paralogs PcHsp70CY1a, PcHsp70CY1b and PcHsp70CY1c; the group CY-B contained PcHsp70CY2a and PcHsp70CY2b; the group ER-A comprised the genes PcHsp70ER1a and PcHsp70ER1b; and group ER-B included PcHsp70ER2a, PcHsp70ER2b and PcHsp70ER2c. The mitochondrial homolog PcHsp70MT1a was determined as group MT for further analyses, even though we could detect only one mitochondrial hsp70 gene sequence. Since we cannot confirm complete locus homozygosity of the clonal P. caudatum strain that we used in this study, the detected within-group paralogs (in-paralogs) might constitute alleles. However, the amitotic division of the ciliate macronucleus (MAC) during vegetative growth can lead to an unequal partitioning of the alleles to daughter MACs [36], resulting in the expression of only one of the two alleles. In conjunction with the detected low mutation rate in Paramecium[37] and the comparatively high sequence divergence of the CY-B in-paralogs (2.5%), we would argue in this case for in-paralogs rather than alleles.

Figure 2

Phylogenetic reconstruction of homologous hsp70 sequences from different pro- and eukaryotes. The alignment was based on a comprehensive amino acid alignment containing 97 homologs that show the typical Hsp70 family signatures. Sequences obtained within this study are shown in boldface. The alignment length was constricted to 489 amino acids, including gaps. The illustrated tree is based on the Maximum-Likelihood calculations using the LG+I+Γ protein evolution model with 1,000 rapid bootstrap replicates. The Archaea DnaK sequences served as outgroup. Numbers at the nodes (occasionally indicated by an arrow) represent support values for the Maximum-Likelihood (above line) and Bayesian analysis (below line), respectively. The three major Hsp70-subfamilies—cytosol (CY), endoplasmic reticulum (ER) and mitochondria (MT)—are indicated by squared brackets to the right. Vertical bars to the right of the tree designate the five different Hsp70-groups (CY-A, CY-B, ER-A, ER-B, MT) detected in Paramecium. Highly heat inducible cytosol-type hsp70 genes (if known) are specified by an asterisk. Filled circles at the nodes represent ancient and intermediary gene duplication events, while filled triangles at the nodes point to speciation within Paramecium. Rounded brackets to the right exemplify evolutionary relationships among the Paramecium Hsp70 homologs and are defined in the upper left box frame. See Additional file 4: Figure S2 for the uncollapsed tree and its electronic version deposited in TreeBASE under accession number TB2:S13746.

The detection of eleven putative hsp70 homologs in P. caudatum is in concordance with former studies on the Hsp70 multigene family in other ciliate species (cf. [16]). The P. tetraurelia genome project [31, 38], for example, revealed fifteen putative hsp70 sequences in MAC DNA, while only thirteen possess the typical conserved domain structures with an N-terminal ATPase and a C-terminal peptide-binding domain. However, our motif analyses revealed that only ten homologs exhibit all three Hsp70 family signatures, indicating the existence of potential hsp70 pseudogenes in the P. tetraurelia MAC genome. In addition, genome analyses for Tetrahymena thermophila revealed thirteen putative hsp70 genes featuring conserved Hsp70 domains [32]. Similar to P. tetraurelia, only seven hsp70 homologs comprise the three Hsp70 family signatures. Interestingly, while we could find all orthologous sequences of these seven T. thermophila hsp70s in its close relative T. malaccensis by utilizing the Tetrahymena Comparative Database, we could not detect the heat-inducible ortholog of hsp70-2 in T. borealis (see Figure 2). Furthermore, in Ichthyophthirius multifiliis and Oxytricha trifallax we could detect only one CY- and one ER-type hsp70 homolog comprising all three Hsp70 family signatures.

Modes of evolution

The five putative Hsp70-groups detected in Paramecium showed consistent phylogenetic patterns according to the assumed evolutionary relationship between the species (Figure 2), thereby supporting the model of divergent evolution in the Paramecium Hsp70 gene family. Additionally, a codon-based Z-test [39] on all P. caudatum hsp70s revealed evidence for purifying selection, which is indicated by higher numbers of synonymous (d S ) to non-synonymous (d N ) nucleotide differences per site in pairwise comparisons (Table 1). While the paralogs between the two Hsp70-groups per subfamily (‘within species out-paralogs’, cf. Figure 2) show a comparatively high amino acid divergence of up to 19%, the potential paralogs within each group (‘in-paralogs’, cf. Figure 2) show a rather low nucleotide diversity (0.1 – 2.5%). This high similarity of the in-paralogs could be indicative of very recent duplication events or might be explained by purifying selection (d N < d S ) or gene conversion (d N = d S ). The results of a codon-based Z-test for neutral evolution revealed that the null hypothesis of strict-neutrality (d N = d S ) could not be rejected for the CY-B and ER-type in-paralogs (Additional file 3: Table S2), while the sequence similarity among the CY-A in-paralogs seems to be caused by purifying selection (d N < d S ). A test for gene conversion events revealed significant evidence (p < 0.05) for only two very short regions (18bp and 53bp) between out-paralogs, which might in fact be misidentifications due to possible misalignment artefacts. Please note that the substitutions within the ER and CY-A groups are very few and therefore do not allow for definite conclusions of the within-group comparisons; on the other hand, the discrepancy between the CY-A and CY-B group is obvious and implies different modes of evolution.

Table 1 Estimates of codon-based evolutionary divergence between P. caudatum Hsp70 genes

Differential gene expression

The group classification with CY-A, CY-B, ER-A, ER-B and MT was used for the expression analyses, since RT-qPCR primers and MGB™-probes were specifically designed to amplify or bind the respective hsp70 sequences within one group. The stability analysis of the reference genes GAPDH and EF-1α using the geNorm v.3.5 applet [40] revealed a high stability of both genes. The averaged expression stability value was 0.054 and the pairwise variation value (0.115) was below the proposed cut-off value of 0.15. We therefore used these two genes to normalise the hsp70 mRNA levels in all further analyses.

As illustrated in Figure 3A, the CY-A relative expression level (relCY-A = 0.60) at optimum temperature (28°C) was the highest among all five Hsp70-groups. While the basal expression levels of the ER-A, ER-B and MT group were comparatively similar (relER-A = 0.46, relER-B = 0.51, relMT = 0.47), the CY-B levels instead were considerably lower (relCY-B = 0.06), indicating that these genes (PcHsp70CY2a and PcHsp70CY2b) were transcribed at low basal levels under normal physiological conditions. After heat-shock exposure for two hours at 34°C, we could detect changes of the hsp70 transcript levels (Figure 3A). While the reference genes GAPDH and EF-1α showed comparatively stable expression levels between 28°C (relGAPDH = 0.70, relEF-1α = 0.98) and 34°C (relGAPDH = 0.67, relEF-1α = 0.95), the heat shock provoked an induction of all different Hsp70-groups. After heat shock, the relative transcription level of CY-A (relCY-A = 0.72) was higher than that of GAPDH (rel GAPDH = 0.67) and still showing the highest hsp70 mRNA expression level. The CY-B group, however, was highly induced (relCY-B = 0.64), resulting in the second most expressed Hsp70-group.

Figure 3

Relative expression amounts of different Hsp70-groups in Paramecium . (A) Relative transcript quantity of the P. caudatum Hsp70-groups CY-A, CY-B, ER-A, ER-B and MT at optimum temperature (28°C) and after heat shock (34°C) were assessed by RT-qPCR; levels were normalised to the highest and lowest observed cycle threshold (Ct) value of the whole data set and expressed in arbitrary units as relative mRNA amount (lowest Ct value = 1.0, highest Ct value = 0.0). All values are shown as mean ± standard error (n = 5). (B) Relative transcript amount of Hsp70-groups derived from EST libraries of P. tetraurelia cultured at 27°C or 35°C. Library screening was performed using the Local Blast engine in BioEdit (e-value cut-off of e < E-100) for orthologous EST sequences of the five P. caudatum Hsp70-groups using reference nucleotide sequences of P. tetraurelia. Resulting EST counts were used to calculate relative expression levels of the five Hsp70-groups in transcripts per million.

Performing relative expression analyses using REST 2009 unveiled a significant up-regulation of all Hsp70-groups (Figure 4). While the transcription level of the CY-A, ER-A, ER-B and the MT group were slightly up-regulated between 1.7- and 3.0-fold, CY-B was strikingly induced by thermal stress with an averaged up-regulation of 84.2-fold. This indicates that the CY-B genes (PcHsp70CY2a and PcHsp70CY2b) are the major inducible forms of the Hsp70 multigene family in P. caudatum. Note that the fairly high standard errors as indicated in the Whisker-Box plots in Figure 4 resulted from the inclusion of the PCR efficiency into the relative expression calculation of REST 2009.

Figure 4

Relative mRNA transcript induction of the five P. caudatum Hsp70-groups after heat shock. Expression ratios were assessed by RT-qPCR using 28°C as control treatment and two-hour incubation at 34°C as heat-shock treatment. Transcript levels of the Hsp70-groups CY-A, CY-B, ER-A, ER-B and MT were normalised to the reference genes GAPDH and EF-1a. Relative expression ratios (fold-change) were calculated using the PCR-efficiency based method in the programme REST 2009. Results are illustrated as Whisker-box plots where the dotted line represents the sample median (n = 5) and the box area encompasses 50% of all observations. The grey dashed line defines the value of no change in relative expression.

The screening of two P. tetraurelia EST libraries for orthologous sequences of the detected five P. caudatum Hsp70-groups revealed that all corresponding P. tetraurelia hsp70s seem to be up-regulated during heat exposure to 35°C, compared to normal physiological conditions at 27°C (Figure 3B). While EST counts for CY-B related genes could not be detected at 27°C, these genes were highly abundant at 35°C. Therefore, the CY-B related homologs seem to represent the major heat inducible hsp70 genes of this multigene family in P. tetraurelia as well.


Phylogenetic analyses based on an Hsp70 amino acid alignment reliably assigned the detected P. caudatum hsp70 genes to the respective Hsp70-subfamilies cytosol (CY), endoplasmic reticulum (ER) and mitochondria (MT) (Figure 2). These analyses further showed not only that these homologs can be divided into the three subfamilies, but they also revealed the existence of five putative Hsp70-groups in P. caudatum: one MT group consisting of only one homolog, but two distinct groups within the CY-, as well as two groups within the ER-type subfamily. Here, each CY- and ER-type Hsp70-group shows a closer relationship to putative orthologous sequences of P. tetraurelia than to the other P. caudatum Hsp70-group of the same subfamily (Figure 2). These findings suggest a gene duplication event before the speciation of P. caudatum and P. tetraurelia, but obviously after the divergence of Paramecium and Tetrahymena (another closely related oligohymenophorean ciliate) since both Paramecium CY- and ER-type homologs form monophyletic clades (see Figure 2). Therefore, the last common ancestor of Paramecium and Tetrahymena should have had one functional CY- and one ER-type hsp70 homolog, while the common ancestor of P. caudatum and P. tetraurelia has possessed one functional gene of each of the four non-organellar Hsp70-groups, meaning one CY-A, CY-B, ER-A and ER-B homolog.

Using an RT-qPCR approach, we could show considerable differences among the five assigned Hsp70-groups of Paramecium caudatum. These analyses revealed the group CY-A as the major constitutively expressed Hsp70-group in P. caudatum at optimum physiological temperatures. Even though this study showed a significant up-regulation in mRNA levels of the CY-A group members after heat shock (~3.0-fold), this does not necessarily cause the rejection of their Hsc70 (heat-shock cognate protein 70) affiliation, since many constitutively expressed hsp70s can be induced under specific conditions (e.g. [4144]). We have also seen that the Hsp70-group CY-B holds a very low mRNA abundance at optimum growth temperatures, but was highly induced after heat shock (~84-fold). This induction resulted in the second most abundant Hsp70s during heat stress and suggests that the two genes PcHsp70Cy2a and PcHsp70Cy2b of group CY-B represent the major heat-inducible homologs in P. caudatum.

As previously mentioned, the P. caudatum Hsp70-groups CY-A and CY-B showed a closer relationship to orthologous Hsp70 sequences of P. tetraurelia than to each other. The comparative analysis of two P. tetraurelia EST libraries (constructed from RNA of cells grown at 27°C or 35°C) revealed comparable expression patterns between the P. caudatum and the P. tetraurelia Hsp70-groups (cf. Figure 3A and 3B). Here, the CY-A group homologs are also highly expressed under normal physiological conditions, indicating constitutively expressed hsp70 genes, while the CY-B related homologs appear to represent the major heat-inducible hsp70s in P. tetraurelia as well. This result suggests conserved functions and a general expression pattern of the Paramecium Hsp70 gene family, at least between P. caudatum and P. tetraurelia. In this context it is interesting to note that Tetrahymena features also constitutively expressed and heat-inducible CY-hsp70s [32, 45], which form together with the single Ichthyophthirius multifiliis CY-hsp70 homolog a sister clade to all Paramecium CY-homologs (Figure 2). In this cytosolic Ichthyophthirius–Tetrahymena clade, the constitutively expressed T. thermophila hsp70-4 gene clusters together with orthologs of T. malaccensis, T. borealis and I. multifiliis by forming a sister clade to all other Tetrahymena CY-homologs including the heat-inducible T. thermophila hsp70-2 gene. Furthermore, while heat-inducible and constitutively expressed CY-hsp70s are common in higher eukaryotes too, they obviously do not form clear separated clades (cf. Additional file 4: Figure S2). In conjunction with our findings for ciliate hsp70s, this strongly indicates that heat-inducibility in cytosolic Hsp70s evolved several times independently during eukaryote evolution; at least twice within closely related unicellular eukaryotes and within metazoans. This result provides a striking example of convergent evolution during subfunctionalization among eukaryotes.

Our relative expression analyses also demonstrated that the P. caudatum Hsp70-groups ER-A and ER-B showed comparatively similar transcription levels at optimum temperatures, but the relative amount was considerably lower compared to the major constitutively expressed Hsp70-group CY-A. Heat treating the P. caudatum cells significantly induced the mRNA expression of both ER-type groups, but only to a comparatively small amount of 2.0-fold for ER-A and 2.7-fold for ER-B. Our study, therefore, partially supports the findings of Hori and Fujishima [46], who showed only trace amounts of ER-type hsp70 mRNA when P. caudatum cells were cultured at 25°C, but also an up-regulation of approx. 4-fold when cells were heat shocked at 35°C. On the other hand, they suggested a predominant expression of ER-type hsp70s because they could not detect cytosolic homologs [46]. This is in contrast to our results showing that the cytosolic Hsp70-group CY-A encompassed the major constitutively expressed hsp70s, while group CY-B covered the major inducible homologs. Further, the relative mRNA levels of the ER-groups were nearly 23% less of the relative cytosolic mRNA transcript amount at optimum temperatures. And after heat stress, the cytosolic hsp70 transcription level was about 2.5-fold higher than that of all ER-type homologs, mainly because of the striking induction of the CY-B group, but also due to the significant up-regulation of the CY-A group hsp70s. Nevertheless, studies on other eukaryotic organisms revealed that ER-type Hsp70 proteins, mostly designated as GRP78 or BiP, are abundant proteins in animal cells as well [47]. They are primarily induced by glucose depletion [48], and also due to cadmium exposure [49], during apoptosis [50] or due to the presence of unfolded or misfolded proteins in the ER. However, they are not significantly heat-inducible [51], while P. caudatum ER-type hsp70s seem to be. Therefore, experiments on the gene expression of these ER-type hsp70 genes investigating the effect of endosymbionts [46, 52] or other stressors would be valuable in unveiling their role in the stress response of Paramecium.

In this study, we further identified one hsp70 gene encoding for a mitochondrial Hsp70 protein (mtHsp70). These proteins are required for the translocation of cytosolic preproteins across the mitochondrial membrane as well as the subsequent folding reactions in the mitochondrial matrix [53, 54]. In most organisms, organelle-specific Hsp70s are generally encoded by a single gene [55], but in Saccharomyces cerevisiae three distinct mtHsp70 genes are described, and P. tetraurelia seems to have at least two functional mthsp70s. Humans seemingly hold only one chaperone-active mtHsp70 protein (HSPA9B), while another highly similar protein (HSPA9A) plays a major role in the import of preproteins into the mitochondria. The single mtHsp70 gene PcHsp70MT1a that we have observed in P. caudatum showed a constitutive expression pattern (Figure 3A), suggesting its essential role for mitochondria to function properly under normal physiological conditions. The 1.7-fold up-regulation in mRNA levels after heat shock, which is comparable to S. cerevisiae or Blastocladiella emersonii mtHsp70s [27, 53], implies also its key role for proper mitochondrial function under heat stress. Though elevated temperatures can lead to enhanced oxidative stress, this might be overcome by an increased protein import into the mitochondria or an enhanced chaperone activity. Whether the mthsp70 gene in P. caudatum covers both chaperone- and preprotein translocation activity and whether the P. tetraurelia mthsp70s have undergone subfunctionalization after gene duplication should be determined by future studies addressing Hsp70 protein interactions.

The phylogenetic and evolutionary patterns detected within the Paramecium Hsp70 gene family support the model of divergent evolution, while the different Hsp70-groups seem to have evolved under functional and structural constraints. On the other hand, in relation to the further investigated ciliate species and metazoans, these patterns indicate distinct evolutionary pathways and convergent evolution (cf. Figure 2). Therefore, this multigene family seems to be differentially affected also in ciliates by gene duplication events and/or whole genome duplications, by gene loss and retention, subfunctionalization and/or pseudogenization. The finding of a similar basal gene duplication pattern of the CY- and the ER-type hsp70s in Paramecium, for example, is indicative of a whole-genome duplication (WGD) event within the last common ancestor of P. caudatum and P. tetraurelia. It also may point to an intermediary WGD that occurred within the genus Paramecium, because the most recent WGD is supposed to coincide with rapid speciation events that gave rise to the P. ‘aurelia’ complex (cf. Figure five in [31]). Further, an old WGD is thought to have occurred before the divergence of Paramecium and Tetrahymena[31], but is still under question due to recent studies that failed to detect evidence for WGD in T. thermophila and on the Ichthyophthirius/Tetrahymena branch [56, 57].

The age of these WGD events would be of major significance to clarify speciation and radiation events in ciliates as well as to understand the dynamics of gene loss and retention or the metabolic adaptation after a WGD [58]. For example, the dates of these duplications could be interesting in consideration of the phylogenetic relationships among the genus Paramecium, whether these events occurred within the last common Paramecium ancestor or before/after certain speciation events. Insights into the different scenarios would be of importance to potentially support the proposed subdivision of the genus Paramecium into the four subgenera Chloroparamecium, Helianter, Cypriostomum and Paramecium[59]. Here, further analyses of the Hsp70 multigene family of at least three additional Paramecium species (e.g., Paramecium [Chloroparamecium] bursaria, Paramecium [Helianter] putrinum and Paramecium [Cypriostomum] calkinsi) would be required. However, due to a missing reliable molecular clock for ciliate divergence times (e.g. approx. 1500 to 600 Ma for Paramecium/Tetrahymena divergence [60, 61]) and recent findings on P. tetraurelia that indicate extraordinarily reduced DNA mutation rates [37], more genome data would be favourable to estimate the dates of Paramecium species divergence or the timing of WGD events in the ciliate phylum.

Assuming a WGD event within the last common ancestor of P. caudatum and P. tetraurelia, the lack of MT paralogs in P. caudatum has to be the result of pseudogenization and a rapid gene loss. Even though many of the duplicates of the recent WGD in P. tetraurelia are functionally redundant and are supposed to be progressively lost, most of the gene duplicates did not go through such a rapid elimination [31]. On the other hand, the four putative mthsp70s in P. tetraurelia, which seem to comprise two functional genes and two pseudogenes, are suggestive of an intermediary WGD event causing the Hsp70-group differentiation in Paramecium. Since our study on P. caudatum was based on cDNA and all the detected homologs show typical Hsp70 family signatures and motifs as well as no premature stop codons or frameshifts, they can be considered as functional hsp70 genes. Therefore, we have not detected either potentially expressed or unexpressed hsp70 pseudogenes, which would represent a hallmark of the birth-and-death process after duplication events. Hence, to disentangle in more detail the different patterns of concerted and non-concerted evolution that shaped the P. caudatum Hsp70 gene family, further gDNA/genome analyses would be necessary to unveil potential hsp70 pseudogenes in P. caudatum. Moreover, comparative genome analyses can uncover the gene order and orientation of recent hsp70s to infer pre-duplication states and to predict patterns of duplicate loss and retention. Such analyses can further improve our understanding of the evolution of such gene families in Paramecium and other microbial eukaryotes.


The results of this study demonstrate first, that the Hsp70 multigene family of Paramecium caudatum comprises several homologous genes that can be assigned to three major Hsp70-subfamilies: one mitochondrial, five cytosolic and five endoplasmic-reticulum-related hsp70s. Phylogenetic analyses further revealed two separate Hsp70-groups within the cytosolic as well as the ER-type homologs, showing a closer relationship to Paramecium tetraurelia orthologs than to each other. These groups are derivatives of duplication events preceding speciation and have evolved in a conservative fashion under purifying selection. Second, our results revealed significant differences in the expression levels of these genes as well as significant heat-shock effects. While we observed constitutively expressed hsp70 genes for all three cellular compartments, indicating the general importance of these proteins, our analyses also unveiled a major inducible Hsp70-group. This group was assigned as a cytosolic member with a low basal expression level, but a striking up-regulation of approx. 84-fold after heat shock. A similar expression pattern could be found in P. tetraurelia via the analyses of different EST libraries, which suggests conserved functions of orthologous hsp70s, at least between these two Paramecium species. On the other hand, the constitutively expressed and heat-induced cytosol-type hsp70s evolved independently in Paramecium and in its close relative Tetrahymena, but also in higher eukaryotes. This fact provides evidence of convergent evolution during subfunctionalization of eukaryote hsp70s. In addition, the herein developed RT-qPCR assay provides a crucial tool for future selective studies examining the effect of several stresses on the hsp70 gene expression in Paramecium, a complex unicellular eukaryote that is an excellent model organism in cellular physiology, developmental biology, genome evolution and epigenetic inheritance.


Parameciumstock and culture conditions

The Paramecium caudatum stock GRL-1, isolated from a freshwater sample of a natural habitat in Greece, Livadia, was grown in a modified 0.25% Cerophyl infusion [62] inoculated with Pseudomonas fluorescens, as described previously [63]. The clonal stock culture was initiated from a single Paramecium cell to minimize locus heterozygosity. Wheat Grass Powder, used as the nutrient supply for the Cerophyl infusion, was purchased from GSE Vertrieb GmbH. The P. fluorescens strain SBW25 EeZY-6KX [64] was acquired from the University of Oxford. Cultures were maintained in microprocessor-controlled, cooled incubators obtained from BINDER GmbH (Type KB 53).

RNA isolation and cDNA preparation

RNA was isolated from 2 ml paramecia culture (~3500 cells ml-1) maintained at 25°C, and from 2 ml culture heat shocked for one hour at 35°C. Total RNA was prepared using the acid phenol-guanidinium thiocyanate procedure [65] with commercial TRIzol® Reagent (Invitrogen Life Technologies) as specified in the manufacturer’s protocol. The purified RNA was treated with RNase-free TURBO™ DNase (Ambion® Inc.), which was subsequently removed with DNase Inactivation Reagent (Ambion® Inc.) following the manufacturer’s protocol. The concentration and purity of total RNA was determined by absorption at OD260, OD260/280, OD260/230 and by gel electrophoresis. cDNA synthesis was carried out according to the instructions given in the SuperScript® III Reverse Transcriptase protocol (Invitrogen Life Technologies) using oligo(dT)18 primers (Fermentas Life Sciences).

PCR amplification and cDNA-library construction

Degenerated primers were designed for conserved regions of the Hsp70 family on the basis of an amino acid as well as a sequence alignment including hsp70 homologous genes of three major Hsp70-subfamilies (cytosol, endoplasmic reticulum, mitochondria) from different eukaryotes. The forward primer Hsp70ForDeg corresponded to the amino acid sequence IGIDLGT: 5-ATW GGH ATH GAY TTR GGW AC-3 and the reverse primer Hsp70RevDeg corresponded to the amino acid sequence PQIEVTF: 5-TCR AAD GWR ACT TCR ATT TRW GG-3. The amplification was performed in a final volume of 50 μl containing 20 ng cDNA, 1 mM of each primer, 1 U Phusion™ High-Fidelity DNA Polymerase (New England Biolabs Inc.), 1 × Phusion™ HF Buffer with 2 mM MgCl2, 200 μM of each dNTP and 1.5 μl DMSO-d6 (Sigma-Aldrich®). PCR conditions were as follows: one minute initial denaturation (98°C); 35 cycles of fifteen seconds at 98°C, thirty seconds at 50°C and fifty seconds at 72°C; and a final extension step of seven minutes at 72°C. PCR products were visualised using agarose gel electrophoresis with ethidium bromide staining and bands of the expected size (~1.6 kb) were excised. DNA was extracted using the NucleoSpin® Extract II kit following the manufacturer’s protocol. The purified DNA fragments were tailed by adding a 3 terminal ‘A’ overhang onto the PCR products using a non-proofreading Taq polymerase (Fermentas Life Sciences) and subsequently ligated into a pGEM®-T Vector (Promega GmbH) according to the manufacturer’s instructions. After transformation of competent E. coli JM109 cells, sixty of more than 200 clones that contained the expected insert size were sequenced with standard vector primers using the BigDye® Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems). Sequencing reactions were performed and analyzed in both directions and with specifically designed internal sequencing primers for full-length sequencing on an ABI 3100 Genetic Analyzer (Applied Biosystems).

To determine the levels of differential gene expression of the diverse hsp70 homologs within subsequent RT-qPCR runs, we further amplified two potential reference genes: the glyceraldehyde-3-phosphate dehydrogenase gene (GAPDH) and the elongation factor-1 alpha gene (EF-1α). Degenerated oligonucleotides, targeted to conserved regions of known GAPDH and EF-1α gene sequences from Paramecium tetraurelia, Tetrahymena thermophila and T. pyriformis, were used as primers to amplify the respective P. caudatum genes. The primers GAPDHFor1: 5-GGT AGA TTR GTW TTR AGA GC-3 and GAPDHRev: 5-CCS CAY TCR TTR TCR TAC CA-3 amplified a ~1kb fragment, and the primers EF1aFor_Deg: 5-GGW AAR TCH ACY WCH WSH GGT CAC-3 and EF1aRev_Deg: 5-GCR ACS RYT WVY TTC ATR TCT C-3 produced a ~1.4 kb fragment. Amplification and subsequent procedures were carried out according to the Hsp70 methods as described above with changes in the annealing temperatures of 56°C for GAPDH and 53°C for EF-1α. Furthermore, ten clones of the respective genes were sequenced to estimate the intra-individual sequence divergence.

cDNA sequence analyses and phylogenetic reconstruction

Sequence data were assembled using the programme Vector NTI v.10.1 (Invitrogen Corp.) and compared with the non-redundant sequence database using NCBI-BLAST to assign the hsp70 sequences to the three Hsp70-subfamilies: cytosol, ER and mitochondria. Sequence statistics, calculation of synonymous (d S ) and non-synonymous (d N ) site differences, and codon-based Z-tests using the Nei-Gojobori model [39] to estimate selective pressures were carried out with MEGA5 [66]. The automated GENECONV algorithm [67, 68] as implemented in the Recombination Detection Program (RDP4 Beta 4.18 [69]) was applied to detect potential gene conversion events between the different hsp70 homologs using the auto-mask option. Motif analyses were done with InterProScan [70] implemented in Geneious Pro v.5.4.2, which was also used for sequence logo calculations [71]. The P. caudatum Hsp70 amino acid sequences were aligned to homologous genes from different prokaryotes and eukaryotes available in GenBank, or derived from several genome databases (see Additional file 5: Table S3 for a complete list of accession numbers, descriptions and the sources used) using Probalign v.1.3 [72]. The final alignment contained 97 taxa and was manually trimmed to the length of P. caudatum Hsp70s, resulting in an overall length of 489 amino acids including gaps. The programme ProtTest v.3.0 [73] was used to find the most appropriate amino acid substitution model of the Hsp70 protein evolution required for Maximum-Likelihood (ML) calculations and Bayesian analysis (BA). The best-fit according to AICc was reported for the LG model [74] with I = 0.084 and Γ = 0.965. Maximum likelihood analyses were conducted with RAxML v.7.3.0 [75, 76] using the PROTGAMMALG model. The statistical robustness of the nodes of the inferred best-scoring ML tree was determined by 1,000 rapid bootstrap replicates. The Bayesian analysis was performed with MrBayes v.3.1.2 [77] that was customized to use the LG+I+Γ model. We carried out two independent runs for 2,000,000 generations, with a sampling frequency of 1,000 generations. Each run used four chains, one cold and three heated. From the sampled trees, we discarded the first 25% as burn-in, ensuring stable likelihood values. The remaining trees were used to compute the consensus tree. The programmes Probalign and the fast hybrid versions of RAxML and MrBayes were run on the CIPRES Science Gateway v.3.3 server [78]. The resulting phylogenetic trees were visualized and arranged for publication using FigTree v.1.3.1 and Corel Draw X4.

Reverse transcription quantitative real-time PCR

RNA was extracted from 3 × 1 ml P. caudatum culture (~750 cells ml-1) maintained at 28°C as the control treatment (optimum temperature) and from paramecia that were heat shocked for two hours at 34°C. The length of the heat treatment was chosen because of the results of a previous time-course experiment (1 h, 2 h and 4 h) revealing this duration as an appropriate value with comparatively high hsp70 gene induction and no Paramecium mortality (data not shown). The respective treatment temperatures were set to values around the previously determined optimum and maximum growth temperatures of P. caudatum[79]. The experiment was run with five biological replicates. RNA was prepared using RNeasy Mini spin columns (QIAGEN) following the manufacturer’s protocol for total RNA purification from animal cells. The buffer RLT was furnished with β-mercaptoethanol and pelleted paramecia cells from 1 ml culture were lysed for five minutes in 350 μl buffer RLT. One volume of 70% ethanol was added to each of the three corresponding lysates, which were consecutively transferred to the RNeasy Mini spin columns. All further steps were carried out according to the instructions, including the optional drying and the second elution step. Potentially leftover genomic DNA was digested by an RNase-free TURBO™ DNase (Ambion® Inc.) treatment. The DNase was subsequently removed with DNase Inactivation Reagent (Ambion® Inc.) following the manufacturer’s protocol. The RNA concentrations, purities and integrities of all preparations were determined by absorption at OD260, OD260/280 and OD260/230, by gel electrophoresis and with the Agilent 2100 Bioanalyzer™ (Agilent Technologies). The mean RNA yield per sample was 161±12 ng/μl (mean±SEM) and the RNA integrity number (RIN) for all specimens was ≥ 9.2 indicating high quality RNA [80]. For each replicate, 500 ng RNA was reverse transcribed into double stranded cDNA using the RevertAid™ H- M-MuLV reverse transcriptase and a mixture of oligo(dT)18 and random hexamer primers (Fermentas Life Sciences) in a final volume of 20 μl according to the manufacturer’s instructions.

The differential mRNA expression levels for five different Hsp70-groups (CY-A, CY-B, ER-A, ER-B, and MT) were quantified with an RT-qPCR assay using an ABI PRISM® 7300 Sequence Detector System (Applied Biosystems). The two reference genes, GAPDH and EF-1α, were used for normalization. The target specific primers and MGB™ hydrolysis probes (Table 2, see also Additional file 6: Figure S3) were designed and validated according to Applied Biosystems relative quantification guidelines using Primer Express® v.2.0 (Applied Biosystems). Each sample of five biological replicates was analyzed in technical triplicates. Each 20 μl reaction contained 1 × reaction buffer (Invitrogen, Platinum® quantitative PCR Supermix-UDG with Rox), 200 μM of each dNTP, 900 nM of each target specific primer, 250 nM of the target specific probe, 5 mM MgCl2 and 10 ng cDNA (10 ng of reverse transcribed RNA), or 10 ng RNA within the –RT control, respectively. Thermal cycling conditions were as follows: two minutes at 50°C as the initial step to activate the uracil glycosylase (UNG), two minutes at 95°C to inactivate the UNG and as the initial denaturation step, followed by 40 cycles for fifteen seconds at 95°C and thirty seconds at 60°C.

Table 2 Nucleotide sequences for the specific primers and MGB™ probes, melting temperatures ( T m ), fragment lengths and PCR efficiencies

Statistical analyses of RT-qPCR

The stability of the two reference genes GAPDH and EF-1α were evaluated with the geNorm v.3.5 applet [40]. The amplification efficiency-based method of the programme REST 2009 v.2.0.13 [81] was used for gene normalization and quantification. The significance of the differences between the controls (28°C) and the heat-shock treatments (34°C) was determined with a strong randomization (iteration) test as implemented in the programme. The test was performed with 10,000 random reallocations of samples and controls between the groups by counting the number of times the relative expression on the randomly assigned group was greater than the sample data (REST 2009, software manual).

Comparative EST library screening

To unveil if the detected differential hsp70 gene expression of P. caudatum is conserved among Paramecium, two EST libraries of P. tetraurelia were screened for specific gene sequences. The EST libraries were constructed at Genoscope - Centre National de Séquençage, France. Both libraries were created from total RNA of vegetative cells, using the CloneMiner cDNA library construction kit (Invitrogen). The first library was derived from P. tetraurelia cells grown at 27°C (NCBI dbEST ID 20983) and the second library from cells heat treated at 35°C (NCBI dbEST ID 20987). Both libraries were screened for orthologous EST sequences of the five P. caudatum Hsp70-groups using following P. tetraurelia nucleotide sequences (see GenBank accession numbers) as references: CY-A = CR932269 and CR933372; CY-B = CR933371, CR933370 and CR933369; ER-A = CR932268 and CR932267, ER-B = CR932266, MT = AF031366 and XM_001461342.1. Screening was performed using the Local Blast engine in Bioedit v. [82] with an e-value cut-off of e < E-100. EST counts for each P. tetraurelia ortholog were used to calculate an approximate expression level of the five Hsp70-groups quoted in transcripts per million.


  1. 1.

    Schlesinger MJ, Ashburner M, Tissieres A: Heat shock, from bacteria to man. 1982, Cold Spring Harbour: Cold Spring Harbour Laboratory Press

    Google Scholar 

  2. 2.

    Ritossa F: A new puffing pattern induced by temperature shock and DNP in Drosophila. Experientia. 1962, 18 (12): 571-573.

    CAS  Article  Google Scholar 

  3. 3.

    Boorstein WR, Craig EA: Transcriptional regulation of Ssa3, an Hsp70 gene from Saccharomyces cerevisiae. Mol Cell Biol. 1990, 10 (6): 3262-3267.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  4. 4.

    Mayer MP, Bukau B: Hsp70 chaperones: Cellular functions and molecular mechanism. Cell Mol Life Sci. 2005, 62 (6): 670-684.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  5. 5.

    Nover L: Heat shock response. 1991, Boca Raton, FL: CRC Press

    Google Scholar 

  6. 6.

    Pirkkala L, Nykanen P, Sistonen L: Roles of the heat shock transcription factors in regulation of the heat shock response and beyond. FASEB J. 2001, 15 (7): 1118-1131.

    CAS  PubMed  Article  Google Scholar 

  7. 7.

    Bukau B, Weissman J, Horwich A: Molecular chaperones and protein quality control. Cell. 2006, 125 (3): 443-451.

    CAS  PubMed  Article  Google Scholar 

  8. 8.

    Daugaard M, Rohde M, Jaattela M: The heat shock protein 70 family: Highly homologous proteins with overlapping and distinct functions. FEBS Lett. 2007, 581 (19): 3702-3710.

    CAS  PubMed  Article  Google Scholar 

  9. 9.

    Hartl FU: Molecular chaperones in cellular protein folding. Nature. 1996, 381 (6583): 571-580.

    CAS  PubMed  Article  Google Scholar 

  10. 10.

    Borchiellini C, Boury-Esnault N, Vacelet J, Le Parco Y: Phylogenetic analysis of the Hsp70 sequences reveals the monophyly of metazoa and specific phylogenetic relationships between animals and fungi. Mol Biol Evol. 1998, 15 (6): 647-655.

    CAS  PubMed  Article  Google Scholar 

  11. 11.

    Boorstein WR, Ziegelhoffer T, Craig EA: Molecular evolution of the Hsp70 multigene family. J Mol Evol. 1994, 38 (1): 1-17.

    CAS  PubMed  Article  Google Scholar 

  12. 12.

    Gupta RS, Golding GB: Evolution of Hsp70 gene and its implications regarding relationships between archaebacteria, eubacteria, and eukaryotes. J Mol Evol. 1993, 37 (6): 573-582.

    CAS  PubMed  Article  Google Scholar 

  13. 13.

    Gupta RS, Singh B: Phylogenetic analysis of 70 kD heat shock protein sequences suggests a chimeric origin for the eukaryotic cell nucleus. Curr Biol. 1994, 4 (12): 1104-1114.

    CAS  PubMed  Article  Google Scholar 

  14. 14.

    Fraga J, Montalvo AM, De Doncker S, Dujardin JC, Van der Auwera G: Phylogeny of Leishmania species based on the heat-shock protein 70 gene. Infect Genet Evol. 2010, 10 (2): 238-245.

    CAS  PubMed  Article  Google Scholar 

  15. 15.

    Simpson AGB, Gill EE, Callahan HA, Litaker RW, Roger AJ: Early evolution within kinetoplastids (Euglenozoa), and the late emergence of trypanosomatids. Protist. 2004, 155 (4): 407-422.

    PubMed  Article  Google Scholar 

  16. 16.

    Budin K, Philippe H: New insights into the phylogeny of eukaryotes based on ciliate Hsp70 sequences. Mol Biol Evol. 1998, 15 (8): 943-956.

    CAS  PubMed  Article  Google Scholar 

  17. 17.

    Hori M, Tomikawa I, Przybos E, Fujishima M: Comparison of the evolutionary distances among syngens and sibling species of Paramecium. Mol Phyl Evol. 2006, 38 (3): 697-704.

    CAS  Article  Google Scholar 

  18. 18.

    Gething MJ, Sambrook J: Protein folding in the cell. Nature. 1992, 355 (6355): 33-45.

    CAS  PubMed  Article  Google Scholar 

  19. 19.

    Tavaria M, Gabriele T, Kola I, Anderson RL: A hitchhiker's guide to the human Hsp70 family. Cell Stress Chaperon. 1996, 1 (1): 23-28.

    CAS  Article  Google Scholar 

  20. 20.

    Nikolaidis N, Nei M: Concerted and nonconcerted evolution of the Hsp70 gene superfamily in two sibling species of nematodes. Mol Biol Evol. 2004, 21 (3): 498-505.

    CAS  PubMed  Article  Google Scholar 

  21. 21.

    Bettencourt BR, Feder ME: Hsp70 duplication in the Drosophila melanogaster species group: How and when did two become five?. Mol Biol Evol. 2001, 18 (7): 1272-1282.

    CAS  PubMed  Article  Google Scholar 

  22. 22.

    Kudla G, Helwak A, Lipinski L: Gene conversion and GC-content evolution in mammalian Hsp70. Mol Biol Evol. 2004, 21 (7): 1438-1444.

    CAS  PubMed  Article  Google Scholar 

  23. 23.

    Martin AP, Burg TM: Perils of paralogy: Using HSP70 genes for inferring organismal phylogenies. Syst Biol. 2002, 51 (4): 570-587.

    PubMed  Article  Google Scholar 

  24. 24.

    Karlin S, Brocchieri L: Heat shock protein 70 family: Multiple sequence comparisons, function, and evolution. J Mol Evol. 1998, 47 (5): 565-577.

    CAS  PubMed  Article  Google Scholar 

  25. 25.

    Becker J, Craig EA: Heat-shock proteins as molecular chaperones. Eur J Biochem. 1994, 219 (1–2): 11-23.

    CAS  PubMed  Article  Google Scholar 

  26. 26.

    Feder ME, Hofmann GE: Heat-shock proteins, molecular chaperones, and the stress response: Evolutionary and ecological physiology. Annu Rev Physiol. 1999, 61: 243-282.

    CAS  PubMed  Article  Google Scholar 

  27. 27.

    Genevaux P, Georgopoulos C, Kelley WL: The Hsp70 chaperone machines of Escherichia coli: a paradigm for the repartition of chaperone functions. Mol Microbiol. 2007, 66 (4): 840-857.

    CAS  PubMed  Article  Google Scholar 

  28. 28.

    Matsumoto R, Akama K, Rakwal R, Iwahashi H: The stress response against denatured proteins in the deletion of cytosolic chaperones SSA1/2 is different from heat-shock response in Saccharomyces cerevisiae. BMC Genomics. 2005, 6:

    Google Scholar 

  29. 29.

    Park HO, Craig EA: Positive and negative regulation of basal expression of a yeast HSP70 gene. Mol Cell Biol. 1989, 9 (5): 2025-2033.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  30. 30.

    Eisen JA, Coyne RS, Wu M, Wu DY, Thiagarajan M, Wortman JR, Badger JH, Ren QH, Amedeo P, Jones KM: Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote. PLoS Biol. 2006, 4 (9): 1620-1642.

    CAS  Article  Google Scholar 

  31. 31.

    Aury JM, Jaillon O, Duret L, Noel B, Jubin C, Porcel BM, Segurens B, Daubin V, Anthouard V, Aiach N: Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia. Nature. 2006, 444 (7116): 171-178.

    CAS  PubMed  Article  Google Scholar 

  32. 32.

    Feng LF, Chang Y, Yuan DX, Miao W: Expression analysis of 5 hsp70 genes in Tetrahymena thermophila [Article in Chinese]. Dongwuxue Yanjiu. 2011, 32 (3): 267-276.

    CAS  PubMed  Google Scholar 

  33. 33.

    Sigrist CJA, Cerutti L, Hulo N, Gattiker A, Falquet L, Pagni M, Bairoch A, Bucher P: PROSITE: A documented database using patterns and profiles as motif descriptors. Brief Bioinform. 2002, 3 (3): 265-274.

    CAS  PubMed  Article  Google Scholar 

  34. 34.

    Dingwall C, Laskey RA: Nuclear targeting sequences - a consensus?. Trends Biochem Sci. 1991, 16 (12): 478-481.

    CAS  PubMed  Article  Google Scholar 

  35. 35.

    Rensing SA, Maier UG: Phylogenetic analysis of the stress-70 protein family. J Mol Evol. 1994, 39 (1): 80-86.

    PubMed  Article  Google Scholar 

  36. 36.

    Blackburn EH, Karrer KM: Genomic reorganization in ciliated protozoans. Annu Rev Genet. 1986, 20: 501-521.

    CAS  PubMed  Article  Google Scholar 

  37. 37.

    Sung W, Tucker AE, Doak TG, Choi E, Thomas WK, Lynch M: Extraordinary genome stability in the ciliate Paramecium tetraurelia. Proc Natl Acad Sci USA. 2012, 109 (47): 19339-19344.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  38. 38.

    Arnaiz O, Cain S, Cohen J, Sperling L: ParameciumDB: a community resource that integrates the Paramecium tetraurelia genome sequence with genetic data. Nucleic Acids Res. 2007, 35: D439-D444.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  39. 39.

    Nei M, Gojobori T: Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 1986, 3 (5): 418-426.

    CAS  PubMed  Google Scholar 

  40. 40.

    Vandesompele J, De Preter K, Pattyn F, Poppe B, Van Roy N, De Paepe A, Speleman F: normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol. 2002, 3 (7): research0034–-research0034.11.

    PubMed Central  PubMed  Article  Google Scholar 

  41. 41.

    Santacruz H, Vriz S, Angelier N: Molecular characterization of a heat shock cognate cDNA of zebrafish, hsc70, and developmental expression of the corresponding transcripts. Dev Genet. 1997, 21 (3): 223-233.

    CAS  PubMed  Article  Google Scholar 

  42. 42.

    Hill JK, Thomas CD, Huntley B: Climate and habitat availability determine 20th century changes in a butterfly's range margin. Proc R Soc Lond Ser B-Biol Sci. 1999, 266 (1425): 1197-1206.

    Article  Google Scholar 

  43. 43.

    Goldbaum O, Richter-Landsberg C: Stress proteins in oligodendrocytes: differential effects of heat shock and oxidative stress. J Neurochem. 2001, 78 (6): 1233-1242.

    CAS  PubMed  Article  Google Scholar 

  44. 44.

    Fangue NA: Intraspecific variation in thermal tolerance and heat shock protein gene expression in common killifish, Fundulus heteroclitus. J Exp Biol. 2006, 209 (15): 2859-2872.

    CAS  PubMed  Article  Google Scholar 

  45. 45.

    Yu T, Barchetta S, Pucciarelli S, La Terza A, Miceli C: A novel robust heat-inducible promoter for heterologous gene expression in Tetrahymena thermophila. Protist. 2012, 163 (2): 284-295.

    CAS  PubMed  Article  Google Scholar 

  46. 46.

    Hori M, Fujishima M: The endosymbiotic bacterium Holospora obtusa enhances heat-shock gene expression of the host Paramecium caudatum. J Eukaryot Microbiol. 2003, 50 (4): 293-298.

    CAS  PubMed  Article  Google Scholar 

  47. 47.

    Chang SC, Wooden SK, Nakaki T, Kim YK, Lin AY, Kung L, Attenello JW, Lee AS: Rat gene encoding the 78-kDa glucose-regulated protein Grp78 - Its regulatory sequences and the effect of protein glycosylation on its expression. Proc Natl Acad Sci USA. 1987, 84 (3): 680-684.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  48. 48.

    Tillman JB, Mote PL, Walford RL, Spindler SR: Structure and regulation of the mouse GRP78 (BiP) promoter by glucose and calcium ionophore. Gene. 1995, 158 (2): 225-229.

    CAS  PubMed  Article  Google Scholar 

  49. 49.

    Schröder HC, Hassanein HMA, Lauenroth S, Koziol C, Mohamed T, Lacorn M, Steinhart H, Batel R, Müller WEG: Induction of DNA strand breaks and expression of HSP70 and GRP78 homolog by cadmium in the marine sponge Suberites domuncula. Arch Environ Contam Toxicol. 1999, 36 (1): 47-55.

    PubMed  Article  Google Scholar 

  50. 50.

    Aoki T, Koike T, Nakano T, Shibahara K, Kondo S, Kikuchi H, Honjo T: Induction of Bip mRNA upon programmed cell death of differentiated PC12 cells as well as rat sympathetic neurons. J Biochem. 1997, 121 (1): 122-127.

    CAS  PubMed  Article  Google Scholar 

  51. 51.

    Kozutsumi Y, Segal M, Normington K, Gething MJ, Sambrook J: The presence of malfolded proteins in the endoplasmic reticulum signals the induction of glucose-regulated proteins. Nature. 1988, 332 (6163): 462-464.

    CAS  PubMed  Article  Google Scholar 

  52. 52.

    Hori M, Fujii K, Fujishima M: Micronucleus-specific bacterium Holospora elegans irreversibly enhances stress gene expression of the host Paramecium caudatum. J Eukaryot Microbiol. 2008, 55 (6): 515-521.

    CAS  PubMed  Article  Google Scholar 

  53. 53.

    Craig EA, Gambill BD, Nelson RJ: Heat shock proteins: Molecular chaperones of protein biogenesis. Microbiol Rev. 1993, 57 (2): 402-414.

    CAS  PubMed Central  PubMed  Google Scholar 

  54. 54.

    Okamoto K, Brinker A, Paschen SA, Moarefi I, Hayer-Hartl M, Neupert W, Brunner M: The protein import motor of mitochondria: a targeted molecular ratchet driving unfolding and translocation. EMBO J. 2002, 21 (14): 3659-3671.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  55. 55.

    Kabani M, Martineau CN: Multiple hsp70 isoforms in the eukaryotic cytosol: mere redundancy or functional specificity?. Curr Genomics. 2008, 9 (5): 338-248.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  56. 56.

    Coyne RS, Hannick L, Shanmugam D, Hostetler JB, Brami D, Joardar VS, Johnson J, Radune D, Singh I, Badger JH: Comparative genomics of the pathogenic ciliate Ichthyophthirius multifiliis, its free-living relatives and a host species provide insights into adoption of a parasitic lifestyle and prospects for disease control. Genome Biol. 2011, 12 (10): R100-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  57. 57.

    Eisen JA, Coyne RS, Wu M, Wu D, Thiagarajan M, Wortman JR, Badger JH, Ren Q, Amedeo P, Jones KM: Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote. PLoS Biol. 2006, 4 (9): e286-

    PubMed Central  PubMed  Article  Google Scholar 

  58. 58.

    van Hoek MJ, Hogeweg P: Metabolic adaptation after whole genome duplication. Mol Biol Evol. 2009, 26 (11): 2441-2453.

    CAS  PubMed  Article  Google Scholar 

  59. 59.

    Fokin S: Morphological and molecular investigations of Paramecium schewiakoffi sp. nov. (Ciliophora, Oligohymenophorea) and current status of distribution and taxonomy of Paramecium spp. Eur J Protistol. 2004, 40 (3): 225-243.

    Article  Google Scholar 

  60. 60.

    Parfrey LW, Lahr DJG, Knoll AH, Katz LA: Estimating the timing of early eukaryotic diversification with multigene molecular clocks. Proc Natl Acad Sci USA. 2011, 108 (33): 13624-13629.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  61. 61.

    Wright ADG, Lynn DH: Maximum ages of ciliate lineages estimated using a small subunit rRNA molecular clock: Crown eukaryotes date back to the paleoproterozoic. Arch Protistenkd. 1997, 148 (4): 329-341.

    Article  Google Scholar 

  62. 62.

    Sonneborn TM: Methods in Paramecium research. Methods in Cell Physiology. Edited by: Prescott DM. 1970, New York: Academic Press, 241-339.

    Google Scholar 

  63. 63.

    Krenek S, Berendonk TU, Petzoldt T: Thermal performance curves of Paramecium caudatum: A model selection approach. Eur J Protistol. 2011, 47 (2): 124-137.

    PubMed  Article  Google Scholar 

  64. 64.

    Bailey MJ, Lilley AK, Thompson IP, Rainey PB, Ellis RJ: Site directed chromosomal marking of a fluorescent pseudomonad isolated from the phytosphere of sugar beet; Stability and potential for marker gene transfer. Mol Ecol. 1995, 4 (6): 755-763.

    CAS  PubMed  Article  Google Scholar 

  65. 65.

    Chomczynski P, Sacchi N: Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal Biochem. 1987, 162 (1): 156-159.

    CAS  PubMed  Article  Google Scholar 

  66. 66.

    Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: Molecular Evolutionary Genetics Analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28 (10): 2731-2739.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  67. 67.

    Padidam M, Sawyer S, Fauquet CM: Possible emergence of new geminiviruses by frequent recombination. Virology. 1999, 265 (2): 218-225.

    CAS  PubMed  Article  Google Scholar 

  68. 68.

    Sawyer SA: A computer package for the statistical detection of gene conversion. 2007, Louis: Distributed by the author, Department of Mathematics, Washington University in St, available at

    Google Scholar 

  69. 69.

    Martin DP, Lemey P, Lott M, Moulton V, Posada D, Lefeuvre P: RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics. 2010, 26 (19): 2462-2463.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  70. 70.

    Zdobnov EM, Apweiler R: InterProScan–an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001, 17 (9): 847-848.

    CAS  PubMed  Article  Google Scholar 

  71. 71.

    Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990, 18 (20): 6097-6100.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  72. 72.

    Roshan U, Livesay DR: Probalign: multiple sequence alignment using partition function posterior probabilities. Bioinformatics. 2006, 22 (22): 2715-2721.

    CAS  PubMed  Article  Google Scholar 

  73. 73.

    Darriba D, Taboada GL, Doallo R, Posada D: ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics. 2011, 27 (8): 1164-1165.

    CAS  PubMed  Article  Google Scholar 

  74. 74.

    Le SQ, Gascuel O: An improved general amino acid replacement matrix. Mol Biol Evol. 2008, 25 (7): 1307-1320.

    CAS  PubMed  Article  Google Scholar 

  75. 75.

    Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006, 22 (21): 2688-2690.

    CAS  PubMed  Article  Google Scholar 

  76. 76.

    Stamatakis A, Hoover P, Rougemont J: A rapid bootstrap algorithm for the RAxML web servers. Syst Biol. 2008, 57 (5): 758-771.

    PubMed  Article  Google Scholar 

  77. 77.

    Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574.

    CAS  PubMed  Article  Google Scholar 

  78. 78.

    Miller MA, Pfeiffer W, Schwartz T: The CIPRES science gateway: a community resource for phylogenetic analyses. Proceedings of the 2011 TeraGrid Conference: Extreme Digital Discovery. 2011, Salt Lake City, Utah: ACM, 1-8.

    Chapter  Google Scholar 

  79. 79.

    Krenek S, Petzoldt T, Berendonk TU: Coping with temperature at the warm edge - Patterns of thermal adaptation in the microbial eukaryote Paramecium caudatum. PLoS One. 2012, 7 (3): e30598-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  80. 80.

    Schroeder A, Mueller O, Stocker S, Salowsky R, Leiber M, Gassmann M, Lightfoot S, Menzel W, Granzow M, Ragg T: The RIN: an RNA integrity number for assigning integrity values to RNA measurements. BMC Mol Biol. 2006, 7: 3-

    PubMed Central  PubMed  Article  Google Scholar 

  81. 81.

    Pfaffl MW, Horgan GW, Dempfle L: Relative expression software tool (REST©) for group-wise comparison and statistical analysis of relative expression results in real-time PCR. Nucleic Acids Res. 2002, 30 (9): e36-

    PubMed Central  PubMed  Article  Google Scholar 

  82. 82.

    Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999, 41: 95-98.

    CAS  Google Scholar 

Download references


We thank T.G. Doak and C. Bleidorn for helpful suggestions and discussions. An earlier version of this manuscript has been improved following the valuable comments and corrections of three anonymous reviewers. This study has been partially supported by the German Research Foundation (DFG) within the priority programmes AQUASHIFT (BE-2299/3-1, 3–2) and Host-Parasite Coevolution (BE-2299/5-1) and by the European Commission FP7-PEOPLE-2009-IRSES project CINAR PATHOBACTER (247658). We gratefully acknowledge further financial support by the DFG and the Open Access Publication Funds of the TU Dresden.

Author information



Corresponding author

Correspondence to Sascha Krenek.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

SK participated in the study design, carried out the molecular, experimental and phylogenetic investigations, performed the statistical analyses and drafted the manuscript. MS contributed to the discussion and interpretation of the results and manuscript preparation. TUB conceived of this study, participated in its design and coordination and manuscript preparation. All authors read and approved the final manuscript.

Electronic supplementary material

Additional file 1: Table S1: Pairwise evolutionary distances between P. caudatum Hsp70 sequences. (PDF 23 kb) (PDF 24 KB)

Additional file 2: Figure S1: Paramecium caudatum Hsp70 amino acid alignment with indicated motifs and family signatures. (PDF 25 KB)

Additional file 3: Table S2: Codon-based test of purifying selection for analysis between P. caudatum Hsp70 sequences. (PDF 19 KB)

Additional file 4: Figure S2: Uncollapsed Maximum-Likelihood tree of pro- and eukaryotic Hsp70s. (PDF 39 KB)


Additional file 5: Table S3: List of accession numbers, descriptions and sources used for phylogenetic analyses. (XLSX 19 KB)


Additional file 6: Figure S3: Paramecium caudatum Hsp70 nucleotide sequence alignment with indicated binding sites for primer and MGB™ TaqMan® hydrolysis probes. (PDF 1 MB)

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Krenek, S., Schlegel, M. & Berendonk, T.U. Convergent evolution of heat-inducibility during subfunctionalization of the Hsp70 gene family. BMC Evol Biol 13, 49 (2013).

Download citation


  • Ciliate
  • Convergent evolution
  • DnaK
  • Gene duplication
  • Heat-inducibility
  • Heat-shock proteins
  • Molecular chaperones
  • Paramecium
  • RT-qPCR
  • Temperature stress