- Research article
- Open Access
A survey of green plant tRNA 3'-end processing enzyme tRNase Zs, homologs of the candidate prostate cancer susceptibility protein ELAC2
BMC Evolutionary Biology volume 11, Article number: 219 (2011)
tRNase Z removes the 3'-trailer sequences from precursor tRNAs, which is an essential step preceding the addition of the CCA sequence. tRNase Z exists in the short (tRNase ZS) and long (tRNase ZL) forms. Based on the sequence characteristics, they can be divided into two major types: bacterial-type tRNase ZS and eukaryotic-type tRNase ZL, and one minor type, Thermotoga maritima (TM)-type tRNase ZS. The number of tRNase Zs is highly variable, with the largest number being identified experimentally in the flowering plant Arabidopsis thaliana. It is unknown whether multiple tRNase Zs found in A. thaliana is common to the plant kingdom. Also unknown is the extent of sequence and structural conservation among tRNase Zs from the plant kingdom.
We report the identification and analysis of candidate tRNase Zs in 27 fully sequenced genomes of green plants, the great majority of which are flowering plants. It appears that green plants contain multiple distinct tRNase Zs predicted to reside in different subcellular compartments. Furthermore, while the bacterial-type tRNase ZSs are present only in basal land plants and green algae, the TM-type tRNase ZSs are widespread in green plants. The protein sequences of the TM-type tRNase ZSs identified in green plants are similar to those of the bacterial-type tRNase ZSs but have distinct features, including the TM-type flexible arm, the variant catalytic HEAT and HST motifs, and a lack of the PxKxRN motif involved in CCA anti-determination (inhibition of tRNase Z activity by CCA), which prevents tRNase Z cleavage of mature tRNAs. Examination of flowering plant chloroplast tRNA genes reveals that many of these genes encode partial CCA sequences. Based on our results and previous studies, we predict that the plant TM-type tRNase ZSs may not recognize the CCA sequence as an anti-determinant.
Our findings substantially expand the current repertoire of the TM-type tRNase ZSs and hint at the possibility that these proteins may have been selected for their ability to process chloroplast pre-tRNAs with whole or partial CCA sequences. Our results also support the coevolution of tRNase Zs and tRNA 3'-trailer sequences in plants.
tRNA 3'-end maturation is a process through which the 3'-trailer sequence of precursor tRNAs (pre-tRNAs) is removed, and processed tRNAs acquire the CCA end which is absolutely essential for tRNA aminoacylation and protein synthesis (for reviews, see [1–3]). In prokaryotes, this process can be either exonucleolytic or endonucleolytic depending on whether the 3'-CCA sequence is genomically encoded. CCA-containing pre-tRNAs are generally processed by the exonucleases that tend to stop removing nucleotides from the 3'-end upon encountering the transcriptionally encoded CCA, whereas CCA-less pre-tRNAs are processed by a 3'-endonuclease termed tRNase Z (also termed RNase Z or 3'-tRNase; for reviews, see [4–7]) that cleaves immediately after the N73 discriminator nucleotide (the first unpaired base after the acceptor stem) to allow subsequent addition of the CCA sequence.
Unlike prokaryotic pre-tRNAs, eukaryotic nuclear and organellar pre-tRNAs generally lack the 3'-CCA sequence (which is added post-transcriptionally) and their 3'-trailer sequences are removed by tRNase Z. Also unlike prokaryotic pre-tRNAs, eukaryotic nuclear pre-tRNAs contain oligo (U) at their 3'-ends, which are recognized and bound by the La protein (for reviews see [2, 8]). In the budding yeast Saccharomyces cerevisiae and fission yeast Schizosaccharomyces pombe, the endonucleolytic cleavage of nuclear pre-tRNAs requires the presence of the yeast La protein [9, 10]. In the absence of the yeast La protein, the 3'-trailer sequence of nuclear pre-tRNAs is trimmed by 3'-exoribonucleases including Rex1p . However, organellar pre-tRNAs lack terminal oligo (U). Furthermore, unlike nuclear pre-tRNAs which are typically monocistronic, most organellar pre-tRNAs are polycistronic [12, 13].
tRNase Z is present in all kingdoms of life. It exists in two forms: tRNase ZS [300-400 amino acids (aa)] and tRNase ZL (700-800 aa), which are encoded by different genes. It is believed that the tRNase ZL gene has evolved from a tandem duplication of the tRNase ZS gene, followed by divergence of the sequence . In prokaryotes, only tRNase ZS is identified. By contrast, all eukaryotes possess tRNase ZL, and some have both forms.
The species distribution of tRNase Z is complex. The majority of eukaryotic species analyzed to date, including S. cerevisiae, the fruit fly Drosophila melanogaster and the nematode worm Caenorhabditis elegans contain a single tRNase ZL [15–17]. In contrast, S. pombe have two tRNase ZLs [18, 19]. Interestingly, two tRNase ZLs and two tRNase ZSs have been experimentally identified in the flowering plant Arabidopsis thaliana . In humans, one tRNase ZS (also termed ELAC1) and one tRNase ZL (also termed ELAC2) are found . Our BLAST searches against public genomic and expressed sequence tag (EST) databases reveal that with few exceptions, vertebrates contain one tRNase ZL and one tRNase ZS (a detailed description of tRNase Z protein distribution in the animal kingdom will be provided elsewhere).
tRNase Z belongs to the metallo-β-lactamase (MBL) superfamily [14, 21–24]. The typical MBL domain contains five conserved sequence motifs termed Motifs I-V. Motifs I and IV each harbor an invariant Asp, Motif II (HxHxDH), which is also called the His motif, is the signature motif of the superfamily, whereas Motifs III and V each contain a conserved His residue. Structural studies of tRNase ZSs from E. coli, T. maritima and B. subtilis [25–28] and mutation analyses of tRNase Zs from a variety of species [29–35] reveal that the His and Asp residues of Motifs II-V form the active site for coordination of two catalytic zinc ions. In particular, the Asp residue of Motif II may participate in both zinc ion coordination and act as a general base to generate a hydroxide ion for nucleophilic attack on the scissile phosphodiester bond at the cleavage site [25, 29]. The Asp residue of Motif I is also catalytically important and appears to stabilize the catalytic site .
Besides tRNase Zs, some nucleic acid processing enzymes are also members of the MBL superfamily. Most of these proteins belong to the β-CASP (MBL-associated CISF Artemis SNM1/PSO2) subfamily of the MBL . This subfamily includes the 73-kD subunit of the cleavage and polyadenylation specificity factor (CPSF-73) and its yeast homolog Ysh1p, which are involved in endonucleolytic cleavage of pre-mRNA, the Intergrator complex subunit 11 (Int11) involved in the 3'-end formation of small nuclear RNAs (snRNA) , bacterial RNase J, which participates in rRNA 5'-end maturation  and RNA decay , and the eukaryotic Pso/Snm1/Artemis proteins, which function in DNA repair and V(D)J recombination . However, unlike tRNase Zs, β-CASP proteins contain conserved β-CASP sequence motifs in place of Motif V.
tRNase Z is distinguished from other MBL members by their unique substrate binding domain termed the flexible arm (also termed the exosite). Based on flexible arm type, there are two major types (bacterial- and eukaryotic-types) and one minor type [T. maritima (TM)-type] of tRNase Zs . The bacterial-type tRNase Zs, which are present predominantly in bacteria, possess the bacterial-type flexible arm. The bacterial-type flexible arm is ~55 aa in length and contains the Gly- and Pro-rich GP motif (GxPxGP, sometimes GxPPGP) . The eukaryotic-type tRNase Zs, which are found only in eukaryotes, contain the ELAC2-type flexible arm. This type of flexible arm harbors the GP motif and is ~62 aa long, which is slightly longer than the bacterial-type flexible arm.
The TM-type tRNase Z was believed to be the minor type at the time of discovery since it was found only in T. maritima and A. thaliana . The flexible arm found in TM-type tRNase Zs appears to be shorter (~30 aa) and lacks the GP motif but instead contains one short basic residue-rich region . In addition, both the bacterial- and eukaryotic-type tRNase Zs contain the PxKxRN, HEAT and HST motifs, which form part of loop structures, whereas the TM-type tRNase Z lacks these motifs [33, 40, 41]. The PxKxRN motif has been suggested to function in CCA anti-determination (tRNase Z activity is inhibited by 3'-CCA) [25, 33], whereas the HEAT and HST motifs have been suggested to play a role in facilitating proton transfer at the final stage of reaction [25, 29, 40].
tRNase Z has diverse functions besides its primary role in tRNA 3'-end processing. This is perhaps best exemplified by ELAC2, which serves a multitude of functions within cells. Recent studies have shown that ELAC2 is involved in the generation of MALAT1, a cancer-associated long noncoding RNA which participates in regulation of pre-mRNA splicing , tRNA-derived small RNAs [43, 44], and viral microRNAs (miRNAs) [45, 46]. Overexpression of ELAC2 delays cell cycle progression, suggesting that ELAC2 may be involved in cell cycle control either directly or indirectly via its role as tRNA processing enzyme . ELAC2 also potentiates TGF-β(transforming growth factor-β/Smad-induced transcription response, indicating a role for ELAC2 in TGF-β/Smad signaling mediated growth arrest . Interestingly, a recent study has shown that destruction of human mitochondria through depletion of mitochondrial DNA results in down-regulation of ELAC2 and a delay in cell cycle progression . Since ELAC2 may be involved in cell cycle regulation, it is likely that ELAC2 may link mitochondrial function and cell cycle control. It is important to note that ELAC2 is a candidate prostate cancer susceptibility gene as its mutations are associated with prostate cancer . However, the underlying mechanisms are unknown. In S. cerevisiae, either inactivating mutations or overexpression of tRNase ZL causes a petite phenotype, suggesting that the action of tRNase ZL may be related to mitochondrial function . In addition, the S. cerevisiae tRNase ZL has also been suggested to play a role in 35S rRNA processing .
The study of tRNase Z evolution has been facilitated by the increasing availability of genome sequences. A previous study showed that only tRNase ZS is found in bacteria and that its presence in bacteria is widespread . We recently reported on a systematic survey of tRNase Zs in fungi . Our analysis reveals that while the majority of fungal species contain one tRNase ZL, all four sequenced Schizosaccharomyces species contain two distinct tRNase ZLs either demonstrated or predicted to be localized to the nucleus and mitochondria, respectively. In addition, the presence of tRNase ZS in fungi is restricted to the phylum Basidiomycota and the basal fungal phyla.
Green plants (Viridiplantae) represent a monophyletic group of land plants and green algae that evolved near the base of the tree of eukaryotic life. Flowering plants (angiosperms), which are typically polyploidy, represent the largest, most diverse and most evolutionary advanced phylum of land plants making up 90% of the plant kingdom. It can be divided into two major groups: dicotyledons (dicots), which accounts for the majority of the angiosperm species, and monocotyledons (moncots). At present, there are at least 27 sequenced and annotated genomes representing the major taxonomic groups within green plants, although the majority of them are those of flowering plants. The public availability of these genome sequences enabled us to identify tRNase Zs in green plants and to study their evolution.
In this study, we undertook a comprehensive survey of candidate tRNase Zs from annotated green plant genomes. To understand the evolutionary relationships among green plant tRNase Zs, we further conducted a phylogenetic analysis of these newly identified candidates. Finally, we presented a detailed sequence analysis of tRNase Zs with the intent of further delineating the distinct features of green plant tRNase Zs.
Identification of candidate green plant tRNase Zs
To extend our previous study of tRNase Z diversity and evolution, we searched public genome databases for putative green plant tRNase Zs with significant matches to known bacterial and eukaryotic tRNase Zs. Since most of candidate sequences identified from the databases are computationally generated without subsequent manual annotation, it is likely that many predictions may contain errors. Therefore, we verified each candidate. We first validated each prediction by reciprocal searches against the GenBank. In back-searches, a candidate was confirmed if reverse BLAST also gave tRNase Z hits in the top matches. Accuracy of prediction was further evaluated by multiple sequence alignment. All discordant candidate sequences were checked manually for possible errors including sequencing errors, intron mispredictions and existence of gaps in the genome sequences. We found that many candidate sequences are apparently incomplete or contain annotation errors. For example, the predicted coding sequence of the monkeyflower (Mimulus guttatus) MguTRZ2 (Phytozome accession no. mgv1a024577 m.g) in the database was incomplete lacking the N-terminal region. We were able to predict this region from the genomic DNA sequence based on sequence similarity. The predicted full-length coding region of MguTRZ2 has 364 aa. As another example, the sequence annotated as the candidate castor bean (Ricinus communis) tRNase ZL (Phytozome accession no. 30146.t000117) appears to be mispredicted due to the presence of sequence gaps. Thus, this sequence was excluded from the list.
Several incorrect predictions are apparently caused by the presence of the non-canonic GC-AG splice site pairs. While the GT donor splice site is a canonical 5'-splice site for introns in eukaryotic genes, the GC donor splice sites account for the majority of the non-canonical donor splice site. Thus, for those that cannot be accurately predicted by the conventional FGENESH program, we carried out gene prediction using FGENESH _GC, which is a new version of the FGENESH program including noncanonical GC dinucleotide in donor splice sites. Indeed, by doing so, we could predict some exons encoding missing conserved motifs. For example, the HEAT motif was originally missing in several annotated candidate tRNase ZLs from flowering plants including papaya (Carica papaya CpaTRZ3), cassava (Manihot esculenta MesTRZ4), Medicago (Medicago truncatula MtrTRZ3) and black cottonwood (Populus trichocarpa PtrTRZ3). After re-evaluation of intron splice sites using FGENESH_GC, we were able to recover their HEAT motifs. Because the intron sequences of some candidate sequences appear to be extremely difficult to predict correctly, we could not conclusively rule out the possibility of errors in certain candidates.
In total, 54 candidate tRNase ZS and 32 candidate tRNase ZL were identified from 27 green plant species including 21 flowering plants, 1 moss, 1 lycophyte and 4 green algae (Additional file 1). The names of these candidates follow the A. thaliana tRNase Z nomenclature . Of these, only tRNase Zs from A. thaliana have been experimentally characterized . While most of these sequenced genomes examined belong to the flowering plants, the availability of the genome sequences from two basal land plants (the bryophyte Physcomitrella patens and the lycophyte Selaginella moellendorffii) and four green algae (Chlamydomonas reinhardtii, Volvox carteri, Micromonas pusilla and Ostreococcus lucimarinus) allows for evaluation of differences between flowering plant tRNase Zs and those from basal land plants and green algae.
The flowering plants examined to date appear to contain multiple tRNase Zs. Foxtail millet (Setaria italica) contains the largest number of tRNase Zs (5) so far identified in a flowering plant. Most flowering plant species have a single tRNase ZL. In contrast, six flowering plants including two Arabidopsis species (A. thaliana and Arabidopsis lyrata) harbor two tRNase ZLs. It should be noted that the presence of two tRNase ZLs are not species-specific since the plant species containing two tRNase ZLs come from diverse taxonomic groups.
Unlike most flowering plants that possess two tRNase ZSs, two members of the Panicoideae subfamily of grasses, sorghum (Sorghum bicolor) and foxtail millet have three tRNase ZSs and four tRNase ZSs, respectively. The presence of multiple tRNase ZSs appears not to be grass-specific, since three other grass plants including rice (Oryza sativa), Brachypodium (Brachypodium distachyon) and maize (Zea mays) contain only two tRNase ZSs. Calculation of the percentage identity and similarity between candidate tRNase Zs from these two grass species shows strong conservation of the proteins at the amino acid level with the most identity (93%) and similarity (95%) between S. bicolor SbiTRZ2 and S. italica SitTRZ2 (Additional files 2 and 3). To our surprise, all candidate tRNase ZSs identified in flowering plants are highly similar and belong to the TM-type tRNase ZS (see below for a detailed discussion).
The number and type of tRNase Zs appear to be highly variable in the two primitive plant species and four green algae (Table 1 and Additional file 1). The largest number of tRNase Zs is found in the moss P. patens, which contains two TM-type and one bacterial-type tRNase ZSs, and one tRNase ZL. In contrast, two green algae C. reinhardtii and V. carteria have the least number of tRNase Zs, comprising one TM-type tRNase ZS and one tRNase ZL. The two other green algae M. pusilla and O. lucimarinus contain one TM-type and one bacterial-type tRNase ZSs and one tRNase ZL. The lycophyte S. moellendorffii, which has the smallest genome size of any land plant reported, contains two tRNase ZLs in addition to one TM-type tRNase ZS.
Interestingly, tRNase Z-like proteins (TLP) which apparently lack one or more conserved motifs of tRNase Zs necessary for the enzymatic activity of the protein are widespread in basal land plant and green algal genomes that have been analyzed here (Table 1 and Additional file 1). Many tRNase Z-like proteins appear to lack the flexible arm but contain all other conserved motifs of tRNase Zs (Figure 1 and see below for a more detailed discussion of the motifs). At one extreme, one S. moellendorffii tRNase Z-like protein (SmoTLP2) appears to lack all of the conserved motifs.
Based on their sizes, tRNase Z-like proteins can be divided into tRNase ZS-like and tRNase ZL-like proteins which are comparable in size to tRNase ZSs and tRNase ZLs, respectively. The number and form of tRNase Z-like proteins vary among the species, being largest in C. reinhardtii (two tRNase ZS-like and two tRNase ZL-like proteins) and next largest in S. moellendorffii (two tRNase ZS-like and one tRNase ZL-like proteins). The basal land plant P. patens and the two green algae M. pusilla and O. lucimarinus contain one tRNase ZL-like protein, whereas the green alga V. carteria contains one tRNase ZS-like and one tRNase ZL-like proteins. In contrast, black cottonwood (P. trichocarpa) appears to be the only species among the flowering plant genomes examined that contains the tRNase ZL-like protein. This species contains one tRNase ZL-like protein in which the second His in the His motif is mutated to Gln (data not shown).
Prediction of subcellular localization of candidate tRNase Zs from flowering plants
To help understand the function of candidate tRNase Zs, we predicted in silico the subcellular localization of each of flowering plant tRNase Zs using different bioinformatic prediction programs. Most flowering plants have two tRNase ZSs (tRNase ZS1 and tRNase ZS2). All tRNase ZS1 proteins apparently lack any predictable signal sequences and therefore predicted to be cytoplasmic proteins, with the exception of O. sativa tRNase ZS1 (OsaTRZ1), which is predicted to be localized in the chloroplasts (Table 2). On the other hand, all tRNase ZS2 except P. trichocarpa tRNase ZS2 (PtrTRZ2), contain a predicted chloroplast targeting signal (Table 2). The predicted chloroplast targeting signal of PtrTRZ2 is unusual in that it is only seven residues long since the majority of the known chloroplast targeting signals consist of N-terminal 20-80 amino acid residues . In the two flowering plants (S. bicolor and S. italica) carrying multiple tRNase ZSs, only one tRNase ZS (tRNase ZS2) from each species is predicted to be chloroplast-localized, whereas the rest are predicted to be cytoplasmic (Table 2).
Besides tRNase ZS, flowering plants also contain one (tRNase ZL1) or two (tRNase ZL1 and tRNase ZL2) tRNase ZLs. Most tRNase ZL1 proteins contain a predicted mitochondrial targeting signal between two putative translational initiation sites at the N-terminus and also have a nuclear localization signal (Table 3). Some tRNase ZL2 proteins have a putative mitochondrial targeting signal, while others are predicted to have both nuclear and mitochondrial targeting signals. The length of the putative mitochondrial targeting signals found in these candidates are within the expected size-range (20-80 aa) . Consistent with the prediction, A. thaliana tRNase ZL1 (AthTRZ3) was found in both the nucleus and the mitochondria . However, A. thaliana tRNase ZL2 (AthTRZ4) predicted to have both nuclear and mitochondrial targeting signals is actually found only in the mitochondria .
To gain insights into the evolutionary relationship among plant tRNase Zs, we reconstructed the phylogeny of 86 candidate sequences using Bayesian phylogenetics. Although most of plant species are flowering plants, they are taxonomically diverse. In addition to tRNase Zs from plants, we also include tRNase ZS from T. maritima as outgroup. Phylogenetic analysis reveals the presences of two well-supported clades: one formed by including all TM-type tRNase ZSs, and the other all tRNase ZLs and bacterial-type tRNase ZSs (Figure 2). The latter clade is further divided into two subclades, of which one contains all tRNase ZLs, and the other bacterial-type tRNase ZSs. Notably, two tRNase ZLs (AlyTRZ4 and AthTRZ4) from the two Arabidopsis species (A. thaliana and A. lyrata) together form a group sister to a group formed by another two tRNase ZLs (AlyTRZ3 and AthTRZ3) from the same two species. Likewise, the two tRNase ZLs found in some species including Aquilegia coerulea (AcoTRZ3 and AcoTRZ4), Glycine max (GmaTRZ3 and GmaTRZ4) and S. moellendorffii (SmoTRZ2 and SmoTRZ3) are sister to each other with a posterior probability value of 1. These observations suggest that tRNase ZL gene duplication took place in certain species.
Conservation of candidate green plant tRNase ZSs
To assess the presence and conservation of motifs in candidate green plant tRNase Zs, we performed multiple sequence alignments of identified tRNase Zs sequences. Candidate tRNase ZSs are analyzed first. A list of aligned tRNase ZSs from representative green plants is shown in Figures 1 and 3. For comparison, tRNase ZSs from T. maritima, B. subtilis, E. coli, the cyanobacterium Synechocystis sp. PCC 6803 and humans are included as needed. A full list of all aligned green plant tRNase ZSs is presented in Additional file 4.
Sequence comparison reveals that except for the N-terminal regions, tRNase ZSs from the flowering plants appear to be more similar to each other than to either the basal land plants or green algae (Figure 3). Furthermore, the sequences of the green plant tRNase ZSs are highly divergent from those of T. maritima, B. subtilis, E. coli and human tRNase ZSs. For example, A. thaliana tRNase ZS1 (AthTRZ1) and tRNase ZS2 (AthTRZ2) exhibit only 17% - 24% and 16% - 21% identity (25% - 35% and 27% - 31% similarity), respectively, with those from T. maritima, B. subtilis, E. coli and humans. Overall, the sequence conservation between the green plant tRNase ZSs and non-plant species is largely confined to the highly conserved motifs of tRNase Zs.
Unexpectedly, careful examination of the sequences of green plant tRNase ZSs reveals that most of the proteins possess several unique features that distinguish them from bacterial-type tRNase ZSs and thus justify their classification as TM-type tRNase ZS. The sequence logos for motifs unique to TM-type tRNase ZSs are presented in Figure 4. First, most green plant tRNase ZSs harbor the TM type flexible arm. The plant TM-type flexible arms show only weak protein sequence homology to the bacterial-type flexible arms, and have distinctive features including the absence of the GP motif and the presence of a consensus sequence KLKxxYxxLxGxxIxxLK, here termed the KL motif (Figures 3 and 4). This Lys- and Leu-rich motif was previously unappreciated in A. thaliana, likely due to the limited number of plant sequences available at the time analysis was performed . As might be expected, there are variations in the consensus sequence (Figures 3 and 4).
Second, most green plant tRNase ZSs lack the PxKxRN motif normally present on the amino side of Motif I. Instead, they share a Gly-rich consensus sequence EGxSxxG in an analogous position to the PxKxRN motif (Figure 3). In some cases, variants in the consensus sequence are found (Figures 3 and 4). Notably, there is very weak sequence similarity between this motif and the corresponding region in T. maritima tRNase ZS.
Third, most green plant tRNase ZSs contain conserved xExT and HxH motifs in place of the HEAT and HST motifs, respectively. The xExT motif is related to the HEAT motif except that the invariant His is replaced by a Thr/Met/Val/Leu residue and the invariant Ala is often replaced by Ser, or occasionally Met, Thr and Cys (Figures 3 and 4). In contrast, the HxH motif is related to the HST motif, except that the conserved Ser is mostly replaced by Thr, and the conserved Thr is substituted by His (Figures 3 and 4). Based on the structural and mutagenesis studies, it has been suggested that the Glu of the HEAT motif and His of the HST motif play a role in facilitating proton transfer at the final stage of reaction [25, 29, 40]. In particular, the His residue is implicated as the proton donor. It is highly likely that the conserved Glu-His pair from the xExT and HxH motifs in the TM-type tRNase ZSs may also participate in the terminal proton transfer reaction.
Multiple sequence alignment of bacterial-type tRNase ZSs from the basal plant P. patens (PpaTRZ3) and green algae reveals that in addition to the well conserved Motifs I-V, these candidates possess a bacterial-type flexible arm containing the GP, HEAT and HST motifs, and the variant PxKxRN motif, in which Lys is replaced with Leu (Figure 1).
Conservation of tRNase ZLs in green plants
Since tRNase ZL can be divided into the N-terminal and C-terminal halves, which are related by weak sequence similarity, we aligned these two halves separately. The alignment of the N-terminal and C-terminal halves of representative candidate tRNase ZLs from diverse green plant species are shown in Figures 5 and 6, respectively. Two non-plant eukaryotic tRNase ZLs from D. melanogaster and humans were included for comparison. A complete list of all aligned green plant tRNase ZLs identified here is given in Additional file 5.
Except for the N-terminal regions, candidate tRNase ZLs from land plants share a high degree of overall sequence similarity to each other. In contrast, candidate tRNase ZLs from the four green algae show relatively low overall sequence similarity to each other and to the land plant tRNase ZLs. In particular, their N-terminal halves are highly divergent compared to those from the land plants. Furthermore, sequence similarity between land plant tRNase ZLs and their homologs in green algae, D. melanogaster and humans is limited to the conserved motifs identified in tRNase Zs.
The N-terminal halves of all green plant tRNase ZLs contain a conserved eukaryotic-type flexible arm domain containing the GP motif. In addition, the N-terminal halves of the proteins also contain a pseudo-PxKxRN motif and pseudo-Motifs I and II, which differ from their original motifs in many positions including critical residues for tRNase Z functions (Figure 5). These pseudo-motifs were previously identified in candidate fungi tRNase ZLs . In contrast, the C-terminal halves of green plant tRNase ZLs harbor conserved Motifs I-V and the PxKxRN, HEAT and HST motifs (Figure 6). These motifs appear in the same relative order in all the sequences in which they are present.
Many chloroplast tRNA genes encode a portion of the 3'-CCA sequence
It has been suggested that many chloroplast tRNA genes encode partial CCA sequences . However, this conclusion was primarily based on the examination of the 3'-flanking sequences of tRNA genes from three chloroplast genomes. To determine if this conclusion holds true when more chloroplast genome sequences are now available for analysis, we examined the presence of tRNA genes encoding whole or partial CCA sequences in 15 chloroplast genomes. The results obtained were tabulated in Table 4.
Indeed, we found that many tRNA genes in the chloroplast genomes encode partial CCA sequences, which could serve as part of the CCA sequence (Table 4). For instance, in the A. thaliana chloroplast genome, 31% of tRNA genes encode the whole or partial CCA sequences. Moreover, many chloroplast tRNA genes encode the first base of the CCA sequence. For example, of the 37 A. thaliana chloroplast tRNA genes examined, 11 had C after the discriminator.
The presence of multiple tRNase Zs in green plants
Unlike C. elegans, D. melanogaster, humans and most fungal species examined to date, green plants are unique in that they possess multiple tRNase Zs. Of the 27 complete green plant genomes analyzed, the majority of them encode two tRNase ZSs and one or two tRNase ZLs. It is reasonable to expect that the existence of multiple tRNase Zs would be common to green plants. This phenomenon appears to have arisen from genome doubling (polyploidy), which seems to be a driving force in plant evolution and variation . However, there seems to be no correlation between the number of tRNase Zs and genome size (Table 5).
An unexpected observation in this study is that most green plant tRNase ZSs, which are clearly distinct from the bacterial-type tRNase ZS, represent the TM-type tRNase ZS. A comparison of motifs found in three different types of tRNase Zs from green plants is provided in Figure 7. Our phylogenetic analysis also supports the existence of both the TM- and bacterial-types of tRNase ZSs in green plants (Figure 2). The TM-type tRNase ZS was previously found only in the hyperthermophilic bacterium T. maritima and the flowering plant A. thaliana, and thus, it was originally thought to be a minor type. Our data greatly expand the repertoire of this type of tRNase Z. Although it remains to be determined if the TM-type tRNase ZS is also widespread in other taxonomic groups, the prevalence of the TM-type tRNase ZS in green plants suggests that this type might be plant-specific.
In contrast to the TM-type tRNase ZS, which is widespread in the green plants, the bacterial-type tRNase ZS is only present in the basal land plants and green algae. Sequence analysis reveals that the two types of tRNase ZSs are related to but distinct from each other, suggesting that they likely arose from the same ancestral tRNase ZS gene, followed by sequence divergence. Since the flowering plants possess only the TM-type tRNase ZS, the bacterial-type tRNase ZS has apparently been lost during the course of plant evolution whereas the TM-type tRNase ZS has been maintained. Interestingly, the bacterial-type tRNase ZS is also found in humans. Although the precise function of human tRNase ZS is unknown, it may play a role in degradation of a subset of microRNAs (miRNA) in the cytoplasm .
It is likely that multiple tRNase Zs found in the flowering plants are localized in different subcellular compartments. In most of the flowering plants examined, the two TM-type tRNase ZSs (tRNase ZS1 and tRNase ZS2) are predicted to be either in the cytosol or in the chloroplast, whereas one of tRNase ZL (tRNase ZL1) is predicted to contain both nuclear and mitochondrial targeting signals. The second tRNase ZL (tRNase ZL2) found in some plants either contains or lacks a predicted mitochondrial targeting signal. These predictions are generally supported by recent subcellular localization studies of A. thaliana tRNase Zs . A. thaliana tRNase ZS1 (AthTRZ1) and tRNase ZS2 (AthTRZ2) are localized in the cytoplasm and chloroplast, respectively, whereas A. thaliana tRNase ZL1 (AthTRZ3) and tRNase ZL2 (AthTRZ4) are targeted to both the nucleus and mitochondria, and the mitochondria, respectively .
Based on their predicted subcellular localization, it is most likely that tRNase ZS2 and tRNase ZL1 are involved in the 3'-ends processing of pre-tRNAs in the chloroplasts, and both the nucleus and mitochondria, respectively, whereas tRNase ZL2 either serves as a back-up for mitochondrial pre-tRNA 3'-end processing or plays a specialized, yet to be discovered function. The function of tRNase ZS1 is unknown. It may play a role in repair of incorrectly processed 3'-ends of tRNAs in the cytosol. Alternatively, tRNase ZS1 could also participate in nuclear pre-tRNA 3'-end processing, as suggested by the observation that deletion of both A. thaliana tRNase ZS1 (AthTRZ1) and tRNase ZL1 (AthTRZ3) results in a lethal phenotype .
The plant TM-type tRNase ZSs may not recognize the CCA sequence as an anti-determinant
The CCA triplet following the discriminator generally inhibits tRNase Z activity, with the first C at nucleotide position 74 having the greatest effect [25, 33, 58]. This anti-determinant property of the CCA sequence is thought to prevent removal of the 3'-terminal CCA sequence from mature tRNAs, and thus avoids futile cycles of CCA addition and removal [33, 58–60]. However, unlike other types of tRNase Zs, the TM-type tRNase Z does not appear to recognize the CCA sequence that is downstream the discriminator and present in the trailer sequence as an anti-determinant. The recombinant T. maritima tRNase ZS cleaves after the CCA sequence which is encoded by all but one tRNA gene, leaving the CCA sequence intact . Similarly, the recombinant A. thaliana tRNase ZS can process pre-tRNAs with the 3'-terminal CCA sequences embedded in the trailer sequence, albeit at a different position . Like the recombinant proteins, a partially purified spinach chloroplast tRNase Z fraction can also cleave pre-tRNAs containing complete or partial CCA sequences after the first C74 regardless of the sequence of the flanking region . Importantly, the remainder of the CCA sequence can be added by chloroplast tRNA nucleotidyltransferease .
However, A. thaliana tRNase ZS appears to be unusual in that it can also cleave off the 3'-terminal CCA sequence from mature tRNAs in vitro . Since mature tRNAs must be protected from counterproductive cleavage by tRNase ZS in vivo, it is likely that the plant chloroplast protein may acquire the ability to recognize the 3'-terminal CCA sequence as the mature tRNA 3'-end via cofactors .
Sequence determinants potentially involved in cleavage specificity
The mechanisms responsible for cleavage site selection and CCA inhibition are not fully understood, but appear to involve a combination of sequence features. First, the flexible arm has been suggested to contribute to cleavage site selection and the inhibitory effect of the CCA sequence . This tRNase Z-specific element is located on the opposite side of the active site and binds primarily the D and T loops of the pre-tRNA [28, 39, 63]. Notably, the flexible arms of the T. maritima and plant tRNase ZSs lack the GP motif but contain the KL motif. Additionally, they are significantly smaller in length relative to those of other tRNase Zs possessing the anti-determinant function. Structural studies reveal that although TM- and bacterial-types of flexible arms share a similar overall structure which is composed of a compact globular domain and an extended two-stranded stalk and protrudes from the protein core, they have different globular domains [25–28, 64]. In the TM-type flexible arm, the globular domain consists of one very short α-helix, one long helix and one 310-helix, whereas in the bacterial-type flexible arm, it is composed of two α-helices, two β-strands and one 310-helix. It would be interesting to know how the differences in the sequence feature, length and topology of the flexible arm may contribute to cleavage specificity by the enzymes.
Another possible motif involved in selection of the cleavage site has been suggested to be Motif I which appears to participate in binding the acceptor stem of pre-tRNA substrates [33, 62]. In vitro studies using the recombinant T. maritima tRNase ZS has suggested that the Ser31 and Thr33 residues of Motif I are involved in the cleavage site selection, with the former residue being more critical . However, only individual, but not simultaneous mutations of these two non-Gln residues to Gln (which are found at the corresponding positions in other tRNase Zs with the CCA anti-determinant) in T. maritima tRNase ZS affects the cleavage site selection .
In addition to the flexible arm and Motif I, the PxKxRN motif, in particular, the two basic residues in the motif, has also been suggested to be involved in the cleavage specificity and, by inference, CCA inhibition of tRNase Zs, since this motif is absent from T. maritima and A. thaliana . Strikingly, tRNase ZSs found in all flowering plants examined so far also lack this motif (Figure 3). However, since the archaeon Methanococcus jannaschii tRNase ZS harbors a PxKxRN motif but cannot recognize the CCA sequence as an anti-determinant in vitro, suggesting that other sequence elements may also be involved in the cleavage specificity of tRNase Z . Alternatively, it has been suggested that a high enzyme concentration used in the assay may contribute to the lack of a CCA anti-determinant effect with the M. jannaschii tRNase ZS .
Why is the TM-type tRNase ZSdeveloped for chloroplast pre-tRNA 3'-end processing?
The discovery that the TM-type tRNase ZS is widespread in green plants raises a question as to why plants adopt the TM-type tRNase ZS over other types of tRNase Z for the 3'-end processing of chloroplast pre-tRNAs. One possible explanation is that the TM-type tRNase ZS identified in plants has evolved to adapt to chloroplast pre-tRNA 3'-end processing. It has previously been suggested that many plant chloroplast tRNA genes encode C74 based on a limited number of available chloroplast tRNA genes . To see if this conclusion could be extended to more flowering plants, we have examined the 3'-flanking region for tRNA genes in additional chloroplast genomes. Indeed, many chloroplast tRNA genes seem to encode partial CCA sequences (Table 4).
The development of the TM-type tRNase ZS may be particularly important for chloroplasts, which have a limited but sufficient number of self-encoded tRNA species and do not import tRNAs [65, 66]. As all possible codons are used in the chloroplast protein-encoding genes, all chloroplast tRNA species appear to be used in protein synthesis . Thus, efficient 3'-end processing of each pre-tRNA by tRNase Z may be critical in ensuring maximum efficiency in chloroplast protein synthesis.
The presence of candidate tRNase Z-like proteins in green plants
tRNase ZS-like proteins have previously been found in cyanobacteria including Synechocystis sp. PCC 6803  and fungi , whereas tRNase ZL-like proteins have not been reported. In Synechocystis, one tRNase ZS-like protein (sll1036) has been identified. This protein does not exhibit any tRNase Z activity in vitro, consistent with the lack of some of the most conserved motifs of tRNase Zs in the protein sequence . Since candidate tRNase Z-like proteins found in basal land plants and green algae either lack all or some of the conserved motifs of tRNase Zs, they most likely possess no tRNase Z activity.
It has been suggested that tRNase ZL has arisen from the fusion of duplicated tRNase ZS genes with further sequence diversification. It is possible that tRNase ZS-like proteins found in the basal land plants and green algae may represent relics of original tRNase ZS that were mutated during diversification of eukaryotic tRNase Z genes. Alternatively, these proteins may play species-specific functions as suggested for cyanobacterial tRNase ZS-like proteins . The predominant presence in the basal land plants and green algae of tRNase Z-like proteins also suggests that duplication of ancestral tRNase ZS genes may occur early in green plant evolution.
This study represents the first large-scale identification and analysis of green plant tRNase Zs. Our survey of current plant genome databases shows that green plants are represented by multiple tRNase Zs, which include one or two tRNase ZLs and two tRNase ZSs. One tRNase ZL is predicted to participate in 3'-end processing of nuclear and mitochondrial pre-tRNA, whereas the other is likely to provide a backup for mitochondrial pre-tRNA processing. It appears that most tRNase ZSs, which is widespread throughout the green plants, belong to a minor but highly distinct type of tRNase ZS (TM-type). In contrast, the typical bacterial-type tRNase ZS is restricted to basal land plants and green algae. The apparent lack of the bacterial-type tRNase ZS in flowering plants suggest that while both types were present in the basal land plants, the bacterial-type tRNase ZS was discarded in favor of TM-type during plant evolution. Based on our results and previous studies, we propose that like T. maritima tRNase ZS, TM-type tRNase ZSs found in green plants seem not to recognize the CCA sequence as an anti-determinant and that the rise of this type of tRNase ZS appears to accommodate the 3'-end processing of chloroplast pre-tRNAs with partial or whole CCA sequences. This unusual property of green plant tRNase ZSs is likely due to multiple sequence determinants including the TM-type-specific flexible arm comprising the KL motif, Motif I and lack of the PxKxRN motif.
However, it should be noted that bioinformatics analysis alone cannot resolve possible differences in cleavage specificity among TM-type tRNase ZSs. The complete understanding of the mechanisms of the cleavage specificities of the TM-type tRNase ZSs awaits the determination of the structures of these enzymes bound to pre-tRNA and the accumulation of more precise biochemical data.
Plant genome database search and protein sequence analysis
Candidate tRNase Zs were identified by BLAST and PSI-BLAST searches against the genome databases using known tRNase Zs as query sequences. The databases used include Phytozome http://www.phytozome.net/, the NCBI nonredundant protein sequence database http://blast.ncbi.nlm.nih.gov/Blast.cgi, Joint Genome Institute (JGI; http://www.jgi.doe.gov/ and Universal Protein Resource (Uniprot; http://www.uniprot.org/. An E-value cutoff of 0.001 was used in all searches. With this value, no β-CASP protein or other MBL protein was found. The resulting sequences were subject to validation as described . The splicing pattern was verified using the FGENESH and FGENESH_GC programs provided at the Softberry website http://linux1.softberry.com/berry.phtml?topic=fgenesh. Prediction of subcellular localization of proteins was made using web-based prediction programs such as MITOPROT http://ihg2.helmholtz-muenchen.de/ihg/mitoprot.html, PSORT http://psort.hgc.jp/form.html and ChloroP http://www.cbs.dtu.dk/services/ChloroP/. Multiple sequence alignments were done by Clustal W .
Phylogenetic analysis was performed using the Bayesian approach, with T. maritima tRNase ZS (TmaTrz1) as an outgroup as described . Briefly, full-length amino acid sequences of candidate plant tRNase Zs and TmaTrz1 were aligned by using Clustal W implemented in MEGA 5.0 . After excluding gaps and the ambiguous sites, we used ProtTest 2.4  to choose the most appropriate evolutionary model for our data set. The phylogenies were estimated by Bayesian inference with MrBayes 3.1.2  using a mixture of the fixed amino acid models and I + G distribution. Statistical confidence was assessed by using Markov Chain Monte Carlo (MCMC) sampling approaches. Two separate runs including a total of four independent tree searches were conducted. All searches consisted of one 'cold' and three 'heated' Markov chains estimated for 107 generations, and every 1000 generations were sampled. The burn-in parameter was estimated by plotting -lnL against the generation number using Tracer 1.4.1 http://beast.bio.ed.ac.uk/Tracer, and the retained trees were used to estimate the consensus tree and the Bayesian posterior probabilities.
- tRNase Z:
tRNA 3' endonuclease
- tRNase ZS:
the short form of tRNase Z
- tRNase ZL:
the long form of tRNase Z
MBL-associated CISF Artemis SNM1/PSO2
the cleavage and polyadenylation specificity factor
the metallo-β-lactamase superfamily
transforming growth factor-β
Phizicky EM, Hopper AK: tRNA biology charges to the front. Genes Dev. 2010, 24: 1832-1860. 10.1101/gad.1956510.
Bayfield MA, Yang R, Maraia RJ: Conserved and divergent features of the structure and function of La and La-related proteins (LARPs). Biochim Biophys Acta. 2010, 1799: 365-378.
Maraia RJ, Lamichhane TN: 3' processing of eukaryotic precursor tRNAs. WIREs RNA. 2011, 2: 362-375. 10.1002/wrna.64.
Ceballos M, Vioque A: tRNase Z. Protein Pept Lett. 2007, 14: 137-145. 10.2174/092986607779816050.
Hartmann RK, Gossringer M, Spath B, Fischer S, Marchfelder A: Chapter 8 The Making of tRNAs and More - RNase P and tRNase Z. Prog Nucleic Acid Res Mol Biol. 2009, 85C: 319-368.
Redko Y, Li de Lasierra-Gallay I, Condon C: When all's zed and done: the structure and function of RNase Z in prokaryotes. Nat Rev Microbiol. 2007, 5: 278-286. 10.1038/nrmicro1622.
Spath B, Canino G, Marchfelder A: tRNase Z: the end is not in sight. Cell Mol Life Sci. 2007, 64: 2404-2412. 10.1007/s00018-007-7160-5.
Wolin SL, Cedervall T: The La protein. Annu Rev Biochem. 2002, 71: 375-403. 10.1146/annurev.biochem.71.090501.150003.
Van Horn DJ, Yoo CJ, Xue D, Shi H, Wolin SL: The La protein in Schizosaccharomyces pombe: a conserved yet dispensable phosphoprotein that functions in tRNA maturation. RNA. 1997, 3: 1434-1443.
Yoo CJ, Wolin SL: The yeast La protein is required for the 3' endonucleolytic cleavage that matures tRNA precursors. Cell. 1997, 89: 393-402. 10.1016/S0092-8674(00)80220-2.
Copela LA, Fernandez CF, Sherrer RL, Wolin SL: Competition between the Rex1 exonuclease and the La protein affects both Trf4p-mediated RNA quality control and pre-tRNA maturation. RNA. 2008, 14: 1214-1227. 10.1261/rna.1050408.
Schafer B: RNA maturation in mitochondria of S. cerevisiae and S. pombe. Gene. 2005, 354: 80-85.
Sugita M, Sugiura M: Regulation of gene expression in chloroplasts of higher plants. Plant Mol Biol. 1996, 32: 315-326. 10.1007/BF00039388.
Tavtigian SV, Simard J, Teng DH, Abtin V, Baumgard M, Beck A, Camp NJ, Carillo AR, Chen Y, Dayananth P, Desrochers M, Dumont M, Farnham JM, Frank D, Frye C, Ghaffari S, Gupte JS, Hu R, Iliev D, Janecki T, Kort EN, Laity KE, Leavitt A, Leblanc G, McArthur-Morrison J, Pederson A, Penn B, Peterson KT, Reid JE, Richards S, et al: A candidate prostate cancer susceptibility gene at chromosome 17p. Nat Genet. 2001, 27: 172-180. 10.1038/84808.
Chen Y, Beck A, Davenport C, Chen Y, Shattuck D, Tavtigian SV: Characterization of TRZ1, a yeast homolog of the human candidate prostate cancer susceptibility gene ELAC2 encoding tRNase Z. BMC Mol Biol. 2005, 6: 12-10.1186/1471-2199-6-12.
Dubrovsky EB, Dubrovskaya VA, Levinger L, Schiffer S, Marchfelder A: Drosophila RNase Z processes mitochondrial and nuclear pre-tRNA 3' ends in vivo. Nucleic Acids Res. 2004, 32: 255-262. 10.1093/nar/gkh182.
Smith MM, Levitan DJ: The Caenorhabditis elegans homolog of the putative prostate cancer susceptibility gene ELAC2, hoe-1, plays a role in germline proliferation. Dev Biol. 2004, 266: 151-160. 10.1016/j.ydbio.2003.10.016.
Zhao Z, Su W, Yuan S, Huang Y: Functional conservation of tRNase ZL among Saccharomyces cerevisiae, Schizosaccharomyces pombe and humans. Biochem J. 2009, 422: 483-492. 10.1042/BJ20090743.
Gan X, Yang J, Li J, Yu H, Dai H, Liu J, Huang Y: The fission yeast Schizosaccharomyces pombe has two distinct tRNase ZLs encoded by two different genes and differentially targeted to the nucleus and mitochondria. Biochem J. 2011, 435: 103-111. 10.1042/BJ20101619.
Canino G, Bocian E, Barbezier N, Echeverria M, Forner J, Binder S, Marchfelder A: Arabidopsis encodes four tRNase Z enzymes. Plant Physiol. 2009, 150: 1491-1502.
Aravind L: An evolutionary classification of the metallo-beta-lactamase fold proteins. In Silico Biol. 1999, 1: 69-91.
Daiyasu H, Osaka K, Ishino Y, Toh H: Expansion of the zinc metallo-hydrolase family of the beta-lactamase fold. FEBS Lett. 2001, 503: 1-6. 10.1016/S0014-5793(01)02686-2.
Dominski Z: Nucleases of the metallo-beta-lactamase family and their role in DNA and RNA metabolism. Crit Rev Biochem Mol Biol. 2007, 42: 67-93. 10.1080/10409230701279118.
Schiffer S, Rosch S, Marchfelder A: Assigning a function to a conserved group of proteins: the tRNA 3'-processing enzymes. EMBO J. 2002, 21: 2769-2777. 10.1093/emboj/21.11.2769.
de la Sierra-Gallay IL, Pellegrini O, Condon C: Structural basis for substrate binding, cleavage and allostery in the tRNA maturase RNase Z. Nature. 2005, 433: 657-661. 10.1038/nature03284.
Ishii R, Minagawa A, Takaku H, Takagi M, Nashimoto M, Yokoyama S: Crystal structure of the tRNA 3' processing endoribonuclease tRNase Z from Thermotoga maritima. J Biol Chem. 2005, 280: 14138-14144. 10.1074/jbc.M500355200.
Kostelecky B, Pohl E, Vogel A, Schilling O, Meyer-Klaucke W: The crystal structure of the zinc phosphodiesterase from Escherichia coli provides insight into function and cooperativity of tRNase Z-family proteins. J Bacteriol. 2006, 188: 1607-1614. 10.1128/JB.188.4.1607-1614.2006.
Li de la Sierra-Gallay I, Mathy N, Pellegrini O, Condon C: Structure of the ubiquitous 3' processing enzyme RNase Z bound to transfer RNA. Nat Struct Mol Biol. 2006, 13: 376-377. 10.1038/nsmb1066.
Minagawa A, Takaku H, Ishii R, Takagi M, Yokoyama S, Nashimoto M: Identification by Mn2+ rescue of two residues essential for the proton transfer of tRNase Z catalysis. Nucleic Acids Res. 2006, 34: 3811-3818. 10.1093/nar/gkl517.
Minagawa A, Takaku H, Takagi M, Nashimoto M: A novel endonucleolytic mechanism to generate the CCA 3' termini of tRNA molecules in Thermotoga maritima. J Biol Chem. 2004, 279: 15688-15697. 10.1074/jbc.M313951200.
Schilling O, Vogel A, Kostelecky B, Natal da Luz H, Spemann D, Spath B, Marchfelder A, Troger W, Meyer-Klaucke W: Zinc- and iron-dependent cytosolic metallo-beta-lactamase domain proteins exhibit similar zinc-binding affinities, independent of an atypical glutamate at the metal-binding site. Biochem J. 2005, 385: 145-153. 10.1042/BJ20040773.
Vogel A, Schilling O, Meyer-Klaucke W: Identification of metal binding residues for the binuclear zinc phosphodiesterase reveals identical coordination as glyoxalase II. Biochemistry. 2004, 43: 10379-10386. 10.1021/bi049703+.
Zareen N, Hopkinson A, Levinger L: Residues in two homology blocks on the amino side of the tRNase Z His domain contribute unexpectedly to pre-tRNA 3' end processing. RNA. 2006, 12: 1104-1115. 10.1261/rna.4206.
Zareen N, Yan H, Hopkinson A, Levinger L: Residues in the conserved His domain of fruit fly tRNase Z that function in catalysis are not involved in substrate recognition or binding. J Mol Biol. 2005, 350: 189-199. 10.1016/j.jmb.2005.04.073.
Spath B, Kirchner S, Vogel A, Schubert S, Meinlschmidt P, Aymanns S, Nezzar J, Marchfelder A: Analysis of the functional modules of the tRNA 3' endonuclease (tRNase Z). J Biol Chem. 2005, 280: 35440-35447. 10.1074/jbc.M506418200.
Baillat D, Hakimi MA, Naar AM, Shilatifard A, Cooch N, Shiekhattar R: Integrator, a multiprotein mediator of small nuclear RNA processing, associates with the C-terminal repeat of RNA polymerase II. Cell. 2005, 123: 265-276. 10.1016/j.cell.2005.08.019.
Britton RA, Wen T, Schaefer L, Pellegrini O, Uicker WC, Mathy N, Tobin C, Daou R, Szyk J, Condon C: Maturation of the 5' end of Bacillus subtilis 16S rRNA by the essential ribonuclease YkqC/RNase J1. Mol Microbiol. 2007, 63: 127-138. 10.1111/j.1365-2958.2006.05499.x.
Mathy N, Benard L, Pellegrini O, Daou R, Wen T, Condon C: 5'-to-3' exoribonuclease activity in bacteria: role of RNase J1 in rRNA maturation and 5' stability of mRNA. Cell. 2007, 129: 681-692. 10.1016/j.cell.2007.02.051.
Schilling O, Spath B, Kostelecky B, Marchfelder A, Meyer-Klaucke W, Vogel A: Exosite modules guide substrate recognition in the ZiPD/ElaC protein family. J Biol Chem. 2005, 280: 17857-17862.
Karkashon S, Hopkinson A, Levinger L: tRNase Z catalysis and conserved residues on the carboxy side of the His cluster. Biochemistry. 2007, 46: 9380-9387. 10.1021/bi700578v.
Levinger L, Hopkinson A, Desetty R, Wilson C: Effect of changes in the flexible arm on tRNase Z processing kinetics. J Biol Chem. 2009, 284: 15685-15691. 10.1074/jbc.M900745200.
Wilusz JE, Freier SM, Spector DL: 3' end processing of a long nuclear-retained noncoding RNA yields a tRNA-like cytoplasmic RNA. Cell. 2008, 135: 919-932. 10.1016/j.cell.2008.10.012.
Haussecker D, Huang Y, Lau A, Parameswaran P, Fire AZ, Kay MA: Human tRNA-derived small RNAs in the global regulation of RNA silencing. RNA. 2010, 16: 673-695. 10.1261/rna.2000810.
Lee YS, Shibata Y, Malhotra A, Dutta A: A novel class of small RNAs: tRNA-derived RNA fragments (tRFs). Genes Dev. 2009, 23: 2639-2649. 10.1101/gad.1837609.
Bogerd HP, Karnowski HW, Cai X, Shin J, Pohlers M, Cullen BR: A mammalian herpesvirus uses noncanonical expression and processing mechanisms to generate viral MicroRNAs. Mol Cell. 2010, 37: 135-142. 10.1016/j.molcel.2009.12.016.
Diebel KW, Smith AL, van Dyk LF: Mature and functional viral miRNAs transcribed from novel RNA polymerase III promoters. RNA. 2010, 16: 170-185. 10.1261/rna.1873910.
Korver W, Guevara C, Chen Y, Neuteboom S, Bookstein R, Tavtigian S, Lees E: The product of the candidate prostate cancer susceptibility gene ELAC2 interacts with the gamma-tubulin complex. Int J Cancer. 2003, 104: 283-288. 10.1002/ijc.10945.
Noda D, Itoh S, Watanabe Y, Inamitsu M, Dennler S, Itoh F, Koike S, Danielpour D, Ten Dijke P, Kato M: ELAC2, a putative prostate cancer susceptibility gene product, potentiates TGF-beta/Smad-induced growth arrest of prostate cells. Oncogene. 2006, 25: 5591-5600. 10.1038/sj.onc.1209571.
Mineri R, Pavelka N, Fernandez-Vizarra E, Ricciardi-Castagnoli P, Zeviani M, Tiranti V: How do human cells react to the absence of mitochondrial DNA?. PloS one. 2009, 4: e5713-10.1371/journal.pone.0005713.
Peng WT, Robinson MD, Mnaimneh S, Krogan NJ, Cagney G, Morris Q, Davierwala AP, Grigull J, Yang X, Zhang W, Mitsakakis N, Ryan OW, Datta N, Jojic V, Pal C, Canadien V, Richards D, Beattie B, Wu LF, Altschuler SJ, Roweis S, Frey BJ, Emili A, Greenblatt JF, Hughes TR: A panoramic view of yeast noncoding RNA processing. Cell. 2003, 113: 919-933. 10.1016/S0092-8674(03)00466-5.
Zhao W, Yu H, Li S, Huang Y: Identification and analysis of candidate fungal tRNA 3'-end processing endonucleases tRNase Zs, homologs of the putative prostate cancer susceptibility protein ELAC2. BMC Evol Biol. 2010, 10: 272-10.1186/1471-2148-10-272.
Vogel A, Schilling O, Spath B, Marchfelder A: The tRNase Z family of proteins: physiological functions, substrate specificity and structural properties. Biol Chem. 2005, 386: 1253-1264. 10.1515/BC.2005.142.
Jarvis P, Robinson C: Mechanisms of protein import and routing in chloroplasts. Curr Biol. 2004, 14: R1064-1077. 10.1016/j.cub.2004.11.049.
Pfanner N, Neupert W: The mitochondrial protein import apparatus. Annu Rev Biochem. 1990, 59: 331-353. 10.1146/annurev.bi.59.070190.001555.
Oommen A, Li XQ, Gegenheimer P: Cleavage specificity of chloroplast and nuclear tRNA 3'-processing nucleases. Mol Cell Biol. 1992, 12: 865-875.
Adams KL, Wendel JF: Polyploidy and genome evolution in plants. Curr Opin Plant Biol. 2005, 8: 135-141. 10.1016/j.pbi.2005.01.001.
Takahashi M, Takaku H, Nashimoto M: Regulation of the human tRNase ZS gene expression. FEBS Lett. 2008, 582: 2532-2536. 10.1016/j.febslet.2008.06.020.
Mohan A, Whyte S, Wang X, Nashimoto M, Levinger L: The 3' end CCA of mature tRNA is an antideterminant for eukaryotic 3'-tRNase. RNA. 1999, 5: 245-256. 10.1017/S1355838299981256.
Nashimoto M: Distribution of both lengths and 5' terminal nucleotides of mammalian pre-tRNA 3' trailers reflects properties of 3' processing endoribonuclease. Nucleic Acids Res. 1997, 25: 1148-1154. 10.1093/nar/25.6.1148.
Pellegrini O, Nezzar J, Marchfelder A, Putzer H, Condon C: Endonucleolytic processing of CCA-less tRNA precursors by RNase Z in Bacillus subtilis. EMBO J. 2003, 22: 4534-4543. 10.1093/emboj/cdg435.
Schiffer S, Rosch S, Marchfelder A: Recombinant RNase Z does not recognize CCA as part of the tRNA and its cleavage efficieny is influenced by acceptor stem length. Biol Chem. 2003, 384: 333-342. 10.1515/BC.2003.039.
Minagawa A, Ishii R, Takaku H, Yokoyama S, Nashimoto M: The flexible arm of tRNase Z is not essential for pre-tRNA binding but affects cleavage site selection. J Mol Biol. 2008, 381: 289-299. 10.1016/j.jmb.2008.05.016.
Hopkinson A, Levinger L: Effects of conserved D/T loop substitutions in the pre-tRNA substrate on tRNase Z catalysis. RNA Biol. 2008, 5: 104-111. 10.4161/rna.5.2.6086.
Ishii R, Minagawa A, Takaku H, Takagi M, Nashimoto M, Yokoyama S: The structure of the flexible arm of Thermotoga maritima tRNase Z differs from those of homologous enzymes. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2007, 63: 637-641. 10.1107/S1744309107033623.
Michaud M, Cognat V, Duchene AM, Marechal-Drouard L: A global picture of tRNA genes in plant genomes. Plant J. 2010, 66: 80-93.
Pfitzinger H, Guillemaut P, Weil JH, Pillay DT: Adjustment of the tRNA population to the codon usage in chloroplasts. Nucleic Acids Res. 1987, 15: 1377-1386. 10.1093/nar/15.4.1377.
Ceballos-Chavez M, Vioque A: Sequence-dependent Cleavage Site Selection by RNase Z from the Cyanobacterium Synechocystis sp. PCC 6803. J Biol Chem. 2005, 280: 33461-33469. 10.1074/jbc.M504691200.
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG: Clustal W and Clustal X version 2.0. Bioinformatics. 2007, 23: 2947-2948. 10.1093/bioinformatics/btm404.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Mol Biol Evol. 2011
Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21: 2104-2105. 10.1093/bioinformatics/bti263.
Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.
Ezraty B, Dahlgren B, Deutscher MP: The RNase Z homologue encoded by Escherichia coli elaC gene is RNase BN. J Biol Chem. 2005, 280: 16542-16545. 10.1074/jbc.C500098200.
Initiative TAG: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408: 796-815. 10.1038/35048692.
Initiative TIB: Genome sequencing and analysis of the model grass Brachypodium distachyon. Nature. 463: 763-768.
Ming R, Hou S, Feng Y, Yu Q, Dionne-Laporte A, Saw JH, Senin P, Wang W, Ly BV, Lewis KL, Salzberg SL, Feng L, Jones MR, Skelton RL, Murray JE, Chen C, Qian W, Shen J, Du P, Eustice M, Tong E, Tang H, Lyons E, Paull RE, Michael TP, Wall K, Rice DW, Albert H, Wang ML, Zhu YJ, et al: The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature. 2008, 452: 991-996. 10.1038/nature06856.
Huang S, Li R, Zhang Z, Li L, Gu X, Fan W, Lucas WJ, Wang X, Xie B, Ni P, et al: The genome of the cucumber, Cucumis sativus L. Nat Genet. 2009, 41: 1275-1281. 10.1038/ng.475.
Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, Xu D, Hellsten U, May GD, Yu Y, Sakurai T, Umezawa T, Bhattacharyya MK, Sandhu D, Valliyodan B, Lindquist E, Peto M, Grant D, Shu S, Goodstein D, Barry K, Futrell-Griggs M, Abernathy B, Du J, Tian Z, Zhu L, et al: Genome sequence of the Palaeopolyploid soybean. Nature. 463: 178-183.
Project IRGS: The map-based sequence of the rice genome. Nature. 2005, 436: 793-800. 10.1038/nature03895.
Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, Schein J, Sterck L, Aerts A, Bhalerao RR, Bhalerao RP, Blaudez D, Boerjan W, Brun A, Brunner A, Busov V, Campbell M, Carlson J, Chalot M, Chapman J, Chen GL, Cooper D, Coutinho PM, Couturier J, Covert S, Cronk Q, et al: The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313: 1596-1604. 10.1126/science.1128691.
Chan AP, Crabtree J, Zhao Q, Lorenzi H, Orvis J, Puiu D, Melake-Berhan A, Jones KM, Redman J, Chen G, Cahoon EB, Gedil M, Stanke M, Haas BJ, Wortman JR, Fraser-Liggett CM, Ravel J, Rabinowicz PD: Draft genome sequence of the oilseed species Ricinus communis. Nat Biotechnol. 28: 951-956.
Doust AN, Kellogg EA, Devos KM, Bennetzen JL: Foxtail millet: a sequence-driven grass model system. Plant Physiol. 2009, 149: 137-141. 10.1104/pp.108.129627.
Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, Haberer G, Hellsten U, Mitros T, Poliakov A, Schmutz J, Spannagl M, Tang H, Wang X, Wicker T, Bharti AK, Chapman J, Feltus FA, Gowik U, Grigoriev IV, Lyons E, Maher CA, Martis M, Narechania A, Otillar RP, Penning BW, Salamov AA, Wang Y, Zhang L, Carpita NC, et al: The Sorghum bicolor genome and the diversification of grasses. Nature. 2009, 457: 551-556. 10.1038/nature07723.
Velasco R, Zharkikh A, Troggio M, Cartwright DA, Cestaro A, Pruss D, Pindo M, Fitzgerald LM, Vezzulli S, Reid J, Malacarne G, Iliev D, Coppola G, Wardell B, Micheletti D, Macalma T, Facci M, Mitchell JT, Perazzolli M, Eldredge G, Gatto P, Oyzerski R, Moretto M, Gutin N, Stefanini M, Chen Y, Segala C, Davenport C, Dematte L, Mraz A, et al: A high quality draft consensus sequence of the genome of a heterozygous grapevine variety. PloS one. 2007, 2: e1326-10.1371/journal.pone.0001326.
Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA, Minx P, Reily AD, Courtney L, Kruchowski SS, Tomlinson C, Strong C, Delehaunty K, Fronick C, Courtney B, Rock SM, Belter E, Du F, Kim K, Abbott RM, Cotton M, Levy A, Marchetto P, Ochoa K, Jackson SM, Gillam B, et al: The B73 maize genome: complexity, diversity, and dynamics. Science. 326: 1112-1115.
We thank three anonymous reviewers for improving the quality of the manuscript, Jingjing Peng and Wei Zhao for valuable discussion. This work was supported in part by grants from the National Science Foundation of China (31070703), the Priority Academic Program Development of Jiangsu Higher Education Institutions and Nanjing Normal University (2007104XGQ0148).
The authors declare that they have no competing interests.
LF, ZW and JL performed online database searches and sequence analysis. JY carried out a phylogenetic analysis. YH conceived this study, analyzed the data and drafted the manuscript. All authors have read and approved the final version of the manuscript.
Electronic supplementary material
Additional file 1: Distribution of candidate tRNase Zs identified in green plants. Abbreviations for species names are indicated in the parentheses. +The number of amino acids in plant tRNase Z and tRNase Z-like proteins. *Indicates that mispredicted sequences obtained from the databases have been corrected. ?Indicates the sequence could not be correctly predicted. (DOC 160 KB)
Additional file 2: Pairwise sequence comparisons of S. bicolor and S. italica tRNase ZSs. The accession numbers for proteins are listed in Additional file 1. The pairwise percent identity (I) and percent similarity (S) between tRNase ZSs from S. bicolor and S. italica were calculated using the Clustal W program . (DOC 34 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Fan, L., Wang, Z., Liu, J. et al. A survey of green plant tRNA 3'-end processing enzyme tRNase Zs, homologs of the candidate prostate cancer susceptibility protein ELAC2. BMC Evol Biol 11, 219 (2011). https://doi.org/10.1186/1471-2148-11-219
- tRNA precursor (pre-tRNA)
- tRNase Z
- post-transcriptional processing
- green plant