The evolutionary origin of the Runx/CBFbeta transcription factors – Studies of the most basal metazoans
BMC Evolutionary Biology volume 8, Article number: 228 (2008)
Members of the Runx family of transcriptional regulators, which bind DNA as heterodimers with CBFβ, are known to play critical roles in embryonic development in many triploblastic animals such as mammals and insects. They are known to regulate basic developmental processes such as cell fate determination and cellular potency in multiple stem-cell types, including the sensory nerve cell progenitors of ganglia in mammals.
In this study, we detect and characterize the hitherto unexplored Runx/CBFβ genes of cnidarians and sponges, two basal animal lineages that are well known for their extensive regenerative capacity. Comparative structural modeling indicates that the Runx-CBFβ-DNA complex from most cnidarians and sponges is highly similar to that found in humans, with changes in the residues involved in Runx-CBFβ dimerization in either of the proteins mirrored by compensatory changes in the binding partner. In situ hybridization studies reveal that Nematostella Runx and CBFβ are expressed predominantly in small isolated foci at the base of the ectoderm of the tentacles in adult animals, possibly representing neurons or their progenitors.
These results reveal that Runx and CBFβ likely functioned together to regulate transcription in the common ancestor of all metazoans, and the structure of the Runx-CBFβ-DNA complex has remained extremely conserved since the human-sponge divergence. The expression data suggest a hypothesis that these genes may have played a role in nerve cell differentiation or maintenance in the common ancestor of cnidarians and bilaterians.
Developmental processes occur throughout an animal's entire life history, during both pre-adult and adult stages. Just as patterning and morphogenesis are required to sculpt the body during embryogenesis and larval development, similar cellular differentiation pathways must be continuously re-deployed in the adult to compensate for the turnover of differentiated cells, to generate context-specific specialized cell types, and to repair damaged tissues. For example, in adult mammals, cellular differentiation pathways are activated to replace various cells of the hematopoietic lineage , to differentiate ova , to develop mature hair follicles , and to heal wounds . Many such "adult developmental" processes are thought to depend upon stem cells exhibiting varying degrees of developmental potency . Additionally, the same developmental processes may occur in those adult animals that are capable of extensively regenerating missing body parts .
One family of genes known to be involved in both pre-adult and adult development is the Runx family of transcription factors. Runx proteins are important for myriad developmental processes in both protostomes (e.g., Drosophila and Caenorhabditis) and deuterostomes (e.g., vertebrates and echinoderms) (reviewed in [7, 8]). The Runx protein in a basal deuterostome, the sea urchin Strongylocentrotus purpuratus was found to be involved in basic developmental processes in the embryo and larva such as control of cell differentiation and survival [9–11], while an isoform of one of the zebrafish Runx genes products, Runx2b was recently found be a maternal factor acting as a ventral determining regulator in the earliest stages of axis formation in the embryo, further illustrating the importance of these factors in multiple aspects and phases of animal development . Importantly, Runx factors have been also widely shown to play particularly key roles in cell fate determination and the maintenance of stem cell populations [13–15].
Some metazoans, including C. elegans and the sea urchin, have a single Runt-like gene, while others, including Drosophila and Fugu, may harbor up to four genes [7, 8]. Mammalian genomes contain three Runx genes, each of which is critical for several different developmental processes occurring during embryogenesis: Runx1 is important for hematopoiesis and neurogenesis of distinct CNS neurons and nociceptive neurons in the dorsal root ganglion (DRG) [16–18], Runx2 is critical for skeletal morphogenesis [19–23], and Runx3 is important for neurogenesis of proprioceptive neurons in the DRG and for hematopoiesis [24, 25]. Mammalian Runx proteins are also important for recurring developmental processes in the adult. For instance, loss of Runx1 in the adult causes hematopoietic abnormalities . Overexpression of Runx2 in adults increases bone turnover leading to multiple fractures . Runx3 controls CD8 cells, dendritic cells and CD4 Th1 cells development in adult stages .
Runx genes regulate transcription of target genes by binding to a variant of the consensus binding site RCCRCA (R = purine). Cis-regulatory Runx binding sites have been found in proximal promoter regions, as distant as a few thousand base pairs from the start methionine, in introns, and even within the coding region of genes whose transcription they regulate [29–31]. Runx binding is mediated by a highly conserved DNA-binding domain termed the Runt domain (RD) (reviewed in ). While the RD can bind DNA independently, in vivo binding usually occurs with its heterodimeric binding partner CBFβ. The protein CBFβ lacks a DNA-binding domain, but when bound to Runx, it increases the DNA-binding affinity of the RD several fold, substantially enhancing its transcriptional activity (reviewed in ). All presently known Runx proteins terminate in a short conserved pentapeptide motif, VWRPY, which binds the co-repressor Groucho and allows them to act as transcriptional repressors under certain conditions . A post-translational control mechanism for mammalian Runx DNA-binding activity occurs via two cysteine 'redox switch' residues [34, 35]. Interestingly, due to their ability to either activate or suppress target genes , when Runx genes are mutated or their regulation is perturbed, they can act as either tumor suppressors or tumor inducers.
While Runx has been intensively studied in some triploblastic animals, little is known about its presence and function in the more ancient "basal" animal lineages such as cnidarians and sponges. These lineages are critical to understanding the early functional evolution of Runx because they diverged from the main metazoan stem lineage prior to the divergence of protostomes and deuterostomes (Figure 1). Over 99% of all extant animal species belong to the Triploblastica (or the Bilateria), including deuterostomes (e.g., vertebrates and sea urchins) and protostomes (e.g., insects and nematodes). Additionally, basal animals may be particularly informative about the evolution of developmental genes required during pre-adult and adult stages because they exhibit much greater regenerative capacity than many triploblastic model systems. For example, both sponges and cnidarians are able to completely regenerate from very small body fragments, and even from aggregates of single cells [37, 38]. In cnidarians, this extensive regenerative capacity extends to the nervous system, which is able to regenerate in its entirety after injury. It is experimentally possible to eliminate the entire cnidarian nervous system, obtaining viable epithelial animals which regenerate a complete and functional nerve network when repopulated with interstitial stem cells [39, 40]. During the regeneration process in hydra, pluripotent interstitial stem cells differentiate to form new nerve cells, gland cells and stinging neurons (nematocytes). In addition, the differentiated cells at the regenerating region undergo de-differentiation to form epithelial stem cells, which then re-differentiate to complete the regeneration process . Sponges have only four cell types, and one of these cell types, the archaeocyte, is a stem-cell lineage that gives rise to the three other differentiated cell types [42, 43]. Thus, cnidarians and sponges are excellent model organisms to study how developmental potency can be maintained in both stem cells and differentiated cells, and how de-differentiation and differentiation processes are initiated and controlled during de-novo creation of form upon regeneration.
In an effort to illuminate the early evolution of Runx and CBFβ genes and explore a possible ancestral connection with stem-cell potency and regeneration, we used computational methods to identify Runx and CBFβ orthologs from basal animals and then subjected them to phylogenetic analyses, comparative structural modeling, and expression studies. In the course of this work we identified Runx and CBFβ genes in cnidarians and sponges, revealing that these genes existed in the ancestral metazoan, an organism which lived perhaps as much as ~300 million years before the previously known earliest Runx and CBFβ genes from the bilaterian ancestor (Figure 1). The sequence of the Runt domain is highly conserved between sponges (Amphimedon queenslandica and Oscarella carmela), cnidarians (the sea anemone Nematostella vectensis, the freshwater hydra, Hydra magnipapillata, and the staghorn coral, Acropora millepora), and triploblastic animals. Likewise, the heterodimeric Runx/CBFβ transcription factor complex and its mode of DNA-binding appear to be highly conserved between sponges, cnidarians, and humans. In situ hybridization studies in adult Nematostella reveal expression in isolated cells of the tentacles, which may represent sensory neurons or their pluripotent progenitors. This possibility is intriguing because Runx factors are required for the development of sensory neurons in mammals [44–47]
Novel Runx and CBFβ genes in basal metazoans
To detect Runx and CBFβ genes in phylogenetically basal organisms, we used bioinformatic approaches to search EST and genomic databases for Runx and CBFβ in basal metazoans and unicellular protozoans. This search revealed well-conserved Runx and CBFβ gene pairs in the sea anemone Nematostella vectensis, the freshwater hydra Hydra magnipapillata, and the marine demosponge Amphimedon queenslandica. We also identified a Runx gene in the freshwater demosponge Oscarella carmela and a CBFβ gene in the coral Acropora millepora. Neither Runx nor CBFβ were detected in the genomes of any unicellular protozoans (Table 1).
The Nematostella Runx gene (Nv-Runx) encodes a predicted protein 496 amino acids long. It contains four exons, the first two of which encode the highly conserved Runt domain (RD). Within this domain, the Nematostella sequence is 81% identical and 93% similar to human RUNX1 (Figure 2A, B). The third and fourth exons encoding the C-terminus of the protein do not exhibit sequence similarity to other Runx proteins, nor to any other protein housed in the NCBI GenBank non-redundant database . Nevertheless, as in other Runx proteins, the Nv-Runx C-terminal region, residues 148–495, is enriched in proline (13%), serine (9%) and threonine (8%) residues. The anemone protein ends with the canonical C-terminal VWRPY Groucho binding motif . A characteristic intron at nucleotide position 328 within the RD, conserved amongst protostomes and deuterostomes , is also conserved in Nv-Runx. The second RD exon of Nematostella is identical in size to the corresponding exon in all deuterostome Runx genes (105 bp) that have been described. The closest similarity in gene structure is for the echinoderm Runx where the N-terminal part of the RD is encoded by a single exon while in other deuterostomes it spans two exons .
The Hydra magnipapillata Runx gene (Hm-Runx) encodes a shorter protein, 418 amino acids in length. The RD of Hm-Runx is also highly conserved with human Runx1 (82% identical and 92% similar). Hm-Runx also terminates in the Groucho binding motif VWRPY, but the remainder of this C-terminal region bears no obvious homology to any protein housed in GenBank. It is enriched in serine residues (14.8%), but interestingly, unlike the sea anemone protein, it is not enriched in proline (5.4%) or threonine (2.7%). The gene structure of Hm-Runx is similar to that of Nv-Runx and may thus reflect the ancestral condition for cnidarian Runx genes.
We found partial Runx transcripts from the freshwater demosponge Oscarella carmela (Oc-Runx) and the marine demosponge Amphimedon queenslandica (Aq-Runx) that encode well-conserved RDs. Oc-Runx and Aq-Runx share ~61% and 69% of residues with human RUNX1 in the 128 amino acid RD, respectively (Figure 2 and Additional file 1). The incomplete ESTs we identified do not encode the complete carboxy-terminus of the proteins, thus we cannot tell whether these sponge proteins also contain the Groucho-binding motif at their C-termini. However, all of the Runx proteins we have identified contain the two cysteine residues that act as redox switches in mammals (Figure 2, ). Notably, the second of these two cysteine residues (aligned residue 32 in Additional file 1) functions as a redox switch in deuterostomes but not in protostomes (being replaced by a serine residue in the latter). This position is a cysteine in both cnidarians and in one of the two sponge sequences (A. queenslandica), while being a valine in the freshwater sponge (Oscarella carmela).
A search of the NCBI EST database revealed three cnidarian CBFβ ESTs – one from Nematostella (Nv-CBFβ), one from Hydra magnipapillata (Hm-CBFβ) and one from a pre-settlement stage of the coral Acropora millepora (Am-CBFβ). The cnidarian ESTs encode proteins that are highly conserved relative to their bilaterian counterparts, e.g., the Nematostella protein is 59% identical and 79% similar to the human CBFβ isoform 2 (Figure 2D, Additional file 2). The anemone and coral sequences are very similar to each other (74% identity and 93% similarity), whereas the Hydra protein is markedly more distant (58% identical and 80% similar to both the Nematostella and A. millepora variants). This reflects the ancient divergence of the medusozoan lineage, which includes Hydra, from the anthozoan lineage, which includes corals and sea anemones (Figure 1, [49, 50]). A putative CBFβ protein identified in the genomic traces of the sponge Amphimedon queenslandica (Aq-CBFβ) bears almost as much similarity to human CBFβ as do cnidarian CBFβ proteins (54% identity, 73% similarity). Interestingly, Aq-CBFβ shares less identity with the 3 cnidarian CBFβ proteins (~47%) than it does with the human variant (54%). The sequence predicted based upon Blast homology includes exons from 2 genomic traces which cannot be directly assembled (Table 1).
The gene structure of CBFβ also appears to be highly conserved between cnidarians and deuterostome bilaterians. The Nematostella CBFβ (Nv-CBFβ) locus comprises six exons, and the positions of the first four exon boundaries are conserved relative to the CBFβ of the purple sea urchin, Strongylocentrotus purpuratus  (Figure 2C). The sixth exon is unique to Nematostella and encodes only the three final amino acid residues at the C-terminus of the protein, the stop codon, and the 3' UTR. The entire CBFβ transcript is encoded by a single exon gene in protostomes, suggesting that a retrotransposition event may have occurred deep within the protostome lineage.
Cnidarian and Sponge Runx and CBFβ possess the structural elements needed for heterodimerization and DNA binding
In triploblastic animals, Runx and CBFβ proteins are known to cooperate in the regulation of gene expression by binding DNA as heterodimers (see  and references therein). To test whether the cnidarian and sponge variants of these proteins possess the structural determinants necessary for them to interact and bind DNA together in a manner homologous to the human complex, we modeled their three dimensional structure using the human AML1(RUNX1)RD-CBFβ-DNA complex as a modeling template (see Methods). Unless noted otherwise, all residue numbers described in the text below correspond to the human RUNX1 and CBFβ proteins (see Additional files 3, 4, 5 for a detailed comparison of the amino acids and their positions in the cnidarian and sponge proteins).
Figure 3A shows a general view of the model of the Nematostella RD-CBFβ dimer in complex with DNA, with the functional residues involved in RD-CBFβ dimerization and DNA binding highlighted as ball-and-stick models. The RD contains two distinct functional surfaces – one involved in DNA binding (composed of 10 functional residues) and one mediating dimerization with CBFβ. CBFβ itself does not directly interact with the DNA double helix. Strikingly, all 10 residues of the DNA binding face of all of the cnidarian and sponge RD's are completely conserved with their human counterpart, strongly indicating that these proteins bind DNA in a similar manner (Figure 3A, Additional file 3 and 6) and prefer the same DNA sequence motif.
In three of the four organisms – Nematostella, Hydra and Amphimedon – the dimerization faces of Runx and CBFβ are conserved, and when amino acid residues differ from those of the human proteins these differences do not alter the general structure of the protein complex (Figure 3C–E, G–I, Additional files 4, 5). This can best be demonstrated with a more detailed analysis of the Nematostella Runx-CBFβ-DNA complex: in Nv-Runx, 13 of the 19 residues on the CBFβ binding face are conserved, including all P and G residues that may be important for the folding of the polypeptide chain (Figure 3C, Additional file 4). Four additional residues (M106, Y113, S114 and T147) are replaced by amino acids with similar physical-chemical characteristics, and only two residues (F153 and H163) are significantly different. The latter two residues are found in the outer rim of the interface where protein-protein contacts are generally less conserved. Similarly, 12 of the 18 residues involved in Runx binding by the human CBFβ are identical in Nv-CBFβ. Four additional residues have conserved physical-chemical characteristics (F17, T60, S65 and M101), and 2 residues (Q67 and P100) are significantly different (Figure 3G, Additional file 5).
It is noteworthy that the latter residues, while different, are well accommodated within the structure of the proteins: F153 of human Runx, which is in a hydrophobic environment (L102, T104, T121, T151), is replaced by K in Nv-Runx, whose side chain is capable of making most of the interactions with the hydrophobic environment. The positively charged end of this K makes an H-bond with the S that replaces T104. H163 in human Runx1 makes an exposed interaction (and therefore of small energetic contribution) with D66; it is replaced by C in Nv-Runx, which cannot form such an interaction but can interact with the nearby backbone carbonyl. P100 of human CBFβ, which makes hydrophobic contacts with Y85 and I109, is replaced by H in Nv-CBFβ. This residue can form a hydrogen bond with the backbone carbonyl of F69. Q67 of human CBFβ is solvent exposed in the separated molecule: it is replaced by H in Nematostella without losing or gaining particular contacts.
Interestingly, while not all of the residues in the RD-CBFβ interface are conserved between Nematostella and humans, a change in the residue of one protein is accompanied by a reciprocal change in its binding partner (Figure 3B). One such change involves residues F153 and M106 in human Runx1 RD, which interface with S65 and Q67 of CBFβ. Nematostella differs at both positions in its RD, where F153 and M106 are replaced by K121 and A73 in the Nematostella protein; these changes in the Runx protein are mirrored by changes in CBFβ, where S65 and Q67 are replaced by T65 and H67 (Figure 3E). Another reciprocal change involves H163 of human Runx1 RD, which interfaces with T62 and F17 in CBFβ; the corresponding residues in Nematostella are C131 (in RD) and S62 and M17 (in CBFβ), respectively. This suggests that while the specific residues may differ between taxa, these proteins have co-evolved to maintain their functional interaction.
In contrast to the Runx proteins from the cnidarians and the demosponge Amphimedon, the CBFβ binding face of the sponge Oscarella carmela Runx protein is quite distinct, with many non-conservative amino acid substitutions found both in the periphery and in the center of the CBFβ binding region (Figure 3F). This suggests either that the corresponding CBFβ protein in this sponge is considerably different, or that the Oc-Runx does not bind a CBFβ protein at all.
Phylogenetic history of Runx and CBFβ
Runx and CBFβ genes have previously been reported in many different phyla of triploblastic animals where a single species may contain multiple Runx or CBFβ genes. The detection of Runx and CBFβ genes in cnidarians and sponges allows, for the first time, reconstruction of their early evolutionary history, in the period preceding the evolutionary origin of Triploblasts.
Figure 4A depicts a phylogeny of Runx proteins from sponges, cnidarians, chordates, echinoderms, and arthropods. Sequences from the nematodes C. elegans, C. brigsae, and D. coronatus were excluded from this analysis as they are highly divergent from the other metazoan Runx proteins, and their inclusion would tend to cause long-branch attraction artifacts. An analysis including the nematode species is presented in Additional file 6 (see Additional file 9 for taxon IDs). The tree is rooted using the two sponge sequences. Consistent with the prevailing view of organismal phylogeny, this gene tree groups the two sponge sequences, the eighteen triploblast sequences, the six chordate sequences, the two urchin sequences, and the ten arthropod sequences into mutually exclusive clades, with each clade receiving bootstrap support > 75%. The Runx sequences do not recover a monophyletic Deuterostomia as the two sea urchin sequences (Strongylocentrotus and Heliocidaris) group more closely with insects than with chordates (although the bootstrap support for this grouping is relatively low). Neither does the Runx tree support cnidarian monophyly, as the sea anemone and hydra sequences appear as successive outgroups at the base of the triploblast clade. Interestingly, this tree implies that the cnidarian proteins have evolved relatively slowly. In particular, the sea anemone Runx appears to have undergone the least amount of evolutionary change of any taxon since the time of the sponge-triploblast ancestor. The anemone may therefore represent our best source of information regarding the ancestral Runx protein. As the human Runx paralogs (Runx1–3) and the insect Runx paralogs (e.g., Drosophila Lozenge, Runt, RuntA, and RuntB) fall into distinct and well supported clades, this tree implies that the ancestral Runx gene underwent independent gene duplication in the deuterostome and protostome lineages.
Figure 4B depicts a phylogeny of CBFβ proteins that is rooted using the single sequence from the sponge Amphimedon. Once again, the nematode sequences were found to be highly divergent from other metazoan CBFβ proteins, and they were excluded from this analysis. A tree containing these sequences is presented as Additional file 7 (see Additional file 10 for taxon IDs). Like the Runx tree, this gene tree recovers the monophyly of a number of accepted taxonomic groups. It groups the three cnidarian sequences, the two sea urchin sequences, the five insect sequences, the two ascidian sequences and the eight vertebrate sequences. Furthermore, the vertebrate clade is subdivided among three well established monophyletic taxa: the two jawless fishes (Eptatretus and Petromyzon), the two ray-finned fishes (Danio and Oryzias), and the four tetrapods (Xenopus, Gallus, Mus, and Homo). Once again, as in the Runx tree, the sea urchin sequences (Strongylocentrotus and Paracentrotus) do not group with other deuterostome sequences or even with the other echinoderm sequence in the analysis, the sea star Asterina. The tree is not consistent with triploblast monophyly because the cnidarian sequences fall within a cluster of deuterostome sequences.
In addition to the apparent reciprocity of amino acid changes that we have observed in the interacting portions of Runx and CBFβ, the anomalous position of the sea urchin sequences on both the Runx and CBFβ trees could be another reflection of co-evolution between these proteins. To explore this hypothesis further, we employed a variation on the protein co-evolution test proposed by Sato and coworkers  (Figure 5). When each of the phylogenies described above is limited to only those 11 taxa contained in both phylogenies, and the tree topologies are constrained to the accepted organismal phylogeny (Figure. 5A), the resultant branch lengths on the two trees are significantly and strongly correlated (Figure 5B, R2 = 0.83; Linear Least Regression test for significance: F1,17 = 85.11, p < 0.0001). In contrast, the branch lengths from a glyceraldehyde-phosphate-dehydrogenase phylogeny, which is a "house-keeping" gene serving as a control, are not correlated with the Runx branch lengths (F1,17 = 2.73, p = 0.1167) and only weakly correlated with the branch lengths from the CBFβ phylogeny (F1,17 = 9.31, p = 0.0072, R2= 0.35).
Runx and CBFβ are expressed in overlapping, but not identical, regions of the Nematostellahead and tentacle ectoderm
To determine if expression of Nv-Runx and Nv-CBFβ is spatially restricted along the primary body axis of the adult polyp we performed reverse-transcription polymerase chain reaction (RT-PCR) on equal amounts of total RNA isolated from four different body regions: (1) tentacles, (2) head region, (3) body column, and (4) physal region. Using two different primer pairs, Runx expression was detected in the tentacles and head/pharyngeal region but not the body column or physa (Figure 6). High levels of CBFβ expression were observed in the tentacles and head/pharyngeal regions; although a lower level of CBFβ expression could also be detected in the body column. No CBFβ expression was observed in the physal region.
We next used whole mount in-situ hybridization, followed by cryosectioning, to characterize the spatial expression of Runx and CBFβ. In agreement with the RT-PCR results, Nv-Runx was expressed mainly in the tentacles, with the strongest expression seen in the tentacle tips (Figure 7A). We found that the expression is limited to small foci found exclusively in the ectoderm, often in tight association with nematocytes (Figure 7G). Sporadic Runx-expressing cells were also seen in the outer body wall in the pharyngeal region (Figure 7I). The same general pattern of Runx expression was observed in ten different anemones using two different anti-sense RNA probes – a 450 nucleotide probe corresponding to exons 1–3 of the gene (containing the Runt domain; Figure 2A) as well as a 1.5 kb probe corresponding to the entire ORF (data not shown). No staining was observed with the complementary negative control sense-strand RNA probes (Figure 7A). While the general pattern of expression was similar in all of the individual anemones, possible differences in the relative level of expression were observed between different animals within the same experiment (compare Figure 7, panels B and C).
Similar to Runx, CBFβ is expressed in the tentacle ectoderm, with the strongest staining in the tentacle tips (Figure 8, panels A-C). Also like Runx, the CBFβ-expressing cells are located primarily in the basal layer of the ectoderm and in close association with large microbasic mastigophore nematocytes, which are a major component of this cell-rich region of the animal (Figure 8. panels F-H). However, in contrast to Runx, some of the individual animals assayed also expressed CBFβ in the mouth region (Figure 8, panels D, G, and H).
Runx and CBFβ Evolution
A primary motivation for performing the phylogenetic analysis was to decipher the relationships among the multiple paralogs that are found in some taxa, and to pinpoint the ancestral lineages in which gene duplications must have occurred in the Runx and/or CBFβ genes. The presence of only a single Runx locus in Porifera and Cnidaria, two of the most ancient animal phyla, along with the fact that both the maximum likelihood (ML) and neighbor-joining (NJ) trees can be rooted such that the sponge and cnidarian sequences emerge from the base of the tree, supports the hypothesis that the eumetazoan ancestor possessed only a single Runx locus. Subsequently, it appears that this ancestral Runx gene underwent independent duplications within arthropods and within vertebrates. While the non-insect arthropods are only known to possess single Runx genes, and the Runt1 sequences from spider (Cupennius), mite (Tetranychus), and crayfish (Pacifastacus) cluster together, distinct Runt, RunxA, and Lozenge genes are found in both Anopholes and Drosophila. Drosophila also possesses a fourth Runx paralog, RuntB. This requires at least three gene duplications among dipteran insects, with at least two of these gene duplications clearly predating the divergence between fruit fly and mosquito. While only single Runx proteins are known from the non-vertebrate chordates (Branchiostoma floridae, B. lanceolatum, and Ciona), there are three human paralogs (Runx 1–3), which cluster together on the tree. This is not surprising given that two rounds of whole genome duplication are known to have occurred early in vertebrate evolution, prior to the divergence of the ray-finned fishes from the lobe-finned fishes. The genomes of pufferfishes (Fugu and Tetraodon), which are ray-finned fishes, contain four Runx genes [53, 54], three of which appear as orthologs of the mammalian Runx1–3. The additional Runx gene has a relatively short coding region; it has been suggested that it represents an ancestral vertebrate Runx that was subsequently lost in the tetrapod lineage . Zebrafish also contain four Runx genes, however two of those are clearly Runx2 orthologs (Runx2a and Runx2b) that likely arose from partial or whole genome duplication in the ray-finned fish lineage [55, 56]. The two zebrafish Runx2 paralogs have different functions, including the aforementioned role of Runx2b in dorso-ventral axis formation . Thus, the evolution of this gene family displays dynamic changes in gene number as a result of gene duplications but also of possible gene loss.
With respect to CBFβ, the presence of only a single known CBFβ homolog in all taxa except the fruit fly, including other insects (e.g., Apis and Bombyx) points to a single, relatively recent gene duplication, in the lineage leading to Drosophila.
On both the Runx and CBFβ trees, two sea urchin sequences group together with robust bootstrap support (100%), but fail to group with other deuterostome taxa as expected. In the case of Runx, the two echinoid sequences (Strongylocentrotus and Heliocidaris) appear most closely related to an arthropod clade, although with relatively low bootstrap support (65%). In the case of CBFβ, the two echinoid sequences (Strongylocentrotus and Paracentrotus) fail to group with a fellow echinoderm, the starfish Asterina, instead appearing most closely related to a sequence from a snail (Biomphalaria). The bootstrap support for this particular grouping is <40%. The low bootstrap support indicates that the echinoids are not exhibiting a strong affinity for protostome sequences so much as they are exhibiting a lack of affinity for the more closely related deuterostome sequences. The simplest explanation for this pattern, which is also supported by echinoid-specific residues in the sequence alignments (Additional files 1, 2) is that these proteins have undergone echinoid-specific sequence changes that obscure their underlying affinity to other deuterostome sequences.
For both proteins, the amino acid substitution rates differ between lineages. In particular, both Runx and CBFβ have diverged rapidly in Drosphila and Caenorhabditis, resulting in the fruit fly and nematode proteins being less similar to their human homologs than are the cnidarian proteins, despite being more closely related to humans by some 50 million years. Drosophila Runx RD's share, on average, ~72% of residues with human Runx proteins (as compared to 81% for Nematostella) while C. elegans and C. briggsae share, on average, only ~48% residues with human RD. Likewise, the CBFβ proteins from cnidarians share 57–59% of the residues with human CBFβ, respectively, whereas Drosphila and C. elegans share only ~55% and 22%, respectively. These data are consistent with a number of recent studies that have noted greater genomic similarity between humans and anthozoan cnidarians than between humans and the major protostome model systems [57–59]. It remains to be seen whether the gene duplication events of both Runx and CBFβ in various lineages are correlated with a diversification of the roles and regulatory networks in these organisms. Such questions may be addressed through functional studies of proteins and computational predictions of regulatory networks.
Runx and CBFβ binding, co-evolution, and co-expression
The majority of the amino acids involved in the binding of CBFβ to Runx, and the binding of the Runx-CBFβ protein complex to DNA, are highly conserved between human and Nematostella. Our comparative protein modeling results show a high degree of stabilizing selection within the Runt domain (RD) and the RD-interacting face of CBFβ. Further, the complete conservation within the metazoa of residues necessary for interaction between the RD and DNA suggests that the RD-CBFβ-DNA complex has been essentially conserved throughout the ~700 million year history of the eumetazoa, in animals whose morphology, ecology, and physiology are as diverse as terrestrial vertebrates, aerial arthropods and marine invertebrates. Interestingly, the CBFβ-binding faces of the sponge proteins are more divergent. Indeed, the Oscarella Runx represents the longest branch on the Runx phylogeny. This is interesting given that we failed to recover a CBFβ homolog from this sponge. This higher degree of sequence divergence within the CBFβ-binding face of O. carmela Runx combined with the lack of CBFβ expression in developmentally mixed EST pools could indicate CBFβ loss within this demosponge or indicate that this demosponge may deserve a basal position within the metazoan lineage, pre-dating the origin of CBFβ. Targeted sequencing for CBFβ within genomic sequence of O. carmela may help resolve this.
Notably, where the Nematostella CBFβ protein differs from human, there appear to be compensatory changes in the Nematostella RD. The possibility of compensatory substitutions in the RD and CBFβ proteins of sea anemone and human suggests that functional constraints require that directional selection on either protein may promote commensurate divergence in its partner. To test this hypothesis over broad phylogenetic distances, we employed a variation on the protein co-evolution test proposed by . Our results indicate a significant and highly predictive relationship between the branch lengths of Runx and CBFβ phylogenetic trees, providing further support that the evolutionary rates of these two domains are correlated.
However, shared evolutionary histories could result in significant correlations between any pair of proteins . For example, any two proteins will exhibit a correlation if they have evolved in clock-like fashion on the same underlying phylogeny, or if the species from which they are derived differ substantially in their overall rates of molecular evolution. To evaluate the contribution of shared history to the observed correlation, we can compare proteins that do not interact functionally. We observed no correlation between the molecular evolution of Runx and GAPDH, a well-conserved 'house-keeping' gene, and we observed only a weak correlation between CBFβ and GAPDH (Figure 5B). Taken together with the reciprocal amino acid changes that have occurred over deep evolutionary divergences, this strongly suggests that these two proteins have co-evolved from sponges and cnidarians to humans to maintain their ability to co-regulate the transcription of target genes.
A role for Runx and CBFβ in nerve cells of adult Nematostella?
In the adult anemone, Nv-Runx is expressed in scattered ectodermal cells, mainly in the tentacles, and predominantly in the tentacle tips. Scattered cells in the body wall ectoderm also express Runx. Similarly, Nv-CBFβ is expressed most strongly in the ectoderm of the tentacle tips. However, in contrast to Runx, CBFβ is also expressed in the mouth and upper portion of the pharynx. Thus, Nv-Runx and Nv-CBFβ are expressed in similar, but possibly not completely overlapping regions of the anemone head and tentacles. Here, as in other systems, including mammals  and sea-urchins , CBFβ likely functions together with Runx in the sites where they overlap. At other sites, where CBFβ is expressed in the absence of Runx, CBFβ may have other roles, which may involve other partners .
The tentacles of sea anemones, where the expression of both Runx and CBFβ is most predominant, are the part of the animal involved in prey capture and manipulation. This region is relatively poor in cell types, with the majority of differentiated cell types present involved either in prey capture (nematocytes, spirocytes) or in its detection (sensory neurons and ganglion cells) (Figure 7H). The nematocytes themselves do not seem to express either Runx or CBFβ, nor do the gland/mucous cells, which are found exclusively in the apical part of the ectoderm (Runx and CBFβ expression is seen mainly in the mid and basal parts of the ectoderm). Thus, we speculate that the cells expressing Runx and CBFβ are probably neuronal cells. Alternatively, they could be precursors of one of the above differentiated cell types – most likely of nematocytes and spirocytes, which are highly enriched in the tentacles, especially towards the distal tips.
In vertebrates, Runx proteins are known to play critical roles in neural differentiation. For example, in mice, Runx1 and Runx3 influence the development and survival of sensory neurons in several ganglia types such as dorsal root ganglia [24, 25, 44, 45, 47]. In mice, Runx3 is essential for development of the monosynaptic stretch reflex arc [24, 25, 62] the simplest and most ancient neuronal circuit. Based on these findings, it was speculated that a Runx3 ortholog may play an analogous role in the highly similar anemone nematocysts response circuit . The spatial expression of Runx and CBFβ in adult Nematostella suggests that these proteins might also be involved in neural differentiation.
Taken together, our results suggest a putative role for Runx in the maintenance of specific neurons in the tentacles of adult anemones, in addition to the diversity of other networks and functions in which Runx is also likely involved. In vertebrates, Runx proteins regulate disparate functions at different developmental stages, in each case by interacting with a different regulatory network . It is noteworthy that an EST encoding the Acropora ortholog of CBFβ was detected in a library from a pre-settlement planula larva stage . This suggests that in corals Runx and CBFβ are expressed during embryonic and larval development. Future studies involving microarray analysis of gene expression, CHIP-ChIP analysis of Nv-Runx target genes, and gene knockdown experiments may help identify the various roles played by Nv-Runx, Nv-CBFβ, and their regulatory networks at different stages in the life history of Nematostella.
In this study, we showed that Runx and CBFβ are found in the most basal metazoans, cnidarians and sponges, and that in these animals they probably interact and bind DNA in a manner similar to that found in humans. Structural motifs involved in the regulation of the Runx-CBFβ pair, such as the Runx C-terminal Groucho binding motif and the cysteine redox switches, are also conserved, suggesting that other parts of the regulatory network in which Runx and CBFβ are embedded are also evolutionarily conserved. Both Runx and CBFβ are expressed mainly in small cells within the ectoderm of the tentacles, the identity of which still await proof, but may be sensory neurons or their precursors.
Maintenance of Anemones
Nematostella vectensis was obtained from the Aquatic Resources Division of the Marine Biological Laboratory at Woods Hole, as well as from the stock maintained at the Finnerty laboratory at Boston University. The anemones were housed in glass dishes of 50% natural seawater/50% distilled water and maintained at 20°C as described previously . The anemones were fed three times a week with freshly hatched nauplii of Artemia salina, and the water was changed weekly.
Identification of novel Runx/CBFβ proteins
Assembled genomes, genomic trace sequences, and/or expressed sequence tags from the following animals were interrogated utilizing a combination of keyword and sequence homology searches: Phylum Protista, Monosiga brevicollis and Proterospongia-like sp. ATCC50818; Phylum Porifera, Oscarella carmela and Amphimedon queenslandica; and Phylum Cnidaria, (class Anthozoa) Nematostella vectensis and (class Hydrozoa) Hydra magnipapillata. The genomes of several additional protozoan species were also searched (Table 1). Monosiga and Proterospongia ESTs were searched at Choanobase [65, 66] using tblastx . A total of 11178 ESTs for Oscarella carmela were downloaded from NCBI's Genbank database and searched locally using tblastx, as was a draft assembly of the Hydra magnipapillata genome (downloaded from the J. Craig Venter Institute . The Nematostella genome was searched using keyword and blastp/tblastx searches of the assembled genome and expressed sequences at the Nematostella vectensis Genomics Database (StellaBase) site [69, 70]. Genomic and coding sequences from other taxa were searched at NCBI (Table 1).
We utilized the default alignment parameters (protein distance matrix = GONNET 250; gap open penalty = 10.0; end gap penalty = -1.0; gap extension penalty = 0.2; gap distance penalty = 4) in ClustalX  to generate multiple sequence alignments for Runx and CBFβ proteins [7, 72], Additional files 1, 2). Numerous previously unidentified Runx and CBFβ sequences from diverse taxa were parsed and assembled from the NCBI dbEST database and from whole genome shotgun sequences after tblastn searches (Table 1, . The conserved 128 amino acid Runt domain [7, 72] and 135 CBFβ amino acid residues were used for phylogenetic inference (Additional files 1 and 2, respectively).
ProtTest was used to determine which of 80 possible amino acid substitution models is most appropriate for each dataset . Both the tree topology and the model of protein evolution were optimized simultaneously. The Akaike Information Criteria with a modification to control for small sample size (AICc, with alignment length representing sample size; ) was used to discriminate among models. Phylogenetic analyses of both Runx and CBFβ revealed rapid lineage divergence in the nematodes (C. elegans, C. brigsae, and Diploscapter coronatus), which suggests that the placement of these taxa could be confounded by long-branch attraction (Additional files 6 and 7, respectively). Because of this, phylogenetic analyses were performed with and without nematode worms for both proteins. The best model identified for Runx was RtREV  + γ (ΔAICc = 0). This model was employed in a maximum-likelihood analysis of Runx sequences using the program Phyml . Since RtREV is not available in the Phylip software package (version 3.65), the neighbor-joining analysis was performed using the 2nd best model: JTT  + γ (excluding nematodes: ΔAICc = 4.74, including nematodes: ΔAICc = 15.40). For the CBFβ sequences, JTT + γ was used for both maximum likelihood (Phyml) and neighbor-joining (Phylip) analyses for both datasets (e.g., with and without nematodes). The support for individual nodes was assessed using the bootstrap re-sampling method with 1000 replicates .
Distance test for Co-evolution
To determine if the Runt domain and the CBFβ protein are co-evolving, we culled the phylogenetic dataset to include only those taxa for which both Runt and CBFβ sequences are available. While constraining the tree topology such that it reflects the accepted evolutionary relationships among the 11 species for which we identified both proteins , we used parsimony (PAUP; Sinauer Associates, Sunderland, Massachusetts) to infer the number of amino acid substitutions that occurred along each branch for both the Runt domain and CBFβ protein. The branch lengths for each tree were plotted against each other. The strength of the correlation between corresponding branch lengths was evaluated using Pearson product moment correlations. Each protein was also independently compared to glyceraldehyde phosphate dehydrogenase (GAPDH) using the same approach. GAPDH is a "housekeeping gene" that serves as a control, because its sequence evolution is not expected to be functionally constrained by the evolution of either Runx or CBFβ. Correlations were considered significant if α < 0.05, as assessed by a linear least squares regression significance test.
Comparative Modeling of Runx1(RD)-DNA and Runx-CBFβ interfaces in Sponges and Cnidaria
Model structures of the Runt and CBFβ domains of Nematostella, Hydra, Amphimedon and Oscarella were constructed based on the X-ray structure of the human AML1(RUNX1)-CBFβ-DNA complex (PDB # 1H9D, ). The Runt domains from the cnidarians and sponges have a high sequence similarity with the human protein (see Results, below). The Nematostella sequence has the same length as the human Runx1 RD sequence whereas the hydra and sponge sequences are 1- or 2-residues longer. The inserts map to a loop that is not involved in either DNA or CBFβ binding. The Nematostella, Hydra, Amphimedon and human variants of CBFβ are also highly similar. The major difference between Nematostella and human CBFβ is the loss of a five amino acid stretch in the Nematostella sequence (Figure 2D). This deletion maps to a loop between Pro70 and Thr80 in the human CBFβ sequence whose structure is not resolved in the experimental structure of RD-CBFβ complex and is not involved in AML1(RUNX1)-CBFβ-DNA complex formation. This loop was not modeled. The Hydra protein has one amino acid deleted in this region, whereas the Amphimedon CBFβ domain shows a 2-residue insert the same region. The overall high degree of conservation strongly suggests that the overall folds of the Runx(RD) and CBFβ domains studied here are similar to the folds of the corresponding human domains. Therefore, the Cα atoms were constrained to their initial positions during the energy minimization, except for the modified loop in Runx(RD). Models were constructed using the 'Homology' module of InsightII (Accelrys Inc., San Diego CA), soaked in a layer of water and energy minimized using the 'Discover-3' module, while employing the CVFF force field.
The likelihood of the RD-CBFβ interaction in Nematostella, Hydra and A. queenslandica was estimated by analyzing the conservation of the interface residues. Additional files 3, 4, 5 indicate interface residues which are buried more than 15% at the interface in the human RD-CBFβ-DNA complex (the 15% limit excludes residues at the edge of the interface whose contribution to the stability of the complex may be marginal). The surface area of each residue which is buried at the interface was estimated as the difference between its exposed surface area in the separated domain and in complex. Exposed surface areas were calculated using the Homology module.
RT-PCR and Cloning of Nv-Runx and Nv-CBFβtranscripts
RNA was isolated from adult Nematostella anemones using Tri-Reagent according to the manufacturer's protocols (Invitrogen, Carlsbad, CA, USA). Reverse-Transcription Polymerase Chain Reaction (RT-PCR) was used to analyze the expression of Runx and CBFβ transcripts in different body regions of the anemone. Adult anemones were first anesthetized by gradually supplementing the culture medium with 14% w/v MgCl2·6H2O. Transverse sections were made using a razor blade to divide each animal into four distinct axial regions: (1) tentacles, (2) head/pharyngeal region (containing tentacle bases), (3) body column, and (4) physal region. First-strand cDNAs were synthesized using RevertAid™ H Minus First Strand cDNA Synthesis Kit (Fermentas, Life Sciences, EU). Runx and CBFβ were amplified from these cDNA pools in a Mastercycler gradient PCR machine (Eppendorf) using Taq DNA polymerase (Fermentas) under the conditions specified by the manufacturer. The oligonucleotides used were: Nv-Runx TCTCCGAATTTTGTGTGCAG and TTCTACTGCGGAATCCATCC of exon 1 and exon 3 respectively; Nv-CBFβ GAAAACTCGCTCGTGAATCC and AGTCTTTGCTTCGCCTGTTC of exon 1 and exon 5 respectively and actin primers CATGGAATTGTCACAAACTGG and CACACTTCATGATGGAGTTG that are general for cnidarians. PCR products were cloned into pGEM-T-Easy (Promega), and sequenced at the Hebrew University Genome Center.
In situ hybridization was performed using the protocol reported by Martinez and co-workers  with several modifications. Briefly, medium sized adult Nematostella (3 days after last feeding) were transferred into a small dish, allowed to relax for about one hour and anesthetized as described above. After about 10 minutes, the anemones did not respond to tactile stimulation, at which point they were fixed in 4% paraformaldehyde (PFA) in NM for at least four hours at room temperature. The animals were dehydrated through a methanol series, rehydrated into PBST (Phosphate Buffered Saline containing 0.1% Tween-20), and digested for 20 minutes with 0.3 U/ml Proteinase K (Roche). The digestion was terminated by incubation in 4% Glycine in PBST for ten minutes. The animals were then washed twice in PBST, twice in 0.1 M triethanolamine (pH 7.8) twice in 0. 25% (v/v) acetic anhydride in 0.1 M triethanolamine, twice more in PBST, and then refixed for one hour in 4% PFA in PBST. After several more washes in PBST, the anemones were washed once in 50% PBST/50% HS (Hybridization Solution: 50% formamide, 0.1% Tween 20, 5 × SSC, 0.1% CHAPS, 1× Denhardt's solution, 100 μg/ml heparin and 200 μg/ml tRNA), once in 100% HS, and prehybridized in 100% HS for at least two hours at 55°C. Hybridization was performed in HS containing 0.05 ng/μl of the appropriate DIG-labeled probe (produced using the DIG RNA labeling kit, Roche) for 48–72 hours at 55°C. Following hybridization, the animals were taken through a series of seven washes, each lasting 1 hour at 55°C (100% HS/tRNA; 100%, HS/2 × SSC; 75% HS/2 × SSC, 50% HS/2 × SSC; 25% HS/2 × SSC; 2 × SSC/1% CHAPS, 2 × SSC/1% CHAPS). The animals were then transferred to MAB (Maleic Acid Buffer: 0.1 M Maleic Acid, 0.15 M NaCl, pH 7.5), blocked for at least two hours in blocking solution (20% Goat Serum, 1% Bovine Serum Albumin, 0.1% Sodium Azide in MAB) and incubated overnight at 4°C with alkaline phosphatase-conjugated anti-digoxigenin antibodies (Roche) that had been preadsorbed to adult Nematostella. Following incubation with the antibody, the animals were washed extensively in MAB over a 24-hour period at room temperature and transferred into NTMT (0.1 M NaCl, 0.05 M MgCl2 and 0.1% Tween-20 in 0.1 M Tris-HCl, pH 9.5). Presence of the antibody was detected using the color substrate NBT-BCIP (Roche). The staining reaction was terminated by extensive washing with PBST. Pictures were taken with either an Olympus BX51 microscope equipped with an Optronics camera or an Olympus dissecting microscope (Carl Zeiss, Jena, Germany) equipped with an Olympus DP10 CCD camera (Tokyo, Japan).
Sectioning of animals after whole mount in-situhybridization
Following whole-mount in situ hybridization, animals were washed extensively in PBST, infiltrated with 30% sucrose in PBST as a cryoprotectant, immersed in Tissue-Tek medium and frozen on dry ice or liquid nitrogen. Ten-micron sections were taken using a Leica CM1850 cryostat (Wetzlar, Germany). The sections were counterstained in Eosin and mounted in glycerol without dehydration and tissue clearing, as this can cause loss of the color precipitate. Micrographs were takes with an Olympus BX51 microscope equipped with an Optronics camera. For the histological sections of the mouth region, an anemone was taken after whole mount in-situ hybridization with a control probe, refixed in 4% PFA in PBST, dehydrated into isopropanol and embedded in Agar-100 resin. Two-micron sections were taken with an Ultratome-3 (LKB-Prodikter) and stained with Methylene Blue.
Hartenstein V: Blood cells and blood cell development in the animal kingdom. Annu Rev Cell Dev Biol. 2006, 22: 677-712. 10.1146/annurev.cellbio.22.010605.093317.
Stocco C, Telleria C, Gibori G: The Molecular Control of Corpus Luteum Formation, Function, and Regression. Endocr Rev. 2007, 28 (1): 117-149. 10.1210/er.2006-0022.
Stenn KS, Paus R: Controls of hair follicle cycling. Physiol Rev. 2001, 81 (1): 449-494.
Page-McCaw A, Ewald AJ, Werb Z: Matrix metalloproteinases and the regulation of tissue remodelling. Nat Rev Mol Cell Biol. 2007, 8 (3): 221-233. 10.1038/nrm2125.
Shi C, Zhu Y, Su Y, Cheng T: Stem cells and their applications in skin-cell therapy. Trends in Biotechnology. 2006, 24 (1): 48-52. 10.1016/j.tibtech.2005.11.003.
Bosch TC: Why polyps regenerate and we don't: towards a cellular and molecular framework for Hydra regeneration. Dev Biol. 2007, 303 (2): 421-433. 10.1016/j.ydbio.2006.12.012.
Rennert J, Coffman JA, Mushegian AR, Robertson AJ: The evolution of Runx genes I. A comparative study of sequences from phylogenetically diverse model organisms. BMC Evol Biol. 2003, 3: 4-10.1186/1471-2148-3-4.
Levanon D, Groner Y: Structure and regulated expression of mammalian RUNX genes. Oncogene. 2004, 23 (24): 4211-4219. 10.1038/sj.onc.1207670.
Coffman JA, Dickey-Sims C, Haug JS, McCarthy JJ, Robertson AJ: Evaluation of developmental phenotypes produced by morpholino antisense targeting of a sea urchin Runx gene. BMC Biol. 2004, 2: 6-10.1186/1741-7007-2-6.
Dickey-Sims C, Robertson AJ, Rupp DE, McCarthy JJ, Coffman JA: Runx-dependent expression of PKC is critical for cell survival in the sea urchin embryo. BMC Biol. 2005, 3: 18-10.1186/1741-7007-3-18.
Robertson AJ, Dickey-Sims C, Ransick A, Rupp DE, McCarthy JJ, Coffman JA: CBFbeta is a facultative Runx partner in the sea urchin embryo. BMC Biol. 2006, 4: 4-10.1186/1741-7007-4-4.
Flores MV, Lam EY, Crosier KE, Crosier PS: Osteogenic transcription factor Runx2 is a maternal determinant of dorsoventral patterning in zebrafish. Nat Cell Biol. 2008, 10 (3): 346-352. 10.1038/ncb1697.
North TE, Stacy T, Matheny CJ, Speck NA, de Bruijn MF: Runx1 is expressed in adult mouse hematopoietic stem cells and differentiating myeloid and lymphoid cells, but not in maturing erythroid cells. Stem Cells. 2004, 22 (2): 158-168. 10.1634/stemcells.22-2-158.
Nimmo R, Antebi A, Woollard A: mab-2 encodes RNT-1, a C. elegans Runx homologue essential for controlling cell proliferation in a stem cell-like developmental lineage. Development. 2005, 132 (22): 5043-5054. 10.1242/dev.02102.
Raveh E, Cohen S, Levanon D, Negreanu V, Groner Y, Gat U: Dynamic expression of Runx1 in skin affects hair structure. Mech Dev. 2006
Wang Q, Stacy T, Binder M, Marin-Padilla M, Sharpe AH, Speck NA: Disruption of the Cbfa2 gene causes necrosis and hemorrhaging in the central nervous system and blocks definitive hematopoiesis. Proc Natl Acad Sci U S A. 1996, 93 (8): 3444-3449. 10.1073/pnas.93.8.3444.
Nucifora G, Rowley JD: AML1 and the 8;21 and 3;21 translocations in acute and chronic myeloid leukemia. Blood. 1995, 86 (1): 1-14.
de Bruijn MF, Speck NA, Peeters MC, Dzierzak E: Definitive hematopoietic stem cells first develop within the major arterial regions of the mouse embryo. Embo J. 2000, 19 (11): 2465-2474. 10.1093/emboj/19.11.2465.
Otto F, Thornell AP, Crompton T, Denzel A, Gilmour KC, Rosewell IR, Stamp GW, Beddington RS, Mundlos S, Olsen BR, Selby PB, Owen MJ: Cbfa1, a candidate gene for cleidocranial dysplasia syndrome, is essential for osteoblast differentiation and bone development. Cell. 1997, 89 (5): 765-771. 10.1016/S0092-8674(00)80259-7.
Ducy P, Zhang R, Geoffroy V, Ridall AL, Karsenty G: Osf2/Cbfa1: a transcriptional activator of osteoblast differentiation. Cell. 1997, 89 (5): 747-754. 10.1016/S0092-8674(00)80257-3.
Lian JB, Javed A, Zaidi SK, Lengner C, Montecino M, van Wijnen AJ, Stein JL, Stein GS: Regulatory controls for osteoblast growth and differentiation: role of Runx/Cbfa/AML factors. Crit Rev Eukaryot Gene Expr. 2004, 14 (1-2): 1-41. 10.1615/CritRevEukaryotGeneExpr.v14.i12.10.
Lian JB, Stein GS, Javed A, van Wijnen AJ, Stein JL, Montecino M, Hassan MQ, Gaur T, Lengner CJ, Young DW: Networks and hubs for the transcriptional control of osteoblastogenesis. Rev Endocr Metab Disord. 2006
Komori T, Yagi H, Nomura S, Yamaguchi A, Sasaki K, Deguchi K, Shimizu Y, Bronson RT, Gao YH, Inada M, Sato M, Okamoto R, Kitamura Y, Yoshiki S, Kishimoto T: Targeted disruption of Cbfa1 results in a complete lack of bone formation owing to maturational arrest of osteoblasts. Cell. 1997, 89 (5): 755-764. 10.1016/S0092-8674(00)80258-5.
Levanon D, Bettoun D, Harris-Cerruti C, Woolf E, Negreanu V, Eilam R, Bernstein Y, Goldenberg D, Xiao C, Fliegauf M, Kremer E, Otto F, Brenner O, Lev-Tov A, Groner Y: The Runx3 transcription factor regulates development and survival of TrkC dorsal root ganglia neurons. Embo J. 2002, 21 (13): 3454-3463. 10.1093/emboj/cdf370.
Inoue K, Ozaki S, Shiga T, Ito K, Masuda T, Okado N, Iseda T, Kawaguchi S, Ogawa M, Bae SC, Yamashita N, Itohara S, Kudo N, Ito Y: Runx3 controls the axonal projection of proprioceptive dorsal root ganglion neurons. Nat Neurosci. 2002, 5 (10): 946-954. 10.1038/nn925.
Growney JD, Shigematsu H, Li Z, Lee BH, Adelsperger J, Rowan R, Curley DP, Kutok JL, Akashi K, Williams IR, Speck NA, Gilliland DG: Loss of Runx1 perturbs adult hematopoiesis and is associated with a myeloproliferative phenotype. Blood. 2005, 106 (2): 494-504. 10.1182/blood-2004-08-3280.
Geoffroy V, Kneissel M, Fournier B, Boyde A, Matthias P: High bone resorption in adult aging transgenic mice overexpressing cbfa1/runx2 in cells of the osteoblastic lineage. Mol Cell Biol. 2002, 22 (17): 6222-6233. 10.1128/MCB.22.17.6222-6233.2002.
Fainaru O, Woolf E, Lotem J, Yarmus M, Brenner O, Goldenberg D, Negreanu V, Bernstein Y, Levanon D, Jung S, Groner Y: Runx3 regulates mouse TGF-beta-mediated dendritic cell function and its absence results in airway inflammation. Embo J. 2004, 23 (4): 969-979. 10.1038/sj.emboj.7600085.
Otto F, Lubbert M, Stock M: Upstream and downstream targets of RUNX proteins. J Cell Biochem. 2003, 89 (1): 9-18. 10.1002/jcb.10491.
Coffman JA: Runx transcription factors and the developmental balance between cell proliferation and differentiation. Cell Biol Int. 2003, 27 (4): 315-324. 10.1016/S1065-6995(03)00018-0.
Westendorf JJ, Hiebert SW: Mammalian runt-domain proteins and their roles in hematopoiesis, osteogenesis, and leukemia. J Cell Biochem. 1999, Suppl 32-33: 51-58. 10.1002/(SICI)1097-4644(1999)75:32+<51::AID-JCB7>3.0.CO;2-S.
Adya N, Castilla LH, Liu PP: Function of CBFbeta/Bro proteins. Semin Cell Dev Biol. 2000, 11 (5): 361-368. 10.1006/scdb.2000.0189.
Levanon D, Goldstein RE, Bernstein Y, Tang H, Goldenberg D, Stifani S, Paroush Z, Groner Y: Transcriptional repression by AML1 and LEF-1 is mediated by the TLE/Groucho corepressors. Proc Natl Acad Sci U S A. 1998, 95 (20): 11590-11595. 10.1073/pnas.95.20.11590.
Akamatsu Y, Ohno T, Hirota K, Kagoshima H, Yodoi J, Shigesada K: Redox regulation of the DNA binding activity in transcription factor PEBP2. The roles of two conserved cysteine residues. J Biol Chem. 1997, 272 (23): 14497-14500. 10.1074/jbc.272.23.14497.
Kagoshima H, Akamatsu Y, Ito Y, Shigesada K: Functional dissection of the alpha and beta subunits of transcription factor PEBP2 and the redox susceptibility of its DNA binding activity. J Biol Chem. 1996, 271 (51): 33074-33082. 10.1074/jbc.271.51.33074.
Blyth K, Cameron ER, Neil JC: The RUNX genes: gain or loss of function in cancer. Nat Rev Cancer. 2005, 5 (5): 376-387. 10.1038/nrc1607.
Gierer A, Berking S, Bode H, David CN, Flick K, Hansmann G, Schaller H, Trenkner E: Regeneration of hydra from reaggregated cells. Nat New Biol. 1972, 239 (91): 98-101.
Technau U, Cramer von Laue C, Rentzsch F, Luft S, Hobmayer B, Bode HR, Holstein TW: Parameters of self-organization in Hydra aggregates. Proc Natl Acad Sci U S A. 2000, 97 (22): 12127-12131. 10.1073/pnas.97.22.12127.
Campbell RD: Elimination by Hydra interstitial and nerve cells by means of colchicine. J Cell Sci. 1976, 21 (1): 1-13.
Campbell RD, Josephson RK, Schwab WE, Rushforth NB: Excitability of nerve-free hydra. Nature. 1976, 262 (5567): 388-390. 10.1038/262388a0.
Holstein TW, Hobmayer E, Technau U: Cnidarians: an evolutionarily conserved model system for regeneration?. Dev Dyn. 2003, 226 (2): 257-267. 10.1002/dvdy.10227.
Muller WE: The stem cell concept in sponges (Porifera): Metazoan traits. Semin Cell Dev Biol. 2006, 17 (4): 481-491. 10.1016/j.semcdb.2006.05.006.
Muller WE, Korzhev M, Le Pennec G, Muller IM, Schroder HC: Origin of metazoan stem cell system in sponges: first approach to establish the model (Suberites domuncula). Biomol Eng. 2003, 20 (4-6): 369-379. 10.1016/S1389-0344(03)00055-8.
Theriault FM, Nuthall HN, Dong Z, Lo R, Barnabe-Heider F, Miller FD, Stifani S: Role for Runx1 in the proliferation and neuronal differentiation of selected progenitor cells in the mammalian nervous system. J Neurosci. 2005, 25 (8): 2050-2061. 10.1523/JNEUROSCI.5108-04.2005.
Theriault FM, Roy P, Stifani S: AML1/Runx1 is important for the development of hindbrain cholinergic branchiovisceral motor neurons and selected cranial sensory neurons. Proc Natl Acad Sci U S A. 2004, 101 (28): 10343-10348. 10.1073/pnas.0400768101.
Chen CL, Broom DC, Liu Y, de Nooij JC, Li Z, Cen C, Samad OA, Jessell TM, Woolf CJ, Ma Q: Runx1 determines nociceptive sensory neuron phenotype and is required for thermal and neuropathic pain. Neuron. 2006, 49 (3): 365-377. 10.1016/j.neuron.2005.10.036.
Inoue KI, Ito K, Osato M, Lee B, Bae SC, Ito Y: The transcription factor Runx3 represses the neurotrophin receptor TrkB during lineage commitment of dorsal root ganglion neurons. J Biol Chem. 2007
The NCBI Entrez protein database. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=Protein]
Bridge D, Cunningham CW, Schierwater B, DeSalle R, Buss LW: Class-level relationships in the phylum Cnidaria: evidence from mitochondrial genome structure. Proc Natl Acad Sci U S A. 1992, 89 (18): 8750-8753. 10.1073/pnas.89.18.8750.
Bridge D, Cunningham CW, DeSalle R, Buss LW: Class-level relationships in the phylum Cnidaria: molecular and morphological evidence. Mol Biol Evol. 1995, 12 (4): 679-689.
Li Z, Lukasik SM, Liu Y, Grembecka J, Bielnicka I, Bushweller JH, Speck NA: A mutation in the S-switch region of the Runt domain alters the dynamics of an allosteric network responsible for CBFbeta regulation. J Mol Biol. 2006, 364 (5): 1073-1083. 10.1016/j.jmb.2006.10.002.
Sato T, Yamanishi Y, Horimoto K, Kanehisa M, Toh H: Partial correlation coefficient between distance matrices as a new indicator of protein-protein interactions. Bioinformatics. 2006, 22 (20): 2488-2492. 10.1093/bioinformatics/btl419.
Glusman G, Kaur A, Hood L, Rowen L: An enigmatic fourth runt domain gene in the fugu genome: ancestral gene loss versus accelerated evolution. BMC Evol Biol. 2004, 4: 43-10.1186/1471-2148-4-43.
Ng CE, Osato M, Tay BH, Venkatesh B, Ito Y: cDNA cloning of Runx family genes from the pufferfish (Fugu rubripes). Gene. 2007, 399 (2): 162-173. 10.1016/j.gene.2007.05.014.
Christoffels A, Koh EG, Chia JM, Brenner S, Aparicio S, Venkatesh B: Fugu genome analysis provides evidence for a whole-genome duplication early during the evolution of ray-finned fishes. Mol Biol Evol. 2004, 21 (6): 1146-1151. 10.1093/molbev/msh114.
Flores MV, Tsang VW, Hu W, Kalev-Zylinska M, Postlethwait J, Crosier P, Crosier K, Fisher S: Duplicate zebrafish runx2 orthologues are expressed in developing skeletal elements. Gene Expr Patterns. 2004, 4 (5): 573-581. 10.1016/j.modgep.2004.01.016.
Darling JA, Reitzel AR, Burton PM, Mazza ME, Ryan JF, Sullivan JC, Finnerty JR: Rising starlet: the starlet sea anemone, Nematostella vectensis. Bioessays. 2005, 27 (2): 211-221. 10.1002/bies.20181.
Kortschak RD, Samuel G, Saint R, Miller DJ: EST analysis of the cnidarian Acropora millepora reveals extensive gene loss and rapid sequence divergence in the model invertebrates. Curr Biol. 2003, 13 (24): 2190-2195. 10.1016/j.cub.2003.11.030.
Sullivan JC, Reitzel AM, Finnerty JR: A high percentage of introns in human genes were present early in animal evolution: evidence from the basal metazoan Nematostella vectensis. Genome Inform. 2006, 17 (1): 219-229.
Pazos F, Ranea JA, Juan D, Sternberg MJ: Assessing protein co-evolution in the context of the tree of life assists in the prediction of the interactome. J Mol Biol. 2005, 352 (4): 1002-1015. 10.1016/j.jmb.2005.07.005.
Westfall JA, Elliott CF, Carlin RW: Ultrastructural evidence for two-cell and three-cell neural pathways in the tentacle epidermis of the sea anemone Aiptasia pallida. J Morphol. 2002, 251 (1): 83-92. 10.1002/jmor.1075.
Kramer I, Sigrist M, de Nooij JC, Taniuchi I, Jessell TM, Arber S: A role for Runx transcription factor signaling in dorsal root ganglion sensory neuron diversification. Neuron. 2006, 49 (3): 379-393. 10.1016/j.neuron.2006.01.008.
Levanon D, Glusman G, Bettoun D, Ben-Asher E, Negreanu V, Bernstein Y, Harris-Cerruti C, Brenner O, Eilam R, Lotem J, Fainaru O, Goldenberg D, Pozner A, Woolf E, Xiao C, Yarmus M, Groner Y: Phylogenesis and regulated expression of the RUNT domain transcription factors RUNX1 and RUNX3. Blood Cells Mol Dis. 2003, 30 (2): 161-163. 10.1016/S1079-9796(03)00023-8.
Hand C, Uhlinger KR: The culture, sexual and asexual reproduction and growth of the sea-anemone Nematostella vectensis. Biological Bulletin. 1992, 182: 169-176. 10.2307/1542110.
The King lab site. [http://mcb.berkeley.edu/labs/king/blast/]
King N, Hittinger CT, Carroll SB: Evolution of key cell signaling and adhesion protein families predates animal origins. Science. 2003, 301 (5631): 361-363. 10.1126/science.1083853.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.
The Craig Venter Institute site. [https://research.venterinstitute.org/files/hydra_magnipapillata_WGS_assembly]
The Nematostella vectensis Genomics Database. [http://stellabase.com]
Sullivan JC, Ryan JF, Watson JA, Webb J, Mullikin JC, Rokhsar D, Finnerty JR: StellaBase: the Nematostella vectensis Genomics Database. Nucleic Acids Res. 2006, 34 (Database issue): D495-9. 10.1093/nar/gkj020.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25 (24): 4876-4882. 10.1093/nar/25.24.4876.
Kagoshima H, Shigesada K, Kohara Y: RUNX regulates stem cell proliferation and differentiation: Insights from studies of C. elegans. J Cell Biochem. 2007, 100 (5): 1119-1130. 10.1002/jcb.21174.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410.
Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21 (9): 2104-2105. 10.1093/bioinformatics/bti263.
Wagenmakers EJ, Farrell S: AIC model selection using Akaike weights. Psychon Bull Rev. 2004, 11 (1): 192-196.
Dimmic MW, Rest JS, Mindell DP, Goldstein RA: rtREV: an amino acid substitution matrix for inference of retrovirus and reverse transcriptase phylogeny. J Mol Evol. 2002, 55 (1): 65-73. 10.1007/s00239-001-2304-y.
Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52 (5): 696-704. 10.1080/10635150390235520.
Jones DT, Taylor WR, Thornton JM: The rapid generation of mutation data matrices from protein sequences. Computer Applications in the Biosciences. 1992, 8 (2): 275-282.
Felsenstein J: Phylogenies from molecular sequences: inference and reliability. Annu Rev Genet. 1988, 22: 521-565. 10.1146/annurev.ge.22.120188.002513.
Peterson KJ, Butterfield NJ: Origin of the Eumetazoa: testing ecological predictions of molecular clocks against the Proterozoic fossil record. Proc Natl Acad Sci U S A. 2005, 102 (27): 9547-9552. 10.1073/pnas.0503660102.
Bravo J, Li Z, Speck NA, Warren AJ: The leukemia-associated AML1 (Runx1)--CBF beta complex functions as a DNA-induced molecular clamp. Nat Struct Biol. 2001, 8 (4): 371-378. 10.1038/86264.
Martinez DE, Dirksen ML, Bode PM, Jamrich M, Steele RE, Bode HR: Budhead, a fork head/HNF-3 homologue, is expressed during axis formation and head specification in hydra. Dev Biol. 1997, 192 (2): 523-536. 10.1006/dbio.1997.8715.
Guindon S, Lethiec F, Duroux P, Gascuel O: PHYML Online--a web server for fast maximum likelihood-based phylogenetic inference. Nucleic Acids Res. 2005, 33 (Web Server issue): W557-9. 10.1093/nar/gki352.
The authors would like to dedicate this paper to the memory of Eliahu Zlotkin, thanking him for not muzzling the ox as it treadth the grain. Our thanks also go to Nicole King (UC Berkeley) for public accessibility of the sponge EST database. This study was supported by NSF grant IBN-0212773 (JRF), BU SPRInG grant 20-202-8103-9 (JRF), NSF grant FP-91656101-0 (JRF and JCS) and the Israel Science Foundation grant 825/07 (UG).
JCS, DS, JRF and UG conceived and designed the study. DS, UG, KS and JCS discovered and analyzed the genomic structure of the Nv-Runx gene. JCS searched various databases for Runx and CBFβ together with DS and performed the phylogenetic analyses together with AMR and JRF. DS performed the molecular work together with UG and HM contributed to the gene expression studies. ME performed the structural modeling and analyzed the data in collaboration with DS, DL, YG and UG. KS participated in the genomic search for Runx binding sites together with JCS. All authors participated in writing and approval of the final manuscript.
James C Sullivan, Daniel Sher contributed equally to this work.
Electronic supplementary material
Additional file 2: CBFβ alignment used for phylogenetic inference. The sources of the sequences and the abbreviations for taxa are provided in Additional file 10. (PPT 604 KB)
Additional file 6: Runx phylogeny, including nematode lineages. To test for long-branch attraction, the the phylogenetic relationships among Runx proteins were estimated with and without nematode worms. Neither the topology nor the bootstrap support for individual nodes was significantly affected by exclusion of nematode sequences (compare against Figure 4A). (PPT 462 KB)
Additional file 7: CBFβ phylogeny, including nematode lineages. To test for long-branch attraction, the CBFβ phylogeny was created with and without nematode worms. Neither topology nor bootstrap support was significantly affected by exclusion of nematode sequences (compare against Figure 4B). (PPT 182 KB)
Additional file 8: The dimerization interfaces of Nv-Runx and Nv-CBFβ with the position of the interacting amino acid residues noted. Runx is shown in blue and CBFβ in green. Non conservative replacements are indicated in red. Note that these replacements occur at the edge of the interface. Also, they are reciprocated in the other protein, thus in the RD-CBFβ complex the replacements F153->K121 and H163->C131 in RD are in spatial proximity to the replacements Q67->H67 and F17->M17 in CBFβ, respectively. (PPT 730 KB)
Additional file 9: Genebank accession numbers and taxon ID's of taxa used in phylogenetic analyses of the Runt domain. (DOC 54 KB)
Additional file 10: Genebank accession numbers and taxon ID's of taxa used in phylogenetic analyses of CBFβ. (DOC 54 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Sullivan, J.C., Sher, D., Eisenstein, M. et al. The evolutionary origin of the Runx/CBFbeta transcription factors – Studies of the most basal metazoans. BMC Evol Biol 8, 228 (2008). https://doi.org/10.1186/1471-2148-8-228