The evolution of Brassica napus FLOWERING LOCUST paralogues in the context of inverted chromosomal duplication blocks
BMC Evolutionary Biology volume 9, Article number: 271 (2009)
The gene FLOWERING LOCUS T (FT) and its orthologues play a central role in the integration of flowering signals within Arabidopsis and other diverse species. Multiple copies of FT, with different cis-intronic sequence, exist and appear to operate harmoniously within polyploid crop species such as Brassica napus (AACC), a member of the same plant family as Arabidopsis.
We have identified six BnFT paralogues from the genome of B. napus and mapped them to six distinct regions, each of which is homologous to a common ancestral block (E) of Arabidopsis chromosome 1. Four of the six regions were present within inverted duplicated regions of chromosomes A7 and C6. The coding sequences of BnFT paralogues showed 92-99% identities to each other and 85-87% identity with that of Arabidopsis. However, two of the paralogues on chromosomes A2 and C2, BnA2.FT and BnC2.FT, were found to lack the distinctive CArG box that is located within intron 1 that has been shown in Arabidopsis to be the binding site for theFLC protein. Three BnFT paralogues (BnA2.FT, BnC6.FT.a and BnC6.FT.b) were associated with two major QTL clusters for flowering time. One of the QTLs encompassing two BnFT paralogues (BnC6.FT.a and BnC6.FT.b) on chromosome C6 was resolved further using near isogenic lines, specific alleles of which were both shown to promote flowering. Association analysis of the three BnFT paralogues across 55 cultivars of B. napus showed that the alleles detected in the original parents of the mapping population used to detect QTL (NY7 and Tapidor) were ubiquitous amongst spring and winter type cultivars of rapeseed. It was inferred that the ancestral FT homologues in Brassica evolved from two distinct copies, one of which was duplicated along with inversion of the associated chromosomal segment prior to the divergence of B. rapa (AA) and B. oleracea (CC). At least ten such inverted duplicated blocks (IDBs) were identified covering a quarter of the whole B. napus genome.
Six orthologues of Arabidopsis FT were identified and mapped in the genome of B. napus which sheds new light on the evolution of paralogues in polyploidy species. The allelic variation of BnFT paralogues results in functional differences affecting flowering time between winter and spring type cultivars of oilseed Brassica. The prevalent inverted duplicated blocks, two of which were located by four of the six BnFT paralogues, contributed to gene duplications and might represent predominant pathway of evolution in Brassica.
Timing of the onset of flowering is an important agronomic trait affecting crop production. To meet the challenges of climate change, and the need to adapt crops to a wider range of growing environments, it is necessary to coordinate flowering within the context of seasonal variations in order to ensure the greatest possibility of pollination, and thus consistently high seed yield. The genetic basis of variation in flowering time is now well understood in Arabidopsis. Forward and reverse genetics has allowed identification of genes in the context of environmental and developmental cues that mediate the onset of flowering, and allowed detailed characterization of the photoperiod, vernalization, gibberellin and autonomous pathways. Major integrators of these pathways include FLOWERING LOCUS T (FT), along with SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 (SOC1) and LEAFY (LFY) [1–5]. FT induces flowering in response to long days and is a direct target of the nuclear protein CONSTANS (CO) in leaves [6–8]. Two proteins within the vernalization pathway, FLOWERING LOCUS C (FLC) and SHORT VEGETATIVE PHASE (SVP), bind to the CArG box within intron 1 and promoter region of FT, respectively, to repress its expression [9, 10]. FT is expressed in the phloem of leaves, with the small protein moving as a long-distance signal to the shoot apical meristem (SAM) where it interacts with FD, a bZIP transcription factor, to form a complex of FT/FD heterodimer. This then activates the floral meristem identity genes APETALA 1 (AP1) and FRUITFUL (FUL) to promote flowering [7, 8, 11, 12].
Although FT plays a central and indispensable role to induce flowering, interpreting the exact roles of different FT paralogues is difficult, owing to variation in the structure and number of FT family members across plant taxa. For example in rice, at least three FT-like genes (OsFTL1-OsFTL3) promote flowering, although there are thirteen such genes within the genome, corresponding to eight in the ancestral grass genome [13–16]. It has been shown that the protein encoded by Hd3a, corresponding to OsFTL2, is a mobile signal in rice, and moves from the leaf to the SAM to induce flowering . A complex situation also exists in barley, where five FT-like genes are found, with HvFT1 being highly expressed under long day conditions at the time of transition from vegetative to reproductive growth, and HvFT2 and HvFT4 expressed later in development. HvFT3 is a candidate gene for a major flowering-time QTL, and the expression level of HvFT5 is very low in short days, corresponding to a predicted stop codon within the protein at residue 69 . Four FT-like cDNAs have been cloned in poplar, of which only PtFT2 has been shown to shorten juvenile phase and promote seasonal flowering . Although Brassica species share a common ancestor with Arabidopsis and diverged at 14.5 to 20.4 MYA [20–22], no FT paralogues had previously been identified, apart from 3 RFLP probes from Arabidopsis FT having been mapped to the A2 (N2) and C6 (N16) linkage groups of B. napus .
B. napus is a major oil crop in temperate regions worldwide with approximately 52 million tonnes of seed production per year (2007-2008; http://www.worldoil.com/). Results of comparative mapping between the Arabidopsis and Brassica genomes suggest that numerous regions homologous to the Arabidopsis genome are triplicated within diploid species of Brassica [24–27]. The B. rapa (2n = 20, AA) and B. oleracea (2n = 18, CC) genomes are essential conserved intact within the amphidiploid species B. napus (2n = 38, AACC), which appears to have been synthesized within the past ten thousand years [24–26, 28–31]. Recently, more detailed evidence for the ancestral segmental chromosomal duplications leading to effective triplication in Brassica diploids has been obtained from partial genome sequencing and comparative chromosome painting [22, 25, 32–34]. However, comparative sequence analysis indicates that various mechanisms of genome evolution have contributed to many situations where either more or less than three paralogous genes, corresponding to single orthologues in Arabidopsis, are present in the Brassica A or C genome [26, 28, 35–37].
Although there are five homologs of FT in Arabidopsis, i.e. TERMINAL FLOWER 1 (TFL1), TWIN SISTER OF FT (TSF), ARABIDOPSIS THALIANA CENTRORADIALIS (ATC), BROTHER OF FT AND TFL1 (BFT), and MOTHER OF FTAND TFL1 (MFT), their functions are very diverged from FT, and only share 56-82% identity at the amino acid sequence level, with TSF having the highest similarity . This information allows us to identify FT orthologues from the B. napus genome unambiguously. We sought to characterize the role that ancestral segmental genome duplication events had played on the number and function of FT orthologues within B. napus, and to establish how this had contributed to the ability to adapt to contrasting agricultural systems, such as those associated with spring and winter seasonal crop types. This paper presents the results of cloning and characterizing the BnFT paralogues, and discusses the evolution of BnFT paralogues within B. napus.
Isolation and genetic mapping of BnFTparalogues
Thirty-five potential FT orthologue-containing BAC clones were identified by Southern blot screening of the JBnB BAC library, which was developed from the doubled haploid B. napus cv. Tapidor . Eleven of these were verified by PCR amplification. These BACs were initially grouped according to intron length (4 groups), and then grouped according to the polymorphism within intron 2 (6 groups) (Additional file 1). Full-length genomic sequences of these six BnFT paralogues were isolated from representative BACs by amplification and PCR walking, with primers designed from one copy of BrFT (designated as BrA2.FT) located in chromosome A2 of B. rapa (Figure 1, Table 1).
The six BnFT paralogues were mapped using PCR-derived markers to four linkage groups of the TNDH genetic map, two in the A genome and two in the C genome (Figure 2). The BnFT paralogues within B. napus were named as follows: BnA2.FT, BnA7.FT.a, BnA7.FT.b, BnC2.FT, BnC6.FT.a and BnC6.FT.b according to their mapped chromosomes. In silico mapping indicates that all of the genomic regions containing BnFT paralogues corresponded to block E in chromosome 1 of Arabidopsis , although block E was inversely duplicated on linkage groups A7 and C6, where it forms inverted duplication blocks (IDBs). Based on marker identity, the BnFT paralogues in the IDBs were close to the junctions of each duplicated block (Figure 2).
According to marker identity, ten IDBs were detected within the TNDH linkage map associated with eight chromosomes (Figure 3, Additional file 2 ). The IDBs covered a quarter of the whole linkage groups of B. napus that could align with the genome of Arabidopsis (Table 2). Interestingly, blocks located in chromosome 3 and 5 of Arabidopsis [26, 39] rarely corresponded with the IDBs in the genome of B. napus. The majority of IDBs were detected in the A genome and not in the corresponding regions of the C genome, since most markers had been developed from B. rapa BACs.
BnFTparalogues: candidate genes of QTL for flowering time?
Two BnFT paralogues, BnC6.FT.a and BnC6.FT.b, were mapped previously to a major QTL cluster region for flowering time, which was detected in all winter-cropped environments, with the Tapidor alleles contributing to accelerated flowering . The BnA2.FT was newly identified to be located within the confidence interval region of the QTL cluster on chromosome A2 in two winter-cropped environments, with the NY7 allele promoting flowering (Figure 4A). To test whether the BnC6.FT.a and BnC6.FT.b represented candidate genes of QTL for flowering time, a DH line harbouring the complete C6 QTL cluster in Tapidor was backcrossed to NY7 for four generations, and two near isogenic lines (NILs) with different combinations of introgressed segments on C6, NI-5 and NI-9, were selected. Both NILs flowered much earlier than NY7 (Figure 4B). NI-5, which had a small additional introgressed fragment containing two BnC6.FT alleles of Tapidor, significantly flowered earlier than NI-9 which lacked Tapidor alleles (Figure 4B). To further confirm whether the BnFT candidates significantly affected flowering time in B. napus, BnC6.FT.a, BnC6.FT.b and BnA2.FT were subjected to association analysis with 35 spring and 20 winter cultivars (Additional file 3). The flowering time of the accessions was investigated in vernalization-free conditions, and the genotypes tested with the PCR markers. The NY7 and Tapidor alleles of three BnFT paralogues were significantly (P < 0.01) ubiquitous in the spring and winter type cultivars (Table 3). In particular, two paralogues, BnC6.FT.a and BnC6.FT.b, showed 95% similarity with Tapidor alleles in winter type cultivars (Table 3). The functional differences observed between spring and winter cultivars implied that the expression of BnFT candidates was repressively regulated prior to vernalization.
Characterization of BnFTparalogues
The canonical organization of FT into four exons and three introns as found in Arabidopsis (Figure 5B) is well conserved across all the FT-like genes identified in rice, poplar and barley genomes. However, some variation of intron size is observed, with the third intron of HvFT1 being absent. The number and exon size of all BnFT paralogues in B. napus were identical to their orthologues in Arabidopsis. However, the intron size of all these genes, especially intron 2, was found to differ with that in Arabidopsis (Figure 5B). Thus a larger ORF is found for BnA2.FT and a smaller ORF for BnC2.FT. In both cases a single base deletion is present within intron 1 that disrupts the CArG box, the binding motif for FLC protein to repress FT expression prior to vernalization in Arabidopsis .
Over the coding sequence, the six BnFT paralogues had 85-87% identity with AtFT, 81-83% with AtTSF [GenBank: NM_118156], and 92-99% with each other (Table 4). Interestingly, the paralogues within homeologous regions of the B. napus A and C genomes, such as BnA2.FT and BnC2.FT, showed the highest nucleotide identity (99%), whilst the paralogues present within inverse duplicated regions, such as BnC6.FT.a and BnC6.FT.b, showed higher identity (97%) than among other pairs of paralogues, such as BnA2.FT and BnC6.FT.a (92%). The degree of amino acid identity showed the same relationships as the coding sequences (Table 4). The BnFT paralogues showed much higher similarity with AtFT than with AtTSF, indicating that conserved amino acids had been substituted at some sites, although the proteins are likely to perform similar functions.
Phylogenetic and evolutionary analysis of FT-like genes across plant taxa
The phylogenetic relationships among FT-like genes from B. napus, B. rapa, Arabidopsis, poplar, rice and barley were analyzed, with a phylogenetic tree constructed from the amino acid sequences of these genes. Four groups could be identified: Hd3a [GenBank: AB052943] and HvFT1 [GenBank: EU331816], ptFT2 [GenBank: AY515152], AtFT [GenBank: NM_105222] and Brassica FT genes (Figure 5A). The classification of species was consistent with established divergence of plant taxa. It was clear that Brassica FT genes fell into three groups: BnA7.FT.b and BnC6.FT.b in group 1, BnA7.FT.a and BnC6.FT.a in group 2, and BnA2.FT, BnC2.FT and BrA2.FT in group 3 (Figure 5A).
The Ks values between AtFT and eight Brassica FT genes (from 0.33 to 0.36) (Table 5) suggest a divergence time of 12 to 13 MYA between Brassica and Arabidopsis (Figure 5C). The Ks values between FT genes evolved from Bx.FT-1 (Figure 5A) and Bx.FT-2 (0.17 to 0.19), Bx.FT-2a and Bx.FT-2b (0.071 to 0.072) (Table 5) suggest that the first and second duplications may have occurred in Brassica ~6-7 MYA and 2.5 MYA, respectively. The divergence of A and C genomes (0.041-0.06) may have occurred at 1.5-2 MYA, with natural B. napus (0.0033-0.0058) (Table 5) calculated to have emerged approximately 0.12-0.21 MYA (Figure 5C). The order of the evolutionary events calculated by Ks values is consistent with the established phylogenetic relationship of the FT genes.
We isolated and mapped six BnFT paralogues from B. napus. Those regions containing BnFT spanned all E segments of Arabidopsis chromosome 1, as previously reported in B. napus [26, 39, 40]. Based on marker identity additional IDBs were identified in A7 and C6 [41, 42]. However, our primary interest was to identify the BnFT paralogues within B. napus and to understand the evolutionary processes and functional consequences associated with their duplication.
Characterization of BnFTparalogues
Although the FT family members within Arabidopsis, Brassica, rice, barley and poplar have similar exon/intron structures (apart from barley HvFT1), the number of paralogous genes differs. The single FT gene in Arabidopsis has been ascribed a "florigen" function that responds to long days [6, 12]. In contrast, rice, a short day plant, has thirteen FT-like genes, of which only the Hd3a protein has been shown to have a "florigen" function, with the roles of other paralogues largely unknown [13, 14, 17]. Five FT-like genes are found in barley and play different roles . B. napus is an allotetraploid derived from interspecific hybridization of B. rapa and B. oleracea [43, 44] and is more closely related to Arabidopsis. The six BnFT paralogues exhibit high levels of nucleotide identity to AtFT within coding sequences, which initially suggested that all BnFT may have a similar function contributing to the induction of flowering. It is of intrinsic and agronomic interest to determine how the presence of six BnFT paralogues may cooperate to regulate onset of flowering in B. napus. Based on the analysis of the cis and coding sequences of the closely related BnFT paralogues, it should be possible to dissect the relative timing and contribution of locus-specific paralogues via expression profiles. One may expect that the presence of multiple paralogous copies provides B. napus with additional capacity for exquisite tuning of the network of signals that are integrated from the pathways leading to flowering. This tuning will be mediated in different cultivars through differential expression of alleles at each paralogous locus, and such differentiation will be particularly pronounced between winter and spring type cultivars. More complex interactions with the growing environment may also arise from variation in the epigenetic status of the paralogous genes.
Multiple pathways are integrated by FT to control flowering , with FT being the direct target of CO, FLC and SVP proteins in Arabidopsis [9, 10, 45]. In Brassica species, several homologues of Arabidopsis flowering pathway genes have been identified, such as four BnCO copies (BnCOa1, BnCOa9, BnCOb1 and BnCOb9) in B. napus , five BoFLC paralogues (BoFLC1, BoFLC2, BoFLC3, BoFLC4 and BoFLC5) in B. oleracea [47, 48], and four BrFLC copies (BrFLC1, BrFLC2, BrFLC3 and BrFLC5) in B. rapa . Based on the evidence from B. rapa and B. oleracea, between eight or ten BnFLC paralogues are anticipated within the whole genome of B. napus. It is tempting to speculate that distinct BnFT copies may be the targets of one or more BnCO and BnFLC paralogues in B. napus. A characteristic CArG box, which acts as the FLC protein binding site, was detected in the first intron of four BnFT paralogues. However, it was absent in BnA2.FT, BnC2.FT and BrA2.FT (Figure 5B). These difference in the structural features of the BnFT paralogues strongly suggested that they may have undergone functional differentiation and regulatory variation in the context of polyploidy. It has been suggested that redundancy may create subtle fitness advantages that might only be evident in particular stages of the life cycle, or under particular environmental conditions . Force et al.  suggest that complementary degenerative mutations in different regulatory elements of duplicated genes can facilitate the preservation of both duplicates, thereby increasing the long-term opportunities for the evolution of new gene functions. Interestingly, the CArG box was also not detected in PtFT2, Hd3a and HvFT1 (Figure 5B) indicating that polyploidy may provide such FT orthologues with the opportunity to explore sequence variation that enables more diverse or subtle functionality within the context of more complex regulatory mechanisms and pathways, harmonized within their own genome.
Earlier genetic mapping revealed that three RFLP markers probed with Arabidopsis FT mapped to the A2 (N2) and C6 (N16) linkage groups of B. napus , although this did not account for the full number of expected BnFT paralogues, nor the available sequence information. In this study, we established a preliminary allelic relationship between BnFT paralogues and flowering time QTL, and extended this interpretation through association mapping analysis. Firstly, the NI-5 that harbored a small introgression fragment containing Tapidor BnFT alleles flowered much earlier than NI-9 (Figure 4B). This indicated that Tapidor might possess alleles of BnFT with much stronger flower-promoting functional effect than those of NY7. Secondly, association mapping with three BnFT candidates indicated that the alleles of NY7 and Tapidor were significantly prevalent in the spring and winter type rapeseeds, respectively (Table 3). The flowering time pathways in Arabidopsis are better defined than in B. napus, and it is well known that correct flowering time ensures the greatest chance of pollination, higher seed yield and oil content, and therefore reproduction of crop cultivars. Accurate predictive combination of flowering time QTLs, including those associated with a Brassica "florigen" function will accelerate the selection of cultivars with appropriate flowering times for different regions, especially where there is variation in latitude and seasonal temperatures associated with winter environments. The different regulatory mechanism and pathway of flowering between winter and spring types are now able to be analyzed further.
Brassica FTgenes and genome evolution
Comparative mapping between Arabidopsis and Brassica species led Lagercrantz et al.  to be the first to propose an ancestral segmental triplication affecting the complete Brassica genome. Subsequently, this view has been substantiated by compelling evidence from genetic and cytogenetic studies across most of the Brassiceae taxa, which indicate this was achieved by a series of distinct duplication events [25, 32, 33]. However, the detailed mechanisms underlying the process of sequential duplications are not yet well characterized. Based on the sequences of four paralogous B. rapa BAC clones and the homologous 124-kb segment of A. thaliana chromosome 5, Yang et al.  deduced that three paralogous subgenomes of B. rapa emerged through duplications 13 to 17 MYA, very soon after the Arabidopsis and Brassica divergence occurred at 17 to 18 MYA. Using BAC-FISH techniques, three or six copies of the contig have been identified from 18 species of Brassicaceae , and the process of ancestral allohexaploidy Brassica genome was further revealed as hybridization between genomes of the ancient diploid and tetraploid .
Here, we identified six BnFT paralogues in B. napus and determined that each of the A and C genomes contained three copies, which is consistent with the established view of Brassica genome triplication. Moreover, phylogenetic analysis of complete BnFT paralogues in B. napus allowed us to determine a clear evolutionary relationship between each other. It is apparent that the Brassica FT paralogous genes and the unique Arabidopsis FT share the same ancestor. The extant Brassica FT genes derived from two ancestral lineages (Bx), Bx.FT-1 and Bx.FT-2. Subsequently Bx.FT-2 gave rise to two highly identical FT copies, Bx.FT-2a and Bx.FT-2b, and finally Bx.FT-1 together with Bx.FT-2a and Bx.FT-2b formed the final triplication of BnFT genes in A and C genomes (Figure 5A). Thus, we propose an evolutionary pathway of Brassica FT genes (Figure 5C) which is in good agreement with the model of diploid Brassica genome evolution via hexaploidization proposed by Ziolkowski et al. . It is not possible to define the evolutionary divergences accurately based solely upon FT family sequence comparison. However, the divergence time calculated by Ks values placed the likely evolutionary events within a distinct order which was in accordance with the established phylogenetic relationship of the FT genes.
Interestingly, the two chromosome fragments where Bx.FT-2a and Bx.FT-2b are located are within IDBs that appear to represent the most recent duplication event, as indicated by the high sequence identity observed between the orthologues. Moreover, the prevalence of IDBs throughout the genome of B. napus (Figure 3, Table 2) suggests that IDBs have been a universal and efficient pathway in the evolutionary development of Brassica. IDBs with its dosage effect should generate raw genetic materials for the evolution that can be modified subsequently by natural selection just as general DNA duplications do , and the occasionally happened pairing between opposite duplicating segments in the meiosis might bring new variations. The homeologous IDBs in different chromosomes would enhance the chance of homeologous reciprocal or nonreciprocal translocations which were already found in several Canadian and Australian cultivars of B. napus where IDBs exchanged between A7 and C6 [53, 54]. Lyask et al.  found that the most frequent chromosome rearrangements involving the At4-b contig are inversions, whilst comparative mapping has indicated that 43% of Brassica genomicregions with homeology to Arabidopsis chromosomes involved inversions . It has been suggested that rearrangements such as translocations or inversions might reduce or prevent undesirable pairing and recombination between homeologous chromosomes/chromosome regions and lead to reproductive isolation between populations, and eventually contribute to the speciation processes . Our observation of duplication events leading to inverted duplication segments is in accordance with previous reports on inverted chromosome segments duplicated in the C6 (O6) linkage group of B. oleracea [41, 42]. However, this differs from other reports where only two E segments of Arabidopsis chromosome 1 were found in A and C genomes, respectively [40, 56].
The six BnFT paralogues have very high identity between their coding sequences, but vary between their corresponding introns. The CArG box within intron 1 was absent from BnA2.FT and BnC2.FT, which may lead to functional divergence. However, BnA2.FT along with two paralogues on chromosome C6, BnC6.FT.a and BnC6.FT.b, was associated with two major QTL clusters for flowering time indicating that the "florigen" of B. napus may be functionally differentiated between winter and spring type cultivars. The BnFT paralogues share the same ancestral gene with the single FT of Arabidopsis, and have evolved via several duplications and divergence resulting from whole genome polyploidization and the formation of inverted duplication blocks. The characterization of the six BnFT paralogues in B. napus increases our understanding of Brassica genome evolutionary pathways involving genome triplication via multi-stage processes.
A B. napus doubled haploid  population, designated as TNDH and consisting of 202 lines, was generated from an F1 resulting from a cross between a Tapidor DH line (hereafter referred to as Tapidor), a European winter cultivar, and Ningyou7 DH (hereafter referred to as NY7), a Chinese semi-winter cultivar . Near isogenic lines (NILs) were developed with NY7 as the recurrent parent and Tapidor as the donor parent. Fifty-five B. napus cultivars (in 2006) and two NILs (in 2008) were sown in a field plot located in Gansu Province, one of the spring rapeseed regions in China, with 10 plants maintained for each line. NY7 was grown as a control in both years. The period from sowing the seeds to the appearance of the first flower for each accession was recorded as flowering time. The student's t test (Family wide error rate P < 0.05) was used to test the significance of the variation in flowering time for NILs and NY7. Winter cultivars did not flower under these field conditions.
BAC library screening and analysis of clones
A 692 bp FT genomic DNA sequence of (exon2 to exon4) with 87% of identity to FT of Arabidopsis for the coding sequence was isolated from Tapidor using primer pairs AtFT-1 and AtFT-2 (Table 1) designed from an mRNA sequence of FT (GenBank accession NM_105222). The 692 bp-FT-probe was used to screen the JBnB BAC library which was constructed from genomic DNA of Tapidor by Dr. Ian Bancroft, John Innes Centre, UK . Positive BACs were verified with PCR amplification using the AtFT-1/AtFT-2 primer pairs. BnFT paralogues were isolated from six BACs using a set of primers (Figure 1, Table 1) designed from a FT orthologues (designated as BrA2.FT) of B. rapa present within the BAC clone KBrB070J11 that had been mapped to chromosome A2 http://brassica.bbsrc.ac.uk/cgi-bin/gbrowse/jic_brassica/. The PCR products were sequenced, and then assembled to form a contig that was compared with the sequence of BrA2.FT. A cDNA sequence from B. oleracea (GenBank accession EH425279) was designated as BoC6.FT.b, due to the 99% identity with BnC6.FT.b.
The TNDH linkage map containing 786 markers  was used to map the BnFT paralogues with the primers designed from the BnFT sequences of Tapidor (Table 1). The mapped BnFT paralogues were assigned names according to their linkage group, and where two copies were located in the same linkage group, they were distinguished by suffixes with "a" or "b" corresponding to the order of the markers.
In this paper, we abbreviate the full gene nomenclature for Brassica genes as outlined by Ostergaard & King , so that Bna.FT becomes BnFT. In order to distinguish between copies of genes located on specific chromosomes we on occasion indicate this thus: BnC6.FT.a is on chromosome C6.
The coding sequences and amino acids of BnFT paralogues were predicted using the software "Softberry FGENESH" online service http://linux1.softberry.com/berry.phtml. Multiple alignments of the deduced amino acid sequences of FT from Arabidopsis and Brassica species along with PtFT2 (Populus deltoides), Hd3a (Oryza sativa) and HvFT1 (Hordeum vulgare) were performed using ClustalW http://www.ebi.ac.uk/. Percentage similarities between pairs of FT orthologues were calculated using Align http://blast.ncbi.nlm.nih.gov/bl2seq/wblast2.cgi. Phylogenetic relationships were established using MEGA version 4.1  with neighbour-joining based on Kimura's  two-parameter model. Putative motifs in intron1 regions were identified using the "Softberry NSITE-PL" online service http://linux1.softberry.com/berry.phtml.
Phylogenetic and evolutionary sequence analysis
The coding sequences of all the FT orthologues were aligned for further analysis. The fraction of synonymous substitutions (Ks) was obtained using K-Estimator version 6.1 . To estimate the timing of evolution for the different duplication events, we used a median Ks value for each orthologous pair between two blocks. Calculations for the dating of the evolutionary events were carried out using a synonymous mutation rate of 1.4 × 10-8 substitutions per synonymous site per year, which was established for the CHALCONE SYNTHASE gene in eudicots . Divergence time (T) was estimated using the equation T = Ks/2 × 1.4 × 10-8 .
The blocks associated with inverted duplications within the B. napus genome were identified using the TNDH linkage map and confirmed by the colinear array of Arabidopsis loci based on marker identity in the linkage groups http://www.arabidopsis.org/wublast/index2.jsp.
Baurle I, Dean C: The timing of developmental transitions in plants. Cell. 2006, 125: 655-664. 10.1016/j.cell.2006.05.005.
Boss PK, Bastow RM, Mylne JS, Dean C: Multiple pathways in the decision to flower: enabling, promoting, and resetting. Plant Cell. 2004, 16 (Suppl): S18-31. 10.1105/tpc.015958.
Jack T: Molecular and genetic mechanisms of floral control. Plant Cell. 2004, 16 (Suppl): S1-17. 10.1105/tpc.017038.
Moon J, Lee H, Kim M, Lee I: Analysis of flowering pathway integrators in Arabidopsis. Plant Cell Physiol. 2005, 46: 292-299. 10.1093/pcp/pci024.
Putterill J, Laurie R, Macknight R: It's time to flower: the genetic control of flowering time. Bioessays. 2004, 26: 363-373. 10.1002/bies.20021.
Kardailsky I, Shukla VK, Ahn JH, Dagenais N, Christensen SK, Nguyen JT, Chory J, Harrison MJ, Weigel D: Activation tagging of the floral inducer FT. Science. 1999, 286: 1962-1965. 10.1126/science.286.5446.1962.
Turck F, Fornara F, Coupland G: Regulation and identity of florigen: FLOWERING LOCUS T moves center stage. Annu Rev Plant Biol. 2008, 59: 573-594. 10.1146/annurev.arplant.59.032607.092755.
Wigge PA, Kim MC, Jaeger KE, Busch W, Schmid M, Lohmann JU, Weigel D: Integration of spatial and temporal information during floral induction in Arabidopsis. Science. 2005, 309: 1056-1059. 10.1126/science.1114358.
Li D, Liu C, Shen L, Wu Y, Chen H, Robertson M, Helliwell CA, Ito T, Meyerowitz E, Yu H: A repressor complex governs the integration of flowering signals in Arabidopsis. Dev Cell. 2008, 15: 110-120. 10.1016/j.devcel.2008.05.002.
Searle I, He Y, Turck F, Vincent C, Fornara F, Krober S, Amasino RA, Coupland G: The transcription factor FLC confers a flowering response to vernalization by repressing meristem competence and systemic signaling in Arabidopsis. Genes Dev. 2006, 20: 898-912. 10.1101/gad.373506.
Abe M, Kobayashi Y, Yamamoto S, Daimon Y, Yamaguchi A, Ikeda Y, Ichinoki H, Notaguchi M, Goto K, Araki T: FD, a bZIP protein mediating signals from the floral pathway integrator FT at the shoot apex. Science. 2005, 309: 1052-1056. 10.1126/science.1115983.
Corbesier L, Vincent C, Jang S, Fornara F, Fan Q, Searle I, Giakountis A, Farrona S, Gissot L, Turnbull C, Coupland G: FT protein movement contributes to long-distance signaling in floral induction of Arabidopsis. Science. 2007, 316: 1030-1033. 10.1126/science.1141752.
Chardon F, Damerval C: Phylogenomic analysis of the PEBP gene family in cereals. J Mol Evol. 2005, 61: 579-590. 10.1007/s00239-004-0179-4.
Izawa T, Oikawa T, Sugiyama N, Tanisaka T, Yano M, Shimamoto K: Phytochrome mediates the external light signal to repress FT orthologs in photoperiodic flowering of rice. Genes Dev. 2002, 16: 2006-2020. 10.1101/gad.999202.
Paterson AH, Bowers JE, Peterson DG, Estill JC, Chapman BA: Structure and evolution of cereal genomes. Curr Opin Genet Dev. 2003, 13: 644-650. 10.1016/j.gde.2003.10.002.
Salse J, Piegu B, Cooke R, Delseny M: New in silico insight into the synteny between rice (Oryza sativa L.) and maize (Zea mays L.) highlights reshuffling and identifies new duplications in the rice genome. Plant J. 2004, 38: 396-409. 10.1111/j.1365-313X.2004.02058.x.
Tamaki S, Matsuo S, Wong HL, Yokoi S, Shimamoto K: Hd3a protein is a mobile flowering signal in rice. Science. 2007, 316: 1033-1036. 10.1126/science.1141753.
Faure S, Higgins J, Turner A, Laurie DA: The FLOWERING LOCUS T-like gene family in barley (Hordeum vulgare). Genetics. 2007, 176: 599-609. 10.1534/genetics.106.069500.
Hsu CY, Liu Y, Luthe DS, Yuceer C: Poplar FT2 shortens the juvenile phase and promotes seasonal flowering. Plant Cell. 2006, 18: 1846-1861. 10.1105/tpc.106.041038.
Blanc G, Hokamp K, Wolfe KH: A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Res. 2003, 13: 137-144. 10.1101/gr.751803.
Bowers JE, Chapman BA, Rong J, Paterson AH: Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 2003, 422: 433-438. 10.1038/nature01521.
Yang TJ, Kim JS, Kwon SJ, Lim KB, Choi BS, Kim JA, Jin M, Park JY, Lim MH, Kim HI, et al: Sequence-level analysis of the diploidization process in the triplicated FLOWERING LOCUS C region of Brassica rapa. Plant Cell. 2006, 18: 1339-1347. 10.1105/tpc.105.040535.
Udall JA, Quijada PA, Osborn TC: Detection of chromosomal rearrangements derived from homologous recombination in four mapping populations of Brassica napus L. Genetics. 2005, 169: 967-979. 10.1534/genetics.104.033209.
Lagercrantz U: Comparative mapping between Arabidopsis thaliana and Brassica nigra indicates that Brassica genomes have evolved through extensive genome replication accompanied by chromosome fusions and frequent rearrangements. Genetics. 1998, 150: 1217-1228.
Lysak MA, Koch MA, Pecinka A, Schubert I: Chromosome triplication found across the tribe Brassiceae. Genome Res. 2005, 15: 516-525. 10.1101/gr.3531105.
Parkin IA, Gulden SM, Sharpe AG, Lukens L, Trick M, Osborn TC, Lydiate DJ: Segmental structure of the Brassica napus genome based on comparative analysis with Arabidopsis thaliana. Genetics. 2005, 171: 765-781. 10.1534/genetics.105.042093.
Ziolkowski PA, Kaczmarek M, Babula D, Sadowski J: Genome evolution in Arabidopsis/Brassica: conservation and divergence of ancient rearranged segments and their breakpoints. Plant J. 2006, 47: 63-74. 10.1111/j.1365-313X.2006.02762.x.
Lukens L, Zou F, Lydiate D, Parkin I, Osborn T: Comparison of a Brassica oleracea genetic map with the genome of Arabidopsis thaliana. Genetics. 2003, 164: 359-372.
Morinaga T: Interspecific hybridization in Brassica. VI The cytology of F1 hybrids of B. juncea and B. nigra. Cytologia. 1934, 6: 62-67.
Rana D, Boogaart van den T, O'Neill CM, Hynes L, Bent E, Macpherson L, Park JY, Lim YP, Bancroft I: Conservation of the microstructure of genome segments in Brassica napus and its diploid relatives. Plant J. 2004, 40: 725-733. 10.1111/j.1365-313X.2004.02244.x.
U N: Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. Jpn J Bot. 1935, 7: 389-452.
Lysak MA, Berr A, Pecinka A, Schmidt R, McBreen K, Schubert I: Mechanisms of chromosome number reduction in Arabidopsis thaliana and related Brassicaceae species. Proc Natl Acad Sci USA. 2006, 103: 5224-5229. 10.1073/pnas.0510791103.
Lysak MA, Koch MA, Beaulieu JM, Meister A, Leitch IJ: The dynamic ups and downs of genome size evolution in Brassicaceae. Mol Biol Evol. 2009, 26: 85-98. 10.1093/molbev/msn223.
Town CD, Cheung F, Maiti R, Crabtree J, Haas BJ, Wortman JR, Hine EE, Althoff R, Arbogast TS, Tallon LJ, et al: Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy. Plant Cell. 2006, 18: 1348-1359. 10.1105/tpc.106.041665.
Babula D, Kaczmarek M, Barakat A, Delseny M, Quiros CF, Sadowski J: Chromosomal mapping of Brassica oleracea based on ESTs from Arabidopsis thaliana: complexity of the comparative map. Mol Genet Genomics. 2003, 268: 656-665.
Lan TH, DelMonte TA, Reischmann KP, Hyman J, Kowalski SP, McFerson J, Kresovich S, Paterson AH: An EST-enriched comparative map of Brassica oleracea and Arabidopsis thaliana. Genome Res. 2000, 10: 776-788. 10.1101/gr.10.6.776.
Osborn TC: The contribution of polyploidy to variation in Brassica species. Physiologia Plantarum. 2004, 121: 531-536. 10.1111/j.1399-3054.2004.00360.x.
Kobayashi Y, Kaya H, Goto K, Iwabuchi M, Araki T: A pair of related genes with antagonistic roles in mediating flowering signals. Science. 1999, 286: 1960-1962. 10.1126/science.286.5446.1960.
Long Y, Shi J, Qiu D, Li R, Zhang C, Wang J, Hou J, Zhao J, Shi L, Park BS, et al: Flowering time quantitative trait loci analysis of oilseed Brassica in multiple environments and genomewide alignment with Arabidopsis. Genetics. 2007, 177: 2433-2444.
Schranz ME, Lysak MA, Mitchell-Olds T: The ABC's of comparative genomics in the Brassicaceae: building blocks of crucifer genomes. Trends Plant Sci. 2006, 11: 535-542. 10.1016/j.tplants.2006.09.002.
Howell EC, Armstrong SJ, Barker GC, Jones GH, King GJ, Ryder CD, Kearsey MJ: Physical organization of the major duplication on Brassica oleracea chromosome O6 revealed through fluorescence in situ hybridization with Arabidopsis and Brassica BAC probes. Genome. 2005, 48: 1093-1103. 10.1139/g05-069.
Ryder CD, Smith LB, Teakle GR, King GJ: Contrasting genome organisation: two regions of the Brassica oleracea genome compared with collinear regions of the Arabidopsis thaliana genome. Genome. 2001, 44: 808-817. 10.1139/gen-44-5-808.
Osborn TC, Kole C, Parkin IAP, Sharpe AG, et al: Comparison of flowering time genes in Brassica rapa, B. napus and Arabidopsis thaliana. Genetics. 1997, 146: 1123-1129.
Parkin I, Sharpe A, Keith D, Lydiate D: Identification of the A and C genomes of amphidiploid Brassica napus (oilseed rape). Genome. 1995, 38: 1122-1131.
Samach A, Onouchi H, Gold SE, Ditta GS, Schwarz-Sommer Z, Yanofsky MF, Coupland G: Distinct roles of CONSTANS target genes in reproductive development of Arabidopsis. Science. 2000, 288: 1613-1616. 10.1126/science.288.5471.1613.
Robert LS, Robson F, Sharpe A, Lydiate D, Coupland G: Conserved structure and function of the Arabidopsis flowering time gene CONSTANS in Brassica napus. Plant Mol Biol. 1998, 37: 763-772. 10.1023/A:1006064514311.
Lin SI, Wang JG, Poon SY, Su CL, Wang SS, Chiou TJ: Differential regulation of FLOWERING LOCUS C expression by vernalization in cabbage and Arabidopsis. Plant Physiol. 2005, 137: 1037-1048. 10.1104/pp.104.058974.
Razi H, Howell EC, Newbury HJ, Kearsey MJ: Does sequence polymorphism of FLC paralogues underlie flowering time QTL in Brassica oleracea?. Theor Appl Genet. 2008, 116: 179-192. 10.1007/s00122-007-0657-3.
Schranz ME, Quijada P, Sung S-B, Lukens L, et al: Characterization and effects of the replicated flowering time gene FLC in Brassica rapa. Genetics. 2002, 162: 1457-1468.
Cooke J, Nowak MA, Boerlijst M, Maynard-Smith J: Evolutionary origins and maintenance of redundant gene expression during metazoan development. Trends Genet. 1997, 13: 360-364. 10.1016/S0168-9525(97)01233-X.
Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J: Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999, 151: 1531-1545.
De Grassi A, Lanave C, Saccone C: Genome duplication and gene-family evolution: the case of three OXPHOS gene families. Gene. 2008, 421: 1-6. 10.1016/j.gene.2008.05.011.
Howell EC, Kearsey MJ, Jones GH, King GJ, Armstrong SJ: A and C Genome Distinction and Chromosome Identification in Brassica napus by Sequential Fluorescence in Situ Hybridization and Genomic in Situ Hybridization. Genetics. 2008, 180: 1849-1857. 10.1534/genetics.108.095893.
Osborn TC, Butrulle DV, Sharpe AG, Pickering KJ, Parkin IA, Parker JS, Lydiate DJ: Detection and effects of a homeologous reciprocal transposition in Brassica napus. Genetics. 2003, 165: 1569-1577.
Rieseberg LH: Chromosomal rearrangements and speciation. Trends Ecol Evol. 2001, 16: 351-358. 10.1016/S0169-5347(01)02187-5.
Panjabi P, Jagannath A, Bisht NC, Padmaja KL, Sharma S, Gupta V, Pradhan AK, Pental D: Comparative mapping of Brassica juncea and Arabidopsis thaliana using Intron Polymorphism (IP) markers: Homoeologous relationships, diversification and evolution of the A, B and C Brassica genomes. BMC Genomics. 2008, 9: 113-10.1186/1471-2164-9-113.
Qiu D, Morgan C, Shi J, Long Y, Liu J, Li R, Zhuang X, Wang Y, Tan X, Dietrich E, et al: A comparative linkage map of oilseed rape and its use for QTL analysis of seed oil and erucic acid content. Theor Appl Genet. 2006, 114: 67-80. 10.1007/s00122-006-0411-2.
Shi J, Li R, Qiu D, Jiang C, Long Y, Morgan C, Bancroft I, Zhao J, Meng J: Unraveling the Complex Trait of Crop Yield With QTL Mapping in Brassica napus. Genetics. 2009, 182: 851-61. 10.1534/genetics.109.101642.
Ostergaard L, King GJ: Standardized gene nomenclature for the Brassica genus. Plant Methods. 2008, 4: 10-10.1186/1746-4811-4-10.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.
Kimura M: A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980, 16: 111-120. 10.1007/BF01731581.
Comeron JM: K-Estimator: calculation of the number of nucleotide substitutions per site and the confidence intervals. Bioinformatics. 1999, 15: 763-764. 10.1093/bioinformatics/15.9.763.
Koch MA, Haubold B, Mitchell-Olds T: Comparative evolutionary analysis of chalcone synthase and alcohol dehydrogenase loci in Arabidopsis, Arabis,and related genera (Brassicaceae). Mol Biol Evol. 2000, 17: 1483-1498.
The authors thank Professor Zaiyun Li from Huazhong Agricultural University for critical reading and revising the manuscript, Jinna Hou and Ruiyuan Li for experimental help. Financial support for this work was provided by the National Key Basic Research and Development Plan of China (2006CB101600) and National 863 High Technology Program, P. R. China (2006AA10Z108 to J. Zhao). GK is funded by the UK Biotechnology and Biological Sciences Research Council.
JW performed the isolation and mapping of BnFT paralogues, analysis of IDBs, phylogenetic relationship and Brassica FT evolution, and contributed extensively to the writing of the manuscript. YL mapped the QTL for flowering time, compared the whole genome of B. napus to Arabidopsis with TNDH map, and advised JW for the experiment. BW collected the flowering time data in Gansu. JL screened the JBnB BAC library with Tapidor 692 bp-FT-probe. CJ constructed the new version of the TNDH linkage map. JM first conceived of the ideas, guided the data analysis and revised the manuscript. LS and JZ secured funds (2006AA10Z108) and revised the manuscript. GK critically read and improved the manuscript both in terms of academic content and expression of English. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 2: Marker information within IDBs. Markers within the IDBs and corresponding orthologous Arabidopsis gene models. (PDF 54 KB)
Additional file 3: Information relating to the cultivars of B. napus. The phenotype and genotype of spring and winter type B. napus grown in a spring environment. (PDF 43 KB)
About this article
Cite this article
Wang, J., Long, Y., Wu, B. et al. The evolution of Brassica napus FLOWERING LOCUST paralogues in the context of inverted chromosomal duplication blocks. BMC Evol Biol 9, 271 (2009). https://doi.org/10.1186/1471-2148-9-271