Time-scale of minor HIV-1 complex circulating recombinant forms from Central and West Africa
BMC Evolutionary Biology volume 16, Article number: 249 (2016)
Several HIV-1 circulating recombinant forms with a complex mosaic structure (CRFs_cpx) circulate in central and western African regions. Here we reconstruct the evolutionary history of some of these complex CRFs (09_cpx, 11_cpx, 13_cpx and 45_cpx) and further investigate the dissemination dynamic of the CRF11_cpx clade by using a Bayesian coalescent-based method.
The analysis of two HIV-1 datasets comprising 181 pol (36 CRF09_cpx, 116 CRF11_cpx, 20 CRF13_cpx and 9 CRF45_cpx) and 125 env (12 CRF09_cpx, 67 CRF11_cpx, 17 CRF13_cpx and 29 CRF45_cpx) sequences pointed to quite consistent onset dates for CRF09_cpx (~1966: 1958–1979), CRF11_cpx (~1957: 1950–1966) and CRF13_cpx (~1965: 1958–1973) clades; while some divergence was found for the estimated date of origin of CRF45_cpx clade [pol = 1970 (1964–1976); env = 1960 (1952–1969)]. Phylogeographic reconstructions indicate that the HIV-1 CRF11_cpx clade most probably emerged in Cameroon and from there it was first disseminated to the Central Africa Republic and Chad in the early 1970s and to other central and western African countries from the early 1980s onwards. Demographic reconstructions suggest that the CRF11_cpx epidemic grew between 1960 and 1990 with a median exponential growth rate of 0.27 year−1, and stabilized after.
These results reveal that HIV-1 CRFs_cpx clades have been circulating in Central Africa for a period comparable to other much more prevalent HIV-1 group M lineages. Cameroon was probably the epicenter of dissemination of the CRF11_cpx clade that seems to have experienced a long epidemic growth phase before stabilization. The epidemic growth of the CRF11_cpx clade was roughly comparable to other HIV-1 group M lineages circulating in Central Africa.
The HIV-1 group M epidemic started to spread in Kinshasa (Democratic Republic of Congo - DRC) long before the identification of the first AIDS cases in western countries [1, 2]. While still confined to Central Africa, the HIV-1 group M diversified into several lineages known nowadays as subtypes (named A-D, F-H, J and K) and inter-subtype recombinant forms . Recombinants between HIV-1 subtypes are designated as circulating recombinant forms (CRFs; 79 described to date [Los Alamos HIV database; http://www.hiv.lanl.gov/]) if the variant is found in at least three individuals with no direct epidemiological linkage, and, if the CRF is composed by sequences originating from more than two subtypes, it is classified as complex . The CRFs are increasingly becoming relevant to the HIV-1 epidemic, since the global proportion of all CRFs combined increased from 11.5% in 2000–2003 to 16% in 2004–2007 .
Some of the complex CRFs (including CRF04_cpx , CRF06_cpx , CRF09_cpx , CRF11_cpx , CRF13_cpx , CRF18_cpx , CRF25_cpx , CRF27_cpx , CRF37_cpx , CRF45_cpx  and CRF49_cpx ) carry fragments of rare subtypes (e.g., subtypes H, J and K) and divergent unclassified (U) lineages likely derived from parental strains that may predate the current subtypes . Some of these complex CRFs are widely dispersed in a given African region and reaching particularly high prevalence (40–50%) in certain countries, such as the CRF06_cpx in Burkina Faso  and the CRF11_cpx in the Central African Republic . Others complex CRFs circulate at a very low prevalence (<5%) throughout several countries from West (CRF09_cpx and CRF49_cpx) [15, 17, 19–21] and Central (CRF13_cpx, CRF45_cpx) [22–25] Africa. The remaining of those complex CRFs were sporadically detected in Africa, but have successfully disseminated to other locations, as the CRF04_cpx in Greece and Cyprus [26, 27] and the CRF18_cpx in Cuba .
Information about the time-scale, migration routes and population dynamics of those complex CRFs (CRFs_cpx) are scarce. Previous studies conducted by our group support that the CRF06_cpx epidemic in West Africa probably originated from the regional dissemination of a single founder strain introduced in Burkina Faso around the late 1970s , while the CRF18_cpx epidemic in Cuba probably resulted from the local expansion of a single founder strain introduced in the country at the early 1990s . Although these estimates supports a relative recent origin for CRF06_cpx and CRF18_cpx epidemics in West Africa and Cuba, these and other CRFs_cpx carrying fragments of rare subtypes and U lineages probably arose in Central Africa several years earlier. The precise onset dates of the CRFs_cpx clades at the epicenter, however, remain largely unknown.
In the present study, we reconstructed the time-scale of the CRFs 09_cpx, 11_cpx, 13_cpx and 45_cpx as well as the spatial and demographic dissemination dynamics of the CRF11_cpx, by using two comprehensive data sets of HIV-1 pol (n = 181) and env (n = 125) sequences sampled in Central and West Africa over a period of 27 years.
HIV-1 CRFs_cpx sequences datasets
All CRF09_cpx, CRF11_cpx, CRF13_cpx and CRF45_cpx (CRFs09/11/13/45_cpx) sequences with information about country of origin and sampling date were retrieved from the Los Alamos HIV Sequence Database (Los Alamos HIVdb, www.hiv.lanl.gov). The sequences covered the entire protease and partial reverse transcriptase (PR/RT) regions of pol gene corresponding to HXB2 coordinates 2253 to 3272 (CRFs_cpx pol dataset), and the V3 region of the env-gp120 gene corresponding to HXB2 coordinates 7041 to 7345 (CRFs_cpx env dataset). All sequences of the CRFs09/11/13/45_cpx’s parental subtypes (subtypes J and K for pol fragment and subtypes A, A1, A2 for env fragments) from Central and West African countries were also retrieved from Los Alamos HIVdb and included in the final datasets. Sequences were aligned using CLUSTAL X v.2 program , followed by manual editing.
Genetic classification of HIV-1 CRFs_cpx
The subtype classification of all pol and env sequences here included was initially verified with REGA HIV subtyping tool v.3  and COMET v.2  and further confirmed by Maximum Likelihood (ML) phylogenetic and bootscanning analysis. The ML tree was inferred with PhyML program  using an online web server  under the GTR + I + Γ4 nucleotide substitution model selected using the jModeltest v.2 program, and the SPR branch-swapping algorithm for heuristic tree search. The consistency of the tree topology was estimated with approximate likelihood-ratio test  based on a Shimodaira-Hasegawa-like procedure (SH-aLRT). All CRFs_cpx sequences were inspected to verify if their mosaic profile were the expected according to the published CRF breakpoint locations in the Los Alamos HIVdb database (Additional file 1: Figure S1) by bootscanning analysis using Simplot software v.3.5.1 . Bootstrap values supporting branching of query and reference sequences were determined by Neighbor-Joining trees constructed using the Kimura two-parameter model based on 100 re-samplings, with a 300 bp sliding window moving in steps of 10 bases.
The phylogenetic tree, evolutionary rate (μ, nucleotide substitutions per site per year, subst./site/year) and the age of the most recent common ancestor (TMRCA, years) of HIV-1 CRFs09/11/13/45_cpx epidemics circulating in Central and West African regions were jointly estimated using a Bayesian Markov Chain Monte Carlo (MCMC) approach implemented in BEAST v1.8.0 [37, 38] along with BEAGLE v2.1 library to perform parallelization . Analyzes were performed using the GTR + I + Γ4 nucleotide substitution model, an uncorrelated lognormal relaxed molecular clock model  with informative substitution rate priors for the pol (1.5 × 10−3–3.0 × 10−3 subst./site/year) and env (4 × 10−3–8 × 10−3 subst./site/year)  genomic regions, and a Bayesian Skyline coalescent tree prior . MCMC chains were run for 108generations and adequate chain mixing was checked, after excluding an initial 10% burn-in for each run, by calculating the effective sample size (ESS) using TRACER v1.6 program . Maximum clade credibility (MCC) trees were summarized from the posterior distribution of trees with TreeAnnotator and visualized with FigTree v1.4 .
Phylogeographic and demographic analyses
The spatiotemporal and demographic dynamics of dissemination of the HIV-1 CRF11_cpx clade were reconstructed using BEAST v1.8.0 as previously described. Migration events throughout the pol and env phylogenetic history were reconstructed by applying a reversible discrete Bayesian phylogeographic model  and a continuous-time Markov chain rate reference prior  and latter summarized using the SPREAD v.1.0.6 application . The effective population size through time was initially estimated using a Bayesian Skyline coalescent model  and estimates of the population growth rate were subsequently obtained using different parametric models (logistic, exponential and expansion) . The fittest model to the demographic signal contained in CRF11_cpx pol and env dataset was chosen after model comparison using the log marginal likelihood estimation based on path sampling (PS) and stepping-stone sampling (SS) methods . MCMC chains were run for 108 generations and adequate chain mixing was checked as previously described. Graphical representations of the effective number of infections through time were generated by programs TRACER v1.6  and GraphPad Prism 6 (GraphPad Software).
Selection of HIV-1 CRFs_cpx pol and env sequences
Most CRFs_cpx-like pol (99.4%) and env (98.5%) sequences obtained from Los Alamos HIVdb were correctly genotyped since displayed the same mosaic structures (Additional file 1: Figure S1) and branched in highly supported monophyletic clades (Additional file 2: Figure S2 and Additional file 3: Figure S3) with corresponding CRFs_cpx reference sequences. These analyses also identified 67 sequences erroneously classified and misannotated in the Los Alamos HIVdb (Additional file 4: Table S1). Two pol sequences incorrectly annotated as subtype J and CRF11_cpx were reclassified as CRF11_cpx and CRF13_cpx, respectively (Additional file 2: Figure S2). One CRF09_cpx env sequence branched within the CRF11_cpx clade, while 57 subtype A/A1/A2 env sequences branched within the CRFs09/11/13/45_cpx clades radiations and were thus reclassified accordingly (Additional file 3: Figure S3). This approach resulted in two final datasets composed by 181 CRFs_cpx-like pol sequences and 125 CRFs_cpx-like env sequences, sampled between 1984 and 2011 from 16 countries of Central and West Africa (Additional file 5: Table S2 and Additional file 6: Table S3) that were used for the subsequent analyses.
The ML env phylogenetic tree also allowed drawing inferences about the evolutionary origin of the parental viruses that originated the CRFs_cpx, once it was reconstructed from a common subtype A genomic segment. The subtype A segments of CRF11_cpx and CRF13_cpx lineages form a highly supported cluster (SH-aLRT = 0.96) within the subtype A/A1 radiation that also comprised 16 basal sequences originated almost exclusively from Central Africa (DRC/Congo). In contrast, the subtype A segments of CRF09_cpx and CRF45_cpx lineages branched outside the subtype A/A1 radiation as very early divergent lineages. Two sequences from the DRC branched basally to the CRF45_cpx clade (SH-aLRT = 0.85), whereas no basal sequences to the CRF09_cpx clade were identified. The A/A1 env sequences from Central Africa that clustered basally to the CRFs_cpx clades with high support (SH-aLRT > 0.90) were combined with the CRFs09/11/13/45_cpx dataset to aid Bayesian evolutionary and temporal analyses.
Time-scale of HIV-1 CRFs_cpx clades
Bayesian analyses of both pol and env datasets confirmed that all sequences from a given CRFs_cpx formed highly supported monophyletic clades (posterior probability, PP > 0.90) (Figs. 1 and 2) with an overall weak geographic structure (Additional file 7: Figure S4). The median evolutionary rate calculated under a relaxed molecular clock model was 1.6 × 10−3 subst./site/year for pol gene and 4.3 × 10−3subst./site/year for env gene. The coefficient of rate variation for both genes was significantly higher than zero (Table 1), thus supporting the use of a relaxed molecular clock model. The median TMRCA obtained from both HIV-1 datasets point to quite consistent onset dates for the CRF09_cpx (pol = 1968, env = 1965), CRF11_cpx (pol = 1958, env = 1957) and CRF13_cpx (pol = 1966, env = 1964) clades (Table 1). A slightly younger median TMRCA for the CRF45_cpx clade was obtained for pol (1970) than for env (1960) datasets. This can be probably attributed to the much smaller sample size of the CRF45_cpx-like pol dataset (n = 9) when compared to the env dataset (n = 29).
Spatial and demographic dissemination dynamics of the CRF11_cpx clade
Phylogeographic and demographic reconstructions were only performed for the CRF11_cpx clade, since it was the only one that comprised a number of pol and env sequences large enough (n > 30) to provide accurate estimates.
The evolutionary parameters obtained from both CRF11_cpx datasets were almost identical to those estimated from the combined CRFs09/11/13/45_cpx datasets (Table 2). The patterns of viral migration across time reconstructed from both pol and env phylogenies were very similar and indicated that the CRF11_cpx clade most probably emerged in Cameroon (posterior state probability ≥ 0.98) around the early 1960s (Figs. 3 and 4). From Cameroon, the CRF11_cpx was first disseminated to Chad and the Central African Republic between 1970 and 1980, and to other neighboring Central (DRC, Equatorial Guinea and Gabon) and West (Nigeria) African countries from the early 1980s onwards. Secondary disseminations of the CRF11_cpx from the Central African Republic to Cameroon/Gabon and from Chad to Cameroon were also recovered by the pol and env datasets, respectively.
The changes in the effective population size (Ne) of the CRF11_cpx clade over time estimated from both pol and env datasets were also very similar. The Bayesian skyline plot (BSP) coalescent analysis indicated that the CRF11_cpx clade experienced an initial phase of exponential growth, followed by a decline in growth rate from the mid-1980s (Fig. 5 and Table 2). Consistent with this result, the logistic growth model was pointed as the best-fit parametric demographic model (log Bayes Factor > 20) by both PS and SS methods (Additional file 8: Table S4) and then it was used to estimate the epidemic growth rate of the CRF11_cpx epidemic in Central Africa. According to the logistic growth coalescent model, the CRF11_cpx expanded between 1960 and 1990 with a median growth rate of 0.27 year−1 (pol) and 0.28 year−1 (env) (Fig. 5 and Table 2).
The pervasive recombination of the HIV-1 at the early stages of the group M epidemic generated a large array of complex CRFs at the epicenter in Central Africa that usually circulates at low prevalence [5–15]. In this study, we compile a quite large number of HIV-1 pol and env sequences from four complex recombinants (CRF09_cpx, CRF11_cpx, CRF13_cpx and CRF45_cpx) displaying unique mosaic structures in pol and a common subtype A fragment in env and estimate their onset dates. Most CRFs_cpx-like pol sequences used here were correctly annotated in the Los Alamos HIVdb. A significant fraction (46%) of the CRFs_cpx-like env sequences here recovered, by contrast, corresponded to West and Central African sequences originally classified as subtype A/A1/A2, but that clearly branched within the CRFs_cpx radiations and should be thus reclassified.
The inspection of the env ML and Bayesian phylogenetic trees revealed that the subtype A env segments that originated the ancestors of CRF11_cpx and CRF13_cpx seems to have derived from a common lineage that currently circulates in DRC and Congo. Other interesting observation was that while subtype A env segment of CRF11_cpx and CRF13_cpx fall within the subtype A/A1 radiation, the subtype A env segments of CRF09_cpx and CRF45_cpx branched as early divergent lineages basally to the root of subtype A/A1. This observation is consistent with the notion that CRF09_cpx and CRF45_cpx likely originated from viruses that diverged close to (or even before) the time of the HIV-1 subtype A progenitor . Our results also points that some of these early divergent lineages are still circulating in the DRC, once two HIV-1 subtype A-like sequences from that country branched basally to the CRF45_cpx clade. The contributions of these lineages to the genesis of some CRFs indicate that they had epidemiological relevance during the early stages of the HIV-1 group M epidemic [16, 49].
It was suggested that the low frequencies of many ancient HIV-1 divergent lineages in the human population was caused by its absence during the initial migratory wave of variants that triggered the global epidemic . Similarly, the overall low prevalence of the CRFs_cpx lineages carrying fragments of those ancient may reflect a later emergence of these variants when compared to more prevalent HIV-1 subtypes and CRFs. The evolutionary analyses performed here, however, support that complex CRFs probably started to circulate in Central Africa between the late 1950s and the late 1960s, which coincides with the estimated onset date of several prevalent HIV-1 group M clades including: subtype A1 (TMRCA ~ 1955) , subtype C (TMRCA ~ 1955–1965) [41, 50, 51], subtype F1 (TMRCA ~ 1960–1970) [52, 53], subtype G (TMRCA ~ 1965–1970) [41, 54], the CRF01_AE (TMRCA ~ 1970–1975) [41, 55, 56], and the CRF02_AG (TMRCA ~ 1965–1975) [41, 57, 58]. The estimated TMRCA also overlaps with a period of demographic transition of the HIV-1 group M in the DRC around 1960 (95% HPD: 1952–1968), from an early phase of relatively slow exponential growth to a second phase of faster exponential growth . Thus, the early establishment of most HIV-1 group M subtypes and CRFs in the DRC was probably shaped by the same factors and occurred at around the same time, despite significant disparities in their final epidemic outcomes.
Alternatively, the current low prevalence of many ancient HIV-1 divergent lineages and CRFs_cpx lineages may reveals a lower transmissibility of those variants when compared with the globally circulating HIV-1 clades . Some evidences, however, also argued against this hypothesis. First, the CRF06_cpx and CRF11_cpx clades comprise a large fraction (40–50%) of HIV-1 infections in Burkina Faso  and the Central African Republic , respectively, and the CRF18_cpx was successfully disseminated in Cuba . Second, coalescent estimations of the exponential growth rates of the CRF06_cpx (~0.8 year−1) and CRF18_cpx (~0.6 year−1) clades in West Africa  and Cuba , respectively, were similar to that estimated for highly prevalent HIV-1 lineages including: subtype B in Western countries (~0.5–1.5 year−1) [59–62], subtype C in Brazil (~0.5–0.9 year−1) [63, 64], and subtype G (~0.7–1.0 year−1) and CRF02_AG (~0.6 year−1) in West Africa [54, 58]. These observations demonstrate that, in specific settings, the CRFs_cpx clades were able to seed large epidemics and to spread at rates comparable to the most prevalent HIV-1 group M clades.
Our demographic reconstructions also indicate that the epidemic growth rate seems to vary for different CRFs_cpx. According to our estimations, the CRF11_cpx expanded in Central Africa between 1960 and 1990 with a median growth rate of ~0.3 year−1 (95% HPD: 0.2–0.4 year−1), a value significantly lower than that estimated for the CRF06_cpx and CRF18_cpx epidemics in West Africa and Cuba, respectively. The epidemic growth rate of the CRF11_cpx, however, was comparable to that estimated for some subtype G (0.3–0.6 year−1) and CRF02_AG (0.3–0.5year−1) clades circulating in Cameroon [54, 58] and to that estimated for the HIV-1 group M (0.2–0.3 year−1) in the DRC during 1960–1990 . We propose that differences in epidemic growth rates across HIV-1 African lineages most probably resulted from ecological determinants, although differences in viral transmissibility properties might be also responsible for the growth rate variances in some cases .
Spatial accessibility has been pointed as a major driving force of HIV-1 spread within Africa, and the central African region displayed a much lower spatial connectivity than western, eastern, and southern sub-Saharan regions . The CRF06_cpx clade most probably entered in Burkina Faso and from there was disseminated to other neighboring western African countries . According to our phylogeographic reconstructions, the epicenter and most important hub of dissemination of the CRF11_cpx clade was Cameroon, from where the virus spread to neighboring Central African countries (the Central African Republic, Chad, Gabon and Equatorial Guinea). Thus, the dissemination of the CRF06_cpx clade took place in a geographic region much better connected than the region of dissemination of the CRF11_cpx clade, which may have contributed to the faster epidemic growth rate of the CRF06_cpx when compared to the CRF11_cpx.
This study shows that HIV-1 CRFs_cpx clades were already circulating in Central Africa before the late 1960s and probably arose at around the same time than other more prevalent HIV-1 group M lineages. Cameroon was traced as the most probable epicenter of CRF11_cpx dissemination in Central Africa and the demographic history of this CRF was roughly comparable to that described for other central African HIV-1 group M lineages. These results support that the final prevalence of the different HIV-1 group M lineages circulating in human populations was mainly determined by stochastic and ecological factors, rather than by differences in the precise onset date of viral lineages. This study offers important insights toward an understanding of the epidemic potential and current dissemination pattern of some rare HIV-1 group M clades.
Worobey M, Gemmel M, Teuwen DE, Haselkorn T, Kunstman K, Bunce M, et al. Direct evidence of extensive diversity of HIV-1 in Kinshasa by 1960. Nature. 2008;455:661–4.
Faria NR, Rambaut A, Suchard MA, Baele G, Bedford T, Ward MJ, et al. The early spread and epidemic ignition of HIV-1 in human populations. Science (80-). 2014;56:56–61.
Robertson DL, Anderson JP, Bradac JA, Carr JK, Foley B, Funkhouser RK, et al. HIV-1 nomenclature proposal. Science (80-). 2000;288:55–6.
Hemelaar J, Gouws E, Ghys PD, Osmanov S. Global trends in molecular epidemiology of HIV-1 during 2000–2007. Aids. 2011;25:679–89.
Gao F, Robertson DL, Carruthers CD, Li Y, Bailes E, Kostrikis LG, et al. An isolate of human immunodeficiency virus type 1 originally classified as subtype I represents a complex mosaic comprising three different group M subtypes (A, G, and I). J Virol. 1998;72:10234–41.
Montavon C, Toure-Kane C, Nkengasong JN, Vergne L, Hertogs K, Mboup S, et al. CRF06-cpx: a new circulating recombinant form of HIV-1 in West Africa involving subtypes A, G, K, and J. J Acquir Immune Defic Syndr. 2002;29:522–30.
McCutchan FE, Sankale J-L, M’Boup S, Kim B, Tovanabutra S, Hamel DJ, et al. HIV type 1 circulating recombinant form CRF09_cpx from west Africa combines subtypes A, F, G, and may share ancestors with CRF02_AG and Z321. AIDS Res Hum Retroviruses. 2004;20:819–26.
Montavon C, Vergne L, Bourgeois A, Mpoudi-Ngole E, Malonga-Mouellet G, Butel C, et al. Identification of a new circulating recombinant form of HIV type 1, CRF11-cpx, involving subtypes A, G, J, and CRF01-AE, in Central Africa. AIDS Res Hum Retroviruses. 2002;18:231–6.
Wilbe K, Casper C, Albert J, Leitner T. Identification of two CRF11-cpx genomes and two preliminary representatives of a new circulating recombinant form (CRF13-cpx) of HIV type 1 in Cameroon. AIDS Res Hum Retroviruses. 2002;18:849–56.
Thomson MM, Casado G, Posada D, Sierra M, Nájera R. Identification of a novel HIV-1 complex circulating recombinant form (CRF18_cpx) of Central African origin in Cuba. AIDS. 2005;19:1155–63.
Luk K-C, Holzmayer V, Ndembi N, Swanson P, Brennan CA, Ngansop C, et al. Near full-length genome characterization of an HIV type 1 CRF25_cpx strain from Cameroon. AIDS Res Hum Retroviruses. 2008;24:1309–14.
Vidal N, Frange P, Chaix M-L, Mulanga C, Lepira F, Bazepeo SE, et al. Characterization of an old complex circulating recombinant form, CRF27_cpx, originating from the Democratic Republic of Congo (DRC) and circulating in France. AIDS Res Hum Retroviruses. 2008;24:315–21.
Powell RLR, Zhao J, Konings FAJ, Tang S, Ewane L, Burda S, et al. Circulating recombinant form (CRF) 37_cpx: an old strain in Cameroon composed of diverse, genetically distant lineages of subtypes A and G. AIDS Res Hum Retroviruses. 2007;23:923–33.
Niama FR, Vidal N, Bazepeo SE, Mpoudi E, Toure-Kane C, Parra HJ, et al. CRF45_AKU, a circulating recombinant from Central Africa, is probably the common ancestor of HIV type 1 MAL and HIV type 1 NOGIL. AIDS Res Hum Retroviruses. 2009;25:1345–53.
de Silva TI, Turner R, Hué S, Trikha R, van Tienen C, Onyango C, et al. HIV-1 subtype distribution in the Gambia and the significant presence of CRF49_cpx, a novel circulating recombinant form. Retrovirology. 2010;7:82. BioMed Central Ltd.
Tongo M, Dorfman JR, Martin DP. High Degree of HIV-1 group M Genetic Diversity within Circulating Recombinant Forms: Insight into the Early Events of HIV-1M Evolution. J Virol. 2015;90:JVI.02302-15.
Tebit DM, Sangaré L, Tiba F, Saydou Y, Makamtse A, Somlare H, et al. Analysis of the diversity of the HIV-1 pol gene and drug resistance associated changes among drug-naïve patients in Burkina Faso. J Med Virol. 2009;81:1691–701.
Marechal V, Jauvin V, Selekon B, Leal J, Pelembi P, Fikouma V, et al. Increasing HIV type 1 polymorphic diversity but no resistance to antiretroviral drugs in untreated patients from Central African Republic: a 2005 study. AIDS Res Hum Retroviruses. 2006;22:1036–44.
d’Aquin TT, Masquelier B, Minga A, Anglaret X, Danel C, Coulibaly A, et al. HIV-1 antiretroviral drug resistance in recently infected patients in Abidjan, Côte d’Ivoire: A 4-year survey, 2002-2006. AIDS Res Hum Retroviruses. 2007;23:1155–60.
Delgado E, Ampofo WK, Sierra M, Torpey K, Pérez-Alvarez L, Bonney EY, et al. High prevalence of unique recombinant forms of HIV-1 in Ghana: molecular epidemiology from an antiretroviral resistance study. J Acquir Immune Defic Syndr. 2008;48:599–606.
Charpentier C, Bellecave P, Cisse M, Mamadou S, Diakite M, Peytavin G, et al. High prevalence of antiretroviral drug resistance among HIV-1-untreated patients in Guinea-Conakry and in Niger. Antivir Ther. 2011;16:429–33.
Djoko CF, Rimoin AW, Vidal N, Tamoufe U, Wolfe ND, Butel C, et al. High HIV type 1 group M pol diversity and low rate of antiretroviral resistance mutations among the uniformed services in Kinshasa, Democratic Republic of the Congo. AIDS Res Hum Retroviruses. 2011;27:323–9.
Castelbranco EPAF, da Silva SE, Cavalcanti AMS, Martins AN, de Alencar LCA, Tanuri A. Frequency of primary resistance to antiretroviral drugs and genetic variability of HIV-1 among infected pregnant women recently diagnosed in Luanda-Angola. AIDS Res Hum Retroviruses. 2010;26:1313–6.
Afonso JM, Bello G, Guimarães ML, Sojka M, Morgado MG. HIV-1 genetic diversity and transmitted drug resistance mutations among patients from the North, Central and South regions of Angola. PLoS One. 2012;7:e42996.
Pircher M, Diafouka M, Papuchon J, Recordon-Pinson P, Mahambou DN, Akolbout M, et al. Molecular Characterization of HIV Type 1 in Brazzaville, Republic of Congo, and First Data on Resistance to Antiretroviral Drugs. AIDS Res Hum Retroviruses. 2012;28:120717082110004.
Kousiappa I, Van De Vijver DA, Kostrikis LG. Near full-length genetic analysis of HIV sequences derived from Cyprus: evidence of a highly polyphyletic and evolving infection. AIDS Res Hum Retroviruses. 2009;25:727–40.
Antoniadou Z-A, Kousiappa I, Skoura L, Pilalas D, Metallidis S, Nicolaidis P, et al. Short communication: molecular epidemiology of HIV type 1 infection in northern Greece (2009-2010): evidence of a transmission cluster of HIV type 1 subtype A1 drug-resistant strains among men who have sex with men. AIDS Res Hum Retroviruses. 2014;30:225–32.
Delatorre E, Bello G. Spatiotemporal dynamics of the HIV-1 CRF06_cpx epidemic in Western Africa. AIDS. 2013;27:1313–20.
Delatorre E, Bello G. Phylodynamics of the HIV-1 Epidemic in Cuba. PLoS One. 2013;8:e72448.
Larkin MA, Blackshields G, Brown NP, Chenna R, Mcgettigan PA, McWilliam H, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–8.
Pineda-Peña AC, Faria NR, Imbrechts S, Libin P, Abecasis AB, Deforche K, et al. Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: Performance evaluation of the new REGA version 3 and seven other tools. Infect Genet Evol. 2013;19:337–48. Elsevier B.V.
Struck D, Lawyer G, Ternes A-M, Schmit J-C, Bercoff DP. COMET: adaptive context-based modeling forultrafast HIV-1 subtype identification. Nucleic Acids Res. 2014;42:e144.
Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307–21.
Guindon S, Lethiec F, Duroux P, Gascuel O. PHYML Online--a web server for fast maximum likelihood-based phylogenetic inference. Nucleic Acids Res. 2005;33:W557–9.
Anisimova M, Gascuel O. Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. Syst Biol. 2006;55:539–52.
Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, Novak NG, et al. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J Virol. 1999;73:152–60.
Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29:1969–73.
Drummond AJ, Nicholls GK, Rodrigo AG, Solomon W. Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data. Genetics. 2002;161:1307–20.
Suchard MA, Rambaut A. Many-core algorithms for statistical phylogenetics. Bioinformatics. 2009;25:1370–6.
Drummond AJ, Ho SYW, Phillips MJ, Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006;4:e88.
Abecasis AB, Vandamme A-M, Lemey P. Quantifying differences in the tempo of human immunodeficiency virus type 1 subtype evolution. J Virol. 2009;83:12917–24.
Drummond AJ, Rambaut A, Shapiro B, Pybus OG. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol. 2005;22:1185–92.
Rambaut A, Suchard M, Drummond A. 2013. Available from: http://tree.bio.ed.ac.uk/software/tracer/. Accessed 10 July 2016.
Rambaut A. FigTree v1.4 [Internet]. 2014. Available from: http://tree.bio.ed.ac.uk/software/figtree/. Accessed 10 July 2016.
Lemey P, Rambaut A, Drummond AJ, Suchard MA. Bayesian phylogeography finds its roots. PLoS Comput Biol. 2009;5:e1000520.
Ferreira M a R, Suchard M a. Bayesian analysis of elapsed times in continuous-time Markov chains. Can J Stat. 2008;36:355–68.
Bielejec F, Rambaut A, Suchard M, Lemey P. SPREAD: spatial phylogenetic reconstruction of evolutionary dynamics. Bioinformatics. 2011;27:2910–2.
Baele G, Lemey P, Bedford T, Rambaut A, Suchard M a, Alekseyenko AV. Improving the accuracy of demographic and molecular clock model comparison while accommodating phylogenetic uncertainty. Mol Biol Evol. 2012;29:2157–67.
Kalish ML, Robbins KE, Pieniazek D, Schaefer A, Nzilambi N, Quinn TC, et al. Recombinant viruses and early global HIV-1 epidemic. Emerg Infect Dis. 2004;10:1227–34.
Rousseau CM, Learn GH, Bhattacharya T, Nickle DC, Heckerman D, Chetty S, et al. Extensive intrasubtype recombination in South African human immunodeficiency virus type 1 subtype C infections. J Virol. 2007;81:4492–500.
Travers SAA, Clewley JP, Glynn JR, Fine PEM, Crampin AC, Sibande F, et al. Timing and reconstruction of the most recent common ancestor of the subtype C clade of human immunodeficiency virus type 1. J Virol. 2004;78:10501–6.
Mehta SR, Wertheim JO, Delport W, Ene L, Tardei G, Duiculescu D, et al. Using phylogeography to characterize the origins of the HIV-1 subtype F epidemic in Romania. Infect Genet Evol. 2011;11:975–9. Elsevier B.V.
Guimarães ML, Vicente ACP, Otsuki K, da Silva RFFC, Francisco M, da Silva FG, et al. Close phylogenetic relationship between Angolan and Romanian HIV-1 subtype F1 isolates. Retrovirology. 2009;6:39.
Delatorre E, Mir D, Bello G. Spatiotemporal dynamics of the HIV-1 subtype G epidemic in West and Central Africa. PLoS One. 2014;9:e98908.
Zeng H, Sun B, Li L, Li Y, Liu Y, Xiao Y, et al. Reconstituting the epidemic history of mono lineage of HIV-1 CRF01_AE in Guizhou province, Southern China. Infect Genet Evol. 2014;26:139–45. Elsevier B.V.
An M, Han X, Xu J, Chu Z, Jia M, Wu H, et al. Reconstituting the epidemic history of HIV strain CRF01_AE among men who have sex with men (MSM) in Liaoning, northeastern China: implications for the expanding epidemic among MSM in China. J Virol. 2012;86:12402–6.
Faria NR, Suchard MA, Abecasis A, Sousa JD, Ndembi N, Bonfim I, et al. Phylodynamics of the HIV-1 CRF02_AG clade in Cameroon. Infect Genet Evol. 2012;12:453–60. Elsevier B.V.
Mir D, Jung M, Delatorre E, Vidal N, Peeters M, Bello G. Phylodynamics of the major HIV-1 CRF02_AG African lineages and its global dissemination. Infect Genet Evol. 2016;S1567–1348:30189-7.
Rome M, Zehender G, Ebranati E, Lai A, Santoro MM, Alteri C, et al. Population Dynamics of HIV-1 Subtype B in a Cohort of. J Acquir Inmune Defic Syndr. 2010;55:156–60.
Hue S, Pillay D, Clewley JP, Pybus OG, Hué S, Pillay D, et al. Genetic analysis reveals the complex structure of HIV-1 transmission within defined risk groups. Proc Natl Acad Sci U S A. 2005;102:4425–9.
Bello G, Eyer-Silva WA, Couto-Fernandez JC, Guimarães ML, Chequer-Fernandez SL, Teixeira SLM, et al. Demographic history of HIV-1 subtypes B and F in Brazil. Infect Genet Evol. 2007;7:263–70.
Walker PR, Pybus OG, Rambaut A, Holmes EC. Comparative population dynamics of HIV-1 subtypes B and C: subtype-specific differences in patterns of epidemic growth. Infect Genet Evol. 2005;5:199–208.
Gräf T, Fritsch HM, de Medeiros RM, Junqueira DM, Almeida SE de M, Pinto AR. Comprehensivecharacterization of the HIV-1 molecular epidemiology and demographic history in the Brazilian region most heavily affected by AIDS. J Virol. 2016;90:8160–8.
Bello G, Guimarães ML, Passaes CPB, Matos Almeida SE, Veloso VG, Morgado MG. Short communication: Evidences of recent decline in the expansion rate of the HIV type 1 subtype C and CRF31_BC epidemics in southern Brazil. AIDS Res Hum Retroviruses. 2009;25:1065–9.
Tatem AJ, Hemelaar J, Gray RR, Salemi M. Spatial accessibility and the spread of HIV-1 subtypes and recombinants. AIDS. 2012;26:2351–60.
We thank the researchers whose publicly available data made this work possible.
E.D. is funded by a fellowship from “Programa Nacional de Pós-Doutorado” (CAPES-Brazil).
Availability of data and material
The sequences analyzed during the current study are available from the Los Alamos HIV Sequence Database and the datasets are available from the corresponding author upon request.
GB & ED conceived the study, performed the bioinformatics analyses and wrote the final manuscript. Both authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Figure S1. Recombination pattern of the circulant recombinant forms analyzed in this study. Box representing pol (HXB2: 2253–3272) and env (HXB2: 7041–7346) gene fragments used in this study were superimposed on the graphical illustrations of the CRFs09\11\13\45_cpx genomes based on breakpoint data available in Los Alamos HIV database and colored according to the legend at bottom. (TIF 343 kb)
Figure S2. Maximum likelihood phylogenetic tree based on the CRF09/11/13/45_cpx pol fragment sequences for HIV-1 subtype (re)classification. Branches were colored according to HIV-1 subtype classification provided by the Los Alamos HIV database and indicated at the legend. Black dots represent the reference genomes of each CRF. For visual clarity, other subtypes not directly related to the CRF09/11/13/45_cpx were collapsed into triangles. The branch support values are indicated as * (SH-aLRT > 0.80 and < 0.90) or ** (SH-aLRT > 0.90) at key nodes. Horizontal branch lengths are drawn to scale with the bar at the bottom indicating nucleotide substitutions per site. (TIF 287 kb)
Figure S3. Maximum likelihood phylogenetic tree based on the CRF09/11/13/45_cpx env fragment sequences for HIV-1 subtype (re)classification. Branches were colored according to HIV-1 subtype classification provided by the Los Alamos HIV database and indicated at the legend. Black dots represent the reference genomes of each CRF. For visual clarity, other subtypes not directly related to the CRF09/11/13/45_cpx and some clades that comprised mostly sequences of subtype A were collapsed into triangles. The branch support values are indicated as * (SH-aLRT > 0.80 and < 0.90) or ** (SH-aLRT > 0.90) at key nodes. Horizontal branch lengths are drawn to scale with the bar at the bottom indicating nucleotide substitutions per site. (TIF 490 kb)
Table S1. Sequences reclassified in this study. (PDF 75 kb)
Table S2. HIV-1 CRFs_cpx pol dataset. (PDF 69 kb)
Table S3. HIV-1 CRFs_cpx env dataset. (PDF 70 kb)
Figure S4. Geographic distribution of the HIV-1 CRFs09/11/13/45_cpx pol (a) and env (b) gene fragments. Tips colors indicate the country of isolation of each sequence, according to the map. Country names are indicated using a two-letter code in accordance with ISO 3166. The external circular segments highlight the position of each specific clade as indicated at the line. Asterisks point to key nodes with a high (>0.90) PP support. Branch lengths are drawn to scale with the concentric circles indicating years. The trees were automatically rooted under the assumption of a relaxed molecular clock. (TIF 2346 kb)
Table S4. Best-fit demographic models for HIV-1 CRF11_cpx pol and env datasets. (PDF 64 kb)
About this article
Cite this article
Delatorre, E., Bello, G. Time-scale of minor HIV-1 complex circulating recombinant forms from Central and West Africa. BMC Evol Biol 16, 249 (2016). https://doi.org/10.1186/s12862-016-0824-8
- Complex CRFs