Previous Article | Next Article ![]()
Journal of Clinical Microbiology, June 2004, p. 2742-2751, Vol. 42, No. 6
0095-1137/04/$08.00+0 DOI: 10.1128/JCM.42.6.2742-2751.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Molecular Virology Laboratory, Molecular Biology and Genetics Unit, Jawaharlal Nehru Centre for Advanced Scientific Research,1 Department of Neuropathology,3 Department of Neurology,10 Department of Neurovirology,2 National Institute of Mental Health and NeurosciencesMicrotest Innovations Pvt. Ltd.,4 Seva Free Clinic, Bangalore, Karnataka,6 Department of Infectious Diseases, Medical College Hospital, Thiruvananthapuram, Kerala,7 Department of Virology, Chittaranjan National Cancer Institute, Kolkata, West Bengal,8 Government General Chest Hospital, Erragadda, Osmania Medical College, Hyderabad, Andhra PradeshIndia,9 Center for Genomic Sciences, Allegheny-Singer Research Institute,5 Thomas E. Starzl Transplantation Institute, Department of Surgery,11 Department of Infectious Diseases and Microbiology, University of Pittsburgh, Pittsburgh, Pennsylvania,12
Received 25 September 2003/ Returned for modification 29 October 2003/ Accepted 21 February 2004
|
|
|---|
B sites encoded in the subtype C-specific fragment. We implemented this method to screen 256 HIV-1-infected individuals from 35 towns and cities in four states in the south and a city in the east. With the exception of single samples of subtypes A and B and a B/C recombinant, we found all to be infected with subtype C viruses, and the subtype assignments were confirmed in a subset by using heteroduplex mobility assays and phylogenetic analysis of sequences. We propose the use of C-PCR to facilitate rapid molecular epidemiologic characterization to aid vaccine and therapeutic strategies. |
|
|---|
Most studies in India have involved a small number of samples and/or were restricted to city or local levels and hence do not offer reliable estimates of subtype prevalence. Furthermore, these studies have also left out the newly emerging high-prevalence areas. For example, characterization of HIV subtypes from the southern states has not been reported, even though a high population prevalence of HIV infection has been predicted on the basis of >1% prevalence among antenatal clinic subjects and a high prevalence among high-risk groups (http://www.naco.nic.in). Recent reports of the potential spread of subtype B infections (15) illustrate a need to distinguish whether this is an artifact of the examination of an isolated cluster or the nucleus of a change in the epidemic.
Among the strategies used for subtyping viruses, sequencing viral genomes followed by phylogenetic analyses is the de facto standard but is an expensive and labor-intensive option. Heteroduplex mobility assays (HMA) (6) are the most often used cost-effective alternative and have excellent concordance with phylogeny-based subtyping (1). In spite of HMA being a technically simpler technique, processing a large number of samples requires running parallel electrophoresis gels with multiple subtypes for each sample and is labor-intensive. Therefore, approaches and methods that allow rapid answers to straightforward questions, such as scoring a given sample for subtype assignment, will facilitate processing large numbers of samples and provide greater coverage of the infected population. Subtype sequence-specific PCR is one such method that targets sequences differentially conserved among subtypes. Analogous to sequence-specific (or allele-specific) PCR that has been used extensively in genomic studies (9, 34, 35), this method exploits genetic differences at the level of primer sequences to differentially amplify fragments specific to a given subtype and has been used with other subtypes of HIV-1 (4, 39). Here, we document a PCR strategy (C-PCR) that exploits differences in the long terminal repeat (LTR) region between subtype C and non-subtype C sequences to generate common HIV-1 and subtype C-specific fragments with mobilities distinguishable in an agarose gel. We have optimized the method and implemented it to screen 256 HIV-infected individuals from 35 cities and towns in five states in the southern and eastern parts of India. We found a preponderance of these individuals to be infected with subtype C viruses.
|
|
|---|
Primer design. We targeted a genomic region extending from the LTR into gag, containing sequences highly conserved among multiple subtypes, as well as stretches of sequences differentially conserved between subtype C and other subtypes. We designed three sets of primers (Table 1). One set of outer primers was designed to amplify a 975-bp LTR-gag fragment from multiple subtypes. A second set of internal primers was designed to amplify a 138-bp LTR fragment specific to subtype C sequences while being refractory to amplification from non-subtype C sequences. A third set of highly conserved internal primers was designed to amplify a 232-bp LTR leader-gag region fragment from all subtypes (Fig. 1).
|
View this table: [in a new window] |
TABLE 1. Sequences for primers used in C-PCR
|
![]() View larger version (88K): [in a new window] |
FIG.1. C-PCR strategy, genomic position, and variability in sequences corresponding to subtype C-specific primers. (Top) LTR/gag region corresponding to primers used in this study and their relative positions (not drawn to scale). The upstream subtype C-specific primer, N415F, was anchored on the upstream stimulatory factor site in the LTR, while the downstream primer was anchored on the NF- B site. (Bottom) Levels of variability in sequences corresponding to these primers among various subtypes. For each subtype, using the number of sequences indicated, the proportion of different nucleotides at each position was calculated and plotted as a percentage. Subtype C sequences from India (CIN) are shown separately. The numbers below the subtypes indicate the average uncorrected nucleotide distances between the primer and the corresponding subtype. The asterisks indicate the four positions that accounted for much of the variability within subtype C sequences. International Union of Pure and Applied Chemistry codes were used for positions with redundant nucleotides.
|
20% or more divergent, while N417R was divergent in excess of 14%.
As is evident in Fig. 1, N415F and N417R were specifically designed to contain mismatches at the 3' end to support specific amplification of subtype C sequences while being refractory to amplification from non-subtype C sequences. N415F was anchored on the 3-base sequence in the upstream stimulatory factor motif within the LTR-U3 region, while N417R was anchored on the NF-
B motif that is proximal to the Sp1 sites. As expected from the high levels of heterogeneity observed in HIV-1, we could not find sequences that encoded the exact complement of subtype C-specific primers in the 3' end. However, this does not appear to be a hurdle, since none of the subtype c samples tested thus far have failed to amplify and the few that did not amplify were found to be non-subtype C. This is the first subtype-specific PCR report involving a comprehensive characterization of region-encoding primer sequences. The lack of similar characterizations may account for the less than widespread use of previously published primers.
PCR and cloning. PCR amplifications were performed using 500 ng of genomic DNA in a 25-µl volume containing 3 mM MgCl2, 200 µM (each) deoxynucleoside triphosphates, 25 pmol of each primer, and 0.625 U of Taq DNA polymerase (Amersham, Piscataway, N.J.). The first-round PCR conditions were as follows: 94°C for 1 min, 60°C for 1 min, and 72°C for 1 min for a total of 25 cycles on a thermocycler (Minicycler; M.J. Research). Two microliters of first-round PCR products was transferred to a second-round PCR that contained the four internal primers. The PCR conditions were as follows: 94°C for 1 min, 65°C for 40 s, and 72°C for 40 s for 35 cycles. After the second round, the PCR products were resolved on a 1.5% agarose gel or a 3% NuSieve agarose gel (FMC Bioproducts, Rockland, Maine), and the ethidium bromide-stained DNA bands were captured using a gel documentation system (Alpha Innotech, San Leandro, Calif.). The quality control measures included the negative controls containing template DNAs from uninfected individuals and blind analyses of samples from different known subtypes.
Annealing temperatures and Mg concentrations were optimized for sensitivity and specificity, individually and in multiplex PCR amplifications using DNA templates from two unrelated molecular clones of subtype C (pINDIE [30] and pMJ4 [33]) and one of subtype B (pYU-2 [25]). Defined concentrations of plasmid DNA were serially diluted in the presence of 500 ng of salmon sperm DNA to derive precise template copy numbers ranging from <1 to 10,000 copies in each reaction. We found the subtype C-specific and common HIV primers to consistently amplify 1 to 10 copies of the template DNA in uniplex PCR experiments (Fig. 2). Conditions optimized in uniplex PCR amplifications were found to work in multiplex PCR with no decrease in sensitivity and/or specificity. These findings suggest consistent and reliable detection of small numbers of copies of HIV using the primer pairs and amplification conditions outlined here.
![]() View larger version (48K): [in a new window] |
FIG. 2. Specificity and sensitivity of C-PCR. (A) The Specificity of C-PCR was assessed using molecular clones derived from different subtypes of HIV-1 in the following order (from left): A (p92UG037.1), A (p90CF402.1.8), B (pBH10), B (pYU2), C2 (pMJ4), C3 (pINDIE), C4 (p92BR025.8), D (p94UG114.1), D (p94UG114.1.6), D (p84ZR085.1), E (p90CF402.1), F (p93BR020.1), G (p92NG003.1), G (p92NG083.2), and H (p90CF056.1). Lane M, DNA size standard; , control lane. The extra NF- B site present in the subtype C4 molecular clone can be seen to lead to increased size of the subtype C-specific fragment. (B) Sensitivity of C-PCR. The subtype C molecular clone pINDIE was used at the copy numbers shown above the lanes. Optimization of uniplex PCR conditions and their subsequent use in multiplex PCR with no loss of sensitivity are seen. , control lane.
|
HMA and sequence analysis.
HMA was performed according to methods outlined by Delwart et al. (6), using the HMA-subtyping kit supplied by the NIH AIDS Reference and Reagent Program. An
700-bp fragment of the env gene, spanning the C2 to V5 domains, was amplified using the primers ES7 and ES8. Heteroduplexes were formed by mixing equal amounts of amplified DNA (
500 ng of each) in a volume of 35 µl and boiling the mixture in the presence of 100 mM NaCl and 2 mM EDTA for 5 min, followed by a quick chill and incubation on ice for 90 min. The heteroduplexes were separated on 5% polyacrylamide gels, stained with ethidium bromide, and scored for subtype assignment. Forty-eight of the 256 clinical samples were tested for concordance among HMA, C-PCR, and phylogenetic analysis.
Sequences corresponding to the LTR and env C2V5 regions used for phylogenetic reconstructions were collected from multiple subtypes from the Los Alamos database (22), aligned at the codon level, and gap stripped. The rates of nucleotide substitution and gamma distribution parameters were estimated using Paup* (51) as described earlier (47). Subtype assignments and depiction of phylogenetic relationships were accomplished using the codeml (with discrete NS site model) and baseml programs for the env and LTR sequences, respectively, in PAML (56).
Nucleotide sequence accession numbers. The GenBank accession numbers for the sequences in this study are AY567495 to AY567539 and AY567474 to AY567486.
|
|
|---|
B site in clone C4 was reflected in the increased size of the subtype C-specific fragment. In addition to these prototype clones, C-PCR accurately subtyped DNAs extracted from primary cultures of subtype C2, C3, A1, and B2 viruses (provided by the National AIDS Research Institute, Pune, India). Consistent amplification of common HIV and subtype C-specific fragments using template copies ranging from 10,000 to <1 in uniplex and multiplex PCRs is illustrated in Fig. 2B. The overall subtype specificity and the ability of each subtype C variant to support PCR amplification suggested that these primers are highly specific to subtype C viruses and that they amplify fragments of predicted sizes from different representative subtype C viruses. Subtyping clinical samples using C-PCR. Subtype-specific C-PCR was implemented to screen DNAs isolated from 256 samples obtained from 35 towns or cities spread over four states in the south and one city in the east of India. These samples were derived from individuals previously diagnosed as having been HIV infected using serologic assays. Among the five states, Karnataka, Andhra Pradesh, and Tamil Nadu have been considered high-prevalence states based on >1% prevalence among women attending antenatal clinics and a high prevalence among sexually transmitted disease patients. Kerala is considered a low-prevalence state based on low prevalence among sexually transmitted disease patients and <0.1% prevalence among antenatal-clinic patients. Subtype C viruses have been identified as the major subtype circulating in West Bengal State in earlier studies (28, 29, 47), and these samples were included to allow comparison with previous findings.
Each of the clinical samples tested in this study generated fragments of the predicted sizes suggestive of the presence of three or four NF-
B sites (Fig. 3A). The heteroduplex mobility patterns of selected representative samples are presented below the corresponding C-PCR lanes. With the exception of one sample from Bangalore, all of the samples identified as subtype C in C-PCR were also determined to be subtype C by HMA (Fig. 3B). All 177 samples from Karnataka screened by C-PCR and 32 env sequences obtained showed the presence of subtype C. One sample was discordant between C-PCR and HMA analyses, suggesting potential infection by a recombinant virus (Fig. 3D). Analysis of nucleotide sequences over the LTR and env regions of this sample confirmed the presence of a B/C recombinant. This is the first report identifying a subtype B/C recombinant virus in India. Each of the 13 samples from Kerala and 11 from Tamil Nadu State was also identified as subtype C by C-PCR. Subtype designation of seven of these samples by using HMA found all of them to be concordant with C-PCR results. Of the 15 samples from Andhra Pradesh that were analyzed, 14 were found to be subtype C by C-PCR and one sample failed to amplify a subtype C-specific fragment and was subsequently identified as subtype B by HMA (Fig. 3C). Similarly, one sample out of the 40 from West Bengal State failed to amplify a subtype C-specific fragment and was identified as subtype A by analysis of the env sequence (results not shown). Overall, C-PCR screening of 256 samples from different states indicated an absence of subtype C sequences in only two samples, both of which were validated using HMA and sequencing.
![]() View larger version (100K): [in a new window] |
FIG. 3. Subtyping clinical samples with C-PCR, HMA, and sequencing. (A) Illustration of typical gel profiles of subtype C sequences obtained using C-PCR. Each lane represents a different clinical sample. , control reaction mixture that contained DNA from an individual not infected with HIV-1. (B) HMA analysis of a subset of samples identified as subtype C by C-PCR. These representative samples demonstrate the typical HMA profiles of subtype C viruses. The labeling above the lanes indicates the subtype standards used. Although additional subtype standards were used in the analysis, only profiles with subtype A and C standards are shown. , control lanes that did not include subtype standards. (C) A non-C virus identified as subtype B by HMA. C-PCR failed to amplify the LTR fragment from this sample. (D) A single recombinant virus identified in this study was subtype C in the LTR and subtype B in env. (E) Insertion of additional sequences resembling NF- B motif is common in subtype C strains and results in bands with lower mobility than expected (Fig. 1A). The insertion of a 15-bp sequence between two authentic NF- B motifs of one such clinical sample has been confirmed by sequencing the LTR. An NF- B-like motif is highlighted.
|
B-like sequences in these cases (Fig. 3E). Similar to that observed for a subtype C4 clone (Fig. 2A) and clinical samples (Fig. 3A), of the 254 subtype C-specific fragments, 35 contained larger fragments, suggesting the presence of four NF-
B sites. These 35 samples were distributed over 17 towns among the states studied here (Table 2). |
View this table: [in a new window] |
TABLE 2. Subtype assignments of HIV-1 using three methods on samples obtained from Indian states
|
We analyzed env C2V5 and LTR sequences from subsets of 45 and 13 samples, respectively (Table 2 and Fig. 4). In phylogenetic reconstructions, env sequences sampled from each of the 45 individuals drawn from the four southern states clustered unambiguously with other subtype C sequences from India and other parts of the world. Similarly, all 13 LTR sequences sampled in this study also clustered with other subtype C sequences. These analyses showed that subtyping viruses based on subtype-specific PCR of the LTR region is consistent with the subtype assignments for the env genes and LTRs in these samples. As illustrated in Fig. 5, the study subjects were from different areas in the southern part of the country. The places where non-subtype C viruses were identified are indicated.
![]() View larger version (42K): [in a new window] |
FIG. 4. Phylograms depicting the relationship between env and LTR sequences obtained in this study (solid circles) and sequences obtained from the HIV database. Single non-subtype C env sequences obtained from the database are identified by the final digits of their accession numbers. Subtype C sequences from eight other countries were included in the analysis and are identified by open triangles along with their two-letter country codes and the accession numbers. The open circles identify subtype C sequences from India available in the database. The phylograms were obtained from gap-stripped alignments and edited. The log likelihood (lnL) scores for the phylograms and the scale bars corresponding to substitutions per codon are indicated.
|
![]() View larger version (37K): [in a new window] |
FIG. 5. Geographic spread in the assessment of subtype prevalence in India. The four states in the south and the one state in the east of India from which the clinical samples for this study were collected are shaded. The urban centers of the states are indicated by stars, and the towns and cities are indicated by triangles. Subtype C viruses were found in all of these places, and the three places where non-subtype C viruses or recombinants were detected are circled.
|
|
|
|---|
Even though we designed both of our primers to contain subtype-specific mismatches, it is possible to discriminate subtypes by using a single subtype-specific primer. With the exception of subtype A viruses, which show high variability in the 3' end of N415F (Fig. 1), the region we have targeted appears to be highly suited for designing primers corresponding to each of the other subtypes. However, we focused this study solely on identifying the presence of subtype C sequences because no information on molecular epidemiology is available from the southern parts of India. These regions represent major epicenters of viral infection, and in view of recent reports documenting the presence of non-subtype C viruses, the need to identify subtype profiles in the emerging epidemics in the southern parts of India is of utmost importance and urgency.
In contrast to most earlier studies that have characterized HIV from major urban epicenters, this study was specifically targeted to explore the natures of viruses circulating in rural parts of India that are geographically distant from major epicenters in each state. While demographic and other information from nonurban areas of the country is slowly becoming available, the proper design of vaccine and antiretroviral therapy programs also requires molecular characterization. Even though we have examined a small and disproportionate number of samples, the study design incorporating a wide geographic area makes up for this deficiency. Regardless of the small number of non-subtype C viruses identified, this study indicates a strong association between subtype C viruses and the epidemic in the southern parts of India. Documentation of the presence of intersubtype recombinants in this study and in others makes it necessary to design strategies specifically targeted to identify them. This can be accomplished by implementing a strategy analogous to C-PCR in a more distant region of the genome. Previous studies have documented recombinants using a similar strategy but involving testing for discordance between gag and env subtype assignments by using HMA (3, 16).
The region of the viral genome targeted for the development of C-PCR has a unique significance among subtype C viruses, which was the basis for our choice. Primer N417R was specifically designed to exploit the features conserved within and flanking the NF-
B motif. While most subtype C viruses contain three NF-
B sites in their LTRs (17, 20, 21, 32), a number of them have been reported to possess an additional fourth motif that resembles an NF-
B site (18, 31, 46). Although the functional significance of these additional NF-
B sites is not clear, their presence in the subtype C LTR is believed to enhance the transactivation property of the viral promoter (31, 43, 44). The link between variable numbers of NF-
B sites and potential biological significance, and the conservation of sequences in the targeted region, supports a wider use of C-PCR to identify subtype C viruses in other parts of the world. In addition to identifying the presence of subtype C viruses, C-PCR can also help to identify and score the number of NF-
B sites.
With the C-PCR strategy, it is important to note that the enhanced sensitivity of nested PCR is invariably accompanied by higher susceptibility to carryover contamination. To make sure that such contamination did not influence any of the results outlined here, we implemented strict procedural controls that included template preparation, PCR setup, and gel electrophoresis in physically separated laboratories. In addition, each PCR experiment included replicates containing DNAs from individuals who were not infected with HIV. Therefore, it is necessary to take similar uncompromising precautions to prevent artifactual interpretations. The possibility of contamination, however, is not a unique problem for C-PCR but also applies to HMA analysis that is also designed with a nested-PCR format.
C-PCR is highly specific, sensitive, and discriminatory for subtype-C strains. Unlike HMA, C-PCR is amenable to automation, and detection is simpler and economical. C-PCR, therefore, could be of use for large-scale molecular epidemiology studies. Especially in a country like India, where subtype-C viruses cause a large proportion of infections, C-PCR offers the advantage of rapid and less expensive screening at the primary level. Importantly, C-PCR in combination with HMA could detect recombinant viruses, as the two techniques target different regions of the virus (Fig. 3D). C-PCR, by design, is a multiplexing of two individual regions of the subtype-C virus, thus offering the advantage of reducing false-negative results due to genetic diversity. The Leader/gag region is one of the most conserved regions of the virus. C-PCR, therefore, is highly sensitive to detect all infections by HIV-1 regardless of the subtype. In our experience with hundreds of primary clinical samples, amplification of the LTR/Leader/gag region failed on very rare occasions. This is in contrast to frequent failures encountered in the amplification of the env region for HMA. LTR/Leader/gag amplification occurred even in a few cases where we could not amplify the env region for HMA analysis. A large number of samples could not be genotyped using HMA due to ambiguous band patterns (18, 45). In one study, 17 out of 52 samples were untypeable by HMA (28). We also encountered several samples that could not be characterized by HMA; however, C-PCR identified all of these samples without ambiguity.
Identifying subtype prevalence, especially in regions where multiple subtypes circulate, is necessary for a number of reasons. Subtype characterization impacts strongly on our understanding of the epidemic and approaches to tackle it. No clear evidence linking genetic subtypes with differences in disease progression is available, but the link cannot be ruled out. Subtype differences are also expected to impact strongly on the efficacy of a vaccine and are most relevant to resource-poor countries. The presence of differences in the pattern of cytotoxic-T-lymphocyte epitope recognition (36, 37) suggests that the high level of sequence differences among subtypes is an important variable in the design of a vaccine. Pending the availability of a potent vaccine, emphasis is being placed on making antiretroviral therapies available in resource-poor settings (42). However, the impact of subtype differences on the outcome following therapy is not clear, since nearly all the available antiretroviral agents were designed in a subtype B setting. Even though most of these agents appear to be effective in suppressing non-subtype B viruses to similar extents, significant differences in enzyme kinetics and mutations associated with reduced sensitivity have been documented (50). For example, subtype C and A polymerases appear to exhibit reduced affinities to a panel of antiretroviral agents (54) and to amplify the effect of mutations associated with reduced susceptibility (55). In addition, Diallo et al. and Loemba et al. have documented distinct differences between subtypes in the rates at which mutations associated with reduced sensitivity to antiretroviral agents evolve and the differences in mutations within subtypes C and B that are associated with reduced sensitivity to antiretroviral agents (7, 26). These studies suggest that identifying subtypes at individual and population levels is an important step in monitoring and managing the epidemic.
In summary, we have optimized the conditions for specific amplification of subtype C sequences using C-PCR and implemented it in the analysis of 256 samples derived from a wide geographic area in the southern part of India. With the exception of single samples containing subtypes A and B and a B/C recombinant, the rest contained subtype C viruses. This is the first report of subtype prevalence in the emerging areas of high prevalence in the southern part of India. This study addresses the need for identifying the proportion of infections caused by different subtypes in the design and implementation of vaccine and antiretroviral therapy programs.
The HMA kit and other reagents were received from the NIH AIDS Research and Reference Reagent Program and the Centralised Facility for AIDS Reagents, National Institute for Biological Standards and Control, UNAIDS. Help from the following individuals in collecting samples from HIV-seropositive individuals is gratefully acknowledged: A. O. Saroja, Consultant Neurologist, KLES Hospital, Belgaum, Karnataka, India; M. N. Balamurugan, Consultant Neurologist, Salem, Tamil Nadu, India; E. Srikanth Reddy, Consultant Neurologist, Vijayawada, Andhra Pradesh, India; James Joseph, Seva Free Clinic, Bangalore, India; and Phalguni Gupta, Department of Infectious Diseases and Microbiology, University of Pittsburgh, Pittsburgh, Pa. R.S. acknowledges G. Ehrlich and C. Post for support and encouragement. The Human Brain Tissue Repository for Neurobiological Studies at NIMHANS, Bangalore, India, is acknowledged for providing samples from the Body Fluid Bank.
|
|
|---|
B-binding motifs in the long terminal repeat region of South African HIV type 1 subtype C isolates. AIDS Res. Hum. Retrovir. 16:305-306.[CrossRef][Medline]
B and the Tat transactivator. Virology 296:77-83.[CrossRef][Medline]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»