Previous Article | Next Article ![]()
Journal of Clinical Microbiology, August 2006, p. 2792-2800, Vol. 44, No. 8
0095-1137/06/$08.00+0 doi:10.1128/JCM.00295-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
bioMerieux, Emerging Pathogens Department, Marcy-l'Etoile, France,1 Institut National de la Santé et de la Recherche Médicale, INSERM Unit 271, Lyon, France,2 Virus Reference Division, Health Protection Agency, London, United Kingdom,3 Azienda Ospedaliera, Pisa, Italy,4 Department of Biotechnology, Intercollegiate Faculty of Biotechnology, Gdansk, Poland5
Received 10 February 2006/ Returned for modification 25 April 2006/ Accepted 14 May 2006
|
|
|---|
|
|
|---|
The virologic monitoring of HBV infection and its treatment is critical for efficiently controlling the disease and improving patient management. Several diagnostic tools can be used to evaluate the immune and virological parameters of the patient: hepatitis B s antigen (HBsAg), antibodies to HBsAg, hepatitis B e antigen (HBeAg), antibodies to HBeAg, antibodies (immunoglobulin M [IgM] and IgG) to hepatitis B core antigen, and viral load and genotypic assays.
Currently, genotypic assays focus on the major mutations in the region of the Pol gene coding for the virus reverse transcriptase (RT) that confer resistance to antiviral treatment (4). However, many other polymorphisms have been shown to have effects on the level of viral replication and the pathogenesis of liver disease and on vaccine and interferon treatment failure. Mutations in the viral polymerase confer resistance to lamivudine, entecavir, and adefovir (reviewed in reference 30), and new mutations will probably also affect the efficiency of molecules currently in clinical trials. As is the case for human immunodeficiency virus (15), resistance testing will probably be recommended to monitor HBV-infected patients in the different settings where the presence of resistant variants is suspected. Both genotypic and phenotypic resistance tests currently exist (16). Mutations in the S region that encode the viral envelope proteins have been associated with vaccine, immune treatment, and diagnostic escape (5, 13). Mutations in the a determinant of the S antigen do not seem to increase over time and to seriously affect vaccination efficiency, as shown by a 15-year survey in Taiwan following nationwide vaccination (19). However, they have been detected in patients from other geographic origins, and these variant virus strains may escape current diagnostic tests (12, 17, 18). Polymorphisms in the precore-, core-, pre-S1-, and pre-S2-coding regions and in the basal core promoter (BCP) may affect virus replication and impact disease outcome. Mutations in the pre-core gene, especially the stop codon at position 28, which results in HBeAg seronegativity, are more often seen in patients with persistent viremia and active liver disease (8-12). Mutations in the BCP, especially at positions 1762 and 1764, down-regulate HBeAg synthesis. They have been associated with fulminant hepatitis (28), more-severe liver damage (23, 31), and HCC (3). The X gene has been associated with evolution of HBV infection toward HCC. Polymorphisms monitoring the X-coding region may be of interest (32, 36). There are eight HBV genotypes, from A to H, that have different geographical distributions. As for hepatitis C virus, different genotypes may have different disease outcomes and sensitivities to treatments. Patients infected with genotypes C and D have a faster progression to liver fibrosis and HCC than those infected by genotype A or B (11, 14, 21, 25, 33, 34, 35). The duration of the HBeAg-positive replicative phase varies for the different genotypes, being shorter for A and D, which dominate in Africa, the Mediterranean region, the Middle East, and India, than for B and C, which dominate in Southeast and East Asia. This may explain why horizontal transmission is favored in the former countries, while transmission is mostly vertical in the latter countries (27). Resistance to lamivudine seems to appear more frequently and rapidly in genotype A than in genotype D (37, 38).
The small size of the HBV genome (3.2 kilobases) makes genotyping and polymorphism analysis possible for all viral genes. This may provide clinically relevant information to adapt patient management and adjust treatment strategies. Currently, genotypic analysis is performed mainly by either sequencing or the reverse-hybridization assay. Sequencing of the complete HBV genome is expensive and time consuming and requires highly skilled personal. Reverse-hybridization assays are limited to a few viral genome nucleotide positions.
We have designed a prototype genotypic assay based on PCR and DNA microarrays (the HBV DNA chip, or "the Chip") for the determination from patient serum of HBV genotype and the detection of polymorphisms at 151 positions in the Pol-, S-, precore-, core-, pre-S1-, pre-S2-, and X-coding regions and the BCP region. This assay has been evaluated for genotype determination and mutation detection in four genomic lociBCP, precore, S, and Polusing a panel of plasma samples collected through the HepBvar network from patients originating from four European countries and which were characterized by sequencing. This study provides detailed information on the performance of the HBV DNA chip as a tool for patient monitoring and for research. It also highlights technical aspects to be considered for the optimization of a chip-based reagent for the characterization of infectious agents.
|
|
|---|
HBV DNA extraction. Viral DNA was extracted from HBV-infected plasma using a QIAamp UltraSens virus kit from QIAGEN (QIAGEN, Hilden, Germany) according to the manufacturer's protocol.
Amplification of the HBV DNA by PCR. After being extracted, the whole DNA genome was amplified by duplex PCR to generate amplicons 1,509- and 1,721-bp long. HBV DNA was amplified with primers A1, A2, and A3 for the 1.5-kb amplicon and primers A4 and A5 for the 1.7-kb amplicon (see primer sequences below). The PCR mixture consisted of 20 µl of extracted DNA, 0.2 µM (each) primer, 0.3 mM (each) deoxyribonucleotide triphosphate, and 2 U of Fast Start Taq DNA polymerase (Roche, Basel, Switzerland) in an amplification buffer containing 2 mM MgCl2. Amplification was performed with the following steps: denaturation for 4 min at 94°C; 40 cycles of 20 s at 93°C, 45 s at 52.5°C, and 2 min 30 s at 68°C; and an elongation step of 7 min at 68°C. Then 5 µl of PCR product was subjected to agarose gel electrophoresis, using a 1% agarose standard (Eurobio, Cortaboeuf, France) run in 0.5x Tris borate EDTA (Eurobio). The following primers were used for the 1.5-kb amplicon: TCCCCAACCTCCAATCAC (forward primer A1), TCCCAAATCTCCAGTCAC (forward primer A2, genotype B-specific), and AAAAAGTTGCATGGTGCTGGT (reverse primer A3). For the 1.72-kb amplicon, the following primers were used: CTTTTTCACCTCTGCCTAATCA (forward primer A4) and GGGACTGCGAATTTTGGC (reverse primer A5).
Labeling and cleavage. Amplicons were biotinylated as previously described (6). Briefly, a 45-µl aliquot of each positive PCR product was biotin labeled by mixing it with 83 µl of RNase- and DNase-free water (Sigma) and 100 µl of 0.1 M meta-biotin-phenylmethyl-diazomethyl (m-bioPMdam; bioMérieux, Marcy LEtoile, France) at 95°C for 25 min in a dry bath. Depurination-mediated cleavage was performed in 8.8 mM HCl (22 µl of 0.1 N HCl per assay) in a final volume of 250 µl at 95°C for 5 min. Fragmented, labeled DNA was purified with a QIAquick 8 PCR purification kit according to the manufacturer's protocol (QIAGEN, Hilden, Germany) and eluted in 120 µl of elution buffer.
DNA chip hybridization and reading. A mixture of 100 µl of the purified biotin-labeled products was hybridized on a high-density DNA chip (designed by bioMérieux and manufactured by Affymetrix, Santa Clara, Calif.) in a final volume of 500 µl containing 400 µl of hybridization buffer (12x SSPE [1x SSPE is 0.18 M NaCl, 10mM NaH2PO4, and 1 mM EDTA, pH 7.7], 0.06% antifoam, 0.09% sodium azide, and 0.5 pmol/assay of control oligonucleotide), denatured at 95°C for 10 min and hybridized onto the array, using the automated Affymetrix GeneChip Fluidic Station 400, at 45°C for 1 h. After the hybridization step, the DNA chip was washed at 30°C in wash buffer type A containing 2x SSPE, 0.03% Triton X-100, 0.006% antifoam, and 0.027% sodium azide, then in wash buffer type B containing 6x SSPE, 0.03% Triton X-100, 0.006% antifoam, and 0.027% sodium azide at the same temperature. The DNA chip was stained for 15 min at 35°C in 510 µl of a solution containing 500 µl of staining buffer (100 mM Tris, 1 M NaCl, 0.05% Tween-20, 0.005% antifoam, and 0.09% sodium azide, [pH 7.2]), 5 µl of bovine serum albumin (0.5 mg/ml; Gibco-BRL, Gaithersburg, Md.), and 5 µl of streptavidin-R-phycoerythrin conjugate (3 µg/ml; Dako, Glostrup, Denmark). A final wash was then performed in wash buffer type B at 25°C. A Gene Array scanner (a confocal laser reader from Agilent, Palo Alto, Calif.) with a pixel resolution of 3 µm and a wavelength of 570 nm was used to detect the fluorescence signal emitted by the target bound to a given probe of the chip.
DNA chip design and results analysis. Each base was identified using a set of four 20-mer oligonucleotide probes. All probes have the same sequence, matching the target sequence except at position 12, corresponding to the interrogated position, where they have one of the four possible nucleotides. The correct base was selected if the intensity of the fluorescence signal generated by the corresponding probe was at least 1.2 times higher than the intensity of any of the three other probes. To detect the presence of a mutant codon, at least 5 consecutive bases, including the codon bases, were resequenced. The probe sets for a given amino acid constitute a tile. Tiles of five bases have been used to identify wild-type codons: the three codon bases plus one upstream base and one downstream base. Tiles of seven bases are used to identify mutant codons: the mutated base, three upstream bases, and three downstream bases. Because natural polymorphisms and other mutations can affect the hybridization of probes, comprehensive sequence databases were built and different tiles containing polymorphisms significantly represented in these databases as well as described mutations were used to identify a wild-type or mutant codon. There are 83,048 probes on the surface of the chip. A percentage of the homology between the experimental sequence and the different reference sequences tiled on the array was assigned to each interrogated position (the base call). The ratio between the median intensity fluorescence signals of probes in a tile that correctly identify a base and the median background in the area of the chip where these probes are located (It/b) was also calculated. This parameter provides confidence in the robustness of tile selection. Probe array cell intensities, base calls, the It/b ratios, and reports were generated by GeneChip version 3.2 software developed by Affymetrix and Test Platform software developed by bioMerieux. The report sheet included the codon positions, amino acid substitutions, and corresponding nucleotide substitutions identified as different from those of the wild-type reference HBV sequence (genotype A; GenBank accession number X02763) in the region of interest.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Positions and mutations for which probes have been included on the chip
|
Analytical sensitivity. One hundred seventy plasma samples, collected in four European countries through the HepBvar network, were analyzed. The viral load and genotype of all samples were known. The distribution of genotypes as determined by sequencing was representative of a European origin of patients, with a large majority of genotypes A (61 samples) and D (74 samples). No genotype G, only 1 genotype F (from Italy), and 2 genotypes H (from Poland) were included in the study.
Among the 170 samples, 153 (90%) could be amplified by the duplex PCR protocol and subsequently analyzed using the chip; i.e., an amino acid could be identified in the majority of the positions and the genotype of the virus could be determined. Failure to amplify samples is likely to be due to low viral load (Fig. 1). All the samples which could not be analyzed had a viral load equal to or lower than 4.3 x 104 genome copies/ml and 9 of 17 (52.9%) had a viral load below 103 copies/ml. It is likely that difficulties in amplifying low-viral-load samples are due to the long lengths of the two amplicons (1.5 kb and 1.7 kb) that must be generated according to the PCR protocol. Amplification efficiency was not significantly associated with viral genotypes (data not shown).
![]() View larger version (24K): [in a new window] |
FIG. 1. Analytical sensitivity of the chip. Viral load (VL) is expressed as the number of HBV genome copies/ml for a serum sample. Data were considered interpretable when an amino acid could be identified in the majority of the positions and the genotype of the virus could be determined.
|
|
View this table: [in a new window] |
TABLE 2. Reproducibility of the chipa
|
Mixture detection. To demonstrate the capacity of the chip reagent to detect mixtures of variants, we prepared mixtures at different ratios of two cloned HBV genomes in HBV-negative plasma. One viral genome harbored the wild-type sequence, and the other has a mutation at position rt180 (L180M). The final virus concentration ranged from 104 to 107 copies of the HBV genome/ml. Figure 2 shows the results obtained at position rt180 for the It/b values of L180 and L180M. The values are those obtained for the first tile and for the second tile corresponding to the expected alternative amino acid. The minor population can generally be detected when it represents 30% to 40% of the mixture, except at the lowest virus concentration tested for which L180M could be detected when present as 50% of the whole viral population.
![]() View larger version (17K): [in a new window] |
FIG. 2. Detection of L180 and L180M mixtures. Artificial mixtures of an HBV genome with a wild-type sequence and another one with a mutation at position rt180 (L180M). (A) Total HBV genome copy number for 107/ml of serum. (B) Total HBV genome copy number for 106/ml of serum. (C) Total HBV genome copy number for 105/ml of serum. (D) Total HBV genome copy for 104/ml of serum. It/b is the ratio between the median intensity fluorescence signals of probes in a tile that correctly identify a base and the median background in the area of the chip where these probes are located.
|
Detection of mutations. All 153 samples which could be amplified and tested with the chip had a reference sequence in the Pol and S open reading frames (ORFs), 75 in the precore gene, and 46 in the BCP. Only 89.4% of the 12,681 codons analyzed in that panel had a sequence reference result. For the remaining 1,346 codons, sequencing gave an indeterminate result. A total of 11,335 codons were thus used for the comparison between the two techniques. Table 3 shows that the overall concordance between results found by sequencing and by the chip is 92.8%, with no significant variation according to the ORF. A large majority of the 815 discrepant results are due to the fact that no tile met the criteria of the selection algorithm described above for the chip because of weak hybridization.
|
View this table: [in a new window] |
TABLE 3. Overall concordance between the chip and sequencing
|
|
View this table: [in a new window] |
TABLE 4. Comparison of results for the chip and sequencing in determining the major nucleotide positions in RT, S, pre-core and BCP
|
Twenty-four positions in the BCP, 12 in the precore region, and 11 in the core gene can be analyzed by the chip, corresponding respectively to 42, 28, and 29 mutations (Table 1). In the precore ORF, mutations at two amino acid positions (28 and 29) are known to affect HBV pathogenesis, especially the stop codon mutation at position 28. Results obtained at these positions are shown in Table 4.
At positions 1762 and 1764 of the BCP, 24 specimens were identified to be the wild type (A1762 and G1764), as determined by sequencing. In two other samples, a mixture of wild-type and mutant bases (G and A) were detected by sequencing. Twenty specimens were determined to have the double mutation A1762T and G1764A. The chip correctly identified only three of these samples, due to weak hybridization.
Apart from codons rt204 and s145 and positions 1762 and 1764, for which hybridization problems due to unexpected polymorphism or to the simultaneous presence of two mutations were encountered, the number of indeterminate results was higher for sequencing (n = 29) than for the chip (n = 13). Twelve mixtures were detected by sequencing only and 11 by the chip only.
|
|
|---|
As a consequence, HBV genome polymorphism analysis may be a major tool both for clinicians to monitor infection and treatment and for researchers to better understand viral genome variability and its consequences for disease outcome. The small size of the HBV genome is compatible with a strategy using whole-genome PCR amplification for the detection of sequence polymorphisms. We demonstrate in this study that a PCR protocol can be optimized and a genotyping tool designed to identify polymorphisms in 151 positions along the HBV genome, consisting of 245 mutations, 20 deletions, and 2 insertions, in a single test. Considering the high quantity of information that can be provided by a single test from one patient specimen, the short turnaround time, and limited hands-on time, there are strong advantages conferred by DNA chip technology. Obtaining the same information by sequencing would require the use of approximately 16 electrophoresis gels, substantially longer hands-on time at the bench, and analysis of electrophoregrams. Similarly, the number of nitrocellulose strips required to analyze 245 mutations would preclude the use of reverse-hybridization techniques for the extensive genotyping of viruses.
To efficiently amplify the whole 3.2-kb HBV genome, we designed two primer sets to generate two subgenomic fragments, one 1,509 bp and the other 1,721 bp in length. Using this protocol, we were able to amplify HBV DNA and subsequently obtain an interpretable chip result in 93.8% of samples with a detectable viral load. Ninety-five percent of samples with >1,000 copies/ml could be amplified, and only two samples with >10,000 copies/ml failed to be amplified. This analytical sensitivity is similar to the sensitivity generally admitted for sequencing and to that reported for INNO-LiPa (Innogenetics, Belgium) (1). In this study, only those samples that could be amplified using the PCR strategy and whose amplicons were sequenced were selected for chip testing. Nevertheless, 10.6% of the codons of viral genomes amplified from these samples could not be determined by sequencing. Given the extent of viral-load reduction achieved following efficient treatment, a PCR protocol with a sensitivity of 103 HBV genome copies/ml may be considered sufficient for practical purposes in the clinical context.
Based on a comprehensive HBV sequence database, amplification primers were selected from regions that showed high conservation among genotypes. An additional forward primer has also been included to accommodate genotype B-specific polymorphisms and more precisely amplify the 1,509-bp amplicon from this subtype. We did not detect any significant difference in amplification efficiency among genotypes A to F, although the non-European genotypes (B, C, E, F, G, and H) were poorly represented in our panel of samples.
Only 3 samples among 151 gave a genotype result that was discordant between sequencing and the chip. Among them was one that had a viral load of <200 copies of the HBV genome/ml. Thus, the performance of the chip reagent for HBV genotype determination can be considered similar to that by sequencing, which is so far the most widely used and is considered the gold standard approach to polymorphism analysis of viral genomes.
Overall, 11,335 codons have been determined by both sequencing and the chip. The correlation between the two techniques is 92.8%, with no significant difference for different regions of the genome. This result is slightly higher than the 88.9% concordance between INNO-LiPa and sequencing described by Lok et al. (26). The degree of concordance between the chip and sequencing would probably have been even higher upon retesting, which was not possible due to the distribution of discrepant codons among all samples. Although we have no data on the reproducibility of the sequencing protocols used in the different participating laboratories, our experiments have shown the chip approach to be highly reproducible.
As shown in this work, the chip is able to correctly identify wild-type and mutant codons. The photolithography technology developed by Affymetrix to produce high-density chips used in this study requires the manufacturing of dedicated masks, which are used for the synthesis of probes needed for the detection of specific target sequences. This manufacturing process is complex and does not allow frequent upgrades of the chip. Therefore, a prototype chip that could be compatible with PCR products generated from the optimized PCR protocol and that could identify sequence polymorphisms in positions of the amplicons that are difficult to resequence had to be synthesized. We were able to identify positions where difficulties were encountered, using this prototype chip. The genomic sequence of HBV, like that of human immunodeficiency virus, is highly polymorphic. The strategy chosen to consider natural polymorphism was to design probes with all the bases that are significantly represented in sequence databases, providing they are not distant from the central base of a wild-type codon or from a base leading to a mutation of interest by more than four nucleotides. However, at some positions, mismatches that are under-represented in the sequence database may affect hybridization, as exemplified by results obtained at position s145, for which weak hybridization efficiency was observed. This weakness is attributed to a polymorphism at the third base of codon rt144 (T to C), which is present in a significant proportion of viruses but was not retained for probe design, resulting in a systematic mismatch between the probes and target DNA. Adjunction of tiles containing probes with the T to C polymorphism at the third base of codon rt144 in a future version of the chip should correct this problem. Polymorphisms located more than four bases away from the resequenced base may also affect hybridization efficiency, as seen at position rt204, where weak hybridization is attributed to the presence of a natural polymorphism at the third base of codon rt202. Genotypes A and G contain a C (cytosine) at that position, whereas genotypes B, C, D, E, and F contain a T. This polymorphism is located more than five bases away from the central base of the wild-type codon ATG, specifying methionine, and more than six bases away from the third base of codon ATT, specifying isoleucine. Thus, probes that have been designed to resequence those two codons contain only a C as the third base of codon 202, introducing a mismatch with amplicons amplified from viruses with a T at that position. Although a polymorphism more than four bases away from the base of interest usually does not affect hybridization, in the case of position 204 of RT, it results in reduced hybridization efficiency and potentially prevents the chip from identifying an amino acid. All but one of the 20 discrepant samples did not belong to genotype A. In the next version of the chip, the region where polymorphisms are considered should be extended beyond four bases. We have also shown that linked mutations at close positions, like those occurring at positions 1762 and 1764 of the BCP, are difficult to detect if probes that hybridize to each of them are not simultaneously present on the chip.
Our study has shown that the chip reagent is able to identify samples with mixed viral populations. Using artificial mixtures, we have shown that for mixtures that carry >10,000 copies of the HBV genome/ml, the chip can detect L180M variants when they represent at least 30% to 40% of the whole viral population. This is better than the performances usually admitted for sequencing techniques (which can identify minority variants, if present, at about 50% of the total population). Other hybridization-based techniques (INNO-LiPa or low-density chips) can detect minority variants representing <10% of the viral population (22, 29). However, these reagents are able to analyze resistance mutations at positions rt180, rt204, and rt207 only, and the number of probes for each positions is limited, which makes them sensitive to hybridization problems due to polymorphisms.
The technique described in this study is intended to be used as a diagnostic as well as a research tool. Its protocol has been designed to be as simple as possible, and thus, amplification is performed on DNA directly extracted from serum samples. As a consequence, it detects individual polymorphisms only and does not provide any information on the linkage between multiple mutations that may be borne by the same HBV genome. It is nonetheless possible to obtain such linkage information using the chip if PCR cloning of constituents of the viral quasispecies present in the specimen is applied before submission for hybridization to the chip.
Finally, we had access only to samples characterized by sequencing in the S, Pol, precore, and core genes and the BCP. Although detection of polymorphisms in these coding regions is of major interest for the clinical management of patients, further studies are required to analyze the performance of the chip in probing for sequence polymorphisms in the X, pre-S1, pre-S2, and core genes.
To obtain maximal hybridization efficiency, it is very important to build and update high-quality sequence databases containing all naturally occurring polymorphisms. Similarly, the list of variants of interest for antiviral resistance or vaccine or diagnostic escape and for their impact on the course of the disease needs to be as complete as possible, including variants that have been found in vitro only. High-density chips produced by photolithography, like those used in this study, can accommodate the large number of probes required to cover all these polymorphisms. Improvement of hybridization at difficult positions, as well as the introduction of newly described polymorphisms, requires updates of chip design. The frequency with which chips are updated would depend on how well the updated chips performed and on the cost of each upgrade.
Although hybridization at difficult positions needs to be improved and newly described mutations have to be included in a new version of the assay, our data suggest that this prototype chip can already be used for clinical and research purposes. It can detect simultaneously in a single test a large number of clinically relevant mutations located along the whole genome of HBV and therefore constitutes a novel and valuable diagnostic tool for the management of chronic hepatitis B.
This work was supported by grants from the European Community (HepBvar, contract QLRT-2001-00977, and ViRgil, contract LSHM-CT-2004-503359).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»