Previous Article | Next Article ![]()
Journal of Clinical Microbiology, November 2006, p. 4149-4156, Vol. 44, No. 11
0095-1137/06/$08.00+0 doi:10.1128/JCM.01230-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Laboratory Medicine, University of Washington Medical Center, Seattle, Washington 98195,1 Program in Infectious Diseases, Fred Hutchinson Cancer Research Center, Seattle, Washington 981092
Received 15 June 2006/ Returned for modification 29 August 2006/ Accepted 6 September 2006
|
|
|---|
|
|
|---|
Initially, genotypes 1 and 2 were identified by the sequencing of multiple samples in the HCV core region (6, 16, 23). Analysis from samples worldwide and sequencing of additional genome areas including the 5' untranslated region (UTR), envelope, and NS5 regions identified genotypes 3, 4, 5, and 6. Sequence variation between genotypes, subtypes, and individual strains is greatest in NS5, less in the envelope and core, and least in the 5' UTR.
Direct sequencing is the most accurate method for HCV genotyping. However, many other methods have been used because of the expense and technical difficulties of direct sequencing. The most frequently used methods in clinical laboratories are LIPA (line probe assay) and sequencing of the 5' UTR (both from Bayer Diagnostics). Both assays require two steps: generation of the PCR amplicon and then evaluation of the amplicon by either hybridization or sequencing. An additional disadvantage is the use of the 5' UTR, which is less informative for genotyping than are other, more variable regions of HCV.
In contrast, type-specific PCR assays require only a single amplification step and thus should be technically simpler. The first type-specific genotyping assay was described by Okamoto et al. (24). The assay utilized a first-round amplification of a large section of the core region, followed by a set of second nested amplification reactions each using an identical 5' primer but one of four subtype-specific primers (for genotypes 1a, 1b, 2a, and 2b). The method of Okamoto et al. detected the four different-sized amplicons by gel electrophoresis. A similar method was described by Chayama et al. (4) in the NS5 region. Larger-scale studies by Okamoto and others with their method demonstrated a significant lack of specificity between the type I (1a) and II (1b) reactions. Improved versions of the assays were described by Okamoto et al. (25), Holland et al. (15), Forns et al. (11), and Ohno et al. (22), which improved specificity and expanded the primer-probe sets to include the newly identified genotypes 4, 5, and 6. Several studies compared these type-specific assays to a variety of other genotyping methods (11, 13, 17, 19). In each of these studies, results from the type-specific assays had good agreement with other methods but had somewhat high rates of mixed samples and samples that did not amplify. Recently, several methods have been described that utilize single-step real-time reverse transcription-PCR (RT-PCR) with target-specific TaqMan (20, 28) or LightCycler (3, 31) probes or SYBR green detection (12), further streamlining the method. Some of these real-time methods were designed only to distinguish between genotype 1 and non-1 types, and none have been evaluated with large numbers of samples.
Reagents for a multicolor three-tube real-time RT-PCR assay for HCV genotyping have recently become commercially available (Abbott HCV Genotype ASR). We describe here the results from a large study designed to evaluate the performance of the real-time RT-PCR HCV genotype assay compared to our existing restriction fragment length polymorphism (RFLP) and core sequencing methods.
|
|
|---|
Sample extraction and RNA purification. Serum or EDTA plasma samples were extracted using the Roche MagNA Pure LC instrument with the MagNA Pure LC Total Nucleic Acid Isolation kit (large volume), according to the manufacturer's instructions. The initial sample volume was 1.0 ml, and the elution volume was 80 µl (12.5x concentrated). For the real-time assay sensitivity experiment, serial 1:4 dilutions of serum in diethyl pyrocarbonate-treated water were made and then each dilution was extracted individually.
RFLP and Abbott real-time assay analysis. The RFLP assay was performed on amplicon product from the 5' UTR (8). The real-time assay consisted of three reactions, each of which contained three probes. The first reaction contained internal control primers-probes designed to detect all HCV genotypes (total HCV quantitation) and primers-probes for genotypes 1a and 1b. The second reaction contained primers-probes for genotypes 2a, 2b, and 3. The third reaction contained primers-probes for genotypes 4, 5, and 6. The genotype 1a and 1b reactions amplified sequences within the HCV NS5 region, while the other reactions amplified sequences within the 5' UTR. RT-PCR amplification was done for 50 cycles on an ABI 7000 instrument, and the raw data file was analyzed using Sequence Genotyping Software, v2.0 (Celera Diagnostics, Alameda, CA). Initially, the presence of two or more genotype reactions having cycle thresholds (CTs) within three cycles of each other was taken as evidence of mixed infection. Samples positive with the total HCV reaction but negative for all genotype-specific reactions were considered indeterminate. Raw amplification data were exported into Excel spreadsheets and analyzed using SAS statistical software (SAS Institute Inc.) and GraphPad Prism statistical/graphical software, v3.03 (GraphPad Software, Inc.). Genotype-specific amplification efficiencies (E) were calculated from the CT values of serial dilutions (E = 10slope 1) (29). Theoretically perfect replication efficiency for a PCR using this calculation would be 100%. For some highly efficient reactions the calculated amplification efficiency may slightly exceed 100%, presumably due to minor variations in serial dilutions.
HCV sequencing. Sequencing was performed on most samples by amplifying the core region by nested RT-PCR using primer sets described by Bukh et al. (2). The resulting 350-bp amplicon was sequenced using the ABI Big Dye terminator kit and an ABI 3730 sequencer. Sequence analysis was done with Seqscape software, and the genotype was assigned by matching with a library of 186 HCV core region sequences. The genotype assignment for each sequence in the library was confirmed by comparison with the Los Alamos database and by the generation of a phylogenetic tree using PHYLIP. A small subset of the samples were sent to Abbott Molecular's Research and Development group and sequenced in the NS5b region.
Statistical methods. Generalized estimating equations were used to test whether mean differences between CTs for the total HCV quantitation reaction (quantitative CT) and the individual genotype-specific reactions (genotype CT) were significantly different from zero. These models account for any correlation among samples belonging to the same laboratory run. This method was also used to compare the mean differences for the genotype 1b test to the mean differences for the other genotype-specific tests. We adjusted for multiple comparisons using the Bonferroni correction (10). The relative efficiencies of each genotype were calculated by subtracting the genotype-specific CT from the total HCV CT for samples with a single genotype amplification only. In an effort to compensate for differing relative efficiencies when comparing genotype-specific CT values, adjustments to the genotype CT based on these relative efficiencies were considered. Depending on the genotype, either a constant adjustment, no adjustment, or an adjustment based on the relationship between the relative efficiency and the genotype CT (using generalized estimating equations) was explored. Only samples with a single genotype amplified were used in developing these modifications. Sufficient numbers of samples were present for the calculations to be valid for the genotype 1a, 1b, and 3 groups across the entire range of CT values seen. However, an insufficient number of samples of low quantity of genotypes 2a, 2b, and 4 were available, so adjustments could not be accurately calculated in this range and thus were restricted to samples with greater quantities. Adjustments were then applied to samples with multiple genotype amplifications but only for CT data within the range of the data used to generate the adjustments mentioned above. This last restriction impacted genotype 2a and 4 samples most heavily, where more than half of the signals were out of range and thus could not be adjusted.
|
|
|---|
To evaluate the reproducibility of the assay, a pooled positive-control serum, containing a mixture of genotypes 1a, 1b, 2a, 2b, and 3, was tested in 64 sequential runs. All 64 runs had detectable amplification signals for the total HCV, 1a, 2b, and 1b reactions. The mean CTs for the total HCV, 1a, 2b, and 1b reactions were 27.2, 30.1, 33.5, and 37.6, respectively (coefficients of variation for the CTs were 3.9%, 5.2%, 4.2%, and 6.4%, respectively). In contrast, genotype 3 was amplified in only 48/64 runs and genotype 2a in only 24/64 runs, presumably because of the lower quantities of these genotypes in the pooled serum. The mean CTs for genotypes 3 and 2a in the pooled control serum were 45.3 and 45.8, respectively (coefficients of variation for the CTs were 5.7% and 5.9%, respectively).
(ii) Amplification cross-reactivity. To assess the possible cross-reactivity of the genotype-specific primer-probe sets, we evaluated the raw data from all samples submitted to the University of Washington molecular virology laboratory for routine testing over a 9-month period, plus the group 1 and group 2 samples described in Materials and Methods. Of the 1,509 total samples with positive genotype amplification (defined as a CT of <45), 987 (65.4%) had amplification with a single genotype-specific primer-probe set, 492 (32.6%) had amplification with two genotype-specific primer-probe sets, 29 (1.9%) had amplification with three genotype-specific primer-probe sets, and 1 sample had amplification with four specific primer-probe sets. Samples with two or more amplifications had a variety of genotype combinations (Table 1). Of samples categorized as genotype 1a, 44.2% also had amplification with the genotype 1b primer-probe set at a CT of <45 (Fig. 1A). However, the 1b amplification was much weaker than the 1a amplification in these samples; in less than 1% were the CTs for the 1a and 1b amplifications within three cycles of each other. Conversely, of samples categorized as genotype 1b, 15.1% had a weaker 1a signal (Fig. 1b). Similar cross-reactivity was seen with the genotype 4 primer-probe set, with 27.7% of the 2b samples, 9.5% of the 2a samples, and 1 of the 12 genotype 6 samples (8.3%) showing weak amplification of genotype 4. If these weaker amplification reactions were due to the presence of a second virus strain with a different genotype, the overall frequency of mixed samples would far exceed that reported in the literature. Thus, the more likely interpretation is that multiple signals can result from false priming or the unexpected presence of matching sequence, especially for the 1b, 1a, and 4 primer-probe sets. This may significantly confound the interpretation of possible mixed-sequence samples.
|
View this table: [in a new window] |
TABLE 1. Frequency of multiple genotype signals
|
![]() View larger version (39K): [in a new window] |
FIG. 1. Scatter distribution of CTs for samples with multiple amplification signals. Each sample is represented along the x axis, ordered from lowest to highest total HCV CT value. At least three data points are shown for each sample: the CT of the total HCV reaction, the CT of the major genotype reaction, and the CT of any additional amplification signal(s) present. (A) Data from samples initially assigned to genotype 1a (n = 514); (B) data from samples from non-1a genotypes (n = 231); (C) corrected CT data from samples initially classified as genotype 1a (n = 296).
|
|
View this table: [in a new window] |
TABLE 2. Comparison of genotype methods: clinical resultsa
|
|
View this table: [in a new window] |
TABLE 3. Characterization of RFLP 2a genotype samples
|
Detection of mixed infections. (i) Possible mixed genotypes as determined by real-time RT-PCR versus sequencing. As noted above, over 30% of specimens showed some degree of amplification with more than one of the genotype-specific primer-probe sets. Although the manufacturer does not provide specific criteria for classifying samples as mixed infections, it has been suggested that the presence of two amplification signals within 3 CTs of each other may indicate the presence of multiple genotypes. During the initial validation and subsequent routine testing, 23 samples (1.4%) were categorized as mixed infections by these criteria (Table 4). Core sequencing did not show the presence of a second sequence for any of the 23 samples, although our sequencing method can detect a second genotype when present at 10 to 20% of total HCV (unpublished). Surprisingly, the genotype amplification with the fastest CT (primary signal) did not always match the genotype determined by sequencing. The most common additional signal not seen by sequencing was 1a (n = 9), followed by 4 (n = 8) and 1b (n = 3).
|
View this table: [in a new window] |
TABLE 4. Mixed-genotype sample results
|
CT = CTtotal HCV reaction CTgenotype-specific reaction). Thus, if the efficiency of the total HCV reaction approximated that of the genotype-specific reaction, the mean for the
CT should approximate zero. Only data from the genotype reaction used to assign the sample genotype were included in this analysis, while mixed-genotype samples, samples with indeterminate genotype, and negative samples were excluded.
The genotype-specific reactions varied widely in their
CTs (Fig. 2). The mean
CT for genotype 4 was 2.09, indicating that this amplification occurred more efficiently than the total HCV reaction. The genotype 2a and 2b reactions did not differ significantly from 0 (mean
CT of 0.68 and 0.04, respectively). Interestingly, the genotype 2b data had a bimodal distribution (mean
CT of 1.8 and 2.3). Several reactions were significantly less efficient than the total HCV reaction: genotype 1a (mean
CT of 1.73), genotype 3 (mean
CT of 3.37), and genotype 1b (mean
CT of 7.78). Genotype 6 had a mean
CT of 3.79 but due to low sample numbers could not be tested for a statistically significant difference.
![]() View larger version (21K): [in a new window] |
FIG. 2. Scatter distribution of CT (CTtotal HCV reaction CTgenotype-specific reaction) for each sample result with a single genotype amplification, displayed by sample genotype; each point represents one sample. The short horizontal lines represent the mean CT for each group. Means for genotypes 1a, 1b, 3, and 4 were all significantly different from zero (P < 0.01), and the genotype 1b mean CT was significantly different from all the other means (P < 0.01). Means for genotype 2a and 2b were not significantly different from zero, while there were insufficient data for testing genotype group 5 and 6 means.
|
CTs for the genotype-specific reactions varied widely, we considered the possibility that the criterion of two amplification signals within 3 CTs might not be appropriate for mixtures of genotypes detected by reactions with greatly different amplification efficiencies. We therefore evaluated whether it might be possible to correct the observed CTs based on the relative efficiencies of each genotype-specific reaction. Among samples with a single amplification of genotype 1b only, an association was noted between the genotype 1b CT and the
CT such that later genotype CT values corresponded to lower
CT values (Fig. 3). Similar associations were observed for genotypes 1a and 3 (data not shown). For these genotypes, the regression line defining the relationship between the genotype CT and
CT was used to calculate a "corrected CT" for samples with multiple signals that met the analysis criteria (Table 5). For genotypes 2a, 2b, and 4, the
CT remained relatively constant regardless of the genotype CT, so simple shifts based on the mean
CT were used for genotype CT adjustments. Genotypes 5 and 6 were not evaluated due to low sample numbers.
![]() View larger version (19K): [in a new window] |
FIG. 3. Difference between total HCV reaction CT and genotype 1b reaction CT plotted against genotype 1b CT, for samples having no other genotype-specific reactions with a CT of <45 (n = 159). The calculated regression line is shown.
|
|
View this table: [in a new window] |
TABLE 5. Adjustments to genotype CTs
|
|
View this table: [in a new window] |
TABLE 6. Comparison of genotype results from two classification schemesa
|
|
|
|---|
We used the
CT calculation from a large number of samples to measure the efficiency of each of the genotype-specific amplification reactions. The substantial spread of the
CT for some of the reactions leads to the hypothesis that within a given genotype there may be sequence polymorphisms leading to better or worse matching with the primers and probes, and thus genotypes with wide spreads in
CT data may have more base pair variations in the amplification region than those with
CTs tightly clustered around a single mean. This is especially true for the 1b primer-probe set, where both the spread and overall assay efficiency may reflect multiple mutations in the primer-probe binding sites located within the NS5 region.
A second type of pattern seen with the genotype 2b data was a bimodal distribution (mean
CT of 1.8 and 2.3). This may indicate two different subpopulations of genotype 2b samples with slightly different sequences within either the 2b or the total HCV amplicon region, leading to different amplification efficiencies within the group. Sequencing studies are currently under way to determine the explanation for these two subgroups within the 2b group.
The amplification reactions were also used to evaluate possible "cross-reactive" amplification by a primer-probe set not specific for the genotype present in the sample. False amplification was frequently observed, especially with the 1b, 1a, and 4 primer-probe sets. Similar "cross-reactive" patterns have been reported with other type-specific genotype amplification methods (11, 13, 15, 17, 19). In the majority of samples, the correct genotype amplification was sufficiently stronger than the cross-reactive amplification so the correct genotype could be determined. Some samples had two amplifications of approximately equal quantity (CT within 3), and these may demonstrate an increased sensitivity of the real-time method to mixed infections.
Mixed-genotype infections in small numbers of samples have been reported in many studies, with higher mixed rates in multiple-exposure groups such as hemophiliacs, patients on chronic hemodialysis, and injection drug users (1, 27, 32). Detection rates also vary depending on the method used, and in studies where multiple methods are used, mixes seen with one method may not be confirmed by a second method (14, 32). Direct sequencing is a relatively insensitive method to detect minor populations, capable of detecting mixes only if the smaller population is at least 10 to 20% of the total, and has been shown to have significant variation in detection in different laboratories (18). The early Line-Probe assay versions fail to distinguish subtype 1a from 1b, thus making them incapable of detecting the most common mixed sample.
Using the real-time assay, we identified a rate of mixed infections somewhat lower than that reported using other genotype-specific PCR assays (21, 32), but unlike our study, these previous studies focused on multiply exposed groups. None of the mixes identified in our study were confirmed by the core sequence assay. However, since the real-time method is quantitatively linear over 6 to 7 logs, it presumably has much greater sensitivity to detect mixes. For example, it might be possible to detect 10 IU/ml of one genotype while detecting 1 x 106 IU/ml of a second genotype, thus detecting a mix at the extremely low 1:100,000 ratio. This contrasts with the current "gold standard" in detection of mixes, the cloning of individual virus sequences, where more than 100,000 clones would have to be sequenced to detect a similar ratio. For the real-time method to fulfill this potential, however, significant improvements will need to be made in the primer-probe specificity.
One unique advantage of the real-time method is the ability to evaluate the efficiency of each genotype-specific amplification. The analysis that we performed clearly demonstrated unequal efficiencies for the different genotype-specific and total HCV amplifications, and this variability creates a major difficulty in the effective use of the CT values. We attempted to overcome this with a mathematical adjustment of the efficiency for inefficient reactions. Unfortunately, adjusting the CT values in this way was not informative, due to the extensive cross-reactivity of certain genotype-specific reactions. Clearly, the real-time assay can be improved by creating primer-probe sets with better specificity and more equal amplification efficiencies.
Published ahead of print on 20 September 2006. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»