Previous Article | Next Article ![]()
Journal of Clinical Microbiology, March 2003, p. 1080-1086, Vol. 41, No. 3
0095-1137/03/$08.00+0 DOI: 10.1128/JCM.41.3.1080-1086.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Roche Molecular Systems Inc., Alameda, California,1 Département de Microbiologie et Infectiologie, Hôpital Notre-Dame du Centre Hospitalier de l'Université de Montréal,2 Département de Microbiologie et Immunologie, Université de Montréal, Montréal, Québec, Canada;,3 BMI, London, United Kingdom,4 Department of Pathology, CSU Bellvitge/Lab. Recerca Translacional, Institut Catala d'Oncologia, Barcelona, Spain5
Received 24 May 2002/ Returned for modification 26 August 2002/ Accepted 9 December 2002
|
|
|---|
|
|
|---|
Because of the genetic polymorphism of HPV, consensus PCR assays have been utilized to amplify in one reaction the majority of known, as well as novel, anogenital HPV genotypes. The most widely used primer sets, the MY09/MY11/HMB01, GP5+/GP6+, PGMY09/PGMY11, and SPF1/SPF2 consensus primers, target conserved sequences in the HPV L1 gene (1, 7, 12, 15, 18, 20, 23). To render the consensus PCR assays more feasible for large-scale testing and clinical application, convenient assays for the detection and typing of HPV have been developed for all primer sets. HPV amplicons generated by PGMY primers can be easily detected and typed by a nonisotopic reverse hybridization assay, the line blot (LB) assay (4, 13). Recently, a colorimetric microtiter plate-based enzyme immunoassay was also reported for screening of the broad spectrum of HPV amplified by the PGMY primers using a generic probe mix (21).
The proficiency of microbiology laboratory testing is generally monitored by proficiency-testing programs that also allow the determination of test variability between laboratories (32). Implementation of proficiency testing panels is essential for unregulated molecular diagnostic tests. Proficiency panels have been developed for the molecular diagnosis of several infectious agents (25, 32) and in research settings to monitor the performance of molecular virology laboratories (17). Thus far, no proficiency panel has been constructed and made available to the general research community for HPV testing. The validity of HPV DNA detection and typing with PCR assays has not been thoroughly assessed.
Epidemiological studies and vaccine clinical trials require the reliable and reproducible identification of genital HPV infection. Several studies have evaluated the intermethod variation of HPV DNA detection (2, 11, 14, 19, 22, 26, 28, 30, 33). However, few studies have evaluated the intralaboratory and interlaboratory reproducibility of L1 consensus PCR assays although these assays have been widely used (6, 16, 19). The latter studies were conducted with in-house reagents and protocols. Although the consensus L1 PCR assays are not commercially available, standardized reagents and protocols for the PGMY-LB assay are available from Roche Molecular Systems for research purposes. The use of standardized and reproducible protocols for HPV detection and typing could facilitate the comparison of results between studies on HPV infection.
In order to assess the accuracy and reproducibility of the PGMY-LB assay, three laboratories were invited to participate in a collaborative study to compare their abilities to detect the presence of and to genotype HPV DNA in blinded specimens. We report here the estimate of intralaboratory and interlaboratory reproducibility of the PGMY-LB assay for HPV DNA detection and typing on 109 specimens tested in triplicate by three independent laboratories under standardized conditions. This information is useful in view of the wider use of the PGMY-LB assay by numerous research groups, as well as its potential application in diagnostic laboratories for HPV detection and typing.
|
|
|---|
Of the 109 samples included in the proficiency-testing panel, 29 were HPV-negative specimens (10 buffer controls and 19 genital samples) and 80 samples contained HPV DNA. The samples containing HPV DNA included 47 genital samples with one HPV type, 10 synthetic samples with low HPV DNA copy numbers, 10 synthetic samples with high HPV DNA copy numbers, 3 synthetic samples containing multiple HPV types, and 10 genital samples containing, in addition to the genotype(s) consistently detected in repeated PGMY-LB assay runs, at least one type that was not consistently detected. The latter 10 samples were tested four times by Roche Molecular Systems: an additional type was detected in one out of four runs (two samples), two out of four runs (seven samples), or three out of four runs (two samples). One sample contained two HPV types that were inconsistently detected in the four runs. HPV genotypes that were inconsistently detected in these 10 samples included types 42 (in three samples), 31 and 58 (in two samples each), and 16, 35, 33, and 84 (in one sample each).
In addition to the 10 samples containing HPV-16 plasmids and 10 samples containing HPV-45 plasmids, there were a variety of HPV genotypes represented among the 60 HPV-positive samples. They included types 16 (in 10 samples), 18 and 51 (in 5 samples each), 31, 45, 52, 59, and 66 (in 4 samples each), 33, 39, 42, and 54 (in 3 samples each), 56, 68, 83, and 6 (in 2 samples each), and 35, 53, 58, 82, 84, and 73 (in 1 sample each). Of the three samples with multiple types, one contained types 16 and 31, one contained types 16, 18, 52, and 73, and another one contained types 16, 35, 42, and 82. The results from the detection of ß-globin were not considered and compared in this work.
Study design. Three laboratories with different levels of experience with molecular diagnostic methods participated in the study. Laboratory A had experience in human genetic testing but was unfamiliar with molecular virology tests. Laboratory B was a diagnostic molecular microbiology laboratory specifically trained to perform the PGMY-LB assay. Laboratory C was a diagnostic cytopathology laboratory familiar with commercialized molecular diagnostic techniques but unfamiliar with the PGMY-LB assay. Standardized reagents comprising the PCR master mix, LB strips, and reagents were sent to each participating center along with a written standard operating procedure for the PGMY-LB assay. Each participating center was asked to follow the protocol without modification.
Three 20-µl aliquots from each panel sample were coded and distributed on dry ice by Roche Molecular Systems to each of the three independent laboratories. In the test panel, the HPV-negative or buffer controls were distributed randomly among the HPV-positive specimens. Five microliters of each sample was used for HPV testing with the PGMY-LB assay. Each laboratory tested, in different PCR runs, each of the 109 samples three times, blinded to the results obtained from Roche Molecular Systems, from the other laboratories, and from previous runs. Laboratories A and B also tested PGMY amplicons with the generic probe microplate assay. The study testing was conducted between June and August 2000.
PGMY-LB assay. HPV DNA was amplified in each center under standard conditions with the L1 consensus HPV PGMY09/PGMY11 primer set, as previously described (12). The amplification mixture contained 4 mM MgCl2, 50 mM KCl, 7.5 U of AmpliTaq Gold DNA polymerase (Applied Biosystems), 200 µM concentrations each of dATP, dCTP, and dGTP, 600 µM dUTP, 100 pmol of each biotinylated PGMY primer pool, and 5 pmol each of the 5'-biotinylated ß-globin primers GH20 and PC04. HPV was amplified with the ultrasensitive profile that consisted of activation of AmpliTaq Gold at 95°C for 9 min, denaturation for 1 min at 95°C, annealing for 1 min at 55°C, and extension at 72°C for 1 min for a total of 40 cycles. Amplification was followed by a 5-min terminal extension step at 72°C. Laboratory A used a Biometra Uno II thermocycler, laboratory B used a TC 9600 thermocycler, and laboratory C used an MJ Research PTC-1 thermocycler. Measures to avoid false-positive reactions due to contamination were strictly adhered to. HPV genotyping was performed with the reverse LB detection system as previously described (13). PCR products were denatured in 0.4 N NaOH and hybridized to an immobilized probe array containing probes for 27 HPV genotypes (types 6, 11, 16, 18, 26, 31, 33, 35, 39, 40, 42, 45, 51, 52, 53, 54, 55, 56, 57, 58, 59, 66, 68, 73, 82, 83, and 84). Positive hybridization was detected by streptavidin-horseradish peroxidase-mediated color precipitation on the membrane at the probe line.
Generic probe microplate assay. Generic probe detection reactions were performed by using reagents from a PCR-ELISA DIG detection kit (Roche Molecular Biochemicals) as described previously (21). Twenty microliters of denatured PCR products was added to the streptavidin-coated microtiter wells, followed by the addition of 200 µl of hybridization buffer provided by Roche and 20 µl of the denatured generic probe pool. Roche synthesized the digoxigenin-labeled generic probe pool by amplification of DNA from HPV types 11, 16, 18, and 51 as described previously (21). Hybridization was performed at 37°C for 1 h. Following color development, absorbance was measured at 405 nm and the background, defined as the average value of blank cells containing no PCR product, was subtracted from all values. A specimen was considered positive if the corrected A405 was greater than 0.5, negative if the value was less than 0.2, and indeterminate in the range between 0.2 and 0.499.
Statistical methods. To ensure an independent evaluation of the reproducibility of the assay, statistical analyses were performed by a scientist (F.C.) without financial interests in Roche Molecular Systems. Results from the three laboratories were imported into a common database for comparison (Microsoft Excel). Results obtained by Roche Molecular Systems (the reference laboratory) were considered the "gold standard" (HPV DNA reference standard). Agreement for overall HPV positivity (HPV DNA positive irrespective of types identified) and for type-specific positivity was calculated as the percentage of runs with identical results for the presence or absence of HPV. Cohen's unweighted kappa statistic was calculated to adjust for chance agreement between sites or between sites and HPV DNA reference standard results (9). In general, a kappa value of >0.75 indicates excellent agreement beyond chance, a kappa value between 0.40 and 0.75 indicates fair to good agreement, and a kappa value of <0.40 represents poor agreement.
The reproducibility of repeated PCR assays (intralaboratory reproducibility) was calculated for each site by comparing results from triplicate runs and calculating the crude percent agreement and the percent agreement considering only HPV-positive results. The intralaboratory reproducibility for the presence or absence of each of 27 genotypes was first calculated by using crude typing results. Since 10 samples contained HPV types that were not consistently detected between runs by the reference laboratory, agreement was recalculated by not considering the presence or absence of these additional types (see above).
Interlaboratory agreement (interlaboratory reproducibility) was assessed by pairwise comparisons of test results from the three laboratories calculating the crude percent agreement and the kappa statistic. Results obtained by the reference laboratory were not considered in these comparisons. To avoid combining intralaboratory and interlaboratory variability in the comparisons of the three laboratories, two strategies of analysis were considered. Laboratories were first compared by considering the HPV DNA and typing results obtained on the initial run for each sample in each laboratory. This strategy allows evaluation of interlaboratory reproducibility when samples are only tested once, as is often the case. Laboratories were also compared by using a consensus definition of HPV results: the final HPV result for a specimen was that obtained from at least two out of three runs for the presence of HPV DNA and HPV typing. This definition favors a higher degree of concordance between laboratories but takes into account triplicate results for each sample.
Accuracy of HPV DNA detection and HPV typing was assessed by pairwise comparisons in contingency tables of each laboratory with the HPV DNA reference standard results, using the two-sided McNemar chi-square analysis for matched-pair data. Agreement was determined first by considering results of the first aliquot for each sample and then by considering the consensus HPV results for each sample as explained above. Since the reference laboratory did not consistently detect a type in 10 samples, agreement was recalculated by not considering the presence or absence of these additional types as a discordant result. Results from laboratories A and B for the generic probe microplate assay were compared for HPV DNA positivity with results obtained with the LB assay. Indeterminate results were excluded from the calculation of agreement.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Intralaboratory reproducibility of PGMY-LB for HPV DNA detection and HPV typing on triplicate testing of 109 samples by three laboratories
|
In order to estimate the intralaboratory agreement for HPV DNA typing, we compared the results of triplicate testing of 109 samples for 27 genotypes (2,943 type-specific results per center) (Table 1). Although the number of discordant triplicates was greater for typing results than HPV DNA detection in all laboratories, agreement reached at least 99% when all specimens were considered (Table 1). The high number of HPV type-specific negative results explains the greater intralaboratory agreement obtained with HPV typing results than with generic HPV DNA detection despite a greater number of discordant triplicates. When concordant negative triplicates were excluded from the analysis, the level of agreement decreased as shown in Table 1. We then considered in our evaluation the fact that 10 samples contained HPV types that were not consistently detected between runs by the reference laboratory. In laboratory C, 7 (50%) of the 14 discordant triplicates contained HPV types not uniformly detected by the reference laboratory, thus leaving only 7 truly discordant triplicates for HPV typing (values in parentheses in Table 1). In laboratory B, four (50%) of the eight discordant triplicates contained HPV types not uniformly detected by the reference laboratory, leaving only four truly discordant triplicates for HPV typing. In laboratory A, 4 (17.3%) of the 23 discordant triplicates contained HPV types not uniformly detected by the reference laboratory, leaving 19 truly discordant triplicates for HPV typing. When only truly discordant triplicates were considered, the intralaboratory agreement for HPV typing increased slightly when all results were included in the calculation of agreement and ranged from 78.4 to 95.8% when only HPV-positive results were used (Table 1).
When the numbers of truly discordant triplicates for HPV DNA typing were compared between participating sites, significant differences were found, after the Bonferroni correction for a total of three comparisons, between laboratories A and B and between laboratories A and C (P = 0.006 and 0.054, respectively; Pearson chi-square test). Considering typing results from laboratory A that showed the greatest intralaboratory variability, 8 of 10 samples with low copy numbers of HPV DNA generated discordant triplicates, as opposed to 15 of 70 samples with clear signals or high HPV copy numbers (P = 0.005; Fisher's exact test). This relationship was not found for laboratories B and C, which had a higher level of agreement between triplicates (data not shown). Excluding specimens with low copy numbers of HPV types 45 and 16, discordant triplicates were distributed equally across several types. Reproducibility was perfect for the three samples with multiple HPV type infections for two laboratories (data not shown). One laboratory failed to detect an HPV type in one of the triplicates for two of the latter samples (data not shown).
Interlaboratory reproducibility of the PGMY-LB assay was first assessed by considering only the first PCR run in each laboratory. Laboratory pairwise comparisons are shown in Table 2 for HPV DNA detection and typing. Agreement for HPV DNA positivity between laboratory pairs A and B, B and C, and A and C reached 94% (102 of 109 samples, kappa value of 0.85), 99% (108 of 109 samples, kappa value of 0.98), and 93% (101 of 109 samples, kappa value of 0.83), respectively. Agreement for HPV typing results between laboratory pairs A and B, B and C, and A and C reached 94% (102 of 109 samples, kappa value of 0.81), 97% (106 of 109 samples, kappa value of 0.88), and 93% (101 of 109 samples, kappa value of 0.80), respectively.
|
View this table: [in a new window] |
TABLE 2. Interlaboratory agreement of PGMY-LB results for HPV DNA detection and HPV typing obtained on initial testing of 109 samples by three laboratories
|
|
View this table: [in a new window] |
TABLE 3. Interlaboratory agreement of PGMY-LB results for HPV DNA detection and HPV typing obtained in three laboratories on triplicate testings of 109 samples using the consensus definition for HPV results
|
|
View this table: [in a new window] |
TABLE 4. Accuracy of HPV DNA detection and typing results obtained with PGMY-LB for 109 samples by three laboratories
|
In laboratories A and B, amplicons generated by PGMY primers were tested with the LB assay and also with the generic probe microplate assay. Since two laboratories tested 109 samples in triplicate, 654 assay results could be compared. Indeterminate results for the generic probe microplate assay were obtained for 59 samples (30 HPV-positive samples and 29 HPV-negative samples). When these indeterminate results were excluded, 420 assays were HPV positive by both tests, 149 were negative by both tests, 11 were HPV positive with the generic probe microplate assay only, and 15 were HPV-positive with the LB assay only. The agreement between the tests for detection of HPV amplicons was 95.6% (569 of 595 results), for an excellent kappa value of 0.89.
|
|
|---|
We demonstrated on synthetic samples that specimens with small amounts of HPV DNA were prone to generate discordant results between triplicates. Since synthetic samples do not contain the amplification modifier found in clinical specimens, we also included in the proficiency panel anonymous cytobrush samples. Ten clinical samples contained at least one HPV type not consistently detected by the reference laboratory. For all of these samples, the type detected intermittently by the reference laboratory was also detected by at least one of the participating laboratories. This phenomenon could result from the presence of a low copy number of HPV DNA or inhibitors and illustrates the difficulty of using clinical samples in a proficiency test.
The intralaboratory agreement was excellent for all three laboratories. It increased when we did not consider variable HPV results obtained in the certification panel by the reference laboratory. Some groups have reported greater reproducibility of PCR for HPV testing (22, 27). In those studies, duplicates instead of triplicates were tested, only one laboratory was evaluated, and the panel included more HPV-negative samples than did our study. Our level of intralaboratory agreement for HPV DNA detection and typing corresponds to a previous evaluation by Daniel et al. (6). They also reported that the reproducibility of HPV typing with MY09/MY11 increased in the presence of greater HPV loads. We found that for the laboratory with several discordant triplicates, low-viral-load synthetic samples were prone to generating discordant results. As reported by others, we found a lower agreement for HPV typing than for HPV DNA detection (6, 19). We calculated not only overall agreement but also agreement with regard only to HPV-positive results because of the large number of concordant HPV-negative assays that artificially increased the level of agreement for HPV typing. As previously reported, agreement was strong for samples with multiple HPV types, although we only analyzed three such specimens (6, 19).
Two studies reported the interlaboratory agreement of PCR for HPV detection using the MY09/MY11 primer pair (16, 22). In one study, 33 samples were tested in two laboratories, while in the other study, 70 anal samples were analyzed in two laboratories. Our evaluation of agreement was consistent with the estimates from these previous reports for HPV DNA positivity (83 and 96%) and HPV typing (90 and 88%).
The accuracy of the PGMY-LB assay was excellent. However, specimens may better agree with a reference standard when the same test is used to validate and test the panel. The low rate of false-positive results in the 29 HPV-negative samples suggests a low rate of contamination in our study. One of the false-positive specimens was preceded by an HPV-51-positive sample. Contamination may have occurred after amplification during the hybridization of amplicons with the LBs, since the same amplicons hybridized with the generic probe tested negative. Another study evaluated the accuracy of the GP5+/GP6+ L1 consensus primer pair and reported excellent accuracy for 50 samples tested in four laboratories, ranging from 92 to 100% for detection of HPV DNA, a rate similar to our results (19). That study also reported that most discrepant results were false-negative results from HPV-positive samples (19). The reproducibility of the PGMY-LB assay in our study was also similar to that of the FDA-approved hybrid capture assay from Digene (29). The accuracy of hybrid capture in detecting high-risk HPV types ranged for three laboratories from 88 to 92%, and interlaboratory agreement reached 90%.
Nonetheless, our results should be interpreted cautiously, with the following caveats. Few samples with multiple type infections were included in our evaluation. These samples have been demonstrated to generate intralaboratory variability in one study (6). Future evaluations of the PGMY-LB assay should include multiple type infections with some types in small amounts. Our clinical samples were obtained without knowledge of cytopathological diagnoses. Assay reproducibility and accuracy may differ when samples from different patient populations are selected, especially since viral load is related to cervical disease status and assay reproducibility (31).
We demonstrated here that there is a very strong interlaboratory agreement when the PGMY-LB assay is used. HPV testing can thus be reliably accomplished with PCR by using a validated and clearly written protocol with quality-controlled reagents. Although the assay procedure is well standardized, an initial certification panel to ensure the reliability of results would be desirable before a laboratory performs the PGMY-LB assay on a regular basis. Our study stresses the importance of using standardized reagents and protocols and the establishment of proficiency panels for the comparability of results between studies. More studies on performance standards that include samples with multiple type infections, with inhibitors and low viral load infections, need to be performed.
F.C. and M.R. hold a clinical career award supported by the Fonds de Recherche en Santé du Québec.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»