Previous Article | Next Article ![]()
Journal of Clinical Microbiology, February 2007, p. 443-452, Vol. 45, No. 2
0095-1137/07/$08.00+0 doi:10.1128/JCM.01870-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Center for Bio/Molecular Science and Engineering, Code 6900, Naval Research Laboratory, Washington, DC 20375,1 NOVA Research Incorporated, Alexandria, Virginia 22308,2 Department of Defense Center for Deployment Health Research, Naval Health Research Center, San Diego, California 921863
Received 8 September 2006/ Returned for modification 27 October 2006/ Accepted 15 November 2006
|
|
|---|
|
|
|---|
A resequencing microarray approach is advantageous in addressing these issues. This platform can maintain specificity, provide information on mutation hot spots and strain variants, and monitor all these aspects in a few long sections of DNA/RNA sequence. Our previous work has demonstrated the potential of short-oligonucleotide resequencing arrays to simultaneously provide both species-level and strain-level identification of amplicons from respiratory pathogens (19, 39). This approach has been used to efficiently and simultaneously detect, type, and genetically characterize geographically diverse influenza viruses (39). The sequences produced by the resequencing arrays were identical to the results from conventional sequencing methods with the exception of ambiguous base calls (N's) (19, 39). The microarray results were analyzed using a new approach that compared the sequence of bases determined to all previously sequenced results. Thus, it is possible both to correctly identify the organism and to determine the difference in sequence between the organism detected and previously sequenced organisms (21). In the initial studies, random amplification methods were used as the enrichment method to provide the sensitivity necessary for this potential diagnostic assay. This broadly targeted enrichment method was initially selected for its potential benefits in reducing biased amplification of one organism over another and in amplifying mutated organisms that a more specific amplification method might miss.
While the initial results using random amplification and a resequencing microarray are very promising, several issues must be addressed before this technology can be considered ready for use in a surveillance or diagnostic application. Specifically, the generic amplification methods, which performed well with analytic test samples, did not consistently detect organisms in complex clinical samples with lower titers. In addition, the analysis method was complex and time-consuming and required the expertise of highly trained individuals. In this study, in order to overcome the sensitivity issue related to random target amplification, we developed an alternate amplification strategy, optimized multiplex PCR, which provides greater sensitivity for clinical samples while still being capable of identifying the presence of close genetic neighbors (e.g., various adenovirus serotypes). Furthermore, to demonstrate that this approach is effective for the simultaneous detection of multiple pathogens, we tested the system's capability for detecting mixed multiple pathogens in analytic control samples and complex backgrounds (mimicking real-world situations). The results demonstrate that this approach allows unambiguous and reproducible sequence-based strain identification of the mixed pathogens. Further testing, using clinical specimens (throat swabs) obtained from patients presenting febrile illness, showed that we can achieve high species-level concordance with standard reference assays while at the same time producing correct species and strain-level identification via direct sequence reads in an assay time of around 8.5 h. These results show that this approach is amenable to the development of a robust microarray-based platform that offers comprehensive coverage of the significant respiratory pathogens for both diagnostic and surveillance purposes.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Analytic sensitivity of microarray-based detection for prototype control strains
|
Clinical samples. Archived throat swabs were collected from patients with symptoms of ARI at various military recruit training centers, at U.S.-Mexico border sites, and on deployed naval ships from 1999 to 2005. These were immediately placed in 2-ml cryogenic vials containing 1.5 ml of viral transport medium (MICROTEST, Multi-Microbe Media; Remel Inc., Lenexa, KS), frozen, and stored at or below 80°C to maintain the viral particles during transport. Samples were then shipped to the Naval Health Research Center (NHRC; San Diego, CA), a molecular diagnostics laboratory certified by the College of American Pathologists (CAP), where they were thawed, aliquoted, and tested for human adenoviruses and influenza virus using CAP-approved diagnostic RT-PCR/PCR and culture tests. Frozen aliquots were then submitted for microarray-based detection in a blinded fashion. Informed consent was obtained from all participants after the nature and possible consequences of the studies were explained.
These samples were collected, and this research has been conducted, in compliance with all applicable federal and international regulations governing the protection of human subjects in research, under Naval Health Research Center (NHRC) work unit number 60501, Department of Defense protocols NHRC.1999.0002.31271 and NHRC.2003.0002.
Chip processing protocol. Figure 1 is a schematic diagram of the processing protocol. The details of each processing step are described below.
![]() View larger version (23K): [in a new window] |
FIG. 1. Schematic of the processing protocol. Clinical samples (nasal swab or nasal wash specimens) were collected from patients presenting ARI symptoms. Nucleic acids were extracted from these samples, followed by reverse transcription. The products of reverse transcription were split into two multiplex PCR mixtures for amplification. PCR products from the two mixtures were combined after amplification, cleaned up, fragmented, and labeled. The labeled products were then hybridized to RPM v.1 chips. After the chips were washed and stained, the sequences generated from RPM v.1 were exported as FASTA-formatted files and further analyzed for pathogen identification using the automatic pathogen identification algorithm.
|
Internal controls.
Two Arabidopsis thaliana plant genes, corresponding to NAC1 and triosphosphate isomerase (TIM), were chosen as internal controls for RT and PCRs, since they would be unlikely to occur naturally in clinical samples. Two plasmids, pSP64poly(A)-NAC1 and pSP64poly(A)-TIM, containing
500 bp of the two genes, were kindly provided by Norman H. Lee of The Institute for Genome Research (Rockville, MD) (38). NAC1 was amplified by PCR with SP6 and M13R primers, and the PCR products were purified using a QIAquick PCR purification kit (QIAGEN, Valencia, CA). To generate RNA from pSP64poly(A)-TIM, the plasmids were linearized with EcoRI and in vitro transcribed from the SP6 promoter using the MEGAscript high-yield transciption kit (Ambion, Austin, TX). Sixty femtograms each of NAC1 and TIM were used as internal controls for checking the amplification efficiency and the presence of inhibitors in the specimens.
Primer design and multiplex RT-PCR amplification. The gene-specific primer pairs for all targets on the RPM v.1 chip (listed in Table S1 in the supplemental material) were designed to meet the minimum amplification efficiency requirement (to provide a detection sensitivity of 101 to 103 genome copies per target) for multiplex PCR. All primers were designed to have similar annealing temperatures and were checked to ensure uniqueness using a full search of the GenBank database with the BLAST program for known sequences. All primers were checked for potential hybridization to other primers in order to reduce the potential of primer-dimer formation. In addition, we adapted a method developed by Shuber et al. and Brownie et al. (3, 32) to further suppress primer-dimer formation by adding a linker sequence of 22 bp (primer L) to the 5' ends of primers used in this study. To further minimize the possibility of intraprimer interactions, the primers were divided into two independent reactions to simplify primer design and optimization. Fine-tuning adjustments to both mixtures (replacing primers that amplified poorly with new ones) were carried out to ensure that all target genes from the 26 targeted pathogens (West Nile virus is included on the array but not in this amplification scheme) would amplify sufficiently to generate detectable hybridization.
RT reactions were performed in 20-µl volumes containing 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM MgCl2, 500 µM each dATP, dCTP, dGTP, and dTTP, 40 U of RNaseOUT, 10 mM dithiothreitol, 2 µM primer LN, 200 U of Superscript III (Invitrogen Life Technologies, Carlsbad, CA), 60 fg of each of the two internal controls (NAC1 and TIM), and 5 to 8 µl of the extracted clinical specimen or laboratory control. Reactions were carried out in a Peltier thermal cycler-PTC240 DNA Engine Tetrad 2 (MJ Research Inc., Reno, NV) using the manufacturer's recommended protocol.
The RT reaction products were split up into two 10-µl volumes to be subjected to two different multiplex PCRs. Primer mix A contains 19 primer pairs and amplifies 18 gene targets from three different influenza A viruses, one influenza B virus, three HAdV serotypes, and one internal control (TIM). Primer mix B contains 38 primer pairs and amplifies the remaining 37 gene targets and the other internal control (NAC1). PCRs were performed in 50-µl volumes containing 20 mM Tris-HCl (pH 8.4), 50 mM KCl, 2 mM MgCl2, 400 µM each dATP, dCTP, dGTP, and dUTP, 1 U of uracil-DNA glycosylase, heat-labile (USB Corporation, Cleveland, OH), 2 µM primer L, 40 nM each primer from mix A or 50 nM each primer from mix B, 10 U of Platinum Taq DNA polymerase (Invitrogen Life Technologies, Carlsbad, CA), and 10 µl of the RT product. The amplification reaction was carried out in a Peltier thermal cycler-PTC240 DNA Engine Tetrad 2 (MJ Research Inc., Reno, NV) with initial incubation at 25°C for 10 min; preliminary denaturation at 94°C for 3 min, followed by 5 cycles of 94°C for 30 s, 50°C for 90 s, and 72°C for 120 s; 35 cycles of 94°C for 30 s and 64°C for 120 s; and a final extension at 72°C for 5 min. The amplified products from the two PCRs were combined into a single volume and subjected to purification and processing prior to hybridization to the RPM v.1 chips (see below).
Microarray hybridization and processing. Microarray hybridization and processing were carried out according to the manufacturer's recommended protocol (Affymetrix Inc., Santa Clara, CA) using a GeneChip resequencing assay kit (Affymetrix Inc.) with the following modification. Purified PCR products were fragmented for 5 min and then labeled for 30 min. Hybridization was carried out at 45°C for 2 h. The images were scanned and processed as previously described to produce FASTA output files (19).
Automatic pathogen identification algorithm. A new software program, the computer-implemented biological sequence-based identifier system, version 2 (CIBSI 2.0), was used to automate the pathogen identification process for the RPM v.1 array. CIBSI 2.0 incorporates the general concept of the resequencing pathogen identification (REPI) program (19) and in addition analyzes the result and makes decisions, steps that were previously carried out manually. A broader discussion of this protocol, including an improved REPI algorithm, is described in detail elsewhere (21).
Quantification of pathogens.
For sensitivity assessments, real-time PCR assays were conducted on an iCycler or MyiQ instrument (Bio-Rad Laboratories, Hercules, CA) to determine the number of pathogen genomes in each sample. The findings for the samples were compared to those for 10-fold serial dilutions of prototype genomic DNA templates with known copy numbers (101 to 106 copies) by using specific primers and RT-PCR/PCR conditions as previously described in the literature (see Table S2 in the supplemental material) (11, 17, 24, 34, 35). The genomic copy number of the pathogen was calculated by measuring the DNA/RNA concentration from purified genomic DNA/RNA and using conversion factors as follows. Molecular mass (y) is calculated as the number of base pairs x 660 Da (average molecular mass for 1 bp at 330 Da for each nucleotide). Grams of DNA per genome copy is calculated as y daltons x 1.67 x 1024 g (e.g., a single adenovirus genome of
35 kb has a molecular mass of 2.31 x 107 Da, which is equivalent to 3.86 x 1017 g; 1 ng of purified adenovirus genomic DNA is equivalent to 2.6 x 107 genome copies) (31). Real-time PCRs were carried out in 25-µl reaction volumes containing 2.5 µl FastStart reaction mix SYBR Green I (Roche Applied Science, Indianapolis, IN), 20 mM Tris-HCl (pH 8.4), 50 mM KCl, 3 mM MgCl2, 200 µM each dATP, dTTP, dGTP, and dCTP, 200 nM primers, and genomic DNA (1 to 4 µl of clinical specimen or DNA extracts).
|
|
|---|
The analytical sensitivity of the RPM v.1 assay was then evaluated using serial 10-fold dilutions of the nucleic acid templates of the prototype strains. Table 1 shows the lowest detectable dilution of each pathogen. The results revealed an individual sensitivity that ranged from 101 to 103 genomic copies per reaction for the prototype strains, which is comparable to the sensitivity of standard multiplex RT-PCR/PCR methods. The genome copy number should not be equated to the CFU or PFU but was used so that comparisons could be made between pathogens from different sources. The number of genome copies represented by the CFU or PFU of a patient sample can vary from one to several orders of magnitude greater for various respiratory pathogens.
The capability of RPM v.1 to identify and discriminate between near genetic neighbors is dependent not only on the capabilities of the microarray but also on the amplification strategy. The capability of RPM v.1 was first demonstrated with random amplification protocols and has been reproduced with this multiplexed amplification protocol. This assay distinguished between 11 different serotypes of ARI-associated HAdV's and also differentiated four different strains of HAdV-4, three strains of HAdV-7, and two strains of HAdV-3. These results demonstrated that the newly developed multiplexed amplification would detect a range of variants comparably to random amplification methods (Table 2).
|
View this table: [in a new window] |
TABLE 2. Differentiation by RPM v.1 of various HAdVs causing febrile respiratory infections
|
The effectiveness of simultaneous multiple pathogen detection was also tested with more-complex mixtures. Three to seven available cultured organisms were spiked at different titers (102 to 105 CFU or PFU/ml) into pooled nasal wash samples collected from volunteers, and 150 µl of the prepared samples was used for testing. Initial results revealed that this approach allowed unambiguous detection of seven pathogensHAdV-4, HAdV-7, influenza A virus (H1N1), parainfluenza virus 1, respiratory syncytial virus A (RSV-A), M. pneumoniae, and S. pyogenessimultaneously at the lowest titer, 100 CFU (or PFU)/ml (see Table S4A in the supplemental material). Further assessment with a set of six pathogens showed that HAdV-4, Bacillus anthracis, influenza A virus (H1N1), RSV-A, and M. pneumoniae were detected at the lowest titer tested, 100 CFU (PFU)/ml, and S. pyogenes was detected at 1,000 CFU/ml (see Table S4B in the supplemental material). For further confirmation, RPM v.1 was tested using eight cultured organisms combined to form three different sets of three pathogens (see Table S4C in the supplemental material). In all sets tested, the assay could reproducibly detect HAdV-4, M. pneumoniae, B. anthracis, influenza A virus (H1N1), human coronavirus 229E, or RSV-A at titers as low as 100 CFU (PFU)/ml and S. pyogenes to only 1,000 CFU/ml. These results indicate that the resequencing array-based approach is an effective means of detecting and typing various pathogens directly from nasal wash samples with the benefit of high sensitivity and specificity, even when as many as seven pathogens are present.
Assessment of clinical specimens. After the capability of the RPM v.1 assay for pathogen detection was successfully demonstrated, it was used for prospective and retrospective diagnoses of infections causing ARI. Clinical specimens, collected primarily from military recruits presenting with ARI, were used to compare the utility of the microarray-based diagnostic to more-established methods of respiratory pathogen detection. The samples (n = 101) consisted of throat swabs, in viral transport medium, from patients with clinically documented respiratory illness. Samples were chosen randomly from sets that had tested positive for HAdV or influenza virus by assays performed at NHRC, a CAP-certified molecular diagnostic laboratory (cell culture and/or PCR), but that were not tested for any other pathogen. As controls, samples that had tested negative for HAdV or influenza virus were also included for testing. These were blinded (randomly renumbered and separated from the associated clinical records) and sent to the Naval Research Laboratory for RPM v.1 testing, and the sample identities were revealed only after the results had been finalized. For pathogens detected in at least two samples each (HAdV, influenza A virus, S. pneumoniae, and M. pneumoniae) by RPM v.1, published species-specific (11, 24) or selected in-house specific PCR primers (see Table S2 in the supplemental material) were used to perform quantitative real-time PCR for a subset of the clinical samples (limited to only 40 due to the availability of the samples) (Table 3). The lack of samples makes it difficult to properly estimate the clinical sensitivity and specificity for some of the pathogens detected by RPM v.1, and so data for these pathogens, such as Neisseria meningitidis, are not reported.
|
View this table: [in a new window] |
TABLE 3. Quantitative real-time PCR results for 40 clinical samples
|
104 genome copies/µl) (Table 3). The high titers of bacteria present in these clinical samples were possibly due to virally induced bacterial superinfection, consistent with the findings of Madhi and Klugman (20) and Peltola and McCullers (26). |
View this table: [in a new window] |
TABLE 4. Evaluation of RPM v.1 for adenovirus, influenza A virus, and negative-control detection in clinical samples
|
|
View this table: [in a new window] |
TABLE 5. Comparison of the culture method with a real-time PCR assay for detection of influenza A virus-positive and -negative controls in 40 clinical samples
|
|
View this table: [in a new window] |
TABLE 6. Influenza virus strain and lineage identification using RPM v.1
|
|
|
|---|
Commensal microflora has been viewed as a source of specimen contamination and an occasional opportunistic pathogen, but it may play a more important role in health and disease than was once thought (29). Madhi and Klugman (20) and Peltola and McCullers (26) have reported that
30% of respiratory viral infections predispose patients, both adults and children, to bacterial superinfection. These superinfections are thought to be the source of many influenza-related pneumonias and subsequent deaths (27). It is not surprising that RPM v.1 methods detected S. pneumoniae and N. meningitidis in several clinical samples, since it is well known that both are commensal bacteria present in the upper respiratory tract. However, it is noteworthy that while most of these bacteria appeared to be present at low titers in the samples, high titers of these pathogens were regularly seen in influenza A virus-positive samples. Although the RPM v.1 assay does not determine the titers of the pathogens detected, the assay can effectively detect coinfection. This type of assay would be highly effective in studies to establish more firmly whether a clinical correlation of disease exists in cases of coinfection. If situations where this occurs are established, a diagnostic test based on a resequencing array will be valuable for the effective treatment of the primary agent and secondary coinfecting organisms, with prompt treatment using appropriate antibiotics.
Unlike traditional methods, the optimized RPM v.1 assay not only identifies pathogens but also provides sequence information, allowing a large number of pathogens to be detected and phylogenetically categorized for genetic variation analysis in the same assay (19, 39). This utility was clearly demonstrated for the influenza A virus-positive clinical samples, where the array using the HA gene tracked the lineage changes from A/Panama/2007/99-like strains (prior to the 2003 influenza season) to A/Fujian/411/2002-like strains in the 2003-to-2004 influenza season and then to A/California/7/2004-like strains in the 2004-to-2005 influenza season. While the specific HA genes allowed detailed tracking of changes in a specific subtype, one M gene (H1N1) sequence that is relatively conserved among influenza A viruses tiled on the RPM v.1 was still able to detect homologous regions of disparate subtypes, allowing correct differentiation (Table 6). This M gene ProSeq would theoretically allow detection of any other type of influenza virus for which specific antigenic HA and NA sequences were not tiled on the array. In the future, this method of mixing specific and conserved targeting will be useful for enhancing clinical management and epidemic outbreak responses by permitting accurate fingerprinting, antibiotic resistance profiling, genetic drift/shift analysis, forensics, and many other parameters of important pathogens while maintaining coverage of a large number of pathogens. This capability will be invaluable for rapid detection of emerging diseases, such as avian influenza virus (H5N1), and biological terrorism events. In addition, with the capability for simultaneous resequencing of dozens of gene targets from multiple pathogens in a single assay, the technology is an excellent tool for identifying minor variants within a population that may emerge and become dominant when selection pressure changes, without the need to isolate before proceeding with the sequencing reaction.
While the RPM v.1 chips with multiplex amplification demonstrated remarkable diagnostic and surveillance capabilities, the following factors must be taken into account before introducing this technique in the diagnostic laboratory. One major hurdle that limits the use of a resequencing array to broad-spectrum pathogen diagnostics has been the selection of the primers for amplification of chosen target species and near neighbors prior to microarray hybridization. In this study, we showed that multiplex PCR can reach the desired clinical sensitivity and detect the presence of close genetic neighbors; however, the current system remains somewhat vulnerable to the rapid mutation of the RNA virus, and each new resequencing array design would require recalibrating the mix of multiplex primers. Future work will attempt to simplify the redesign method and also try to find alternative amplification methods that provide the necessary sensitivity with more-comprehensive coverage.
Another issue, in the current array format, is that a limited number of probe sets can be placed on the microarray, and so very detailed information cannot be obtained for every pathogen. Currently the targets must be carefully chosen, or the microarray will not provide the coverage or detailed information desired. As the number of probe sets that can be placed on the microarray continues to increase, the trade-off between specific and conserved targets in the effort to provide sufficient detailed information while maintaining coverage will become less of an issue. Indeed, our newer chip design has improved upon the content of the detectable pathogens, which will further broaden the detection capability of the resequencing array.
In its current format, this assay can be performed at centralized laboratories in 8.5 h with a completely automated pathogen identification process. However, this is not suitable for a point-of-care diagnostic tool. The simplicity of the assay should allow it to be adapted to a fully automated process, increasing throughput, further decreasing the assay time, and reducing the error rate caused by human handling. With these alterations, it should be possible to implement this assay as a point-of-care analysis system. The cost of this assay compared to that of rapid single assays is high, although factoring the cost of all the rapid assays required to give the same information indicates that the relative cost difference for the two methods is not as significant. Considering costs on a per-test basis, the advancements occurring in microarray production technology should provide even higher density microarrays with a reduced chip cost, and steps taken to produce a point-of-care system would also result in a reduction in the overall assay cost, which leads us to believe that newer-generation chips providing more-comprehensive coverage will be cost-effective in the near future. The benefits of this approach, with clear paths to overcome or reduce its limitations, make this an attractive assay method to develop for detecting respiratory pathogens. Future development will include addressing the most important limitations: designing a new resequencing array with more-comprehensive coverage of respiratory pathogens, improving primer selection, and integrating microarrays and microfluidics into a portable device, allowing the possibility of transferring this reference lab method into the point-of-care field.
We thank Margaret Ryan and Christopher Barrozo at NHRC, Linda Canas and Luke Daum at AFIOH, and Ted Hadfield at AFIP for kindly providing samples used in this study. We also thank Carolyn E. Meador for providing technical support and advice. Constructive advice from Frances Ligler, Gary Vora, Zheng Wang, and Chris Myers is gratefully appreciated.
The opinions and assertions contained herein are those of the authors and are not to be construed as official or as reflecting the views of the Department of Defense or the U.S. Government.
Published ahead of print on 29 November 2006. ![]()
Supplemental material for this article may be found at http://jcm.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»