Previous Article | Next Article ![]()
Journal of Clinical Microbiology, April 2009, p. 988-993, Vol. 47, No. 4
0095-1137/09/$08.00+0 doi:10.1128/JCM.01346-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Clark Tibbetts,5 and
David A. Stenger1
Center for Bio/Molecular Science & Engineering, Code 6900, Naval Research Laboratory, Washington, D.C. 20375,1 Nova Research, Incorporated, Alexandria, Virginia 22308,2 Department of Respiratory Disease Research, Naval Health Research Center, San Diego, California 92106,3 Naval Medical Research Unit 3, Cairo, Egypt,4 TessArae, LLC, Potomac Falls, Virginia 201655
Received 15 July 2008/ Returned for modification 9 January 2009/ Accepted 17 February 2009
|
|
|---|
|
|
|---|
Detection and discrimination of all potential influenza A virus subtypes is needed to identify the introduction of zoonotic strains to humans, monitor the status of these pathogens in their natural hosts, and minimize epidemic spread if transmissible human infections occur. An effective surveillance assay could rapidly detect and identify all subtypes of avian influenza virus and provide useful secondary information related to specific functional mutations which alter pathogenicity or drug resistance. For example, the low-pathogenicity H5N1 viruses should be differentiated from the highly pathogenic strains by a mutation in an HA cleavage site (a multibasic cleavage motif, PQRERRRKKRG), a deletion of 20 amino acids in the NA protein, and a signature amino acid substitution, E627K, in the PB2 protein (1, 23).
Viral culture paired with serological HA typing is the current standard method for detecting and typing influenza A viruses. These procedures are time-consuming, taking days or even weeks to provide specific results. Several molecular diagnostic approaches including reverse transcription (RT)-PCR, real-time PCR, PCR-enzyme-linked immunosorbent assay, and spotted oligonucleotide microarrays provide fast and sensitive alternatives to viral culture (5, 8, 12-15, 20, 23, 27, 31, 33, 34, 38). While promising, these methods either are limited to detecting only a few subtypes or provide a very limited range of genetic resolution. Additional time-consuming characterization, such as direct sequencing, is required for analysis of strain variations and specific mutations that contribute to or predict influenza virus pathogenicity, host range, drug resistance, and vaccine efficacy.
Alternative strategies, such as the use of RT-PCR coupled with pyrosequencing (7, 28), RT-PCR-electrospray ionization (ESI)-mass spectrometry (MS) (30), or resequencing pathogen microarrays (RPM) (6, 16, 17, 19, 36), allow tracking of genetic changes and supply subspecies identification. The pyrosequencing technique is currently limited to short fragments and is applied to detection of H5N1 or selected drug resistance markers (3, 4, 7, 28). The RT-PCR-ESI-MS method, developed for detection of all avian serotypes, has, to date, only demonstrated tracking of genetic changes in human influenza virus samples. The RPM technology is the only one of these technologies currently under development for simultaneous detection and identification of influenza A virus variants together with a large number of other viral and bacterial pathogens that may elicit similar flulike illnesses. Furthermore, the RPM technology separates and partially decouples the amplification of limiting templates by multiplex RT-PCR from the selection of microarray contents and detection capability, which alleviates constraints on primer selection while still providing the required specificity.
Herein, we investigate the performance of new versions of the respiratory pathogen microarray (TessArray RPM-Flu 3.0 and 3.1, subsequently designated RPM-Flu (see Table S1 in the supplemental material) for detection and differential identification of all subtypes of the influenza A virus HA and NA genes in a single-pass assay. Previous studies demonstrated the ability of RPM technology to detect targeted pathogens with analytical and clinical sensitivities and specificities that are similar to (or improved over) those for existing technologies, while simultaneously providing sequence information for strain resolution (6, 16, 17, 19, 22, 36). The RPM-Flu arrays are designed and constructed to allow thorough coverage of 86 bacterial and viral agents, including respiratory pathogens and zoonotic organisms considered to be significant risks for human health, e.g., severe acute respiratory syndrome virus.
About 30% of the RPM-Flu array is dedicated to targeting all 16 HA and 9 NA alleles of avian influenza A viruses. The HA and NA genes represented on the microarray are based on prevalent strains of influenza A viruses circulating in the avian population within the last few years. Previous studies have shown that a single sequence on a resequencing microarray could reliably detect and serotype strains with as much as 10 to 15% variation (16, 18, 19, 36). Thus, this technology has the potential to detect all possible influenza A virus HA and NA subtype combinations. The data presented here demonstrate that the RPM platform is an effective means for universal detection and identification of all subtypes of avian influenza A viruses and provides useful secondary information related to pathogenicity and drug resistance.
(This work was presented in part at the 107th general meeting of the American Society for Microbiology, 2007.)
|
|
|---|
Specimen collection and processing. Cloacal swabs from migratory birds (mostly waterfowl), cloacal and/or tracheal swabs from commercial poultry, tracheal and lung tissue samples from dead birds, human throat swabs, and one human lung sample taken from a deceased patient were collected by Naval Medical Research Unit 3 (NAMRU3; Cairo, Egypt). These samples were cultured using chicken eggs and/or MDCK cells. Sample collection and viral culture techniques were performed as described in the World Health Organization Manual On Animal Influenza Diagnosis and Surveillance, Version 2002.5, revision 1 (39). Total RNA was extracted from culture isolates; 1:100 to 1:1,000 dilutions were used for molecular diagnosis.
Microarray design. The details of the design and selection process for the gene targets for RPM-Flu arrays were described in a previous publication (37). Briefly, after target genes were chosen based on the association between sequence diversity and clinically relevant phenotypic diversity, in silico modeling (21) was employed to select a set of probes that specifically recognize only the targeted clade of organisms while accounting for all or most of the recognized variants (i.e., strains or subtypes) within that clade. The RPM-Flu arrays (TessArray RPM-Flu 3.0 and 3.1; TessArae, LLC, Potomac Falls, VA) were designed to maximize coverage of respiratory pathogens, targeting all 16 HA and 9 NA alleles of avian influenza A viruses as well as three matrix (M) alleles, one nonstructural protein (NS1) allele, and one polymerase basic subunit 2 (PB2) region (see Table S1 in the supplemental material). About 30% of the RPM-Flu arrays are dedicated to the detection of influenza viruses. Because numerous respiratory pathogens needed to be detected using a limited amount of microarray space, most HA and NA genes were represented by a single sequence optimized for prevalent strains of influenza A viruses circulating in the avian population within the last few years (see Table S2 in the supplemental material). We have focused on ensuring the typing of the most common variants, while ensuring identification (with potentially less detailed typing results) for less common variants. There are instances where less common types will have more than 15% variation for some targets. This will not prevent detection of the influenza virus but may prevent subtyping to a single type and instead result in a list of several equally likely subtype identifications. Definite subtyping then relies on other targets on the array. The H10N7 samples discussed in Results serve as an example of this. The NA7 tile is designed based on the H7N7 subtype, and there is approximately 20% difference in the NA sequences of the H7N7 and the H10N7 subtypes. For a sample of H10N7, the sequence generated from the NA7 tile provides detection but indicates both H10N7 and H7N7 as being equally possible. The information from the HA10 (positive) and the HA7 (negative) subtypes allow a final determination that H10N7 has been detected. For each of the HA3, HA6, HA7, NA1, and NA8 genes, two different reference sequences were used because multiple sequence alignments revealed two major prevalent groups that could not be represented by a single reference sequence.
Primer design. Four independent multiplex primer mixes, separated to simplify primer design and multiplex optimization, were used to amplify a total of 187 targeted sequences represented on RPM-Flu arrays (298 primers). The gene-specific primer pairs for all targets on the RPM-Flu chips (see Table S1 in the supplemental material) were designed according to the criteria described previously (16, 17). To accommodate the genetic variation of RNA viruses and ensure amplification of closely related variants, a software script automated a primer selection algorithm developed by our group (unpublished data, available upon request) to select primers from defined primer regions of each target. Briefly, target sequences were used to search against GenBank to find all sequences sharing at least 80% sequence identity. Consensus sequences that covered the primer regions flanking the target sequences were generated from the sequences found. Primers were chosen to be between 18 and 25 bp in length, with predicted annealing temperatures between 55 and 60°C and at most one wobble nucleotide (M, R, S, W, Y, or K). When necessary, two primer pairs instead of one were used to amplify one target sequence.
Of the four multiplex primer mixes, one was dedicated to the influenza virus targets, including the aforementioned HA and NA alleles and the M genes from H3N2 and H1N1, which served as general markers of the presence of influenza A viruses. The M, NS1, and PB2 genes from the H5N1 subtype were included to allow more-detailed analysis of critical regions subject to functional mutation (see Table S1 in the supplemental material).
Multiplex RT-PCR amplification. The multiplex RT-PCRs were performed as previously described (16), with the following modifications. For the RT step, primer LN was replaced by primer NLN (a random 9-mer with a linker sequence). One picogram each of two internal controls (NAC1 and triosephosphate isomerase) and 4 µl of the total nucleic acids extracted from either clinical specimens or laboratory controls were used. The RT reaction products were split into four 5-µl volumes for four different multiplex PCRs. Platinum Taq DNA polymerase (Invitrogen Life Technologies, Carlsbad, CA) was replaced by GoTaq DNA polymerase (Promega Corporation, Madison, WI) in the PCR. Linker primer NL instead of primer L was used with 50 to 150 nM each of primers from mixes 1 to 4 in the multiplex PCR. The amplification reaction was carried out with an initial incubation at 25°C for 10 min and then preliminary denaturation at 94°C for 2 min; followed by 16 cycles at 94°C for 30 s, 45 to 60°C for 30 s (with an incremental increase of 1°C per cycle), and 72°C for 90 s; and then 24 cycles at 94°C for 30 s and 60°C for 120 s. The amplified products from all four PCRs were combined into a single volume and subjected to purification and processing prior to hybridization to the RPM-Flu chips.
Microarray hybridization and analysis. The overall procedure of microarray processing protocol was based on a previously published procedure (see Fig. S1 in the supplemental material). Microarray hybridization and processing and image scanning and processing for production of FASTA output files were performed as previously described (16). Final pathogen identification was performed using Computer-Implemented Biological Sequence Identifier version 2.0 software (22), an automatic pathogen identification algorithm based on nucleic acid sequence alignment, which was developed and tested in detail in previous studies (16, 17).
Reference assays. Hemagglutination inhibition assays (HI) were performed at NAMRU3, following the World Health Organization Manual On Animal Influenza Diagnosis and Surveillance, version 2002.5, revision 1 (39). RT-PCR-ESI-MS was performed as previously described (30) at the Naval Health Research Center (NHRC; San Diego, CA). Additional assays, including conventional and/or real-time RT-PCR, were also performed as further confirmation at the NHRC and/or the Naval Research Laboratory (NRL; Washington, DC). Real-time RT-PCR was conducted to quantify the number of avian influenza virus genomes present in each sample (32). Furthermore, additional subtype-specific RT-PCR amplification assays were performed on discordant samples on the basis of both RPM-Flu and RT-PCR-ESI-MS results.
Full-length amplifications of M and NS1 genes, using the universal primer set described by Hoffmann et al. (11), were also performed on discordant samples to obtain unbiased de novo sequence results. When the universal primer set did not produce sufficient material for conventional sequencing, subtype-specific RT-PCR amplifications were performed based on both RPM-Flu and RT-PCR-ESI-MS results. Amplified products from HA, M, NA, and NS1 genes were purified and sent to Macrogen USA (Gaithersburg, MD) for Sanger/electrophoresis-based sequencing using either correspondent universal or specific primers.
Nucleotide sequence accession numbers. All nucleotide sequences reported in this study are available at GenBank (http://www.ncbi.nlm.nih.gov/GenBank/index.html) under accession numbers EU599273 to EU599315.
|
|
|---|
Identification of influenza A virus isolates. Having demonstrated the capability of RPM-Flu for detecting and differentiating all HA and NA subtypes, a study was undertaken to characterize 37 cultured avian influenza virus specimens collected in the Middle East. This study effectively compared the utility of the RPM platform with respect to the traditional HI for avian influenza virus detection. The samples were additionally tested using alternate PCR methods available at the time (RT-PCR-ESI-MS and PCR assays, carried out at the NHRC). These samples were blinded with respect to all prior results and tested at the NRL by using the RPM-Flu platform. Sample identities were revealed only after the RPM results were finalized (Table 1).
|
View this table: [in a new window] |
TABLE 1. Identification of influenza A virus by RPM, HI, and RT-PCR-ESI-MS
|
Confirmation of influenza virus isolates by sequence analysis. Conventional sequencing of HA, M, NA, and NS1 by use of universal primers (11) was attempted on the 11 discordant samples and 2 H5N1-positive samples as controls. When the universal amplification procedure failed, subtype-specific primers were employed based on the typing identifications made by both the RPM-Flu and the RT-PCR-ESI-MS platforms. In the event that conventional sequencing was successful, sequences generated for M and NS1 genes did not provide conclusive information for subtype identification, but direct HA and NA gene sequencing agreed with the results from the RPM-Flu and HI platforms. In cases where HI failed to provide conclusive identification but the RPM-Flu provided detection, the identification was concordant with the de novo sequencing analysis (Table 1).
Identification of highly pathogenic influenza A strains. Translation of RPM-Flu resequencing data for the HA and PB2 sequences of the 20 avian H5N1-positive samples confirmed that 13 samples contained the HA cleavage site and 15 samples indicated key amino acid substitutions in the PB2 protein. Eight samples had base calls before and after the location of the NA deletion. In these samples, this is a strong indication that a deletion is present. The remaining samples, unfortunately, had base calls only after the location of the NA deletion, so, while suggestive, it is possible that the absence of base calls before this location is associated with the low titer of the sample and not with the presence of a deletion. The overall results indicated that 75% of the H5N1 samples had at least one highly pathogenic strain marker and no conflicting indications. The remaining samples failed to provide base calls in all the marker locations; therefore, no clear determination of low- or high-pathogenicity strains could be made. The HA peptide translated from the two H7N7 samples indicated low virulence strains (data not shown).
Drug resistance information. In addition to tracking the mutations that were indicative of HPAI, RPM results provided the ability to track drug resistance markers. Previous work has shown that the NA protein may be characterized by four critical amino acid substitutions, E119V, R152K, H274Y, and R292K, associated with resistance against neuraminidase inhibitors. The amino acid substitution R292K was of particular significance, as this mutation results in complete resistance to oseltamivir and zanamavir (2). The results obtained from translation of the NA gene from RPM-Flu resequencing analysis of the H5N1-positive samples confirmed that 21 avian H5N1 samples with good base call rates harbored the E119V mutation. Furthermore, the R152K mutation was not observed in 17 out of 21 samples, the R292K mutation in 20 out of 21 samples, and the H274Y mutation in 9 out of 21 samples (data not shown). These data indicated that most of the avian H5N1 samples identified were presumably sensitive to neuraminidase inhibitors. Specific single-amino-acid substitutions in any one of four critical amino acids of the M2 protein, L26F, V27A/T, A30T/V, and S31N/R, have been associated with amantadine resistance (10, 29). Unfortunately, the results obtained from translation of the M2 genes did not give conclusive results for amantadine resistance (data not shown).
|
|
|---|
The use of two additional PCR-based methods illustrates that other assay approaches can provide similar detection of avian influenza virus but that additional tests are required for the number of pathogens tested for by RPM-Flu to be matched. While detections of the presence of avian influenza virus were equivalent, different results were obtained from the methods for serotyping 11 of the 37 samples. Specific PCR remains one of the most inexpensive molecular diagnostic methods, making it useful for identifying the presence of current circulating strains. However, due to its limited scope, it is not easily applied for identifying reassortments or new variants or for broad-spectrum serotyping of diverse samples, as is described here for the non-H5N1 avian influenza virus samples tested.
RT-PCR-ESI-MS requires a high initial capital investment but is operationally a cost-effective method if the high-throughout capabilities of the platform can be utilized. Furthermore, this approach is able to provide more-detailed information beyond mere identification, such as its capacity to track the evolution of circulating human influenza virus. The previously reported capability of this method for serotype identification is largely confirmed in this study (30). RT-PCR-ESI-MS inferentially determines the identity of influenza viruses through base composition analysis of PCR segments of polymerase (PB1, PB2, and PA), M, NS1, and nucleoprotein (NP) genes (30). This inferential method may lead to the discrepancies seen in samples representing less prevalent subtypes with limited sequencing results in the database. Although this system is capable of reporting additional information, the tests and analysis applied to the samples in this study only provide serotyping information.
While the RPM-Flu chips demonstrate a very broad, high-confidence detection and analysis capability, the technology can be improved upon. For example, the current arrays, designed to provide comprehensive coverage of primarily human bacterial and viral pathogens, may not effectively provide detailed mutation and pathogenicity information for all subtypes of influenza virus. This impediment reflects the limits of microarray real estate, which demand that compromises between the depth and scope of coverage must be made. The limitation will become less of a factor as the capacity and cost of resequencing microarrays improve. In addition, no conclusive results could be obtained from the critical amino acids associated with amantadine drug resistance in the M2 protein. This is likely due to the fact that several nucleotide polymorphisms occur within the same 18-base-pair region with respect to the sequence used on the microarray. More specifically, clustered single nucleotide polymorphisms have a higher probability of not being identified on a resequencing microarray (21). Therefore, only ambiguous calls are made within the region, and no positive evidence of the mutation is provided.
Future iterations of RPM design can alleviate detection of bases in clustered mutation hotspots by including additional probes. Another issue that can be overcome in future iterations by modifying design strategies and analysis algorithms is the current difficulty in confirming sequence deletions. Finally, it should be noted that the absolute analytical sensitivity of the human influenza virus primers is slightly higher than that of the avian influenza virus primers, suggesting that adjustments in the primers may improve base call rates by providing more amplicon. To maximize the benefits of using a resequencing microarray for avian samples, an array that detects many pathogens of concern in bird samples will provide more information and could be placed on a smaller, less costly microarray. Efforts are currently ongoing to develop such an array.
Funding for this research was provided in part by the Office of Naval Research via the NRL base program. Partial support for this project came from TessArae, LLC (Potomac Falls, VA), through a cooperative research and development agreement. There are two pending patent applications, one for an amplification method and one for an analysis algorithm, that are related to this article. Clark Tibbetts is chief technology officer of TessArae, LLC, which is a for-profit organization. None of the authors from the NHRC or NAMRU3 have a financial conflict of interest.
We applied the sequence-determines-credit approach for determining the sequence of authors (35).
The opinions and assertions contained herein are those of the authors and are not to be construed as those of the U.S. Navy or military service at large.
Published ahead of print on 11 March 2009. ![]()
Supplemental material for this article may be found at http://jcm.asm.org/. ![]()
Current address: College of Science, George Mason University, Fairfax, VA 22030. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»