Previous Article | Next Article ![]()
Journal of Clinical Microbiology, December 2004, p. 5793-5801, Vol. 42, No. 12
0095-1137/04/$08.00+0 DOI: 10.1128/JCM.42.12.5793-5801.2004
Center for Biologics Evaluation and Research, Food and Drug Administration, Rockville,1 SRA International Inc., Bethesda, Maryland2
Received 4 May 2004/ Returned for modification 6 July 2004/ Accepted 30 July 2004
|
|
|---|
|
|
|---|
Gradual antigenic drift and intermittent occurrences of more substantial antigenic shifts make the creation of new prospective seed virus strains for both inactivated and live attenuated influenza vaccines a continuous process that must be completed within a short period before each epidemic season. To be effective, prospective vaccine strains must not only possess the antigenic properties of the currently circulating strains but also replicate well in the substrate used for their manufacture. Some naturally occurring influenza virus isolates do not grow well in substrates used for vaccine production (e.g., embryonated chicken eggs). Adaptation of influenza viruses for optimal growth by passaging them in a target substrate is a time-consuming process that may also allow the accumulation of random mutations that can alter antigenic specificity. An alternative strategy is to create reassortants between the currently circulating strains and another strain previously adapted for high growth in vitro (2, 21). The reassortants should contain genes coding for the protective antigens (hemagglutinin [HA] and neuraminidase [NA]), as well as genes determining the optimal replicative properties of the virus. Selection of appropriate reassortants is aided by reverse transcription (RT)-PCR and restriction fragment length polymorphism (RFLP) tests of multiple candidate clones (13, 18) and multiplex RT-PCR with single-strand conformation polymorphism analysis (1). However, none of these methods is fully adequate, and the creation of new rapid, highly specific, sensitive, and reproducible segment-mapping techniques remains highly desirable.
Hybridization with microarrays of immobilized oligonucleotides is a rapid alternative to DNA sequence analysis (6, 12) and has been used for the genotyping of viruses, bacteria, and genes of higher organisms (7, 9, 10, 20). Recently, a microarray method has been used for the successful genotyping of the VP7 gene of rotaviruses (8) and for analysis of intermolecular recombinants of poliovirus (5), as well as for detection and discrimination of orthopoxviruses (14). Recently, the potential of microarray hybridization for detection and discrimination between types and subtypes of influenza A viruses has been shown by using 24 long oligoprobes (average, 500 nucleotides [nt] [16]). Shorter oligoprobe microarrays (between 4 and 128 nt) have been developed for the detection of influenza viruses (22). However, in the latter study, the intensities of probe spots completely complementary to target sequences lacked definitive boundaries between significant hybridization and nonspecific background binding, and most of the monospecific NA and HA subtype probes have not been discriminated by the array.
For better discrimination of closely related segmented RNA viruses, we report a new high-throughput oligonucleotide microarray-based method to map the origins of all segments of two closely related influenza B viruses, B/Beijing/184/93 and B/Shangdong/7/97, which were used as the models for development of the method. The two influenza B viruses were chosen for the study because B/Shangdong/7/97 is a naturally occurring reassortant with three segments (PA, NP, and M) derived from a precursor virus phylogenetically related to B/Beijing/184/93 and therefore presented a challenge by requiring the ability to distinguish between single-base substitutions. In addition, the strains possess different growth characteristics in embryonated chicken eggs (23) and may serve as a model for the reassortment approach to the creation of influenza B vaccines. The proposed microarray approach enabled us to successfully discriminate all eight segments of the Beijing and Shangdong strains despite minimal differences between them.
|
|
|---|
(ii) RNA preparation and reverse transcription. Viral RNA was extracted from allantoic fluid using the Viral RNA Mini kit (QIAGEN, Valencia, Calif.), and cDNA was synthesized using Thermoscript RT polymerase (ThermoScript RT-PCR System; Invitrogen, Carlsbad, Calif.) with an AGCAGAAGC primer complementary to the conserved 3' ends of all influenza B virus genome segments, either alone or in a mixture with a random hexanucleotide (dN6). The resulting cDNA was used for multiplex PCR amplification.
(iii) Nucleic acid sequencing and analysis. The nucleotide sequences of all eight segments of B/Beijing/184/93 were obtained from GenBank. The sequences of genome segments of B/Shangdong/7/97 were determined in our laboratory (23).
cDNA samples were prepared by 30 cycles of PCR amplification (HotStarTaq Master Mix; QIAGEN) using primers specific for partial sequences of eight genome segments. Amplification products were separated by electrophoresis in 1% agarose gel, extracted with the QIAquick Gel Extraction kit (QIAGEN), and sequenced using a dRhodamine Terminator Cycle Sequencing kit (PE Biosystems, Warrington, United Kingdom) and an ABI Prizm model 377 (Applied Biosystems, Foster City, Calif.).
(iv) Multiplex PCR and synthesis of Cy5-labeled hybridization samples. Amplicons of all eight influenza B virus genome segments were prepared in two multiplex PCRs; the primer pair for the PB2, PB1, PA, and HA segments in one multiplex PCR was B1-S1, and the primer pair for the NP, NA, M, and NS segments the other multiplex PCR was B2-S2 (B and S refer to Beijing and Shangdong, respectively). The sequences of the PCR primers are listed in Table 1. One primer in each pair was 5' biotinylated for strand separation and preparation of Cy5 fluorescently labeled single-stranded DNA samples as previously described (8). Briefly, 30 µl of a reaction mixture containing 1x AmpliTaq PCR buffer with 1.5 mM MgCl2; a 200 nM concentration of each primer; 20 µM Cy5-dCTP; 20 µM dCTP; 200 µM (each) dATP, dGTP, and dTTP; and 0.8 U of HotStartTaq DNA Polymerase (QIAGEN) was used. Amplification was performed by 35 PCR cycles consisting of 30 s at 94°C, 30 s at 45°C, and 60 s at 72°C. Single-stranded DNA was prepared by the separation of DNA strands using streptavidin-coated magnetic beads (Dynal ASA, Oslo, Norway) according to the manufacturer's protocol. Fluorescently labeled single-stranded DNA was eluted from the magnetic beads by washing them with 50 µl of 0.1 M NaOH purified through CentriSep spin columns (Princeton Separation, Adelphia, N.J.), concentrated by vacuum drying, and finally resuspended in water.
|
View this table: [in a new window] |
TABLE 1. PCR primers used for amplification of genomic segments of influenza B virus
|
|
View this table: [in a new window] |
TABLE 2. Oligoprobes used for discrimination between Beijing and Shangdong strains of influenza B virusa
|
![]() View larger version (19K): [in a new window] |
FIG. 1. Influenza B virus array design. The array consists of four identical subarrays. A solid circle indicates a spot containing printed oligoprobe; an open circle indicates a control spot containing no printed oligoprobe to assess background sample binding. Each subarray contained 40 oligoprobes (Table 1). The left part of each subarray contained 20 oligoprobes complementary to the Beijing strain; the right part contained 20 oligoprobes complementary to the Shangdong strain. The pairs of homologous oligoprobes of the Beijing and Shangdong strains were printed symmetrically across the vertical axis of a subarray. Each subarray consisted of four areas. Each area contained oligoprobes complementary to one of the four fluorescently labeled cDNA samples: B1, B2, S1, or S2 (see "Multiplex PCR and synthesis of Cy-5-labeled hybridization samples"). To assist in the visual presentation, the areas of oligoprobes complementary to the B1 and S1 samples are separated from the B2 and S2 samples.
|
(vi) Array design. Figure 1 shows the layout of oligoprobes on the microarray. Each array consisted of four identical subarrays, and each subarray contained 20 oligoprobes redundantly representing eight segments of the B/Beijing/184/93 strain and 20 oligoprobes representing eight segments of the B/Shangdong/7/97 strain. Each subarray consisted of four areas containing oligoprobes complementary to one of four fluorescently labeled cDNA samples designated B1, B2, S1, and S2 (see "Multiplex PCR and synthesis of Cy-5-labeled hybridization samples" above).
Eight identical arrays were printed on each slide. Therefore, each of the four fluorescently labeled cDNA samples could be hybridized twice on each slide. This design enabled us to produce eight replicates of the homotypic hybridization signal and eight replicates of the heterotypic hybridization signal (used as the negative control) for each of the 20 unique oligoprobes spotted on a slide. For example, the Beijing sample B1 could hybridize to eight homotypic Beijing-specific oligoprobes (PB2-B-591) and to eight heterotypic Shangdong-specific oligoprobes (PB2-S-591).
(vii) Microchip scanning and data analysis. Microchip images were captured using a ScanArray 5000 confocal fluorescent scanner (Perkin-Elmer, Boston, Mass.) with two HeNe lasers (632 nm for excitation of Cy5 and 543 nm for Cy3). The fluorescent images were analyzed using QuantArray software (Perkin-Elmer).
(viii) Restriction enzyme digestion. Three micrograms of each NS segment amplicon was digested at 30°C for 65 min in 20 µl of reaction mixture containing 2 U of the HgaI restriction enzyme (New England BioLabs Inc., Beverly Mass.). The products of digestion were separated by electrophoresis in 9.5% polyacrylamide gels.
Statistical methods: background subtraction and signal normalization. Let A represent the value of the intensity of the hybridization signal of a given fluorescently labeled cDNA sample hybridized with the homotypic oligoprobe on the array; let B represent the value of the intensity of the hybridization signal of the sample hybridized with the heterotypic oligoprobe on the same array. Let K be the value of the background intensity of the signal measured in mock spots on this array (printed without oligoprobe) (Fig. 1). We measured K values in 64 mock spots on each array.
To reduce the effect of background (nonspecific binding of the fluorescently labeled cDNA sample to the array) and to compare the homotypic hybridization signals across arrays and different slides, we estimated the relative level of homotypic hybridization signal for a given oligoprobe by the following formula:
![]() | (1) |
)/(
) and IB = (B
)/(
).
is the mean value of background intensities averaged over the 64 signals measured in mock spots on the array (printed without oligoprobe [Fig. 1]).
is the mean value of the homotypic signal averaged over four subarrays on the array. Because the oligoprobes of the array were hybridized to only one of the four labeled cDNA samples, the local (on the array) normalization of the hybridization signal on factor
allowed us to compare the interarray and interslide variabilities of the specific signals for a given oligoprobe. The four pairs of IA and IB values on each array were compared by Wilcoxon's signed-rank pair test. We also used the Kruskal-Wallis test for multiple comparisons of the arrays in the replicated experiments and a paired t test and a one-way analysis of variance test to compare the IA and IB values in the replicated experiments. Estimation of the number of replicates of arrays and the confidence probabilities of the false-negative rate for hybridizations is described in the appendix.
The MLAB (Civilized Software, Inc., Silver Spring, Md.), Statistica-5 (StatSoft, Inc., Tulsa, Okla.), and JMP (SAS Institute Inc.) programs were used for statistical analysis.
Nucleotide sequence accession numbers. The sequences of genome segments of B/Shangdong/7/97 determined in our laboratory (23) were deposited in GenBank under accession numbers AF 484967, AY044173, AF486835, AF486836, AY044169, AY044172, AY044171, AY044170, AF101974, AF101992, AF102009, AF050061, AF100367, AF100381, and AF100399.
|
|
|---|
![]() View larger version (63K): [in a new window] |
FIG. 2. Typical hybridization for reference virus strains B/Beijing/184/93 and B/Shangdong/7/97 on a subarray. For purposes of visual presentation, hybridization patterns obtained for multiplex PCR products containing PB2, PB1, PA, and HA segments and NP, NA, M, and NS segments were combined in one image. (A) Hybridization spots for labeled single-stranded DNA samples of Beijing virus homotypically hybridized with 20 Beijing-complementary oligoprobes and heterotypically hybridized with 20 Shangdong-complementary oligoprobes (control). (B) Hybridization spots for labeled single-stranded DNA samples of Shangdong virus heterotypically hybridized with 20 Beijing-complementary oligoprobes (control) and homotypically hybridized with 20 Shangdong-complementary oligoprobes. The hybridization spots in the rows of a subarray represent hybridizations of a Cy-5 fluorescently labeled single-stranded DNA sample with different oligoprobes printed in the row. For purposes of visual presentation and inspection, the vertical line going through the subarray separates three columns containing the Beijing-complementary oligoprobes (on the left side of the subarray) and the three columns containing the Shangdong-complementary oligoprobes (on the right side of the subarray). Each row on a subarray represents oligoprobes of one genome segment of influenza virus. The oligoprobes were printed in a mirror symmetry manner relative to this line (see details in Fig. 1). The identifiers of genome segments are showed on the left in panels A and B. The PB2, PA, NP, and NS genome segments of the Beijing and Shangdong strains are represented by three different oligoprobes and by three observed hybridization spots. The PB1, NA, HA, and M genome segments of these strains are represented by two different oligoprobes and by the two hybridization spots. In the last case, two "empty" spots on the rows of the subarray are observed.
|
![]() View larger version (110K): [in a new window] |
FIG. 3. Discrimination of hybridization patterns of four different reassortants on a subarray. See details in the text. Microarray elements specific to the prevailing RNA segment are boxed.
|
|
View this table: [in a new window] |
TABLE 3. Significance of homotypic hybridizations for oligonucleotides of influenza B virus chipa
|
|
View this table: [in a new window] |
TABLE 4. Characteristics of reliability of influenza B virus chipa
|
|
View this table: [in a new window] |
TABLE 5. Comparison of sequencing and microarray methods by mapping of influenza B virus reassortants on all influenza B virus genomic segmentsa
|
![]() View larger version (62K): [in a new window] |
FIG. 4. RFLP of NS segments digested by HgaI. Lane 2, reassortant (the size of the band undigested by HgaI is 1,030 bp, and the sizes of the digested bands are 730 and 300 bp); lane 3, Beijing (the size of the band undigested by HgaI is 1,030 bp); lane 4, Shangdong (the sizes of the bands digested by HgaI are 730 and 300 bp); lanes 1 and 5, DNA ladders.
|
![]() View larger version (13K): [in a new window] |
FIG. 5. Hybridization of mixed samples with NS segment oligoprobes. The initial Shangdong sample (175 mM) was diluted to final concentrations of 10 and 5% in a Beijing sample (175 mM) and then hybridized with Shangdong-specific oligonucleotides. The heterotypic hybridization signal of the Beijing sample to Shangdong-specific oligoprobes was set as the control. The hybridization was repeated five times. The error bars represent standard deviations.
|
|
|
|---|
The rational design of vaccines against influenza includes the generation of reassortants between a recent field isolate and a reference strain adapted for efficient growth in the substrate used for vaccine production. After a variety of reassortant strains is generated in vitro by growing two parental strains together, a prospective vaccine strain is selected by genotyping all relevant genomic segments to identify the right combination of segments coding for immunogenic proteins and those determining high-growth characteristics. Traditionally, this time-critical task of selection of prospective vaccine strains has been accomplished by RFLP, sequencing of cDNA, or other approaches (1, 13, 18). The recently developed high-throughput microarray analysis method appears to be ideally suited for one-step mapping of segmented viruses.
Microarray hybridization was previously used to genotype individual segments of influenza A virus (16, 22). In our work, we have developed and used an improved oligonucleotide microarray method to discriminate viruses with high levels of genomic homology through identification of minor genetic differences, including single-nucleotide variations, simultaneously in all segments of influenza B viruses. We observed unambiguous discrimination between the B/Beijing/184/93 and B/Shangdong/7/97 influenza strains. These strains were discriminated with statistical certainty by using different oligoprobes, including PA-1979, PA-2034, NP-188, and M-1004, differing at only 1 nucleotide and representing the highly homologous segments PA, NP, and M of both viruses. The high level of nucleotide homology between the PA, NP, and M segments of B/Shangdong/7/97 and B/Beijing/184/93 has been an obstacle to the genotyping of reassortants derived from the two strains by RT-PCR and RFLP because of the lack of suitable restriction sites (23).
Microarray hybridization, coupled with statistical interpretation of the results, allowed us to estimate the effects of cross-hybridization and to account for nonspecific sample binding (background). Our statistical analysis demonstrated high levels of sensitivity and reproducibility for all individual oligoprobes to all influenza B virus segments; P values for specific hybridization signals for all of the 40 selected oligoprobes were highly significant (P < 0.01).
The statistical planning of the number of hybridization experiments developed in this work can be used in the future in the design of similar oligonucleotide microchip experiments to assess the discriminating power of individual oligoprobes and to select only those that produce the clearest results. Identification of the origin of each segment based on hybridization with two or more specific oligoprobes significantly increased the reliability of identification. To this end, we estimated the maximum probability of false-negative identification of each segment based on multiple results for individual oligoprobes in 56 hybridization experiments. The P values for segments of different origin were calculated by multiplication of P values for homotypic hybridization with oligoprobes specific to each segment. They were statistically significant and demonstrated high overall discriminating power of the microarray for all eight segments of influenza B virus.
Note that our approaches for estimating the minimum number of replicate experiments and for estimating the false-negative error rate for a given gene segment seem novel for microarray genotyping. However, depending on the specific designs of microarray experiments, other statistical approached can also be useful (22a).
Interestingly, the influenza B virus microarray can detect the presence in mixed viral stocks of as little as 5% of a second closely related influenza B virus strain (Fig. 4). The greater ability to detect mixed viral populations is an important advantage of the method over conventional methods based on nucleotide sequencing, since natural isolates often contain more than one genotype. Although the restriction pattern of DNA shown in Fig. 4 indicates that the NS segments of both the Beijing and Shangdong strains were present, similar analyses of other segments and other reassortant strains prepared as vaccine candidates need to be studied in the future. Our preliminary experiments presented here showed that the method could not be used to quantify the components of a mixture without further refinement. The ability to detect as little as 5% of a second strain in a mixture on one hand and the lack of statistically significant hybridization between the values obtained for 5% and increasing spike concentrations on the other suggest that the dose-response curve may be nonlinear. There may a number of reasons for this, including saturation of immobilized probes during hybridization and nonlinear response of the fluorescence detector. Further experiments are needed to make this method more quantitative and to accurately assess the detection limit for mixtures.
The data presented in this paper suggest that the microarray method can be used for high-throughput screening of influenza B virus reassortants, substantially reducing the time needed for vaccine development. The influenza B virus array described in this paper contains only a few oligoprobes specific to each segment of two influenza B virus strains, Beijing and Shangdong. The microarray method can be expanded by adding new oligoprobes, including those specific to other strains, possibly covering the entire repertoire of currently circulating influenza B virus strains. Such extension of the array might allow us to use a more quantitative assay to determine the origins of individual genomic segments in a series of reassortant strains prepared as vaccine candidates or mixed virus populations.
Our preliminary analysis of the repertoire of vaccine reassortants suggests a highly biased (nonrandom) generation of certain segment combinations. Extensive studies of a large number of reassortant strains may help to reveal the preferred combination of segments and to examine the reason for this nonrandomness of the reassortment process.
It is known that rates of homologous recombination in negative-sense RNA viruses, including influenza virus, are substantially lower than their point mutation rates (24). However, patterns of sequence variations compatible with the action of recombination were recently observed in several RNA viruses, including influenza A virus and influenza B virus (3). A search for the evidence of within-segment recombination in influenza viruses could be done by a microarray method similar to the one developed to analyze recombinants in poliovirus (5).
The ability to perform rapid and comprehensive genomic analysis of multiple field isolates can help in efforts to recognize and respond to the emergence of new pandemic strains of influenza virus by providing a better understanding of the processes of viral evolution in populations of susceptible hosts. Similar approaches can also be developed for analysis of influenza A viruses and for other viruses with segmented genomes and can substantially advance our understanding of their evolution and help to create new effective vaccines.
|
|
|---|
5% of homotypic hybridizations.
Let us plan to draw random samples X1, X2,..., Xn of the n hybridizations for a given homotypic oligoprobe. Let A represent the homotypic hybridizations showing a positive value of I (true event). Let B represent the homotypic hybridizations showing a negative or zero value of I (false-negative homotypic hybridization). We assume that the events B are rare or unlikely events. Let p represent the probabilities of event B, and let (1 p) represent the probability of event A. These A and B events can be considered mutually independent and identically distributed events. Let
represent the level of significance for maximum probability of rejection of a null hypothesis, which is ascribed to a confidence probability of the events B after n true binomial experiments having probability (1 p) in each of the n experiments. Let pB represent a maximum value of a probability of the event B for a given oligoprobe. Then, using a binomial probability distribution function BIN(0;n,pB) (11), we have P (events B in n true homotypic hybridizations) = (1 pB)n.
Then, assuming that P =
= 1 ß, we obtain:
![]() | (2) |
![]() | (3) |
= 1 ß = 1 0.95 = 0.05; then, using equation 3 we obtain n = log0.05/log(0.95)
58. Thus, we expect that
58 replicated hybridization experiments for each homotypic oligoprobe should be sufficient to get an expected reliability for the purpose of planning experiments. In actuality, we analyzed seven slides containing 56 replicates of each of the 40 distinct homotypic oligonucleotides. Estimating the reliability of homotypic hybridizations for genomic segments. When no error is observed in all 56 of the replicated experiments, we can estimate the maximum probability of false-negative events (homotypic hybridizations), called B, by equation 2: pB = 1 (1 ß)1/n. Then, pB = 1 (1 0.95)1/56 = 0.05209.
When I, 2, ..., k,... errors are found, the maximum probability of false-negative events, pB, can be estimated by the binomial probability distribution BIN(k;n,pB) (11):
![]() | (4) |
We selected from two to four homotypic oligoprobes for each of the eight genomic segments to use in hybridization experiments. Let pB1, pB2,... represent the observed frequencies of false-negative events B1, B2,...associated with homotypic hybridization on the first, second, etc. distinct oligoprobes representing a given genome segment. Analysis of hybridization with two or more distinct oligoprobes covering a given segment in the same microarray experiment allowed us to increase the reliability of genomic-segment identification by estimating the joint probability function of false-negative events for a given genomic segment by the formula Ps = pB1pB2. ... For example, when two oligoprobes used for the identification of a given segment had pB1 = 0.0781 and pB2 = 0.0108, respectively, the maximum probability of false-negative identification of the given segment was estimated to be Ps = pB1pB2 = 0.0079.
The work was supported by grants from the Defense Advanced Research Projects Agency (DARPA) and the National Vaccine Program Office (NVPO).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»