Previous Article | Next Article ![]()
Journal of Clinical Microbiology, January 2006, p. 244-250, Vol. 44, No. 1
0095-1137/06/$08.00+0 doi:10.1128/JCM.44.1.244-250.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Argonne National Laboratory, Argonne, Illinois 60439,1 Pacific Northwest National Laboratory, Richland, Washington 99352,2 Brigham Young University, Provo, Utah 846023
Received 9 May 2005/ Returned for modification 3 July 2005/ Accepted 19 October 2005
|
|
|---|
|
|
|---|
Because diagnostic nucleic acid signatures are not and may not be known a priori for all organisms of interest, gel-based DNA fingerprinting techniques continue to dominate microbial epidemiology studies (see, e.g., references 12, 16, and 20). However, it is well recognized that current genotyping methods frequently do not discriminate between isolates. Gel-to-gel positional variations in internal standards and the test sample is particularly troubling, for example, because it necessarily leads to increased bin sizes and decreased resolving power in cross-gel comparisons (see, e.g., reference 21). The positional variations in gels, however, also begs the following questions: what are the objective criteria for including or excluding data from a gel-based DNA fingerprint and how does one generate error bars and statistical confidence to test the hypothesis of profile equivalence?
DNA microarrays provide physically fixed data features, are readily amenable to replication, and provide an alternative technology base for developing quantitative DNA fingerprinting methods. We are particularly interested in developing a simple, low-cost, diagnostic genotyping product and method for microbial epidemiology, while retaining sufficient resolving power to discriminate between strains that may be indistinguishable by conventional techniques. Inherent in this objective is a need to develop standard operating protocols and normalization controls that will (ultimately) allow for quantifiable and objective comparisons across days, users, or laboratories.
Bacterial isolates used for this study are listed in Table 1. Bacillus near-neighbor isolates were grown in nutrient broth (Difco, Sparks, MD) at 29°C and 450 rpm for
48 h. American Type Culture Collection (ATCC) isolates (e.g., outliers) were purchased as genomic DNA preparations from the vendor. B. anthracis isolates were cultivated, and genomic DNA was isolated under appropriate biosafety level 3 controls as described in reference 11. Nucleic acid integrity from all genomic DNA preparations was analyzed by gel electrophoresis on 0.8% single-comb E-gels (Invitrogen, Carlsbad, CA). DNA concentrations were determined in solution by UV absorbance (UV/visual-light spectrophotometer Lambda Bio 10; Perkin Elmer, Boston, MA) and in-gel by ethidium bromide staining of chromosomal DNA. Only intact genomic DNA of >10 kbp in length was utilized for subsequent PCR and microarray analysis.
|
View this table: [in a new window] |
TABLE 1. Bacterial isolates used in this study
|
Figure 1A shows that the REP-PCR is quite reproducible by conventional microbiological standards, with the same (qualitative) level of PCR reproducibility observed for all other isolates in the study (not shown). Thus, variations in microarray fingerprints (below) are not due to PCR bias or error during the sample-processing steps. From this simple gel analysis, however, it is readily apparent that the two B. anthracis isolates are indistinguishable based on a conventional REP-PCR test. In the same way, the REP-PCR gels could not differentiate between B. thuringiensis strains HD-571 and Al Hakum and between B. cereus strains 3A and S2-8 (Fig. 1B). The positional variation in gel bands, background smears, and related gel artifacts underscore the qualitative nature of gel-based genotype comparisons.
![]() View larger version (46K): [in a new window] |
FIG. 1. Reproducibility of B. anthracis REP-PCR. (A) Agarose gel electrophoresis of REP-PCR products amplified with genomic DNA from two Bacillus anthracis isolates, shown in sequential order (A0392 followed by A0362 for all 4 days). One hundred nanograms of each genomic DNA template was amplified in replicate tubes each day for four separate days, as described in the text, and 10% of the PCR amplification products were analyzed on a 2% agarose gel containing ethidium bromide. Gel images were stretched and adjusted in Adobe Photoshop 7.0 to account for different electrophoresis results and exposure times in the creation of this image. While the fingerprints are qualitatively reproducible, the two B. anthracis isolates cannot be differentiated (but see results in Fig. 3 and 4). Similar patterns of PCR reproducibility were obtained for all other isolates used in this study (not shown). (B) REP-PCR fingerprints for 12 of the 13 isolates used in this study.
|
|
View this table: [in a new window] |
TABLE 2. Mix-10 control targets
|
The target DNA for the 12 hybridizations for each bacterial isolate was derived from six independent amplification reactions, split evenly between two arrays. REP-PCR fragments from three different organisms were hybridized to three arrays on the same slide according to a balanced incomplete block design, where each slide is treated as a block (24) so that paired strains were directly compared on the same slide exactly twice. The fourth array on each slide was hybridized with the Mix-10 standard targets, all diluted in an equimolar ratio to a final concentration of 1.53 nM (each) in the hybridization solution. Each bacterial species or strain was hybridized to 12 replicate arrays, and paired strains were directly compared on the same slide exactly twice, with no triple repeated. Hybridization and washing proceeded as described elsewhere (25).
Microarray images were acquired on a custom-built, temperature-controlled fluorescence microscope operating at room temperature. Briefly, slides were illuminated with a 100-W mercury lamp through a D525/50 bandpass filter and Cy3 emissions were collected through a 590DF35 filter. Microarrays were illuminated for 20 s, and images were captured through a custom lens (LINOS Photonics, Inc., Milford, MA) as img files with a 12-bit SenSys charge-coupled-device camera (Photometrics, Tucson, AZ) at a resolution of 1,536 by 1,024 pixels. Image analysis was performed using the freely available Automated Microarray Image Analysis Toolbox for Matlab (23; http://www.pnl.gov/statistics/amia). Spot identification routines are implemented using a seeded-region-growing method adapted from Hojjatoleslami and Kittler (10). To ensure accurate spot finding against a potentially distorted printing grid, an analyst manually identified several spots and reran the automated algorithm until all spots in the alignment were accurately identified. For each spot, the average pixel intensity value and the average (local) background intensity value were exported to a spreadsheet for statistical analysis as described in detail elsewhere (24). In brief, we calculated the log(mean spot pixel intensity) minus the log(mean background pixel intensity) for each probe over all replicate arrays (n = 12). The only explicit normalization performed in the analysis is to center low-end probe histogram modes for each array. Summary statistics for each probe were computed using a mixed-effects linear analysis of variance (ANOVA) model that parallels the incomplete block design. F-statistics from the ANOVA calculations were used to identify discriminating probes significant at
= 0.01. For the discriminating probes, the cell means parameterization of the linear model was used to obtain estimates of relative hybridization, which were grouped using a finite mixture procedure for grouping treatment means. The number or proportion of microarray probes and their relative hybridization intensities therefore provide a quantitative measure of difference between the isolates that can be tested for significant differences with any number of multivariate statistical procedures.
It is clear from the literature that microarray results can be highly varied (see, e.g., reference 15). The Mix-10 control targets were therefore used to understand underlying microarray variability, independent of the genetic variation between organisms or method level variability associated with nucleic acid extraction and/or PCR amplification prior to microarray hybridization. For the microarray design reported here, each array consisted of 400 spots (391 probes with nine controls) printed with a 4-pin print head (i.e., 100 spots per pin, wherein 1 pin defines a subarray). To determine if calculated probe intensities varied between pins during microarray manufacture, the average hybridization intensities in each subarray (over all 100 probe spots) were normalized to mean 0 and standard deviation 1 and compared for each array using ANOVA. Among the 52 Mix-10 control arrays, at least 1 subarray was statistically distinct from the other subarrays in approximately 31% of the slides (at
= 0.05). Similarly, we utilized a linear ANOVA to analyze the hybridization intensity of each of the perfectly matched Mix-10 capture probes across days (or print lots). Table 3 shows that for all but one of the Mix-10 probes and control targets (probe/target 119), there is a significant print day effect at
= 0.05. Hence, there are clearly significant pin and day effects during the manufacture of printed arrays, effects that must be managed in a standard operating protocol through biological replication across print lots (and pins).
|
View this table: [in a new window] |
TABLE 3. Average log signal/background values and standard deviations for perfectly matched Mix-10 control probes and targets based on print lot (or day)
|
Scatter plots were generated to compare actual hybridization intensity for each positive probe on each array with the median hybridization intensity for each probe over all replicate arrays (n = 12 for bacterial isolates; n = 52 for the Mix-10 standard). Both Fig. 2 and Table 4 show that the Mix-10 standard and REP-PCR hybridizations for bacterial isolates are equally varied and reproducible, even in the face of unpredictable cross-hybridization and false positives. The average R2 for the Mix-10 data are somewhat lower than those for the bacterial isolates, most likely due to the few number of positive capture probes (66 reproducibly positive signals) relative to the number from a typical bacterial REP-PCR hybridization pattern (>200 reproducible signals). As such, the Mix-10 itself may not be an ideal or perfectly representative standard as presently configured, but it nonetheless accurately reflects hybridization behavior and microarray probe responses to bacterial REP-PCR products. Hence, the fingerprinting method described here is making conservative, unbiased conclusions relative to underlying biological differences between isolates, rather than displaying differences due to measurement noise or error. From these data (Fig. 2; Table 4), we are cautiously optimistic about using a Mix-10 (or similar) synthetic standard as part of a quality control and normalization procedure and recommending 12 replicate hybridizations as an upper boundary on any standard experimental procedure. The extent to which the total number of replicates can be reduced while still providing statistically significant and quantifiable DNA fingerprints will be determined in future studies with larger collections of isolates.
![]() View larger version (26K): [in a new window] |
FIG. 2. Fingerprinting array method level reproducibility. Scatter plots for hybridization (A) and method level (B-D) reproducibility. Panel A shows results for a randomly selected Mix-10 array relative to the median intensity for 66 probes (52 replicate arrays), reflecting the behavior and reproducibility of the microarray itself during hybridization. The R2 value is for the single array and regression line shown in the panel. Average R2 values are in Table 4. Panels B-D show results for B. thuringiensis 97-27, B. anthracis A0392, and Campylobacter jejuni 11168 (12 replicate arrays), respectively, which are inclusive of PCR amplification and labeling steps. Results were similar for all other isolates, as shown in Table 4.
|
|
View this table: [in a new window] |
TABLE 4. Average R2 values for bacterial REP-PCR products and the Mix-10 standard
|
= 0.01, 212 of the 391 probes (54%) are differentially detected and/or hybridized across the 13 isolates. That is, for the 212 probes, the intensity differences between isolates are significantly greater than the intensity differences between replicate hybridizations for the same isolate. Discriminatory fingerprints were constructed from the ANOVA output using a specially designed finite mixture procedure described in reference 24, which provides a discrete fingerprint (analogous to gel-based fingerprints) in which isolates are assigned to a small number of probe level groups according to relative intensity. Figure 3 shows the discriminatory fingerprints for the 13 isolates, where the probes have been reordered (clustered) according to their similarity across isolates, and bands are shaded according to relative hybridization intensity at each discriminating probe.
![]() View larger version (27K): [in a new window] |
FIG. 3. Simultaneous clustering of probes and isolates. Only the 212 discriminatory probes are plotted here. Probes are clustered and reordered according to their correlation across isolates. Jaccard's distance was used to create the dendrogram.
|
= 0.01) one from the other, even amid the microarray manufacturing and hybridization variability described above and despite the fact that three isolate pairs could not be differentiated based on a typical gel electrophoresis pattern (Fig. 1B).
![]() View larger version (81K): [in a new window] |
FIG. 4. Numbers of fingerprint differences between pairs of isolates. There are 212 discriminatory probes (out of 391) for the set of 13 isolates tested in this study and analyzed with the balanced incomplete block statistical design. Cells in the image plot are color coded according to the number of discriminating probes, from fewest (black) to most (white). The actual number of discriminating probes between isolate pairs is shown within the color-coded cells. B thur, Bacillus thuringiensis.
|
It can certainly be argued that multiple PCR amplifications and 12 replicate arrays per isolate are impractical for routine forensic or diagnostic purposes, but any (present) judgment should be balanced against the quality and statistical rigor of the resulting information (microbial fingerprint). That is, a natural inclination or assumption underlying the development of universal genotyping microarrays is that more probes are "better" and that large data sets are a suitable substitute or proxy for method level reproducibility (e.g., see arguments and rationale in reference 3). For a complete data set or experimental design, these assumptions may hold and generate a robust microarray pattern for qualitative comparison (depending upon how one analytically defines "robust"). On the other hand, we argue that data are not equivalent to information. Hence, for the diagnostic problems of library construction, quantitative comparisons against libraries and reference databases, and dealing with the practicality of routine analyses, we argue that method level reproducibility and information quality are more important than data volume. In this context, a question worth asking is whether high-density arrays are even necessary or helpful for the end user. What we have shown here is that a very simple (391 probe) nonamer array, combined with appropriate (method level) replication and quantitative statistical techniques, can easily and reproducibly differentiate between strains of Bacillus anthracis and near neighbors, a group of organisms that are notoriously monomorphic and difficult to differentiate by many classical molecular taxonomy techniques.
We thank Anne Gemmell for assistance in the cultivation and extraction of nucleic acids from bacterial isolates.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»