Previous Article | Next Article ![]()
Journal of Clinical Microbiology, July 2003, p. 3035-3042, Vol. 41, No. 7
0095-1137/03/$08.00+0 DOI: 10.1128/JCM.41.7.3035-3042.2003
Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, Georgia 30322,1 Division of Healthcare Quality Promotion, Centers for Disease Control and Prevention, Atlanta, Georgia 303332
Received 13 January 2003/ Returned for modification 18 February 2003/ Accepted 19 April 2003
|
|
|---|
|
|
|---|
Computer-assisted analysis of PFGE data allows investigators to create searchable databases of fragment patterns where similarity calculations and cluster analyses can easily be performed (18). Comparisons of software packages to date have given some insight into their ability to identify outbreak-related strains (7) but have not directly addressed the question of compensating for the intra- and intergel variation within PFGE libraries. The present study assessed the impact of three potential sources of intra- and intergel variation of fragment banding patterns: (i) the location of the standards in the gel, (ii) the age of the agarose gel plugs containing the sample DNA, and (iii) the normalization of fragment position by Rf versus MW methods. In addition, the impact of varying matching tolerance algorithms was assessed by using Enterococcus faecium and Staphylococcus aureus PFGE libraries.
|
|
|---|
PFGE protocol for E. faecalis and E. faecium isolates. The isolates were inoculated into 1 ml of Trypticase soy broth (Remel, Lenexa, Kans.) and incubated at 37°C overnight. The turbidity of the broth was adjusted to an optical density of 1.00 at 540 nm with TE buffer (10 mM Tris-HCl [Sigma, St. Louis, Mo.] and 1 mM EDTA [Sigma] at pH 7.5). Cells were washed twice in TE buffer and resuspended in 300 µl of buffer before the addition of 300 µl of melted 2% SeaPlaque agarose (BioWhittaker Molecular Applications, Rockland, Maine; final agarose concentration, 1%). Plugs were prepared in reusable plug molds (Bio-Rad Laboratories, Hercules, Calif.). The bacterial cells in the plugs were lysed overnight in 20 µl of mutanolysin (stock concentration, 10,000 U/ml; Sigma) per 1 ml of EC lysis buffer (6 mM Tris-HCl, 1 M NaCl, 100 mM EDTA, 0.5% Brij-58 [Sigma], 0.2% sodium deoxycholate [Sigma], 0.5% Sarkosyl [Sigma]). Deproteination of the samples was performed by incubating each plug in 0.1 mg of proteinase K (Life Technologies, Inc., Rockville, Md.)/ml of Sarkosyl-EDTA-Tris buffer (10 mM Tris-HCl, 0.1 M EDTA, 1% Sarkosyl) at pH 6.8 for 4 to 6 h in a 55°C water bath. After deproteination, the plugs were washed three times in TE buffer. Slices of the plugs were cut and incubated with the restriction endonuclease SmaI (New England BioLabs, Inc., Beverly, Mass.) for 2 to 4 h in a dry shaker at 25°C. Digested plug slices were inserted into 20-well 1% chromosomal grade agarose (Bio-Rad Laboratories) gels prepared with 0.5x Tris-borate-EDTA buffer (Life Technologies, Inc.). Gels were run on a CHEF DR-III apparatus (Bio-Rad Laboratories) at 14°C for 20 h at 6 V/cm with an initial pulse time of 1 s and a final pulse time of 20 s. Gels were stained in ethidium bromide, destained in fresh distilled water, and photographed. Digital images of the gels were saved as TIFF files by using the Foto/Analyst Archiver (Fotodyne, Inc., Hartland, Wis.) for subsequent gel analysis. Manual banding pattern interpretation was based upon published criteria (20). Computer-assisted analysis was performed with the Advanced Analysis (version 4.01) and Database (version 1.12) modules of Phoretix gel analysis software (Nonlinear USA, Inc., Durham, N.C.). (The modules together are currently named Phoretix 1D Pro.)
PFGE protocol for S. aureus isolates. S. aureus isolates were inoculated into 5 ml of brain heart infusion broth (BD BioSciences, Sparks, Md.) and incubated without shaking at 37°C overnight. The cultures were adjusted with sterile brain heart infusion broth to an optical density of 1.3 by using a Microscan turbidity meter (Dade MicroScan, Sacramento, Calif.). A total of 200 µl of each cell suspension was transferred to microcentrifuge tubes and centrifuged at 13,000 rpm for 5 min. The supernatant was aspirated and cells were resuspended in 300 µl of 0.01 M TE buffer (pH 8.0). To each tube, 4 µl of conventional lysostaphin (stock solution of 1 mg/ml in 20 mM sodium acetate; Sigma) and 300 µl of 1.8% SeaKem Gold agarose (BioWhittaker Molecular Applications) were added, with mixing after the addition of each reagent. Plugs were prepared in nondisposable plug molds. After they solidified, the plugs were transferred to tubes containing 3 ml of EC lysis buffer and then incubated for 4 h at 37°C. After lysis, the plugs were washed four times for 30 min with 4 ml of TE buffer. Slices of the plugs were cut and incubated with SmaI for 4 h at 25°C. Digested plug slices were placed directly onto a 30-well comb that was subsequently placed securely into the gel-casting platform. One percent SeaKem Gold agarose (equilibrated to 55°C) made with 0.5x Tris-borate-EDTA buffer was poured into the gel-casting platform around the comb and allowed to solidify at room temperature for 45 min. Gels were run on a CHEF DR-II (Bio-Rad Laboratories) apparatus at 14°C for 21 h at 200 V with an initial pulse time of 5 s and a final pulse time of 40 s. Gels were stained with an ethidium bromide solution, destained in fresh distilled water, and photographed. Polaroid pictures of the gels were saved digitally as TIFF files for subsequent gel analysis. Manual banding pattern interpretation and computer-assisted analysis were performed as previously described for the E. faecium isolates (20).
Sources of intra- and intergel variation investigated.
The following sources of variability were evaluated to determine their influence on the ability of PFGE analysis software packages to accurately match 32 identical, non-standard-lane pulsed-field patterns on two PFGE gels: (i) variation of standard lane position, (ii) gel plugs made on different days, and (iii) normalization of fragment position by MW and Rf methods. In the software, MW normalization of a fragment position occurred by estimating unknown fragment values from the E. faecalis OG1RF MW standard fragment values by using a log curve-fitting algorithm. The log regression method was used because it appeared to provide the best fit of the E. faecalis OG1RF strain fragment positions run by PFGE compared to other curve type choices in the software (i.e., cubic spline, first-order Lagrange, linear, linear log, and quadratic). Conversely, Rf normalization of fragment position was a linear function of the pixel position that fragments occupied within a lane. Thus, for the Rf method, unknown fragment positions were not estimated based on the "fit" of a log curve; rather, they represented the location of the peak pixel intensity associated with each identified fragment. All statistical comparisons of MW and Rf normalization results were performed by using SAS version 6.12 (SAS Institute, Inc., Cary, N.C.). For each fragment present in the 32 lanes examined, the mean, range, variance, and coefficient of variation (CV; standard deviation divided by the mean) (17) were calculated. CVs, the within-fragment variability normalized by MW (CVMW) and Rf (
), were compared for each fragment by using the Z-test. A P value of
0.05 defined significant associations.
Optimization of matching tolerance studies. The effects of varying the matching tolerance algorithm were assessed by using PFGE libraries of E. faecium (n = 62; 57 isolates were vancomycin resistant) and S. aureus (n = 89; 88 isolates were oxacillin resistant) macrorestriction patterns analyzed by Rf. Optimization of matching tolerance algorithms was achieved by manipulation of the vector box setting (VBS), which is the maximum difference in Rf values between two fragments that the PFGE analysis software defined to match. For both the E. faecium and S. aureus matching tolerance optimization studies, the visually similar group (VSG) was defined as fragment patterns that were within six fragment differences from an arbitrarily chosen reference fragment pattern that was visually similar to more than one fragment pattern in each respective PFGE library. At different VBS settings, unweighted pair group method using arithmetic averages (UPGMA) dendrograms were generated from Dice coefficients for each library. At each VBS, the match similarity on the dendrograms that included all members of the VSG was determined. All isolates included in this subset were defined as the dendrogram group. Within each dendrogram group, the total number of isolates, the number of non-VSG isolates, and the specificity (%) were determined. The sensitivity was 100% by definition since all VSG isolates were included in each dendrogram group. The specificity was defined according to the following relationship: (true negatives/[false positives + true negatives] x 100). False positives were non-VSG isolates located within the dendrogram group. True negatives were non-VSG isolates located outside of the dendrogram group. The dendrogram group with the highest specificity was designated the "optimized" VBS. In a similar manner, the optimized matching tolerance for the E. faecium library analyzed by MW was determined, and results were compared to the Rf analysis.
|
|
|---|
![]() View larger version (123K): [in a new window] |
FIG. 1. Gel 2: lanes containing SmaI-digested E. faecalis OG1RF DNA. Some lane-to-lane variations, or distortions, exist in gel 2. Distortions are pronounced in lanes 4 to 6.
|
Age of agarose plug production. The impact of using agarose plug preparations made on different days on the minimum VBS for both gels was determined. Lot 1 plugs were prepared on a different day than lot 2 plugs. The standard and nonstandard plug combinations used were as follows: lot 1 in both the standard and the nonstandard lanes; lot 1 in the standard lanes and lot 2 in the nonstandard lanes; lot 2 in both the standard lanes and nonstandard lanes; and lot 2 in the standard lanes with lot 1 in the nonstandard lanes. The lanes of each combination were matched together by Rf normalization with the range of minimum VBSs varying between 0.525 and 0.875% (gel 1) and between 0.650 and 1.2% (gel 2). The lowest minimum VBS for gel 1 (0.525%) was achieved by using lot 2 in the standard lanes and lot 1 in the nonstandard lanes. Conversely, the lowest minimum VBS for gel 2 (0.650%) was found by using lot 2 in both the standard and the nonstandard lanes. For all lot 1 and lot 2 standard and nonstandard lanes matched together for both gels, the minimum VBSs were 1.2% and 0.750%, respectively. When gels 1 and 2 were matched to duplicate images of themselves, the minimum VBSs for gels 1 and 2 were 0.975 and 1.0%, respectively.
MW versus Rf normalization. The PFGE fragment position for each lane of E. faecalis OG1RF DNA in gels 1 and 2 was normalized by MW and Rf methods by using Phoretix software. After the automated band finding feature of the software was used, all of the DNA fragment positions of the 38 E. faecalis OG1RF isolates were confirmed visually to ensure that no artifacts were present on the gel prior to the normalization. During the MW and Rf normalization process, the first, middle, and last lanes of each gel were considered standard lanes and were excluded from further normalization analysis, leaving 32 nonstandard lanes for analysis. For normalization of the MW values, the MWs of fragments in the lanes not designated as standard were calculated by the software using the known MWs of E. faecalis OG1RF DNA fragments (13, 20) in the standard lane positions. The Phoretix software fitted a log curve between the E. faecalis OG1RF MW standard fragment values and used this curve to estimate the MWs of the fragments in the nonstandard lanes. The Rf values of fragments, calculated by the software, represented the migratory distance in pixels that fragments traveled relative to the length of each lane on a gel (Phoretix 1D Advanced manual). For Rf analysis, the middle of the loading well in each lane was standardized as the top of the lane. The software calculated Rf values by assigning Rf lines to each fragment position within the leftmost standard lane position. Once these Rf lines were assigned, linear interpolation of pixel position in the software generated Rf values for fragment positions in nonstandard lane positions with "0" for the top and "1" for the bottom of each lane.
Table 1 summarizes statistics calculated from the intrafragment MW values from all 11 fragments in the 32 nonstandard lanes of gels 1 and 2. The actual MWs are different from the calculated MW mean due to the variation associated with how well the log curve "fit" the known E. faecalis OG1RF MWs. The calculated MW range and variance generally decreased from fragments 1 to 11. CVs ranged between 0.0096 and 0.0230 for fragments 1 to 6 and fragments 10 to 11, but they were
0.0070 for fragments 7 to 9.
|
View this table: [in a new window] |
TABLE 1. Intrafragment MW normalization analysis of 32 lanes containing E. faecalis OG1RF
|
and CVMW values were close to each other and, when compared, were not found to be significantly different (P > 0.1 for fragments 1 to 6 and fragments 10 to 11, P = 0.07 for fragments 7 to 8, and P = 0.08 for fragment 9). |
View this table: [in a new window] |
TABLE 2. Intrafragment Rf normalization analysis of 32 lanes containing E. faecalis OG1RF
|
![]() View larger version (43K): [in a new window] |
FIG. 2. Comparison of fragment-to-fragment matching at two vector box sizes of 0.500 and 0.925%. At a VBS of 0.500%, fragment 5 of lane X will match fragment 5 of lane Y, but fragment 7 of lane X does not match fragment 6 of lane Y. At a VBS of 0.925%, fragment 7 of lane X does match fragment 6 of lane Y due to fragment overlap caused by increasing VBS.
|
For the E. faecium PFGE library, 5 of the 11 VBSs, representing the range of matching tolerance settings evaluated, are shown in Table 3. For each VBS, total isolates and non-VSG isolates within the dendrogram group decreased until the VBS of 0.925% and then increased with the VBS of 1.25%. Sensitivity was 100% by definition across all VBSs (all VSG isolates were included in the dendrogram group). The specificity of the dendrogram group increased until the 0.925% VBS and then decreased at 1.25%. Of all VBSs tested, the dendrogram group at 0.925% yielded the highest match similarity (74%), the highest specificity, while maintaining 100% sensitivity, and the lowest number of dissimilar isolates. Therefore, the optimal VBS for the E. faecium PFGE library was 0.925%. Figure 3 shows the dendrogram at a VBS of 0.925%.
|
View this table: [in a new window] |
TABLE 3. Range of VBSs tested to determine optimum matching tolerance for E. faecium isolates
|
![]() View larger version (28K): [in a new window] |
FIG. 3. UPGMA dendrogram generated from Dice coefficients that contains E. faecium PFGE library isolates compared at a VBS of 0.925%. Isolates marked with an asterisk indicate those in the VSG. The single isolate indicated by a "" symbol (designated 1435 in the middle of the dendrogram group) is the reference (index) isolate. Of 31 isolates in the dendrogram group (marked with a bracket), 26 were visually similar, and 5 were dissimilar. The match similarity for the dendrogram group was 74%.
|
0.300%. |
View this table: [in a new window] |
TABLE 4. Range of VBSs tested to determine optimum matching tolerance for S. aureus isolates
|
![]() View larger version (18K): [in a new window] |
FIG. 4. UPGMA dendrogram generated from Dice coefficients that contains S. aureus PFGE library isolates compared at a VBS of 0.500%. Isolates marked with an asterisk indicate those in the visually similar group (VSG). The single isolate indicated by a "" symbol (designated lane 19 toward the top of the dendrogram group) is the reference (index) isolate. Of 35 isolates in the dendrogram group (marked with a bracket), 33 were visually similar, and 2 were dissimilar. The match similarity for the dendrogram group was 79%.
|
For MW, the optimal vector percentage of the E. faecium library was 18% (data not shown). Upon comparison of the optimal VBS (0.925%) and the vector percentage (18%), the dendrogram group normalized by Rf (VBS) as opposed to MW (vector percentage) contained fewer total (31 versus 43) and dissimilar (5 versus 17) isolates while it maintained a higher match similarity (74% versus 58%) and specificity (86% versus 51%).
|
|
|---|
No pattern of gel plug lot-to-lot variation was detected in gels 1 and 2; thus, plug lot did not contribute to the intragel variation found within either gel. However, for intergel variation, lot 2 lanes yielded a much lower minimum VBS than did lot 1 lanes. This phenomenon could be related to the fact that lot 1 lanes contained more lane distortions. Baseline VBSs required to match gels 1 and 2 to their duplicate image at the highest match similarity were higher than some of the VBSs produced in lot-specific, intergel analyses. The inability of software-based algorithms to match identical isolates at 100% is an indication that algorithms of mathematic rigidity cannot yet replace visual interpretation of PFGE fragment patterns. Although software programs aid in the analysis of PFGE patterns across multiple gels, user intervention is still required during the steps of computerized analysis to produce accurate results (7, 16). Typing techniques, such as amplified fragment length polymorphism, that use automated sequence equipment for gel reading and analysis potentially have less variability in band finding and greater run-to-run reproducibility than PFGE.
The generation of mean Rf values provided a single set of "gold standard" values that could be entered manually into all E. faecalis OG1RF standard Rf values to normalize fragment position and minimize intra- and intergel variation within a PFGE library. This was how the E. faecium and S. aureus PFGE libraries were analyzed in the present study. Mean Rf values generated from a PFGE library capture dynamic variations in fragment position, whereas predetermined MW standards (with known fragment MWs) are not representative of the unique fragment pattern variations that reside within a particular PFGE library. However, generating the mean Rf values required the additional effort of running the same strain 38 times on two gels to determine minimum matching tolerance settings. An alternate method of accomplishing the same task would be to place an additional standard isolate (e.g., E. faecalis OG1RF in the present study) in a nonstandard lane on each gel of a PFGE library (11).
CV was a better indicator of variability between the same fragment on different lanes for MW and Rf because it measured the variability relative to the mean (for each fragment, the mean was higher for MW than for Rf due to the units of measure involved). Comparisons of CVMW and
for each fragment indicated that the same variability existed no matter what normalization method was used. However, P values for fragments 7 to 9 were smaller and closer to the significance level (0.05) than P values for the other fragments. Interestingly, fragments 7 to 9 were closer to the actual MW log curve used to estimate the MWs of all of the other fragments on the gel. This suggests that the statistical variation between the estimated MWs and the actual MWs decreases as the estimated MWs approach the real MW values used to generate the log curve.
Even though the between-lane variability was basically the same for both Rf and MW normalized values, MW normalization showed less specificity than did Rf normalization for the E. faecium PFGE library analyzed in the present study. Thus, software users should be aware of the differences between Rf and MW normalization before performing matching operations on any given PFGE library. In our study at optimized matching tolerance settings, the results favored the use of Rf over MW normalization methods for our E. faecium PFGE library analyzed with Phoretix software. Comparison of MW and Rf was possible by using one software package in the present study because Phoretix software allows the user a choice of normalization method (MW or Rf), although different algorithms are used to calculate matching tolerances for each normalization method. Other software packages typically use either the MW or the Rf normalization method. Phoretix is similar to other software packages in most other respects, offering similar choices of regression method, similarity coefficient (e.g., Dice, Jaccard, and Pearson), and dendrogram calculation method (e.g., UPGMA and neighbor joining).
VBS matching tolerance algorithms remain constant for all fragments at any given VBS. This can result in unnecessary fragment overlap when the VBS is not optimized prior to analyzing a PFGE library. In the current study, suboptimal matching tolerance settings decreased the specificity with which visually similar isolates were grouped together. The manufacturer's recommended setting of 0.500% decreased the specificity of the organism clustering for the E. faecium PFGE library but yielded acceptable results for the S. aureus PFGE library. The degree of alteration of the matching tolerance from its recommended or default setting is dependent on the magnitude of intra- and intergel variation due to organism-specific fragment or lane distortions. Thus, optimizing the VBS setting should be performed before subsequent matching analysis takes place. The E. faecium PFGE library contained a high degree of fragment pattern variation and made computer-assisted analysis more difficult compared to S. aureus. Heterogeneity of restriction fragment patterns in E. faecium has been discussed by other researchers (10, 12) and may have played a role in the apparent lower match similarity of the dendrogram group in the E. faecium PFGE library compared to the S. aureus library. However, the influence of other factors on the match similarities, such as the variety of patterns in each library and the choice of reference pattern for the VSG, cannot be determined in the present study.
Our data showed that, when optimized, a gel analysis software package could compensate for the various sources of intra- and intergel variation in PFGE libraries. However, in agreement with the findings of Cardinali et al. (3), there is a clear need for further development of software packages and algorithms used for gel analysis. Improvements, combined with optimization of the software algorithms prior to use, will further facilitate fragment pattern comparisons on multiple gels.
Phase IV of Project ICARE is supported in part by unrestricted grants to the Rollins School of Public Health of Emory University by Astra-Zeneca Pharmaceuticals, Wilmington, Del.; Bayer Corporation, Pharmaceuticals Division, West Haven, Conn.; Cubist Pharmaceuticals, Inc., Lexington, Mass.; Elan Pharmaceuticals, San Diego, Calif.; Pharmacia Corporation, Peapeck, N.J.; and Roche Laboratories, Nutley, N.J.
The use of trade names is for identification purposes only and does not constitute endorsement by the Public Health Service or the U.S. Department of Health and Human Services.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»