Previous Article | Next Article ![]()
Journal of Clinical Microbiology, August 2002, p. 2795-2800, Vol. 40, No. 8
0095-1137/02/$04.00+0 DOI: 10.1128/JCM.40.8.2795-2800.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
and Royston Goodacre1*
Institute of Biological Sciences, University of Wales, Aberystwyth, Ceredigion, SY23 3DD,1 Bronglais General Hospital, Aberystwyth, Ceredigion, SY23 1ER, United Kingdom2
Received 4 February 2002/ Returned for modification 24 March 2002/ Accepted 12 May 2002
|
|
|---|
|
|
|---|
Despite such clinical importance, there are considerable variations in the approaches of laboratories and physicians towards UTIs. Routine diagnosis includes a quantitative microbial count of urine. Counts of >105 organisms/ml are regarded as showing significant bacteriuria (25). Diagnosis is followed by characterization of the causal agent and sometimes antibiotic sensitivity tests to determine appropriate courses of treatment, because the empirical choice of an effective treatment is becoming more difficult as urinary pathogens become increasingly resistant to commonly used antibiotics (15).
Further identification of the isolates may not be undertaken except in complex and acute cases (27), because conventional methods are expensive, time-consuming, and labor-intensive (11). However, recurrent infections are common, particularly in children, and these may lead to further complications, such as renal scarring and hypertension (26). Typing of urine isolates from recurrent cases will distinguish between reinfection with a new causative organism and relapses perhaps due to treatment failures (21). Furthermore, the dynamics of infection within the population can be established.
For routine purposes, the ideal method for UTI characterization would be rapid and automatable and would permit the probabilistic identification of an unknown causal agent against a stable database. Recently, there has been great interest in the amplified fragment length polymorphism (AFLP) technique (35) for the genetic fingerprinting of microorganisms (16, 22, 28).
In the fluorescent AFLP (FAFLP) method, genomic DNA is digested with two different restriction enzymes (one is a frequent cutter, e.g., MseI with a 4-bp recognition site, while the other is a more infrequent cutter, e.g., EcoRI recognizing a 6-bp sequence), creating fragments of suitable size to be resolved by polyacrylamide gel electrophoresis. Oligonucleotide linkers (12 to 20 bp long) are then ligated onto the cohesive ends of the resulting restriction fragments, and these serve as primer-binding sites for PCR. The PCR product is fluorescently labeled, and the resultant PCR products are separated and visualized on a polyacrylamide gel by laser excitation. The primers recognize the linker and restriction site and usually contain selective nucleotides to reduce the number of products amplified by PCR and therefore detected.
AFLP has been used to type strains belonging to a wide range of genera, including Bacillus anthracis (19), Campylobacter fetus (36), Corynebacterium diphtheriae (8), E. coli O157 (30), Neisseria meningitidis (12), Salmonella enterica (7), and Staphylococcus aureus (14). In order to develop the AFLP technique for each of these different groups, various restriction enzymes are investigated and primers with selective nucleotides are optimized for each bacterial species, often with recourse to whole-genome sequences for in silico AFLP predictions (2, 13).
By contrast to the reports mentioned above, the present study employed FAFLP using a dual restriction digest and a single primer combination set, optimized to characterize UTI isolates belonging to eight different genera. Identification of isolates from patients with bacteriuria was then possible by matching them against a probabilistic database generated for UTI using Bayes' theorem.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Summary of 69 UTI isolates analyzed by AFLP
|
FAFLP analysis. All AFLP reactions were performed in a Progene thermocycler [Techne (Cambridge) Ltd., Cambridge, United Kingdom]. Restriction of genomic DNA and ligation of adapters were performed using GIBCO (Life Technologies, Paisley, United Kingdom) AFLP core reagent kits. A total of 250 ng was simultaneously digested with EcoRI and MseI restriction enzymes, 2.5 U each, in a final reaction volume of 25 µl. The reaction mixtures were incubated at 37°C for 2 h and then at 70°C for 15 min to heat inactivate the enzymes. EcoRI and MseI adapters were added at final concentrations of 0.2 and 2.2 µM, respectively, and ligated for 2 h at 20°C using 1 U of T4 DNA ligase. The final ligation mixture volume was 50 µl.
PCR amplification of the adapter-ligated fragments was performed with 5' IRD800-labeled MseI plus CT and unlabeled EcoRI plus 0 primers (MWG-Biotech UK Ltd., Milton Keynes, United Kingdom). The PCR mixtures (20 µl) constituted 5 µl of ligated DNA, 40 ng of labeled MseI primer, 4 ng of EcoRI primer, 0.2 mM (each) deoxynucleoside triphosphates, 0.4 U of Taq polymerase (Promega UK, Southampton, United Kingdom), 10 mM Tris-HCl (pH 8.3), 1.5 mM MgCl2, and 50 mM KCl. Twenty-five amplification cycles were performed with the following cycle profile: cycle 1, 60 s at 94°C, 30 s at 65°C, 60 s at 72°C; cycles 2 to 12, 30 s at 94°C, 30 s at an annealing temperature 0.7°C lower than for each previous cycle (starting at 64.3°C), 60 s at 72°C; cycles 13 to 24, 30 s at 94°C, 30 s at 56°C, 60 s at 72°C. This "touchdown" PCR protocol was used to minimize PCR artifacts (2).
Electrophoresis of PCR products. The PCR-amplified products were separated on 5% denaturing polyacrylamide gels (25 cm2 by 0.2 mm thick) on a LICOR 4200 automated sequencer (MWG-Biotech). The gels were prepared with 20 ml of monomer solution and 5 ml of complete buffer from the SequaGel XR premixed gel system [National Diagnostics (UK) Ltd., East Riding, Yorkshire, United Kingdom]. These mixtures were briefly degassed, and 1 ml of 10% ammonium persulfate was added prior to casting.
The amplified products were mixed with 10 µl of sequencing loading dye (Sigma) and heated to 95°C for 3 min. These denatured mixtures were rapidly placed in ice to prevent the DNAs from reverting to their secondary structures. Each sample (0.5 µl) was spotted on a CAM48 membrane comb (National Diagnostics) together with an IRD800-labeled molecular size marker of 50 to 350 bp (MWG-Biotech) after every six samples. Electrophoresis was performed at 1,500 V, 25 W, and 45°C using Tris-borate-EDTA buffer.
Fragment analysis. Fragments were sized with RFLPScan version 3.56 software (Scanalytics, Inc., Billeric, Mass.). Electropherograms of all AFLP profiles were visually inspected for polymorphisms, with the presence (1) or absence (0) of fragments from 60 to 350 bp scored in a binary matrix and stored in Microsoft Excel 2000.
From the literature, it was apparent that the majority of AFLP analyses of the same species studied here used MseI plus C and EcoRI plus one or two selective nucleotides (2, 23, 30, 40). Therefore, we initially used MseI plus C and EcoRI plus 0, but too many bands were produced. Experiments using a second selective nucleotide on the MseI binding site led us to a common primer combination of MseI plus CT and EcoRI plus 0, which produced between 40 and 60 bands that could be accurately sized and scored.
Cluster analysis. Cluster analysis used the simple matching genetic distance estimation and unweighted pair group method using arithmetic averages (UPGMA) (31). All calculations and displays were performed using TREECON for Windows version 1.3 (33), which runs under Microsoft Windows NT on an IBM-compatible personal computer.
Probabilistic identification. The approach adopted here uses Bayes' theorem, which is an established method that has been used widely for the identification of bacteria against an identification matrix (9, 20, 39), particularly when the characters are binary in nature (4), such as those in this study. Note that the theoretical requirement for using Bayes' theorem is that all taxa are exclusive and exhaustive, and while this may not always be the case for AFLP (or any other typing method, viz., API and Vitek) in that two different taxa (organisms) may give rise by chance to identical AFLP patterns, the probability of this happening is very small, and it was not encountered in this study (see below). Therefore, the practical implementation using Bayes' theorem for the construction of a probabilistic database from FAFLP data can be considered valid. The probabilistic database comprised two parts, (i) the construction of the probabilistic identification matrix and (ii) the identification of an unknown isolate against the identification matrix. All calculations were performed using Matlab version 5 (The MathWorks, Inc., Natick, Mass.), which runs under Microsoft Windows NT on an IBM-compatible personal computer.
The first stage was the construction of an identification matrix which consisted of 291 columns (1 for each possible FAFLP fragment from 60 to 350 bp), while there were either (i) 8 rows, 1 for each genus, or (ii) 10 rows, 1 for each species. The responses used in each column were the probabilities of observing an FAFLP fragment of 60 to 350 bp for that taxon. The probabilities were calculated as the arithmetic mean of the presence (1) or absence (0) of each fragment for each taxon and were scaled to range between 0.01 and 0.99. Entries of 0.0 and 1.0 were not employed, since an isolate with an atypical or erroneous result would generate an identification score of zero for a strain similar to it (5). Forty-five of the FAFLP patterns were used to construct these matrices, and the rest (24 FAFLP profiles) were used as an independent test set; these patterns were chosen randomly and are detailed in Table 1.
The second stage was the identification of an unknown isolate against the probability identification matrix. The identification was based on the implementation by Willcox et al. of Bayes' theorem for use with bacteria (38, 39) and has been adapted by Lapage et al. (20) to be Li* = Li/
Li, where Li is the likelihood of an isolate belonging to taxon i. The identification scores for each taxon are normalized so that
Li* = 1; thus, identification of an unknown isolate is achieved when
Li* for one taxon exceeds 0.999 (5, 20).
|
|
|---|
The common bacterial causal agents of UTI are very varied, and most of the common ones, including E. coli, P. mirabilis, K. pneumoniae, K. oxytoca, C. freundii, M. morganii, E. agglomerans, E. cloacae, P. rettgeri, and Enterococcus spp., were encountered and analyzed in this study. For these eight genera, encompassing both gram-positive and gram-negative bacteria, it was found after extensive investigation of different primer combinations that a dual restriction digest with EcoRI and MseI followed by MseI plus CT and EcoRI plus 0 as a common primer combination was an excellent approach for generating information-rich FAFLP patterns from all of these bacterial isolates. Moreover, all FAFLP reactions were run in triplicate over a period of 4 months, and the same FAFLP profiles for each set of replicates were observed; this highlights the excellent reproducibility of FAFLP, a phenomenon observed by other workers, who showed the robustness of AFLP between different laboratories (18).
The 69 bacterial isolates that were examined by FAFLP showed distinctive AFLP profiles at the genus, species, and subspecies levels, and these are shown digitally in Fig. 1 and recorded in Table 1. These FAFLP patterns typically comprised between 40 and 60 fragments in the selected size range of 50 to 350 bp (Fig. 1); these limits were set by the molecular size marker. All FAFLP profiles were sized using RFLPScan software, and a matrix of ones and zeros representing the presence or absence of amplified fragments, respectively, was generated for all of the UTI isolates. The genetic relatedness of the samples was determined by cluster analysis using the UPGMA algorithm, and the resulting dendrogram of the 69 FAFLP profiles revealed eight main clusters (Fig. 1) corresponding to the eight bacterial genera.
![]() View larger version (64K): [in a new window] |
FIG. 1. UPGMA dendrogram derived from FAFLP data showing all 69 isolates analyzed. The scale represents the genetic difference. Also shown are the digitized AFLP patterns with the scale in base pairs.
|
25%, the first branch in this tree clearly discriminates between the gram-positive enterococci and enteric gram-negative isolates. These discriminations strongly demonstrate the potential of AFLP, contrary to popular opinion (32), for describing a wider diversity range within microbial populations where identification rather than subspecies typing is required. Clinical cases of UTIs will often require accurate typing of the causative organism in order to distinguish relapses, detect reinfection with different organisms, and identify outbreaks of cross-infection. This higher level of discrimination was obtained from the combined FAFLP data. The Klebsiella and Enterobacter isolates both show two well-defined subclusters at a genetic-difference threshold of 10% (Fig. 1 and Table 1), corresponding to K. oxytoca and K. pneumoniae and to E. agglomerans and E. cloacae, respectively. Furthermore, when the threshold of 5% genetic difference is used, subspecies discrimination among the UTI isolates is also observed (Fig. 1 and Table 1). Finally, to test whether subspecies discrimination is dependent on very different microorganisms being included in tree production, cluster analysis was performed on the 25 E. coli isolates alone. The resultant dendrogram in Fig. 2 shows the same discrimination observed when all the data were combined (Fig. 1), indicating that having very different FAFLP profiles in the cluster analysis does not skew the discrimination observed. In combination, the above-mentioned results clearly suggest that the single primer combination EcoRI plus 0 and MseI plus CT was sufficient to characterize and type bacteria from these different genera to the species level and further. It is hoped that this will also be true for other genera and species therein, but this will be a subject for future investigation.
![]() View larger version (19K): [in a new window] |
FIG. 2. UPGMA dendrogram derived from FAFLP data showing the relationships among the 25 E. coli isolates.
|
The method that we have used is based on Bayes' theorem for the identification of bacteria against an identification matrix (9, 20, 39). Forty-five of the FAFLP patterns were chosen randomly (Table 1 shows the details) and used to construct (i.e., supervise) two probabilistic identification matrices; the first contained 8 taxa, 1 for each genus, and the second comprised 10 taxa, 1 for each species. These identification matrices comprised 291 columns, 1 for each possible FAFLP fragment from 60 to 350 bp, and each cell was the arithmetic mean for each taxon for each of the possible 291 fragments, scaled between 0.01 and 0.99. The 24 FAFLP profiles that were not used for construction of these identification matrices (Table 1) were employed to test whether this method could be used to identify an unknown hospital UTI isolate. Probabilistic identification was performed as detailed above, and likelihood scores for each taxon were computed. If the likelihood was > 0.999 for one taxon, this was taken as its identity. Each of the 24 unknown isolates was correctly identified at the genus and species levels with a score of 1.0000 (as were the 45 isolates used to construct the databases). It was therefore evident that this approach does indeed work for an independent test set and that it produced results that no longer needed the interpretation of a dendrogram.
In cases of UTI relapse, it is necessary to identify the causal organism to the subspecies level. It is known that AFLP has sufficient resolving power to allow this (6, 8, 10, 12, 17, 27, 32); therefore, the question arises as to whether the use of an FAFLP probabilistic database could identify the infecting organism to the subspecies level. Hence, we constructed a database of only the first FAFLP replicate set from the 25 E. coli isolates and calibrated it to identify the 11 different E. coli FAFLP types (A10a to -k) observed in the dendrogram at 0.05 genetic difference (Fig. 1; detailed in Table 1). The second and third replicate FAFLP patterns were subsequently matched against this, and all 50 isolates were correctly identified with scores of 1.0000. This suggests that the approach has sufficient resolving power to identify UTI relapse to the subspecies level.
In summary, this study clearly demonstrates that FAFLP can be used to identify the causal agents of UTI. A single primer combination was discovered that gave information-rich AFLP profiles from gram-negative and gram-positive bacteria encompassing eight genera. Furthermore, the identification of bacteria against an FAFLP identification matrix using Bayes' theorem was illustrated with unknown UTI isolates that were not used in the construction of the database. In conclusion, we believe that the approach we have developed using an AFLP probabilistic database constructed with FAFLP patterns from the common causal agents of UTI offers great potential as an objective UTI identification procedure in the clinical laboratory.
Present address: Belfast City Hospital, Belfast, BT9 7AB, United Kingdom. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»