Previous Article | Next Article ![]()
Journal of Clinical Microbiology, June 2004, p. 2388-2397, Vol. 42, No. 6
0095-1137/04/$08.00+0 DOI: 10.1128/JCM.42.6.2388-2397.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Division of Infectious Diseases, Massachusetts General Hospital, Boston, Massachusetts 02114,1 Department of Medicine,2 Department of Microbiology and Molecular Genetics, Harvard Medical School,6 Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts 02115,3 Department of Animal Health and Biomedical Sciences, University of WisconsinMadison, Madison, Wisconsin 53706,4 Foodborne and Diarrheal Diseases Branch, Centers for Disease Control, Atlanta, Georgia 303335
Received 6 October 2003/ Returned for modification 10 December 2003/ Accepted 8 March 2004
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Outbreaks of food-borne pathogens such as O157 occur nearly simultaneously in several different states or even different countries, with varied disease manifestations. In addition, this organism has been classified as a category B bioterrorism agent by the Centers for Disease Control and Prevention (Atlanta, Ga.). Rapid typing of O157 strains would play a key role in limiting widespread outbreaks by allowing identification and withdrawal of the common source of infection. Pulsed-field gel electrophoresis (PFGE) is used by diagnostic and public health laboratories to type O157 strains. At the time PFGE was designated the "gold standard" for bacterial strain typing, it was the only technique that could provide distinctive profiles for bacterial strains in a reasonably reproducible manner (29, 38). The other methodologies available at the time included phenotypic and genotypic methods, such as biotyping, antimicrobial susceptibility patterns, bacteriophage typing, bacteriocin typing, multilocus enzyme electrophoresis patterns, whole-cell fatty acid profiles, plasmid analysis, restriction mapping, restriction fragment length polymorphism analysis, and PCR amplification patterns of repetitive, insertion, and other consensus sequences (1, 14, 24).
However, over time, certain drawbacks of PFGE have become more evident, requiring researchers to implement several modifications of the technique in order to increase sample turnaround, overcome inhibition of restriction enzymes or degradation of DNA, and minimize changes in electrophoretic patterns of DNA between gels to avoid reliance on sophisticated pattern recognition computer software to interpret the patterns generated (3, 12, 15, 23, 29). The ability of PFGE to accurately assess the genetic relatedness of O157 isolates in outbreaks was recently assessed (6). It was observed that single-restriction-enzyme PFGE gave a poor measure of genetic relatedness, as it did not resolve all the DNA fragments generated following restriction digestion and could not resolve comigrating bands (6). Assessing genetic relatedness is crucial to any epidemiological survey, and the authors suggest the use of multiple enzymes to obtain an accurate result using PFGE (6).
In previous studies, researchers tried to decipher the genetic events underlying strain variation in O157 as seen by PFGE following digestion of genomic DNA by the restriction enzyme XbaI (18, 19). XbaI is the enzyme of choice in PFGE analysis of O157 isolates, as the enzyme is an effective cutter and gives a more decipherable profile than other infrequently cutting restriction enzymes (6, 29, 38). Our analysis showed that insertions and deletions (indels) of sequences containing XbaI sites were responsible for the majority of differences between O157 strains, and based on these indels, we proposed a novel, PCR-based strain-typing tool called polymorphic amplified typing sequences (PATS, or XbaI-based PATS) (18, 19) (U.S. utility patent pending; filed 1 November 2000; PCT International patent pending; filed 1 November 2001.). XbaI-based PATS was able to type every O157 isolate tested in a reproducible manner. In contrast to PFGE, XbaI-based PATS yielded straightforward, easy-to-interpret results that correctly matched related O157 isolates (19). However, XbaI-based PATS was less discriminatory than PFGE and was unable to differentiate fully between unrelated isolates. This may have been due to our inability to utilize two repeat regions in the O157 genome in the design of the XbaI-based PATS primer pairs, as well as the occurrence of insertions and deletions in regions between XbaI sites that would have been overlooked by this typing technique.
Hence, we decided to evaluate another infrequently occurring restriction enzyme site in the O157 genome to increase the discriminatory power of PATS. The restriction enzyme AvrII (BlnI) fulfilled the criteria we had set by cutting the O157 genome infrequently, having sites within the repeat regions not targeted by XbaI-based PATS, and having only one site overlapping a region previously amplified by XbaI-based PATS. In this study, we analyzed the genetic events underlying polymorphisms at or around AvrII restriction sites in the O157 genome and used the information to enhance the discriminatory power of PATS.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Design of primer pairs amplifying O157 AvrII sites. The sequenced EDL 933 genome (GenBank accession number AE005174) (33) was used as the prototype O157 to determine the total number of AvrII restriction sites and the DNA sequences of the regions flanking these sites. The sequences were used to design primer pairs that would yield distinct amplicons containing a single AvrII site from O157 strain EDL 933. These AvrII-based PATS primers were each assigned an IKNR prefix. We also used previously described primer pairs that amplified specific virulence genes, stx1, stx2, eae, and hlyA, and the previously identified eight polymorphic XbaI-containing sequences in the O157 genome when required (19, 30). PCR with these primer pairs was set up using previously described conditions (18, 19) that combined the hot-start and touchdown techniques (7, 8).
Primer pairs evaluating the presence of O islands 43 and 48. O islands 43 and 48 are identical O islands encoding tellurite resistance, integrase, urease, phage proteins, and an adhesin (33). They are also referred to as the "tellurite resistance- and adherence-conferring islands" (P. I. Tarr, S. S. Bilge, J. A. Vary, N. M. Tang, M. R. Baylor, K. Potter, T. E. Besser, and S. L. Moseley, Abstr. 31st US-Japan Cholera Rel. Diarrheal Dis. Conf., 1995, p. 119-124) and are 0.5 Mb apart on the EDL 933 genome, with O island 43 inserted close to the serX tRNA gene and O island 48 inserted close to the serW tRNA gene (33). The following primer pairs used to amplify the serW and serX tRNA genes and the 3' end of tellurite resistance- and adherence-conferring islands in a previous study (37) were included in this study: SerW (forward), 5'TCGGGGAAGGTAAGGAT3', and SerW (reverse), 5'TTGTGATATGTATGAAGT3'; SerX (forward), 5'TTTTTCTATTGTCGATTCCTCT3', and SerX (reverse), 5'GATCTACAAAGGCCACCAGCA3'; and Ter-island (forward), 5'GACAAACTCTCCGGGATAACTCA3', and Ter-island (reverse), 5'TGCGGGTGCTGGTGTGGGATAA3' (37). These primer pairs were used to determine if both or only one of the two identical O islands was present in the O157 isolates tested, as four of the AvrII-based PATS primer pairs (IKNR6A and -B, IKNR7A and -B, IKNR9A and -B, and IKNR10A and -B) mapped to these two regions. PCR was done under conditions described previously (18, 19).
Evaluation of amplicons. PCR were initially screened for the presence or absence of amplicons. Amplicons, when present, were purified using the QIAquick PCR purification kit (Qiagen, Valencia, Calif.) and digested with the AvrII restriction enzyme (New England Biolabs, Beverly, Mass.) to confirm the presence of an AvrII restriction site within the amplicon. Undigested and digested DNA fragments were resolved on a 4% agarose gel prepared with a combination of 3% Nusieve GTG agarose (FMC BioProducts, Rockland, Maine) and 1% agarose (Shelton Scientific Inc., Shelton, Conn.) and stained with ethidium bromide.
The absence of amplicons was further investigated by PCR, Southern blotting, and sequencing (see below). Additional primer pairs were designed based on sequences upstream and downstream of AvrII sites to investigate polymorphisms resulting from indels. In addition, amplicons that showed variant digestion patterns with the AvrII restriction enzyme were evaluated by sequencing for the presence of single-nucleotide polymorphisms (SNPs) at the restriction site.
DNA extraction, sequencing, and probe labeling. Genomic DNA was prepared using the Easy-DNA Isolation kit (Invitrogen Corp., Carlsbad, Calif.) according to the manufacturer's instructions. DNA sequencing was done at the DNA Sequencing Core Facility, Department of Molecular Biology, Massachusetts General Hospital. All DNA probes were labeled using the ECL direct nucleic acid labeling and detection system (Amersham Pharmacia Biotech, Inc., Piscataway, N.J.).
Southern blotting. Indels were evaluated using Southern blot hybridization. DNA was fractionated by agarose gel electrophoresis, transferred to Hybond-N+ membranes (Amersham Pharmacia Biotech, Inc.), UV cross-linked to the membrane using a Stratalinker (Stratagene, La Jolla, Calif.), and hybridized with the appropriate probes derived from O157 strain EDL 933 and labeled using the ECL direct nucleic acid labeling and detection system. Hybridization at 42°C and posthybridization washing of blots was done according to the ECL kit manual. Autoradiographs were prepared by exposure of processed blots to Scientific Imaging X-OMAT AR film (Eastman Kodak Company, Rochester, N.Y.).
Data analysis. Dendrograms were constructed by coding molecular data as follows. The presence or absence of each of the 19 amplicons (representing eight polymorphic XbaI sites, seven polymorphic AvrII sites, and four virulence genes) was coded as a dichotomous variable. Characters representing gain or loss of a restriction site recognized by AvrII were weighted to reflect the increased probability of losing a site over gaining one; however, such weighting had no impact on the resulting dendrogram. Trees were constructed using the unweighted-pair-group method with arithmetic means option in Phylogenetic Analysis Using Parsimony software (Sinauer Associates, Inc., Sunderland, Mass.).
| RESULTS |
|---|
|
|
|---|
300- to 600-bp segments of DNA containing each AvrII restriction site were designed flanking each of these sites. The presence or absence of an amplicon with each primer pair, as well as the presence or absence of an AvrII restriction site within each amplicon, was tested by PCR, AvrII digestion, and agarose gel electrophoresis. Thirty-three primer pairs were analyzed with the control O157 strains, EDL 933 and 86-24. With O157 strain EDL 933, 32 of the 33 primer pairs yielded the expected single amplicons containing the AvrII restriction sites. One of the primer pairs, IKNR29A and -B, consistently yielded two amplicons from both the control strains and other test O157 isolates. Sequence analysis showed that this primer pair was amplifying overlapping sequences from two repeat regions along the O157 genome. The repeat region from which the IKNR29 primers were designed was already targeted by a previously designed XbaI-based PATS primer pair, IK38A and -B (19). In addition, the second repeat region was being amplified by another AvrII-based primer pair, IKNR24A and -B, that yielded a single amplicon from that site. Hence, we decided that exclusion of IKNR29 would not compromise the typing system, and this primer pair was excluded from the final pool of 32 AvrII-based PATS primer pairs.
With O157 strain 86-24, two of the 32 AvrII-based PATS primer pairs either yielded an amplicon with an SNP (IKNR27) or failed to yield an amplicon (IKNR33) (Table 1). These polymorphic results demonstrated the ability of these primer pairs to discriminate between the two O157 control strains.
|
Of the 32 AvrII-based PATS primer pairs, 7 produced polymorphic results and 25 produced identical results across the 44 O157 isolates tested (Table 1). The polymorphic results produced by the seven primer pairs (Table 1) ranged from the absence of an amplicon (score, 0) to the presence of an amplicon with no (score, 1) or two (score, 3) AvrII sites. Based on the scores assigned to each amplicon obtained from every isolate-primer pair combination tested, the 44 O157 isolates were separated into 11 AvrII-based PATS types (Table 1). These results were successfully reproduced in three separate analyses of the 44 O157 isolates. In addition, the absence of an amplicon was confirmed by Southern blot analysis (data not shown). The absence of the restriction site or presence of an additional restriction site was confirmed by sequencing.
DNA sequences amplified by all 32 AvrII-based PATS primer pairs were analyzed for their location in the O157 strain EDL 933 genome using the GenBank database (BLAST search program; National Center for Biotechnology Information). Of the 32 O157 AvrII-containing genome sequences, 22 occurred in sequences having homology to E. coli strain K-12 genome sequences (backbone), and 10 occurred in sequences not shared with K-12 (O islands). Among the conserved 25 AvrII-containing sequences, only 5 were in O islands. On the other hand, five of the seven polymorphic AvrII-containing sequences were in O islands, supporting a previous observation that major genomic differences between O157 strains occur in O-island sequences (Table 2) (18, 19).
|
|
|
Primer pairs IKNR12A and -B, IKNR12C and -D, and IKNR12E and -F yielded amplicons of expected sizes only from EDL 933. This indicated possible deletion of sequences within, at the start of, and at the end of O island 50 in isolates A, H, and VV. Primer pairs IKNR12G and -H, and IKNR12I and -J yielded amplicons from all isolates and EDL 933, indicating the presence of DNA sequences upstream and downstream of O island 50. Primer pair IKNR12G and -J yielded amplicons only from isolates A, H, and VV, confirming a deletion of the entire O island 50 in the genomes of these isolates; this amplicon lacked an AvrII site. No amplification was obtained from EDL 933 with this primer pair, as the PCR conditions were not set to compensate for the
50-kb distance between the binding sites for these primers in the absence of any deletions. Sequencing and Southern blot analysis further confirmed our PCR results and observations. A 47,696-bp deletion had occurred in the genomes of O157 isolates A, H, and VV that included the entire O island 50, part of the intergenic segment between potC and O island 50 (14 bp), the entire intergenic segment between O island 50 and potB (348 bp), and part of potB itself (24 bp) (Fig. 1C). Interestingly, O island 50 has a putative insertion sequence element and a putative integrase at both ends, and an overlap in sequence was observed at the site of cleavage. These may have contributed to the excision of O island 50 in these variant isolates.
IKNR33. Primer pairs IKNR33A and -B amplified an amplicon, IKNR33, from the control O157 strain EDL 933 and most of the tested O157 isolates except control strain 86-24 and isolates H, HH, Q, QQ, T, and TT (Table 1). This amplicon was mapped to O island 172 in O157 strain EDL933, encoding several unknown, integrase, and helicase proteins (Table 2) (GenBank accession number AE005174) (33). O island 172 is 44,433 bp in size and is flanked by the leuX (tRNA-leu) and yjhs (unknown) genes, as shown in Fig. 1D. We designed additional primer pairs, IKNR33 C and -D, IKNR33E and -F, IKNR33G and -H, IKNR33I and -J, and IKNR33K and -L (Fig. 1E), and tested the six variant isolates.
Primer pairs IKNR33A and -B, IKNR33C and -D, and IKNR33E and -F yielded amplicons of expected sizes only from EDL 933. This indicated possible deletion of sequences within and at the start of O island 172 in isolates H, HH, Q, QQ, T, and TT. Primer pairs IKNR33G and -H and IKNR33I and -J yielded amplicons from all isolates and EDL 933, indicating the conservation of DNA sequences upstream and downstream of O island 172. Primer pair IKNR33K and -L yielded amplicons only from isolates H, HH, Q, QQ, T, and TT, confirming a deletion of part of O island 172 in the genomes of these isolates; this amplicon lacked an AvrII site. No amplification was obtained from EDL 933 with this primer pair, as the PCR conditions were not set to compensate for the >10 kb distance between the binding sites for these primers in the absence of any deletions. Sequencing and Southern blot analysis confirmed that a 10,235-bp deletion had occurred in O island 172 of O157 isolates H, HH, Q, QQ, T, and TT (Fig. 1F). The deletion had occurred between two putative integrase-encoding regions, one occurring at the start of and the other within O island 172.
(ii) Polymorphism caused by SNPs. Polymorphisms in amplicons IKNR3, IKNR7, IKNR10, IKNR16, and IKNR27 were due to SNPs occurring either at the AvrII restriction site or upstream of the site, resulting in the loss or gain of an AvrII site (Table 1 and Fig. 2), respectively. Of these five amplicons with SNPs, three were mapped to O islands and two were mapped to backbone sequences (Table 2).
Primer pairs IKNR3A and -B, IKNR7A and -B, IKNR10A and -B, and IKNR16A and -B yielded amplicons from all control and test O157 isolates. However, amplicons from some of the isolates failed to digest with the restriction enzyme AvrII. Sequence analysis indicated the presence of an SNP in the recognition sequence for this enzyme, namely, CCTAGG. The SNPs are listed in Fig. 2 as they were seen on the coding strand and were primarily transitions (adenine to guanine), with only one transversion (adenine to cytosine) seen in the IKNR3 amplicon. Similarly, primer pair IKNR27A and -B yielded amplicons from all control and test O157 isolates. However, amplicons from some of the isolates yielded three fragments upon digestion with the restriction enzyme AvrII, clearly indicating the presence of an additional site for the enzyme. Sequence analysis confirmed this observation. A transition (cytosine to thymidine on the coding strand) had occurred in the sequence upstream of the original AvrII site, creating an additional AvrII site in that region (Fig. 2).
Evaluation of O157 isolates for the presence of O islands 43 and 48. Amplicons IKNR7 and IKNR10 mapped to the two well-characterized, identical O islands 43 and 48, which encode tellurite resistance, integrase, urease, adhesin, and other phage proteins (33, 37) (Table 2). Studies conducted by Taylor et al. have shown genomic variability in these O islands between O157 strains. Of the 16 O157 strains studied by these authors, 6 had both O islands, 6 had only one of the O islands, and 4 had neither (37). In our study, all of the O157 isolates analyzed yielded the IKNR7 and IKNR10 amplicons, clearly indicating that one or both of the O islands were always present (Table 1).
We evaluated the six O157 isolates that had SNPs in IKNR7 and IKNR10 for the presence of O islands 43 and 48. Our PCR analysis, using O157 strain EDL 933 (O island 43+ O island 48+) as a positive control, showed that all six O157 isolates contained only one O island (O island 43) inserted at serX (data not shown). Specifically, primer pairs SerW forward and reverse and Ter forward and reverse yielded amplicons of the expected sizes from all these isolates, while SerX forward and reverse failed to yield an amplicon (data not shown).
We also screened the 21 O157 isolates that grouped in the same PATS type following AvrII-based PATS analysis (Table 1) for the presence of O islands 43 and 48 to see if this would enable further differentiation of these isolates. However, the PCR analysis yielded results identical to those described above, indicating the presence of only O island 43 in these isolates as well. This eliminated the possibility of differences between the O157 isolates based on this factor.
Comparison of polymorphisms at AvrII and XbaI restriction sites. The seven AvrII-containing regions that were polymorphic between O157 isolates occurred in O islands or backbone sequences and were caused by indels and SNPs. In contrast, as shown previously, the eight XbaI-containing regions that were polymorphic between O157 isolates were all located in O islands, and all were caused by indels (18).
One of the AvrII-containing polymorphic regions (IKNR33) and an XbaI-containing polymorphic region (IK114) could be traced to the same O island, namely, O island 172. Polymorphisms at both IKNR33 and IK114 were due to indels, suggesting that perhaps O island 172 has regions of increased recombinational activity. No overlap was observed between the remaining polymorphic regions. We also observed that some O islands contained both conserved and polymorphic regions around one or both of these restriction sites. These included O islands 8 (IK111-conserved-XbaI; IKNR3-polymorphic-AvrII), 43 (IKNR6-conserved-AvrII; IKNR7-polymorphic-AvrII), 48 (IKNR9-conserved-AvrII; IKNR10-polymorphic-AvrII), 108 (IK118-conserved-XbaI; IKB15-polymorphic-XbaI), and 172 (IK39-conserved-XbaI; IK114-polymorphic-XbaI; IKNR33-polymorphic-AvrII) (18, 19) (Tables 1 and 2).
Combined screening for polymorphic XbaI-containing and AvrII-containing regions, along with virulence genes. In order to enhance the discriminatory ability of XbaI-based PATS, we had specifically chosen sites for the restriction enzyme AvrII, as the in silico analysis showed that some of these sites were in close proximity to some of the XbaI sites and others were fairly evenly distributed along the entire O157 genome between the XbaI sites (Genome Restriction Mapping Program; The Institute for Genomic Research Database). This would help us address two issues: (i) to be able to use the repeat regions that were excluded from XbaI-based PATS due to our inability to obtain distinct amplicons for those regions and (ii) to target polymorphisms that occurred between XbaI sites (19).
Four repeat regions, two direct repeats and two inverse repeats, were identified in the O157 genome, each of which contained a single XbaI and a single AvrII restriction enzyme site. Although the regions had >90% homology to each other, they contained a few nonoverlapping sequences. Using the nonoverlapping sequences, and the positions of the restriction enzyme sites within these repeat regions, we designed primer pairs that would yield a single amplicon from each of the repeat regions. Two of the repeat regions were successfully amplified by the XbaI-based PATS primer pairs IK38A and -B and IKB14A and -B (19), while three of the repeat regions were amplified by the AvrII-based PATS primer pairs IKNR18A and -B, IKNR24A and -B, and IKNR31A and -B (Table 1). The amplicons IKNR24 and IKNR31 amplified repeat regions that had not been targeted by XbaI-based PATS, fulfilling our goal of being able to use these regions in typing.
The distribution of the AvrII restriction enzyme sites in DNA segments between the XbaI restriction sites also helped to identify additional polymorphic regions along the O157 genome. Furthermore, the same set of 44 O157 isolates analyzed by XbaI-based PATS and AvrII-based PATS was analyzed by a combination of only those primer pairs amplifying polymorphic regions containing XbaI or AvrII restriction sites, along with the virulence gene primer pairs. The O157 isolates were differentiated into 19 groups, demonstrating that the combination of select primer pairs (combined PATS) discriminated better than the XbaI-based or AvrII-based PATS primer pairs alone (19) (Tables 1 and 3 and Fig. 3). The large clusters of isolates were split into smaller groups so that the overall distribution of the isolates was similar to that from PFGE (Fig. 3). The final PATS profiles identified the paired O157 isolates from 22 outbreaks as identical for 14 outbreaks, similar (1 amplicon difference; isolates A-AA, G-GG, H-HH, M-MM, N-NN, and P-PP) for 6 outbreaks, and different (>1 amplicon difference; isolates B-BB and V-VV) for 2 outbreaks (Table 3 and Fig. 3). The isolates recognized as different also had substantially different PFGE profiles.
|
|
| DISCUSSION |
|---|
|
|
|---|
PATS typed every O157 isolate tested in a reproducible manner, providing simple, accurate, and easy-to-interpret results. The consistent improvement achieved in this typing system by analyzing polymorphisms at different infrequently cutting restriction enzyme sites is reflected in the number of groups into which the 44 O157 isolates tested were distributed at each step. AvrII-based PATS was least discriminatory, dividing the O157 isolates into 11 groups. XbaI-based PATS divided the isolates into 14 groups, showing better discrimination, while combined PATS organized the isolates into 19 groups, reflecting increased discrimination (19) (Tables 1 and 3). Combined PATS was able to match the paired O157 isolates from 20 of the 22 outbreaks as identical (14 outbreaks) or closely related (6 outbreaks) while identifying the paired isolates from 2 outbreaks as different (Table 3 and Fig. 3). These O157 isolates (B-BB and V-VV), which were different by combined PATS, also had substantially different PFGE profiles (19). Likewise, most of the isolates from different outbreaks that shared the same combined PATS group also shared the same clade by PFGE (Table 3 and Fig. 3). In comparison, PFGE was able to match the paired isolates from only 13 of the 22 outbreaks as identical (6 outbreaks) or closely related (7 outbreaks); isolates from 6 outbreaks were grouped as more distantly related, while isolates from 3 outbreaks could not be typed (19) (Fig. 3). Current PFGE interpretive criteria may have overdiscriminated these isolates, as even those exhibiting a single band difference in their profiles were categorized as different. Spurious incompletely digested and/or comigrating DNA bands could have easily contributed to the incorrect assessment of isolates as being closely or more distantly related (6). In addition, we cannot rule out possible misclassification of O157 isolates to the outbreaks, as these organisms were collected at a time when subtyping was not available (19). On the other hand, PATS provided comparable, and in some instances better, discriminatory and relating abilities than PFGE. PATS, with its straightforward and less subjective results, related better to the epidemiological data than PFGE for this set of O157 isolates.
The difference between the distributions and natures of polymorphisms in XbaI and AvrII restriction sites may have contributed to the significant differences we saw in the PATS results based on these individual sites (18) (Table 1 and Fig. 3). Despite an AT-rich recognition site, only 40 XbaI restriction sites occur in the O157 strain EDL 933 genome with a 50% G+C content. Of these, 18 XbaI sites occur in backbone sequences and 22 XbaI sites occur in O-island sequences (18). All of the 18 backbone sequences and 14 of the O-island sequences containing an XbaI site are conserved among the O157 isolates tested. Polymorphic sequences containing an XbaI site are restricted to eight O islands and are all caused by indels (18). In contrast, AvrII, with a G+C-rich recognition site, has 33 sites in the O157 strain EDL 933 genome. Of these, 23 AvrII sites occur in backbone sequences and 10 occur in O-island sequences. Twenty-one backbone and five O-island sequences with AvrII sites are conserved among the O157 isolates tested, while the remaining two backbone and five O-island sequences with AvrII sites are polymorphic. These polymorphisms have been caused by both indels and SNPs. XbaI, with more sites on the O157 genome that are primarily distributed in O islands and exhibiting polymorphism caused by indels only in O islands, is probably best suited to differentiate O157 strains. Since most of the O islands are associated with phage or insertion elements capable of undergoing recombination, these may play a major role in causing polymorphisms between strains (5, 27, 36). AvrII, with fewer sites, most of which occur in backbone sequences, exhibited polymorphisms caused by SNPs in both backbone and O-island sequences that may accumulate over time, and this in turn may limit its ability to discriminate between strains (20, 26, 34).
Overall, analysis of restriction site polymorphisms in O157 indicates that twice as many indels as SNPs (10 indels and 5 SNPs) influence the process of diversification in the organism. Indels play a significant role in the evolutionary change and adaptation of bacteria, especially enterobacteria (5, 20, 26, 32, 34). Even the sequenced O157 genome contains 1.34 Mb of DNA that is not present in the K-12 genome and lacks 530 kb of DNA that is present in the K-12 genome (33). Similar indel-directed differences between the two sequenced O157 strains, EDL 933 and Sakai, can be observed (5, 13, 33). The probable existence of O157 in hypermutable states, due to defects in its methyl-directed mismatch repair system, may contribute to the large number of horizontal transfer events occurring in the genome, leading to genetic variation in the serotype and its strains (21, 22, 28). A recent study comparing closely related species and strains from six very distant systematic groups, including apes, sea urchins, bacteria, insects, nematodes, and plants, has shown that indels are responsible for many more unmatched nucleotides between closely related DNAs than are base substitutions (5).
Although most pathoadaptive mutations in bacteria that provide a selective advantage for the organism are due to indels occurring in structural and regulatory genes, SNPs, either singly or in numbers, can also influence this process (40). For instance, SNPs in the E. coli fimbrial gene can greatly influence the tropism of the organism to a variety of host cell and mucosal surfaces (40). Several SNPs have been identified in the O157 strain EDL 933 genome, especially in the backbone sequences, although the significance of each of these needs to be deciphered (33). These SNPs are primarily transitions (three transitions and one transversion). A bias toward T
C transitions over A
G transitions on the coding strand has been observed, with G
T being the most common transversion (33). SNPs, primarily transitions (four transitions and one transversion), also contributed to the restriction site polymorphisms we studied. While this observation matched the trend for the O157 genome, the nucleotide bases involved in the predominant transitions and transversions were different. Three of the transitions were A
G and one was C
T on the coding strand. The one transversion was A
C on the coding strand. Hence, the SNPs we observed belonged to the underrepresented group of SNPs found along the O157 genome.
PFGE is currently the standard strain-typing technique used by various public health and diagnostic laboratories to determine the relatedness of outbreak or nosocomial O157 and other bacterial isolates (2, 4, 16). Compared to the available strain-typing methodologies, PFGE provides relatively distinctive profiles for strains in several serotypes, making it a popular strain-typing tool. However, challenges in using PFGE include the fact that it is a labor-intensive and time-consuming technique that requires expensive instrumentation and software to interpret the complex electrophoretic patterns generated, and there are several factors inherent to the protocol itself that influence the patterns generated and lead to misclassification of strains (12, 15, 23). Some of the factors influencing the electrophoretic patterns of DNA in PFGE are improper storage and repeated subculturing of the isolates used to prepare the genomic DNA, which can cause spontaneous loss of plasmids or other recombinational events that alter DNA profiles; nuclease-related or electrophoresis-related degradation of DNA; methylation of DNA, causing incomplete restriction digestion of DNA; inactivation of restriction enzymes by reagents used to prepare DNA; improper resolution of smaller or larger DNA fragments, depending on the size limits of gel conditions; comigration of similar-sized DNA fragments; and nonhomologous DNA migrating as same-size bands, resulting in untypeable or incorrect profiles (3, 6, 23). In fact, PFGE fails to resolve all fragments generated after restriction digestion of the DNA on the gel under a single set of conditions, and the number of bands visualized does not match the number of sites for the restriction enzyme used to digest the DNA (3, 4, 6, 12). The choice of electrophoresis conditions, restriction enzymes, and software for the analysis of DNA patterns can alter PFGE profiles for the same strain, making it difficult to interpret and compare data (3, 6, 35). These drawbacks continue despite several measures to improve the technique by adding thiourea or HEPES to the running buffer to reduce degradation of DNA, the simplification of the protocol to make it more rapid, and the interpretation criteria designed by Gautom, Koort et al., and Tenover et al. to compensate for changes in profiles for the same strain under the conditions described above (9, 17, 38).
DNA sequence-based typing methods, such as multilocus sequence typing, are potential alternatives for PFGE, as they offer shorter assay times, less subjectivity in the interpretation of results, comparable and transferable data, and ease of automation (25). However, these techniques still rely on expensive instrumentation and software to interpret data (29). Most importantly, these typing systems are inefficient in global analysis of the genome, showing a bias toward a select set of sequences or genes that may or may not have sufficient diversity among strains of a pathogen. Also, sequencing errors may influence the results (25).
PATS provides a simple, user-friendly, straightforward, easy-to-perform and -interpret alternative to the existing techniques for bacterial strain typing. The currently standardized combined PATS is at least as good as PFGE in matching like and discriminating unlike O157 isolates. This is made possible by the ability of PATS to exploit both indels and SNPs in sequences along the bacterial genome. By doing so, the system addresses the issue of sequence variations at various sites along the bacterial genome without extensive instrumentation and helps to differentiate between homologous and nonhomologous regions. PATS is a PCR-based technique with high typeability and reproducibility, as it is not hindered by DNA methylation. The PCR format also makes it cost-effective, as it utilizes common laboratory instruments and reagents. The assay time, which is already shorter than that of PFGE or multilocus sequence typing, could be further reduced by automation if necessary. There is less subjectivity in the interpretation of results, and the data generated are easily comparable and transferable between laboratories. We are evaluating combined PATS against a new set of independent O157 isolates linked to various outbreaks to further validate our observations and to set up the interpretive criteria for the typing system. We are also working to automate the system using a DNA microarray platform and are in the process of expanding the application of PATS to other organisms.
| ACKNOWLEDGMENTS |
|---|
This study was supported by funding from the Centers for Disease Control and Prevention-Association of Public Health Laboratories to S.B.C. and I.T.K. This work was also supported in part by the Center for Integration of Medicine and Innovative Technology (CIMIT) through U.S. Army Medical Research and Materiel Command Cooperative Agreement DAMD17-02-2-0006 awarded to S.B.C., I.T.K., and M.J.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Antimicrob. Agents Chemother. | Clin. Microbiol. Rev. |
|---|---|
| Clin. Vaccine Immunol. | ALL ASM JOURNALS |
|---|