Previous Article | Next Article ![]()
Journal of Clinical Microbiology, May 2009, p. 1290-1299, Vol. 47, No. 5
0095-1137/09/$08.00+0 doi:10.1128/JCM.02095-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.

Washington State Department of Health, Public Health Laboratories, Shoreline, Washington,1 Bacterial Epidemiology and Antimicrobial Research Unit, U.S. Department of Agriculture, Athens, Georgia2
Received 31 October 2008/ Returned for modification 20 December 2008/ Accepted 28 February 2009
|
|
|---|
|
|
|---|
Salmonella bongori and S. enterica are the two species that comprise the genus Salmonella. S. enterica is further divided into six subspecies, namely, enterica (I), salamae (II), arizonae 5 (IIIa), diarizonae (IIIb), houtenae (IV), and indica (VI). S. enterica subsp. enterica strains are of the greatest clinical relevance and are typically isolated from humans and warm-blooded animals. Strains belonging to one of the other five S. enterica subspecies and S. bongori are associated with environmental or reptilian sources (10, 23a). Serologic classification of Salmonella strains based upon properties of various surface polysaccharide (O) and flagellar (H) antigens is the reference method for epidemiologic surveillance. This method involves the characterization of over 150 unique O and H antigens to produce an antigenic formula that can be scored using the Kauffman-White scheme to determine a serovar for an isolate (7, 23a). Currently, serotyping classifies over 2,500 serovars of Salmonella, of which over 1,400 belong to S. enterica subsp. enterica (7, 12). Although serotyping using the Kauffman-White scheme remains the standard for serovar determination through its longstanding and widespread use, it is not without significant deficiencies. Aside from being labor-intensive and expensive, serotyping is also time-consuming to perform, often taking three or more days after receipt of a specimen for a highly trained laboratory technician to produce a result. Incomplete or incorrect serologic classification may occur due to atypical expression of an isolate's surface O or H antigen as in the case of mucoid strains in which the O antigen is obscured or for nonmotile and/or monophasic isolates for which only one flagellar phase antigen can be determined. Recent comparative genomic studies of common clinical serotypes have also revealed evidence of a high level of intraserovar variation among isolates of some serovars (16, 29, 38). In other cases, some serovars have been shown to be highly similar genetically, which suggests that a "genovar" classification be adopted, based upon genetic relatedness. It is also proposed that genovar classification may be more appropriate than serovar classification (4, 39).
The deficiencies of conventional serotyping combined with the wealth of genomic information now available for Salmonella have led to the development of alternative molecular strategies to replace or complement conventional serotyping. A number of recent strategies have employed PCR-based approaches to determine different O and H antigens as a means to replace serologic identification of these antigens (14, 20, 25, 32). Others have proposed alternative strategies examining genetic differences as a means of identifying serovars, including ribotyping (17), pulsed-field gel electrophoresis (PFGE) (28), multiplex PCR (3, 4, 29), IS200 analysis (18, 46), random amplification of DNA polymorphisms (45), and DNA microarray analysis (38, 42).
For this study, a rapid, high-throughput multiplex PCR-based method was developed that allows for discrimination of the majority of common serotypes that are reported in the United States based upon their genetic differences. The gene targets are present in some serotypes but not others, as demonstrated by previous comparative genomic studies (19, 35, 38, 39). Primers for all 16 gene targets were designed for use in a single multiplex PCR, thus allowing for a reduction in costs, a decrease in turnaround time per test, and a lower rate of laboratory error. Additionally, the scoring of resulting amplicons was adapted for high-throughput testing by fluorescently labeling PCR products and employing capillary electrophoresis for separation and analysis. Aside from offering high sensitivity and accurate sizing of the PCR amplicons, DNA sequencing equipment is robust and already common to a large number of public health laboratories. The use of this typing method, especially in conjunction with serogrouping or PFGE, allows for serovar determination of Salmonella isolates at a level comparable to that of conventional serotyping, with considerable time and cost savings.
|
|
|---|
Primer design and PCR amplification. The method for selecting genomic targets used in the assay was described previously (30). Briefly, genetic loci were selected based upon their variability among isolates from different clinical S. enterica serovars but relative stability within isolates of the same serovar, according to the results of a previous study comparing genomic contents of different S. enterica serovars by use of a microarray containing the entire genomic complements of serovar Typhimurium LT2 (STM gene names) and serovar Typhi CT18 (STY gene names) (38). Primers were designed to amplify 12 discriminatory regions previously identified by Kim et al. (30). An additional gene target, STM3518, whose presence was also variable among different serovars by comparative microarray analysis, was added to the panel to improve the discriminatory power (38). Two universal Salmonella-specific genomic regions, STM1608 and STM0171, were identified in this study and used as internal DNA amplification controls. These targets were selected based upon their presence in all common Salmonella enterica serovars, as determined by comparative genomics using microarray analysis (38) and by BLAST sequence comparison of all available Salmonella-specific genome sequences (2). Salmonella-specific primers contained limited sequence similarity to non-Salmonella gene sequences by BLAST sequence analysis. Primers for amplification of the genomic region representing the phase 2 flagellin gene (fljB) were added later in the study as a means to discriminate between Salmonella serovar Typhimurium and Salmonella 4[5]12:i:– strains (1, 15).
muPlex primer software (40) was used to design primers amplifying all genomic targets in a single multiplex PCR (Table 1). For each primer pair, either the forward or reverse primer included a universal sequence complementary to a 6-carboxyfluorescein-linked primer used to label all amplicons (Table 1). A 10x primer master mix was prepared to contain 1 µM of each primer and the labeling probe. All PCRs were carried out in a final volume of 25 µl containing 12.5 µl of Immolase DNA polymerase 2x master mix (Bioline, USA Inc., Randolph, MA), 2.5 µl of 10x primer master mix, 3 mM MgCl2, and either 1 µl of template DNA from boiled culture preparations or 2.5 µl of DNA from heated agarose plugs. Thermocycling parameters were 94°C for 10 min; 25 cycles of 94°C for 30 s, 57°C for 90 s, and 72°C for 30 s; 72°C for 5 min; 15 cycles of 94°C for 30 s, 68°C for 90 s, and 72°C for 30 s; and 72°C for 5 min. Control reaction mixtures containing no template or genomic DNA from Salmonella serovar Typhimurium LT2, serovar Typhi CT18, or serovar Enteritidis PT4 were included with each sample run.
|
View this table: [in a new window] |
TABLE 1. Primers and probes used for high-throughput multiplex PCR determination of Salmonella enterica serotypes using capillary electrophoresis
|
SMART codes were determined for the top 50 Salmonella serovars, which represented all serovars contained in both the top 30 clinical and nonhuman Salmonella serovars reported in the United States (Table 2) (12). Four hundred eighty-nine isolates from WAPHL and USDA, previously serotyped by conventional methods, were assayed via the PCR method and assigned SMART codes. For each serovar, a minimum of five isolates were screened, with the exception of Salmonella serovar Cerro (four isolates). All isolates within a serovar that were used for the database were nonclonal by PFGE analysis whenever possible.
|
View this table: [in a new window] |
TABLE 2. Representative PCR amplicon codes for most common U.S. clinical and veterinary serovars of Salmonella enterica subsp. enterica
|
PFGE analysis. PFGE was used to subtype each isolate, using a standard method with XbaI digestion previously described for Escherichia coli (23). For isolates for which the PCR result matched an amplicon code generated by multiple serovars, BioNumerics v 4.1 (Applied Maths, Austin, TX) was used to compare the similarity of the PFGE pattern of the isolate with other patterns of isolates within those serovars. Relatedness was calculated using the Dice similarity coefficient, with optimization and band position tolerance each set to 1.5%. In each case, the PFGE pattern of the isolate with an amplicon code matching multiple serovars was used as the reference pattern for comparison against the banding pattern of isolates from each of the possible serovars. Using more stringent criteria for scoring relatedness, patterns with a similarity of 100% were considered indistinguishable, patterns with 99 to 90% similarity were considered probably related, those with 89 to 80% similarity were considered possibly related, and those with 79% or less similarity were considered unrelated (44).
|
|
|---|
Determination of multiplex PCR amplicon codes for common clinical human and animal Salmonella serovars. To develop the assay, 489 isolates identified by conventional serotyping were used to determine SMART codes for the top 50 most common Salmonella serovars (Table 2). For 43 of the 50 (86%) serovars tested, a single SMART code was produced for each serovar. Isolates of Salmonella serovars Agona, Paratyphi B [including biovar d-tartrate (+)], Stanley, and Javiana each produced two different SMART codes, three codes were generated from serovar Reading, and serovars Poona and Havana each had four different SMART codes.
Twenty-nine of the 50 (58%) serovars screened in the panel resulted in completely unique SMART code identifiers, including Salmonella serovars Typhimurium, Enteritidis, 4[5]12:i:–, Montevideo, St. Paul, Heidelberg, and Javiana, which are among the top 10 most common clinical serovars isolated in the United States (12). Two of the typhoid-causing serovars, Salmonella serovars Typhi and Paratyphi A, also produced unique SMART codes. Twelve of the remaining 21 serovars in the panel shared a SMART code with one additional serovar, and 3 serovars shared a common SMART code with two other serotypes (Table 2). The SMART code 156-11-13 was shared among four isolates (Salmonella serovars San Diego, Reading, Brandenburg, and Chester).
Blinded comparison of high-throughput multiplex PCR typing with conventional serotyping. A total of 751 serotyped isolates of Salmonella enterica were screened in a blinded study to determine the ability of the multiplex PCR assay to correctly determine their serotypes from the SMART code database (Table 2). Five hundred isolates (66.6%) generated SMART codes that corresponded to the codes listed in Table 2 and correctly matched the conventional serotyping results (Tables 3 and 4). A further 139 of the remaining 251 isolates (18.5%) were putatively identified to the correct serotype, but with another serotype or serotypes sharing the same SMART code. Within this shared-code data set, there were 83 isolates which were scored as being one of two serotypes and 56 isolates with a SMART code that was shared by three serotypes. Serotypes with shared SMART codes are described in Table 2.
|
View this table: [in a new window] |
TABLE 3. Comparison of serovar determinations using conventionally derived serotypes and SMART codes
|
|
View this table: [in a new window] |
TABLE 4. Summary of results for serovar determination of 2007 WAPHL Salmonella isolates by SMART
|
There were 54 isolates that corresponded to rare serotypes not screened in the original panel of serotypes that were analyzed for creation of the SMART code database used in the blinded study (Table 4). Twenty-six of these isolates (3.5% of the total tested) produced unique SMART codes, of which isolates of Salmonella serovar Wandsworth, Salmonella serovar Tennessee, and Salmonella serovar Manhattan were the most frequent, with six, four, and three isolates, respectively (Table 3). The remaining 28 of these isolates (3.7% of the total tested) produced SMART codes that matched codes already assigned to other more common serotypes in the database (Table 4). The most common code was 1256-11-13, which was initially scored for Salmonella serovar Hadar. A further six isolates representing five rare serotypes shared this code. It is notable that they also shared the same serogroup, O28 (M). Salmonella serovar Apapa also shared the same code but was from serogroup O45 (W). The last subset of blinded samples included 12 isolates (1.6%) that were S. enterica strains other than S. enterica subsp. I. Interestingly, all specimens produced one of two short codes, i.e., 17-11 and 1-11 (Table 4).
Although the overall sensitivity of the assay with the panel was 85.1%, the data suggest that a larger database encompassing the SMART codes for more isolates and rarer serotypes would be beneficial. The addition of the data from this blinded study will aid in the improvement of the overall sensitivity of the assay. In this study, 9% of the isolates screened had unique SMART codes relative to those in the current database, and these data will be added to further develop the database. Similarly, it was observed that all non-subgenus I isolates screened gave a unique two-digit or very rare three-digit SMART code. The control amplicons 1 and 11 were present in all of these codes, with amplicon 7 being the third amplicon. These data suggest that these can also be discriminated in future screening using the SMART assay. Only Salmonella serovar Lexington gave a 17-11 SMART code, one shared by 10 of the 12 non-subgenus I isolates screened in this study. The other two isolates that were subgenus III had the SMART code 1-11.
PFGE pattern analysis as a tool to further discriminate serotypes for isolates which share SMART codes. Serovars that share SMART codes include isolates of Salmonella serovars Newport, Senftenberg, Paratyphi B [including biovar d-tartrate (+)], Muenchen, Braenderup, and Thompson (Table 2). Isolates representing these six serovars accounted for 125 of 139 (89.9%) isolates in the blinded study that produced shared SMART codes (Table 3). To provide further discrimination, the XbaI PFGE pattern of each of these putatively typed isolates was compared to existing databases containing the PFGE patterns of each serovar. For instance, the PFGE patterns for the nine isolates that generated a SMART code of 1256-11 were compared to the PFGE patterns of isolates from Salmonella serovars Thompson and Havana. PFGE patterns of 129 isolates (92.8%) had >90% similarity to PFGE patterns of isolates representing the serovar identified by conventional serotyping. Eight of the remaining isolates had a similarity of at least 84% with patterns of the correct serovar, suggesting the possibility of relatedness. A PFGE result for one isolate of Salmonella serovar Choleraesuis showed a similarity of 77.4% to the patterns of other Salmonella serovar Choleraesuis isolates in the PulseNet database but was scored as unrelated, using the 80% similarity cutoff value. The similarity to Salmonella serovar Choleraesuis via PFGE was still higher than that to the other potential serovars, Braenderup and Hartford, which share a SMART code with Salmonella serovar Choleraesuis. In this study, the only isolates with shared SMART codes which could not be discriminated further by PFGE pattern analysis were Salmonella serovar Senftenberg and Salmonella serovar Westhampton, which share the SMART code 12578-10-11-13. An isolate of Salmonella serovar Bareilly produced only a smear (data not shown) after XbaI digestion, and thus a PFGE pattern could not be determined for comparison. In total, the use of PFGE pattern comparison allowed the determination of the serotype identified by conventional serotyping for 103 of these 139 isolates (74.1%).
Serovar determination using SMART and PFGE for isolates unidentified by conventional serotyping. The absence or masking of surface antigens is a limitation of conventional serotyping and is usually observed in single-phase, nonmotile, rough, or mucoid isolates. The serotypes of eight isolates could not be determined definitively using conventional methods. The XbaI PFGE pattern was available for all of these isolates. SMART codes were obtained for the isolates, and serovar designations were assigned to six of them. One isolate produced a SMART code that matched Salmonella serovar Mbandaka (antigenic formula 6,7:z10:z15) and had a PFGE pattern with 97% similarity to other Salmonella serovar Mbandaka patterns in the PFGE database. While serologic classification of the first flagellar phase matched that of Salmonella serovar Mbandaka, the serogroup of the isolate was O:42, which is different from that of Salmonella serovar Mbandaka (O:6,7). The strong molecular evidence along with the partial antigenic match suggests that the isolate may be a variant of serovar Mbandaka, although a definitive classification could not be made here due to the discrepancy with the serogroup classification. A second isolate was identified as Salmonella serovar Berta via its SMART code. However, it was serogrouped as O:47, while Salmonella serovar Berta belongs to serogroup O:9. While the possibility of incorrect classification of its O antigen by serotyping cannot be excluded, it seems plausible that the isolate represents a variant of Salmonella serovar Berta, given the 100% similarity of the PFGE pattern of this isolate to the patterns produced by multiple isolates of Salmonella serovar Berta.
Four of the isolates with partial antigenic formulae gave SMART codes that matched two or more serotypes (Table 3). Use of PFGE pattern comparison showed similarities of >90% for patterns of three of these isolates with the PFGE patterns of isolates from one of the putative serovars identified by the SMART code. The combined results of SMART assays and PFGE comparison along with the observation that only one phase of the H antigen could be identified, even with multiple attempts at serotyping, suggest these three isolates may represent monophasic variants of Salmonella serovar Newport, Salmonella serovar Mississippi, and Salmonella serovar Choleraesuis (Table 3). The fourth isolate produced the SMART code shared between Salmonella serovar Stanley and Salmonella serovar Schwarzengrund but did not produce a PFGE pattern of significant similarity (>80%) to the PFGE patterns of other isolates representing either of these serotypes. A unique SMART code, 1456-11, was generated for one isolate belonging to serogroup O:43, but a serovar could not be designated based upon SMART codes from the original panel in Table 2. Finally, a SMART code of 17-11 was generated for the one isolate for which a Salmonella subgenus classification could not be determined using conventional serotyping. This reduced SMART code was the predominant code observed among non-subgenus I isolates and one isolate of Salmonella serovar Lexington (Table 3).
|
|
|---|
Unlike other multiplex PCR-based methods for serotyping S. enterica subsp. enterica isolates, the relevant targets in this assay were amplified in a single reaction, and the assay was developed by testing 489 isolates representing the top 50 U.S. clinical serotypes of Salmonella. The use of fluorescently labeled PCR amplicons permits automatic sampling via a capillary electrophoresis machine that also automates data collection and data analysis. The use of capillary electrophoresis increases the throughput of the assay and also allows the introduction of a quality control regimen based on the predetermined correct amplicon sizes and a threshold fluorescence of amplicons to predict effective labeling of the amplified DNA targets. The transfer of large analyzed data sets is simple via computer, and the creation of binary scores is rapid and reduces errors in scoring, using software such as Excel (Microsoft, Redmond, WA). The principle behind this technology could easily be applied to other bacterial species, and other forms of binary scoring have been applied already to Salmonella, E. coli, and group B Streptococcus (4, 43, 51).
In this study, we demonstrated that the incorporation of a sensitive detection system permits the pooling and expansion of discriminatory targets for a single multiplex PCR targeting 16 genomic targets. A previous study by Kim et al. demonstrated a series of multiplex reactions and single PCRs that could discriminate the common serotypes of Salmonella enterica subsp. I (30). The versatility of the assay was tested by adding a new target to the assay, with the belated addition of fljB after a 15-plex assay had been designed. This helped to identify an emerging subgroup of monophasic isolates, including some members of Salmonella serovar 4[5],12:i:–, that may contain deletions of fljB (1, 15). The current work was based upon advances in Salmonella enterica genomics by several groups in the field, most notably from the laboratory of Michael McClelland (9, 13, 16, 38, 39). Recent publications by Arrach et al. and Scaria et al. have described new typing techniques based on analysis of these comparative genomic hybridizations (4, 30, 42). Arrach et al. designed an array of 384 PCR assays based on the most informative genes found in Salmonella enterica subsp. I. From this data set, they designed an assay consisting of 12 multiplex PCRs, each with eight targeted amplicons, that was capable of identifying 11 serotypes, with at least two discriminatory amplicons to confirm each result. However, the assay requires 12 eight-plex PCRs, analysis by agarose electrophoresis, and result scoring by hand. The complexity of each of these processes is very prone to operator error during reaction setup, sample analysis, and data interpretation. By designing a single multiplex PCR with built-in quality controls and an automated system for sample analysis, data preparation, and scoring, we have developed a rapid assay that is amenable for high-throughput screening of large numbers of specimens, which can be experienced by a reference laboratory during peak testing months for enteric diseases.
To create the SMART database, 489 isolates representing the top 50 clinical and veterinary serovars were screened by the assay, and the majority of the serotypes gave a unique code. The blinded testing of a further 751 isolates revealed that 66.6% were correctly identified from the database and that isolates representing serotypes which share SMART codes (18.5%) were also partially identified. The use of a partially identified isolate's SMART code to direct which serotypes to use to conduct initial PFGE pattern comparison permitted the correct identification of 74.1% of isolates with shared codes. This finding reflects the current need to utilize a parallel subtyping system in conjunction with the SMART method to maximize overall sensitivity. However, the assay was originally designed for use in large local, regional, or national reference laboratories where PFGE is a routine procedure. The PulseNet USA guidelines require state laboratories to subtype each Salmonella enterica isolate submitted by PFGE with XbaI-digested DNA. The resulting pattern is then compared with state and national databases for matching patterns for the monitoring of local and national outbreaks of food-borne diseases (41). Consequently, the use of PFGE complements the confirmation of the serotype and also allows mistyped isolates to be identified when very dissimilar patterns are observed. A further benefit of using this assay is that unique SMART codes can be identified prior to PFGE, and therefore outbreak-related serotypes are identified earlier, allowing for preferential PFGE analysis of these isolates in order to expedite ongoing epidemiologic investigations.
The isolates that were not identified by the assay were comprised mainly of serotypes that had not been screened by the SMART assay, and this reflects a need for a more comprehensive screening with a greater number of rare serotypes. A limiting factor in the development of the SMART database was the number of isolates in the WAPHL strain collection. However, the USDA has over 70,000 archived isolates of Salmonella enterica, and the low cost, simplicity, and speed of the assay will allow for further mass screening to expand the existing database. The assay has demonstrated its potential to identify some monophasic or rough isolates of Salmonella enterica. With monophasic isolates, the assay can identify fljB deletion phenotypes, but other factors known to prevent flagellar antigen expression are not detected by this target. One observation from this study was that Salmonella serovars other than those in subgenus I were discriminated from those in subgenus I, with the sole exception of a single isolate of Salmonella serovar Lexington, of which only 20 isolates have been reported in the United States in the preceding 14 years (12).
Other molecular methods for serovar determination, such as DNA microarrays and liquid microsphere arrays, have great potential for molecular serotyping of Salmonella enterica subsp. enterica due to their sensitivity and specificity (20, 38, 42). However, these methods require extra time for processing due to additional hybridization and washing steps following target amplification or labeling, and microarrays are not well suited for high throughput testing. Currently, these methods are prohibitively expensive for routine testing due to the high costs associated with the required acquisition and maintenance of new instrumentation, consumables, and reagents; in particular, fluorescent dyes and array chips for microarrays and fluorescently labeled microspheres for liquid arrays are very expensive. Most reference laboratories have access to capillary electrophoresis machines due to their widespread use in DNA sequencing-based identification and subtyping of microorganisms (6, 8, 21, 24, 27, 33, 34, 49). A basic cost analysis of the SMART method, including the current costs of reagents, primers, probes, and consumables used for DNA preparation, PCR, and capillary electrophoresis, showed an estimated cost reduction as high as 95% per test versus conventional serotyping ($1.50 per test versus $40.00 per test for serotyping).
The further incorporation of new target sequences with the aim to increase the specificity of the assay is under development. This will reduce the need for PFGE patterns as discriminators and thus increase the suitability of the method for wider use by noncentralized reference laboratories involved with serotyping of Salmonella isolates. A notable feature is the development of other multiplex PCR assays which incorporate a range of different fluorescent dyes into the reaction products. For example, the ABI 3130 system has the capability to simultaneously detect five spectra from different fluorescent dyes, and thus it may be possible to screen up to four separate multiplex PCRs pooled into the same analysis and run with standard size markers. This approach may generate very large amounts of epidemiologic data while keeping the cost per test comparatively low. Examples of this future potential include developing further targets to discriminate serotypes that currently share the same SMART code (30), using molecular markers to replace phage typing of Salmonella serovar Typhimurium (31), subtyping by variable-number tandem repeats (8), and screening for the presence/emergence of clinically significant extended-spectrum beta-lactamases and/or R plasmid markers (11, 36, 37). The cost-effectiveness of this approach not only is of great benefit to public health laboratories involved in food-borne disease surveillance but also has the potential to aid other clinical research studies by screening large sample sets of untyped Salmonella enterica isolates.
We also acknowledge an Emerging Infectious Diseases Fellowship to B.T.L. and an Epidemiology and Laboratory Capacity grant, administered by the Association of Public Health Laboratories and the Centers for Disease Control and Prevention, for funding this work.
The mention of trade names or commercial products in this report is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the Washington State Department of Health or by the U.S. Department of Agriculture.
Published ahead of print on 4 March 2009. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»