Previous Article | Next Article ![]()
Journal of Clinical Microbiology, October 2006, p. 3712-3719, Vol. 44, No. 10
0095-1137/06/$08.00+0 doi:10.1128/JCM.00843-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
John Inman-Bamber,1,
Graeme R. Nimmo,2
Wendy Munckhof,3
Jacqueline Schooneveldt,2
Bruce Harrison,4
Jennifer A. McMahon,4 and
Philip M. Giffard1*
Cooperative Research Centre for Diagnostics, Institute of Health and Biomedical Innovation, QUT, Brisbane, Australia,1 Queensland Health Pathology Service, Brisbane, Australia,2 Princess Alexandra Hospital and District Health Service, Brisbane, Australia,3 Corbett Life Science, Eight Mile Plains, Brisbane, Australia4
Received 21 April 2006/ Returned for modification 29 June 2006/ Accepted 29 July 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
We have devised a systematic approach to the development of new bacterial genotyping methods. The central hypothesis is that, given a defined data set of genetic diversity within a bacterial species, an appropriate means of analyzing those data, and a numerical description of the required resolving power of a genotyping method, it is in principal always possible to identify a set of polymorphic sites that, if interrogated, will provide the required resolving power in an efficient manner. Robertson et al. (25) described the "Minimum SNPs" software that can derive highly informative sets of single-nucleotide polymorphisms (SNPs) from DNA sequence alignments and the application of this to the development of SNP-based genotyping methods for Neisseria meningitidis and Staphylococcus aureus. The SNPs used were derived from MLST databases. The most generally applicable SNP sets were optimized through maximization of Simpson's index of diversity (D). This work was extended by Stephens et al. (28), who showed that, in the case of S. aureus, the genotypes defined by a seven-member high-D SNP set were concordant with the population structure of the species, and that it is possible to identify a small set of binary markers (genes present in some isolates but not others) that efficiently add genotyping resolution.
Real-time PCR is a single-step closed tube method that is amenable to automated setup and data analysis. Real-time PCR devices are increasingly cost-effective to purchase and run, standardized real-time PCR-based procedures can incorporate the interrogation of different classes of polymorphisms (e.g., SNPs plus genes that exhibit binary variability), and real-time PCR-based procedures have the potential to incorporate primary diagnosis and target quantification. An inherent disadvantage of real-time PCR is that it is not very amenable to multiplexing, so interrogating multiple genetic targets normally requires multiple reactions. In our previous reports of the application of real-time PCR to SNP-based bacterial genotyping, the technique used was allele-specific PCR in the real-time format (kinetic PCR) (10, 25, 28). The advantage of kinetic PCR is that it is simple and robust and requires only generic mastermix and unlabeled primers. The disadvantage is that it requires at least two reactions per SNP, making the interrogation of multiple SNPs somewhat unwieldy and expensive. Therefore, an objective of this study was to develop variants of kinetic PCR that preserve the advantages but reduce the cost and the number of reactions required.
Here we report the development of two novel formats for allele-specific real-time PCR, the application of these methods to the genotyping of a large number of S. aureus isolates, and automated methods for DNA preparation, reaction setup, and data analysis. The utilities of these methods have been compared using S. aureus as a model system. This study has yielded a robust, cost-effective, and automated procedure for S. aureus genotyping. The small number of genetic targets interrogated means that this approach is especially amenable to adaptation to emerging technologies such as "lab-on-a-chip" devices and dedicated, fully automated real-time PCR machines.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Bacterial isolates. All S. aureus isolates used for the SNP genotyping were collected between June 2004 and November 2004 in southeast Queensland, Australia. They were obtained from 391 cases of inpatient and outpatient infections at the Princess Alexandra, Ipswich, Queen Elizabeth II, Logan, Redland, Boonah, and Beaudesert Hospitals. The identities of isolates were confirmed by nuc PCR, and methicillin resistance was confirmed by mecA PCR. A list of all isolates, together with genotyping results, is available upon request.
DNA extractions. Cultivation of all S. aureus isolates and DNA extractions from a small subset of the isolates were as described by Stephens et al. (28). As part of the current study, an automated method for S. aureus DNA extraction based on the Corbett X-tractor Gene automated DNA extraction system (Corbett Robotics, Brisbane, Australia) was developed. The optimal protocol arrived at was as follows. Twelve colonies from overnight cultures on blood agar plates were suspended in 100 µl lysostaphin buffer (40 µg/ml) and incubated at room temperature for 30 min. Lysates were transferred to the X-tractor Gene (Corbett Robotics, Brisbane, Australia) and 200 µl liquid sample digest buffer (catalog no. B5810) using X-tractor Gene reagent pack (catalog no. XTR-1; Corbett Life Science, Brisbane, Australia) was added, and incubation at room temperature continued for a further 30 min. The program from this point on was identical to the whole-blood DNA extraction protocol in the X-tractor Gene product literature (http://www.corbettlifescience.com/control.cfm?page=X-tractor_Gene).
SNPs and binary markers. The SNPs used in this study were derived from the S. aureus MLST database available at http://www.mlst.net. The SNPs were identified using the software package Minimum SNPs, v.2.0415. This program is as described by Robertson et al. (25) but with the added feature that the user can specify that certain SNPs are excluded from or included in the SNP set (24). This greatly facilitates the assembly of SNP sets with reference to primer design constraints. The eight-member SNP set ultimately used in this study provides a Simpson's index of diversity (D) of 0.95 with respect to the entire S. aureus MLST database. The SNPs are acrC210, tpi241, tpi243, arcC162, gmk318, pta294, tpi36, and pta383. The binary markers interrogated in this study were pvl, cna, sdrE, pUB110, and pT181.
Primer sequences. The sequences of all primers used in this study are shown in Table 1. Light upon extension (LUX) primers were obtained from Invitrogen Life Technologies, while the unlabeled primers were obtained from Proligo (Lismore, NSW, Australia). Primer design was carried out using Primer Express 2.0.0 (Applied Biosystems) and the LUX primer design software (Invitrogen).
|
Kinetic PCR for SNP genotyping of S. aureus using SYBR green was conducted for the most part as per Stephens et al. (28), with format modifications as described in Results and Discussion. The reactions were performed on a Corbett Rotor-Gene 3000 or an Applied Biosystems ABI7000 sequence detection system. One microliter of extracted DNA (1 to 3 ng) was added to 19 µl of reaction mastermix containing 10 µl of the 2x SYBR green PCR Mastermix (Applied Biosystems) and 8 pmol of each primer. Temperature cycling for these reactions was as follows: 50°C for 2 min, 95°C for 10 min, followed by 40 cycles of 95°C for 15 s, 56°C for 20 s, and 72°C for 35 s. The SNPs interrogated and primer sequences for these reactions are shown in Tables 2 and 3.
|
|
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
The relationship between SNP profiles and the S. aureus population structure as defined by eBURST analysis (5) is available at http://www.ihbi.qut.edu.au/research/cells_tissue/phil_giffard/. These data depict the relationship between SNP profiles and eBURST clonal complexes for the 403 STs included in the eBURST analysis and the relationship between SNP profiles and STs for all STs in the database. Consistent with the findings of Stephens et al. (28), the major clonal complexes and a subset of singletons are discriminated, and there is little or no discrimination within clonal complexes. It was concluded that these SNPs are suitable for assigning S. aureus isolates to major clonal complexes. Interrogation of these SNPs can also be combined with methods that interrogate rapidly evolving markers in "phylogenetic hierarchical resolving assays using nucleic acids"-like genotyping procedures, as described by Keim et al. (12).
The eight SNPs define 61 extant S. aureus genotypes. This is a consequence of the selection of the SNPs on the basis of their states being as uncorrelated as possible and therefore having maximal informative power in combination with each other. We have previously speculated that these SNPs are very ancient in origin and so have had time to be reassorted by horizontal gene transfer and recombination (28). This is consistent with other studies that have shown that, in clonal organisms, a SNP set defines about as many genotypes as there are SNPs; conversely, in nonclonal organisms, the number of genotypes can be much more than the number of SNPs (6, 9, 11, 12, 15). With S. aureus, the SNPs that discriminate clonal complex founders from single-locus variants are likely of recent origin, have not been reassorted by recombination, and in consequence, are polymorphic in a tiny fraction of the species' diversity and have little informative power when interrogated.
Automation of genome DNA extraction from S. aureus isolates. An automated procedure for extracting and purifying S. aureus DNA using the Corbett Robotics X-tractor Gene was developed, optimized, and ultimately used to extract DNA from all isolates used in this study. The procedure was based around the method for DNA extraction from whole blood (Buffy Coat DNA extraction protocol, Corbett Life Sciences, Australia), so the optimization focused on cell lysis. Lysostaphin is the most costly reagent in the extraction procedure, so the amount of lysostaphin required has been minimized. Additionally, it was shown that the incubations can all be conducted at room temperature, which means that all manipulations subsequent to suspension of the colonies in the lysis buffer can be carried out by the X-tractor Gene. The combinations of lysostaphin, proteinase K, and incubation temperatures that were tested, together with the yields and quality of nucleic acid are available (see Materials and Methods for optimal protocol). Agarose gel electrophoresis revealed that the purified DNA is intact and free of RNA (data not shown). The X-tractor Gene program is available from us. The cost and time of the extraction procedure were calculated as $1.60 and 3.5 min per isolate, including all reagent costs.
Novel real-time PCR-based approach to SNP interrogation based upon fluorescent primers. A central aim of this study was to develop new kinetic PCR formats that will allow the streamlining of SNP-based bacterial genotyping.
LUX real-time PCR primers increase in fluorescence when incorporated into PCR products (Invitrogen product literature) and so allow amplimer detection and quantification. They are available labeled with 6-carboxyfluorescein (FAM) and 6-carboxy-4',5'-dichloro-2',7'-dimethoxyfluorescein (JOE) and so can be used in biplexed real-time PCR procedures. The most straightforward way of using these primers for kinetic PCR is to make allele-specific LUX primers and carry out both allele-specific reactions in the same tube. However, this strategy is flawed for two reasons. First, carrying out both the allele-specific reactions for a particular SNP in a single tube is competitive priming rather than standard allele-specific PCR. In competitive priming, the increase in amplimer as the reaction proceeds provides more opportunities for mispriming by the mismatched primer and so reduces the reaction specificity (8). Second, we have observed on many occasions that adding labels to allele-specific primers changes their priming efficiencies unpredictably, making complete reoptimization of the assay necessary and often difficult or impossible (data not shown). Accordingly, we devised a biplexing approach in which the allele-specific reactions for a particular SNP are carried out in separate tubes and the allele-specific primers are unlabeled. In this method, a given reaction tube contains single allele-specific reactions for two different SNPs, the LUX primers are the non-allele-specific "common" primers that are in effect used to identify the target, and the allele-specific primers are unlabeled and essentially identical to the primers used in our prototype SYBR green-based procedures. We have termed this format "distributed multiplexed kinetic PCR" (DMKP), and it is depicted in Fig. 1A.
|
|
The LUX primer-based genotyping with automated setup was tested extensively for reproducibility using isolates of known genotype as determined by full sequence typing and/or by an adaptation of the SYBR green-based SNP plus binary gene method described by Stephens et al. (28) and also by carrying out duplicate assays on unknown isolates. In total, 391 isolates were genotyped, of which 44 were initially of known genotype, and 260 were done in duplicate. Significant difficulties were experienced in achieving high levels of reproducibility. It was necessary to empirically develop quite complex criteria for calling SNPs and the presence or absence of binary genes. The criteria were different for each SNP, reflecting differences in the relative priming efficiencies of the allele-specific primers. In general, criteria were developed to determine cycle time cutoff and
CT values. Extensive examination of the data has led us to conclude that the very small fluorescence increases that take place when LUX primers are incorporated into PCR products (<0.2-fold) result in unreliability in the signal normalization. It is also possible that the multiplexing and/or variation in automated reagent quantities were impacting the results, but it was not possible to unambiguously discriminate these effects from signal normalization issues. Ultimately, it was found to be necessary to analyze the data with the "slope correct" and "ignore first 10 cycles" functions inactivated to avoid large artifacts in the normalization of the SNP interrogation reactions, while the binary gene interrogation was most reliable with these functions switched on. The criteria for SNP calling are available at http://www.ihbi.qut.edu.au/research/cells_tissue/phil_giffard/. When these were used, the accuracy of SNP calling was 98% when full MLST determination or SYBR green-based SNP interrogation were used as the gold standard. This figure of 98% is for single SNPs, so the incidence of correct genotypes was less than that.
These criteria for SNP calling were incorporated into a Microsoft Excel macro. Raw CT values may be pasted into the macro, which outputs the SNP profile and also the clonal complex if the SNP profile matches one of the major clonal complexes. This macro is available from us. A similar degree of accuracy was found with the binary markers, with 98% of the LUX data points in accordance with the SYBR green method. A cycle time value of <27 was used as the criterion to call the presence or absence of binary markers using LUX.
It was concluded that the DMKP using LUX primers is a method that shows considerable promise. However, it would be significantly more attractive if the LUX primers displayed a greater fluorescence increase when incorporated into PCR products. As a consequence, we set out to develop an alternative strategy to streamlining SNP-based genotyping.
A novel approach to streamlining SYBR green-based kinetic PCR.
Despite intense efforts, we were unable to achieve the accuracy of DMKP using LUX primers higher than 98%. Therefore, an alternative strategy was devised. We have previously observed that kinetic PCR using SYBR green is very robust (25, 28). Also, we have observed that the relationships between the kinetics of reactions interrogating different SNPs are stable from experiment to experiment (data not shown). Therefore, it was reasoned that it should be possible to call the SNPs using just one reaction per SNP, provided that each reaction was compared with a single "universal" control reaction (Fig. 1B.) This was initially tested by reanalyzing our collection of SYBR green kinetic PCR data to determine if the SNPs could be called using just one of the allele-specific reactions and comparing the CTs to a single arbitrarily chosen reaction from within the assay. It was found that, in essentially all cases, the SNP could indeed be called accurately (data not shown). Accordingly, a standardized format and data analysis protocol were developed. In this format, one of the eight SNPs (tpi36) is interrogated using the conventional "two-reaction" method. The reaction that gives the lowest CT (and so has the perfectly matched allele-specific primer) is used as the control. This avoided the necessity of finding a new invariant target to use as the control, and it also allowed one of the eight SNPs to be interrogated using the exceptionally robust "two-tube" method. The conventional "two-tube" interrogation of tpi36 gives the smallest
CT (i.e., weakest SNP signal) of any of the "two-tube" reactions (data not shown), so this was used as the control. We have named this approach "single-tube kinetic PCR" (STKP). The primers used in the current STKP method for interrogating the eight S. aureus high-D SNPs are shown in Table 3.
The robustness of the STKP reaction was determined by genotyping in duplicate 44 S. aureus isolates. All of these isolates had been previously genotyped using the conventional "two-tube" kinetic PCR and/or complete ST determination. This constituted 704 SNP interrogations. Just one of these 704 reactions gave a result that was inconsistent with the other genotyping methods. Therefore, this procedure is robust to a degree that makes duplicates virtually unnecessary. The reproducibility of the CT values is shown in Fig. 2. It can be seen that the different alleles of a given SNP give rise to signals that are separated by >3 standard deviations.
|
As previously stated (28), our view is that the SNP-based genotyping is of greatest utility when combined with interrogation of rapidly evolving markers. This STKP reaction could easily be combined with reactions designed to interrogate the presence or absence of binary markers. This could be done either by SYBR green (28), DMKP using LUX primers (this article), or indeed, other chemistries, such as TaqMan. Given our current experience, the best compromise between robustness and economy of time and reagents would be obtained by using STKP to interrogate the SNPs and a multiplexed fluorescent primer- or probe-based method to interrogate the binary markers. This is because our data do not currently indicate that a combined DMKP/STKP method in which there are two SNPs interrogated per reaction tube would be sufficiently robust. Therefore, the SYBR green STKP remains the best choice, pending improvements in chemistry or data analysis software. In the case of the binary markers, the adoption of an STKP strategy is not a meaningful concept because there is only one reaction per polymorphic site. Therefore, multiplexing remains the only available strategy for reducing the number of "binary marker" reaction tubes. Determining the presence or absence of binary markers is an inherently simple task. Although we only achieved 98% accuracy using the LUX primers, the great bulk of our optimization efforts were focused on the SNPs, so that figure could probably be improved, particularly if the primers fluoresced more intensely. In addition, there is a wide choice of highly effective real-time PCR chemistries for readily determining the presence or absence of genes in multiplexed reactions, and any of these could be used (16, 19, 23).
The reagent cost of the real-time PCR procedures was calculated at A$19.53 per isolate for the LUX-based method (includes binary markers) and A$13.36 per isolate for the SYBR green-based method (SNPs only). The great majority of the cost is in the mastermixes, so there is scope for considerable cost reductions through volume reductions and identification of cheaper mastermix products.
Development of an assay for the identifications of S. aureus ST-93: the "Queensland clone." It is now clear that community acquired MRSA (CA-MRSA) is caused by strains belonging to a variety of S. aureus clonal complexes, with the phage-borne pvl gene being an important pathogenicity determinant (27). In Australia, the major CA-MRSA clones belong to ST-1, ST-30, and ST-93. The ST-1 and ST-30 clones are found over large parts of the world, but ST-93 is a singleton (not closely related to other STs) that, to date, has been found only in Australia (2, 20). This, in combination with the fact that it appears to be highly virulent and responsible for a large proportion of CA-MRSA infections in Australia, makes it of particular interest to clinical and public health microbiologists. Accordingly, we set out to develop an identification procedure for this clone.
It was found that ST-93 gives the same SNP genotype as the ST-59 and ST-121 clonal complexes and a number of other singletons. This is not optimal, as the unrelated CA-MRSA SF25:P clone that is mainly found on the U.S. West Coast is ST-59 (7). However, as has been reported by Stephens et al. (28), we were able to use Minimum SNPs to reveal that a G at position 252 of the MLST locus aroE discriminates ST-93 from all other known STs. A kinetic PCR method for interrogating the aroE252G SNP was developed and shown to be robust using isolates of known ST. This assay was used to screen 49 isolates from our collection of 391 that gave the SNP profile characteristic of ST-59 and ST-93. Forty-six of these gave a result characteristic of ST-93. Of these, 30 were methicillin resistant and pvl positive and thus consistent with the Queensland clone. The remaining 16 were methicillin susceptible, and 12 of these were also pvl positive. Only one of the isolates that were negative for the ST-93-specific SNP was pvl positive.
It was concluded that this procedure is useful for screening for the ST-93 "Queensland clone" either on its own or in combination with the high-D SNPs and/or the pvl gene.
Genotyping results. The genotypes of all isolates were established on the basis of either completely consistent duplicate DMKP results or from STKP results. The complete data set is available at http://www.ihbi.qut.edu.au/research/cells_tissue/phil_giffard/, and a summary is shown in Table 5. The results are consistent with earlier reports. For example, clonal complex 239 (Aus-1/Aus-2 strain) is known to be the most abundant hospital-acquired MRSA strain in southeast Queensland. As expected, this was the most abundant SNP profile. It is of interest that the majority of the isolates are MRSA, which suggests that the clonal complex 239 isolates in this region are purely of epidemic origin. Consistent with this, clonal complex 239 MRSA are very common worldwide (1-3), and its evolutionary origin has been hypothesized to have resulted from recombination of ST-8 and ST-30 (26). The two major community acquired clones in southeast Queensland are the Southwest Pacific/Western Samoan phage pattern/Oceania strain (clonal complex 30) and the Queensland clone (ST-93) (2, 17, 20, 21). The SNP profile corresponding to these clones was abundant, and as expected, the pvl gene was present in a large proportion of these isolates. The somewhat unexpected abundance of MSSA isolates with the same SNP and pvl genotypes is significant because it suggests that the establishment of these lineages in this region predates their acquisition of SCC-mec. ST-93 is worthy of continued focus. It is a singleton according to eBURST analysis and, to date, appears to have been found only in Australia. It very commonly harbors the pvl gene and has been associated with fatal necrotizing pneumonia (22). The other abundant genotypes that include significant numbers of MRSA isolates correspond to clonal complexes 1, 5, and 8, and this is consistent with the previously reported abundance of these clonal complexes worldwide. Clonal complexes 5 and 8 are not generally associated with community-acquired infections, and as expected, the isolates with these SNP profiles are largely pvl negative. Clonal complex 1 is associated with community-acquired infections in Australia and the United States (18). The Australian representatives of this lineage are assigned to the Western Australia clone. It has previously been reported that the pvl gene is uncommon in the Western Australia clone (20), and our results are consistent with this. Finally, this study provided data that justifies ongoing surveillance of S. aureus. The presence of the pvl gene in representatives of clonal complex 5 and 239 is cause for concern.
|
In conclusion, we have developed an efficient means of genotyping S. aureus. The SNP profiles were able to discriminate the major clones extant in southeast Queensland. Additional resolution may be obtained by interrogating binary markers. Even more resolution could potentially be obtained by measuring the melting temperatures of tandem repeat loci in conjunction with SNP and/or binary marker interrogation. We are currently determining the informative power of this approach, which is, in effect, a three-stage "phylogenetic hierarchical resolving assays using nucleic acids"-like procedure.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
F.H. and J.I.-B. contributed equally to this study. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Antimicrob. Agents Chemother. | Clin. Microbiol. Rev. |
|---|---|
| Clin. Vaccine Immunol. | ALL ASM JOURNALS |
|---|