Previous Article | Next Article ![]()
Journal of Clinical Microbiology, May 2005, p. 2407-2417, Vol. 43, No. 5
0095-1137/05/$08.00+0 doi:10.1128/JCM.43.5.2407-2417.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Food Science,1 Quality Milk Production Services, Cornell University, Ithaca, New York 148532
Received 6 May 2004/ Returned for modification 7 November 2004/ Accepted 14 January 2005
|
|
|---|
|
|
|---|
Many phenotypic and DNA banding pattern-based typing methods have been used to characterize S. uberis, including bacteriocin typing (18), restriction endonuclease fingerprinting (18), rapidly amplified polymorphic DNA (RAPD) typing (37, 53, 56), repetitive element polymorphism-PCR (53), and pulsed-field gel electrophoresis (2, 38). While these methods have contributed to insights into the epidemiology of S. uberis mastitis, they have limitations in terms of typeability, discriminatory power, and reproducibility. For several streptococcal species, including S. agalactiae, S. pneumoniae, S. pyogenes, and S. suis, multilocus sequence typing (MLST) schemes have been developed (4, 6, 21, 26), which will likely supersede banding pattern-based typing methods in the near future. In contrast to banding patterns, DNA sequence data, such as those generated in MLST, are easy to standardize, store, and exchange electronically, facilitating analysis, interpretation, and diagnostic use of typing results for isolates from multiple sources, regions, and countries (4). In addition, while it is difficult to correlate band differences to genetic relatedness of strains (45), DNA sequence data can be used for phylogenetic and evolutionary analyses (7, 22, 41). Thus, MLST schemes provide more standardized and informative strain typing data than banding pattern-based methods. While the traditional MLST schemes are based on sequencing of multiple housekeeping genes (4), inclusion of virulence genes (41) or other hypervariable genes (35) in an MLST scheme can enhance discriminatory power. Because genetic variation accumulates slowly in housekeeping genes, MLST based on housekeeping genes is particularly suited for global epidemiological studies (4). Virulence genes may show more short-term variability, and sequencing of virulence genes may thus be more suited to study local epidemiology (57). Typing schemes that encompass housekeeping genes as well as virulence genes also allow for detection of parallel evolution and acquisition or loss of virulence genes (22, 41).
In order to develop a standardized DNA sequencing-based subtyping method for S. uberis, we designed an MLST scheme that includes housekeeping genes as well as virulence genes. A strain set that included a collection of 30 diverse S. uberis isolates from the United States as well as 20 S. uberis isolates from The Netherlands with well-defined epidemiological relatedness (56) was used for ribotyping as well as MLST typing to compare the discriminatory power of these subtyping methods as well as their abilities to reveal epidemiologically meaningful relationships. Phylogenetic analyses were conducted to provide insight into the population structure and evolution of S. uberis. These data will ultimately help us to better understand the evolution of strain-specific transmission and virulence characteristics in this important bovine pathogen and may help us to understand mechanisms behind emergence of new strains or shifts in mastitis epidemiology in response to control measures, including antibiotic treatment and vaccination.
|
|
|---|
BTM samples were collected and processed by Cornell University's Quality Milk Production Services following standard recommendations (36), and quarter milk samples from The Netherlands were collected and processed as described previously (56). Briefly, upon culture from milk, preliminary identification of isolates was based on colony morphology on blood-esculin agar, Gram stain, and catalase testing (36) followed by speciation with the API20STREP system as recommended by the manufacturer (BioMérieux, Inc., Hazelwood, MO). Isolates were stored at 80°C in brain heart infusion (BHI) broth with 15% glycerol. For ribotyping and additional analyses, streptococci were grown on BHI agar. Bacterial lysates for PCR were prepared as described by Furrer et al. (11). Species identity of all 50 S. uberis isolates and the S. parauberis isolate was confirmed by PCR using S. uberis or S. parauberis primers targeting species-specific parts of the 16S rRNA gene (13).
DNA banding pattern-based strain typing. Ribotyping of 50 S. uberis isolates and one S. parauberis isolate was performed twice, once with restriction enzyme EcoRI and once with PvuII, using the RiboPrinter microbial characterization system (Qualicon, Wilmington, DE). RAPD typing of isolates from The Netherlands had been performed previously (56), using primer OPE-04 (5'-GTGACATGCC-3') (37, 53). Simpson's index of discrimination (SID) (15) was calculated for ribotyping but not for RAPD typing, because RAPD results were not generated for BTM isolates.
MLST. Six genes, including housekeeping genes, vaccine targets, and virulence genes (Table 1) were chosen for inclusion in the MLST scheme described here. For gapC and pauA, DNA sequence data from GenBank were used to design primers in Primer Select (DNAstar; Lasergene). Primers for amplification of cpn60, oppF, sodA, and tuf were taken from the literature (12, 24, 40, 43). Primer sequences, sources of sequence information, and reaction conditions for DNA amplification by PCR are listed in Table 2.
|
View this table: [in a new window] |
TABLE 1. Target genes in multilocus sequence typing scheme for S. uberis
|
|
View this table: [in a new window] |
TABLE 2. Primer sequences and source information, amplicon size, and thermocycling parameters for MLST of S. uberis
|
Sequence data analysis. Sequence data were proofread in SeqMan, and high-quality, double-stranded sequence data were used for further analysis. Sequences for each gene were aligned in MegAlign (DNAstar; Lasergene). Data for the full coding sequence were used for pauA, while partial coding sequences, read in frame, were used for housekeeping genes. DnaSP version 4.0 (42) was used for descriptive analyses, including allele assignment. Different alleles were assigned to gene sequences that differed from each other by one or more polymorphisms. Each unique combination of alleles was considered a sequence type (ST), and BURST analysis (based upon related STs) was used to identify single-locus variants (SLV), double-locus variants (DLV), and satellites (SAT) for each sequence type and to detect the presence of clonal complexes (http://fi-srvmlst1-ide.sm.med.ic.ac.uk/burst/ [last accessed 03/15/2004]).
DnaSP version 4.0 (42) was also used to calculate the G+C content of the sequenced genes as well as the number of polymorphic sites, average number of pair-wise differences between alleles, and average nonysynonymous/synonymous rate ratio (dN/dS). Calculations were performed twice: once for the set of all 50 S. uberis isolates, and once for a "core set" of 49 S. uberis isolates that did not include isolate FSL Z1-015. Isolate FSL Z1-015 had a highly divergent DNA sequence compared to the remaining S. uberis isolates (see Results) and strongly skewed the results of some analyses.
To asses whether the evolution of genes could be adequately represented by the classical (bi)furcating tree model, data were tested for reticulate evolution within genes (17, 44). Methods for detection of reticulate evolution within genes included (i) Rm, the minimum number of recombination events based on the four-gamete test (14) as implemented in DnaSP version 4.0; (ii) construction of split decomposition trees with 1,000 bootstrap replicates based on parsimony splits as implemented in SplitsTree 4.0 BETA 4 (16; http://www-ab.informatik.uni-tuebingen.de/software/splits/ [last accessed 12/06/2004]); and (iii) calculation of compatibility of informative sites within genes as implemented in Reticulate (17). Again, analyses were performed including and excluding the strongly divergent isolate FSL Z1-015.
The model of evolution best describing each gene was determined with ModelTest version 3.06 (39), using hierarchical likelihood ratio tests as the criterion. Settings for the best-fitting model were subsequently used in PAUP version 4.0 beta 10 to determine whether evolution under the specified model followed a molecular clock and to generate maximum likelihood (ML) trees with 100 bootstrap replicates. In these trees, S. parauberis was used as an outgroup. Finally, compatibility between genes was assessed through visual comparison of bootstrapped ML trees for individual genes and through analysis of concatenated sequence data for all genes in Reticulate (17).
Detection of positive selection. Using codon substitution models that account for heterogeneous selection pressure at amino acid sites, positively selected sites can be specifically identified and the posterior probability of positive selection for each site can be calculated (55). The fit of nested models with and without positive selection can be compared using likelihood ratio tests. For genes with indications of positive selection based on average dN/dS, the codon substitution models 0, 1, 2, 3, 7, and 8, which are described in detail elsewhere (55), were used to test for evidence of amino acid sites under positive selection. Briefly, model 0 assumes that there is one dN/dS ratio for all sites. Model 1 assumes that a proportion of sites has dN/dS = 0 (purifying selection, conserved sites), while the remainder of sites in the gene have dN/dS = 1 (neutral selection). Model 2 assumes that dN/dS can take on one of three values, i.e., 0 (purifying selection), 1 (neutral selection), or a third value that is estimated by the model and that can either account for so-called "slightly deleterious" mutations or for positive selection but not for both at the same time. Model 3 is similar to model 2 but allows for multiple fixed levels of dN/dS. In model 7, levels for dN/dS are not fixed at discrete values but follow a beta distribution. Positive selection is not covered in model 7, because the beta distribution is restricted between dN/dS = 0 and dN/dS = 1. Model 8 combines the beta distribution from model 7 with an extra category for dN/dS, which does allow for positive selection (dN/dS > 1). Analyses were run in the codeml program of PAML version 3.13 (phylogenetic analysis using maximum likelihood), and likelihood ratio tests were performed for model 2 versus 1 (three versus two fixed levels of dN/dS), model 3 versus 0 (multiple versus one fixed level of dN/dS), and model 8 versus 7 (beta distribution of dN/dS with or without positive selection) (55).
In addition to PAML, a likelihood-based method for detection of positively selected sites, the parsimony-based ADAPSITE program was used to identify positively selected sites (47). The location of residues in the ß-domain of the pauA-encoded plasminogen activator that were shown to have nonsynonymous mutations was visualized in Cn3D version 4.1 (http://www.ncbi.nih.gov/ [last accessed 04/28/2004]) using alignments with the homologous streptokinase sequence of Streptococcus dysgalactiae subsp. equisimilis (49).
|
|
|---|
|
View this table: [in a new window] |
TABLE 3. Examples of RiboPrint patterns of S. parauberis (FSL S3-563) and S. uberis (all other isolates) generated with the restriction enzymes EcoRI and PvuII
|
|
View this table: [in a new window] |
TABLE 4. Source information, strain assignment, and MLST results for 50 S. uberis isolates and one S. parauberis isolate from bovine milk
|
|
View this table: [in a new window] |
TABLE 5. Compositional characteristics of genes in MLST scheme for S. uberis
|
Concordance of ribotyping, RAPD, and MLST. RAPD data available for the 20 S. uberis isolates from The Netherlands (56), representing 15 RAPD types, were used to compare discriminatory power and epidemiological relevance of RAPD, ribotyping, and MLST. Three RAPD types, A, B and G, were represented by more than one isolate in the data set (Table 4). The one isolate with RAPD-A obtained from herd 32 differed in both its PvuII ribotype and its ST (Table 4) from the other three isolates with this RAPD type, which were from herd 33 (Table 4). Two of three RAPD-A isolates in herd 33 (Z1-063 and Z1-095), both collected in December 1997, shared the same ribotype (799-5) and ST (37). The third RAPD-A isolate (Z1-124) from herd 33, which had been collected in August 1998, had a different ribotype (801-7) and ST (40). Thus, ribotyping and MLST differentiated within RAPD-A in accordance with herd of origin and time of sample collection. For RAPD-B, two isolates (Z1-075 and Z1-082) were collected from two cows in herd 32 in May and June 1998 during an outbreak of S. uberis mastitis and were considered to be epidemiologically related (49). STs for these isolates were identical, while ribotype patterns differed by one band (Table 4). For RAPD-G, the two isolates collected from herd 32 and 33 (Z1-005 and Z1-118) differed in ribotype and ST, in accordance with the difference in herd of origin.
Ribotype 116-783-3 was represented by seven isolates, which originated from seven herds on two continents and were thus considered epidemiologically unrelated; these isolates were grouped into seven different STs. Four ribotypes (116-795-2, 116-783-6, 116-790-2, and 116-790-6) were represented by two isolates each, and within each of these ribotypes the isolates originated from different herds and belonged to different STs. Two other ribotypes were also represented by two isolates each. Isolates originated from one herd and two herds for ribotypes 116-795-5 and 116-783-4, respectively. Within each of these two ribotypes, isolates were assigned the same ST. Ribotype 116-799-5 was represented by four isolates from one herd. Among the four isolates, three RAPD types had been identified, and three STs were assigned in accordance with RAPD patterns. In summary, comparison of ribotyping, RAPD, and MLST results showed that MLST provided subtype discrimination within RAPD types or ribotypes, which is in accordance with the epidemiological origin of isolates.
Descriptive analysis of sequence data. Sequence alignments and phylogenetic trees showed that the individual gene sequences for S. uberis isolate FSL Z1-015 were highly divergent from the consensus sequence for the other 49 S. uberis isolates (Fig. 1). For all housekeeping genes, sequences of FSL Z1-015 were more similar to the sequences for S. parauberis isolate FSL S3-563 and for other S. parauberis isolates obtained from GenBank (AF4855798, AF485799, and AF485800 for cpn60; AF421901 for gapC; AJ544723 for sodA; and AY267004 for tuf) than to the sequences for the other 49 S. uberis isolates. In bootstrap analysis of ML trees for housekeeping genes, FSL Z1-015 was separated from the remaining isolates with a bootstrap value of 100, while it was not separated from S. parauberis, which was included in the analysis as an outgroup (Fig. 1). To confirm the species identity of FSL Z1-015, partial sequencing of rnpB (encoding the RNA subunit of endoribonuclease P) and rpoB (encoding the beta subunit of RNA polymerase), two genes previously used to probe the phylogeny of streptococcal species, was performed as described elsewhere (3, 48). rnpB and rpoB sequences for FSL Z1-015 were identical to those obtained for six S. uberis isolates representing six EcoRI RiboPrinter patterns (Table 3, groups 1 to 5 and 7) and different from those for S. parauberis isolate FSL S3-563, confirming the classification of FSL Z1-015 as S. uberis. Analysis of sequence data after removal of FSL Z1-015, leaving a core set of MLST data for 49 S. uberis isolates, reduced the number of polymorphic sites per gene dramatically but had little impact on the number of parsimony-informative sites (Table 5). G+C content was approximately 39% for all genes, with the exception of pauA (34%) (Table 5). The average dN/dS was low (0.00 through 0.12) for all genes, again with the exception of pauA (dN/dS = 1.2). Exclusion of isolate FSL Z1-015 strongly reduced the number of synonymous and nonsynonymous substitutions, but the effect on dN/dS ratios was limited (Table 5).
![]() View larger version (23K): [in a new window] |
FIG. 1. Maximum-likelihood trees for gapC, sodA, and tuf based on sequence data for 50 S. uberis isolates. Sequences used represent all combinations of alleles for the three genes. Bootstrap values of >50 are shown. S. parauberis (S3-563) was used as an outgroup (italicized). Isolates from a major branch of the gapC tree are labeled (bold), and distribution of the isolates from this branch over sodA and tuf trees is shown. Bootstrap analysis clustered isolate FSL Z1-015 (star) together with S. parauberis and separate from other S. uberis isolates for all three genes.
|
|
View this table: [in a new window] |
TABLE 6. Evolutionary characteristics of genes in the MLST scheme for S. uberis
|
0.05 for each gene, with the exception of sodA in the analysis including FSL Z1-015). High compatibility of sites within genes (range, 89.1 to 100%; average, 95.9%) relative to compatibility of sites between genes (range, 35.4 to 72.9%; average, 55.9%) indicates that different genes in the core set of 49 isolates experienced distinct evolutionary histories. Visually, this is indicated by a large proportion of black squares for between-gene comparisons in the compatibility matrix (Fig. 2) with no black squares within genes. When all 50 isolates were included in the analysis, within-gene compatibility (range, 78.6 to 94.4%; average, 88.6%) was still much higher than compatibility between genes (43.6% to 76.4%; average, 59.9%), but incompatibilities were visible within each gene due to the inclusion of highly divergent isolate FSL Z1-015.
![]() View larger version (25K): [in a new window] |
FIG. 2. Compatibility matrix for concatenated sequence data of six genes of S. uberis. White squares and black squares indicate compatibility and incompatibility of sequence polymorphisms, respectively. The number of parsimony-informative sites for each gene is shown in parentheses after the gene designation. The right panel shows results for housekeeping genes of 50 isolates, while the left panel shows results for the core set of 49 isolates obtained after exclusion of highly divergent isolate FSL Z1-015. The core set shows perfect within-gene compatibility for gapC, sodA, and tuf as indicated by the white triangles. Results for oppF and pauA did not differ between panels because FSL Z1-015 did not yield amplicons for those genes, and sequence data were thus not included in either reticulogram.
|
Amino acids encoded by nonsynonymous sites in pauA are shown in Table 7. Codon 211 was highly variable and contained synonymous as well as nonsynonymous mutations. It encodes Arg, Leu, or Ser in the three most common alleles of pauA, detected 15, 13, and 6 times, respectively, among the 49 S. uberis isolates, with each allele detected among isolates from the United States as well as Europe (Table 7). Residue 211 is located at the tip of an
-helix in the ß-domain of streptokinase (Fig. 3). Codon 240 encodes a residue located at the tip of a loop between ß5 and ß6 strands which connects two ß-folded sheets (49) (Fig. 3). In the three most common alleles of pauA, codon 240 codes for glycine, a small, hydrophilic amino acid that allows for flexibility in stereochemical conformation, or proline, a hydrophobic amino acid that limits structural flexibility (Table 7).
|
View this table: [in a new window] |
TABLE 7. Amino acids encoded by codons with nonsynonymous mutations in plasminogen activator gene pauA of S. uberis
|
![]() View larger version (41K): [in a new window] |
FIG. 3. ß-Domain of streptokinase from S. dysgalactiae subsp. equisimilis (49), showing a single -helix (tube-shaped arrow) and ß-folded sheets consisting of multiple strands (flat gray arrows). White arrows indicate the localization of the codons homologous to codons 211 and 240 in the S. uberis PauA sequence.
|
|
|
|---|
MLST provides for highly discriminatory and epidemiologically relevant subtyping of S. uberis. While selection of genes to be included in an MLST scheme is facilitated by the availability of partial and/or complete genome sequences for the organism of interest, no genome sequences for S. uberis were available at the initiation of our project. Target genes were thus selected based on availability of sequence data and/or PCR primer sequences. Housekeeping genes, virulence genes, and genes encoding vaccine targets were included in our MLST scheme to allow studies on population genetics as well as the evolution of virulence and vaccine target genes in S. uberis. The housekeeping genes cpn60 (chaperonin or heat shock protein gene), sodA (superoxide dismutase), and tuf (elongation factor Tu) were chosen because these genes are conserved across bacterial species, allowing us to use previously described degenerate primers for amplification of these genes (12, 24, 40). These primers can thus be used for DNA sequence-based bacterial speciation and for subtyping schemes, such as MLST. For example, primers for these genes have been successfully used in our laboratory for species identification and characterization of the mastitis pathogens S. agalactiae and S. dysgalactiae subsp. dysgalactiae and for enterococcal species isolated from milk and dairy farm environments. In addition to the three housekeeping genes described above, we also included two genes whose products have been the target of vaccine studies, i.e., gapC (glyceraldehyde-3-phosphate dehydrogenase), a housekeeping gene (9), and pauA (plasminogen activator A), a virulence gene (31), in our MLST scheme. Finally, oppF, encoding an oligopeptide permease homolog, was included because this gene is essential for the growth of S. uberis in milk (43). Due to its potential role in infection, this gene may also be classified as a putative virulence gene. Analysis of dN/dS ratios showed that all genes, except pauA, appeared under purifying selection, indicating that the five genes other than pauA provide appropriate information for probing the population genetics of S. uberis, although oppF cannot be amplified from all S. uberis isolates. While the relative location of the selected MLST genes on the S. uberis genome could not be mapped, the distinct similarity patterns for different genes suggest that the target genes represent sufficiently diverse locations to be appropriate for MLST.
Our results show that MLST provides for more discriminatory subtype differentiation than banding pattern-based methods (ribotyping and RAPD). While similar results have previously been obtained for other human and animal streptococcal pathogens (4, 6, 26), our data further show that development of a sensitive MLST-based subtyping scheme is possible even with limited a priori sequence information (i.e., without the availability of a genome sequence). Our data thus also indicate the feasibility of developing MLST-based subtyping methods for economically less important streptococcal pathogens, such as S. dysgalactiae subsp. dysgalactiae and S. parauberis, for which complete genome sequences may not be available in the near future. The fact that three of the primers used in the MLST scheme described here also allow for amplification of genes from these species further supports the feasibility of developing a core MLST scheme for all streptococcal mastitis pathogens, which could be supplemented with sequencing of species-specific virulence genes. In addition to the discriminatory capability of the MLST scheme described here, we also showed that MLST provides epidemiologically meaningful subtype information, e.g., by differentiating epidemiologically unrelated isolates from different herds, which share identical ribotypes or RAPD patterns. MLST will thus provide a better tool to detect farm-to-farm S. uberis transmission and pathogen reemergence on a given farm. Finally, as discussed in more detail below, MLST, unlike banding pattern-based subtyping methods, allows for meaningful phylogenetic and evolutionary analyses (5), which can provide important insight into S. uberis biology, population genetics, and transmission patterns, providing information that could help in the design of improved mastitis control strategies.
S. uberis has a recombinatorial population structure. Our data show that while within-gene recombination is limited, considerable reticulate evolution occurs between genes. Thus, a species tree or core phylogeny for S. uberis based on concatenated sequence data could not be constructed (44). Our results are consistent with previous reports, which showed that most streptococcal species generally show a highly recombinatorial population structure. For example, in S. pyogenes, group C streptococci (GCS) and group G streptococci (GGS) reticulate evolution between genes has been shown to play an important role in overall sequence diversity, while only a limited number of housekeeping genes show evidence of within-gene reticulate evolution (23). In some species, for example, in the highly pathogenic species S. pyogenes, clonal groups are preserved despite widespread occurrence of reticulate evolution. These clonal complexes may be niche specific, as shown for S. pyogenes (22) and S. suis (26). In other streptococcal species, such as GCS and GGS, clonal groups have not been detected. Interestingly, for streptococcal species (23) as well as for other pathogens, e.g., Mycobacterium spp. (10) and Listeria monocytogenes (35), increasing evidence is emerging that lineages or species with a commensal lifestyle show higher levels of recombination, possibly facilitating rapid adaptation to different environments, while host-adapted lineages or species often show highly clonal population structures. Results from our current analysis did not identify any clones of S. uberis that could be associated with increased pathogenic potential or targeted through specific mastitis control programs. Rather, a nonclonal population structure was detected, similar to results for commensal streptococcal species (GCS, GGS). This is in accordance with the concept that S. uberis is predominantly an opportunistic environmental pathogen and not a host- or organ-adapted pathogen with specific virulence characteristics. Future studies using a larger number of S. uberis isolates from a variety of sources, including from fecal samples, environmental sources, and animals with clinical S. uberis mastitis, will be required, though, to comprehensively probe the population genetics of S. uberis.
Phylogenetic analysis of MLST data reveals an S. uberis subtaxon genetically resembling S. parauberis. Advantages of MLST over banding pattern-based methods include the ability to use MLST data to infer phylogenetic relationships and thus reliably define bacterial species and species-like subgroups. Recently, Lan and Reeves (28) specifically suggested that MLST data provide a better approach for defining bacterial species than 16S rRNA analysis or DNA-DNA hybridization. In our study, MLST data for four housekeeping genes allowed us to specifically define S. uberis isolate FSL Z1-015 as a unique subtaxon which is only distantly related to the remaining S. uberis isolates. While ribotyping and RAPD fingerprinting assigned unique banding patterns to this isolate, they failed to reveal the extent of the differences between FSL Z1-015 and other S. uberis isolates, some of which also had unique banding patterns. Interestingly, isolate FSL Z1-015 also lacks the virulence gene pauA as well as oppF, encoding an oligopeptide permease that plays a role for bacterial growth in milk, two genes also absent in S. parauberis. Since S. parauberis cannot be differentiated from S. uberis based on cultural morphology or biochemical properties, it was originally considered a subtype of S. uberis, and only 16S rRNA sequence data allowed for separation of the species S. uberis and S. parauberis (54). Previously described species-specific 16S rRNA PCR assays (13) and rnpB and rpoB, two genes previously used to study the phylogeny of streptococcal species (3, 48), clearly suggest that FSL Z1-015 belongs to the species S. uberis, but sequencing data from housekeeping genes showed that FSL Z1-015 represents a subtaxon distinct from both S. uberis and S. parauberis. Isolate FSL Z1-015 originated from a milk sample in The Netherlands. Meanwhile, ongoing work in our laboratory has identified at least four additional isolates originating from environmental samples in the United States that also belong to this subtaxon, as determined by analysis of sequence data for cpn60, gapC, sodA, and tuf. Like FSL Z1-015, these isolates did not yield amplicons for oppF and pauA (R. N. Zadoks and M. Wiedmann, unpublished results).
The evolution of pauA differs from the evolution of housekeeping genes. Absence of a pauA amplicon was observed in 2 of 50 isolates in our study (4.0%) and in 4 of 130 S. uberis isolates (3.1%) in a study from Germany (25). Diagnostic methods such as multiplex PCR that target pauA for detection of S. uberis will thus fail to diagnose a small proportion of S. uberis-positive samples. In addition to lower prevalence, pauA displayed a lower GC content than the other five genes sequenced. These findings suggest that S. uberis may have acquired this virulence gene by horizontal transfer. PauA, or plasminogen activator A, is a streptokinase, and transfer of streptokinase genes between streptococcal species has been described before, specifically, between S. pyogenes in humans and GCS or GGS (22).
pauA also represented the only gene that showed strong positive selection, as evidenced by an average dN/dS above 1. Our observation of positive selection in pauA is noteworthy, because PauA has been studied as a potential vaccine target (8, 31) and epitopes that contain positively selected sites may be less suitable as vaccine targets than epitopes that consist exclusively of negatively selected amino acid sites (46). Although expression of PauA is not essential for infection of the mammary gland, as indicated by the isolation of pauA-negative isolates from mastitic cows and by experimental studies (50), PauA is still considered to play a critical role in the pathogenesis of mastitis (52). Some substitutions in pauA are inconsequential. For example, substitutions for alanine by glycine (residue 15) or valine by isoleucine (residue 23) are unlikely to have a functional impact because the amino acids substituting each other have very similar sizes and properties. In addition, they are located in a signal peptide that is not part of the functional domains of streptokinase (20). Other nonsynonymous substitutions on the other hand may have an effect on the conformation and/or functionality of the protein, such as the substitution for arginine by leucine or serine (residue 211), or substitution for glycine by proline (residue 240). The specified residues are located in the ß-domain of streptokinase and differ considerably in stereochemical properties, which could potentially affect protein folding and activity. Differences between S. uberis isolates in plasminogen activity have been reported (20) but not correlated to specific amino acid polymorphisms. A recent study on regions of PauA critical to recruitment and activation of plasminogen did not identify any of the codons listed in Table 7 as such (51). Thus, while positive selection clearly plays a role in the evolution of pauA, its impact on pathogenesis is as yet unclear.
In S. pyogenes both positive selection and horizontal transfer of streptokinase genes contribute to streptokinase gene diversity (22). Transfer of streptokinase genes is not just a historic event but can take place between S. pyogenes and S. dysgalactiae subsp. equisimilis when they occupy the same niche (22). The origin of streptokinase genes in S. uberis is unknown in terms of time or donor species. Observations for S. pyogenes and S. uberis, both of which show evidence for horizontal gene transfer and diversifying selection, suggest that there is potential for further evolution of S. uberis streptokinases in response to selective pressures such as vaccination. Vaccine candidates other than PauA are currently being investigated (32). For genes encoding novel vaccine targets, as for streptokinase genes, continued monitoring of gene diversity and evolution, particularly in vaccine studies or applications, may be advised.
Conclusion and outlook. We developed an MLST scheme for the bovine udder pathogen S. uberis based on four housekeeping genes, a putative virulence gene (oppF), and an established virulence gene (pauA). While previous reports on development of MLST-based subtyping methods have used various numbers of isolates from as few as 28 (57) to as many as 294 (26), we show that use of a well-defined smaller collection of isolates can allow for initial development and validation of an MLST scheme, particularly in the case of an organism like S. uberis, which was previously known to be highly diverse (29, 34, 53). MLST-based subtyping of S. uberis was superior to banding pattern-based methods in terms of discriminatory ability, concordance with epidemiological data, and quantitative information regarding relatedness of isolates. Inclusion of three housekeeping genes that can be amplified from a variety of streptococcal and enterococcal species provides a first step towards development of a "multispecies" mastitis pathogen MLST scheme. pauA-negative isolates, which go undetected by pauA-based diagnostic methods, constitute a small proportion of S. uberis isolates from milk and may represent a new subtaxon of S. uberis that is genetically closely related to S. parauberis. Reticulate evolution contributes to a limited extent to genetic variability within genes but plays a major role in overall sequence variability. The nonclonal structure of the S. uberis population is in accordance with the notion that S. uberis is an environmental opportunist rather than a host-adapted pathogen. Evolution of virulence genes, specifically pauA, differed from the evolution of housekeeping genes, and routine inclusion of housekeeping as well as (additional) virulence genes in MLST schemes for S. uberis should be considered. Horizontal gene transfer and positive selection may contribute to acquisition or evolution of new allelic types for pauA or alternative plasminogen activators and could affect the long-term efficacy of a vaccine based on such virulence genes. In the development and evaluation of S. uberis vaccines, creation of subunit vaccines targeting conserved epitopes and monitoring of target gene diversity and evolution should be considered.
This publication was developed with help of the Cornell University Center for Biotechnology, a New York State Center for Advanced Technology supported by New York State and industrial partners. The project was funded in part by the United States Department of Agriculture Cooperative State Research, Education, and Extension Service (USDA-CSREES) award 2004-35204-14220.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»