Previous Article | Next Article ![]()
Journal of Clinical Microbiology, August 2007, p. 2616-2625, Vol. 45, No. 8
0095-1137/07/$08.00+0 doi:10.1128/JCM.02343-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Departments of Pediatric Dentistry,1 Oral and Molecular Microbiology, Osaka University Graduate School of Dentistry, 1-8 Yamada-oka, Suita, Osaka 565-0871, Japan,2 Department of Pediatric and Preventive Dentistry, Institute of Dentistry, University of Helsinki, Helsinki, Finland,3 Department of Oral and Maxillofacial Diseases,4 Division of Clinical Microbiology, Helsinki University Central Hospital, Helsinki, Finland,5 Department of Life Science, Nihon University Advanced Research Institute for the Sciences and Humanities, Tokyo 102-0073, Japan,6 Division of Bacteriology, Department of Infectious Disease Control, International Research Center for Infectious Diseases, Institute of Medical Science, The University of Tokyo, Tokyo 108-8639, Japan,7 PRESTO, Japan Science and Technology Agency, Kawaguchi 332-0012, Japan8
Received 19 November 2006/ Returned for modification 9 February 2007/ Accepted 6 June 2007
|
|
|---|
|
|
|---|
In a previous study, we showed that some blood isolates of S. mutans that cannot be classified into the c, e, or f serotype have negligible amounts of glucose side chains, despite the presence of the rhamnose backbone in serotype-specific polysaccharides (21). We designated the novel serotype as serotype "k" and showed that serotype k strains are less susceptible to phagocytosis by human polymorphonuclear leukocytes. Most oral isolates are of serotype c (approximately 70% to 80%), followed by serotype e (approximately 20%) and serotype f (less than 5%) isolates. Serotype k was defined in 2004, and its distribution frequency in the oral cavity is estimated to be 2% to 5% (21, 22). S. mutans strains of serotype k were isolated from the blood of patients in the early 1990s. However, no serotype k strains were identified from 1,326 stock strains isolated between 1982 and 1990. These results suggest that serotype k strains have acquired or lost some genetic elements in recent years. However, no information is available about the evolution of this unique serotype.
Although serotyping has been widely used to differentiate S. mutans strains, this method has limited power to discriminate the genetic relationships of strains within the same serotype. Several genotypic methodologies have been used to subtype S. mutans, including multilocus enzyme electrophoresis, ribotyping, and random amplification of polymorphic DNA (8, 16, 28). Another discriminatory method for the subtyping of S. mutans is pulsed-field gel electrophoresis (19). However, these methods differ in their discriminatory powers and reproducibilities. In general, DNA-based typing approaches display better discriminatory power than do phenotypic approaches in investigating S. mutans infections. In this study, we designed a multilocus sequence typing (MLST) method, using housekeeping loci, to evaluate the evolution of the three classical serotypes and the newly identified serotype k from epidemiological samples.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. S. mutans strains used in this study
|
Housekeeping loci used for MLST. Eight housekeeping gene loci were used in this study. The following sequences were obtained from the genome sequence of S. mutans UA159 (NC_004350): tkt, which encodes transketolase (30); glnA, which encodes glutamine synthetase subunit 1a (30); gltA, which encodes glutamate synthase (23); glk, which encodes glucose kinase (23); aroE, which encodes shikimate 5-dehydrogenase (15); gyrA, which encodes DNA gyrase subunit A (7); murI, which encodes glutamate racemase (31); and lepC, which encodes signal peptidase I (39) (Table 1).
DNA isolation and sequencing. Chromosomal DNA was prepared from all isolates as described previously (5). Internal fragments of the MLST loci were PCR amplified (AmpliTaq Gold, 10 x PCR buffer, and 2 mM GeneAmp deoxynucleotide triphosphate [dNTP] mix; Applied Biosystems) with the following cycling parameters: an initial denaturation at 94°C for 5 min and then 30 cycles consisting of 94°C for 30 s, 55°C for 30 s, and 72°C for 30 s, with a final extension at 72°C for 7 min. The amplified fragments were visualized after agarose gel electrophoresis on 1.5% agarose in the presence of 1 µg/ml ethidium bromide. They were then extracted from the agarose gel and cloned into the pGEM-T Easy vector (Promega). These plasmids were sequenced on both strands using vector-specific primers and BigDye Terminator version 3.1 (Applied Biosystems) by use of an ABI Prism 3110 genetic analyzer. In-frame internal fragments of the genes were selected for use in the MLST scheme.
Confirmation of serotypes. Serotyping was performed with the immunodiffusion method using rabbit antisera specific for serotypes c, e, f, and k, and with PCR methods using serotype-specific sets of primers, as described previously (22, 33). To determine serotypes c, e, and f, PCR amplification (Takara ExTaq, 10 x buffer, and 2.5 mM dNTPs; Takara) was performed with the following cycling parameters: initial denaturation at 96°C for 2 min and then 25 PCR cycles of 15 s of denaturation at 96°C, 30 s of annealing at 61°C, and 1 min of extension at 72°C. To identify serotype k, PCR amplification (AmpliTaq Gold, 10 x PCR buffer, and 2 mM GeneAmp dNTP mix; Applied Biosystems) was performed with the following cycling parameters: an initial denaturation at 94°C for 5 min and then 30 cycles of 94°C for 30 s, 60°C for 30 s, and 72°C for 30 s, with a final extension at 72°C for 7 min. The amplified fragments were separated by agarose gel electrophoresis on 1.5% agarose.
Alleles and ST assignments. For each locus, distinct allele sequences were assigned arbitrary allele numbers, with no weighting given to the degree of sequence divergence among the alleles. For each isolate, the alleles at each of the eight loci, defining the allelic profile or sequence type (ST), were analyzed with the nonredundant databases (http://linux.mlst.net/nrdb/). The STs were assigned arbitrary numbers in the order of their description. STs were grouped into lineages or clonal complexes with the program eBURST version 3, written and developed by E. Feil (http://eburst.mlst.net/), and the START version 2 package of programs, developed by K. Jolley (http://pubmlst.org/software/analysis/start2/) (11). Two or more independent isolates that shared identical alleles at six or more loci were defined as the members of a lineage. Each lineage was called a "group," designated according to the ST identified as the putative ancestral type by use of eBURST.
Computational analyses. The degree of clonality within the data set was estimated by calculating the index of association (IA) (34) between all of the STs in the data set by use of LIAN, version 3.5, written by B. Haubold and R. R. Hudson (http://adenine.biz.fh-weihenstephan.de/cgi-bin/lian/lian.cgi.pl) (10). We determined the number of polymorphic nucleotide sites and calculated the dN/dS ratio (where dN values represent nonsynonymous base substitutions and dS values represent synonymous base substitutions). We also performed phylogenic analyses of both individual genes and concatenated 3351-bp sequences by use of the DNAML, SeqBoot, and drawtree programs in the PHYLIP software package, written by J. Felsenstein (version 3.66; http://evolution.genetics.washington.edu/phylip.html), with the unweighted pair group method using arithmetic means.
Analysis of cnm genes of S. mutans strains. The prevalence of the cnm gene was evaluated by Southern blot analysis. Total genomic DNA from each strain was digested with HindIII (New England Biolabs, Beverly, MA). The DNA fragments were separated by 0.7% agarose gel electrophoresis and transferred to a nylon membrane (Hybond-N; Amersham Pharmacia Biotech). A 1,617-bp cnm gene fragment containing the whole open reading frame was amplified by PCR from the genomic DNA of S. mutans NN2072 with the primers cnm-F (5'-ATGAAAAGAAAAGGTTTACGAAGAC-3') and cnm-R (5'-TCAGCTATGATATTTACGGTAAAC-3'), designed based on the sequence of strain Z1 (GenBank accession no. AB102689) (29). The gene fragment was then labeled using a digoxigenin High-Prime DNA labeling and detection starter kit (Roche) according to the manufacturer's instructions. The blotted membrane was prehybridized and then hybridized according to the protocol described by the manufacturer. After hybridization, the membrane was washed twice in 2x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate) containing 0.1% sodium dodecyl sulfate at room temperature and then washed twice in 0.5x SSC containing 0.1% sodium dodecyl sulfate for 15 min at 65°C. The washed membrane was visualized with the immunological method described by the manufacturer (Roche).
|
|
|---|
|
View this table: [in a new window] |
TABLE 2. Primer sequences and characteristics of housekeeping gene loci included in the S. mutans MLST scheme
|
![]() View larger version (12K): [in a new window] |
FIG. 1. Dendrogram of 102 isolates of S. mutans based on an MLST scheme with cluster analysis by the unweighted pair group method using arithmetic means. Clonal complex groups, STs, serotypes, and distribution of the collagen-binding protein gene (cnm) were determined as described in Materials and Methods.
|
Evidence of recombination. The extent of recombination within the S. mutans population was assessed by determining the standardized IA (34) with LIAN. The IA for the complete data set was 0.0931 when randomized data sets (1,000 trials) were used. This value is significantly greater than zero, which is the value expected for a population in linkage equilibrium. However, in populations in which recombination is sufficient to randomize the alleles at different loci over long time periods, the appearance of multiple isolates with similar genotypes can result from the recent expansion of clones. Therefore, the SplitsTree program was used to detect recombination among the various STs. Allelic profile data were converted into distance matrix values by use of START, and the resulting nexus file was analyzed with the split decomposition method. SplitsTree analysis of the 102 S. mutans strains yielded a very low fit value (fit = 22.2), which may have resulted from the program's inability to analyze this large amount of information (14). We also analyzed several small subsets of randomly selected strains, which considerably improved the fit values. One example of a SplitsTree analysis displaying 10 parallelograms and a high fit value of ca. 55 was based on 18 randomly selected S. mutans isolates (data not shown).
Relationships between STs and serotypes. We then determined the serotypes of all the isolates by immunodiffusion and PCR-based typing (Table 3) . Serotype c was dominant (43 isolates), followed by e (31 isolates), f (16 isolates), and k (12 isolates). In general, the serotypes appeared to be associated with the overall genotype defined by the STs, and only ST2 contained two serotypes (c and f). As stated above, closely related STs constituted clonal complexes. However, groups 1, 2, 3, and 4 contained isolates of more than one serotype, whereas the other groups contained only one serotype. Serotype c, the major serotype of S. mutans clinical isolates, was widely distributed among the clonal complex groups, such as in groups 1, 2, 3, 4, 6, 7, 8, 10, and 12, and in many singletons, indicating that serotype c is the ancestral serotype of S. mutans. Group 2 contained 15 of the 31 serotype e isolates (ST3, 5, 6, 22, 42, 71, 72, 73, 74, 75, 77, 83, and 84), and groups 3 and 4 contained three and six serotype e isolates, respectively. Serotype f isolates were found mainly in three groups (groups 5, 9, and 11), and approximately half the serotype f isolates occurred in these groups. Another three STs (ST48, 57, and 91) were not categorized within the same clonal complex but had closely related sequence homologies (Fig. 1). Only three STs (ST2, 21, and 85) that were serotype f were located separately from the other serotype f groups. Serotype k, newly identified in the last decade, was distributed among closely related STs (ST16, 36, 38, 66, 67, 68, and 79). These results indicate that the ancestral strain of S. mutans was serotype c and that the other serotypes have branched from the serotype c groups continuously during evolution up until the present.
|
View this table: [in a new window] |
TABLE 3. Allelic profiles of isolates belonging to the 92 STs identified in this study
|
We next determined the relationship between serotype and systemic infection, because S. mutans frequently causes bacteremia and infective endocarditis. However, the blood-derived isolates were widely distributed between the various STs and clonal complex groups (Fig. 1), and no significant relationship between STs and the blood-derived isolates was observed. Therefore, we conclude that the S. mutans blood isolates are not a specific strain and that each isolate has the potential to induce systemic infections such as bacteremia or infective endocarditis.
Strain-specific collagen-binding adhesin (encoded by cnm) is a recently identified wall-anchored protein. However, its prevalence among S. mutans isolates has not been determined. To clarify the prevalence of the cnm gene, we analyzed the distribution of cnm-positive isolates by PCR and Southern blot analysis. Of the 102 strains examined, 22 were cnm positive (21.6%; see also data in Fig. 1). The cnm-positive isolates were distributed in ST85; in ST45 and ST46 (group 11); in ST40; in ST14, 15, and 86 (group 5); in ST55; in ST34, 37, and 35 (group 9); in ST69; in ST26 and 27 (group 7); in ST57; in ST48; in ST80; in ST51; in ST88; in ST87; and in ST79 (Fig. 1). The predominant cnm-positive isolates were of serotype f (13 cnm positive of 18 total [81.3%]), followed by serotype k (5 cnm-positive of 12 total [41.7%]). Three isolates of serotype c (3 [ST26, 27, and 51] of 43 total [7.0%]) and 1 isolate of serotype e (1 [ST40] of 31 total [3.2%]) had the cnm gene. The prevalence of cnm gene in each serotype is statistically significant (between serotype c and serotype f, a P value of <0.0001; between serotypes c and k, a P value of 0.087; between serotypes e and f, a P value of <0.0001; between serotypes e and k, a P value of 0.0087; and between serotypes e and k, a P value of 0.0042 [all by Fisher's exact probability test]). These isolates were closely related to the cnm-positive serotype f and k strains on phylogenetic analysis. Interestingly, the distribution of cnm-positive strains was limited to closely related groups. These observations indicate that some clinical isolates have acquired the cnm gene, probably by horizontal gene transfer, during evolution.
Streptococcus mutans transmission from mother to child. In the six mother-child pairs of S. mutans isolates (Table 4), the STs of two pairs (LJ11 and 12 and LJ30 and 31) were completely identical, indicating that these S. mutans isolates were vertically transmitted from mother to child. Of the other four pairs, the LJ3-LJ4 and LJ16-LJ17 pairs had three and four matches, respectively, in their ST profiles. However, on the MLST-based dendrogram and sequence-based phylogenetic tree, these two pairs were grouped onto different branches. The LJ24-LJ25 and LJ26-LJ27 pairs had only one allele match, and the LJ24-LJ25 members showed different serotypes. Although only a limited number of mother-child pairs were analyzed in this study, the MLST method allowed the strict discrimination of the strains from both individuals in each mother-child pair.
|
View this table: [in a new window] |
TABLE 4. Mother-child transmission of S. mutans by MLST analysisa
|
|
|
|---|
Seven genes have usually been used in recent MLST schemes, because it is important that the selection of the genes to be sequenced in an MLST scheme developed for a particular bacterial species should be optimized to save time and cost (40). However, the discrimination of each serotype, especially serotypes f and k, was not definitive in our MLST when seven genes were used in the analysis of S. mutans. In addition, the combined analysis of housekeeping genes and some virulence genes has been also used to identify virulent or pandemic clones (35, 40). In this study, we used eight housekeeping loci because the virulence genes of S. mutans are not as genetically diverse as those of other organisms. For example, glucosyltransferases synthesize water-soluble and -insoluble glucans from a sucrose substrate, and these activities are important in the colonization and biofilm formation of S. mutans (27). Chia et al. (2) reported the existence of DNA polymorphisms in the 5' regions of the gtfB and gtfC genes. However, only four or five variants have been observed in this region (2). This observation has been confirmed by use of restriction fragment length polymorphisms of the gtfB gene (36). Therefore, these putative virulence genes of S. mutans are not suitable for the analysis of the evolution or population biology of the species because their heterogeneity is low. The mean number of alleles identified per locus was 18.5, providing the theoretical potential to distinguish more than 1.2 x 1010 different genotypes. In fact, as shown in Fig. 1 and Table 3, our MLST method divided the 102 strains into 92 STs, indicating that this MLST scheme has high discriminatory power.
The cell wall antigens of S. mutans are rhamnose-glucose polysaccharides, which are composed of
1,2- and
1,3-linked rhamnan backbones and glucose side chains. Bacterial cell wall polysaccharides decorate the cell surface and often play an important role in the colonization of the bacterial ecological niche. Each serotype-specific polysaccharide has a unique linkage to its glucose side chains. Serotype c has an
1,2 linkage, serotype e has a ß1,2 linkage, and serotype f has an
1,3 linkage (17). Serotype k strains have a unique "untypeable" phenotype in terms of known serotype antibodies because they lack glucose side chains linked to their rhamnose backbones (21). The genes related to serotype specificity have been determined, and each serotype has a specific and heterogenic region downstream from the conserved rgpA-rgpF operon (33). On the basis of these specific regions, Shibata et al. suggested that none of the three serotypes could be defined as the ancestral strain (33). However, in our MLST analysis, most of the serotype e and serotype f strains occur on their own branches on the dendrogram. Most serotype e strains are included in clonal complex group 2, and the root of this branch derives from serotype c (Fig. 1). The ST85 isolate in this group is serotype f, but the restriction fragment length pattern of the rgpA-rgpF region in the ST85 isolate seems to be that of serotype c (data not shown). Why these strains appear phenotypically serotype f rather than serotype c is now under investigation by complete nucleotide sequence analysis. The distribution of serotype f isolates is also restricted to limited branches derived from serotype c. These observations suggest that a serotype c strain, the dominant serotype among S. mutans isolates, is the ancestral phenotype of this organism and that serotype e and f strains acquired their strain-specific genes during evolution. It is still unclear whether these phenotypic changes are due to genetic transfer or genetic exchange. Therefore, these issues require further analysis.
Serotype k, recently identified as a new serotype of S. mutans, also has an interesting distribution in our MLST analysis. All genes for the biosynthesis of serotype-specific polysaccharides are highly conserved relative to those of serotype c, and only the mRNA expression of rgpE is diminished in serotype k strains (25). Therefore, we hypothesized that serotype k might be derived from serotype c. As we expected, many serotype k isolates were derived from the serotype c branch (Fig. 1). However, some serotype k strains seem to have been derived from serotype f (ST55 and ST80). Polysaccharide synthesis is a sequential reaction involving several gene products. Therefore, a defect in a gene or a loss of function of a gene product related to the biosynthesis of polysaccharides results in the formation of a serotype k strain. In fact, a gluA-destructive mutant of a serotype c strain has the "serotype k" phenotype (21). Therefore, it is reasonable to infer that S. mutans strains that are phenotypically "serotype k" have arisen from the genetic dysfunction of serotype c or f strains in this era. It is possible that all S. mutans strains may have the potential to become serotype k in the future.
In this study, we compared the distribution and lineage differences of Japanese isolates with those of Finnish isolates because Japan is geographically and ethnically distinct from Finland. However, we found no regional bias in the distribution of the S. mutans isolates (Fig. 1). Several lines of epidemiological evidence indicate that serotype c is predominant worldwide (70% to 80%), with serotype e the next commonest (10% to 15%) and serotype f occurring rarely (1% to 5%) (9). On the basis of these findings, we speculate that the serological differentiation of S. mutans occurred earlier than we had previously inferred. To address this question further, we are continuing to analyze worldwide serotype distributions using our MLST scheme.
Oral bacteria are considered to cause transient bacteremia, countered by professional dental treatment and daily oral care practices such as tooth brushing and flossing (32). However, the incidence of infective endocarditis estimated by a review of reports published between 1993 and 2003 was only 3.6 per 100,000 head of population per year (20). These observations suggest that some infective-endocarditis-specific or virulent strains exist and that these strains necessarily induce systemic infections. However, the specific distribution of S. mutans strains related to systemic infections was not determined in our MLST analysis.
In contrast, the distribution of the collagen-binding protein gene (cnm) was clearly predominant in the clonal complex groups of serotype f strains (Fig. 1). The gene product of cnm was first identified in the cold-agglutination phenotype of S. mutans, and thereafter the binding of collagen-binding adhesin to collagen and laminin was reported (29). The binding of collagen-binding adhesin to specific extracellular matrix molecules may be important for the initial bacterial attachment to blood vessels. However, the distribution of cnm-positive isolates does not exactly match that of isolates derived from bacteremia or infective endocarditis patients, so the contribution of collagen-binding adhesin to the progression of systemic infections is not clearly explained.
The vertical transmission of S. mutans from mother to child is thought to be the major route for early acquisition (16). In contrast, the detection of genotypes in children that are not found in their mothers or other family members indicates that S. mutans may also be acquired from other sources. In our MLST analysis, two of six mother-child pairs showed complete identity of ST profiles for the mother and the child, whereas the other pairs showed differing ST profiles (Table 4). This result indicates that our MLST scheme is useful for epidemiological studies and is suitable for the long-term monitoring of S. mutans transmission. Mattos-Graner et al. reported at least two to five different genotypes in 30% of the children tested (18). In this study, we used only one isolate each from the mother and the child. Further prospective studies involving greater numbers of S. mutans isolates are required to explore the frequencies of vertical and horizontal transmission.
In conclusion, we have devised the first clear typing system for S. mutans. In this study, we have provided new insight into the lineages of the four serotypes, their regional specificities, and the distribution of a newly identified virulence gene and have demonstrated the applicability of the scheme to epidemiological studies. The superior discriminatory capacity of MLST, compared with that of classical serotyping or DNA-based genotyping methods, may have important practical implications. We hope that this MLST scheme will now be expanded to include isolates from other countries.
Published ahead of print on 13 June 2007. ![]()
|
|
|---|
-proteobacteria inferred from nucleotide sequence comparisons of the house-keeping genes adk, aroE and gdh: comparisons with phylogeny inferred from 16S rRNA gene sequences. J. Gen. Appl. Microbiol. 52:147-158.[CrossRef][Medline]This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»