Department of Epidemiology and Preventive Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201,1 University of Louvain, Faculty of Medicine, Microbiology Unit, B-1200 Brussels, Belgium2
Received 8 January 2005/ Returned for modification 9 February 2005/ Accepted 24 February 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Initial studies examining the relatedness among Yersinia species used serotyping and other phenotypic characteristics, such as biochemical properties (biotyping), susceptibility to antibiotics (13), and phage typing (7, 44). Brenner et al. (11) subsequently introduced DNA hybridization techniques to classify Y. enterocolitica-like species and to determine the genetic relatedness among them. However, the methodology was limited because it did not provide information required to determine evolutionary relationships among yersiniae, and it was prone to yield potentially misleading results influenced by the level of gene acquisition and loss (39). Therefore, several molecular typing methodologies, such as plasmid profile analysis (22, 50), restriction fragment length polymorphism of chromosomal DNA (9), ribotyping (40, 42), sequence analysis of the 16S RNA gene (31), pulsed-field gel electrophoresis (PFGE) (43), and variable-number tandem repeat analysis (2) were subsequently applied for typing of yersiniae during epidemiological investigations and for determining genetic relatedness among Yersinia strains. Most of these approaches have good or excellent discriminatory power (e.g., PFGE was proposed as the "gold standard" for typing of Yersinia strains [34]) and are well suited for short-term epidemiological studies. However, they are less suited for long-term epidemiological studies and for determining evolutionary traits of, and phylogenetic relationships among, various strains or species (18). A relatively recently developed approach called multilocus enzyme electrophoresis (MLEE) addressed some of the shortcomings of the above-mentioned methodologies by combining excellent discriminatory power with relative clonal stability (49). Several studies (12, 17, 25, 26) have utilized MLEE to characterize yersiniae. However, problems with band resolution, and the ability of environmental conditions to alter phenotypic expression of the enzyme under study, can adversely affect the reproducibility of MLEE typing and complicate data analysis.
Nucleotide sequence-based approaches alleviate all of the above-mentioned drawbacks of MLEE. Indeed, sequencing of the entire genome of a bacterium, with subsequent in silico analysis of the data, constitutes the ultimate approach for delineating the genetic relatedness of various bacteria, and this approach is likely to become a common strategy during future studies of bacterial evolution and virulence traits (59). However, sequencing an entire bacterial genome is a major undertaking, and therefore it is usually limited to very few strains per species. Sequencing a single geneusually the 16S RNA gene, which has been used for the molecular analysis of many bacterial pathogens, including Yersinia (31, 32)also is not optimal. For example, interspecies recombinations in 16S RNA genes (initially thought to be very rare) have recently been inferred (52) to occur in at least some bacterial species, which underscores the importance of not focusing on a single gene during studies to determine the phylogeny/taxonomy of a bacterial species (39). In this context, using a technique that allows sequence-based analysis of multiple loci simultaneously will provide an ideal balance between sequence-based resolution and technical feasibility. Such an approach, called multilocus sequence typing (MLST), was developed by Maiden et al. (41) to characterize meningococci, and it has rapidly been applied to the study of many other bacterial species as well, including selected species of Yersinia (1). However, MLST data pertaining to genetic relationship, phylogenetic traits, and clustering of all known species within the genus Yersinia are not currently available, and the present study is the first attempt to use MLST to examine the intra- and interspecies relationships among all of the currently described yersiniae.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
|
Nucleotide sequence accession numbers. The DNA sequences of the 16S RNA, recA, Y-HSP60, gyrB, and glnA gene loci have been deposited in GenBank under accession numbers AY332802 though AY333069.
| RESULTS |
|---|
|
|
|---|
|
Some of the Yersinia strains had identical 16S RNA sequences, e.g., Y. aldovae CNY 7112 and 35236 and Y. kristensenii WE 414/93, Y. intermedia WS 58/02 and Y. kristensenii 490, and Y. pestis KIM and Y. pseudotuberculosis CDC801-84 (Fig. 1). Eight strains did not cluster with their conspecifics: Y. frederiksenii WE 83/02, Y. intermedia WA 11/94, Y. kristensenii WA 948, Y. mollaretii WA 309 and WS 43/94, Y. bercovieri ATCC 43970, and Y. kristensenii WS 052 and ATCC 33638.
Y-HSP60, glnA, recA, and gyrB sequence variations in Yersinia. All of the four loci were polymorphic: glnA, gyrB, recA, and Y-HSP60 had 109, 105, 92, and 74 polymorphic sites, respectively (Table 2). The calculated dN/dS ratios for those housekeeping genes ranged from 0.0052 for the recA locus to 0.0378 for the gyrB locus. The G+C contents of the loci ranged from 46.81% for the gyrB locus to 57.84% for the 16S RNA locus (Table 2).
Two different allelic designations were assigned to each sequence, one based on the nucleotide sequence (NT) and the other on the translated amino acid sequence (AT). Thirty-six, 35, 37, and 32 NTs were identified for the glnA, gyrB, recA, and Y-HSP60 genes, respectively (Table 1). With the exception of Y. pestis and Y. pseudotuberculosis, identical NTs were observed only in strains within the same species, and strains possessing identical NTs for all of the loci were very rare (e.g., Y. bercovieri WA 17/96 and WS 19/96 had identical alleles for all four loci). One exception, where the strains of two different species had the same NTs, was Y. pestis KIM and the two Y. pseudotuberculosis isolates: these three strains shared identical ATs for each housekeeping gene locus examined. Translation of the nucleotide sequences into amino acid sequences resulted in fewer alleles (ATs) than NTs, which suggests that most of the nucleotide substitutions were silent (Table 1). For example, the 37 NTs for the recA locus translated into only 4 ATs. Overall, there were 9, 15, 4, and 14 ATs for glnA, gyrB, recA, and Y-HSP60, respectively.
Some isolates of several Yersinia species shared identical ATs at one or more of the four housekeeping loci. As noted above, isolates of Y. pestis and Y. pseudotuberculosis shared ATs at all four loci. For glnA, Y. aldovae and Y. intermedia shared AT2, Y. enterocolitica and Y. kristensenii shared AT3, and Y. frederiksenii and Y. bercovieri shared AT3. For gyrB, Y. bercovieri and Y. mollaretii shared AT3, and Y. enterocolitica, Y. pseudotuberculosis, and Y. pestis shared AT8. For Y-HSP60, Y. pseudotuberculosis, Y. pestis, and Y. enterocolitica shared AT2, Y. intermedia and Y. frederiksenii shared AT4, Y. kristensenii and Y. frederiksenii shared AT1, and one strain each of Y. kristensenii and Y. mollaretii shared AT7, which was commonly found in the Y. bercovieri strains. The smallest number of ATs was observed for recA. The number of recA-based ATs identified was less than the number of currently known species of Yersinia; therefore, not surprisingly, the recA-based ATs were commonly shared across the species. For example, 28 isolates representing six Yersinia species possessed recA AT2, and 12 strains of Y. frederiksenii and Y. mollaretii had recA AT3. In contrast, with the exception of one possible match with Y. aldovae CNY 7112, Y. ruckeri's recA AT4 was not found in any other species. Moreover, the data obtained during our analysis of the other three housekeeping genes of Y. ruckeri indicated that their ATs also were unique to Y. ruckeri, i.e., they were not shared with any strains of the other Yersinia species.
Genetic relationships among yersiniae. Inter- and intraspecies relationships among various yersiniae were examined by analyzing a 16S RNA-based maximum-likelihood tree (Fig. 1) and an unrooted phylogram (Fig. 2) constructed from the 1,407-bp concatenated sequences of all four housekeeping MLST loci. The 16S RNA sequences were not included in the concatenated locus, because multiple copies of the gene exist in various Yersinia strains (e.g., 6 and 7 copies in Y. pestis strains CO92 and KIM, respectively). The availability of the two trees enabled us to compare the genetic relatedness data obtained by 16S RNA sequencing with those obtained by MLST analysis of a concatenated tree containing four housekeeping gene loci.
|
The genetic distances between species ranged from 0.062 to 0.134, with the exception of Y. pestis and Y. pseudotuberculosis, for which it was <0.01. Other species that were closely related were Y. aldovae and Y. intermedia (pairwise genetic distances between isolates ranged from 0.066 to 0.068) and Y. bercovieri and Y. mollaretii (pairwise genetic distances ranged from 0.062 to 0.066). The Y. ruckeri isolates exhibited the largest genetic distances (0.116 to 0.134) from the other Yersinia species.
Comparison of the 16S RNA and MLST data. The data obtained by 16S RNA analysis (Fig. 1) and MLST (Fig. 2) of 39 Yersinia strains placed 35 of the isolates in their nominal species-specific group. The four discordant strains were Y. kristensenii WS 052, Y. kristensenii WA 948, Y. frederiksenii WA 935, and Y. bercovieri ATCC 43970. Y. kristensenii WS 052 appeared to be more closely related to the Y. enterocolitica group (based on the 16S RNA analysis [Fig. 1]) and to the Y. enterocolitica, Y. frederiksenii, and Y. rohdei groups (based on the MLST analysis [Fig. 2]) than to other Y. kristensenii strains. In addition, according to the 16S RNA analysis (Fig. 1), Y. kristensenii WA 948 discordantly clustered with Y. intermedia WA 11/94another "outsider" strain. The former strain also was observed, by MLST, to be discordantly but not tightly clustered with Y. bercovieri (Fig. 2). Based on the 16S RNA analysis (Fig. 1), Y. frederiksenii WA 935 was genetically distinct and not closely related to other Y. frederiksenii strains; however, it clustered with two other Y. frederiksenii strains that formed one of the two Y. frederiksenii clusters on the concatenated tree (Fig. 2). Similarly, based on the 16S RNA analysis (Fig. 1), Y. bercovieri ATCC 43970 did not cluster with the other Y. bercovieri isolates; however, it clustered tightly with four other Y. bercovieri strains on the MLST tree (Fig. 2).
Evidence for O-antigen switching and genetic recombination in Yersinia. Several strains with identical or closely related genetic backgrounds had distinct O antigens; e.g., although Y. intermedia WS 11/01 and WE 24/02 were very closely related (Fig. 1 and 2), they had distinct O antigens (O:48 and O:14, respectively) (Table 1). Also, in a few cases, O antigens were shared by strains of species that were not closely related; e.g., Y. bercovieri W601 shared the O:8 antigen with the genetically distant Y. enterocolitica 8081 and YE 37 (Table 1; Fig. 1 and 2).
The Splitstree analysis revealed parallelograms, for each individual locus and for the concatenated sequences, among strains within and among various Yersinia species. Parallelograms in the splitsgraph were accepted as evidence for parallel changes only if (i) the fit values were above 95% (indicating that most of the variation was accounted for in the analysis) and (ii) the bootstrap values on adjacent sides were greater than 90%. The criteria were never met for data sets containing the sequences for all the strains, presumably because the data from the genetically diverse species were too complex. However, the criteria were routinely met when small subsets of the sequences (i.e., simpler data sets) were analyzed. An example of such an analysis for a subset of Y. enterocolitica, Y. frederiksenii, Y. intermedia, Y. kristensenii, and Y. mollaretii strains is presented in Fig. 3. As shown in Fig. 3, the parallelograms often had adjacent sides with very high bootstrap values (e.g., all bootstrap values for the gyrB and recA loci had a maximum possible value of 100), and the fit values for all loci were at their maximum possible value of 100.
|
| DISCUSSION |
|---|
|
|
|---|
In addition to Y. pestis and Y. pseudotuberculosis, five Yersinia species (Y. aldovae, Y. bercovieri, Y. intermedia, Y. rohdei, and Y. ruckeri) were genetically fairly homogeneous. For example, based on the sequence analysis of the 16S RNA gene loci, all four Y. ruckeri strains and both of the Y. aldovae strains we examined were identical for each species, even though some of the strains were isolated in different years (e.g., Y. ruckeri WS 34/93 and WS 10/95 were isolated in Belgium in 1993 and 1995, respectively) or had different serotypes (e.g., the serotypes of Y. aldovae 35236 and CNY 7112 were O:17 and O:21, respectively). The Y. intermedia and Y. rohdei strains were also genetically homogeneous. The distances between conspecific isolates were <0.008, and some strains isolated ca. 6 years apart on different continents (e.g., Y. rohdei WA 9/90 and WA 14/96, which had the same serotype, O:76, were isolated in Germany in 1990 and in New Zealand in 1996, respectively) were identical based on their 16S RNA sequences. The two identical strains of Y. bercovieri, WA 17/96 and WS 19/96, were isolated from humans in 1996. They had identical alleles for all five loci (Table 1), and they shared the O:58,73 antigen; however, they were isolated from geographically distinct locations (New Zealand and Belgium, respectively).
The genetic homogeneity of Y. aldovae, Y. intermedia, Y. rohdei, and Y. ruckeri has been suggested by other investigators (22, 48), and our observations are in agreement with those earlier reports. For example, our 16S RNA and MLST data regarding the genetic homogeneity of Y. ruckeri are in agreement with a previous report (22) in which Y. ruckeri strains isolated from a wide variety of sources were shown to be clonal, based on their plasmid profile and ribotyping pattern analysis, and one plasmid type and ribotype predominated. In addition, MLEE analysis (48) of Y. ruckeri strains indicated that they were primarily of one electrophoretic type. In contrast, Y. enterocolitica, Y. frederiksenii, Y. kristensenii, and Y. mollaretii were genetically more heterogeneous, which is also in agreement with previous reports (12, 17, 57).
Comparison of 16S RNA analysis and MLST data regarding genetic relatedness among Yersinia species. In most, but not all, instances, the MLST and 16S RNA data correlated well with each other, and they agreed with the species' designations determined by biochemical tests. The MLST and 16S RNA data also correlated well in a few cases (e.g., Y. kristensenii WS 052 and WA 948) where the results of both analyses did not agree with the biochemical designation of the species; i.e., according to the 16S RNA and MLST analyses, the two strains mentioned above were clear outsiders in relation to other strains of the same species. Therefore, as discussed below, their taxonomic designations should be reexamined.
In instances where the 16S RNA and MLST data did not correlate with each other, MLST was more consistent with the species' biochemical designations than was 16S RNA analysis. For example, with the exception of one isolate (Y. bercovieri strain ATCC 43970), all of the Y. bercovieri strains were either identical or very closely related, based on their 16S RNA sequence analysis (Fig. 1). The ATCC 43970 strain (also known as CNY 7506) was a clear outsider, and it appeared to be more closely related to other Yersinia species (e.g., Y. frederiksenii) than to the other Y. bercovieri strains we analyzed. Similarly, according to the 16S RNA data (Fig. 1), Y. kristensenii ATCC 33638 (also known as CIP 80-30) did not cluster with the other Y. kristensenii strains. Both ATCC strains (Y. bercovieri ATCC 43970 and Y. kristensenii ATCC 33638) are the type isolates of their respective species; therefore, our observation that they were genetically distinct (according to the 16S RNA analysis) from other strains of their respective species was puzzling. However, when five of the nine Y. bercovieri strains (including ATCC 43970) and seven of the eight Y. kristensenii strains (including ATCC 33638) were analyzed by MLST using concatenated trees based on Y-HSP60, glnA, gyrB, and recA loci, both type strains correctly clustered with isolates of their respective species (Fig. 2). Another example of MLST being better than 16S RNA sequencing for determining phylogenetic identity is Y. frederiksenii WA 935. According to 16S RNA analysis, that strain was genetically distinct from other Y. frederiksenii strains (Fig. 1), but it correctly clustered with other Y. frederiksenii strains on the concatenated MLST tree (Fig. 2).
16S RNA sequencing has been and still is commonly used to identify various species, and its ability to yield reproducible and accurate information is commonly accepted. However, an increasing body of data suggests that 16S RNA does not always ideally identify/resolve strains on a species level. For example, a recent study (15) of Y. frederiksenii reported that gyrB was a better phylogenetic marker than that species' 16S RNA sequence. Also, as noted above, interspecies recombinations in 16S RNA genes, which initially were thought to be rare, have recently been suggested (52) to occur in at least some bacterial species, and such genetic exchanges may affect the determination of the phylogeny/taxonomy of bacterial species in which the changes have occurred (39). Our data support these observations, and they suggest that (i) although 16S RNA sequences are reasonably good phylogenetic markers for differentiating Yersinia species, genetic relatedness data obtained at least by the partial sequencing of this locus may not clearly distinguish between some members of this genus, and (ii) MLST is better suited for studying genetic relatedness among yersiniae than is 16S RNA-based analysis. On a more general level, our data support the idea that MLST can be an invaluable tool during continued refinement of the phylogenetic relationships among enteric bacteria (3).
16S RNA analysis, MLST, and Yersinia taxonomy. In the genus Yersinia (as in other bacteria), species have been identified mainly according to certain biochemical tests. The introduction of DNA-based approaches has revolutionized bacterial taxonomy and has helped assign species designations to numerous bacteria, including yersiniae (11). 16S RNA analysis has been particularly useful in that regard, and it has become a molecular standard for identification of many bacterial species, including uncultivable bacteria (29). However, the question about the minimal percent difference in 16S RNA sequences needed to differentiate bacterial species has not yet been resolved, and the issue of whether a 2% or larger difference is required to assign strains to different species is still a matter of debate (6, 14). The results of our present analysis of various Yersinia species suggest that MLST can be useful in that regard and that MLST is better than 16S RNA analysis for determining the taxonomic designations of the yersiniae.
The MLST data obtained during our study were based on four housekeeping gene loci that are generally considered to be conserved; therefore, a selection of conserved loci would be expected to represent the minimum differences between homologous loci in two genomes. Relatively few strains of the various Yersinia species were analyzed during our study, and therefore it is difficult to state with confidence whether there are "rules of thumb" for identifying the genetic distances between isolates of different Yersinia species. However, our data provide some preliminary insight into this issue. For example, based on our MLST data for seven Yersinia species (Y. bercovieri, Y. enterocolitica, Y. intermedia, Y. mollaretii, Y. pseudotuberculosis, Y. rohdei, and Y. ruckeri), the intraspecies genetic distances were <0.02, while the interspecies distances were >0.06. From these data, we observed a bimodal distribution in which intraspecies distances were in the first mode and interspecies distances were in the second mode. Thus, there is the potential to separate species using a bimodal distribution and a suitable statistic such as Giacomelli's dip statistic (23).
Several strains could not be distinguished on a species level by the partial sequencing of the 16S RNA gene but were identified by MLST, which suggests that MLST, but not 16S RNA analysis, has sufficient resolving ability to unambiguously differentiate the isolates of the closely related Yersinia species. Additional studies are required to fine-tune and validate the cutoff values for interspecies and intraspecies genetic distances, and their results may have a profound impact on clarifying the taxonomic classification of various bacterial species (53). Indeed, MLST already has been proposed (3) as a useful tool for refining the taxonomy and phylogenetic relationships among enteric bacteria.
We observed four exceptions to the above-mentioned bimodal distribution of the interspecies and intraspecies genetic distances between Yersinia isolates. The first two exceptions are Y. pestis and Y. pseudotuberculosis strains, which had genetic distances similar to those of intraspecies genetic distances. In addition, the same strains also (i) had the same 16S RNA allele (NT25), (ii) had identical ATs for all four housekeeping genes we analyzed (Table 1), and (iii) clustered tightly, as determined by 16S RNA sequencing (Fig. 1) and MLST (Fig. 2). The genomes of Y. pestis and Y. pseudotuberculosis are known to be very similar, based on the results of (i) DNA hybridization studies (8), (ii) 16S RNA analyses (56), and (iii) MLST studies (1), which has led to the hypothesis that the two species are actually two subgroups of the same species. Our results are in agreement with those previous reports, and they support the idea that Y. pestis and Y. pseudotuberculosis are two lineages within the same species rather than two distinct species. The third and fourth exceptions are Y. kristensenii and Y. frederiksenii, respectively. Some strains of those species (e.g., Y. kristensenii WA 948 and WS 052, and Y. frederiksenii WA 935) were separated from other strains of their respective species by genetic distances larger than those observed for other strains within the same species (i.e., the genetic distances were >0.06). Furthermore, on the maximum-likelihood trees, the same strains were not closely related to other strains of their respective species, based on the 16S RNA (Fig. 1) and concatenated MLST (Fig. 2) analyses. Y. frederiksenii has been shown (15-17, 33), by the results of comparative 16S RNA and gyrB sequence analyses, DNA hybridization studies, and MLEE analysis, to include strains that may represent a new subline within the genus Yersinia which is most closely related to Y. frederiksenii hybridization group 2 (unnamed genomospecies 2) and is biochemically indistinguishable from typical Y. frederiksenii (Y. frederiksenii genomospecies 1). Based on our 16S RNA sequencing and MLST data (Fig. 1 and 2), the Y. frederiksenii strains in our collection grouped in distinct clusters, which provides additional support for the idea that the species contains multiple subgroups, some of which may represent unknown species in Yersinia. Also, our MLST analyses (Fig. 2) identified at least one genetically atypical Y. frederiksenii isolate (WA 935) that was a clear outsider with regard to the other Y. frederiksenii isolates we examined.
Dolina and Peduzzi (17) found that some of the Y. kristensenii strains in their collection were genetically more closely related to Y. mollaretii than to other strains of Y. kristensenii, as determined by MLEE analysis. Our MLST data obtained during our study of the atypical Y. kristensenii strains (e.g., Y. kristensenii WA 948 and WS 052) in our collection are consistent with their report. The number of Y. kristensenii and Y. frederiksenii strains analyzed during our study was relatively small, and we detected only a few atypical strains; thus, it is not clear how common those strains are in nature. However, our data support the idea that such strains do exist, and they emphasize the need for additional MLST studies to elucidate the prevalence of such strains and to delineate their taxonomic classification, since they may represent a new species within the genus Yersinia.
Y. ruckeri (the etiologic agent of red mouth disease in fish) was a clear outsider with regard to all other yersiniae, based on (i) 16S RNA analysis (Fig. 1) and (ii) analysis of the concatenated sequences (Fig. 2). Moreover, the ATs of the four Y. ruckeri strains in our collection were unique to that species; i.e., AT7 (for glnA), AT15 and AT9 (for gyrB), AT4 (for recA), and AT11 (for Y-HSP60) were not found in any strains of any other Yersinia species we examined. The taxonomic status of Y. ruckeri has been controversial since its initial classification in 1978 (21), and data reported by several investigators (5, 17, 38, 48) suggest that Y. ruckeri should not be classified in the genus Yersinia. Our 16S RNA and MLST data suggest that Y. ruckeri is the genetically most distant species within the genus and that the taxonomic standing of Y. ruckeri may need to be reevaluated.
Correlation between the genetic backgrounds of various Yersinia species and their O antigens. Many O antigens are shared among various species of Yersinia; e.g., Y. frederiksenii shares several O antigens with Y. enterocolitica (57), and many Y. kristensenii strains are agglutinable in the Wauters scheme (58). However, the genetic relatedness of Yersinia strains with common O antigens has never been rigorously examined, and the mechanisms responsible for this phenomenon are currently not well understood. During the present study we identified two Y. mollaretii strains (WA 134/92 and WE 302/92) and one Y. bercovieri strain (W601) that shared the O:3 and O:8 antigens (51, 61), respectively, with Y. enterocolitica (Table 1). Y. mollaretii strains with Y. enterocolitica O antigens have been described previously (4), and our data provide additional evidence for the existence of such strains. Genetically, the two Y. mollaretii strains with the O:3 antigen were closely related to one another, according to their 16S RNA sequences (Fig. 1) and their shared NTs (Table 1). However, both strains were genetically distinct from all of the Y. enterocolitica strains we examined (Table 1 and Fig. 1). Similarly, Y. bercovieri W601 was not closely related to the Y. enterocolitica strains, including the two strains (8081 and YE 371) with which it shared the O:8 antigen (Fig. 1 and 2). These observations suggest that O-antigen switching, which has been proposed (54) to have occurred among several other bacteria, may have also occurred between at least some Yersinia species.
Evidence of intraspecies and interspecies genetic recombination in Yersinia. The START analysis of our MLST data revealed a noticeable difference in the G+C contents of the genes we analyzed (Table 2). The G+C content differed by as much as 11% among the various loci; e.g., the mean G+C contents for the gyrB and 16S RNA loci were 47% and 58%, respectively. The backbone G+C content of the Yersinia genome is estimated to range from 46 to 48% (5), and with the exception of the 16S RNA loci, which had a higher G+C content, the remaining loci we analyzed were within this typical G+C range for Yersinia. Furthermore, species-specific differences in the G+C content were not observed for any of the loci we examined; i.e., there was no apparent evidence that the loci were acquired from distinct ancestors.
As an additional approach to detect recombinational changes, we used the Splitstree analysis, utilizing small subsets of randomly selected strains of various Yersinia species (simultaneously analyzing many strains overwhelmed the program, in agreement with what has been reported previously for other bacteria [37, 47]). The observation of parallelograms during Splitstree analysis indicates that recombination played a role in the evolution of the gene under examination, and we observed such parallelograms during our analyses (one example of such analyses for each of the four housekeeping genes is presented in Fig. 3). The bootstrapping values were greater than 90% (Fig. 3)in many cases, the bootstrapping values were at their maximum of 100% and the fit values were the maximum possible value of 100indicating robust results based on multiple genetic changes. We consider the possibility that these multiple changes resulted from multiple, independent, recurrent mutations to be very small, particularly since nearly all of the nucleotide changes failed to result in amino acid changes (Table 1). Thus, the parallel changes are likely to be the result of recombination rather than independent recurrent mutations. Also, the dN/dS ratios for all loci were <1 (Table 2), suggesting that there is no selective pressure on nucleotide changes in these genes.
Parallelograms were also formed during our Splitstree analysis of the 16S RNA loci (data not shown) in strains within the same species and in strains of various Yersinia species. Interspecies recombinations in 16S RNA genes have recently been proposed (52) to occur in some non-Yersinia species, and our data suggest that the same phenomenon also may occur among the yersiniae. However, caution is required in interpreting these data, because a relatively small subset of strains was analyzed, and the number of nucleotide differences between the sequences is very small. Nevertheless, our data suggest that recombination in the 16S RNA genes of various Yersinia species is a possibility, and they indicate that additional studies in that direction are warranted. Such interspecies recombinations among the yersiniae could have important practical implications; e.g., they may complicate the development of 16S RNA-targeted tools to detect and identify various Yersinia species. These observations also support the idea (28) that gene trees based on a single gene (including conserved genes, such as those for 16S RNA) are not likely to permit accurate determination of the evolutionary history of the species. Therefore, analysis of several loci of the bacterial chromosome (e.g., by using MLST) is required for a rigorous understanding of the evolutionary history of yersiniae and for the development of advanced tools capable of accurately identifying and differentiating Yersinia species.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||