Previous Article | Next Article ![]()
Journal of Clinical Microbiology, February 2009, p. 300-310, Vol. 47, No. 2
0095-1137/09/$08.00+0 doi:10.1128/JCM.01916-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Genotyping of Pathogens and Public Health,1 Biodiversity of Emerging Bacterial Pathogens,2 Institut Pasteur, Université Pierre et Marie Curie (Paris VI), UPRES EA 2392 Antibiotiques et Flore Digestive,3 Service de Microbiologie, Assistance Publique-Hôpitaux de Paris, Hôpital Saint-Antoine,4 Service de Bactériologie, Assistance Publique-Hôpitaux de Paris, Hôpital Tenon, Paris, France5
Received 3 October 2008/ Returned for modification 15 November 2008/ Accepted 22 November 2008
|
|
|---|
|
|
|---|
Seven Pantoea species are currently distinguished: P. agglomerans, the type species of the genus; Pantoea ananatis; Pantoea stewartii (divided into the two subspecies Pantoea stewartii subsp. stewartii, the agent of Stewart's vascular wilt in maize and sweet corn, and Pantoea stewartii subsp. indologenes); Pantoea dispersa; Pantoea citrea; Pantoea punctata; and Pantoea terrea (18, 20, 28, 37). The Pantoea agglomerans complex was previously designated Erwinia herbicola or Enterobacter agglomerans (18). The biochemical heterogeneity of P. agglomerans and related strains and species renders identification difficult, even if several biochemical or nutritional characteristics distinguish the "Japanese group" (20) of Pantoea species (P. citrea, P. punctata, and P. terrea). Currently, confident identification is not achieved routinely.
Precise knowledge of the phylogenetic relationships and the degree of genetic distinctness among Pantoea species is a prerequisite for their correct identification. Phylogenetic relationships among Pantoea species were initially based on 16S rRNA analysis (24, 39, 53), which showed that P. agglomerans, P. ananatis, and P. stewartii were closely related. The same result was obtained based on the three protein-coding genes atpD, carA, and recA (53). However, only one or a few strains per species were analyzed, and the phylogenetic relationships of the three "Japanese" species with the other Pantoea species have, to our knowledge, never been described. Hence, it is not clear whether Pantoea species are clearly demarcated and if the genus Pantoea forms only one phylogenetic branch.
Defining bacterial species remains a challenge, especially given the fact that homologous recombination or lateral gene transfer can disturb species boundaries. Currently, the approach to defining bacterial species uses both genomic and phenotypic characteristics. The pragmatic values of 70% DNA-DNA reassociation and a difference of <5°C in the melting temperature have been proposed as a cutoff for species definition (50) but are technically challenging to obtain. Multilocus sequence analysis (MLSA) provides an alternative way to define species and to explore sequence discontinuities among them (19). Phylogenetic analysis of concatenated multiple protein-coding genes sometimes allow one to clearly separate species into sequence clusters that can be used to define species, even if borders may be fuzzy for highly recombinogenic species (17, 23). Moreover, the MLSA approach can be used to estimate homologous recombination among species, which is important for determining the reliability of molecular identification based on one or a few genes and for estimating the impact of homologous recombination or lateral gene transfer on the speciation process. So far, no MLSA approach has been reported for Pantoea species.
Strain typing and population genetics studies are necessary for epidemiological purposes and to identify strains with important phenotypes such as virulence to plants or humans (11, 47). For example, it is important to determine if P. agglomerans strains differ in their abilities to infect humans or to cause specific diseases in plants. Currently, P. agglomerans strains can be differentiated using fluorescent amplified fragment length polymorphism (5) or pulsed-field gel electrophoresis (51). However, these methods do not provide unambiguous definition of clones and clonal families, and the results are generally difficult to compare among laboratories. Currently, the amount of diversity within P. agglomerans and its population structure are unknown. A widely accepted method for studying strain relationships is multilocus sequence typing (MLST) (33). It consists of sequencing internal portions of several protein-coding genes. This method provides unambiguous and portable data and allows one to compare data worldwide, which is necessary in order to achieve a comprehensive overview of strain diversity and epidemiological distribution (32). In addition, this method is suitable for studying strain phylogeny and allows one to address evolutionary questions at the strain level within species (11). In contrast to MLSA, MLST relies on the comparison of allelic profiles of strains within species, whereas MLSA uses concatenation of gene sequences to define boundaries and phylogenetic relationships between species.
There are few data concerning the susceptibility of P. agglomerans to antimicrobial agents. In 1986, Muytjens et al. reported in vitro susceptibility data for eight species of Enterobacter, including 27 strains of E. agglomerans (38). The strains exhibited very variable susceptibilities to β-lactams, aminoglycosides, and quinolones. Hieng et al. (25) described a case of septicemia due to an Erwinia herbicola strain that was resistant to ampicillin, carbenicillin, and cephalothin (cefalotin) and susceptible to other antibiotics usually active on gram-negative bacilli. Cruz et al. (7) reported similar results. No β-lactamase was found among the Pantoea species. In 2000, a clinical isolate of P. agglomerans recovered from a patient with septic arthritis was reported to be highly resistant to fosfomycin (8).
The objectives of this study were (i) to define the phylogenetic relationships among P. agglomerans strains, or strains that may be misidentified as P. agglomerans, and other species of Enterobacteriaceae; (ii) to characterize Pantoea species or phylogenetic clusters biochemically and for their susceptibilities to antimicrobial agents; and (iii) to develop and evaluate MLST for strain discrimination and determination of the population structure of P. agglomerans.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Characteristics of the strains used in this study
|
PCR amplifications and sequencing. PCR amplification and sequencing of internal portions of the six housekeeping genes fusA, gyrB, leuS, pyrG, rplB, and rpoB were performed with the oligonucleotide primers given in Table 2, using the protocol described previously for Plesiomonas shigelloides (44). These genes were selected because they are single-copy-number genes, essential, and present in many bacterial lineages; therefore, they were expected to be present in all members of the Enterobacteriaceae. Primers for fusA, leuS, and pyrG were retrieved from Salerno et al. (44), and primers for gyrB, rplB, and rpoB were designed to amplify all species of the Enterobacteriaceae. The PCR cycles used for amplification are shown in Table 2. PCR products were directly sequenced after purification by ultrafiltration (Millipore, France) on both strands using the BigDye Terminator ready reaction kit (version 3.1; Perkin-Elmer). Purification was performed by ethanol precipitation. Sequence reaction products were analyzed using an ABI 3730XL automated DNA sequencer. 16S rRNA was amplified and sequenced under the same conditions using the primers and PCR cycles detailed in Table 2. The repA gene was amplified by PCR with the previously described primers Rep23220_5 and Rep25101_3 (51). This PCR was carried out using 30 cycles (30 s at 94°C, 30 s at 45°C, and 1 min at 72°C).
|
View this table: [in a new window] |
TABLE 2. Primers and conditions used for amplification and sequencing
|
Antimicrobial susceptibility testing. Antimicrobial susceptibility was determined by the disk diffusion method on Mueller-Hinton agar (Bio-Rad, Marnes la Coquette, France) according to the recommendations of the Comité de l'Antibiogramme de la Société Française de Microbiologie (CA-SFM) (http://www.sfm.asso.fr). The antibiotic disks (Bio-Rad) were as follows: amoxicillin, amoxicillin-clavulanic acid, ticarcillin, ticarcillin-clavulanic acid, piperacillin, piperacillin-tazobactam, cephalothin, cefoxitin, cefotaxime, ceftazidime, aztreonam, imipenem, gentamicin, tobramycin, netilmicin, amikacin, nalidixic acid, ofloxacin, ciprofloxacin, trimethoprim-sulfamethoxazole, fosfomycin, and colistin. MICs of amoxicillin, amoxicillin-clavulanic acid, cephalothin, cefotaxime, and fosfomycin were determined by the standard agar dilution method according to the recommendations of the CA-SFM.
Data analysis. Chromatogram traces were edited and analyzed using BioNumerics, version 5.10 (Applied Maths, Sint-Martens-Latem, Belgium). Each base of the selected template region was confirmed by at least two chromatograms (forward and reverse); if there were ambiguities for a sequence, an additional sequence reaction was performed. Sequences were aligned with BioNumerics. The best model of nucleotide substitution was determined using the MODELTEST Web server (42). Maximum-likelihood trees and bootstrap values were calculated using PhyML (21) with the appropriate model of nucleotide substitution, and the proportion of invariable sites was estimated. Nucleotide diversity and level of polymorphism were calculated using DNAsp, version 4 (43). MLST data were analyzed by the standard MLST approach: for each gene, an allele number was attributed to each allelic variant, and the sequence type (ST) of a strain corresponded to the combination of the allele numbers of the six genes.
Nucleotide sequence accession numbers. All sequences of MLST genes are available on our publicly accessible MLST Web server (http://www.pasteur.fr/mlst). The rrs gene (coding for 16S rRNA) sequences were deposited in the GenBank/EMBL/DDBJ databases under accession numbers FJ357809 to FJ357836.
|
|
|---|
![]() View larger version (19K): [in a new window] |
FIG. 1. Maximum-likelihood tree of 34 species of Enterobacteriaceae (each represented by its taxonomic type strain), the 36 Pantoea study strains, and 3 other gammaproteobacteria, constructed using the concatenated sequences of six loci (fusA, gyrB, leuS, pyrG, rplB, and rpoB). Sequences of the following strains were retrieved from the GenBank database: Yersinia pestis CO92 (accession no. NC_003143), Pasteurella multocida subsp. multocida Pm70 (NC_002663), Shewanella oneidensis MR-1 (NC_004347), and Haemophilus influenzae Rd KW20 (NC_000907). The numbers at the nodes are bootstrap values higher than 80%, obtained after 1,000 replicates.
|
![]() View larger version (21K): [in a new window] |
FIG. 2. (A and B) Maximum-likelihood trees of 28 strains of Pantoea agglomerans and closely related species, constructed using concatenated sequences of six protein-coding genes (fusA, gyrB, leuS, pyrG, rplB, and rpoB) (A) and 16S rRNA sequences (B) based on the general time-reversible model of nucleotide substitution, which was inferred to be the most likely using MODELTEST. (C) Split decomposition analysis tree of the 20 strains of P. agglomerans and 2 P. agglomerans-like strains. Circles and triangles represent P. agglomerans groups I and II, respectively. The two P. agglomerans-like strains, PA5 and SB547, are in the rectangle. The numbers at the nodes are bootstrap values higher than 80%, obtained after 1,000 replicates.
|
![]() View larger version (39K): [in a new window] |
FIG. 3. Biotype-100 results for the 36 strains used in the study. The tree on the left corresponds to a dendrogram, obtained by the unweighted-pair group method using average linkages, based on the concatenated sequences of six protein-coding genes (fusA, gyrB, leuS, pyrG, rplB, and rpoB) for the 36 strains. The tree at the top corresponds to the dendrogram, obtained by the unweighted-pair group method using average linkages, of 99 carbon sources clustered based on Dice similarity. A gray square means that the strain is able to utilize the substrate, whereas an empty square means that the strain is not able to grow on this carbon source after 48 h of incubation.
|
|
View this table: [in a new window] |
TABLE 3. Phenotypic characteristics of Pantoea agglomerans strains and related strains
|
Antibiotic susceptibility. All strains of P. agglomerans were uniformly susceptible to aminoglycosides (gentamicin, tobramycin, amikacin), fluoroquinolones (ofloxacin, ciprofloxacin), and trimethoprim-sulfamethoxazole. All strains were fully susceptible to broad-spectrum cephalosporins and imipenem. Testing for susceptibility to other β-lactams showed various results (Table 4): nine P. agglomerans strains were susceptible to all β-lactams; six strains showed intermediate susceptibility or resistance to amoxicillin; and four strains showed intermediate susceptibility or resistance to cephalothin. Isolates SB547 and PA5 were susceptible to all antibiotics. Strain PA16 and the type strains of P. dispersa and P. stewartii were resistant to cephalothin, and the type strain of P. ananatis was resistant to amoxicillin.
|
View this table: [in a new window] |
TABLE 4. MICs of several β-lactams
|
MLST discriminates P. agglomerans isolates. For the 20 P. agglomerans strains and isolates SB547 and PA5, the number of distinct alleles ranged from 4 for rplB to 17 for leuS (Table 5). Remarkably, each strain had a unique ST, showing that MLST is very discriminatory among P. agglomerans strains.
|
View this table: [in a new window] |
TABLE 5. Diversity of six housekeeping genes among 20 Pantoea agglomerans and 2 P. agglomerans-like strainsa
|
) ranged from 0.3% to 2.6% for rplB and leuS, respectively. Thus, P. agglomerans contains relatively large numbers of nucleotide polymorphisms among orthologous genes, even based on this limited strain sample. No nonsynonymous changes were found in pyrG and rpoB, and the ratio of synonymous to nonsynonymous changes (Ks/Ka) was higher than 11 for the other genes, consistent with selection against amino acid changes in housekeeping genes. For microevolutionary studies, phylogenetic relationships among strains using MLST data are typically deduced based on allelic profiles rather than nucleotide sequences, since the former approach is less sensitive to recombination (11). Among P. agglomerans isolates, most STs were distant by at least three allelic mismatches, with only two pairs of STs (ST11-ST12 and ST17-ST19) differing by two genes. Thus, it was difficult, based on this restricted sample, to identify groups of closely related STs that could correspond to clonal families.
The phylogenetic structure of P. agglomerans was investigated by analysis of nucleotide sequences using SplitsTree (Fig. 2C). This analysis revealed the existence of a cluster of four strains (group II), which was demarcated from the remaining 16 P. agglomerans strains, including the type strain (group I). This cluster was also recovered by maximum-likelihood (Fig. 2A) and neighbor-joining (data not shown) phylogenetic reconstructions, as well as from most individual gene phylogenies (see Fig. S1 in the supplemental material). Therefore, two phylogenetic groups can be distinguished within P. agglomerans. Three of four isolates of group II did not utilize D-lyxose, whereas all group I strains except one were positive for this carbon source. Of note, the 16S rRNA gene sequence did not allow separation between the two groups due to its limited amount of polymorphism (Fig. 2B).
Plasmid pPATH is associated with the virulence of P. agglomerans for plants (35), and P. agglomerans strains differ by the presence of this plasmid (51). Currently, the link between plant strains and human clinical isolates is not known. In order to determine the presence or absence of plasmid pPATH among our strains, we performed a PCR assay (51) that targets the repA gene, which codes for the replicase protein A of plasmid pPATH. It appeared that all P. agglomerans strains were positive by repA PCR, except for strain SB545 of group II. Of the two P. agglomerans-like strains SB547 and PA5, the former was negative and the latter was positive. All non-P. agglomerans strains were negative except for strain PA16 (identified as P. dispersa), which was positive and may have acquired plasmid pPATH horizontally. These results provide no evidence that P. agglomerans clinical strains represent a population distinct from plant-pathogenic strains.
|
|
|---|
The six genes fusA, gyrB, leuS, pyrG, rplB, and rpoB were successfully PCR amplified and sequenced for a collection of 36 strains initially identified as belonging to the Pantoea genus. The phylogenetic tree based on the concatenated sequences of the six housekeeping genes confirms that the genus Pantoea is heterogeneous. Pantoea species did not group in a single branch but instead were divided into two clusters. One cluster contained the type species of the genus (P. agglomerans), together with P. ananatis, P. dispersa, and P. stewartii. The second cluster contained the species P. citrea, P. terrea, and P. punctata, described by Kageyama et al. (28), which were strongly associated with the type strain of Tatumella ptyseos. These results indicate a necessity to revise the taxonomic status of these three species, as was suggested by Grimont and Grimont based on the distinct phenotypic characteristics of the "Japanese" group of Pantoea species (20), which possibly should be reclassified as belonging to the genus Tatumella.
Within the first cluster, P. agglomerans was clearly demarcated from the other species. Indeed, the 19 isolates and the type strain formed a tight cluster with less than 1.5% divergence on average for the six protein-coding genes, while these strains were separated by more than 7% divergence from P. ananatis and P. stewartii. Therefore, there appears to be no ambiguity in the distinctness of these species based on the present set of strains. In contrast, the two strains SB547 and PA5 showed an intermediate average distance from P. agglomerans strains. However, individual gene phylogenies of the six genes do not show the same picture, especially for fusA, on the basis of which the two strains clustered inside P. agglomerans (group II) (see Fig. S1 in the supplemental material). Some strains can have intermediate positions on phylogenetic trees deduced from concatenated sequences, in particular due to horizontal transfer occurring independently on individual genes, providing conflicting phylogenetic signals. These strains result in unclear demarcation of species, and this difficulty in separating species has led to the concept of "fuzzy species" (23). SB547 and PA5 potentially represent examples of strains that have received DNA sequences from external species, leading to their apparent separation from P. agglomerans. Alternatively, they may represent a distinct species that has received some genes (in particular, fusA) from P. agglomerans. This phenomenon makes it difficult to decide, based on the present data, whether the two strains SB547 and PA5 belong to P. agglomerans or to a closely related species. In contrast, PA2 clearly represents a species distinct from all currently described Pantoea species, since its uniqueness is supported by all individual gene phylogenies.
It may be noted that the position of P. ananatis relative to P. agglomerans and P. stewartii in the concatenate tree (Fig. 2A) was different from that in the 16S rRNA tree (Fig. 2B). This difference was also apparent in a previous study where 16S rRNA and protein-coding genes were analyzed (53). The plot of the distance based on 16S rRNA versus the distance based on the concatenated sequence showed a strong correlation, with most comparisons fitting on a single line, but P. ananatis stood as an outlier, with an atypically high 16S rRNA similarity (data not shown). Therefore, one may suspect that the 16S rRNA gene underwent horizontal transfer between P. agglomerans and P. ananatis. Based on individual gene phylogenies (see Fig. S1 in the supplemental material), five of our six genes, as well as the atpD and carA genes (53), grouped P. ananatis with P. stewartii, while fusA grouped P. ananatis with P. agglomerans. Thus, fusA may also have been imported from P. agglomerans into P. ananatis. Finally, we noted that for both subspecies of P. stewartii, the 5' portion of the 16S rRNA sequence was divergent from those of P. agglomerans and P. ananatis, with 8 single-nucleotide polymorphisms within 92 nucleotide positions (8.7%), whereas the average divergence over the entire length of the rRNA gene was 2.7%. This observation suggests that the 16S rRNA gene of P. stewartii may have a mosaic origin.
Overall, these results illustrate the importance of using multiple independent gene sequences, since they provide much-improved phylogenetic information over that obtained with the 16S rRNA gene. In addition, as proposed in the MLSA approach (23), the use of multiple genes allows one to buffer the effect of horizontal transfer on phylogenies and to be better informed on the validity of assigning strains to species.
Identification tools. Biochemical tests are broadly used for strain identification in clinical laboratories. Our results show that strains identified as P. agglomerans by widely used biochemical methods represent a diverse set of strains encompassing several unrelated phylogenetic branches. A majority of isolates initially identified as P. agglomerans clustered in a compact group together with the type strain and can be considered as belonging to the species P. agglomerans. Nevertheless, the API 20E method led to the misidentification of a few strains (SB546, PA2, PA8, PA10, PA13, PA15, and PA16), since phylogenetic analysis revealed that they did not belong to P. agglomerans. Thus, multilocus sequencing of protein-coding genes stands as a useful reference tool for the identification of P. agglomerans and for the characterization of atypical strains.
Even though biochemical identification used routinely can be imprecise, the Biotype-100 results were consistent with the phylogeny, and some characteristics appear to be potentially useful and can provide an orientation for strain identification, since they are specific to particular phylogenetic clusters. For example, D-tartrate was used only by P. agglomerans, whereas myo-inositol and meso-tartrate were used only by P. agglomerans and closely related Pantoea species; these results are consistent with previous data (20). Likewise, resistance to fosfomycin was found to be very useful for the identification of P. agglomerans.
Toward a universal MLST scheme for Enterobacteriaceae? Strain typing is an important tool for detecting the source of infections in epidemiological investigations (48). In addition, the possible association between ecology or virulence and particular P. agglomerans clones is currently unknown, and future research into these important questions would benefit greatly from a standard definition of strains. MLST is now widely recognized as a powerful approach for defining groups of related strains (clones), for determining the geographic and temporal distribution of clones, and for revealing the internal genetic structure of species. Until now, MLST developments have been restricted to a few species of the Enterobacteriaceae (1, 9, 29, 52), because it remains difficult to design successful PCR primers for sequencing of protein-coding genes, especially when no genome sequence is available. In addition, because different gene sets are used for different species, and because distinct genes have distinct evolutionary rates, direct comparison of amounts of diversity and population genetics parameters among species is not possible.
We successfully developed an MLST scheme for Pantoea species in the absence of a complete genome sequence for Pantoea by designing broad-range primers based on universally conserved protein-coding genes. Our data show that MLST is a powerful typing tool for P. agglomerans, since each of the 20 isolates had a distinct multilocus genotype (allelic profile). This result was not necessarily expected, since the use of broadly distributed genes could have implied a low evolutionary rate and hence low discrimination. Together with previous results for Plesiomonas shigelloides (44) and Enterobacter cloacae (41), which also showed high discrimination among strains, the present study therefore indicates that our set of broad-range primers could represent a universal MLST scheme applicable to most, if not all, species of the Enterobacteriaceae.
Population structure of P. agglomerans. Bacterial species differ widely in the rate of homologous recombination among strains (13, 45). Determining the frequency of recombination is important for understanding the evolution of bacterial pathogens and for the interpretation of typing data (46). Given the relatively small sample and the restricted degree of nucleotide polymorphism, it was not possible to determine the rate of recombination precisely and to demonstrate its occurrence statistically. However, two lines of evidence indicate that recombination is not rare among P. agglomerans strains. First, the distribution of nucleotide polymorphisms along the individual phylogenies of the three most informative genes (gyrB, leuS, and rpoB) showed homoplasy, which is likely to result from the occurrence of intragenic recombination in these genes. Second, split decomposition analysis of the concatenated sequences of the six genes showed a network-like structure (Fig. 2C). Typically, networks disclosed upon split decomposition analysis are interpreted as evidence for recombination (26), since they incorporate the effects of both intragenic and intergenic recombination. This analysis, therefore, suggests relatively frequent homologous recombination among P. agglomerans strains. This species, therefore, represents another example of a recombining species within the family Enterobacteriacae, following the previous demonstration of high rates of recombination in P. shigelloides (44) and moderate recombination in Escherichia coli (12, 52) and Salmonella enterica (10).
Despite evidence for homologous recombination among housekeeping genes, an internal phylogenetic structure could be revealed for P. agglomerans. Indeed, we consistently found two groups of strains that were recovered by different analyses and genes. The existence of an internal phylogenetic structure in the presence of recombination is reminiscent of the situation of E. coli, where at least six clearly separated groups are distinguished (27). In addition, utilization of D-lyxose appeared to distinguish P. agglomerans groups I and II, although this finding should be confirmed with additional strains. There was no association in our limited sample between the groups and the source of isolates, but the study of larger strain collections of diverse origins, including plant-pathogenic isolates and environmental isolates, may reveal associations between phenotypes and multilocus genotypes. In order to provide a common language for P. agglomerans strain characterization and evolution, a publicly available MLST website was set up at http://www.pasteur.fr/mlst.
A.D. was partially financed by a generous gift from the Conny-Maeva Charitable Foundation.
Published ahead of print on 3 December 2008. ![]()
Supplemental material for this article may be found at http://jcm.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2010 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»