Previous Article | Next Article ![]()
Journal of Clinical Microbiology, July 2008, p. 2280-2290, Vol. 46, No. 7
0095-1137/08/$08.00+0 doi:10.1128/JCM.01752-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Department of Laboratory Medicine, Children's and Women's Health,1 Department of Cancer Research and Molecular Medicine, Faculty of Medicine, Norwegian University of Science and Technology,2 Department of Medical Microbiology, St. Olavs University Hospital, Trondheim, Norway,5 Groupe de Recherche sur les Maladies Infectieuses du Porc, Faculté de Médecine Vétérinaire, Université de Montréal, Saint-Hyacinthe, Québec J2S 7C2, Canada,3 Institut für Mikrobiologie und Tierseuchen, Freie Universität Berlin, P.O. Box 040225, D-10061 Berlin, Germany4
Received 3 September 2007/ Returned for modification 26 October 2007/ Accepted 30 April 2008
|
|
|---|
|
|
|---|
The E. coli species was subdivided into four phylogenetic groups—A, B1, B2, and D--based on multilocus enzyme electrophoresis (52). It has not been possible to establish reliable ancestral relationships between the phylogenetic groups due either to different speed of evolution among the groups or to frequent recombination (27, 61). Typical EPEC strains from different parts of the world have been shown to belong to four main clonal groups (33). In contrast, it has been argued that atypical EPEC strains may comprise a heterogeneous group of strains from different pathotypes that have acquired the LEE by horizontal transfer or are typical EPEC that have lost the EAF plasmid (58). Some atypical EPEC strains have shown a higher degree of genetic similarity to EHEC (O157:H7) than to typical EPEC (55). It has also been reported that strains classified as atypical EPEC may have very heterogeneous virulence profiles (2, 58).
Whereas typical EPEC is well recognized as a leading cause of severe childhood diarrhea in low-income countries (39), the role of atypical EPEC as a diarrheagenic agent has been controversial. Atypical EPEC has been shown to be prevalent both in children with diarrhea and in healthy children (14, 16, 20, 23, 31, 38, 40-44, 47, 50, 57). However, several investigators have reported a statistical association between atypical EPEC and diarrhea (4, 15, 41, 49, 51, 58).
In a recent case-control study from Norway, we found a high prevalence of atypical EPEC in children with or without diarrhea (1). When the strains were analyzed with respect to virulence gene content using DNA microarray and PCR, several genes were significantly associated with diarrhea (2). Among these, the association was highly significant for genes belonging to the pathogenicity island OI-122 (efa1/lifA, nleB, nleE, and set/ent), and for long polar fimbriae (LPF) when three variants of the lpfA gene were analyzed together. In contrast, the yjaA gene, which is often used as a phylogenetic marker, was negatively associated with diarrhea. This finding indicated a possible association between pathogenic potential of the atypical EPEC strains and their phylogenetic relationships.
Multilocus sequence typing (MLST), by which a set of multiple housekeeping gene loci are compared using sequence analysis, is a powerful tool for long-term epidemiological and phylogenetic analysis (35, 56). In the present study we used MLST to characterize the phylogenetic relationships between the atypical EPEC strains from the above-mentioned case-control study. In addition, we classified the strains in phylogenetic groups using PCR and compared the phylogenetic ancestry of the strains with their virulence characteristics based on microarray analysis and with the results of serotyping and pulsed-field gel electrophoresis (PFGE). Finally, we searched for a possible association between phylogenetic ancestry and diarrhea.
|
|
|---|
Phylogenetic group determination. Each atypical EPEC strain was assigned to one of the E. coli phylogenetic groups according to the presence or absence of the genes chuA, yjaA, and tspE4C2 as proposed by Clermont et al. (13).
MLST. For each strain the seven housekeeping genes adk, fumC, gyrB, icd, mdh, purA, and recA were amplified and sequenced according to the protocol of the Escherichia coli MLST database (http://web.mpiib-berlin.mpg.de). After overnight culture, bacterial DNA for PCR was obtained by heat lysis. Amplification was carried out with primers as previously published (61) in a total volume of 50 µl with 50 µM concentrations (each) of dATP, dCTP, dGTP, and dTTP; 0.5 µM concentrations of each primer (MedProbe, Oslo, Norway); 10x PCR buffer (Applied Biosystems, Branchburg, NL); 1.5 mM MgCl2; 1 U of AmpliTaq Gold polymerase (Applied Biosystems); and 2 µl of bacterial DNA extract as a template. The reaction conditions used were 15 min at 95°C, followed by 30 cycles of 1 min at 95°C, 1 min at the annealing temperature specified for each gene (see reference 61 or the E. coli MLST website), and a 2-min extension at 72°C, followed finally by 5 min at 72°C in a thermocycler, either an MJ Research PTC-200 (Bio-Rad, Hercules, CA) or a GeneAmp PCR System 9700 (Applied Biosystems). The PCR products were purified for sequencing with a QIAquick PCR purification kit (Qiagen, Valencia, CA). Both DNA strands were sequenced with the PCR primer set, with primers published by Tartof et al. (54), or with primers designed in the present study—gyrB(Trh)F, 5'-AAGTGATCATGACCGTTCTG-3'; icd(Trh)F, 5'-GATGGAATCGGTGTAGATGT-3'; icd(Trh)R 5'-GTAGCCCCAGTCTTTAAACG-3'; and purA(Trh)R, 5'-TGCTTGCAGAGGAACTCGC-3'. Sequencing was performed by using either the CEQ DTCS-Quick Star kit (Beckman Coulter, Fullerton, CA) or a BigDye terminator cycle sequencing kit (v3.1; Applied Biosystems) with subsequent capillary electrophoresis, respectively, on a Beckman Coulter CEQ 8800 or an ABI 3130x genetic analyzer according to the protocol of the manufacturers.
Sequence analysis. Raw sequence traces were reviewed by visual inspection using Sequencher software version 4.2 (Gene Code Corp.). Forward and reverse sequences were aligned, and consensus sequences corresponding to the allele templates were compared to known variants of the corresponding gene at the E. coli MLST database. Ambiguities in consensus sequences were resolved by resequencing. Each of the seven gene loci was assigned an allele number by submission of the sequences to the E. coli MLST database. Allelic sequences previously not reported were given new allele numbers by the curator of the database after independent review of sequence traces. Each isolate was assigned a sequence type (ST) according to its allelic profile.
Phylogenetic analysis.
For each bacterial strain the sequences from all seven gene loci were concatenated for phylogenetic analysis. Concatenated sequences were then aligned by using the CLUSTAL W algorithm of the MEGA3 software (32). A rooted neighbor-joining tree was constructed by using a Kimura two-parameter model of nucleotide substitution and the phylogenetically divergent E. coli strain Z205 from the E. coli MLST database as an outgroup (61). Tree stability was assessed by bootstrap analysis with 1,000 iterations. Using SplitsTree 4 software (26), phylogenetic network analysis was done with the neighbor-net algorithm and untransformed distances (p distance). The SplitsTree
w recombination test was applied to concatenated sequences and for each of the seven gene loci individually to distinguish recombination from recurrent mutation.
The phylogenetic relationships between different STs were analyzed by using the MSTree application of Bionumerics version 4.6 (Applied Maths, Sint-Martens-Latem, Belgium) to identify closely related genotypes. In this analysis atypical EPEC strains are compared based on similarity in allelic profiles. Strains of different STs sharing six of seven alleles were interpreted as belonging to the same clonal lineage but were assigned to an ST complex by the curator of E. coli MLST database only when that lineage included at least three different STs.
The phylogenetic diversity of atypical EPEC strains isolated from children with diarrhea and healthy children was compared by using Simpson's index of diversity: D = 1 – [
n(n – 1)/N(N – 1)], where n is the number of subjects with atypical EPEC strains belonging to each ST, and N is the total number of subjects in each of the two groups (25, 53).
Virulence gene profile. The atypical EPEC strains were compared with respect to virulence gene content based on the previously reported results of DNA microarray experiments and PCR (2). The microarray used in that study, derived from an E. coli virulence and antimicrobial resistance microarray (8), was composed of 70-mer oligonucleotide probes specific for 182 virulence genes or markers found in various intestinal and extraintestinal E. coli strains of all known pathotypes. The oligonucleotide microarray allowed the detection of genetic variants of the eae gene (17 variants), and three variants each of the espA, espB, and tir genes, all located on the LEE pathogenicity island. Oligonucleotides specific for three variants [lpfA(O113), lpfA1, and lpfA(R141)] of the LPF-encoding gene lpfA were also included. The overall virulence profiles of the atypical EPEC strains were compared by principal-component analysis (R, version 2.6 [www.r.project.org]) of the 95 virulence genes detected in one or more of the strains (2). The strains were assigned to virulence clusters based on their distribution in the principal component analysis.
PFGE. Macrorestriction analysis (PFGE) of chromosomal DNA was done using XbaI with the following electrophoretic conditions: 14°C, linear ramp of 5 to 60 s over 24 h, 120° switch angle, and a gradient of 6.0 V cm–1. Analysis was done by using Bionumerics software. Similarities of fragments between strains were compared by using a Dice coefficient at 1.0% tolerance and 0.5% optimization, and a dendrogram was constructed with the UPGMA (for unweighted pair-group method with arithmetic averages) clustering method. Significant clusters were determined by calculating the cutoff value that produced the highest point-bisectional correlation (Bionumerics manual, version 4.6).
Serotyping. Serotyping of somatic (O) antigens (serogroups O1 to O177) and flagellar (H) antigens was done by using standard methods (24) at the Escherichia, Shigella, Yersinia, and Vibrio Reference Unit, Laboratory for Enteric Pathogens at the Health Protection Agency (United Kingdom).
Statistical analyses. Fisher's exact test was used for the statistical analyses of categorical variables, and the Mann-Whitney U-test was applied to the analysis of differences between quantitative data which were not normally distributed. P values <0.05 were considered significant.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Comparison of phylogenetic classification, virulence profile, serotype, PFGE clustering, and clinical information for each of the 56 atypical EPEC strains identified in a case-control study among children <5 years old in Norway
|
Phylogenetic analysis of concatenated sequences of the seven MLST loci showed that STs within the phylogenetic group B2 were grouped together and were clearly distinct from STs of the three other phylogenetic groups (Fig. 1). SplitsTree analysis revealed several parallel paths indicating phylogenetic incompatibility in the divergence of atypical EPEC clones (Fig. 2). By using the
w test, statistical significant evidence of recombination was demonstrated (P = 6.8 x 10–6). When each gene was analyzed separately, evidence for recombination was found for the genes fumC (P = 0.002) and gyrB (P = 0.025) but not for the five other genes.
![]() View larger version (35K): [in a new window] |
FIG. 1. Phylogenetic relationships between 56 atypical EPEC strains from Norwegian children. A rooted phylogenetic tree was constructed by the neighbor-joining algorithm based on the Kimura two-parameter model of nucleotide substitution. The ST with the number of isolates (in brackets) is given at each branch tip. Bootstrap values greater than 50% based on 1,000 replications are shown at the internal nodes. Clonally related STs are shown by shaded boxes. The classification in phylogenetic groups, distribution of LEE gene variants, the presence of OI-122, lpfA, and EHEC-related genes, and the sources of the strains are shown for each ST. For combinations of LEE gene variants, see Table 3.
|
![]() View larger version (34K): [in a new window] |
FIG. 2. Phylogenetic relationships between 56 atypical EPEC strains from Norwegian children are presented as a phylogenetic splits network based on the neighbor-net algorithm using a p distance matrix. STs are indicated at the branch tips. The sources of the strains are listed in parentheses (patients/controls). STs belonging to the same phylogenetic group are enclosed by an ellipse.
|
Phylogenetic background compared to virulence characteristics. The atypical EPEC strains were assigned to three different clusters by principal-component analysis of the 94 virulence genes or markers identified in the study (Fig. 3). Two of the clusters were highly compact, indicating a high degree of similarity in virulence profiles between the strains. Both of these clusters consisted exclusively of phylogenetic group B2 strains and were labeled clusters B2-A and B2-B. The third cluster was much wider, indicating more heterogeneity in virulence genes between these strains. This cluster included all of the phylogenetic group A, B1, and D strains and was accordingly labeled cluster A-B1-D. Phylogenetic group B and especially group A strains were widely dispersed within this cluster. Phylogenetic group D strains, in contrast, were located at the periphery and may represent a separate cluster.
![]() View larger version (6K): [in a new window] |
FIG. 3. Principal component analysis of the distribution of 94 putative virulence genes or gene variants in 56 atypical EPEC strains isolated from Norwegian children. Scores (A) and loadings (B) for the first two principal components; the phylogenetic group of each strain is indicated by the design of the marker, while the color shows whether the strain was isolated from children with (red) or without (blue) diarrhea. The loadings show genes previously shown to be statistically linked with diarrhea in red (2).
|
|
View this table: [in a new window] |
TABLE 2. Genes or gene variants restricted to one virulence cluster or phylogenetic group in 56 atypical EPEC strains
|
Ten combinations of different variants of the LEE genes espA, espB, tir, and eae were present among the 56 atypical EPEC strains (Table 3). Each of the four phylogenetic groups, and even some of the STs, included strains with different combinations of LEE genes (Fig. 1). Some of these combinations were observed only within one phylogenetic group, but two of the most frequent combinations were detected in strains from more than one group. Although most of the eae variants were associated with only one specific combination of the espA, espB, and tir gene variants, the gamma and the iota variants were linked with two different combinations of the other LEE genes (Table 3).
|
View this table: [in a new window] |
TABLE 3. Distribution of different variants of four LEE genes compared to phylogenetic group and the presence of OI-122 genes in 56 atypical EPEC strains isolated from children <5 years old from Norway with or without diarrheaa
|
Genes linked to the EHEC pathotype, the pO157 plasmid genes katP, espP, etpD, ehxA, and L7095 and the EHEC urease gene ureD were detected in 14 (25%) strains. These genes were present in strains of nine different STs within the phylogenetic groups A, B1, and D (Fig. 1). Eight different combinations between the EHEC-related genes were observed (data not shown). The 15 strains that contained an incomplete stxB1 sequence belonged to seven different STs within the phylogenetic groups A, B1, and D.
PFGE. When the 56 atypical EPEC strains were analyzed by PFGE, all but two ST10 strains were typeable. Among the 54 typeable strains, the majority displayed unique genotypic patterns (Fig. 4). Except for two strains with identical PFGE restriction patterns isolated from siblings, there was no known epidemiological connection between other strains with closely related PFGE profiles. Five distinct clusters were identified by using the cluster cutoff method, while 42 different clusters identified if a similarity cutoff value of 90% was applied.
![]() View larger version (60K): [in a new window] |
FIG. 4. PFGE cluster analysis compared to phylogenetic ancestry, virulence profile, phylogenetic group, and source of atypical EPEC strains from Norwegian children with or without diarrhea. Clusters identified by the cluster cutoff method (see the text) are visualized by dense lines in the dendrogram. Two strains (Trh7 and Trh10) were not typeable by the PFGE method used. The STs 35 and 526 share six of seven alleles but have not been assigned to a ST complex. See Table 1 for more information.
|
Eleven different H-types were detected among the atypical EPEC strains. Most common were the flagellar types H34 (seven strains), H8 (five strains), and H7 (four strains). Seven strains were nonmotile (H–), and the H-type was not identified in 14 strains.
Phylogenetic ancestry and diarrhea. An association between phylogenetic decent and diarrhea could indicate the presence of virulence traits linked to phylogenetic ancestry. In the present study none of the phylogenetic groups A, B1, and D were significantly associated with diarrhea (Table 4). However, a trend toward significance was observed for the groups B1 (P = 0.06) and D (P = 0.09). In contrast, strains belonging to phylogenetic group B2 were more common in healthy controls than in children with diarrhea. Phylogenetic group A strains were equally common in patients and controls.
|
View this table: [in a new window] |
TABLE 4. Distribution in phylogenetic groups of 56 atypical EPEC strains from children with or without diarrhea with respect to the source of the strains
|
There was a higher degree of diversity in phylogenetic ancestry between strains isolated from children with diarrhea than in strains from healthy children. All but one of the 20 different clonal lineages detected in the study were present among the 37 strains from symptomatic children (Simpson's diversity index of 0.95). In comparison, no more than six clonal lineages were present among the 19 strains from healthy children (Simpson's diversity index of 0.83).
|
|
|---|
Comparison between phylogenetic background and virulence characteristics has usually been done only for a limited number of virulence factors (15, 22, 42, 45, 48, 58) or for strains belonging to specific EPEC serotypes (6). The use of data from DNA microarray experiments in the present study permitted an extensive characterization of the atypical EPEC strains with respect to overall virulence gene content, as well as a comparison between virulence profile and phylogenetic ancestry (Fig. 3). The main division between group B2 strains and the three other groups—A, B1, and D—observed in the phylogenetic analysis (Fig. 1) was supported by differences in virulence profiles (Fig. 3). There was also agreement between the two different types of analyses in the separation of phylogenetic group B2 strains in two clusters, while phylogenetic group A and B1 strains were not reliably differentiated by any of the two methods. Group D strains, on the other hand, were narrowly scattered at the periphery of this cluster (Fig. 3) and were also shown to contain significantly more virulence genes than strains belonging to the other virulence groups. The link observed in the present study between phylogenetic ancestry and virulence profile (Table 2 and Fig. 3) may be explained by the requirement of a specific genetic background for the acquisition of certain virulence factors (17).
One possible reason for poor separation between strains belonging to different groups in the phylogenetic analysis might be the presence of hybrid strains carrying ancestry from more than one source (61). Such hybrid strains were not identified by the PCR method used for the analysis of phylogenetic groups in the present study. However, at least four of the STs identified in the study (ST 28, 32, 154, and 206) were previously observed to contain phylogenetic ancestry from more than one source (61).
Typing of LEE genes may be of importance in the characterization of A/E pathogens since different variants of these genes have been associated with tropism to different locations in the human intestine (19, 46) and may be used for epidemiological characterization of LEE-containing strains. The LEE genes espA, espB, tir, and eae are all genes that may be exposed to high selection pressure since they encode adhesins and effector proteins that interact directly with the host. In the present study 11 different combinations of these four LEE gene variants were identified. In contrast to previous reports (12, 21), several of the espA, espB, tir, and eae variants could be observed in combination with more than just one variant of the other LEE genes. This finding is consistent, as recently suggested, with horizontal exchange between different strains not only of entire LEE sequences but also of smaller gene elements within the LEE (11). The link between certain combinations of LEE gene variants and OI-122 genes shown in the present study (Table 3) may be due to close proximity of the genomic islands in the chromosome of these atypical EPEC strains, a finding similar to what has been shown for O103:H2 EHEC strains (28).
The finding of genes usually linked to the EHEC pathotype in a considerable proportion of the atypical EPEC strains (Table 1) is consistent with evidence from epidemiological and experimental studies showing that atypical EPEC may convert to, or be a conversion from, the EHEC pathotype through the acquisition or loss of stx genes (7, 58, 61). Such a relationship is further supported by the presence of STs belonging to the phylogenetic lineages EHEC1 (ST335 and ST587) and EHEC2 (ST29) (59; http://www.shigatox.net) among the atypical EPEC strains. The variability in the content of plasmid genes between different strains is in agreement with reports of extensive heterogeneity of large plasmids in STEC (9) and A/E E. coli of animal origin (3).
The main impression from the results of the PFGE analysis is that of extensive heterogeneity between the atypical EPEC strains in the study. Differences in the PFGE banding pattern of several fragments were observed even between strains belonging to the same clonal lineage or virulence group (Fig. 4). The PFGE results also confirm a pattern of endemic infection among children during the study period and demonstrate, as expected, that the PFGE method has greater discriminatory power than MLST in differentiating between epidemiologically unrelated atypical EPEC strains.
The finding that an O serogroup was identified in less than half of the strains in the study (Table 1) supports the view that O serogrouping is not useful in the diagnosis of atypical EPEC infections, at least with endemic atypical EPEC strains as reported here. This view is also supported by the observation that OI-122-positive atypical EPEC strains, which we have previously shown to be significantly associated with diarrhea (2), belonged to many different serogroups or were nontypeable (Table 1).
Phylogenetic ancestry was a less useful indicator of diarrheagenic potential in this collection of atypical EPEC strains (Table 4) than specific virulence genes reported previously (2). This is most likely explained by the considerable heterogeneity in virulence factors within each of the phylogenetic groups (Fig. 3 and Table 2). The OI-122 gene efa1/lifA most strongly associated with diarrhea in our previous study was present only in some of the strains within the phylogenetic groups A and B1 (Fig. 1). On the other hand, the negative statistical association with diarrhea shown for the phylogenetic group B2 seems to indicate a link between phylogenetic descent and lack of diarrheagenic potential.
The lack of significant association observed between phylogenetic groups and diarrhea in the present study may in part be explained by the limited number of strains analyzed. In addition, the high number of STs among the strains made a statistical analysis for the relationship with diarrhea for each ST impossible. A considerably larger study than the present will be needed to clarify this issue further. However, the finding that some STs, present in more than two strains, were represented in only one of the groups of children (patients or controls) may indicate a difference in virulence potential between these STs (Fig. 1). Interestingly, the three STs that were detected exclusively in strains from children with diarrhea all contained the efa1/lifA gene.
A limitation of the comparison between phylogeny and virulence genes used in the present study is that only a small part of the genome of each strain was included in the analysis. It is therefore possible that as-yet-unrecognized virulence genes were missed. An alternative could be to do a comparison at the whole-genome level, for instance, by comparative genomic (18) or subtractive (60) hybridization. However, the strength of the present study is the analysis of virulence genes of all pathotypes. In contrast, an analysis at the whole-genome level is usually based on hybridization of the test strain against a sequenced reference strain. In such an analysis, genes which are not present in the genome of the reference strains will not be detected.
In conclusion, we have shown that atypical EPEC strains from Norwegian children with or without diarrhea belonged to all four phylogenetic groups, to 26 different STs, and to 20 different clonal groups. The strains were separated into three clusters by overall virulence gene profile. One large cluster included all phylogenetic group A, B1, and D strains, and two clusters consisted exclusively of strains belonging to group B2. Almost one-third of the virulence genes were detected in only one virulence cluster or phylogenetic group. There was considerable heterogeneity in PFGE profiles and serotypes, and almost half of the strains were O nontypeable. The efa1/lifA and other OI-122 genes were detected in many different STs, but only within the phylogenetic groups A, B1, and D, and were linked with certain combinations of LEE gene variants. EHEC pathotype-related genes were present in one-fourth of the strains. There was borderline significant association with diarrhea for the phylogenetic groups B1 and D, but phylogenetic ancestry was less useful as a predictor of diarrhea than the specific virulence genes shown previously.
J.E.A. was supported by a Ph.D. grant from the Central Norway Regional Health Authority.
Published ahead of print on 7 May 2008. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»