Previous Article | Next Article ![]()
Journal of Clinical Microbiology, August 2005, p. 3734-3742, Vol. 43, No. 8
0095-1137/05/$08.00+0 doi:10.1128/JCM.43.8.3734-3742.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Meningitis and Special Pathogens Branch, National Center for Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia 30333,1 University of Modena and Reggio Emilia, Medicine Faculty, School of Microbiology and Virology, via Campi 287 41100, Modena, Italy,2 Emory University, Division of Infectious Diseases, Atlanta, Georgia 303333
Received 8 March 2005/ Returned for modification 9 April 2005/ Accepted 17 April 2005
|
|
|---|
|
|
|---|
70% of the total number of cases (Centers for Disease Control and Prevention; Active Bacterial Core Surveillance reports, Emerging Infectious Program Network, Neisseria meningitidis; available at http://www.cdc.gov/ncidod/dbmd/abcs/survreports.htm, accessed 8 November 2004). Currently, rare cases of invasive serotype b disease continue to occur, most often identified in unvaccinated or undervaccinated populations with high carriage rates (12, 13, 22). Methods for molecular characterization of bacterial pathogens are commonly used to establish genetic relatedness or similarities between individual strains, which is useful for understanding disease transmission, tracking the spread of virulent or antibiotic-resistant strains, and monitoring the evolution of bacterial populations. H. influenzae isolates have been characterized by different molecular methods, including pulsed-field gel electrophoresis and multilocus enzyme electrophoresis (MEE) (25, 26). To date, MEE has been the most discriminatory method, showing that H. influenzae serotypes c, d, e, and f form monophyletic clusters, unlike H. influenzae serotype a or serotype b isolates, which are more diverse (24, 26, 31). MEE also showed that NT H. influenzae isolates are more heterogeneous than serotypable isolates due to frequent genetic recombination (24, 31). In the last few years, though, multilocus sequence typing (MLST) has been replacing MEE as a molecular subtyping tool for a number of bacterial pathogens, including H. influenzae (7, 19, 23, 37), since it allows much better intra- and interlaboratory comparison and portability of data.
Recently, 16S rRNA gene sequencing (16S typing) has been used to characterize Neisseria meningitidis isolates (33), and the epidemiological benefit of this approach became apparent in investigations of disease outbreaks caused by N. meningitidis serogroup C and W135 isolates. 16S typing provided the highest sensitivity, specificity, and positive and negative predictive values for defining outbreak-related versus sporadic isolates when compared with pulsed-field gel electrophoresis, MEE, and MLST (21, 33).
Because of the increased contribution of non-serotype b H. influenzae in invasive disease in the United States after the introduction of the serotype b vaccine, the lack of an ideal molecular subtyping method, especially for NT H. influenzae isolates, and the successes of 16S typing for men in outbreak settings, we analyzed a representative and diverse collection of H. influenzae isolates by 16S typing and MLST to do the following: (i) investigate the diversity of H. influenzae 16S rRNA genes in H. influenzae isolates of different serotypes and in nonserotypable isolates, (ii) examine any association of 16S type with MLST sequence types (ST) or H. influenzae serotype, and (iii) evaluate the abilities of both methods to differentiate NT H. influenzae isolates.
|
|
|---|
H. influenzae surveillance isolates. Of the 330 H. influenzae isolates in this study, 293 were surveillance isolates. Two hundred twenty-seven H. influenzae case isolates recovered from 2000 to 2003 by the Active Bacterial Core Surveillance/Emerging Infectious Program for H. influenzae invasive disease (part of the ongoing multistate active laboratory-based, population-based surveillance program coordinated by Centers for Disease Control and Prevention [CDC]). They were selected for this study to provide representation for the U.S. H. influenzae isolates as follows: all isolates of serotypes a, b, c, and d (n = 12, 42, 1, and 3, respectively) and a random selection from serotypes e (43/80, 54%) and f (36/267, 13%) and NT H. influenzae (90/894, 10%).
H. influenzae community survey isolates. Twenty-six H. influenzae isolates (25 serotype b isolates and 1 NT isolate) were recovered from 1999 to 2000 during an investigation of serotype b transmission among Amish children in Pennsylvania (12). Eight of these isolates were from patients with invasive disease; the other 18 isolates were from asymptomatic carriers in two of the communities in which there were cases of serotype b disease. These isolates were selected to assess usefulness of 16S typing in an outbreak-like setting. An additional 66 serotype b isolates were collected from a study in Alaska during 1992 to 1997; 24 isolates were from cases of invasive serotype b disease, and the remaining 42 isolates were cultured from the throats of healthy Alaska Native children. They were selected for this study due to the slightly different epidemiology of H. influenzae disease in Alaska and its remote geographic location (18).
H. influenzae reference strains. Eleven H. influenzae strains were used as controls for serotype-specific PCR, including six CDC control isolates (serotype a, M4741; serotype b, M5216; serotype c, M6542; serotype d, M6548; serotype e, M5153; and serotype f, M6297) and five ATCC strains (serotype b, ATCC 9795; serotype c, ATCC 9007; serotype d, ATCC 9332; serotype e, ATCC 8142; and serotype f, ATCC 9833).
Identification, standard slide agglutination serotyping, and serotype-specific PCR. Identification of all H. influenzae isolates was performed in the state public health laboratories according to standard microbiological methods. H. influenzae serotypes were confirmed at the CDC by standard slide agglutination for all 330 isolates. PCR specific for the capsule transport gene (bexA) and serotype-specific PCRs for molecular characterization of H. influenzae capsular types (a to f) were performed on all 330 isolates as previously described (9), except that the final MgCl2 concentration was increased to 2.1 mM and the annealing temperature was 55°C when serotype e-specific primers were used.
Whole-cell suspensions. Whole-cell suspensions were used as templates for all PCRs. Cells were harvested from overnight growth on chocolate II agar plates (BD Bioscience, Baltimore, MD), transferred into 1.0 ml of 10 mM Tris buffer (pH 8), and vortexed to homogeneity. Suspensions were boiled at 100°C for 10 min and then stored at 20°C.
16S rRNA gene sequence. The 16S rRNA genes of all H. influenzae isolates were amplified by PCR using external primers to the 16S rRNA gene, primers F15 and R1594, as previously described (33). The amplified product of 1,595 bp was sequenced using 16 different primers: primers 357, 530, 790, and 1390 in forward and reverse orientation; primers 15, 17, 24, 981, and 1230 in forward orientation; and primers 1492, 1583, and 1585 in reverse orientation. A total of six novel primers were used in this study: F15 (5'TAAGCAGTTTATTGAGCGAT 3'), F17 (5' AGCAGTTTATTGAGCGATTG 3'), F24 (5' TATTGAGCGATTGAACTTGA 3'), R1583 (5' CTCGCTGTCTCTCGTCTTCA 3'), R1585 (5' CTCGCTGTCTCTCGTCTT 3'), and R1594 (5' GTGAGCACTCGCTGTC 3'). Primers were designed based on the 16S rRNA gene consensus sequence of the six operons of the serotype d strain Rd W20 (GenBank L42023) (11), using Oligo V6 software (Molecular Biology Insights, Inc., Cascade, CO). Primers 357, 530, 790, 981, 1230, 1390, and 1492 were described previously (8, 32, 35). Sequencing was performed using the BigDye terminator cycle sequencing kit version 2.0 (Applied BioSystems, Foster City, CA). Sequencing products were purified using Centri-Sep (Princeton Separations, Adelphia, NJ) and were resolved in an Applied BioSystems model 3100 automated DNA sequencing system (Applied BioSystems).
16S rRNA type determination. The 16S rRNA gene sequence (1,538 bp) was obtained and used for analysis and comparison by the GCG (Wisconsin) package, version 10.1, (Accellrys, San Diego, CA). A type number was assigned for each different 16S sequence; a single base difference, including a mixed base (more than one nucleotide identified at a single position), was considered a novel 16S type. Because H. influenzae has six rRNA operons (11), the PCR products could have different nucleotides at any single position for a given operon, which could result in a heterologous base call or "mixed base" in the consensus sequence. When a unique 16S type was obtained, the 16S rRNA gene amplification and sequencing of the entire gene or parts containing the novel region were repeated.
MLST. MLST was performed by sequencing gene fragments of adk, atpG, frdB, fucK, mdh, pgi, and recA as previously described (23) (http://www.mlst.net). MLST was performed on 304/330 H. influenzae isolates in this study. MLST testing of the 26 isolates from Pennsylvania was conducted previously (23). The detailed MLST results and sources of the isolates determined in this study have been deposited in the H. influenzae public database (http://www.mlst.net). The eBurst algorithm and the H. influenzae database at http://eburst.mlst.net were used to examine relationships among isolates (10).
Dendrogram. Two dendrograms were generated: one for 16S and one for MLST data. Each dendrogram contained a single representative sequence for each of the 65 different 16S types or 89 different STs. For the MLST dendrogram, DNA sequences for each of the seven genes for each ST were concatenated and used as an input file. Evolutionary distance correlation was predicted by the method of Jukes and Cantor, and the phylogenetic dendrograms were generated using the unweighted pair group method with arithmetic mean (UPGMA) (16).
Nucleotide sequence accession numbers. The 330 16S rRNA gene sequences determined in this study have been deposited in the GenBank database under the following accession numbers: AY613445 to AY613775.
|
|
|---|
16S rRNA gene sequencing. (i) Sequence diversity. In total, 65 different 16S types were identified (Table 1). Sequence differences among these 65 16S types ranged from 0.06% to 2.73%, i.e., from a single base to small regions of up to seven consecutive base changes. The 16S rRNA gene sequence for serotype f isolates (16S types 13, 51, and 54) is shorter by one base due to a base missing at position 211 (1,537-bp length). When the nucleotide sequence of the 16S rRNA gene fragments (1,538 bp) from all 330 isolates were aligned and compared, 78 positions of difference throughout the gene were found. In 32 of these 78 positions (41%), mixed bases were detected. There were two hypervariable areas (46 bp and 34 bp) in the 16S rRNA gene responsible for 33 (42%) of the 78 positions of differences. These areas are between positions 170 and 216 (18 positions) and 1002 and 1036 (15 positions) in the 16S rRNA gene; these positions correspond to variable regions V2 and V6 (27). The remaining 45 positions were distributed across the entire length of the 16S rRNA gene sequence.
|
View this table: [in a new window] |
TABLE 1. Distribution of 65 16S types and 89 STs and their correlation
|
(iii) 16S rRNA types among NT H. influenzae isolates. Forty 16S types were found among the 91 NT H. influenzae isolates, and none of these 16S types was found among serotypable isolates (Table 1). Thirty-seven (92.5%) of these 40 16S types are identified in only 1 to 4 isolates each. Only three 16S types, types 3, 4, and 39, are represented by more than five isolates each. These account for 33 (36%) of the NT H. influenzae isolates; 16S type 3 (n = 16; 18%), type 4 (n = 10; 11%), and type 39 (n = 7; 8%).
MLST. (i) MLST diversity. The nucleotide sequences of the seven MLST genes were aligned and compared with those on the MLST website. We found between 22 and 43 different alleles for each gene: 29 for adK, 22 for atpG, 30 for frdB, 26 for fucK, 43 for mdh, 36 for pgi, and 29 for recA. In total, 62 novel alleles were found and added to the MLST database: 8 for adK, 5 for atpG, 10 for frdB, 6 for fucK, 12 for mdh, 14 for pgi, and 7 for recA. When the alleles for the 7 genes for each of the 330 isolates were combined, we obtained 89 different STs, of which 47 were novel, and 9, 4, 4, and 30 new STs among serotype b, serotype e, serotype f, and NT H. influenzae, respectively, were found. The novel STs also were added to the MLST database. For each of the 89 different STs, a concatenated DNA sequence of 3,057 bp was generated using the individual sequences for each of the 7 genes. The concatenated sequences were aligned and used to generate a dendrogram (see Fig. 2).
![]() View larger version (14K): [in a new window] |
FIG. 2. Consensus UPGMA phylogenetic tree constructed with 89 concatenated DNA sequences obtained from seven housekeeping genes of H. influenzae. The number in the branches indicates the ST, and the respective serotypes are represented. The symbols by STs 44 and 157 (circle), 117 (open square), and 4 (filled square) indicate STs of H. influenzae serotypes that correlate with 16S types in Fig. 1; they are indicated by the same symbols. The scale bar represents an expected substitution rate of 0.1 nucleotide substitution per base position. See the legend to Fig. 1 for abbreviations.
|
(iii) STs among NT H. influenzae isolates. Fifty-four STs were found among and were exclusive to the 91 NT H. influenzae isolates (Table 1). Forty-seven (87%) of these STs are represented by 1 (n = 29; 54%) or 2 (n = 18; 33%) isolates only. The remaining 13% are STs represented by 3 (n = 3; 6%) or 4 (n = 4; 7%) isolates.
Correlation among H. influenzae 16S types, STs, and serotypes. The correlation between 16S types and STs is presented in Table 1. Thirty-five of the 65 (54%) 16S types correlated exclusively with a particular ST; 27 of the 35 (77%) were among NT H. influenzae. For the remaining 30 16S types (46%) that did not correlate exclusively with a particular ST, 13 (43%) were also NT H. influenzae. Twenty-nine of the 89 (33%) STs correlated exclusively with a particular 16S type; 24 of the 29 (83%) were among NT H. influenzae. For the remaining 60 STs (67%) that did not correlate exclusively with a particular 16S type, 30 (50%) were also NT H. influenzae. There was a specific correlation between 16S types and STs with serotypes; not a single 16S type or ST was found in two or more different H. influenzae serotypes. Six major clones defined by a combination of ST/16S types represent 69% of serotypable isolates (Table 2).
|
View this table: [in a new window] |
TABLE 2. The six most prevalent combinations of MLST and 16S rRNA types among serotypable H. influenzae isolatesa
|
Relationships among 16S rRNA or MLST sequences. Dendrograms generated with 16S rRNA or MLST gene sequences show that isolates of a particular serotype are grouped as one unique, serotype-specific cluster, with the exception of serotype b isolates, which were grouped into two serotype b clusters (b-I and b-II), and that NT H. influenzae isolates are mainly grouped into three unclearly delineated clusters (NT-I, NT-II, and NT-III) (Fig. 1 and 2). However, the 16S rRNA dendrogram shows three exceptions: first, a single serotype a isolate (16S type 47) was grouped into cluster NT-II. The 16S type 47 sequence is similar to 16S type 34 (serotype a), with differences at four positions (positions 263, 468, 1131, and 1133) and a region of 19 bp (between positions 831 to 850). These differences in the sequence are identical to the sequence of 16S types 20 and 39, both part of the cluster NT-II. This suggests that 16S type 47 arose by recombination between 16S type 34 (serotype a) with 16S type 20 or 39 (NT H. influenzae). Second, serotype b of 16S types 27 (1 isolate) and 45 (20 isolates) were not in cluster b-I or b-II but clustered with NT H. influenzae isolates. Additional sequence examination showed that 16S types 27 and 45 are highly related to 16S type 6 (cluster b-II); 16S type 27 sequence is the same as 16S type 6, with 12 bp (between positions 180 and 191) identical to 16S type 12. The 16S type 45 sequence is also like 16S type 6, with a substitution of three consecutive bases (between positions 1131 and 1133) identical to those of 16S types 55 and 67. It appears that 16S types 27 and 45 arose by recombination between 16S types 6 and 12 to produce type 27 and between 16S types 6 and either 55 or 67 to produce type 45. MLST confirmed that these unusual serotype b isolates are within the MLST b-I cluster including the ST6 complex (Fig. 2). According to eBurst analysis, ST117 (found in isolates of 16S type 27), ST44, and ST157 (both 16S type 45) are all related to ST6 with one, two, or three locus variations from ST6, showing these isolates to be members of the ST6 clonal complex.
![]() View larger version (11K): [in a new window] |
FIG.1. Consensus UPGMA phylogenetic tree constructed with 65 identified H. influenzae 16S types. The numbers on the branches indicate the 16S type, and the respective serotypes are represented. The symbols denoting 16S types 45 (circle), 27 (open square), and 47 (filled square) indicate H. influenzae isolates with 16S types and serotypes that do not cluster with other 16S types in each respective serotype. These 16S types correlate with STs in Fig. 2, as indicated by the same symbols. The scale bar represents an expected substitution rate of 0.1 nucleotide substitution per base position. Hi-NT, nontypeable H. influenzae; Hib, H. influenzae serotype b; Hia, H. influenzae serotype a; Hic, H. influenzae serotype c; Hid, H. influenzae serotype d; Hie, H. influenzae serotype e; Hif, H. influenzae serotype f.
|
|
|
|---|
1.0% to 2.0% 16S rRNA gene sequence differences cutoff threshold generally used for species identification (17). It is also interesting that the 16S rRNA gene sequences of all 38 serotype f isolates are one base shorter than those of all other H. influenzae 16S rRNA genes analyzed so far. Greater divergence in 16S rRNA genes has been found previously in other bacteria, e.g., bacteria having multiple operons within one genome, differing from each other from 6.7% to 11.6% (1, 4, 6, 20). Therefore, recombination between 16S rRNA genes of different strains or species can occur at a much higher frequency than originally suspected. In contrast to the high level of diversity of 16S rRNA genes among N. meningitidis and H. influenzae isolates, we have previously found that for some species, the diversity of 16S rRNA genes can be limited or even absent (14, 15, 32), suggesting that the level of 16S rRNA gene diversity is also species related. Some of the reasons that would affect the level of diversity among different bacteria would be natural competency, differences in ecology, the presence of DNA uptake sequences, and restriction modification systems (3, 5, 28, 29, 34). The 16S rRNA gene sequence diversity of H. influenzae isolates described here is greater than in any other human pathogenic bacterial species described so far. However, we still observed a direct correlation between 16S types and serotypes (a to f); no 16S type was found in 2 or more different serotypes. An identical correlation was also observed between STs and serotypes.
Previous studies using a variety of subtyping methods including MEE and MLST have shown that though serotypable H. influenzae behave as highly clonal populations, NT H. influenzae are not closely related to serotypable isolates, and they also appear to undergo more frequent recombination (24, 26, 31). Our 16S typing and MLST results are in agreement with these studies and even emphasize the lack of clonality of NT H. influenzae in that the 2 methods cluster isolates differently.
The results of this study demonstrate that 16S typing and MLST allow for defining relationships among isolates when used on a predominantly clonally structured bacteria like serotypable H. influenzae. This was further confirmed in the analysis of serotype b isolates collected in Pennsylvania, as well as clinical and carrier serotype b isolates from Alaska, both areas with persistent disease.
This is the first study using 16S typing to characterize an extensive collection of H. influenzae isolates demonstrating an unexpectedly high level of DNA sequence heterogeneity (up to 2.73%), and emphasizing that species identification using 16S rRNA gene sequence cutoffs of
1% - 2% are not universally applicable to all bacteria species. Although there is not a 1:1 correlation between 16S types and STs, both methods similarly clustered nearly all H. influenzae serotypable isolates, but not NT H. influenzae. Despite the diversity found by 16S typing among these isolates, the evidence of clonal structure and association of a particular 16S type with a specific serotype remain. Because 16S rRNA gene sequence alone provides a similar level of discrimination to that obtained with the analysis of 7 genes for MLST, 16S rRNA sequencing is potentially useful molecular typing method for characterizing H. influenzae isolates.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»