Previous Article | Next Article ![]()
Journal of Clinical Microbiology, January 2003, p. 197-204, Vol. 41, No. 1
0095-1137/03/$08.00+0 DOI: 10.1128/JCM.41.1.197-204.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Stanley Division of Developmental Neurovirology, Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21287
Received 10 June 2002/ Returned for modification 26 July 2002/ Accepted 16 October 2002
|
|
|---|
500- to 600-bp gene fragments of seven housekeeping gene loci, would include the pilA, abcZ, serC, glnA, gdh, gnd, and pip loci. |
|
|---|
A number of typing methods have been developed for N. gonorrhoeae based on growth requirements for specific nutrients and cofactors (1), antibiotic sensitivity (3, 20), differences in electrophoretic mobility of bacterial enzymes (17), and serological reactivity against surface antigens (12). The introduction of PCR and high-throughput nucleotide sequencing has led to the development of several DNA-based typing methods. Two methods that are highly discriminating are the opa gene (OPA) typing method and sequencing of the porB gene (POR sequencing), which we as well as others have employed (11, 16, 18, 22). A disadvantage of the above two methods is that variation at these loci is evolving rapidly and for reasons that are not fully understood, although immune selection plays an important role. An alternative approach is to use variation that is accumulating slowly and that is likely to be selectively neutral. Although only limited variation may be observed at a single locus, high levels of discrimination can be achieved by analyzing many loci. Brian Spratt and colleagues recently developed a molecular typing method, multilocus sequence typing (MLST), which fulfills the above criteria (14). MLST is based on the principles of multilocus enzyme electrophoresis, but characterizes the alleles present at multiple housekeeping genes directly by nucleotide sequencing rather than indirectly by the electrophoretic mobility of their gene products. The advantages of MLST compared to typing methods that involve comparison of DNA fragments in gels are that the data are unambiguous, can be shared between laboratories, and lend themselves to analyses of the genetic relationships among isolates. MLST schemes have been developed for several bacterial species (4-7, 13, 15), demonstrating the broad applicability of the method, and MLST-generated data have been used to address aspects of the population genetics and evolutionary biology of bacterial species (2, 8, 9). As an investigation preliminary to the development of an MLST scheme for N. gonorrhoeae, we have characterized the genetic diversity at 18 gonococcal housekeeping gene loci.
|
|
|---|
Preparation of chromosomal DNA. For DNA analysis, strains were plated from frozen stocks onto chocolate II agar plates (Becton Dickinson Microbiology, Franklin Lakes, N.J.) and incubated overnight at 37°C in 5% CO2. Genomic DNA was prepared from N. gonorrhoeae cells scraped off an agar plate culture by using the Clontech Nucleopsin tissue kit (BD Biosciences, Clontech, Palo Alto, Calif.) according to the manufacturer's instructions.
Amplification and nucleotide sequence determination of housekeeping genes.
PCR products were amplified with oligonucleotide primer pairs designed from the genome sequence of N. gonorrhoeae strain FA1090 (www.genome.ou.edu/gono.html). The primers are shown in Table 1. Each 50-µl amplification reaction mixture contained
10 ng of gonococcal chromosomal DNA, 0.5 µM each PCR primer, 1x Expand PCR buffer (Boehringer Mannheim Biochemicals, Indianapolis, Ind.), 0.2 mM deoxynucleotide triphospates, and 2 U of Expand High Fidelity (Boehringer Mannheim Biochemicals). The reaction conditions were 94°C for 1 min, primer annealing at 55°C for 30 s, and extension at 72°C for 30 s for 30 cycles and a final extension reaction for 10 min at 72°C. The amplification products were purified with the QIAaquick PCR purification kit (Qiagen, Valencia, Calif.), and their nucleotide sequences were determined on each DNA strand by using the amplification primers or sequencing primers (not shown) and half the amount of the manufacturer's recommended BigDye Ready Reaction mix (PE Biosystems, Foster City, Calif.). Unincorporated dye terminators were removed by the RapXtract II kit (Prolinx, Bothell, Wash.), and the reaction products were separated and detected with an ABI Prism 3700 automated DNA sequencer from PE Biosystems (Synthesis and Sequencing Facility, Department of Biological Chemistry, Johns Hopkins University School of Medicine). Trace data were edited, and nucleotide sequences were assembled with the SeqMan software program (DNASTAR, Inc., Madison, Wis.).
|
View this table: [in a new window] |
TABLE 1. PCR primers and sizes of gene fragments for 18 N. gonorrhoeae housekeeping genes
|
Nucleotide sequence accession numbers. Novel sequences determined in this study have been deposited in GenBank under accession no. AF520224 to AF520355.
|
|
|---|
600 bp to
2 kb (Table 1). The resulting segments included more than 90% of the putative open reading frame (ORF) of 9 loci and from 44 to 89% of the ORF of the remaining loci. The locations of the 18 genes on a physical map of the N. gonorrhoeae genome (strain FA 1090) are shown in Fig. 1. For strain FA 1090, the median distance between loci was 99,194 bp (interquartile range, 27,207 to 147,739 bp). The most closely spaced genes, glnA and aroE, were separated by 310 bp.
![]() View larger version (18K): [in a new window] |
FIG. 1. Location of 18 housekeeping genes and porB on chromosomal map of N. gonorrhoeae strain FA 1090
|
17,500 nucleotides spanning 18 loci were sequenced from 24 isolates. For each isolate, the sequences obtained at each of the 18 loci were compared with those of every other isolate, and sequences were assigned as distinct alleles if they differed at one or more nucleotide sites. A search of the GenBank database revealed three unique N. gonorrhoeae glnA sequences. No N. gonorrhoeae sequences were found in GenBank that matched the other 17 loci. The allele from strain FA 1090, whose genome has been completely sequenced, was arbitrarily assigned as allele 1. The number of alleles identified at the 18 loci ranged from 2 to 18 (Table 2). The mean number of alleles per locus was 8.3, which would potentially allow >3.7 x 1016 sequence types to be distinguished. The number of polymorphic (variable) sites at the 18 loci varied between 2 and 75. The positions of the polymorphic sites for all loci are shown in Fig. 2. Visual inspection of the sequences suggested that, with two exceptions, the distribution of polymorphic sites along each gene was random. An identical cluster of 13 polymorphic sites occurred in three alleles of glnA between positions 216 and 273. Three other alleles of glnA shared eight identical polymorphic sites over a region spanning positions 125 to 216. Two alleles of serC contained clusters of polymorphic sites; allele 5 had 12 polymorphic sites between positions 480 and 507, and allele 9 had 51 polymorphic sites distributed over the region from positions 639 to 1049. The visual impression was confirmed by the results of Sawyer's run test, which provided strong support for recombination (P < 0.0001 and P = 0.0003 for glnA and serC, respectively). Allele 1 of pilA contained a 12-bp in-frame insertion at position 250, and allele 12 had a 45-bp in-frame deletion from positions 286 to 330 (data not shown in Fig. 2). The average percent nucleotide difference between pairs of alleles was
2.0% at 17 of the 18 loci (Table 2). For serC, the interallelic differences ranged from 0.1 to 6.1%. The high value was due to the presence of a large number of unique polymorphic sites in allele 9. The mean percentage of nucleotide differences between alleles of N. gonorrhaoeae and the most closely related N. meningitidis allele at the 18 loci was 5.3%, with a range of 1.55% to 9.50% (Table 2). The dn/ds ratio was calculated as a measure of the degree of selection in the population of sequences (Table 2). As expected for evolutionarily conserved genes, the ratio was less than 1.0 for 14 housekeeping gene loci. For abcZ and gpdh, the dn/ds ratios were 1.3132 and 1.5239, respectively. |
View this table: [in a new window] |
TABLE 2. Nucleotide sequence variation in 18 N. gonorrhoeae housekeeping gene fragments
|
![]() ![]() View larger version (87K): [in a new window] |
FIG. 2. The variable sites in each of the unique sequences (alleles) of 18 gonococcal housekeeping genes. The nucleotide present at each variable site among the 25 isolates is shown for strain FA 1090 (allele 1). For the other alleles, only those sites that differ are shown. Sites that are the same as those in allele 1 are shown by dots. Nucleotide sites that are the same in all alleles are not shown. The sites are numbered above in vertical format based on the nucleotide numbering of N. gonorrhoeae strain FA 1090. The polymorphisms that are synonymous (s) and nonsynonymous (n) are shown below.
|
|
View this table: [in a new window] |
TABLE 3. Allelic profile of 25 N. gonorrhoeae strains
|
![]() View larger version (25K): [in a new window] |
FIG. 3. Dendrogram of 25 N. gonorrhoeae strains. For each strain, the allele at 18 housekeeping gene loci defined an allelic profile. The dendrogram was constructed by the UPGMA method from a matrix of pairwise differences in the allelic profiles of the 25 strains.
|
|
|
|---|
100,000 bp. Ideally, the loci should be spaced far enough apart that pairs of alleles are unlikely to be inherited together by recombinational events, because this would diminish the independent discriminatory power of each locus. Approximately 300 bp separated aroE from glnA, and
3,000 bp separated gdh and pgi1. Thus, the use of both loci of these pairs would be inadvisable in designing an MLST for N. gonorrhoeae. One caveat to this recommendation is that, for other strains, the genomic locations of the loci may differ.
Although only 25 gonococcal strains were studied, and 18 were obtained from a temporally and geographically restricted population, eight or more alleles were identified at 10 of the 18 loci examined, and from two to six alleles were identified at the remaining 8 loci. It is difficult to compare the diversity of gonococcal housekeeping genes to that of other bacterial species, because the loci and the numbers and sources of the strains studied vary. The distribution of polymorphic sites along each gene fragment was generally random and characterized by single nucleotide replacements, consistent with evolution of DNA sequences by point mutation. However, clusters of polymorphic sites were observed in some alleles of glnA and serC. Clustered polymorphisms may be indicative of a recombinational event. Two alleles of pilA contained either an insertion or deletion. Insertions and deletions may arise through recombination or replication slippage. In general, pairwise differences in nucleotide substitutions between alleles of N. gonorrhoeae were lower (average,
2.0%) than those between gonococcal and meningococcal alleles (average, 5.3%). However, for allele 9 of serC, the average intraspecies nucleotide difference (6.1%) was greater than the average nucleotide difference between gonococcal and meningococcal alleles (5.2%). This may be an example of an interspecies recombinational event at the serC locus. Housekeeping genes are believed to be either selectively neutral or subject to purifying selection. Thus, the rate of synonymous substitutions (ds) should be equal to or slightly greater than the rate of nonsynonymous substitutions (dn), giving a dn/ds ratio of
1. For 14 of the 16 housekeeping gene loci, for which sufficient data were available to calculate dn and ds, the ratio was <1. However the ratios were
1.3 and
1.5 for abcZ and gpdh, respectively, suggesting that these genes may be subject to positive Darwinian selection. If ratios >1 are confirmed with a larger data set, possible explanations for positive selection acting on these loci, such as differences in pathogenic potential, transmissibility or tissue tropism, should be investigated.
How many and which loci should be used in an MLST scheme depend on the cost and level of discrimination desired. The MLST schemes for N. meningitidis, Streptococcus pneumoniae, Streptococcus pyogenes, Staphylococcus aureus, and Campylobacter jejuni are all based on seven loci (http://www.mlst.net/). Knowledge of the number and position of polymorphic sites at a locus is essential for selecting the most informative set of loci. Another consideration is the size of the gene fragment. Because of the technical limitations of PCR product sequencing and the need to sequence both strands for maximal accuracy, gene fragments should be no longer than
600 bp. The data from our study can be used to help design an MLST scheme for N. gonorrhoeae. The 25 strains we studied were completely resolved by the allelic profiles of three housekeeping genes: pilA, abcZ, and pip or pgi2. The pgi2 locus is not ideal for an MLST scheme, because the polymorphic sites were distributed over
1,500 bp. As additional strains of N. gonorrhoeae are typed, it is likely that the list of loci will need to expand in order to achieve a high level of discrimination among strains. Other potentially informative loci include glnA from positions 125 to 651, gdh from positions 15 to 560, gnd from positions 669 to 1234, and pip from positions 81 to 573.
Typing of bacterial isolates can be used for clinical purposes to establish strain-specific correlates of microbial pathogenesis, for epidemiological purposes to investigate the endemic and epidemic spread of organisms, and for genetic purposes to better understand bacterial evolution. The choice of typing method will depend on the question that needs to be addressed. For clinical studies, it may be sufficient to study a single candidate virulence gene. MLST is the preferred method for evolutionary studies, because the variation detected is amenable to quantitative analysis and is believed to be selectively neutral. For epidemiological studies, the sometimes-conflicting demands of discriminatory power and clonal stability must be balanced. Short-term epidemiological studies require methods with high discriminatory power in order to identify recent transmission events. For long-term epidemiological studies, a genetic marker must exhibit sufficient clonal stability that ancestral relationships among strains can be discerned. N. gonorrhoeae poses a particular challenge, because the extent of recombination is very high. MLST is based on slowly accumulating genetic variation and thus is ideal for addressing long-term epidemiological questions. Previous investigations using MLST have been concerned primarily with the global molecular epidemiology of various bacterial species (4, 6,7, 14). However, by combining data from several genetic loci into an allelic profile, MLST can achieve high levels of discrimination. We studied strains obtained from a single sexually transmitted diseases clinic in Baltimore over a 2-year period and found that MLST could resolve the isolates better than either POR sequencing or OPA typing. The discriminatory power of MLST did not preclude the ability to identify the same strain in individuals who were known sexual contacts. Although the data set was small, the results suggest that MLST for N. gonorrhoeae may be useful for both short-term and long-term epidemiological studies. In future studies, we plan to characterize strains isolated in different years and from different subpopulations in Baltimore in order to assess changes in allele frequencies with time and in relation to the distribution of strains within core and peripheral transmission groups. In addition, we will examine strains recovered in association with local genital mucosal infection or systemic disseminated disease.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»