Previous Article | Next Article ![]()
Journal of Clinical Microbiology, July 2004, p. 2926-2934, Vol. 42, No. 7
0095-1137/04/$08.00+0 DOI: 10.1128/JCM.42.7.2926-2934.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
National Reference Laboratory for Pathogenic Neisseria,1 Unit for Infectious Disease Control, Department of Clinical Microbiology, Örebro University Hospital, Örebro,3 Department of Strategic Development, Laboratory Medicine Östergötland, Linköping University Hospital, Linköping, Sweden2
Received 28 December 2003/ Returned for modification 15 February 2004/ Accepted 25 March 2004
|
|
|---|
|
|
|---|
The present study investigated whether DNA sequence analysis by pyrosequencing technology, strictly limited to highly polymorphic segments of the porB gene, could be used as a rapid and high-throughput method for molecular epidemiological characterization of N. gonorrhoeae isolates. Pyrosequencing is a recently described, fast real-time DNA sequence analysis by means of synthesis of short DNA stretches (21). It is a primer-directed polymerase extension assay, where the release of pyrophosphate upon nucleotide incorporation is measured in real time.
The rationale of the present study was that N. gonorrhoeae outer membrane protein PorB is universally present and does not undergo high-frequency variation during the course of infection in smaller groups of sexual contacts (33). Individual strains express only one of the two PorB protein groups, either PorB1a or PorB1b (4, 6, 11, 29), which are encoded by the mutually exclusive porB gene alleles, porB1a and porB1b, respectively. Naturally but rarely occurring strains that express PorB1a-PorB1b hybrids have been identified (4, 9). Antigenic diversities of PorB between strains form the basis for the widely used serogroup and serovar determination with monoclonal antibodies (MAbs) (14, 22), and attempts at identifying the antigenic epitopes of PorB of the serovar-specific MAbs have previously been published (1, 3, 4, 5, 16, 27). However, the exact amino acid sequences and structures of many of the epitopes are still unknown.
Previous genetic studies have concluded that porB gene sequencing can be used to trace the transmission of individual N. gonorrhoeae strains within the community, discriminate suspected clusters of gonorrhea cases, and confirm epidemiological connections of patients (5, 12, 28, 31). The segments of the porB gene encoding the eight surface-exposed amino acid loops of PorB (29) exhibit the highest level of polymorphism (1, 4, 5, 7, 12, 27, 28, 29, 31). Sequence analysis of four to six of these short, highly polymorphic segments of the porB gene generates discrimination between isolates that is almost identical to that obtained by sequencing of the entire porB allele (27). Therefore, pyrosequencing technology could be ideal for rapid, large scale sequencing of such carefully selected short, highly polymorphic segments of the porB gene.
The results of the present study show that pyrosequencing technology can be used for genetic variant (genovar) determination of N. gonorrhoeae isolates, a concept that may complement or even replace current internationally established major phenotypic antigen (serovar) determinations in routine use for epidemiological typing.
|
|
|---|
|
View this table: [in a new window] |
TABLE 2. Genetic variants (genovars) of N. gonorrhoeae isolates (n = 87) comprising 28 different serovars and one nonserotypeable serovar according to a previous study (27)a
|
porB gene real-time PCR. The entire porB gene was amplified by using the previously documented primers PorBU and PorBL (28). The amplifications were performed in a LightCycler system (Roche Molecular Biochemicals, Mannheim, Germany), using SYBR Green I fluorescence melting curve analysis for specific identification of porB1a or porB1b amplicons. Each PCR mixture (20 µl) contained 2 µl of LightCycler-FastStart DNA Master SYBR Green I (Roche Diagnostics GmbH, Mannheim, Germany), 3 mM MgCl2 (Roche Diagnostics), a 0.5 µM concentration of each primer, and 2 µl of DNA template. In each PCR, one porB1a reference strain (ATCC 43069), one porB1b reference strain (CCUG 15821), and one negative control (distilled water instead of DNA template) were included. The cycling parameters of the amplification were as follows: a FastStart enzyme activation step at 95°C for 10 min, followed by 30 sequential cycles of heating up to 95°C, 60°C for 10 s, and 72°C for 42 s. The cycling parameters of the subsequent melting curve analysis were as follows: heating the PCR products up to 95°C, cooling at 65°C for 45 s, and finally slowly heating (0.1°C/s) up to 95°C. For identification of porB1a or porB1b amplicons, the melting temperature (Tm) of the product, which is related to the size of the amplicons, was determined by fluorescence, which was continuously measured during the finishing slow-heating phase. The mean Tms with standard deviations (SDs) for the porB1a and porB1b alleles of all isolates were calculated. Subsequently, the individual isolates were designated as belonging to one or the other genogroup (porB1a or porB1b allele). The PCR products were stored at 4°C.
Real-time PCR of six short, highly polymorphic porB gene segments. The four and six most polymorphic segments, encoding surface-exposed amino acid loops of the mature PorB (29), of the porB1a and porB1b alleles, respectively, were included (Fig. 1 and 2). This was due to the fact that phylogenetic analysis of these segments and the entire porB alleles shows practically identical discriminatory ability between isolates (27). In the present study, the sequences encoding surface-exposed loop 3 and loop 6 of mature PorB1a protein were also included, since these contain suggested epitopes for the MAbs used in the serovar determination (16) (Fig. 1). The segments of the porB gene were amplified by using primers complementary to conserved regions of the porB gene, according to results in previous studies (27, 28) as well as sequences in GenBank (Table 1). The primers were located some nucleotides before, overlapping the beginning of, or within the segments encoding the loops of PorB (Table 1). All known polymorphic nucleotide sites in these segments were, however, located within the amplicons and clear of the primers used for the pyrosequencing. In each PCR, one of the primers for each segment was labeled with a biotin molecule at the 5' end for purposes of easy capture. The amplifications were performed in a LightCycler system (Roche Molecular Biochemicals) with SYBR Green I fluorescence melting curve analysis for identification of specific amplicons. In a pilot study, the PCR products were also analyzed by electrophoresis through a 2% agarose gel and ethidium bromide staining in order to ensure the specificity of the amplification. Each PCR mixture (20 µl) contained 2 µl of LightCycler-FastStart DNA Master SYBR Green I (Roche Diagnostics), 3 mM MgCl2 (Roche Diagnostics), a 0.5 µM concentration of each primer, and 2 µl of DNA template. In each PCR, one porB1a reference strain (ATCC 43069) and/or one porB1b reference strain (CCUG 15821) and one negative control (distilled water instead of DNA template) were included. The cycling parameters of the amplification were as follows: a FastStart enzyme activation step at 95°C for 10 min, followed by 40 sequential cycles of heating up to 95°C, 45°C for 10 s, and 72°C for 3 s. The cycling parameters of the subsequent melting curve analysis were as follows: heating the PCR products up to 95°C, cooling at 50°C for 30 s, and finally slowly heating (0.1°C/s) up to 95°C. For identification of a specific amplicon, the Tm of the product was given by the fluorescence that was continuously measured during the finishing slow heating phase. The PCR products were stored at 4°C prior to pyrosequencing.
![]() View larger version (20K): [in a new window] |
FIG. 1. Sequence variants and polymorphic nucleotide sites in each of the highly polymorphic segments of the porB1a alleles, encoding amino acid loops of the mature PorB1a protein (29), of N. gonorrhoeae isolates (n = 22). The nucleotide present at each polymorphic site among all of the sequence variants is shown for sequence variant 1 in each loop segment. For the other sequence variants, those sites that differ are shown. Dots indicate identity with sequence variant 1, and underlining indicates synonymous mutations. Nucleotide sites that are conserved in all sequence variants are excluded. The length of each loop segment is included, and the sites are numbered above in vertical format based on the nucleotide numbering of a porB1a gene (GenBank accession no. J03029).
|
![]() View larger version (35K): [in a new window] |
FIG. 2. Sequence variants and polymorphic nucleotide sites in each of the highly polymorphic segments of the porB1b alleles, encoding amino acid loops of the mature PorB1b protein (29), of N. gonorrhoeae isolates (n = 65). The nucleotide present at each polymorphic site among all of
the sequence variants is shown for sequence variant 1 in each loop segment. For the other sequence variants, those sites that differ are shown. Dots indicate identity with sequence variant 1, dashes represent alignment gaps (due to indels in some of the variants), and underlining indicates synonymous mutations. Nucleotide sites that are conserved in all sequence variants are excluded. The length of each loop segment, according to the multiple-sequence alignment, is included, and the sites are numbered above in vertical format based on the nucleotide numbering of a porB1b gene (GenBank accession no. J03017).
|
|
View this table: [in a new window] |
TABLE 1. Primers used in real-time PCR and pyrosequencing of the six highly polymorphic segments of the porB1a and porB1b alleles of N. gonorrhoeae
|
S, dCTP, dGTP, dTTP, enzyme mixture [DNA polymerase, ATP-sulfurylase, luciferase, and apyrase], and substrate mixture [luciferin and adenosine 5'-phosphosulfate]) was used according to the instructions of the manufacturer (Pyrosequencing AB) in a PSQ 96 MA system with PSQ SQA (version 2.0) software. The different types of deoxynucleoside triphosphates were separately dispensed in a sequence-directed and/or cyclic (dispensation order, 20 x CTGA or 25 x CTGA) manner. The sequences were automatically interpreted from the pyrograms by the PSQ SQA (version 2.0) software and manually edited after visual inspection. All sequencing results were compared to previous results obtained by using conventional Sanger sequencing (27, 28). Sequence comparisons and genovar designation. Different multiple-sequence alignments of the six highly polymorphic porB gene segments were performed with BioEdit (version 5.0.9) software and by manual adjustment. The nucleotide sequences in each of the segments of all of the isolates were compared, and each unique sequence was assigned a sequence variant number (Fig. 1 and 2). The most prevalent sequence variant in each segment was designated 1, the second most prevalent was designated 2, and so on. Subsequently, the isolates were designated as different genovars due to different sequence variants present in the polymorphic segments (Table 2). The sequence variants as well as the genovars of all of the isolates were entered into an Excel database.
|
|
|---|
Pyrosequencing. Real-time PCR amplification followed by sequence analysis by using pyrosequencing technology for the six highly polymorphic segments of the porB gene was performed without technical obstacles. Normally, pyrosequencing allowed rapid (approximately 1.5 h) determination of 96 sequences with 20 to 65 correct nucleotides each by means of automatic pyrogram analysis included in the software. Occasionally, manual editing of the obtained sequence was necessary. For porB gene segments encoding loops 1, 4, 6, and 8 and loops 6, 7, and 8 of the PorB1a and PorB1b proteins, respectively, the entire sequences could be determined by using only one of the DNA strands and, consequently, one sequencing primer in the pyrosequencing. The software-interpreted results of the pyrosequencing were in concordance with the conventional Sanger sequencing, with the exception of some uncertainties concerning the correct number of identical nucleotides in homopolymeric segments and some other minor discrepancies presumably due to nonsynchronized extension of some sequences, which limited the length of obtained reliable and correct sequences. The pyrogram of each analysis was reproducible, and a pyrogram of the sequence analysis of the loop 8 segment of the porB1b allele, which also illustrates the need of manual editing for homopolymeric segments in some cases, is shown in Fig. 3.
![]() View larger version (19K): [in a new window] |
FIG. 3. Pyrogram of the sequence analysis of the loop 8 segment of the porB1b allele (sequence variant 1, reverse strand), encoding surface-exposed amino acid loop 8 of the mature PorB1b protein (29), in N. gonorrhoeae. The nucleotide sequences of the entire loop 8 segment, as interpreted by the PSQ SQA (version 2.0) software and after manual editing are shown above the pyrogram. The discrepancy is indicated by underlining in the manually edited sequence and by an arrow in the pyrogram. The cyclic dispensation order of the deoxynucleoside triphosphates is indicated below the pyrogram.
|
The numbers of sequence variants identified in the six highly polymorphic segments of the porB1a allele, based on conventional sequencing (27, 28) and mostly confirmed by pyrosequencing, were 11 (segment encoding loop 1), 7 (loop 2), 6 (loop 3), 5 (loop 4), 5 (loop 6), and 6 (loop 8) (Fig. 1). Consequently, the number of sequence variants ranged from 5 to 11, with a mean of 6.7 per segment (Fig. 1). Thus, theoretically, the number of sequence variants in the highly polymorphic segments of the porB1a allele would allow >6.9 x 104 different porB1a genovars to be distinguished.
Among the porB1a isolates (n = 22) comprising nine different serovars and one nonserotypeable serovar, 22 unique genovars were assigned (Table 2).
Pyrosequencing of six highly polymorphic segments of the porB1b alleles from 65 N. gonorrhoeae strains and subsequent genovar determination. The 65 porB1b isolates comprised 65 unique porB1b gene sequences, with 145 polymorphic nucleotide sites (of 999) in the unambiguous multiple-sequence alignment, in accordance with previous studies using conventional Sanger sequencing (27, 28). The number of polymorphic sites in the six highly polymorphic segments varied from 8 to 31 (ratios of 20.5 to 38.3 per 100 sites), and the locations of the sites for all sequence variants are shown in Fig. 2. Most of the polymorphic sites were distributed in the sequences encoding the apexes of the amino acid loops of PorB1b (29). The polymorphism was due to single-nucleotide substitutions, clustered substitutions, and indels.
The numbers of sequence variants identified in the six highly polymorphic segments of the porB1b allele, based on conventional sequencing (27, 28) and mostly confirmed by pyrosequencing, were 18 (segment encoding loop 1), 22 (loop 3), 39 (loop 5), 17 (loop 6), 13 (loop 7), and 8 (loop 8) (Fig. 2). Accordingly, the number of sequence variants ranged from 8 to 39, with a mean of 19.5 per segment (Fig. 2). Thus, theoretically, the number of sequence variants in the highly polymorphic segments of the porB1b allele would allow >2.7 x 107 different porB1b genovars to be discriminated.
Among the porB1b isolates (n = 65) comprising 19 different serovars, 64 distinguishable genovars were identified (Table 2).
|
|
|---|
The present sequencing results (Fig. 1 and 2) provide evidence of a high discriminatory ability, practically the same as for sequencing the entire porB gene (27), and excellent typeability (all isolates were typeable) (Table 2). By using real-time PCR and pyrosequencing technology for DNA sequencing of polymorphic segments of the porB gene, rapid sequence data that are objective, portable for comparison between laboratories, and reproducible for epidemiological characterization of N. gonorrhoeae isolates are generated. This molecular strategy also continuously identifies new sequence variants in the highly polymorphic segments of the porB gene, reveals new genovars, and can be optimized to amplify the porB gene from clinical samples without culturing of the bacteria. By creating a database that comprises all sequence variants in each polymorphic segment, sequence variant numbers can be derived and isolates can be automatically designated as being different genovars.
Depending on the level of discrimination desired, the number of highly polymorphic segments of the porB gene analyzed can most likely be reduced in the future, and the nomenclature of the genovars can be simplified. Laboratories lacking access to pyrosequencing technology and using conventional Sanger sequencing could also adopt the same nomenclature of sequence variants and genovars. These laboratories have the option of analyzing the entire porB gene, but perhaps the most appealing alternative would be to sequence a fragment of approximately 400 to 600 bp of the porB gene, spanning, for instance, loops 1 to 4 of the porB1a allele and loops 3 to 7 of the porB1b allele, by using only one sequencing primer. Subsequently, sequence variants of some of the different highly polymorphic segments and genovars could be identified. However, pyrosequencing would seem to be the more convenient alternative.
Overall, sequence analysis by using high-throughput pyrosequencing technology is accurate and reproducible and can be almost fully automated. The technology is also less expensive, time-consuming, and labor-intensive, as well as easier to perform, than conventional Sanger sequencing. Disadvantages of the pyrosequencing technology are short read lengths (which is partly compensated for by the possibility of reading from the first base), the uncertainty in determining the correct number of incorporated identical nucleotides in homopolymeric segments, and the occasional occurrence of nonsynchronized extension of some sequences due to minus or plus frameshifting, which limits the length of obtained reliable and correct sequences (19, 20, 21). These areas need to be further developed. In the present study, homopolymers of 3 to 7 nucleotides in several of the loop segments were identified for some of the isolates, making it difficult for the software to interpret the correct number of incorporated identical nucleotides. This inherent problem in pyrosequencing technology is due to the nonlinear light response, which is not completely compensated for by the software algorithms, following incorporation of more than three or four identical nucleotides. However, the effect is less pronounced for G and C homopolymeric regions. The homopolymers may also cause nonsynchronized extension of the sequences due to insufficient completion of the homopolymers in some DNA strands, i.e., minus frameshifting, that can subsequently result in uncertain or erroneous interpretation of the pyrograms (19, 20, 21). Nevertheless, a sequence-directed dispensation order applied to conserved nucleotides preceding the polymorphic nucleotide positions, in combination with cyclic dispensation of the deoxynucleoside triphosphates (20 x CTGA) thereafter, use of reference peaks, and manual editing after visual inspection of the pyrograms, enabled the correct number of nucleotides in homopolymers to be revealed in most cases. A pyrogram of the sequence analysis of the loop 8 segment of the porB1b allele before and after manual editing is shown in Fig. 3. The method may also be further optimized by using sequencing primers with 3' ends that partly overlap the homopolymers of interest.
The widely used serovar determination for epidemiological characterization of N. gonorrhoeae has important limitations (10, 12, 13, 17, 24, 27, 28). In addition, for several of the MAbs used in the serovar determination, the precise amino acid residues of PorB that are critical for single-MAb reactivity have been difficult to identify (1, 3, 4, 5, 16, 27). Consequently, a rapid and stable molecular genetic method for routine use in the characterization of N. gonorrhoeae, and which overcomes the limitations of serovar determination, is definitely needed. However, the prospects of developing a genetic typing system comprising a completely congruent translation of the serovar determination seem to be limited.
In conclusion, the present study illustrates for the first time how pyrosequencing technology, strictly applied to highly polymorphic segments of the porB gene, can be used for molecular typing of N. gonorrhoeae. This paves the way for rapid molecular epidemiological characterization of N. gonorrhoeae isolates based on the porB gene, coding for the PorB protein, which is internationally established and routinely used as a major phenotypic antigen (serovar determination). The genetic variant (genovar) determination of N. gonorrhoeae isolates is a concept which, because of its simplicity, high discriminatory ability, portability, and reproducibility, complements and might eventually replace current internationally established major phenotypic antigen (serovar) determinations in routine use for epidemiological typing.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»