Previous Article | Next Article ![]()
Journal of Clinical Microbiology, October 2005, p. 5034-5043, Vol. 43, No. 10
0095-1137/05/$08.00+0 doi:10.1128/JCM.43.10.5034-5043.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Prateep Thanakijcharoen,3
Opart Karnkawinpong,3 and
Prasit Palittapongarnpim1,2*
National Center for Genetic Engineering and Biotechnology, Phatumthani,1 Department of Microbiology, Faculty of Science, Mahidol University, Bangkok,2 Amnat Charoen Hospital, Amnat Charoen, Thailand3
Received 22 March 2005/ Returned for modification 13 May 2005/ Accepted 24 July 2005
|
|
|---|
0.6) and subsequently were used as the representative VNTR typing set for comparison to IS6110-based restriction fragment length polymorphism (RFLP) typing. The 10-locus VNTR set, potentially providing >2 x 109 allele combinations, obviously showed discriminating capacity over the IS6110 RFLP method for M. tuberculosis isolates with fewer than six IS6110-hybridized bands, whereas it had a slightly better resolution than IS6110 RFLP for the isolates having more than five IS6110-hybridized bands. Allelic diversity of many VNTR loci varied in each IS6110 RFLP type. Genetic relationships inferred from the 10-VNTR set supported the notion that M. tuberculosis may have evolved from two different lineages (high and low IS6110 copy number). In addition, we found that the lengths of many VNTR loci had statistically significant relationships to each other. These relationships could cause a restriction of the VNTR typing discriminating capability to some extent. Our results suggest that VNTR-PCR typing is practically useful for application to molecular epidemiological and phylogenetic studies of M. tuberculosis. The discriminating power of the VNTR typing system can still be enhanced by the supplementation of more VNTR loci. |
|
|---|
Variable-number tandem repeats (VNTR), often referred to as micro- or minisatellite DNA, are ubiquitous in eukaryotes and humans. They have been extensively studied in humans, and the fingerprinting methods based on them are successfully used for paternity and genetic linkage tests between human individuals (13). VNTR can be directly amplified by PCR and analyzed by agarose gel electrophoresis systems commonly available in general laboratories. Therefore, VNTR-based PCR analysis is easy, rapid, and highly specific and can be conducted by investigators worldwide. In addition, VNTR typing generates portable digit-based data, unlike the analog information obtained from RFLP-based fingerprinting methods. In this regard, investigators can easily compare the genotypic data of independent studies between different laboratories. With all of these advantages, the use of VNTR for investigating M. tuberculosis has attracted great attention from investigators in recent years.
VNTR of M. tuberculosis have been initially identified by many investigators (6, 9, 12, 20, 30). Using a homology search (27) we found 49 potential VNTR at 48 loci in the M. tuberculosis genome. Some of these repeats were equivalent to the major interspersed repeated units (MIRU) (31). In that study, we also performed in silico genome comparison and confirmed that at least 22 loci of the repeats in the two sequenced strains of M. tuberculosis, H37Rv and CDC1551, were polymorphic. In the meanwhile, a number of VNTR loci were introduced into epidemiological studies (7). At first, compared to the standard IS6110 RFLP typing method, VNTR typing was less discriminating (16), probably because too few VNTR loci were used. However, many investigators attempted to perform high-resolution VNTR typing as the number of new informative VNTR increased (17, 23, 26, 28, 31). Nowadays, the discrimination capacity of VNTR typing has been much improved, to a level comparable to that of IS6110 RFLP (18). Moreover, VNTR typing has a greater discriminating power for M. tuberculosis with low-copy-number IS6110 than RFLP typing (18).
Herewith, we report the allelic polymorphisms of all 48 VNTR loci previously identified by our group, including 13 novel ones, among M. tuberculosis strains in Thailand. Many of the novel loci were informative and could be useful for future epidemiological studies. We affirmed that VNTR typing had greater discrimination than IS6110 RFLP typing. We also found that the allelic diversity of VNTR loci could greatly vary in the different RFLP types of M. tuberculosis. In addition, this study also demonstrates correlations of the lengths of many VNTR loci.
|
|
|---|
The hospital is a 400-bed provincial hospital of the Amnat Charoen province, which borders the southern part of Laos and is about 570 km from Bangkok, the capital of Thailand. The population in the province is about 330,000, with the majority working in the agricultural sector. There are no trains or flights to the province, but the province is readily accessible by car. The prevalence of tuberculosis in the province is about 105/100,000 population. The human immunodeficiency virus infection rate among pregnant women is 0.6%.
All of the samples were isolated by cultivation on Lowenstein-Jensen medium. M. tuberculosis H37Rv and Mt14323 were originally obtained from the National Reference Center for Tuberculosis, Canada, and Jan D. A. van Embden, respectively.
DNA extraction and IS6110 RFLP fingerprinting. The bacteria were cultured in Lowenstein-Jensen medium for 3 weeks. The cells were then harvested, and chromosomal DNA was extracted by an enzymatic lysis method (22). Two micrograms of DNA of each isolate, and the Mt14323 strain, which was used as a marker, was then digested with PvuII and Southern blotted to a nylon filter. The filter was then hybridized with digoxigenin-labeled plasmid pDC73, which contained a segment of IS6110. The details of the methods were previously described (22).
The IS6110 hybridization patterns were analyzed by Gelcompar II version 1.5 (Applied Maths, Kortrijk, Belgium). Isolates were classified as members of the Beijing family (n = 21) based on 78% or more similarity to the previously described isolates (8). The rest of the bacteria had a single band (n = 18), a few (two to five) bands (n = 28), or more than five hybridized bands with heterogeneous banding patterns (n = 24).
VNTR polymorphism analysis.
PCR was performed for each of the 48 loci of the tandem and direct repeats that were previously identified by our group (27). The primers (Table 1) were designed from the genome sequence of M. tuberculosis H37Rv. In general, the final PCR mixture was composed of 10 mM Tris-HCl, pH 9.0, 50 mM KCl, 0.1% Triton X-100, 1.5 mM MgCl2, 200 µM concentrations of each deoxynucleoside triphosphate, a 0.5 µM concentration of each primer, 2.5 U of Thermus aquaticus (Taq) DNA polymerase enzyme, and 40 ng of DNA template in a total volume of 50 µl. Thermocycling conditions included a denaturation step at 95°C for 1 min, an annealing step at 55 to 65°C for 1 min, and an extension step at 72°C for 2 min. PCR was done for 30 cycles in a PCR gradient machine (Eppendorf, Germany). In addition, the reaction mixtures of some primer pairs were supplemented with 4% dimethyl sulfoxide for improving amplification. The amplified products were analyzed by electrophoresis in 1 to 2% agarose gels at 100 V. Visualization was done on a UV light illuminator after ethidium bromide staining. To estimate the lengths of the amplified products, the migratory distances of the amplified products were used to compare with those of DNA fragments of the
DNA cut by PstI. The copy number of the amplified products was inferred from the difference between the molecular weights of the amplified products of the samples and those of the H37Rv strain. If there were doubts regarding the number of copies of VNTR, the amplified DNA was purified by QIAquick PCR purification kit or gel extraction kit (QIAGEN, Germany) and sequenced.
|
View this table: [in a new window] |
TABLE 1. Sequences of primers used for amplifying VNTR in this studya
|
(allele frequencies)2 (15). Genetic relationships among the isolates were estimated by the unweighted pair group method with arithmetic averages in PAUP version 4.0 b1 software using the distance measurement according to total character differences of the copy numbers of VNTR. The correlation coefficient (r) between the numbers of copies of VNTR was calculated for every pair of VNTR loci. A statistical significance of
= 0.01 was used. |
|
|---|
-isopropylmalate synthase gene (leuA). The latter was shown to code for the polymorphic
-isopropylmalate synthase in M. tuberculosis (2). |
View this table: [in a new window] |
TABLE 2. PICs of the 48 potential VNTR locia
|
![]() View larger version (82K): [in a new window] |
FIG. 1. PCR products of various clinical isolates using primers that amplify VNTR3820. VNTR3820 has from 32 copies (lane 1) to 3 copies (lane 16) of the repeat unit. Lane M denotes the DNA cut by PstI.
|
![]() View larger version (82K): [in a new window] |
FIG. 2. Comparison of the sequences of M. tuberculosis H37Rv and the isolates containing a partial deletion of the VNTR sequence. (a) A 54-bp-long fragment of VNTR4052 and its flanking sequence are absent from isolate 667. There are two perfect 18-bp direct repeats (bold) at the border of the deleted fragment. VNTR4052 resides in a possible coding region (Rv3611). The start and stop codons of Rv3611 are double underlined. (b) A 24-bp-long fragment is absent from VNTR0580. Two pentaguanine runs are found at the border of the deleted sequence. VNTR regions are marked by thick single underlines. Boxed nucleotides and dashes denote the first nucleotide of a VNTR copy and the deleted regions, respectively.
|
Allelic diversity of VNTR loci.
The PIC of each VNTR locus was calculated (Table 2). We categorized the polymorphism arbitrarily into three levels: high (
0.6), moderate (0.4 to <0.6), and low (<0.4). Ten loci (three novel: VNTR3820, VNTR4120, and VNTR0569) were considered high discriminants. Thirteen loci (one novel: VNTR2372) were moderate discriminants, and the remaining loci (five novel) were low discriminants. The PIC values of some VNTR loci of various IS6110 RFLP types were different. For examples, 3 of the 10 most polymorphic loci were not polymorphic at all among the Beijing strains, while the other 3 loci were lowly polymorphic. Similarly, among the single-banded isolates, the polymorphisms of 4 of the 10 most polymorphic loci were much lower than average.
Phylogenetic analysis of M. tuberculosis. In assessing the genetic relationships between M. tuberculosis strains and the discriminating power of VNTR typing for M. tuberculosis, the 10 VNTR loci with the highest average PICs, including the three new VNTR loci, were used as the representative VNTR typing set. The set could potentially provide >2 x 109 allele combinations.
Two major separations on the phylogenetic tree were obtained. The larger group (64 isolates) did not contain any Beijing strains. Virtually all of the isolates in this group had one to eight copies of IS6110, with the exceptions of the three heterogeneous strains, which had more than eight copies. The number of copies of IS6110 appeared not to associate with any segregation in the phylogenetic tree of this group. In contrast, the smaller group contained all of the 21 Beijing strains as well as the 4 other isolates possessing >10 copies of IS6110, an 8-IS6110-copy isolate, a single-IS6110-copy isolate, and the H37Rv strain. All Beijing isolates were closely formed into a subgroup (Fig. 3).
![]() View larger version (83K): [in a new window] |
FIG. 3. Dendrogram of genetic relationships of 92 isolates of M. tuberculosis (denoted by numbers) based on the 10 VNTR loci. The consensus tree is built using the unweighted pair group method with arithmetic averages algorithm in the PAUP program. The isolates are classified into two separate lineages: high IS6110 copy number (more than eight IS6110 copies, except for the single-banded isolate S.609 and the eight-banded isolate H.101) and low IS6110 copy number (one to eight IS6110 copies, except for the three heterogeneous isolates H.664, H.2664, and H.2719, which have more than eight IS6110 copies). Letters B, F, H, and S denote the different IS6110 types: Beijing, few-banded, heterogeneous, and single-banded isolates, respectively. The VNTR profiles are shown in the 10-digit column, representing the number of copies of the repeat units in VNTR0569, VNTR0580, VNTR0595, VNTR0960, VNTR1955, VNTR2461, VNTR3690, VNTR3820, VNTR4052, and VNTR4120, respectively. Asterisks at VNTR0580 and VNTR4052 indicate partial deletions in the VNTR sequences. The IS6110 RFLP profiles are shown, with high to low molecular weight oriented from left to right. The top and bottom rows represent the IS6110 RFLP of the Mt14323 strain.
|
Excluding the isolates with five copies of IS6110 or less, VNTR typing gave a slightly better discrimination than IS6110 RFLP typing (of 41 distinct VNTR and 40 RFLP patterns) among the 46 isolates, whereas both methods could differentiate the isolates into 44 distinct patterns. There were 18 unique RFLP and 17 distinct VNTR patterns among the 21 Beijing strains. The isolates in the Beijing group were differentiated into 20 distinct patterns by both methods. In the heterogeneous group, VNTR typing showed a slightly better discrimination than the IS6110 method, with 24 and 22 unique patterns, respectively. VNTR typing for the 28 few-banded isolates also rendered a slightly better resolution than IS6110 RFLP typing (25 and 23 patterns, respectively). The few-banded isolates were dissolved into 26 patterns by both methods. For the 18 single-banded strains, VNTR typing clearly generated a greater number (17) of patterns than IS6110 typing (3 patterns; 1.4-, 5.0-, and 5.5-kb-long bands). Two isolates (one heterogeneous and one few-banded strain) had identical VNTR patterns.
Relationships of results of VNTR typing at different loci.
Based on the lengths of all 39 VNTR loci, a correlation coefficient of each pair of VNTR loci was calculated. Among all possible relationships, 125 pairs of correlated VNTR loci were found across 32 VNTR loci, 65 being positive and 60 being negative (for
= 0.01). Among these 32 VNTR loci, 4 correlating pairs did not correlate with any other VNTR loci; VNTR0514 and VNTR2703, VNTR0917 and VNTR1907, VNTR1305 and VNTR2531, and VNTR3007 and VNTR3336. Twenty-three loci had different degrees of either positive or negative correlations to more than one other locus. It was interesting that there were 17 loci forming two major correlating groups. Each group was composed of several VNTR loci that had positive correlations to the others in the same group (Fig. 4). However, members of these two groups were negatively correlated. It was interesting that 7 of the 10 highest discriminants were within these two groups as well. The last locus (VNTR2074) had a correlation with only VNTR0960, although the latter correlated with others. Seven loci did not correlate to any other loci at all: VNTR0577, VNTR1955, VNTR3155, VNTR3171, VNTR3192, VNTR3239, and VNTR4155.
![]() View larger version (34K): [in a new window] |
FIG. 4. The lengths of 17 VNTR loci appear to correlate, and these loci can be classified into two groups. Correlation coefficients (r; in boxes) were calculated for each pair of VNTR loci based on the lengths of the alleles. Negative and positive r values indicate negative and positive correlations, respectively. Zeros denote statistically insignificant correlations. Two major groups of VNTR loci, one marked with asterisks and the other not marked, represent the loci having positive correlations to one another in the same group (shaded cells). However, VNTR loci of these two groups are negatively correlated to each other (unshaded cells).
|
|
|
|---|
Short mutations in the different copies of the VNTR sequence, which consist of few substitutions, insertions, and deletions (indels), could be seen in many VNTR loci of M. tuberculosis (23, 31). In contrast, long indels in the VNTR sequence were rarely found. To our knowledge, up to now, only one VNTR locus (VNTR0580 or MIRU4) has been reported to have such a long mutation, which was confirmed in this study (27, 31). This study revealed an additional locus (VNTR4052) having a 54-nucleotide deletion in the VNTR and the nearby flanking sequence. Sequence analysis revealed that the deletion at VNTR4052 was most likely caused by the homologous recombination of the perfect 18-bp direct repeats lying exactly at the boundaries of the deleted sequence (Fig. 2). The proportion of the isolates containing the deletion in the allele of VNTR4052 was relatively low and had no correlation with IS6110 patterns, suggesting that deletion in this region may not have resulted from a single mutational event. Although VNTR4052 resided in a possible coding unit (Rv3611), such internal deletion did not affect the translational frame of the remaining coding sequence. Internal deletion was also observed in alleles of VNTR0580 in this study, possibly associating with the notable pentaguanine (G5) direct repeats lying at the border of the deleted sequence (Fig. 2). However, the underlying cause of this deletion could not be explained by the general recombination process.
The deletion in VNTR4052 also suggests that homologous recombination between different repeat units of the VNTR sequence can occur and contribute to the allelic variation of VNTR. The recombination can either reduce or increase the copy number. The latter case occurs if a double crossover between two DNA strands occurs after the chromosome replication fork has passed the site but before the two daughter chromosomes have been partitioned into the two daughter cells. Alternatively, the addition and deletion of VNTR copies could be generated from the DNA polymerase slippage process, as also suggested from a previous study (32). This mechanism is typically characterized by the stepwise alteration of the complete repeat unit in the variable alleles. In this study, many alleles of VNTR loci had only the incomplete copy and lacked a complete repeat unit. The isolates without the complete repeat unit should not be able to regain the variability property, and therefore VNTR in these loci are practically lost.
The polymorphism of each tandem repeat locus was found to be different. The degree of polymorphism may relate inversely with the selective pressure acting on each locus. However, there appeared to be no differences between the degrees of polymorphism of VNTR residing inside and outside coding sequences. In the absence of selective constraints, one may assume that the diversity of a particular VNTR is correlated with its evolutionary rate. Therefore, a VNTR locus with a high level of polymorphism would be assumed to have a faster evolutionary rate than a locus with a lower level. If so, the rate seemed not to relate to the length of the repeat unit or the position in the genome.
Many VNTR loci in this study had moderate or high allelic diversity (PIC
0.4). These VNTR loci are useful for the differentiation of M. tuberculosis strains. It was shown in this study that the 10-VNTR set had a resolution comparable to standard IS6110 RFLP typing when tested with the panel of M. tuberculosis isolates containing high copy numbers (more than five) of IS6110; meanwhile, this VNTR set could remarkably differentiate the isolates having only one IS6110 copy. These findings are similar to those of other studies using different VNTR typing sets and M. tuberculosis isolates from various geographical origins (18, 29), which also suggests that VNTR typing is suitable for use in the global epidemiological study of tuberculosis.
Molecular genotyping based on VNTR-PCR analysis has several advantages over standard IS6110 RFLP and other typing methods (16, 19). Also, an individual VNTR locus can be examined independently, giving the investigators the flexibility of using it to modify and improve their own typing format. We found that many VNTR had biased diversity for each IS6110 RFLP type; for instance, VNTR3232 gave the maximal diversity for Beijing strains, whereas this locus showed very low or no discriminating value for the remaining RFLP types. This strain-dependent property of VNTR can be beneficial to the investigation of an outbreak caused by a particular IS6110 type of M. tuberculosis.
VNTR-based analysis can be used to infer the phylogenetic relationships of M. tuberculosis strains. A dendrogram constructed from the 10-VNTR set agrees with the hypothesis that M. tuberculosis may have evolved from two separate lineages, the high- and low-IS6110-copy-number isolates. In this study, the high-IS6110-copy-number group was mostly composed of the isolates having >10 IS6110 copies (Fig. 3). All Beijing isolates were included in this high-IS6110-copy-number group, which conforms to the notion that the Beijing strains constitute a homogeneous M. tuberculosis family. At the same time, the isolates in the low-copy group comprised those possessing one to four copies of IS6110 as well as those having five to eight IS6110 copies. The latter were classified in the high-copy lineage in a previous analysis (18).
In this study, many VNTR loci exhibited significant copy number correlations. These correlations may be caused by several factors. We observed that correlations of VNTR loci could occur between loci with different degrees of allelic diversity. It is probable that the correlations between VNTR loci having low allelic diversity may represent false correlations occurring from the combined effect of the small number of allele combinations between the VNTR loci and the biased allele frequencies of the VNTR loci. In contrast, significant relationships of the VNTR loci having relatively high allelic diversity were unlikely to occur artificially due to a large number of allele combinations between the VNTR loci and should represent the actual correlations. This may happen because the bacterial populations in the study were of multiple clonal origins and M. tuberculosis possesses few mechanisms for horizontal genetic exchanges. If the number of copies of the repeats varies gradually, the correlation of the lengths between some sites may just be the result of the fact that the ancestors originally had VNTR of different lengths at those sites. Alternatively, the correlated VNTR may be subject to the same biological constraint, such as binding to the same proteins, and therefore tend to evolve in the same way. Similar associations between different VNTR loci were also previously recognized in M. tuberculosis (32).
Correlations of VNTR could limit the number of possible patterns among M. tuberculosis isolates. As a consequence, these relationships would reduce to some extent the potential discriminating power of the VNTR typing system composed of the correlating VNTR loci.
The present VNTR typing systems could not define all unique isolates and still require the complementation of other typing methods, such as IS6110 RFLP. The power of the VNTR typing system can be improved by the supplementation of an extra number of VNTR loci. It was shown in this study that our 10-VNTR set was powerful for distinguishing M. tuberculosis isolates. This VNTR set has already been developed in our laboratory for application in the molecular epidemiological study of M. tuberculosis. We adopted multiplex PCR with three separate sets of primers to target those 10 VNTR loci, in conjunction with analysis by agarose gel electrophoresis. Without the requirement of the automated gel analysis equipment, our system is inexpensive and so can be exploited by general researchers. We are now applying this 10-VNTR typing format to examining different M. tuberculosis populations in Thailand.
This work was supported by the National Science and Technology Development Agency (NSTDA), Thailand.
Supplemental material for this article may be found at http://jcm.asm.org/. ![]()
Present address: National Institute for Medical Research, Mwanza Medical Research Center, Mwanza, Tanzania. ![]()
|
|
|---|
-isopropylmalate synthase in Mycobacterium tuberculosis. Tuberculosis 81:1-6.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»