Previous Article | Next Article ![]()
Journal of Clinical Microbiology, December 2006, p. 4498-4510, Vol. 44, No. 12
0095-1137/06/$08.00+0 doi:10.1128/JCM.01392-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
,
Sabine Rüsch-Gerdes,4
Eve Willery,1,2
Evgueni Savine,1,2
Petra de Haas,5
Henk van Deutekom,6
Solvig Roring,7
Pablo Bifani,1,2,3,8
Natalia Kurepina,8
Barry Kreiswirth,8
Christophe Sola,9
Nalin Rastogi,9
Vincent Vatin,2,10
Maria Cristina Gutierrez,11
Maryse Fauville,3
Stefan Niemann,4
Robin Skuce,7
Kristin Kremer,4
Camille Locht,1,2 and
Dick van Soolingen4
INSERM, U629,1 Institut Pasteur de Lille, Lille, France,2 Institut Pasteur de Bruxelles, Laboratoire Tuberculose et Mycobactéries, Brussels, Belgium,3 Forschungszentrum Borstel, National Reference Center for Mycobacteria, Borstel, Germany,4 Diagnostic Laboratory for Infectious Diseases and Perinatal Screening, National Institute for Public Health and the Environment, 3720 Bilthoven,5 Department of Tuberculosis Control, Municipal Health Service, Amsterdam, The Netherlands,6 Veterinary Sciences Division, Department of Agriculture and Rural Development, Belfast, Northern Ireland,7 TB Center, Public Health Research Institute, Newark, New Jersey,8 Unité de la Tuberculose et des Mycobactéries, Institut Pasteur de Guadeloupe, Guadeloupe,9 CNRS, UMR 8090, Lille, France,10 Laboratoire de Référence des Mycobactéries, Institut Pasteur, Paris, France,11
Received 6 July 2006/ Returned for modification 18 August 2006/ Accepted 12 September 2006
|
|
|---|
|
|
|---|
Genotyping based on variable numbers of tandem repeats (VNTRs) of different classes of interspersed genetic elements named mycobacterial interspersed repetitive units (MIRUs) (12, 25, 32, 36, 40, 43, 44) is increasingly used to solve these problems. This method relies on PCR amplification of multiple loci using primers specific for the flanking regions of each repeat locus and on the determination of the sizes of the amplicons, which reflect the numbers of the targeted MIRU-VNTR copies. MIRU-VNTR typing is technically flexible, as sizing can be done using capillary (1, 24) or gel (28) electrophoresis or nondenaturing high-performance liquid chromatography (8). It is considerably faster than IS6110-RFLP typing, is applicable to crude DNA extracts from early mycobacterial cultures, and has been adapted to high-throughput conditions (1, 24, 42). Moreover, the results are expressed as numerical codes and are therefore very easy to compare and exchange.
Among different sets of MIRU-VNTR loci described for typing M. tuberculosis isolates (12, 14, 18, 21, 25-27, 32, 33, 36, 39, 45), a system based on 12 loci (28, 44) is currently the most widely used and has been integrated in TB control systems on a national scale in, e.g., the United States (1, 6). Based on pilot studies with limited numbers of isolates, the discriminatory power of this set approached that of IS6110-RFLP typing to discriminate epidemiologically unrelated cases (28, 42), while the genotypes based on this set are stable among isolates from epidemiologically linked cases (4, 16, 24, 34). A recent population-based study indicated that the use of this system as a first-line method in combination with spoligotyping provides adequate discrimination in most cases for large-scale, prospective genotyping of M. tuberculosis in the United States. However, IS6110 fingerprinting is still required as an additional method to type the clustered isolates in a number of cases, when contact investigation or demographic or epidemiological data do not provide independent clues to the existence or the absence of links between patients (6).
Alternative sets of MIRU-VNTR loci have been suggested to further improve the discrimination of unrelated isolates, compared to that provided by this 12-locus system (18, 21, 25, 32, 33, 36, 39, 45, 49). However, the collections of isolates studied were restricted to small samples of local origin and/or included only Mycobacterium bovis or representatives of only one or two of the defined M. tuberculosis lineages. The overall technical robustness and the clonal stability of the individual MIRU-VNTR loci in the sets tested were not assessed. Furthermore, none of these studies were nonselected, population-based studies, and contact tracing data were not available, making it impossible to establish the predictive value of the various sets for studying ongoing M. tuberculosis transmission at the population level.
Here, we have investigated the resolution power and the clonal stability and technical applicability for molecular epidemiological typing of 29 MIRU-VNTR loci, comprising our original 12 loci and most of the other exploitable loci disclosed so far. These parameters were tested on 824 tubercle bacillus isolates, including worldwide representatives of the main M. tuberculosis lineages identified by the use of a diversity of genetic markers, as well as multiple groups of epidemiologically linked or clonal isolates. On this basis, an optimized set of 24 loci was defined, including a highly discriminatory subset of 15 loci for specific first-line epidemiological investigations. The predictive value of this optimized set for evaluating M. tuberculosis transmission was evaluated in a companion population-based study (M. Cardoso-Oelemann, R. Diel, V. Vatin, W. Haas, S. Rüsch-Gerdes, C. Locht, S. Niemann, and P. Supply, submitted for publication) and compared to that of IS6110-RFLP typing.
|
|
|---|
A second set assembled at the National Institute for Public Health and the Environment, The Netherlands, included 61 single-colony cultures obtained from 13 different M. tuberculosis complex strains, which were previously used in the study described in reference 7, as well as a collection of 110 isolates, representing 52 different groups of related strains from different countries. The latter collection comprised pairs or groups of serial M. tuberculosis isolates obtained from 42 individual patients from Belgium (n = 1), Denmark (n = 12), Estonia (n = 4), The Netherlands (n = 17), the United States (n = 7), and Vietnam (n = 1), as well as 10 groups of clustered isolates obtained from different patients from The Netherlands linked by contact tracing, with one or two human-to-human transmissions (K. Kremer, R. Warren, K. Brudey, R. Skuce, C. Gutierrez, T. Lillebaek, E. Fair, C. Arnold, G. Saunders, and D. van Soolingen, Abstr. EU Concerted Action Project Meet., Prague, Czech Republic, p. 58, 2003).
A third set included isolates from 125 patients in 42 IS6110-polymorphic GC-rich sequence (PGRS) RFLP strain clusters, assigned to different transmission groups (TGs) according to the likelihood of epidemiological linkage between patients after thorough contact tracing and epidemiological analysis. These clusters were identified in a population-based study conducted in the province of North Holland, The Netherlands, from July 1998 to July 2000 (46, 47).
MIRU-VNTR genotyping. The isolates were genotyped by PCR amplification of the original 12 MIRU-VNTR loci as described in reference 44 and 17 other loci containing VNTRs of other interspersed sequences (25, 32, 36, 40, 44). They are collectively referred to as MIRU-VNTR in this study. The primers and the conditions for their amplification, their standardized designation, and correspondence with alias designations are described in Tables 1 and 2. Most analyses were performed using multiplex PCR, Rox-labeled MapMarker 1000 size standard (Bioventures), and gel (ABI 377) or capillary electrophoresis-based (ABI 3730-XL) sequencers at the Institut Pasteur de Lille, except for PCR fragments with sizes above 1,000 bp, which were also analyzed by electrophoresis at 150 V for 6 h 30 min using 25-cm gels and 1.2% Ultra Pure electrophoresis-grade agarose (Gibco BRL). Subsets of isolates or loci were analyzed at the Institut Pasteur de Bruxelles and at the National Reference Center for Mycobacteria, Germany, or at Queen's University of Belfast, Northern Ireland, using 3100-Avant capillary electrophoresis-based sequencers or agarose gel electrophoresis, respectively. Sizing of the PCR fragments and assignment of the various VNTR alleles were done using customized GeneScan and Genotyper or Genemapper software packages (PE Applied Biosystems). The reproducibility and accuracy of sizing and the size offsets, which correct differences in relative migration between the size standard and the amplicons depending on the locus and the polymer used for capillary electrophoresis, were checked and standardized among the different laboratories by analyzing selected PCR fragments amplified from M. tuberculosis H37Rv and other reference isolates, as described previously (1, 42).
|
View this table: [in a new window] |
TABLE 1. Locus designations and PCR primer sequences used in this study for the 24-locus set
|
|
View this table: [in a new window] |
TABLE 2. Locus designations and PCR conditions for excluded locia
|
Spoligotyping. Spoligotyping was performed according to the previously described method (19). Spoligotype families were assigned as described in reference 10.
Allelic diversity, genetic distance, and clustering analysis.
The MIRU-VNTR allelic diversity (h) at a given locus was calculated as h = 1
xi2 [(n/n 1)], where xi is the frequency of the ith allele at the locus and n is the number of isolates. Minimum spanning tree analysis was performed using Bionumerics (Applied Maths, Belgium) and a categorical coefficient. The priority rule was to first link types that had the highest number of single-locus variants (SLVs). To enable easier detection of locus variations between existing types, creation of hypothetical types was not allowed. The frequencies of SLVs, double-locus variants (DLVs), and triple-locus variants (TLVs) for the different MIRU-VNTR loci were calculated using alleles detected in the different complexes defined by MIRU-VNTR relationships and corroborated by consistent spoligotype family designations. The phylogenetic consistency of the Ghana and Uganda designation (with two subfamilies in the latter case) was supported by independent analysis of genomic deletions (T. Wirth et al., unpublished). The clustering rate was defined as (nc c)/n, where n is the total number of cases in the sample, c is the number of genotypes represented by at least two cases, and nc is the total number of cases in clusters of two or more patients (38).
|
|
|---|
The problems of these five loci consisted of the absence of PCR products, PCR products that were difficult to interpret, and amplification of multiple alleles, as detailed below. No allele could be amplified from locus 2163a of four isolates despite several attempts. Repeated PCR assays were regularly required to obtain interpretable single amplicons from large alleles of locus QUB-18, even after simplex PCR. For loci 3232 and 3336 two distinct alleles were simultaneously detected in a total of 10 isolates and for locus QUB-1895 in one isolate. These patterns were clearly different from the stutter peaks frequently observed with some VNTRs, as described previously (42). These double alleles were also detected when different DNA extracts, PCR conditions, and primers were used but were not observed for any of the other loci using the same DNA from these isolates, ruling out contamination by foreign DNA or mixed infections. In addition, for locus 3336 a range of intermediate alleles, corresponding to inclusion of repeats with apparent sizes of 5 to 25 bp instead of the 56 bp expected for a full-size repeat, were detected on sequencers, complicating the interpretation of the respective VNTR patterns. When loci 3232, 3336, and 1895 were analyzed by two different laboratories by using the automated and/or the manual techniques, 35 allelic discordances were found for the 90 isolates, of which 17 were for locus 3232, 13 for 3336, and 4 for 1895. In contrast, a single discordance and no discordance were found for MIRU 26 and ETR B, respectively, added as controls. The discordances resulted from differential amplification or interpretation of dual alleles, in addition to simpler sizing problems (see Note S1 in the supplemental material for details).
Finally, QUB-3232, 3336, and 2163a appeared hypervariable compared to the other loci, as seen from the respective numbers of alleles detected (21, 18, and 14), maximal numbers of repeats (25, 21, and 29), and allelic diversities (0.92, 0.91, and 0.87) (Table 3), which can explain part of the problems encountered.
|
View this table: [in a new window] |
TABLE 3. Parameters of locus variability among geographically diverse isolates
|
By using the 24 remaining MIRU-VNTR loci, 89 distinct genotypes were distinguished for the 90 isolates included in this standardized collection. The only remaining cluster was composed of the clonal M. tuberculosis strains H37Rv and H37Ra, which were also undistinguishable by spoligotyping. Even when the five problematic loci were considered, these two strains remained clustered. By testing multiple combinations of markers within the set of 24 robust loci, we found that the maximal resolution of 89 types in this collection could already be achieved using a minimal group of nine markers (MIRU 04, 10, 16, 26, and 40 and VNTR 0577, 2163b, 2165, and 4052).
Stability of MIRU-VNTRs among epidemiologically linked or serial patient isolates and single-colony cultures. The clonal stability of the individual MIRU-VNTR loci was evaluated using a total of 171 isolates, including a standardized set of 52 different groups of epidemiologically linked or serial isolates from six different countries and 13 groups of 2 to 10 single-colony cultures obtained from different M. tuberculosis complex strains (Table 4).
|
View this table: [in a new window] |
TABLE 4. MIRU-VNTR stability among 65 groups of clonal M. tuberculosis complex isolates
|
In contrast, at least eight changes were assigned in total for the five excluded loci among the epidemiologically linked or serial isolates and one among the single-colony cultures. Two laboratories analyzed independently loci VNTR 3232 and 3336 from the 52 epidemiologically linked or serial isolate groups and scored for these loci three and four, and seven and zero, changes, respectively. Some changes were scored as uncertain, as the patterns were irreproducible or repeatedly complex with two or even three alleles, and there was no correspondence between any changes scored as certain or possible by the two laboratories for these two loci. A total of 17 discordant results were noticed among the alleles assigned by the two laboratories in the 52 groups. Finally, noninterpretable long amplicons were obtained for locus 1895 of nine epidemiologically linked, serial isolate, and single-colony culture groups (see Note S2 in the supplemental material for details), and locus QUB-18 could not be amplified from one isolate.
MIRU-VNTR variability in an extended collection of isolates from cosmopolitan origins. In order to analyze the relative evolutionary rate and the resolution power of various marker sets in a sample that is more representative of the worldwide diversity of M. tuberculosis, the above-described standardized collection of 90 M. tuberculosis complex isolates was extended with 404 additional isolates to include a total of 494 isolates from widespread origins, representing the main M. tuberculosis spoligotype prototypes identified so far and the other, more rarely encountered members of the complex. The analysis was restricted to the 24 robust loci in combination with spoligotyping.
The 24 loci resolved 446 types among the 494 isolates (Fig. 1 and 2); 420 isolates had a unique profile, while the remaining 74 isolates were in 26 clusters. All but one of the MIRU-VNTR clusters were composed of two or three isolates of the same spoligotype family. The single larger cluster included 10 epidemiologically related multidrug-resistant (MDR) isolates from the Beijing/W family. Only eight additional types among the 494 isolates were identified when spoligotyping was combined with the 24-locus MIRU-VNTR typing, bringing the total number of types up to 454 by combining the two methods. The values and the overall hierarchies of the allelic diversities among the 24 loci were similar between the small standardized collection and the extended one (Table 3), including 404 additional isolates.
![]() View larger version (15K): [in a new window] |
FIG. 1. Discrimination by MIRU-VNTR typing alone or in combination with spoligotyping in a collection of isolates from cosmopolitan origins. The number of types distinguished among 494 isolates of the M. tuberculosis complex (A) and the corresponding clustering rates (B) obtained by different combinations of markers are shown on the y axes. Because of their particular population structure characterized by homogeneous clonal groups with horizontal gene transfers among them (15), M. prototuberculosis isolates were not considered for this analysis. Diamonds and squares correspond to values obtained using MIRU-VNTR (MV) alone and in combination with spoligotyping (spol), respectively. The composition of the different MIRU-VNTR sets is given in the inset table; 24, full set of 24 robust loci; 15, 15-locus discriminatory subset; Old12, original 12 MIRU-VNTR loci; 9, minimal set resulting in maximal resolution in the standardized collection of 90 isolates (see text).
|
![]() View larger version (20K): [in a new window] |
FIG. 2. Minimum spanning tree based on MIRU-VNTR relationships among tubercle bacilli. Circles correspond to the different types identified by the set of 24 loci among 494 M. tuberculosis complex isolates from cosmopolitan origins and 35 M. prototuberculosis isolates (as reference) and are proportional to the number of clustered isolates with an identical MIRU-VNTR type. The corresponding species names and spoligotype family designations (except T types) are indicated. Linkages by a single, double, or triple locus variation are boldfaced. EAI, East African-Indian; LAM, Latin American-Mediterranean; CAS, Central Asian; S, S spoligotype family; X, X spoligotype family.
|
![]() View larger version (67K): [in a new window] |
FIG. 3. Distribution of single, double, or triple-locus variations in 24 MIRU-VNTR loci among isolates from cosmopolitan origins. Events detected among 494 isolates from widespread geographic origins of the M. tuberculosis complex are shown. Because of their particular population structure, characterized by homogeneous clonal groups with horizontal gene transfers among them (15), M. prototuberculosis isolates were not considered for this analysis. eai, East African-Indian; lam, Latin American-Mediterranean; ory, M. bovis from oryxes; S, S spoligotype family; X, X spoligotype family; bcg, M. bovis BCG; h, Haarlem; mic, Microbacterium microti; sea, M. bovis from seals; cas, Central Asian; bov, classical M. bovis; T, T spoligotypes; bj, Beijing (including W); ug, Uganda; afr, M. africanum; gha, Ghana; cam, Cameroon; cap, Mycobacterium caprae.
|
Analysis of IS6110-RFLP clusters from a population-based study with the discriminatory subset of 15 MIRU-VNTR loci.
The few changes observed in some of the 24 robust loci in the serial isolates and epidemiologically linked and single-colony culture groups described above were all confined to markers composing the 15-locus discriminatory subset. The epidemiological relevance of these changes was therefore assessed by examining isolates from 40 high-copy-number (
6) and two low-copy-number IS6110-PGRS RFLP clusters of a previously published population-based study (46, 47). Based on thorough epidemiological analysis these clusters had been classified into groups with proven epidemiological links (TG1) and proven contacts after DNA fingerprint data had become available (TG2), likely contacts (TG3), or no epidemiological links (TG4).
Among the 24 IS6110-PGRS RFLP clusters of epidemiologically linked patients, comprising 57 isolates from TG1 and TG2, only two (clusters 8 and 13) were subdivided by changes in the 15-locus subset (Fig. 4). In both cases, the changes consisted of an SLV in one clustered isolate, affecting locus ETR C or QUB-2163b, respectively.
![]() View larger version (64K): [in a new window] |
FIG. 4. IS6110 RFLP, spoligotype, and MIRU-VNTR patterns of the M. tuberculosis isolates from 125 patients in 42 IS6110-PGRS clusters, as assigned to four transmission groups (for explanation of transmission groups, see text). Designations of MIRU-VNTR loci are given according to the position (in kilobase pairs) on the M. tuberculosis H37Rv chromosome. Alias designations are in parentheses. Spoligo, spoligotyping. Results of IS6110-RFLP, spoligotyping, and MIRU-VNTR loci from VNTR 424 to 4156 were taken from reference 47; results for VNTR 2165 (ETR A), VNTR 1955, 2163b (QUB-11b), and 4052 (QUB-26) are from this study. Differences in MIRU-VNTR patterns among IS6110-PGRS RFLP clustered isolates are boxed. For IS6110-PGRS RFLP cluster 37, the isolate of the patient from TG3 is compared with the isolate of the first patient from TG4, and they thus differ by an SLV in MIRU-VNTR locus 4052. Likewise, for IS6110-PGRS RFLP cluster 38, the isolates of the two patients of category TG3 differ by a four-locus variation (in loci 2165, 1955, 2163b, and 4052).
|
Finally, of the seven IS6110-PGRS clusters of isolates from TG4, six (85.7%) were split by differences in two to six loci of the 15-locus MIRU-VNTR subset.
|
|
|---|
In a standardized collection (23) including 90 isolates from 38 countries, the set of the 24 selected MIRU-VNTR loci reached nearly the maximal resolution, with 89 distinct types. This represents the highest discrimination achieved among all typing methods tested so far in this collection and a significant gain compared to the 78, 81, 82, and 84 types obtained with the 12 original loci, the 12 original locus markers combined with three ETR loci, five QUB loci, and IS6110-RFLP, respectively (for details see references 21, 23, and 42). The only clustered strains were H37Rv and H37Ra, which were derived from a single M. tuberculosis strain in the 1930s and kept thereafter as laboratory strains.
In an extended collection of 494 isolates from cosmopolitan origins, the use of the 24-locus MIRU-VNTR set increased the number of types by 40%, compared to those obtained with the original 12-locus set, respectively. When spoligotyping was used as an additional method, the gain in the number of types was of 23% with the set of 24 loci compared to the original 12-locus set. This lower relative gain in resolution power observed after the addition of spoligotyping consistently reflects the much lower frequency of subdivision by spoligotyping of clusters defined by the 24 loci, compared to subdivision by spoligotyping of clusters defined by the original 12 loci. The clustering rates were consequently decreased by fourfoldand by threefold in combination with spoligotypingwhen the 24-locus MIRU-VNTR set was used instead of the original 12-locus set. Virtually all of the remaining clusters were composed of two to three isolates which were quite evenly distributed within the different spoligotype families, indicating that this 24-locus set discriminates quite evenly and well in the primary M. tuberculosis lineages. The single larger cluster included 10 MDR isolates from the Beijing/W family with subtle differences in complex IS6110 fingerprints. However, the isolates of this cluster also share the same distinctive mutations in rpoB, rpsL, embB, and katG and are known to be part of consecutive epidemic waves in New York City, N.Y., originating from a single MDR clone (29). Some other, smaller clusters were also known to correspond to outbreaks and/or to include isolates with identical IS6110 fingerprints (data not shown). The values and the overall hierarchies of the allelic diversities among the 24 loci were broadly similar between the smaller standardized collection of 90 M. tuberculosis complex isolates (23) and the extended one including 404 additional isolates from widespread origins, despite marked changes in composition and sizes. This observation indicates a good preservation of the relative performances of the individual loci in these different strain populations, although clear differences of diversities were obvious among different lineages for some loci (e.g., allelic diversity of ETR B was 0.74 in East African-Indian isolates, compared to 0 in Central Asian and Beijing isolates).
Therefore, we have analyzed the relative evolutionary rates of the 24 loci by measuring the frequency of their involvements in single- to triple-locus variations among well-characterized isolates. Such an approach, similar to the BURST approach (9), is less sensitive to sampling bias and to problems of saturation (i.e., ancient changes being obscured by more recent ones) than simple analyses of allelic diversities or reductive analysis among diverse strains. If the combined MIRU-VNTR markers are not subject to convergence (independent evolution to the same state), SLVs, DLVs, or TLVs should inherently occur among closely related isolates and thus, on a probabilistic basis, involve loci with the highest evolutionary rates. This assumption was confirmed by the observation that virtually all of the SLVs, DLVs, and TLVs were found among isolates within phylogenetically consistent spoligotype families (i.e., excluding the T family and apparently rare outliers in families such as the Latin American-Mediterranean family; see reference 11). In general a very good concordance between MIRU-VNTR groupings and these spoligotype families was observed (Fig. 2), which also indicates that the selected 24-locus set is highly informative and, not surprisingly, better than the smaller set of the 12 original loci (11, 13, 42) for phylogenetic identification of M. tuberculosis complex lineages. As expected, the loci the most frequently involved in SLVs, DLVs, or TLVs generally showed the highest overall allelic diversities. The converse situation did not necessarily occur, as found for some loci (e.g., MIRU 23), indicating a lower evolutionary rate but rather separate distributions of distinct alleles among different families.
Consistently, a discriminatory subset gathering the 15 robust MIRU-VNTR markers most frequently involved in SLVs, DLVs, or TLVs distributed across the different families concentrated almost all of the resolution (96%, or 98% in combination with spoligotyping) provided by the full set of 24 loci. The nine less-variable remaining robust loci thus provide auxiliary discrimination, consistent with their exceptional involvement in SLVs in spite of sometimes relatively high allelic diversities in some lineages (e.g., ETR B, see above). The usefulness of these less-variable ancillary loci thus mostly resides in their collective contribution to more accurate phylogenetic identification of clones from the different lineages, when used in addition to the other 15 loci (Fig. 2). When some loci of the 15-locus subset were removed from the analysis, the marginal gain obtained with secondary use of spoligotyping gradually increased, but this gain gradually compensated less for the deficit in resolution, as expected. In agreement with higher evolutionary rates, the few clonal changes observed among the 24 robust loci in the epidemiologically linked, serial isolate, and single-colony culture groups were all confined to some markers composing this 15-locus discriminatory subset. Further in accordance with higher evolutionary rates, this subset of 15 MIRU-VNTR loci was able to split six out of the seven clusters of patients from a population-based study for which no epidemiological link could be established and which had isolates with identical IS6110 high-copy-number RFLP fingerprints. This situation likely corresponds to the most stringent conditions to analyze the resolution power in M. tuberculosis. It is noteworthy that cases of subdivision of false clusters based on high-copy-number IS6110 RFLP fingerprints will in general probably remain a minority, as another population-based study showed that in most cases the clustering on the basis of IS6110 RFLP typing and that on the basis of advanced MIRU-VNTR typing are in agreement (Cardoso-Oelemann et al., submitted). However, our data suggest that optimized MIRU-VNTR typing, especially when combined with spoligotyping, may be overall more accurate for cluster analysis than IS6110 RFLP typing (47; Cardoso-Oelemann et al., submitted).
Among the seven changes observed in this 15-locus subset over 123 possible independent events among clones within the epidemiologically linked, serial isolate, and single-colony culture groups, six differences (4.9%) involved only an SLV, and a single one involved a DLV (0.8%). The changes in these SLV cases most likely reflect rare and stochastic MIRU-VNTR mutation events and inherent genetic drift in clonal populations originating from recent transmission (34, 47). However, in the single DLV case, a single-band difference was also observed between the IS6110 RFLP fingerprints of the two serial isolates from the elderly, possibly foreign-born patient involved (data not shown); this distinction concordantly seen with IS6110 RFLP and two independent MIRU-VNTR loci casts some suspicion on the clonal origin of these two isolates (i.e., this patient might have been infected independently by two closely related strains). Likewise, only two differences (6.1%) involving only an SLV were observed over 33 possible independent events in the population-based clusters including patients with a proven epidemiological link (TG1 and TG2). As expected, a slightly higher proportion of differences were observed in the clusters with only a likely link (TG3). In that case, four changes (12.9% of possible independent events) involved an SLV, while two involved a DLV and a four-locus change, respectively. However, the molecular definition of the two latter clusters was based on RFLP profiles with relatively low IS6110 copy numbers, known to be poorly discriminated in spite of the additional use of PGRS fingerprinting and spoligotyping. Therefore, the MIRU-VNTR results cast doubt on the likelihood of the links and thus of the clonal transmission suggested by contact tracing for these two TG3 clusters. Hence, if cautiously only the groups with proven epidemiological links are considered, the chances of erroneous exclusion from a cluster of an isolate showing an SLV or DLV in the 15-locus subset are estimated to be only 5 to 6% or less than 1% (if any; see above), respectively. The probability of incorrect exclusion when mutations involve more than two loci or a locus from the rest of the 24-locus set is close to zero, since no such event was observed among clonal isolates. In contrast to a suggestion from a recent report (35) but in accordance with others studying the 12 original loci (4, 6, 16, 24, 34), these results thus fully support the stability of MIRU-VNTR types among epidemiologically related isolates and thus the use of MIRU-VNTR changes for reliable exclusion of a link (termed "sensitivity of typing" in reference 35) when ongoing transmission is tracked.
Acceptable technical problems were occasionally encountered with some of the 24 markers, including relatively frequentbut still interpretablestutter peaks (e.g., with QUB-26) and exceptional PCR failures for some infrequent genotypes (i.e., with locus 2163b in some Mycobacterium africanum isolates or isolates from oryxes). In such a case, the genotypes can still reliably be compared based on the 23 (or 14) remaining loci.
In contrast to the 24-locus MIRU-VNTR set, five other loci out of a total of 29 tested failed to acceptably fulfill the criteria of sufficient variability yet required stability and robustness as defined above. Three of them, i.e., QUB-3232, 3336, and 2163a, were hypervariable with respect to both allelic diversity among unrelated isolates and frequency of changes among epidemiologically linked and serial patient isolates. Some repeatedly yielded double alleles in several isolates, regardless of the experimental conditions and/or were subject to frequent PCR failures or uninterpretable PCR patterns, resulting in numerous discordances or uncertainties between results obtained in independent laboratories or within the same laboratory (see Note S3 in the supplemental material for details) (20). We believe that even hypothetical technical improvements will at best only partially solve these problems, and the inclusion of these markers in a routine panel for general screening such as proposed elsewhere (18, 45) would reduce the overall reproducibility to an unacceptable rate. Future studies will determine if such hypervariable markers could nevertheless retain some utility in very well controlled conditions for dissection of long transmission chains. It is noteworthy that these markers excluded for M. tuberculosis were not subject to such problems when specifically applied to M. bovis (except QUB-3336), probably because of generally lower repeat numbers in M. bovis isolates. Therefore, panels including some of these markers might be specifically proposed for bovine tubercle bacilli, in order to cope with the particularly high genetic homogeneity reported among these isolates in previous studies (2, 37).
A few additional M. tuberculosis VNTR loci (25, 39) were not included in this study. Among them, VNTR 3820, proposed in another panel of markers (39), displayed features characteristic of hypervariable loci (large repeat and allele numbers and high allelic diversity), which we predicted to lead to the problems described above. As far as the other loci we suspect them to be unlikely to have a significant impact on the discriminatory power, as judged from the limited data available.
In conclusion, we have defined an optimized set of 24 MIRU-VNTR loci, including a discriminatory subset of 15 loci to be used as first-line molecular typing tools. These sets significantly and reliably improve the resolution for molecular epidemiological studies compared to the original set of 12 MIRU-VNTR loci. However, not all 15 or 24 loci are necessarily required to define all the unique isolates in any given situation. Depending on the cases, such as for the standardized collection of 90 isolates (23), nine loci from the 15-locus set can be already sufficient. As a general guideline, loci with the highest evolutionary rates within this set may be applied in the first place, possibly depending on the lineage(s) known to be prevalent in the area of interest (5) (Fig. 3 shows the loci that are the most frequently involved in SLVs, DLVs, or TLVs in the different lineages). In other cases, the SLV distribution and the resolution of the 15-locus set compared with that of the 24-locus set in the worldwide collection indicate that secondary typing with a few ancillary loci may provide additional discrimination of particular genotypes in some specific lineages. Spoligotyping also yielded additional albeit marginal resolution in this study. Therefore, its combined use with MIRU-VNTR typing may be useful, especially as a quick and convenient independent control. Overall, however, the optimized MIRU-VNTR set described here is predicted to be generally effective for M. tuberculosis typing in many settings, as it has been defined based on numerous representatives of the principal worldwide M. tuberculosis lineages. Its predictive value for evaluating M. tuberculosis transmission was found to be equal to that of IS6110-RFLP typing in a companion population-based study. Therefore, we propose the 15-locus (for epidemiological studies) and 24-locus (more for phylogenetic studies) sets as a basis for standardized MIRU-VNTR typing of M. tuberculosis. The introduction of this more advanced method provides the possibility of maintaining and reinforcing international communication in the field of TB molecular epidemiology, already facilitated by the recognition of IS6110-RFLP as the gold standard since the early 1990s. This situation is almost unique among the infectious diseases.
The work was supported by INSERM, the Institut Pasteur de Lille, a grant from the Lille Génopole, and the European Community (grant QLK2-CT-2000-00630). Parts of this work were also supported by the German Ministry of Health and the Robert Koch Institute, Berlin, Germany. E.S. held a Poste Vert from the INSERM. M.C.-O. held a fellowship from the CAPES; P.S. is a Chercheur du Centre National de Recherche Scientifique.
Published ahead of print on 27 September 2006. ![]()
Supplemental material for this article may be found at http://jcm.asm.org/. ![]()
Present affiliation: Laboratório de Biologia Molecular Aplicada a Micobactérias, Departamento de Micobacterioses, Instituto Oswaldo Cruz-Fiocruz, Rio de Janeiro, RJ, Brazil. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»