Previous Article | Next Article ![]()
Journal of Clinical Microbiology, April 2008, p. 1398-1406, Vol. 46, No. 4
0095-1137/08/$08.00+0 doi:10.1128/JCM.02089-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Maryse Fauville-Dufaux,1,
and
Philip Supply2,3,
*
Scientific Institute of Public Health, Department Institut Pasteur de Bruxelles, Laboratoire Tuberculose et Mycobacteries, rue Engeland 642, 1180 Bruxelles, Belgium,1 Institut Pasteur de Lille,2 Laboratoire des Mécanismes Moléculaires de la Pathogenèse Bactérienne, INSERM U629-1, rue du Professeur Calmette, BP 245, 59019 Lille Cedex, France3
Received 29 October 2007/ Returned for modification 10 December 2007/ Accepted 22 January 2008
|
|
|---|
|
|
|---|
So far, IS6110-restriction fragment length polymorphism (RFLP) has been the gold standard method for genotyping M. tuberculosis (43). IS6110 fingerprinting has proven useful for conducting population-based studies of TB transmission (2, 32). However, this technique is laborious and time-consuming with this slow-growing mycobacterium. In addition, it has insufficient capacity to discriminate M. tuberculosis strains with low copy numbers of IS6110 and is not straightforward enough for the identification of genetic lineages (10, 22, 23).
Several alternative PCR-based methods have been developed in order to overcome these problems. Spoligotyping often is used as a secondary method for typing low-copy-number IS6110 isolates, but it does little to improve strain differentiation in population-based studies (11). The most promising PCR-based methods are based on the analysis of multiple loci containing variable numbers of tandem repeats (VNTR) of different families of interspersed genetic elements, collectively called mycobacterial interspersed repetitive units (MIRU) (15, 24, 28, 31, 34, 35, 39, 40). Currently, the most used version of this method (designated MIRU-VNTR) is based on the analysis of 12 loci (25, 38). Especially when applied as a first-line typing test in combination with spoligotyping, this 12-locus-based method can provide adequate discrimination in many cases, including in large-scale studies (6, 9). Nevertheless, this 12-locus set still fails to discriminate a significant number of different strains, as revealed by more extensive MIRU-VNTR sets or IS6110-RFLP combined with contact tracing data (9, 27, 37).
A standardized MIRU-VNTR format with significantly improved discriminatory power has been proposed on the basis of the analysis of the clonal stability and evolutionary rates of MIRU-VNTR markers in primary genetic lineages of tubercle bacilli from around the world (37). This format is comprised of 24 loci, 15 of which were defined as composing a discriminatory subset based on higher variability within the different clonal complexes studied. So far, the predictive value of these standardized sets for estimating M. tuberculosis transmission has been evaluated only retrospectively, in a single population-based study conducted during 1 year in Hamburg, Germany (27).
Here, we evaluated this standardized format prospectively in a fivefold-larger population-based study with a different patient population, conducted during 39 months in the Brussels capital region of Belgium. The discrimination and cluster identification by these new MIRU-VNTR sets were compared to those obtained using the previous set of 12 loci, spoligotyping, and IS6110-RFLP and to available epidemiological information. Genetic lineages of M. tuberculosis were determined based on the congruence of the different markers and on the newly available MIRU-VNTRplus identification database (45). This database permits clonal identification based on comparisons to the reference lineages as defined by a comprehensive marker set, which is made up of large sequence polymorphisms (LSPs) and single-nucleotide polymorphisms (SNPs) in addition to MIRU-VNTR and spoligotyping. These data then were analyzed in terms of the geographic origin of the patients.
|
|
|---|
Molecular typing method. IS6110-RFLP analyses were performed on 258 isolates according to an internationally agreed-upon standard method (43). Spoligotyping was used as previously described by Kamerbeek et al. (22). Twenty-four-locus-based MIRU-VNTR typing was routinely applied using a four-capillary-based ABI 3100-Avant genetic analyzer as described by Allix et al. (3) and Supply et al. (37).
Computer-assisted analysis of patterns. IS6110-RFLP patterns, spoligotyping, and MIRU-VNTR profiles were analyzed using the Bionumerics package, version 4.5 (Applied Maths, St-Martin-Latem, Belgium). Dendrograms based on IS6110 fingerprints were generated using the dice coefficient, the unweighted-pair group method using average linkages, and a position tolerance of 1.8%. Dendrograms based on MIRU-VNTR patterns were generated using the categorical coefficient and neighbor-joining method and were rooted using a "Mycobacterium prototuberculosis" C/D genotype (also called "Mycobacterium canetti") (20). A strain cluster was defined as two or more patients infected by isolates having identical genotypes depending on the typing method(s) used. Assuming that one patient from each strain cluster corresponded to the index case at the origin of infection, the strain-clustering rate (or recent transmission index) was calculated with the following equation: strain-clustering raten – 1 = (nc – c)/n, where nc is the total number of strain-clustered cases, c is the number of strain clusters, and n is the total number of cases in the sample (33). For cluster analysis, isolates with mixed populations, identified by double alleles, in two or more MIRU-VNTR loci were excluded, and for isolates with clonal variants, the single locus displaying a double allele was not considered.
Statistical analysis.
Pearson
2 or the Fisher exact test (depending on the number of subjects) was used to test pairwise differences in strain genetic lineages between patients of various geographic origins.
|
|
|---|
The remaining patients (343/1,150) were not typed due to their producing culture-negative samples (n = 210), contamination by other microorganisms (n = 3), and culture unavailability (n = 130). As expected, young patients (less than 15 years of age; n = 48) and patients with extrapulmonary TB (n = 129), notoriously known to give paucibacillary samples, were overrepresented among the culture-negative patients (data not shown).
Strain typeability, clonal variants, and mixed populations. The 807 isolates were fully typeable for the 24 MIRU-VNTR loci. Ninety-seven percent of the alleles were obtained after the first round of multiplex PCR. The remaining 3% were obtained after an additional round(s) of injection on the DNA analyzer or of multiplex or simplex PCR amplification. For one isolate, the spoligotype could not be obtained despite repeated attempts. IS6110-RFLP was not systematically applied to the full collection (see below).
Of the 807 isolates, only 8 reproducibly displayed a double allele in a single MIRU-VNTR locus, thus identifying the simultaneous presence of two closely related clonal variants, as defined previously (3, 18, 30, 37). The corresponding loci were 2163b (n = 3), 4052 (n = 2), 2165 (n = 1), 2461 (n = 1), and MIRU 27 (n = 1). For each of these isolates, the corresponding locus initially was treated as missing data for cluster identification (i.e., only the 23 other loci of these isolates were considered). The double allele subsequently was considered for the detection of possible overlaps of clusters identified (see Discussion). Only five isolates reproducibly displayed double alleles in two (n = 1) or more (n = 4) loci, thereby identifying the simultaneous presence of independent clones in accordance with previous studies (3, 18, 30, 37). These isolates were excluded from subsequent analyses.
Discriminatory power and cluster identification in a test panel. As an initial step, the resolution power of MIRU-VNTR typing alone or in combination with spoligotyping was compared to that of IS6110-RFLP by analyzing 258 isolates (Table 1). This panel represented 77% (258/334) of all genotyped isolates from 1 September 2002 to 31 December 2003.
|
View this table: [in a new window] |
TABLE 1. Discriminatory power and cluster identification in a test panel (77% of all genotyped isolates from Brussels during a 16-month period; n = 258)a
|
Not surprisingly, the five IS6110-RFLP clusters displaying fingerprints with one to five bands all were subdivided by MIRU-VNTR typing. Most (19/21) of the isolates grouped within these clusters were identified as unique by MIRU-VNTR.
Of the 23 IS6110-RFLP clusters with high copy numbers and grouping a total of 70 isolates, 20 clusters (including 61 isolates) were found to be completely identical by MIRU-VNTR typing. Of the three remaining IS6110-RFLP clusters, two (including two Haarlem and three LAM isolates with 9 and 11 IS6110 bands, respectively) were fully subdivided both by four to seven MIRU-VNTR loci and by spoligotyping, whereas one cluster (including four isolates) was distinguished into two pairs by a single-locus change by MIRU-VNTR typing.
Conversely, only five clusters defined by MIRU-VNTR typing based on the 15 and the 24 loci and grouping a total of 12 isolates were subdivided by IS6110-RFLP. Four of these MIRU-VNTR clusters included two isolates, whereas one, containing four isolates, was distinguished into two pairs by IS6110-RFLP. In all cases, the IS6110-RFLP differences consisted of a single band difference.
Discriminatory power of the PCR-based methods in the entire population-based collection. Given the higher-resolution power of MIRU-VNTR typing compared to that of IS6110-RFLP and the excellent correlation for the strain cluster definition (excluding nonrelevant IS6110-RFLP clusters with low numbers of bands) in the initial test panel, standardized MIRU-VNTR typing was applied in combination with spoligotyping for the screening of the M. tuberculosis isolates subsequently collected from 2004 and 2005. The analysis of the discriminatory power and cluster definition by the PCR-based methods thus was extended to a total of 802 isolates. The isolates were obtained during the full study period; five isolates that corresponded to mixtures of two strains were excluded (see above).
The discriminatory power of MIRU-VNTR typing was evaluated by comparing results obtained using the discriminatory subset of 15 loci, the full set of 24 loci, and the old set of 12 MIRU-VNTR loci (Table 2). The discriminatory subset of 15 loci distinguished 596 different profiles, resulting in a strain-clustering rate of 25.8%. The number of additional profiles obtained with the full set of 24 loci was limited to 14, resulting in only a slight decrease in the strain-clustering rate (23.9%). In comparison, the number of profiles and the strain-clustering rate obtained with the old set of 12 loci were 418 and 47.9%, respectively.
|
View this table: [in a new window] |
TABLE 2. Discriminatory power of the PCR-based methods in the entire population-based collection (n = 802)a
|
Phylogeographic analysis. The Brussels capital region is a cosmopolitan area, and 76% (n = 612) of TB patients in this study were born in 69 different countries, with the majority being from Africa (n = 381). Thus, this population offers an opportunity for the study of global M. tuberculosis phylogeography, especially for Africa, which is understudied in this regard. The strain lineage distribution was determined for the 629 different genotypes, firstly by analyzing the congruence between spoligotyping and 24-MIRU-VNTR-locus typing within the study collection. Among them, 12 major spoligotype families, with profiles matching the signatures of classical prototypes (7, 13) and each containing 9 to 124 isolates, were identified (Fig. 1). Using a tree based on the 24-MIRU-VNTR-locus data (Fig. 1), excellent concordance was observed between MIRU-VNTR groupings and these spoligotype assignations, except for isolates with undefined or T spoligotypes, which is in keeping with results indicating that the latter actually conceal genetically heterogeneous strains (14).
![]() View larger version (35K): [in a new window] |
FIG. 1. Concordance observed between MIRU-VNTR groupings and spoligotype assignations among the 446 isolates from well-defined lineages. MIRU-VNTR groupings are visualized by a dendrogram calculated using a neighbor-joining algorithm and rooted using an "M. prototuberculosis" C/D genotype (20). Asterisks indicate examples in which the MIRU-VNTR groupings confirmed the genetic classifications that initially were presumed on the basis of spoligotypes sharing only partial spoligoprototype signatures.
|
The association between a patient's strain lineage and that patient's region of birth has been shown on the basis of LSP use (16). This analysis was performed after regrouping, based on a shared sequence deletion in pks15/1, all of the branches corresponding to SNP-based principal genetic group 2 (PGG2) and PGG3 (36) into a superlineage called Euro-American. This superlineage comprises lineages of Cameroon, Haarlem, LAM, Ural, S, and Uganda spoligotypes plus T spoligotypes (17 and T. Wirth, F. Hildebrand, C. Allix, F. Wölbeling, T. Kubica, K. Kremer, D. van Soolingen, S. Rüsch-Gerdes, C. Locht, A. Meyer, P. Supply, and S. Niemann, unpublished data). We first similarly grouped our lineages, as identified by 24-MIRU-VNTR-locus types and confirmed by spoligotyping and MIRU-VNTRplus analysis, and analyzed their correlations with a patient's region of birth (Fig. 2). Significant associations (P < 0.001 in all cases) were consistently found between CAS and EAI lineages and patients from the Indian subcontinent, between the Beijing lineage and those from central and east Asia, between Mycobacterium africanum (West African 1 and 2) and Mycobacterium bovis lineages and those from western and northern Africa, and between the Euro-American superlineage and those from Europe, Africa, and North and South America.
![]() View larger version (15K): [in a new window] |
FIG. 2. Distribution of major M. tuberculosis lineages according to patients' regions of origin. Asterisks indicate statistically significant associations (P < 0.001).
|
![]() View larger version (22K): [in a new window] |
FIG. 3. Distribution of branches of the Euro-American M. tuberculosis superlineage according to patients' regions of origin. Asterisks indicate statistically significant associations, and respective P values are indicated.
|
|
|
|---|
A high correlation was found between unique isolates or strain clusters defined by MIRU-VNTR and IS6110-RFLP (i.e., more than five IS6110 bands) in our test panel, representing 77% of all genotyped isolates during the 16-month period. Of the 23 IS6110-RFLP clusters with high copy numbers, 20 were found to be completely identical to each other by MIRU-VNTR typing. Of the three remaining IS6110-RFLP clusters, two were fully subdivided both by four to seven MIRU-VNTR loci and by spoligotyping. As shown by previous studies (27, 42), such discrimination by multiple MIRU-VNTR loci, a fortiori when independently corroborated by spoligotype differences, is predictive of the absence of a link between the patients involved and, thus, indicates that these IS6110 clusters are epidemiologically irrelevant. In contrast, only one IS6110-RFLP spoligotype cluster, containing four patient isolates, was subdivided into two pairs by a single-locus variation (SLV) by MIRU-VNTR typing. Interestingly, one of these pairs involved two Rwandese patients, while the other one grouped together a Belgian and a Somali patient who were identified as acquaintances by subsequent contact tracing. These two patient groups might be unrelated, since MIRU-VNTR SLVs have been observed in several instances among isolates with fully matching IS6110-RFLP results and spoligotypes originating from epidemiologically unlinked patients (9). However, it cannot be completely excluded that the SLV between our two patient isolate groups reflects a rare MIRU-VNTR mutation and genetic drift in the clonal population originating from recent transmission (29, 37).
Not surprisingly, the five IS6110-RFLP clusters displaying fingerprints with one to five bands all were subdivided by MIRU-VNTR typing. Conversely, only five clusters defined by 24-MIRU-VNTR-locus typing were subdivided by IS6110-RFLP. In all cases, the IS6110-RFLP differences consisted of a single-band change among multibanded profiles, the epidemiological interpretation of which is often questioned (46). Regardless, the discriminatory power and the accuracy for the cluster analysis of MIRU-VNTR typing appeared to be slightly greater than that of IS6110-RFLP in this test panel. In these conditions, the maximal resolution power was achieved with the discriminatory subset of 15 loci without the influence of the additional inclusion of the 9 ancillary loci and/or spoligotyping.
The benefit of the 15- and 24-locus formats was further evaluated in the entire population-based collection by comparing them to the 12-locus format currently used alone or in combination with spoligotyping for universal genotyping in the United States and elsewhere (6, 9). The use of the 15- and 24-locus formats resulted in a 50% drop in the strain-clustering rate compared to that obtained with the old set of 12 loci (25.8, 23.9, and 47.9%, respectively). The small 2% difference in strain-clustering rates between the 15- and 24-locus formats is consistent with the design of the discriminatory subset of 15 loci, which comprises the most variable markers across representative M. tuberculosis genetic lineages (37). It is noteworthy that the secondary use of spoligotyping slightly improved the resolution obtained with both the 15- and 24-locus formats and, interestingly, rendered the difference between these two formats negligible (22.2 and 21.6%, respectively). In contrast, the combined use of the old 12-locus set and spoligotyping generated a significantly higher strain-clustering rate (32.2%) than did the 15- and 24-locus sets, even without considering spoligotyping results.
The epidemiological interpretation of the molecular results depends not only upon the discriminatory power of the markers but also upon their adequate clonal stability during infection and transmission. A total of 59 serial isolates, obtained from 27 different patients during the study period, were genotyped by using the 24 MIRU-VNTR loci and spoligotyping. Consistently with the low frequency of exogenous reinfection expected in this type of setting (8) and other analyses on MIRU-VNTR stability (29, 37), the genotypes were conserved across the full set of markers within the 27 groups.
The epidemiological significance of the clusters defined by 24 MIRU-VNTR loci and spoligotyping combined was further evaluated in a parallel study by analyzing epidemiological, demographic, and contact tracing data available for TB cases registered from 1 January 2003 to 31 December 2004 (C. Allix-Béguec, P. Supply, M. Wanlin, P. Bifani, and M. Fauville-Dufaux, unpublished data). These analyses confirmed familial or social links for at least 51/157 (32%) strain-clustered patients, including those defining a large ongoing outbreak. Patients were found to live in close proximity in 25/157 (16%) additional strain-clustered cases of unknown direct epidemiological links. The classical risk factors for TB transmission (e.g., being young and underprivileged) likewise were identified based on the clusters defined by MIRU-VNTR and spoligotyping. Furthermore, the strain-clustering rate (20% during the period considered) defined on this basis was remarkably identical to that established based on IS6110-RFLP in a similar study that involved a comparable population during the same 2-year period (41). Note that similar results would have been obtained with the 15 MIRU-VNTR loci and spoligotyping, as the strain clusters obtained in this case were virtually the same as those obtained with 24 MIRU-VNTR loci and spoligotyping (see above).
Interestingly, one isolate with a double allele in a single locus constituted the overlap of two clusters by sharing allele 3 with one isolate and allele 4 with another one in locus 2165, while the 23 other loci were fully identical for the three isolates. However, the plausibility of the possible corresponding epidemiological connection could not be confirmed by epidemiological or demographic data.
We additionally investigated the capacity of standardized MIRU-VNTR typing to identify different well-defined M. tuberculosis strain lineages. Because M. tuberculosis has a clonal population associated with patient geographic origins (16), strain lineage information is useful, as it can, for instance, provide indications regarding the source of the TB case (ongoing transmission versus the reactivation or the importation of an infection acquired abroad). Phylogeographical studies also have implications for the development of new tools for TB control (17). The phylogenetic information conveyed by MIRU-VNTR markers has been questioned (14, 17). Here, we found an excellent congruence between groupings based on 24 MIRU-VNTR loci and well-defined spoligotype signatures within our population-based collection. In fact, in many cases the MIRU-VNTR groupings even confirmed the genetic classifications that initially were only presumed on the basis of the spoligotypes sharing only partial spoligoprototype signatures (see, for instance, the Haarlem and LAM profiles in Fig. 1). Moreover, the strain lineage designations defined on this basis were tested by searching for best matches of their respective 24-MIRU-VNTR-locus genotypes among external reference strain lineages included in the database at www.miru-vntrplus.org. In this database, these references were identified by combining extensive marker sets, including SNPs and LSPs, which are presented as the most robust markers for phylogenetic analysis (17). As detailed in another study (Allix et al., unpublished), the internal designations were confirmed by consistent best matches in 88.7% of the cases when 24-MIRU-VNTR-locus types were used alone for analysis. While the absence of a match was found in only 8.1% of the cases, conflicts were detected in no more than 3.2% of the cases. Finally, when grouping our strain lineages, which were identified by 24-MIRU-VNTR-locus types and confirmed by spoligotyping and MIRU-VNTRplus, similarly to previously published data (16) and taking into account the correspondence of lineage nomenclatures, we found an association between patients' strain lineages and the patients' regions of birth, as was seen with the use of LSPs or SNPs (16, 19).
In conclusion, our results extend inferences, which initially were based on a smaller study on a different patient population in Germany (42% foreign-born patients, mostly from Turkey and eastern countries) (1, 12, 27), regarding the wide applicability of standardized MIRU-VNTR typing optionally combined with spoligotyping for the analysis of TB transmission, at least in the numerous settings with epidemiological features similar to those of these two studies. It remains to be studied how useful the standardized MIRU-VNTR set of 15 or 24 loci would be in a stable rural area with clusters existing for a long time period and displaying much less strain heterogeneity, but it can be predicted that, even in such conditions, these sets most likely will outperform the old 12-locus format for strain discrimination. As W/Beijing strains were infrequently found in these two studies, the possible relevance of a few additional MIRU-VNTR loci to this standardized format in specific populations in which such strains are predominant awaits a population-based evaluation, including detailed epidemiological data and refined genetic analyses (21, 26, 44). However, we already notice that some additionally proposed loci clearly do not meet the conditions of robustness and stability for standardized screening in and among laboratories (37). In contrast, with its 807 fully typeable isolates and the consistent epidemiological analysis results, this study demonstrates the excellent operability and efficiency of the standardized MIRU-VNTR formats for molecular screening in routine laboratory conditions. Lastly, we show that this standardized method permits the accurate and high-resolution analysis of M. tuberculosis phylogeography, which is now further facilitated by the online accessibility of the multifunctional MIRU-VNTRplus database. Taking into account its speed and portability, standardized MIRU-VNTR typing represents a powerful tool with diverse applications.
C.A.-B. was a fellow of the Brussels capital region.
P.S. is a researcher of the Centre National de la Recherche Scientifique (CNRS).
Published ahead of print on 30 January 2008. ![]()
Present address: Genoscreen, Campus de l'Institut Pasteur de Lille, 1 rue du Professeur Calmette, 59000 Lille, France. ![]()
These authors contributed equally to this work. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»