Previous Article | Next Article ![]()
Journal of Clinical Microbiology, July 2004, p. 3248-3255, Vol. 42, No. 7
0095-1137/04/$08.00+0 DOI: 10.1128/JCM.42.7.3248-3255.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Vincent Hervé,1 Gilles Vergnaud,3 and Christine Pourcel5,6*
Laboratoire de Mycobactériologie, HIA Percy, 92140 Clamart,1 HIA Val de Grâce, 75005 Paris,2 Centre d'Etude du Bouchet, 91710 Vert le Petit,3 Génome, Polymorphisme et Minisatellites (GPMS), Institut de Génétique et Microbiologie, Université Paris XI, 91405 Orsay Cedex,5 Institut Pasteur, 75015 Paris, France,6 CHA Bouffard, Djibouti, Republic of Djibouti4
Received 2 February 2004/ Returned for modification 15 March 2004/ Accepted 6 April 2004
|
|
|---|
|
|
|---|
The first such strain was isolated by Georges Canetti in 1969. About 20 years later, two additional strains were isolated from patients with tuberculosis (TB) lymphadenitis (16, 25). In 2002 Miltgen et al. described the characteristics of two new strains of "M. canettii" isolated from two French patients with pulmonary TB (14). Considering the very small number of "M. canettii" infections reported to date, no difference with the classical M. tuberculosis infection can be identified. Nonetheless, some authors suggest that "M. canettii" could induce milder pneumonia than classical TB (12, 14).
Studies by Brosch et al. and Marmiesse et al. of 20 regions where insertion-deletion events took place in the genome of M. tuberculosis suggested that "M. canettii" diverged first from the rest of the MTBC (1, 13). The "M. canettii" taxon can also be easily differentiated from the other members of the MTBC on the basis of a restriction site polymorphism in the hsp65 gene (8).
All the described "M. canettii" strains show only two bands by spoligotyping (25), corresponding to spacers 30 and 36, which indicates that the direct repeat (DR) locus in "M. canettii" is very different from that of the other members of the MTBC. By sequencing, van Embden et al. showed the existence of at least 26 spacers that appear to be unique to "M. canettii" (24).
Frothingham et al. (7) and Supply et al. (21) were the first to describe the use of polymorphic tandem repeat loci for genotyping MTBC strains. Variable-number tandem repeat (VNTR) analysis is a very promising approach, since it successfully discriminates low-copy-number IS6110 strains (2), which are poorly resolved by the otherwise quite efficient IS6110 typing (23). In addition, tandem repeat typing can be standardized, and common databases are easily set up (3, 10). However, typing by multiple methods is still required to attain maximum specificity and more importantly to correlate multiple-locus VNTR analysis (MLVA)-derived data sets with data based on previous methods. One reason for this is that mutation rates and evolution of tandem repeat loci in bacteria are not yet fully understood, so that MLVA data interpretation is still to some extent a research area.
Based upon the available genomic sequences for M. tuberculosis, we previously selected and analyzed a larger collection of VNTRs, giving rise to a high-resolution MLVA typing assay (10). Using a selection of 21 VNTRs we have typed 43 smooth variants of M. tuberculosis and compared their genotype with one of the previously described "M. canettii" strains and with the collection of MTBC strains described by Le Flèche et al. (10).
|
|
|---|
Most patients were Djiboutian, and the others were expatriated patients living in Djibouti. No epidemiological link could be established between the patients. The detailed description of the clinical features of the cases will be published elsewhere (J.-L. Koeck et al., unpublished data). DNA from "M. canettii" strain CIPT140010060 (referred to in this work as CIPT060) was kindly provided by Véronique Vincent. percy8 (see Fig. 1) is the same as strain 990161 in reference 13. The reference strain M. tuberculosis H37Rv was used as a control in the MLVA (10).
![]() View larger version (44K): [in a new window] |
FIG. 1. MLVA cluster analysis. Clustering analysis was done using the categorical and unweighted pair group method with arithmetic averages options. From left to right, the columns designate the strains names, the genotype number (geno), the result obtained for the PCR amplification of part of the DR locus using "canettii-specific" spacers 73 to 76 or 80 to 83 (24), and the hsp65 profile. The asterisks indicate that sequencing data have been produced. The hsp65 fragment has been sequenced in at least one strain from each genotype (SNPs 2 and 3 [Fig. 5] are not detected by PCR-restriction fragment length polymorphism analysis). Strains H37Rv and CIPT060 are underlined.
|
Data management and analyses. Gel images were analyzed using a bionumerics software package (version 3.5; Applied-Maths, Sint-Martens-Latem, Belgium) as previously described (10). The number of repeats in each allele was deduced from the amplicon size. The resulting data were analyzed with bionumerics as a character data set. Clustering analysis was done using the categorical parameter and the unweighted pair group method with arithmetic averages coefficient. The minimum spanning tree (5) was constructed with the following options: (i) in case of equivalent solutions in terms of calculated distances, the selected tree was the one containing the highest number of links between genotypes differing at only one locus ("Highest number of single locus variants" option); (ii) the creation of hypothetical types (missing links) reducing the total length of the tree was allowed.
Analysis of the DR locus by PCR amplification and characterization of new spacers. PCR amplification was performed using primers selected in the spacers described by van Embden et al. (24): primer pair Mcan80For (5'-TAGCGAGCTGTGCGGCAGTA) and Mcan83Rev (5'-TAAGCACACCAGCACCTCCC) and primer pair Mcan73For (5'-TCGGTGCTGACCCCATGGAT) and Mcan76rev (5'-GATTCGCCCGTCGCTGCAAT). These amplifications are predicted to produce, respectively, a 202- and a 213-bp fragment in the sequenced "M. canettii" strain (see Fig. 3A).
![]() View larger version (60K): [in a new window] |
FIG. 3. Structure of the DR locus. PCR using primers Mcan80For and Mcan83Rev (A) or Mcan73For and Mcan76Rev (B) on 24 smooth strains and H37Rv. The 100-bp ruler is used as size marker (lanes 1, 11, 19, and 28). No amplicon is observed in H37Rv (lane 2) and in seven smooth strains (lanes 9, 12, 21, 23, 24, 25, and 27). (B) Strains percy32 and CIPT060 have four additional new spacers as shown by the presence of a 505-bp amplicon and by sequencing.
|
hsp65 polymorphism and sequencing of PCR products. PCR amplification of part of the hsp65 gene was performed as described previously (8) using primers Tb11 (5'-ACCAACGATGGTGTGTCCAT) and Tb12 (5'-CTTGTCGAACCGCATACCCT). The 441-bp fragment was digested by HhaI or DdeI (Roche, Meylan, France) and run on a 3% Nusieve 3:1 agarose gel (FMC Bioproducts, TEBU, Le Perray en Yvelines, France).
For sequencing, a PCR with 45-µl mixture was performed and the product was purified by polyethylene glycol precipitation as described previously (4). Sequencing was done by MWG Biotech (Courtaboeuf, France).
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Genotyping data obtained by MLVA in this study
|
![]() View larger version (28K): [in a new window] |
FIG. 2. Minimum spanning tree analysis and comparison with the other approaches. A minimum spanning tree was constructed using the genotyping data provided in Table 1. The parameters used for the construction of the tree authorized the introduction of hypothetical (missing) links which appear as open circles (data predicted by the software as intermediates, not present in the available collection). The genotype numbers listed in Fig. 1 are indicated in the circles. The three group A genotypes comprising 31 strains are shown in blue. The two closely related strains, including the "M. canettii" strain CIPT060 (genotype 12), and in which four additional spacers in the DR locus have been found are shown in yellow. Genotypes with the hsp65 SNP 3 variant are presented in green. The exceptional strain percy65 (genotype 14; hsp65 SNPs 1, 2, and 5) is colored in red. The two groups of strains devoid of DR regions are circled. Strain percy79 (genotype 4, absence of the "M. canettii"-specific set of spacers, presence of new spacers) is indicated. Dotted lines are used to link genotypes separated by seven or more differences in the MLVA genotype.
|
To check whether a DR locus with unknown spacers might be present in some of the strains, we performed a PCR amplification with primers DRa and DRb localized in the constant region of the DR. No amplification was observed in 10 strains, confirming the complete absence of DR structure in these strains (Fig. 1). In contrast an amplification product in the form of a ladder was obtained with strain percy79 (Fig. 4A). We purified amplicons of a size ranging between 100 and 600 bp and cloned them into a plasmid vector (Fig. 4B and C). Sequencing of the insert from eight clones led to the identification of 20 spacers, all of them previously unknown.
![]() View larger version (55K): [in a new window] |
FIG. 4. Cloning of new spacers from strain percy79. (A) PCR amplification was performed using primers DRa and DRb on 16 "M. canettii" strains. Amplification product from strain percy79 was purified and cloned into the pGEM-T Easy vector (B), and inserts were analyzed by PCR amplification (C). Lanes 1, 10, and 19 in panels A and C contain 100-bp ruler DNA size markers.
|
![]() View larger version (60K): [in a new window] |
FIG. 5. PCR-restriction fragment length polymorphism analysis of the hsp65 locus. (A) hsp65 PRA patterns of representative strains from the MTBC after digestion with HhaI (left) or DdeI (right). The representative "M. canettii" strain CIPT060 shows a different pattern with HhaI as expected (lane 7) but not with DdeI (lane 14). (B) Fifteen smooth strains and H37Rv upon digestion of the 441-bp amplicon with HhaI. (C) Same as panel B, using DdeI instead of HhaI. A 100-bp ruler is used as size marker. Strain percy65 has a unique digestion pattern as exemplified. (D) Multiple alignments using the CLUSTALW algorithm of the 180-bp portion of the hsp65 gene in H37Rv and smooth strains showing five single nucleotide changes, numbered 1 to 5. The SNP described by Goh et al. (8) is SNP 4.
|
|
|
|---|
The 43 smooth variants of M. tuberculosis have characteristics in common with the representative "M. canettii" strain analyzed, such as a specific polymorphism of the hsp65 gene, but can be separated into several groups on the basis of the other molecular tests: (i) clustering by MLVA genotyping (Table 1; Fig. 1 and 2), (ii) the presence of a DR region in one group containing a collection of unique spacers (Fig. 3 and 4), (iii) the absence of DR in the second group (Fig. 2 to 4), and (iv) additional polymorphism in the hsp65 gene (Fig. 5).
MLVA places the 44 strains together in a separate group compared to almost 200 strains of the MTBC similarly analyzed (10; unpublished data). For several markers, some alleles are unique to "M. canettii," such as ETR-A (allele 10), ETR-C (alleles 6 and 10), MIRU-02 (allele3), MIRU-40 (allele 8), and Mtub29 (allele 5). Two markers are monomorphic in this collection, ETR-A and Mtub01. In contrast, ETR-A was found to be highly polymorphic in the MTBC (19). Here a unique "M. canettii"-specific allele is observed.
The strains percy32 and CIPT060, which possess a DR locus with additional spacers compared to those described by van Embden et al. (24), constitute a branch more closely related to group A strains than to the rest. Ten strains lacking a DR locus are loosely connected (Fig. 1 and 2). Analysis of the PCR amplification product obtained with primers corresponding to the conserved repeat of the DR locus shows that strain percy79 possesses a not previously described set of spacers. Previous work (24) suggested that a common ancestor bearing a large number of units has evolved by the interstitial deletion of motifs to explain the hundreds of different combinations seen in contemporary MTBC strains (6). The fact that strains which lack a DR locus have highly diverged genotypes (here, for instance, strain percy65, [Fig. 2] and the genotypes 5 to 10 and 13) suggests that the deletion occurred at least twice independently in the different strains. Further molecular work such as spoligotyping and sequencing across the "missing DR" might improve the understanding of the origin and history of the DR locus. Considering the genome microdeletions that occurred during the MTBC evolution and separated M. bovis from M. tuberculosis, Brosch et al. (1) concluded that M canettii is the species closest to the common ancestor. This analysis was based on the investigation of five "M. canettii" strains, which were shown to be devoid of the set of deletions which differentiate the other members of the MTBC. The relatively very high variability of the MTBC in terms of microdeletion events is quite interesting and probably reflects a sudden change of environment and selection pressure (some parts of the genome then becoming dispensable) rather than true evolutionary distance.
Our investigation on a relatively large collection of smooth strains, recruited in a very limited geographic area (the Republic of Djibouti), demonstrates that the genetic heterogeneity (not taking into account deletions) within "M. canettii" is much larger than within the whole MTBC. This is suggested first by the MLVA data (Fig. 1 and 2). In spite of the suspicion that tandem repeat loci may be under variable and specific evolutionary pressures, the clustering of strains by MLVA does make sense (15, 17). This is illustrated by a growing number of studies, not only of M. tuberculosis, in which MLVA clustering data have been compared to previous knowledge (3, 11) and which validate this approach as a clinical epidemiologic tool (19). In the present investigation, percy32 and CIPT060, for instance, which were predicted to be closely related by MLVA (Fig. 2, yellow circles), turned out to contain the same very rare four additional spacers in their DR regions. Similarly, strains carrying hsp65 SNP 3 are grouped together by the MLVA (Fig. 2). However, it is the hsp65 polymorphism data which provide the strongest evidence for the comparatively high "M. canettii" diversity. We have analyzed a portion of 441 bp from the hsp65 gene which was previously shown to harbor a single nucleotide difference specific for the few "M. canettii" strains previously assayed. This polymorphism is present in the 44 strains of this study, but in addition we describe four new nucleotide differences in 6 strains (synonymous substitutions), one of them being present in 5 strains. Interestingly strain percy65 possesses three unique SNPs. This important polymorphism in a small genomic region confirms that some strains are highly diverged. In contrast, no sequence difference at this locus is seen between the three sequenced genomes, M. bovis strain AF2122, M. tuberculosis strains H37Rv, and CDC1551. Not a single SNP was observed among either the 267 strains representing the MTBC diversity (and including M. microti, M. africanum, M. bovis) or the M. bovis BCG strains (20). In addition only one SNP separates any member of the MTBC investigated so far from the closest "M. canettii" sequence (Fig. 5D), whereas four different SNPs have been identified within the present collection of "M. canettii." This implies that the "M. canettii" group is much older than the MTBC. In other words, the available data strongly suggest that the MTBC is a recently emerged subspecies of "M. canettii." Furthermore, all the "M. canettii" strains investigated originate from the Republic of Djibouti, i.e., a very limited geographic area, and were collected over a short time period, so that the diversity detected probably underestimates the true diversity of the species.
This finding clearly raises a number of interesting questions. Smooth M. tuberculosis strains have not been detected elsewhere in spite of the fact that they show a very remarkable colonial morphology and spoligotype (25) and could not have been missed. It is tempting to speculate that the MTBC originates from East Africa. One successful clone spread, whereas the rest of the progenitor species, "M. canettii," remained in East Africa. It may be relevant in this view to recall that the group of ancestral M. tuberculosis strains without deletions of TbD1 (13) is associated with the East African Indian type of strains (19).
Finally, a reservoir must exist. In spite of the lack of epidemiological link between the different patients, we show here that one clone (genotype 2) is responsible for a large proportion of cases. Field investigations would be of great interest in trying to find this reservoir.
Work on the typing and molecular epidemiology of dangerous pathogens is supported by the French ministry of defense.
Present address: Service des maladies infectieuses et tropicales, HIA Laveran, 13013 Marseille, France. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»