Previous Article | Next Article 
Journal of Clinical Microbiology, May 1998, p. 1410-1413, Vol. 36, No. 5
0095-1137/98/$04.00+0
Copyright © 1998, American Society for Microbiology. All rights reserved.
Nonrandom Association of IS6110 and
Mycobacterium tuberculosis: Implications for Molecular
Epidemiological Studies
T. D.
McHugh and
S. H.
Gillespie*
Department of Microbiology, Royal Free
Hospital School of Medicine, London NW3 2PF, United Kingdom
Received 20 October 1997/Returned for modification 10 January
1998/Accepted 29 January 1998
 |
ABSTRACT |
IS6110 restriction fragment length polymorphism typing
is now established as the primary typing method for Mycobacterium
tuberculosis. It has been assumed that the position of bands is
random. Thus, the discrimination of the technique increases in
proportion to the copy number. Two collections of M. tuberculosis were investigated to test this hypothesis. We
identified 33 positions in isolates from a Tanzanian collection and 25 positions in isolates from a London, United Kingdom, collection where
bands were significantly more likely to be present than would be
expected by chance. These data suggest that band position is not
random, and this possibility may have an impact on the interpretation
of molecular epidemiological studies of M. tuberculosis.
 |
TEXT |
The insertion sequence
IS6110 is found in almost all isolates of
Mycobacterium tuberculosis and is present in between 1 and over 20 copies per isolate (2). The positions of these
copies have been used in an internationally standardized restriction fragment length polymorphism (RFLP) protocol to type M. tuberculosis (13), and the protocol is a powerful tool
for unravelling questions of tuberculosis epidemiology. It has been
particularly useful in investigating tuberculosis outbreaks in closed
communities such as hospitals and prisons (4, 9, 11) and in
detecting cross-infection by shared use of equipment or laboratory
cross-contamination (9, 12). More recently population-based
studies have been reported (3), and it has been possible to
follow the spread of drug-resistant isolates in wider communities
(1, 7, 14).
Mycobacterium paratuberculosis possesses an insertion
sequence, IS900, which has a defined 5-bp target (AGGAG) at
which it integrates into the bacterial chromosome (6). No
similar target sequence has been reported for IS6110,
although the direct repeat locus has been recognized as a hot spot for
integration, being the common site for single copy strains
(10). It has been speculated that this site represents the
original point of entry for IS6110 and other copies arise
from further transposition events (10). The sequence
adjacent to this site has been determined and given the name
IS6110 preferential locus (ipl) (8).
IS6110 typing is dependent on the premises that integration
into the genome is random and that the discrimination of the technique increases in proportion to the copy number. The aim of this study was
to test the hypothesis that integration of IS6110 into the genome of M. tuberculosis is random. If, alternatively,
there were integration hot spots this would confound the interpretation of the many population-based IS6110 typing studies currently
under way.
Two collections of M. tuberculosis isolates were
investigated to ensure that results would not be confounded by a bias
due to transmission of strains in a single population. One collection consisted of 207 strains isolated from patients with tuberculosis diagnosed in the Royal Free and St George's Hospitals, London, United
Kingdom, who had acquired their
infection from a wide range of countries including the Indian
subcontinent, Africa, Southern Europe, and the United Kingdom. The
second was a group of 154 strains collected from the Northern Zone of
Tanzania as part of a study of the interaction of human
immunodeficiency virus and tuberculosis.
Analysis of IS6110 RFLP was performed by the International
Standard Method (13). Fingerprints were compared by using
GelCompar computer analysis software; this process normalized the gel
image, dividing each track into 400 arbitrary divisions. The number of isolates with IS6110 bands located in each of the positions
defined by the GelCompar program was plotted.
The number of IS6110 bands in each position is illustrated
in Fig. 1 for the Tanzanian isolates, and
a similar pattern was found for the London strains (Table 1). If
IS6110 were randomly distributed in the M. tuberculosis genome, the graph line would be parallel to the
x axis and have a value equal to the total number of
IS6110 bands divided by the total number of positions and
would vary around that mean. Distinct peaks were found in a number of
positions.

View larger version (20K):
[in this window]
[in a new window]
|
FIG. 1.
Number of M. tuberculosis isolates with a
band at a given position. The lower line represents the predicted mean
number of bands per position (total number of bands divided by the
number of band positions available equaled 4.4). The upper line
represents the hot-spot cutoff line (see text for definition).
|
|
In Fig. 2 the number of positions with a
given number of IS6110 bands is plotted for the Tanzanian
and London isolates together with the expected number of positions with
a given number of IS6110 bands and its Poisson distribution.
A
2 test was performed by calculating the expected
number of positions with a given number of IS6110 bands and
comparing this with the observed number. The results of this test for
the Tanzanian isolates were
2 = 2,486, df = 11, P < 0.0005, and for the London isolates the results
were
2 = 8,133, df = 12, P = 0.0005. This
2 analysis demonstrates that the observed
frequency distribution is significantly different from the predicted,
random distribution for both populations, indicating that band
positions are not randomly associated but that there are favored
locations where more than the expected number of IS bands are found. By
using these data it was possible to define hot spots for each
collection of isolates as positions where the expected number of
positions with a given number of IS6110 bands was less than
0.16 (i.e., >99% confidence interval), giving cutoff values of 13 and
16 IS bands per position for the Tanzanian and London isolates,
respectively. These positions are listed in Table 1. For the Tanzanian
isolates a total of 33 hot spots were identified, although 17 peaks
were noted on the frequency distribution graphs as many hot spots are
found in adjacent positions (Table 1). The majority of the hot spots are shared by the two populations we studied. It should be noted that
these hot spots included the direct repeat locus which has already been
defined and is located at hot-spot position 254 (10).

View larger version (10K):
[in this window]
[in a new window]
|
FIG. 2.
Frequency distribution of the number of positions with a
given number of bands for isolates of M. tuberculosis
collected in Tanzania (top) and London, United Kingdom (bottom). The
dotted lines represent the predicted number of positions with a given
number of bands calculated by using a Poisson distribution. The
predicted mean value of the number of positions with a given number of
bands is derived on the assumption that IS6110 is randomly
associated. Therefore, the number is equal to the total number of bands
divided by the total number of available positions (400).
|
|
The number of IS6110 bands which were located in hot spots
was calculated and the data for the combined populations are tabulated in Table 2. These results demonstrate
that the majority of strains with five or fewer copies have
IS6110 bands which are located in hot spots and confirm the
need for such strains to be investigated by an alternative technique
such as polymorphic GC-rich RFLP or spoligotyping (2).
High-copy-number strains had IS6110 bands in the
low-copy-number hot-spot sites but the reverse was rarely the case
(data not shown). Thus, it appears that there may be two subpopulations
of M. tuberculosis with distinct collections of
IS6110 integration sites. The first subpopulation is defined by strains that have low copy numbers, i.e., less than five bands, and
the second is defined by strains that have high copy numbers. This
division agrees with the observation of Yang et al., who noted the
existence of an Asian subgroup of M. tuberculosis strains with low copy numbers (14). Alternatively, low-copy-number
sites may be transcriptionally inactive, making it less likely that the
transposase will be transcribed and trigger a transposition event.
Thus, when rare transposition events to more active sites occur copy
number will rise rapidly, leaving few strains with an intermediate
number of IS6110 bands (5).
In high-copy-number strains a significant proportion of
IS6110 bands are located in hot spots, and this is likely to
have an effect on the sensitivity of IS6110 RFLP typing for
discriminating among isolates. In strains which have many hot-spot
IS6110 bands, removal of these conserved IS6110
bands from the similarity calculation may enhance discrimination,
weighting the calculation to band positions which are randomly
associated.
Although it is unlikely that we will see a consensus sequence similar
to the target for IS900 integration, detailed sequence analysis of IS6110 hot-spot regions will identify the
characteristics of the genome which favor integration of this insertion
sequence.
 |
ACKNOWLEDGMENTS |
We gratefully acknowledge the help of Richard Morris with
statistical advice, Nicki Hutchison for permission to use her RFLP data, and Anne Dickens for her technical assistance.
This work was supported by funds from The Special Trustees of the Royal
Free Hospital (S.H.G.).
 |
FOOTNOTES |
*
Corresponding author. Mailing address: Department of
Microbiology, Royal Free Hospital School of Medicine, Rowland Hill St., London NW3 2PF, United Kingdom. Phone: 44-171-794-0500. Fax:
44-171-794-0433. E-mail: stepheng{at}rfhsm.ac.uk.
 |
REFERENCES |
| 1.
|
Barnes, P. F.,
H. el-Hajj,
S. Preston-Martin,
M. D. Cave,
B. E. Jones,
M. Otaya,
J. Pogoda, and K. D. Eisenach.
1996.
Transmission of tuberculosis among the urban homeless.
JAMA
275:305-307[Abstract].
|
| 2.
|
Butcher, P. D.,
N. A. Hutchinson,
T. J. Doran, and J. W. Dale.
1996.
The application of molecular techniques to the diagnosis and epidemiology of mycobacterial diseases.
J. Appl. Bacteriol.
81:53S-71S.
|
| 3.
|
Chevrel-Dellagi, D.,
A. Abderrahman,
R. Haltiti,
H. Koubaji,
B. Gicquel, and K. Dellagi.
1993.
Large-scale DNA fingerprinting of Mycobacterium tuberculosis strains as a tool for epidemiological studies of tuberculosis.
J. Clin. Microbiol.
31:2446-2450[Abstract/Free Full Text].
|
| 4.
|
Coronado, V. G.,
C. M. Beck-Sague,
M. D. Hutton,
B. J. Davis,
P. Nicholas,
C. Villareal,
C. L. Woodley,
J. O. Kilburn,
J. T. Crawford,
T. R. Freiden,
R. L. Sinkowitz, and W. R. Jarvis.
1993.
Transmission of multidrug-resistant Mycobacterium tuberculosis among persons with human immunodeficiency virus infection in an urban hospital-epidemiologic and restriction fragment length polymorphism analysis.
J. Infect. Dis.
168:1052-1055[Medline].
|
| 5.
|
Dale, J. W.
1995.
Mobile genetic elements in mycobacteria.
Eur. Resp. J.
20:633S-648S.
|
| 6.
|
Doran, T. J.,
M. Tizard,
D. Millar,
J. Ford,
N. Sumar,
M. Loughlin, and J. Hermon-Taylor.
1997.
IS900 targets translation signals in Mycobacterium avium subsp. paratuberculosis to facilitate the expression of its hed gene.
Microbiology
143:547-552[Abstract].
|
| 7.
|
Dwyer, B.,
K. Jackson,
K. Raios,
A. Sievers,
E. Wilshire, and B. Ross.
1993.
DNA restriction fragment analysis to define an extended cluster of tuberculosis in homeless men and their associates.
J. Infect. Dis.
167:490-494[Medline].
|
| 8.
|
Fang, Z., and K. J. Forbes.
1997.
A Mycobacterium tuberculosis IS6110 preferential locus for insertion into the genome.
J. Clin. Microbiol.
35:479-481[Abstract].
|
| 9.
|
Frieden, T. R.,
C. L. Woodley,
J. T. Crawford,
D. Lew, and S. M. Dooley.
1996.
The molecular epidemiology of tuberculosis in New York City: the importance of nosocomial transmission and laboratory error.
Tubercle Lung Dis.
77:407-413[Medline].
|
| 10.
|
Hermans, P. W. M.,
D. van Soolingen,
E. M. Bik,
P. E. W. de Haas,
J. W. Dale, and J. D. A. van Embden.
1991.
Insertion element IS987 from Mycobacterium bovis BCG is located in a hot-spot integration region for insertion elements in Mycobacterium tuberculosis complex strains.
Infect. Immun.
59:2695-2705[Abstract/Free Full Text].
|
| 11.
|
Jereb, J. A.,
D. R. Burwen,
S. W. Dooley,
W. H. Haas,
J. T. Crawford,
L. J. Geiter,
M. B. Edmond,
J. N. Dowling,
R. Shapiro,
A. W. Pasculle,
S. L. Shanahan, and W. R. Jarvis.
1993.
Nosocomial outbreak of tuberculosis in a renal transplant unit: application of a new technique for restriction fragment length polymorphism analysis of Mycobacterium tuberculosis isolates.
J. Infect. Dis.
168:1219-1224[Medline].
|
| 12.
|
Small, P. M.,
N. B. McClenny,
S. P. Singh,
G. K. Schoolnik,
L. S. Tompkins, and P. A. Mickelsen.
1993.
Molecular strain typing of Mycobacterium tuberculosis to confirm cross-contamination in the mycobacteriology laboratory and modification of procedures to minimize occurrence of false-positive cultures.
J. Clin. Microbiol.
31:1677-1682[Abstract/Free Full Text].
|
| 13.
|
van Embden, J. D. A.,
M. D. Cave,
J. T. Crawford,
J. W. Dale,
K. D. Eisenach,
B. Gicquel,
P. W. M. Hermans,
C. Martin,
R. McAdam,
T. Shinick, and P. M. Small.
1993.
Strain identification of Mycobacterium tuberculosis by DNA fingerprinting: recommendations for a standard methodology.
J. Clin. Microbiol.
31:406-409[Abstract/Free Full Text].
|
| 14.
|
Yang, Z. H.,
I. Mtoni,
M. Chonde,
M. Mwasekaga,
K. Fuursted,
D. S. Askgard,
J. Bennedsen,
P. E. W. de Haas,
D. van Soolingen,
J. D. A. van Embden, and A. B. Andersen.
1995.
DNA fingerprinting and phenotyping of Mycobacterium tuberculosis isolates from human immunodeficiency virus (HIV)-seropositive and HIV-seronegative patients in Tanzania.
J. Clin. Microbiol.
33:1064-1069[Abstract].
|
Journal of Clinical Microbiology, May 1998, p. 1410-1413, Vol. 36, No. 5
0095-1137/98/$04.00+0
Copyright © 1998, American Society for Microbiology. All rights reserved.
This article has been cited by other articles:
-
Vishnoi, A., Roy, R., Bhattacharya, A.
(2007). Comparative analysis of bacterial genomes: identification of divergent regions in mycobacterial strains using an anchor-based approach. Nucleic Acids Res
35: 3654-3667
[Abstract]
[Full Text]
-
Mathema, B., Kurepina, N. E., Bifani, P. J., Kreiswirth, B. N.
(2006). Molecular Epidemiology of Tuberculosis: Current Insights. Clin. Microbiol. Rev.
19: 658-685
[Abstract]
[Full Text]
-
Lari, N., Rindi, L., Sola, C., Bonanni, D., Rastogi, N., Tortoli, E., Garzelli, C.
(2005). Genetic Diversity, Determined on the Basis of katG463 and gyrA95 Polymorphisms, Spoligotyping, and IS6110 Typing, of Mycobacterium tuberculosis Complex Isolates from Italy. J. Clin. Microbiol.
43: 1617-1624
[Abstract]
[Full Text]
-
Alland, D., Whittam, T. S., Murray, M. B., Cave, M. D., Hazbon, M. H., Dix, K., Kokoris, M., Duesterhoeft, A., Eisen, J. A., Fraser, C. M., Fleischmann, R. D.
(2003). Modeling Bacterial Evolution with Comparative-Genome-Based Marker Systems: Application to Mycobacterium tuberculosis Evolution and Pathogenesis. J. Bacteriol.
185: 3392-3399
[Abstract]
[Full Text]
-
Dale, J. W., Al-Ghusein, H., Al-Hashmi, S., Butcher, P., Dickens, A. L., Drobniewski, F., Forbes, K. J., Gillespie, S. H., Lamprecht, D., McHugh, T. D., Pitman, R., Rastogi, N., Smith, A. T., Sola, C., Yesilkaya, H.
(2003). Evolutionary Relationships among Strains of Mycobacterium tuberculosis with Few Copies of IS6110. J. Bacteriol.
185: 2555-2562
[Abstract]
[Full Text]
-
Fleischmann, R. D., Alland, D., Eisen, J. A., Carpenter, L., White, O., Peterson, J., DeBoy, R., Dodson, R., Gwinn, M., Haft, D., Hickey, E., Kolonay, J. F., Nelson, W. C., Umayam, L. A., Ermolaeva, M., Salzberg, S. L., Delcher, A., Utterback, T., Weidman, J., Khouri, H., Gill, J., Mikula, A., Bishai, W., Jacobs, W. R. Jr., Venter, J. C., Fraser, C. M.
(2002). Whole-Genome Comparison of Mycobacterium tuberculosis Clinical and Laboratory Strains. J. Bacteriol.
184: 5479-5490
[Abstract]
[Full Text]
-
Kivi, M., Liu, X., Raychaudhuri, S., Altman, R. B., Small, P. M.
(2002). Determining the Genomic Locations of Repetitive DNA Sequences with a Whole-Genome Microarray: IS6110 in Mycobacterium tuberculosis. J. Clin. Microbiol.
40: 2192-2198
[Abstract]
[Full Text]
-
Supply, P., Lesjean, S., Savine, E., Kremer, K., van Soolingen, D., Locht, C.
(2001). Automated High-Throughput Genotyping for Study of Global Epidemiology of Mycobacterium tuberculosis Based on Mycobacterial Interspersed Repetitive Units. J. Clin. Microbiol.
39: 3563-3571
[Abstract]
[Full Text]
-
Fang, Z., Kenna, D. T., Doig, C., Smittipat, D. N., Palittapongarnpim, P., Watt, B., Forbes, K. J.
(2001). Molecular Evidence for Independent Occurrence of IS6110 Insertions at the Same Sites of the Genome of Mycobacterium tuberculosis in Different Clinical Isolates. J. Bacteriol.
183: 5279-5284
[Abstract]
[Full Text]
-
Barlow, R. E. L., Gascoyne-Binzi, D. M., Gillespie, S. H., Dickens, A., Qamer, S., Hawkey, P. M.
(2001). Comparison of Variable Number Tandem Repeat and IS6110-Restriction Fragment Length Polymorphism Analyses for Discrimination of High- and Low-Copy-Number IS6110 Mycobacterium tuberculosis Isolates. J. Clin. Microbiol.
39: 2453-2457
[Abstract]
[Full Text]
-
Sola, C., Ferdinand, S., Mammina, C., Nastasi, A., Rastogi, N.
(2001). Genetic Diversity of Mycobacterium tuberculosis in Sicily Based on Spoligotyping and Variable Number of Tandem DNA Repeats and Comparison with a Spoligotyping Database for Population-Based Analysis. J. Clin. Microbiol.
39: 1559-1565
[Abstract]
[Full Text]
-
Mazars, E., Lesjean, S., Banuls, A.-L., Gilbert, M., Vincent, V., Gicquel, B., Tibayrenc, M., Locht, C., Supply, P.
(2001). High-resolution minisatellite-based typing as a portable approach to global analysis of Mycobacterium tuberculosis molecular epidemiology. Proc. Natl. Acad. Sci. USA
98: 1901-1906
[Abstract]
[Full Text]
-
de Boer, A. S., Kremer, K., Borgdorff, M. W., de Haas, P. E. W., Heersma, H. F., van Soolingen, D.
(2000). Genetic Heterogeneity in Mycobacterium tuberculosis Isolates Reflected in IS6110 Restriction Fragment Length Polymorphism Patterns as Low-Intensity Bands. J. Clin. Microbiol.
38: 4478-4484
[Abstract]
[Full Text]
-
Gillespie, S. H., Dickens, A., McHugh, T. D.
(2000). False Molecular Clusters due to Nonrandom Association of IS6110 with Mycobacterium tuberculosis. J. Clin. Microbiol.
38: 2081-2086
[Abstract]
[Full Text]
-
Dale, J. W., Nor, R. M., Ramayah, S., Tang, T. H., Zainuddin, Z. F.
(1999). Molecular Epidemiology of Tuberculosis in Malaysia. J. Clin. Microbiol.
37: 1265-1268
[Abstract]
[Full Text]