Previous Article | Next Article ![]()
Journal of Clinical Microbiology, December 2004, p. 5658-5663, Vol. 42, No. 12
0095-1137/04/$08.00+0 DOI: 10.1128/JCM.42.12.5658-5663.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
,
Beth M. Mole,1,
,
Daiva Dailidiene,2
Issy Segal,3
Reid Ally,3
Rajesh Mistry,3
Ousman Secka,4
Richard A. Adegbola,4
Julian E. Thomas,5
Erik M. Lenarcic,1
Richard M. Peek Jr.,6,7
Douglas E. Berg,2* and
Mark H. Forsyth1
Department of Biology, College of William and Mary, Williamsburg, Virginia,1 Department of Molecular Microbiology, Washington University Medical School, St. Louis, Missouri,2 African Institute of Digestive Disease, Chris Hani Baragwanath Hospital, Soweto, Johannesburg, South Africa,3 Medical Research Council Laboratories, Fajara, The Gambia,4 Royal Victoria Infirmary, Newcastle upon Tyne, United Kingdom,5 Departments of Gastroenterology and Cancer Biology, Vanderbilt University Medical Center,6 Veterans Administration Medical Center, Nashville, Tennessee7
Received 19 July 2004/ Returned for modification 10 August 2004/ Accepted 30 August 2004
|
|
|---|
|
|
|---|
Extensive geographic differences have been found in DNA sequences from H. pylori strains from different human populations (11, 14, 20, 24, 41). This led to a proposal that historical patterns of human migration and of racial admixture can be predicted from DNA sequences of H. pylori housekeeping genes (14, 41). An ability to discern H. pylori lineages from distinct human populations should also have medical relevance, because disease outcomes associated with H. pylori infection vary geographically (22, 27, 28, 38, 39), and some of this might stem from genetic differences between H. pylori populations (25, 26, 37).
More than 80% of Africans are infected with H. pylori, but their gastric cancer rates are lower than might be predicted based on this high prevalence (6, 22, 25-28, 38, 39). Infection rates in North America and Europe are often less than half of those in Africa, but yet distal gastric carcinoma is more common in these regions (22, 38, 39), a discordance referred to as the "African Enigma" (22). Gastric cancer rates also vary by country in Asia (29). Those in Japan and Korea are remarkably high (male, 82.7/100,000, and female, 32.8/100,000), some 10- to 20-fold higher than in Thailand and 100-fold higher than in India (29), whereas H. pylori seroprevalences in these south Asian countries are higher than those of Japan and Korea. This discordance is referred to as the "Asian Enigma" (29). These discordances have been ascribed variously to societal, human, bacterial genetic, and environmental factors, including concurrent helminth infections (the frequency of which varies among societies) (18, 25, 26, 27).
Although overall rates of gastric adenocarcinoma have decreased in the United States over the last 60 years (2), those in African-American, Hispanic, and Native American populations remain two to three times those in the Caucasian population (15). Again, much of this difference may stem from environmental, societal, or human genetic factors, but we infer that H. pylori genetic differences could also be important, i.e., disease outcomes might be affected by population differences in gene content or gene regulation in predominant H. pylori strains. Here we characterize 180-bp and 480-bp sequences (ins180 and ins480, respectively) found just downstream of a highly conserved but function-unknown gene and near a promoter of two-component signal transduction genes in a subset of H. pylori strains, positions that might allow it to affect expression of either the upstream or downstream genes. The 180-bp insert, in particular, was more common in strains from Africans and African Americans than in those from other human populations. ins480 was found only in an unusual outlier subset of South African strains.
|
|
|---|
Among Lithuanian isolates (n = 20), 18 contained the cag pathogenicity island (PAI) and two did not. Twelve of the Lithuanian source patients were diagnosed with gastritis, six were diagnosed with duodenal ulcer, and two were diagnosed with gastric ulcer. Isolates from India (n = 20) each contained the cag PAI and were each from patients diagnosed with duodenal ulcer. All Japanese isolates (n = 20) contained the cag PAI and included strains from patients with gastric carcinoma (n = 4) and gastritis only (n = 4). All 29 African strains (9 Gambian and 20 South African) were from black Africans with gastric complaints. Four of nine Gambian patients had duodenal ulcer, and 2 of 20 South African patients had gastric or prepyloric ulcers. In addition, seven of nine Gambian and 14 of 20 South African strains contained the cag PAI and the other two Gambian and six South African strains did not. North American H. pylori isolates used in this study included 30 that contained and 24 that lacked the cag PAI. Included were 18 strains from African Americans. Clinical diagnoses of source patients included gastritis, gastric ulcer, gastric erosions, duodenal ulcer, esophagitis, and Barrett's esophagus. Spanish isolates (n = 40) were from patients with gastric complaints, and 24 strains contained the cag PAI and 16 did not.
Molecular biology techniques. Genomic DNA was extracted using cetyltrimethylammonium bromide (CTAB), a cationic detergent, essentially as described previously (5). Briefly, 24- to 48-h H. pylori cultures were harvested from solid medium into 0.9% NaCl solution, and cell lysis was accomplished by addition of sodium dodecyl sulfate to a final volume of 0.5% and incubation at 37°C for 30 min. A 5 M concentration of NaCl was added to a final concentration of 725 mM, followed by 0.144 volume of 10% CTAB-0.7 M NaCl detergent. Bacterial lysates were phenol-chloroform and chloroform extracted, and genomic DNA was isopropanol precipitated using 0.86 volume of isopropanol, washed with 70% ethanol, and resuspended in 10 mM Tris, pH 8.5. PCR was performed using Taq DNA polymerase (Promega) and manufacturer-provided reaction buffer. Reaction mixtures consisted of 1x Mg2+-free polymerase reaction buffer, 2 mM MgCl2, 0.8 mM deoxynucleoside triphosphates (200 nM dATP, 200 nM dTTP, 200 nM dGTP, and 200 nM dCTP), 400 ng of each oligonucleotide, 200 ng of template DNA, and 1.25 U of Taq DNA polymerase in a final reaction volume of 50 µl. Amplification conditions consisted of a 5-min hot start and 25 to 30 cycles as follows: 94°C for 30 s, 56°C for 30 s, and 72°C for 1 min. The oligonucleotide primers used to amplify ins180-ins480 were insFwd, 5'-GTGGCGCGTTTCTTGCAATACC-3', and insRev, 5'-AACTCGCTCAAAAACTCGGC-3'. Cloning of PCR amplicons was accomplished using pGEM-T Easy and T4 DNA ligase (Promega). Plasmids were prepared using Qiagen spin columns, and restriction enzyme digestions were performed according to standard protocols (4). PCR analysis of the intergenic region of jhp0153 to jhp0152 was performed on 164 different H. pylori clinical isolates to determine the prevalence of the 180-bp and 480-bp insertions. One hundred forty-nine of these strains were from a global collection, and the others were from age- and gender-matched African-American (n = 18) and Caucasian (n = 18) patients in Tennessee (Table 1). Before running these tests, the researcher carrying them out was unaware of the origin of each of these 36 isolates.
|
View this table: [in a new window] |
TABLE 1. Prevalence of ins180 and ins480 in the jhp0153-jhp0152 intergenic region among a panel of H. pylori clinical isolates from geographically and ethnically diverse source patients
|
Nucleotide sequence accession number. The sequences of ins180 and ins480 elements, determined here, have been deposited in GenBank (accession numbers AY731183, AY731182, and AY669071).
|
|
|---|
1 x 104) at 13 sites in the J99 genome (Fig. 1) and five sites in the 26695 genome (two of them at sites also identified in J99, downstream of jhp0643 and jhp1211 in Fig. 1). Every one of the 16 sequences so identified was between genes, not within them. Seven of the 13 in strain J99 were between 3' ends of convergently transcribed genes, five were between the 3' and 5' ends of neighboring genes transcribed in the same direction (as with canonical ins180 in the jhp0153-jhp0152 site), and one was between the 5' ends of divergently transcribed genes.
![]() View larger version (23K): [in a new window] |
FIG. 1. ins180 contains a highly repetitive element. A 99-bp subsequence of ins180 is shown. This sequence, internal to ins180, contains all subsequences found in 13 other intergenic regions of H. pylori J99 and which along with ins180 are represented as shaded boxes between adjacent open reading frames. The numbers above each box represent the length (in base pairs) of ins180 subsequence found at each location. Each ins180 subsequence region was identified by BLAST search, and each had an E value of 1 x 104. Typical percent homologies between these subsequences and ins180 were from 78 to 100. All genes with predicted functions are designated with three-letter gene designations within parentheses. Those genes without such designations were predicted to encode hypothetical proteins (1).
|
![]() View larger version (17K): [in a new window] |
FIG. 2. Sequence comparison of response regulator promoter regions from H. pylori strains 26695, J99, and Afr7. Matched shadings denote sequence similarities of predicted coding regions, and patterned regions illustrate extragenic sequence homology. Asterisks indicate unsequenced regions where nucleotide homology is unknown. Black arrows indicate transcription start points for the response regulator (RR)-histidine kinase (HK) operon that was previously determined in H. pylori strain 26695 (10, 17). Predicted transcription start points in H. pylori strains Afr7 and J99 are based upon sequence homology with the orthologous region in strain 26695. White half-arrows represent binding sites for primers insFwd and insRev. The amplicon without ins180 is 350 bp, and amplicons with ins180 and ins480 are 530 and 830 bp long, respectively.
|
![]() View larger version (49K): [in a new window] |
FIG. 3. ins180 is more frequent among H. pylori isolates from African Americans than from Caucasians. A PCR survey for ins180 in the response regulator jhp0152 promoter region from 36 H. pylori strains from both African Americans (n = 18) and Caucasians (n = 18) was performed. The jhp0153-jhp0152 intergenic region amplicon without ins180 is 350 bp, while the amplicons possessing ins180 are 530 bp. Intergenic regions possessing ins480 yield amplicons of 830 bp. (A) Lane 1, H. pylori strain Afr7 (containing ins480); lane 2, strain 26695 (no ins180 or ins480); lane 3, strain J99 (containing ins180); lane 4, 100-bp ladder. (B and C) H. pylori strains derived from18 African Americans and 18 U.S. Caucasians, respectively.
|
Sequence analysis of ins180 and ins480. Amplicons that putatively contained ins180 from one North American strain (B166) and one South African strain (Afr82) were sequenced. The jhp0152-jhp0153 intergenic regions of these strains were 90 to 95% identical to that of H. pylori strain J99 (GenBank accession numbers AY731182 and AY731183), indicating that the larger (530-bp) amplicons from North American and African strains do indeed reflect presence of the same 180-bp insertion. These ins180 elements seemed to be inserted at the same site as that occupied by ins180 in strain J99 (just downstream of the translation stop of hp0167) and contained a potential 41-codon open reading frame, which may not, however, be an authentic coding region, because it lacks a consensus Shine-Dalgarno sequence and would start translation with a UUG codon.
As noted above, 4 of 20 South African strains (each of an unusual outlier group) carried a 0.5-kb insertion between jhp0153 and jhp0152 that was not found in any of the strains from other regions that we tested. Sequence analysis of this region from one such strain, Afr7, revealed an insertion of a unique 480-bp segment in this strain's intergenic region, at the same site as ins180 (Fig. 2) (GenBank accession number AY669071). This ins480 segment is not found in either of the fully sequenced genomes (strain 26695 or J99), although an
80-bp internal sequence exhibits 70% identity with a segment of ins180 (data not shown). A search for open reading frames in ins480 revealed several small (160- to 190-bp) possible open reading frames with no significant DNA or protein-level homology to other current database entries.
ins180 is more frequent in H. pylori isolates from African Americans than in those from Caucasian Americans. Given the presence of ins180 in each of our nine West African strains and also its presence in nearly half of the South African strains, we tested for this or other characteristic segments in this region in H. pylori from infected African-American patients, relative to isolates from age- and gender-matched Caucasian-American patients (n = 18 in each group). Thirteen of 36 (36%) strains yielded amplicons of approximately 530 bp, indicating the presence of ins180, whereas the other 23 (64%) yielded amplicons of 350 bp, indicating a simple empty site (Fig. 3 and Table 1). According to sorting of these data by patient ethnicity, 10 of 18 (56%) strains from African Americans versus 3 of 18 (17%) strains from other patients carried ins180 (P < 0.05) (Fig. 3B and C and Table 1). There was no correlation between the presence or absence of ins180 and carriage of the cag PAI or patient disease outcome (data not shown).
|
|
|---|
A BLAST search with the ins180 sequence identified conserved subsequences at 5 and 13 other sites in the genomes of the two sequenced H. pylori strains. Others have reported that
78% of the H. pylori genome was well conserved in a set of 15 strains (37). This represents
1,300 genes, most specifying metabolic and cellular functions. Many of the other nonconserved genes encoded DNA restriction and modification enzymes, transposases, and cell surface proteins (37). H. pylori strains also exhibit marked differences in gene arrangement (mostly due to inversion) and extensive point mutation (1, 19). Given the repetitive nature of ins180 and related sequences, we suggest that these elements constitute substrates for homologous recombination, which occurs readily in short repetitive sequences in H. pylori (4).
The ins180 sequence was nonrandomly distributed geographically (Table 1). Each of nine strains from The Gambia in West Africa, nearly half of South African strains, and one-fourth of Spanish strains contained ins180, whereas only a few of the other isolates that we first surveyed (all North American) carried this element. Also of note was the larger ins480 element, which was inserted at the same site and was found in an additional 4 of 29 South African strains (14%). Each of these four was of an unusual outgroup that is distinct from mainstream H. pylori strains of Africa or other continents (9; Dailide and Berg, unpublished). Reinforcing the view that such polymorphisms can be useful in identifying geographic origins of different H. pylori strains, 10 of 18 H. pylori isolates from African-American patients but only 3 of another 18 isolates from a matched Caucasian-American patient cohort carried this ins180 sequence. This is interpreted as reflecting H. pylori's predominately intrafamilial transmission (3, 24, 36, 40) but is still most remarkable, given the multiple generations since the ancestors of most African Americans were taken from their West African homelands. In this context, it is tempting to imagine that ins180 in nearly one-fourth of Spanish strains might stem from the Moorish conquest and other movements of peoples between northwest Africa and the Iberian Peninsula (16).
The population correlations seen with ins180, in particular, make it attractive to consider that phylogenetic analyses of other loci in H. pylori strains from African Americans will also show evidence of West African origins. This may allow us to more clearly focus on bacterial versus other factors that contribute to the higher risk of gastric cancer in African Americans than in Africans. More generally, we suggest that easily scored markers such as ins180 and ins480 will be particularly useful in deciphering bacterial genetic versus other causes of particular disease associations, as epitomized by the African Enigma.
This work was supported by grants from the National Institutes of Health (AI 53062) and the Thomas F. and Kate Miller Jeffress Memorial Trust (J-602) to M.H.F. as well as grants DK 58587 and CA 77955 from the National Institutes of Health to R.M.P. and grants AI 38166, DK 53727, and DK 63041 from the National Institutes of Health to D.E.B. O.S. was the recipient of a UNESCO-ASM Travel Award. This research was also supported in part by a Howard Hughes Medical Institute grant through the Undergraduate Biological Sciences Education Program to the College of William and Mary.
S.L.M. and B.M.M. contributed equally to this work. ![]()
Present address: Department of Microbiology and Molecular Genetics, Emory University, Atlanta, Ga. ![]()
Present address: Department of Microbiology and Immunology, University of North Carolina, Chapel Hill, N.C. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»