Previous Article | Next Article ![]()
Journal of Clinical Microbiology, March 2003, p. 1316-1321, Vol. 41, No. 3
0095-1137/03/$08.00+0 DOI: 10.1128/JCM.41.3.1316-1321.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Mycotic Diseases Branch, Division of Bacterial and Mycotic Diseases, National Center for Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia,1 Johns Hopkins University School of Hygiene and Public Health, Baltimore, Maryland,2 University of Pittsburgh Graduate School of Public Health, Pittsburgh, Pennsylvania ,3 Yale University School of Medicine, New Haven, Connecticut4
Received 22 October 2002/ Returned for modification 27 November 2002/ Accepted 15 December 2002
| ABSTRACT |
|---|
|
|
|---|
| TEXT |
|---|
|
|
|---|
This species is diploid with a primarily clonal mode of reproduction. However, there is evidence for a low level of recombination (perhaps meiotic) between strains, and within a single strain, there are relatively high rates of nonhomologous mitotic recombination, translocation, and aneuploidy (3, 12). Nevertheless, population structure studies, based on cladistic analyses using randomly amplified DNA polymorphisms (25), single-nucleotide polymorphisms (31), and variable nucleotide repeats, such as microsatellites (4), have demonstrated a tripartite division among U.S. strains. In addition, there is recent evidence to suggest that, on a global level, there are additional strain types that are genetically distinct from these three previously described groups (7, 28).
Between 1992 and 1993, the Centers for Disease Control and Prevention (CDC) conducted a population-based surveillance for candidemia in the Atlanta, Ga., metropolitan area and in the San Francisco, Calif., area (13). The underlying illnesses varied, with approximately 8 to 10% of patients infected with human immunodeficiency virus (13). A total of 100 isolates of C. albicans collected during this surveillance study (one isolate per patient) were randomly selected for a substudy in which they were initially characterized by a limited number of genetic markers to identify possible genetic variation by location and/or time (15). From this set, 25 to 30 isolates from each of the two sites were selected to represent the observed genetic diversity in the larger population and further characterized using a larger number of microsatellite and single-nucleotide polymorphism markers (16).
The five microsatellites that have been employed are briefly described below. The CEF3 locus (named for Candida elongation factor 3) was originally described by Bretagne et al. (2) and is a compound microsatellite consisting of TTC/TTTC repeats. For all five loci, alleles are assigned by the size of the amplified product (in base pairs). Because of the compound nature of the repeat, CEF3 alleles can vary by 1 bp (2). The ERK locus (located in a protein kinase gene) has been analyzed by Field et al. (6) and is another compound repeat of the motif (CAGGCT)a(CAAGCT)b. Like CEF3, there are a relatively large number of alleles that vary by one nucleotide (6). ZNF1 is located in the coding region of the zinc finger (transcriptional factor) gene and is a simple CAA repeat (ZNF1 alleles vary by 3 bp) (14). Likewise, A3 and A4 are anonymous loci that were found by searching the C. albicans genome database (www.stanford.edu) and are simple repeats of TAA and GAA, respectively (16; Candida albicans genome sequencing project of the Stanford DNA Sequencing and Technology Center [http://www-sequence.stanford.edu/group/candida]).
Between October 1998 and September 2000, CDC conducted a second population-based active surveillance in Baltimore, Md., and in the state of Connecticut (R. A. Hajjeh, A. Sofair, L. Harrison, G. M. Lyon, M. E. Brandt, B. A. Arthington-Skaggs, J. Morgan, S. Mirza, M. Phelan, L. Thomson-Sanza, S. Huie, L. Yeo, M. Pass, and D. W. Warnock, Abstr. 39th Annu. Meet. Infect. Dis. Soc. Am., abstr. 641, 2001). The five microsatellite loci described above (16) were used to analyze 93 C. albicans isolates from this study. In this paper, we present a comparison of genetic diversity data for C. albicans isolates from two time periods and four geographical locations in the United States.
Isolates and storage. A total of 439 incident bloodstream isolates of C. albicans were collected between 1992 and 1993, while 429 were collected between 1998 and 2000. Species identification was confirmed at the CDC using standard phenotypic methods including API 20C biochemical profiles and morphological appearance on Dalmau cornmeal agar plates. Isolates were prescreened to exclude Candida dubliniensis by the presence of a nontruncated copy of IS1, using C. albicans IS1 as a size control (15). All strains were stored at -70°C as 30% glycerol stocks in 1x phosphate-buffered saline. For this study, 57 C. albicans isolates (29 from Atlanta, Ga., and 28 from San Francisco, Calif.; one isolate per patient) were randomly selected from the surveillance population collected from 1992 to 1993, and 93 (46 from Connecticut and 47 from Baltimore, Md.) were from the 1998 to 2000 survey.
DNA extractions. DNA isolation was performed from 10-ml yeast-peptone-dextrose (YPD) cultures as described previously (16). For IS1 sizing, PCR was performed in a volume of 25 µl and consisted of 35 cycles (1 cycle was 30 s at 95°C, 30 s at 55°C, and 30 s at 72°C), followed by a 4-min extension step at 72°C. The entire reaction mixture was loaded and run on 3:1 Nusieve agarose gels (FMC Bioproducts, Rockland, Maine).
Microsatellite analysis. PCR was performed in a 25-µl volume as previously described (16) using carboxyfluorescine (FAM)- or tetrachloro-6-carboxyfluorescine (TET)-labeled forward primers. All primer sequences have been described previously (16). PCRs were performed individually but pooled for sizing. All PCRs were run for 35 cycles, with 1 cycle consisting of 30 s at 95°C, 30 s at 55°C, and 30 s at 72°C, with a 4-min extension at 72°C. For FAM- and TET-labeled reaction mixtures, 0.5 µl each was placed in 25 µl of formamide (Applied Biosystems Inc. [ABI], Foster City, Calif.) with 0.5 µl of TAMRA 500-labeled size standard (ABI) and denatured prior to analysis. PCR products were sized by an ABI 310 genetic analyzer (ABI) using Genescan software (ABI) with no split-peak correction (16).
Population analysis.
The microsatellite results for 1998 to 2000 were compared to data obtained from the 1992 to 1993 surveillance population. Genetic analysis was performed using GenePop (GenePop website, wbiomed.curtin.edu.au/genepop) (27) and Popgene (33). Data input files were created for diploid populations; each allele was assigned a two-digit number. For genic and genotypic differentiation, algorithms developed for diploid populations undergoing meiotic recombination were used (9, 27). Although there is little direct evidence for mating in C. albicans, this approach was used, assuming that mating does occur and is more conservative in rejecting the null hypothesis of genetic homogeneity than standard
2 statistics (11).
Microsatellite allelic distributions and frequencies. Microsatellite allelic distributions and frequencies were determined for isolates from Connecticut and Baltimore, Md., and compared to data from the 1992 to 1993 surveillance collection (Atlanta, Ga., and San Francisco, Calif.) for a total of 150 isolates (nalleles = 300). The results are given in Table 1. For the CEF3 locus, we found one new allele at bp 120. To date, this is the smallest allele observed at this locus. For ERK, we found four new alleles at bp 222, 235, 237, and 246. For locus A4, we found three new alleles at bp 111, 126, and 132. No additional alleles were found for the A3 or ZNF locus. For each locus, we observed from one to three predominant alleles in the population as previously described (16). With the possible exception of the A4 locus, allelic frequencies did not appear normally distributed around a mean allelic size by a visual inspection. This was particularly evident in the compound loci CEF3 and ERK. In each case, we observed predominant allelic combinations (genotypes). As previously shown using coalescence (16), these combinations correspond to subgroups I (IA) and II, comprising the majority (approximately 75%) of the population. The remaining alleles were found in low numbers, and these genotypes were often represented by one strain each.
|
2 and G2 (G is log likelihood) likelihood ratio tests (11). In these cases, the null hypothesis is that the allelic frequencies are in equilibrium and is rejected when P
0.05 (27). We observed no differences in the significance levels for
2 from those previously published; all were significantly less than 0.05 (16). For the more-conservative G2 test, the ZNF and ERK values that were previously >0.05 (16) were found to be 0.008 and 0.01, respectively. Thus, there is no evidence for Hardy-Weinberg equilibrium in any of the five loci tested.
Likelihood tests (9, 27) were performed to determine if the allelic distribution is identical across populations (genic differentiation) and if the genotypic distribution is identical across populations (genotypic differentiation). These tests estimate the P value of a G-based exact test and in principle are the same as a probability test (or Fisher exact test) (9, 27). In each case, the null hypothesis is that the distribution is the same across populations. For genic differentiation, we observed that the A3, CEF3, and ERK loci exhibited P values of <0.05 (i.e., populations were significantly different), while the analysis of A4 and ZNF loci suggested that the populations were the same. Thus, from allelic frequency alone, we could not conclude that the populations were homogeneous. For genotypic differentiation, the results of a pairwise analysis across the four populations are shown in Fig. 1. For a total of 30 pairwise combinations, we observed that 23 of 30 (76%) of the comparisons suggested that the populations are the same (P
0.05). Using Bonferroni's correction for multiple tests on the same data, with P = 0.05/30 = 0.0016, all but two pairwise combinations were the same. For comparisons of geographical locations studied during the same time period (e.g., comparing Connecticut to Maryland data and comparing Georgia to San Francisco, Calif., data), we found that 100% of the comparisons could not reject the null hypothesis. Therefore, we conclude that there is no evidence for a shift in allelic frequencies between the two studies and that they effectively represent a single U.S. population of this type. The combined total frequencies (± standard deviations) are given in Table 1. In a calculation of the number of migrants (Nm) using private alleles (32), the calculated Nm is 13.9. This is a relatively large value, suggesting high rates of migration between the four cities sampled.
|
In this study, we found one new rare allele in the compound locus CEF3 and four new rare alleles in ERK in the 93 isolates from the 1998 to 2000 survey. In addition, CEF3 and ERK each have two predominant alleles per locus and predominant genotypes that do not vary across the four populations surveyed. Considering the large number of alleles per locus (15 for CEF3 and 13 for ERK) and the discovery of new alleles in this study, we suspect that additional rare alleles will be uncovered as sample size increases. At present, it is unknown whether alleles of identical size are also identical in sequence, and there is evidence in the case of ERK that this may not be true (18). In the case of locus A4, we observed three new alleles and therefore suspect that, like CEF3 and ERK, there might be additional rare alleles uncovered in larger populations. We found no new alleles for A3 and ZNF; because we genotyped 93 isolates from the 1998 to 2000 survey, we suspect that there will be few additional, very rare alleles discovered for A3 and ZNF as sample size increases. These new observations give additional support for a departure from Hardy-Weinberg equilibria for all five loci.
In this study, we found genetic homogeneity among these four populations. The homogeneity found may be caused by several factors: movement of alleles between different genotypes of C. albicans due to recombination factors intrinsic to C. albicans, movement of the humans who are the hosts for the yeast, or selection of genotypes suited to the bloodstream environment. Sexual reproduction would need to be extensive to be the sole factor accounting for the homogeneity observed in these populations. While fusion of different cells is possible (12), C. albicans populations appear to be predominantly clonal (7, 8, 31). C. albicans frequently colonizes the host at or near birth, and individuals typically carry one or a few strains (21), so there is limited opportunity for sexual recombination within any given human host. These strains can persist over time, sometimes lasting years before recolonization of the host by other strains. The species is not free-living, and there is little information on other natural reservoirs (19). In addition, there is no evidence for horizontal transmission of genes.
The majority of genetic differentiation in C. albicans populations may be due to migrations throughout the evolutionary past and recent history of the natural human host population and not through other natural reservoirs. Due to high mutation rates, microsatellites are thought to be more useful in detecting recent migration events. The Atlanta and San Francisco human host populations were previously compared to populations obtained from other continents and found to have a high migration rate between the two U.S. sites, but the populations were genetically different from those from other sites worldwide (8). One possible interpretation of this study is that there has been migration among the four U.S. host populations examined (supported by an Nm of 13.9 for C. albicans). As observed in other systems, such migration leads to genetic homogeneity (29).
Another possibility is that these five loci are in genetic equilibrium by selection-mutation balance and that any other defined bloodstream population in the United States will display the same frequencies. Despite high mutation rates and number of alleles per locus common to microsatellites, a limited number of genotypes were found for CEF3 and ERK. This is consistent with selection at these loci. It will be of interest to compare these data with those derived from commensal isolates obtained from healthy individuals to determine if a characteristic profile of isolates from bloodstream infections exists. Other population genetic studies suggest a difference between "normal" commensal strains and strains from individuals infected with human immunodeficiency virus (17; M. Tibayrenc, Letter, Trends Microbiol. 5:253-254, 1997; R. Vilgalys, Y. Graser, and T. G. Mitchell, Author's Reply, Trends Microbiol. 5:254-257, 1997). Significantly, our data suggest that the isolates obtained from these selected U.S. sites may be representative of the overall U.S. population of isolates from bloodstream infections.
| ACKNOWLEDGMENTS |
|---|
R.E.F. was supported through an American Society for Microbiology (ASM) postdoctoral research fellowship through the National Center for Infectious Diseases.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Antimicrob. Agents Chemother. | Clin. Microbiol. Rev. |
|---|---|
| Clin. Vaccine Immunol. | ALL ASM JOURNALS |
|---|