Previous Article | Next Article ![]()
Journal of Clinical Microbiology, November 2005, p. 5601-5613, Vol. 43, No. 11
0095-1137/05/$08.00+0 doi:10.1128/JCM.43.11.5601-5613.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Institute of Medical Sciences, University of Aberdeen, Aberdeen AB25 2ZD,1 The Peter Medawar Building for Pathogen Research and Department of Zoology, University of Oxford, OX1 3SY Oxford, United Kingdom2
Received 10 May 2005/ Returned for modification 24 July 2005/ Accepted 21 August 2005
|
|
|---|
|
|
|---|
Among other approaches to strain typing of C. albicans, DNA fingerprinting based on the moderately repetitive sequence Ca3 has been widely used. By this method, C. albicans populations were shown to comprise five major clades of closely related strain types (46), including clades enriched in isolates from Europe (36) and South Africa (2, 3). Resistance of C. albicans to flucytosine in vitro was found to be a property restricted almost entirely to isolates from a single C. albicans clade as determined by Ca3 fingerprinting (37), and the sole mechanism of resistance to flucytosine within this clade was the mutation of a single nucleotide in the FUR1 gene (14).
Typing by a combination of restriction fragment polymorphisms and randomly isolated polymorphic DNA also showed geographical enrichment of groups of related isolates (10), and "ABC typing," based on the size and number of internally transcribed spacer regions (ITS1) in DNA encoding rRNA showed a higher proportion of flucytosine-resistant isolates among type A strains (26).
We have now typed more than 400 isolates of C. albicans by MLST. Our data show how MLST reveals a population structure with four major clades of related strain types plus at least eight minor clades. It confirms the predominant association of flucytosine resistance with one major clade and reveals associations between clades and their geographical, anatomic, and temporal origins.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Summary of 416 C. albicans isolates tested by MLST and their originsa
|
Recent clinical isolates of C. albicans were provided by Elizabeth M. Johnson (Mycology Reference Laboratory, Bristol, United Kingdom) and Christopher C. Kibbler (Royal Free Hospital, London, United Kingdom) from their diagnostic mycology laboratories. Fresh oral commensal isolates of C. albicans were obtained from healthy undergraduate student volunteers in Aberdeen, United Kingdom. The remainder of the isolates typed came from the senior author's collection of C. albicans isolates, which was started in the 1970s. This collection, originally stored under distilled water (29), was transferred twice in its entirety, once for an investigation of the prevalence of the species C. dubliniensis in the collection (32). Some of these isolates had been previously described in epidemiological studies of C. albicans based on phenotypic typing tests (30, 31). The quality of the collection was confirmed by the near-identity of MLST data for several isolates from the same source (details not shown).
Isolates were maintained on Sabouraud agar (Oxoid, Basingstoke, United Kingdom) for the duration of this study.
MLST.
Sequences of bases in the gene fragments AAT1a, ACC1, ADP1, MPIb, SYA1, VPS13, and ZWF1b were determined as previously described (6, 48). Isolates were designated as genotype A, B, or C on the basis of PCR for rRNA genes (26), and the status of the mating-type-like locus (MTL) in each strain was assessed as heterozygous (a/
), homozygous (a/a), or homozygous (
/
) by PCR as previously described (48).
Sequencing of other genes. Genes FCY22, FCA1, and FUR1 encode, respectively, cytosine permease, cytosine deaminase, and uracil phosphoribosyltransferase in the pathway for conversion of flucytosine to the active antifungal 5-fluorouracil in C. albicans (14). For selected C. albicans isolates, the DNA sequences of these genes were determined by the same method as that used for MLST (above). For FCY22, the forward primers used were FCY22-1 (5'AACATAATCTGTAGTTATATCTC), FCY22-2 (ATTATTGGAAGCATTCCAGTG3'), FCY22-3 (5'TCCAACTGAAGCTGGTAATG3'), and FCY22-4 (TGCCATTCAATGTATATGGAG), and the reverse primer was 5'TAACAGGGCAGAATGAATCG3'. For FCA1, the forward primers were FCA1-1 (5'AATTCTCTTTCAATTCTTCTTG3') and FCA1-2 (5'TCACCATGTAGTATGTGTAC3'), with the reverse primer being 5'TTGAACAAGATGATGATGTTG3'. For FUR1, the forward primers were FUR1-1 (5'ATAATGGAGCATCTCGCAAC3') and FUR1-2 (5'TAGCTGAAGATATCAGTGAAC3'), with the reverse primer being 5'TGTTGTACATATAATCCTATAAG3'.
Susceptibility testing. MICs of fluconazole, flucytosine, and itraconazole were determined by microdilution tests by a previously described method and spectrophotometric endpoints (33) that give MIC results compatible with those obtained by the NCCLS reference method (27). Isolates with fluconazole MICs of 16 µg/ml or higher, flucytosine MICs of 8 µg/ml or higher, or itraconazole MICs of 0.25 µg/ml or higher were classed as resistant to these agents (these breakpoints indicate full susceptibility to the agents concerned [27]).
Data analysis. Sequence data were analyzed by scrutiny of capillary sequencer output chromatograms with DNASTAR Seqman software. Each unique (diploid) sequence for each of the seven DNA fragments was given a separate identifying number. Each unique combination of seven sequences was assigned a separate diploid sequence type (DST) number. The sequence and DST identifiers were compiled on a central database (http://test1.mlst.net/).
The eBURST package (http://eburst.mlst.net/ [15]) was used to determine putative relationships between isolates. With C. albicans data, this software scans allele sequences from pairs of isolates and records isolates as related when six of the seven sequences are identical between a pair. The eBURST algorithm places all related isolates into clonal complexes and, where possible, predicts the founding, or ancestral DST of each complex. The output is a display of the most parsimonious patterns of descent of each DST from the ancestral type.
Phylogenetic analyses by unweighted pair group method with arithmetic averages (UPGMA) and neighbor-joining algorithms were conducted with MEGA version 2.1 (22) applied to modified sequence data. The analyses were based only on the results for polymorphic bases to maximize their power to discriminate between isolates. The data set of only the variable bases was suitable for pairwise-difference analysis, which has been used previously with C. albicans MLST (5, 48). To obtain sequences that could be handled by the MEGA software, which is not programmed to analyze heterozygous code data, the following procedure was used. The results for the variable sites from the seven gene fragments sequenced were concatenated into a single sequence. For any pair of isolates, each with a diploid genome, the base at each polymorphic site could be homozygous and identical between the isolates, heterozygous and identical, homozygous and different, or heterozygous in one isolate and homozygous in another. For example, the sequencing result (a IUPAC single-letter code) for a given base across a set of strains might appear as A, T, or W (= A + T). Data from the polymorphic sites from the seven C. albicans alleles were therefore conjoined into a single sequence, and then each base in the sequence was rewritten twice for a homozygous (A, C, G, or T) datum or once each for the two component bases for a heterozygous (K, M, R, S, W, or Y) datum. These revised sequences could then be analyzed to generate dendrograms in MEGA 2.1. In this form, the analysis was the functional equivalent of scoring a pair of results as 1 for homozygous or heterozygous identical data, 0 for homozygous different data, and 0.5 when one polymorphic site had a heterozygous result and the other a homozygous result and then creating a difference matrix. Use of the modified sequences, rather than of difference matrices, enabled calculation of statistical significance for the cluster nodes by bootstrapping with 1,000 replications.
Distributions of properties between C. albicans clades were determined statistically by Fisher's exact test. Results were considered significant for a P of <0.05. A univariate analysis of variance (ANOVA) was used to determine associations between clades and A, B, or C type (with 1, 2, and 3 corresponding to A, B, and C, respectively); MTL status (with 1, 2, and 3 corresponding to a/
, a/a, and
/
, respectively); fluconazole, itraconazole, and flucytosine susceptibilities (with 1 indicating susceptibility and 2 indicating resistance); decades of isolation (1 being a year prior to 1990, 2 being a year in the 1990s, and 3 being a year in the 2000s); geographical origin (1 being the United Kingdom, 2 being another European country, 3 being North America, and 4 being another country); and anatomical origin (1 being blood or other sterile site, 2 being the oropharynx, 3 being the vagina, and 4 being another location). Isolates from animals were excluded from this analysis.
Discriminatory power was calculated according to Hunter (20).
|
|
|---|
Statistics of C. albicans MLST. The 416 isolates yielded 351 distinct DSTs, with a discriminatory power index of 0.9996. Among the seven fragments sequenced, ZWF1 gave the highest discriminatory ratio, yielding 84 different sequences from just 10 polymorphic sites, followed by SYA1 (70 alleles from 14 variable sites), VPS13 (103 alleles from 21 variable sites), AAT1a (57 alleles from 13 variable sites), ACC1 (37 alleles from 10 variable sites), ADP1 (46 alleles from 18 variable sites), and MPIb (36 alleles from 21 variable sites).
Details of the alleles and DSTs for each isolate are shown in the full version of Table S1 in the supplemental material, together with the geographical and anatomical sources of the isolates. Insufficient DNA for sample BougnCP10 precluded determination of ABC and MTL type for this isolate. Among the remaining 415 isolates, 281 (67.7%) were type A, 96 (23.1%) were type B, and 38 (9.2%) were type C. The great majority of isolates (371, or 89.4%) were heterozygous at the MTL, while 27 isolates (6.5%) were of homozygous MTL type
/
and 17 isolates (4.1%) were of type a/a.
Population structure of 416 unrelated C. albicans isolates. MLST data allow for different approaches to phylogenetic analysis of various degrees of sophistication. We utilized several approaches, with an emphasis on methods based on pair differences at polymorphic nucleotide sites and on gross allele sequence differences in an effort to determine robust subspecific clades that will allow easy assignment of future isolates to the population structure.
eBURST analysis of the set of 416 C. albicans isolates from unique sources revealed 18 clonal clusters, with 5 being based on six or more DSTs (Fig. 1). Cluster 1, the largest, comprised 105 isolates (25.2% of all isolates), with DST 69 as the putative ancestral type. Clonal cluster 2, the second largest, comprised 38 isolates and 27 DSTs and no putative founding DST (type 155 was represented by 7 isolates but did not emerge as a likely ancestor for the cluster). Clonal cluster 3 comprised 28 isolates and 17 DSTs, with type 124 being the predicted founding type. Cluster 4 comprised eight isolates and seven DSTs, with type 305 being the likely ancestral DST, and cluster 5 comprised seven isolates and six DSTs, with no putative ancestral type. The remaining clonal clusters consisted maximally of six isolates.
![]() View larger version (17K): [in a new window] |
FIG. 1. Modified eBURST "snapshot" for 416 C. albicans isolates from unique sources. The figure is redrawn from the original output of the eBURST analysis to eliminate overlapping lines and numbers. The diameter of each circular spot is proportional to the number of isolates with the same DST (indicated by the number adjacent to the spot). Lines join DSTs that differ in just one of the seven fragments sequenced for MLST and thus indicate a hierarchical relationship between DSTs. Where a putative ancestral DST was determined for a cluster, the DST number has been marked with an asterisk. Each clonal cluster of related isolates is numbered in boldface type. Lengths of lines joining DSTs are of no relevance.
|
Unlike the eBURST analysis, a UPGMA pairwise-difference dendrogram for the 416 separate-source isolates (Fig. 2) took no account of the gene fragment in which sequence polymorphisms arose and therefore provided a different perspective on relationships between isolates: isolates with small numbers of differences in the concatenated sequences for variable sites across all fragments sequenced were regarded as similar by UPGMA analysis, even if the polymorphic sites were located on separate fragments. Comparison of the clustering in the UPGMA dendrogram with the eBURST clonal clusters (Fig. 2) showed clonal clusters 11 and 17 and several eBURST singletons dispersed within the broad range of isolates in clonal cluster 1. Similarly, isolates in clonal clusters 5, 10, and 13 were mixed with each other and with several eBURST singletons by UPGMA analysis.
![]() ![]() View larger version (83K): [in a new window] |
FIG. 2. UPGMA pairwise-difference dendrogram for 416 C. albicans isolates from separate sources. The dendrogram is broken into three sections, a to c, to fit the printed page. The complete dendrogram is shown at a much reduced vertical scale at the bottom right, with indications of the points at which the tree has been split. For each isolate, its clonal cluster (or singleton status) as determined by eBURST analysis is indicated, as is the result of Ca3 typing for isolates provided by D. R. Soll (Cl is the Soll clade result by UPGMA analysis) or Schmid (Gp is the Schmid group result by neighbor-joining analysis). The assignment of isolates to clades, as indicated in the third data column, was made by consideration of the eBURST and Ca3 data in conjunction with UPGMA bootstrap values as described in the text.
|
All five of the isolates from clade I as determined by Ca3 typing (46) clustered within a large group of isolates closely related by UPGMA (Fig. 2); four of the five belonged to clonal cluster 1 by eBURST analysis. Similarly, 19 of the 21 isolates designated types A1 and A2 by Ca3 fingerprinting and designated by neighbor-joining statistics (42, 43) appeared in the same large UPGMA cluster that was broadly delimited by eBURST clonal cluster 1. All 155 isolates from AM2003/0046 to AM2004/0049 in Fig. 2 were therefore provisionally assigned to clade 1 as demarcated by MLST. The five Ca3 clade II isolates and both Ca3 group B isolates correlated with a well-delineated UPGMA cluster (Fig. 2); four of these seven isolates appeared in eBURST clonal cluster 2. Isolates from AM2004/0030 to IHEM17110 in Fig. 2 were therefore provisionally assigned to MLST clade 2. Similarly, the five isolates from Ca3 clade SA clustered together in the UPGMA dendrogram, with four of the five in clonal cluster 3. However, the UPGMA cluster designated MLST clade 4 in Fig. 2 (isolates HUN68 to IHEM16731), though appearing reasonably homogeneous, contained a subcluster comprising two of the Ca3 clade E isolates, and all of the members of clonal cluster 15. The five isolates from Ca3 clade III emerged in clonal clusters 9 or 10 or as singletons by eBURST analysis, but they clustered together in the UPGMA dendrogram in a group of isolates with the same overall level of similarity as those in MLST clades 1, 2, and 4. Isolates AM2003/0082 to b30998/5 were therefore provisionally considered to belong to MLST clade 3. It was noted that isolates previously designated as group C by neighbor-joining analysis of Ca3 typing (42, 43) did not show obvious affinities with eBURST- or UPGMA-defined clusters.
The provisional assignment of isolates to clades 1 to 4 indicated a maximum dissimilarity of 10 by pairwise-nucleotide difference analysis as calculated with the MEGA software. The following definition of a clade was therefore used: a cluster of at least five isolates with a dissimilarity not exceeding 10 by pairwise-difference analysis. By this definition, 348 (83.7%) of the 416 isolates could be assigned to 12 clades, as specified in Fig. 2. Reanalysis of the MLST variable-site data only for the 348 isolates that matched the clade definition confirmed the validity of the clade assignments in Fig. 2. The bootstrap values for the nodes defining clades were 85% (clade 1), 88% (clade 2), 87% (clade 3), 86% (clade 4), and 98 or 99% for the minor clades. The exception was clade 5, for which the node bootstrap value was only 64%, while the values for the node defining the pair of isolates 81/078 and AM2003/0084 and the node for the remaining four isolates nominated for clade 5 were 97% and 99%, respectively. These results suggest that clade 5 is a less robust grouping of isolates than the other clades and may need to be revised in future as further C. albicans isolates are added to the MLST database.
To examine the population structure of C. albicans clades by an alternative statistical approach the MLST data for the 416 isolates of separate origins were reanalyzed by determination of p-distance with the unrooted neighbor-joining algorithm in the MEGA2 software package. The result (Fig. 3) confirmed most of the isolate clusters determined by pairwise-distance UPGMA analysis (Fig. 2) and reiterated that the majority of the isolates studied could be assigned to one of four major clades. However, the neighbor-joining algorithm also produced some differences in the structure from the UPGMA analysis.
![]() View larger version (41K): [in a new window] |
FIG. 3. Population structure of 416 isolates of C. albicans from separate sources, as determined by p-distance (22) in a neighbor-joining analysis. The tree is displayed in radial format to clarify the relative positions of putative clades of related isolates, which are indicated with gray shading. The clade numbers correspond to those in Fig. 2; note that a minority of isolates in clades 3 and 5 were separated in the neighbor-joining tree labeled 3a and 5a, respectively. Branches to singleton isolates without clade affinities are shown with dashed lines.
|
Isolates and clusters of notable provenance. Three isolates, AM2003/0025, AM2003/0032, and MYA-2669, the type strain of the putative species C. africana, were indistinguishable by MLST but clustered with very low similarity to the other isolates in Fig. 2, confirming that MYA-2669 differs substantially from other C. albicans isolates. All three of these indistinguishable isolates came from genital swabs.
Isolate SC5314, which was used for the first whole-genome sequencing project with C. albicans (21), fell within clade 1. Isolate WO-1, which has been extensively used in research into phenotypic switching and mating in C. albicans (45), appeared as a singleton between clades 8 and 4 in Fig. 2.
Within clade 1, a group of four isolatesIHEM20416, J942149, IHEM20414, and IHEM20417that were extremely similar by MLST was notable because all four were resistant to fluconazole, and the first three were also resistant to itraconazole. These three were
/
at the MTL (Table 1). All four of the isolates in this subcluster came from the mouths of AIDS patients in Germany. A similar subcluster of four closely related isolates within cluster 5, already noted above by the high significance of the bootstrap value for the cluster node, included three isolates resistant to both azole antifungal agents and three isolates that were a/a at the MTL. All four isolates came from the mouths of AIDS patients in France.
C. albicans clades: properties and clinical relevance. Table 2 summarizes the properties of the 12 individual C. albicans clades and of singleton isolates. The numbers of isolates in each of clades 5 to 12 was too small to permit reliable statistical analysis; these isolates were therefore grouped together as "minor clades" for the calculations detailed below.
|
View this table: [in a new window] |
TABLE 2. Main properties of clades of C. albicans isolates
|
There were 44 (10.6%) of 415 typeable isolates with homozygous alleles at the MTL. Just over half of these isolates (29) were in clade 1. The distribution of isolates homozygous at the MTL differed significantly between clades (by Fisher's exact test, P was 0.016; the test was applied to numbers of isolates that were a/
, a/a, or
/
[Table 1]). Minor clades 8 to 11 contained no examples of MTL homozygous isolates, while a high proportion of isolates in clades 5 to 7 were homozygous at the MTL (Table 2).
Except for clade 9, the majority of isolates in each clade came from sources within Europe (information on geographical origin was available for 415 of the 416 isolates in Fig. 2). The 6 isolates in clade 5 were all of European origin, and there were no North American isolates among the 28 that made up clades 10 to 12. Table 3 breaks down the geographical-origin data more finely, but with minor clades treated as a single group. Statistical analysis of the distribution of isolate sources by clade as shown in Table 3 indicated a highly significant association of clade with geographical source (Fisher's exact test, P < 0.001). Even when the data were analyzed with French and other EU isolates combined into a single group and African isolates included with "other" geographical sources, the differential distribution of sources within each clade remained statistically significant (P < 0.001). It should particularly be noted that, among the 45 isolates in clade 4, equivalent to the "South African" clade based on Ca3 fingerprinting (2), only 3 isolates came from the African continent. The majority of African isolates in the present study belonged to clade 2.
|
View this table: [in a new window] |
TABLE 3. Geographical origins of C. albicans isolates in different MLST clades
|
|
View this table: [in a new window] |
TABLE 4. Anatomical origins of C. albicans isolates in different MLST clades
|
For most of the clades, between 10% and 20% of the isolates were first cultured before 1990 (Table 2). In clades 1 to 3, 6, 7, and 9, isolates first cultured before 1990 were represented and most of the isolates were first cultured before the year 2000, while clades 4, 5, 8 and 10 to 12 bore evidence of possibly more recent origins (Table 2). The distribution of isolates of different ages differed significantly between the clades (by Fisher's exact test, P was 0.003; analysis done with clades 5 to 12 treated isolates as a single group, and dates of isolation segregated as pre-1990, 1990s, and 2000s).
A high proportion of the five isolates in clade 5 was resistant to azole antifungal agents, while clades 7 to 11 contained no isolates resistant to any of the three agents tested (Table 2). However, the overall differences in clade distributions of the 378 isolates with susceptibility data available were not significant for those that were resistant to fluconazole (Fisher's exact test, P = 0.289), itraconazole (P = 0.94), or flucytosine (P = 0.105). Other significant associations were, however, apparent for the antifungal resistant isolates. Fifteen (65%) of the 23 flucytosine-resistant isolates were members of clade 1. In every one of these isolates, flucytosine resistance was associated with a homozygous allele pair encoding cysteine at position 101 in the FUR1 gene. In a random selection of 14 flucytosine-susceptible isolates from clade 1, the equivalent position was either homozygous for arginine or heterozygous (cysteine/arginine). The eight flucytosine-resistant isolates from other clades and seven randomly selected susceptible isolates from other clades were all homozygous, encoding arginine at position 101, except for one flucytosine-resistant isolate for which no FUR1 sequence was obtained despite three attempts. No common mutation was found in the FUR1, FCY22, or FCA1 gene sequences of these non-clade-1, flucytosine-resistant isolates that might account for their resistance to the drug. Among the 35 fluconazole-resistant isolates, 17 (48.6%) were homozygous at the MTL. The same was true for 10 (41.7%) of the 24 itraconazole-resistant isolates and for 7 (21.2%) of the 23 flucytosine-resistant isolates. The prevalence of MTL homozygosity among the corresponding susceptible isolates was 7.3, 9.0, and 9.9%, respectively (by Fisher's exact test, P was <0.001, <0.001, and equal to 0.004, respectively). There was no significant association between itraconazole or flucytosine resistance and ABC type, but an unusually high proportion (40%) of 35 isolates resistant to fluconazole were ABC type B (Fisher's exact test, P = 0.013).
Twenty-eight of the 35 fluconazole-resistant isolates came from the oropharynx; all of these were from AIDS patients. For 109 oropharyngeal isolates, the HIV status of the patient source was known. The distribution of oral isolates from HIV-positive versus HIV-negative patients differed significantly between clades, as follows. For 48 HIV-negative patients, 26.8% were in clade 1, 17.9% in clade 2, 0.0% in clade 3, 17.9% in clade 4, and 23.2% in minor clades, and 14.3% were singletons. For 61 HIV-positive patients, the corresponding data were 50.7%, 9.0%, 9.0%, 6.0%, 16.4%, and 9.0% (Fisher's exact test, P < 0.001).
The individual data comparisons made by Fisher's exact test can be vulnerable to biases arising from uneven distributions of contributing variables other than the variable under analysis. A univariate ANOVA was done to determine the associations between properties and sources of isolates and their clade distribution. Animal isolates were excluded. The ANOVA was repeated with individual clades and singletons as the dependent variable and also with the minor clades included as a single variable. With individual clades analyzed, ABC type emerged as the most significant variable (P < 0.001) and anatomical source as the next most significant variable (P = 0.027). With minor clades grouped, the corresponding P values were <0.001 for ABC type and 0.065 for anatomical source. MTL status, fluconazole, itraconazole, and flucytosine resistance, decade of isolation, and geographical source were not significant clade-related variables by ANOVA.
|
|
|---|
Assignation of population structures and clade boundaries to a set of isolates cannot be done entirely objectively. The clade assignments in Fig. 2 take account of three types of information: the UPGMA dendrogram, results of eBURST analysis, and the results of Ca3 oligonucleotide fingerprinting for several of the isolates. Use of neighbor-joining statistics in place of pairwise-difference analyses further redistributed some isolates between clades (Fig. 3). We intend to evaluate a wider range of statistical approaches to clade designation, including Bayesian statistics, when our database of MLST results is at least twice as large as that reported in this study. For the time being, we consider pairwise-difference analyses to be the simplest and most useful approach for clade assignments in the population of isolates at hand.
The eBURST results were helpful in demarcating potential clades, but not definitive: many examples occurred where isolates that coclustered by Ca3 typing or by MLST analyzed by pair differences were designated as singletons by the eBURST test. It may be that the nature of sequence variation in C. albicans and the diploid genome of the species make eBURST analyses less relevant than they are to haploid bacteria (50). In common with others (4), we found an excellent correlation between UPGMA clusters for MLST and Ca3 fingerprinting data. Clades I, II, III, and SA as currently delimited by Ca3 typing (46) matched clades 1 to 4, respectively, as defined by MLST in the present study. Only MLST clade E failed to emerge as a single group of isolates as determined by MLST; the 5 representatives of Ca3 clade E split into two clusters by UPGMA analysis of MLST data (Fig. 2).
The very clear demarcation of geographical origin of strains determined by Ca3 typing as described previously (46) was not confirmed in the present study. Our panel of isolates was dominated by isolates of European origin, particularly by isolates from the United Kingdom, while the panel of isolates studied by Ca3 typing (36, 46) was dominated by isolates from South Africa. In common with the Ca3 studies, we found significant interclade differences in proportions of various geographical origins, but the relationships between clade and origin were not the same, almost certainly because of the large differences in proportions of isolates from different sources. Most notably, MLST clade 4, which corresponded to Ca3 clade SA, contained only three isolates of African origin, while most of the African isolates in our study clustered in clade 2. We have therefore chosen to refer to clades only by numbers until the MLST database contains a sufficiently large representation of isolates from all parts of the globe to allow more objective determination of possible geographic evolutionary origins of clades. The statistical significance of geographical variations between MLST clades determined by Fisher's exact test was not confirmed in the ANOVA, perhaps showing the effect of biases from differences in anatomical origin, date of isolation, etc. A larger MLST database will allow for statistical analyses of geographical origin based on, for example, only blood isolates or only oral isolates. It is to be expected that individual C. albicans clades may be enriched with isolates from a particular geographical source, and several previous studies by other DNA typing methods have concluded that geographical enrichment of C. albicans clades is a real phenomenon (10, 23, 44), even to the extent of regional differences within the American continent (36, 46).
However, there seems also to have been sufficient mixing of isolates, possibly because of international migrations, to blur the geographical distinctions, and a microsatellite-based typing survey found genetic homogeneity of isolates in four different locations in the United States (24).
One unequivocal conclusion that we can draw is that C. albicans isolates from animals are distributed differently from human isolates in the MLST population structure. The differences in the present study, which are self-evident even without the high statistical support, confirm a similarly large difference in distribution of strains from birds compared with human isolates by MLST done with a slightly different gene set (4).
The species C. africana was erected to delineate C. albicans isolates with atypical carbohydrate assimilation patterns (49). All the isolates with the characteristics of C. africana were cultured from male or female genitalia, and most came from patients in Africa. The C. africana type strain MYA-2669 (from the penis of a German patient) clustered indistinguishably with two United Kingdom vaginal isolates by MLST, so this type continues to be associated exclusively with isolates of genital origin. Although the three isolates were highly distinct from the remainder of the panel we typed by MLST in the UPGMA dendrogram (Fig. 2), they were less distinct in an unrooted neighbor-joining tree (Fig. 3). The fact that all seven of the gene fragments that we used for MLST easily gave PCR products with MYA-2669 contrasts with the situation that we encountered when we attempted to type C. parapsilosis isolates by MLST and ultimately proposed two new species to account for isolates that gave PCR products with only a minority of the gene set tested (47). In common with others (17), we think the case for separate species status for C. africana is not confirmed by the results of the present study.
Our data concerning the antifungal susceptibility of our panel of isolates confirms and extends existing information. Like Pujol et al. (37), we found that a majority of flucytosine-resistant isolates belonged to clade 1. Since almost all clade 1 isolates are also ABC type A, this conclusion corroborates the work of McCullough et al. (26), who also found a significantly higher proportion of flucytosine-resistant isolates among type A isolates. All clade 1 isolates with flucytosine resistance encoded arginine at position 101 in the FUR1 gene product, as previously reported by others (14, 18). This mutation will presumably retard or prevent the conversion of flucytosine to fungicidal 5-fluorouracil within the fungal cells. Flucytosine-susceptible isolates in clade 1 did not encode arginine at position 101, nor did resistant and susceptible isolates from other clades. This finding of an apparent clade-specific molecular resistance mechanism to an antifungal agent opens the possibility for development of a molecular diagnostic test to predict flucytosine resistance and raises the expectation that further research may elucidate clade-consistent molecular bases for resistance to other agents. Although we did not uncover any self-evident associations between C. albicans clades and resistance to azole antifungal agents, the numbers of resistant isolates available to us was relatively small, and we have not investigated the molecular basis of resistance in these isolates. It should be noted that all of the isolates tested against fluconazole, itraconazole, and flucytosine in this study were also tested for caspofungin susceptibility, but none showed resistance to this agent, so this result was not detailed.
It is notable that most of the azole-resistant isolates of C. albicans in our panel were oral isolates from HIV-positive patients. A recent, very large, global surveillance study shows a very low prevalence of fluconazole resistance among C. albicans isolates since the introduction of highly active antiretroviral therapy (HAART) in 1996 (34). In the absence of a substantial future failure of efficacy of HAART, it is likely to prove difficult to obtain substantial numbers of azole-resistant clinical isolates of C. albicans for further investigation. The strong association that we saw between homozygosity at the C. albicans mating type locus and fluconazole resistance was described previously (40). Our data extend this association to itraconazole and even to flucytosine, which exerts its antifungal effects by a mechanism different from that of the azoles. Previous experimentation showed that engineering homozygosity at the C. albicans MTL does not lead per se to the development of fluconazole resistance (35). The association that we describe between MTL homozygosity and resistance to several antifungals therefore suggests that a common event, perhaps a change in expression of a transcription factor (11), is responsible both for resistance development and MTL homozygosity in wild-type isolates.
An important consideration to emerge from the determination of C. albicans population structure concerns the choice of strains used for experimental work with the fungus. To date, almost all molecular research with C. albicans has been done with the single clade 1 isolate SC5314 and its Ura- derivatives, in the absence of positive selectable markers for molecular genetic work. Now that at least one positive selectable marker has been developed for C. albicans (52), our delineation of the four major clades of the species, and a preliminary demonstration of potential clade-related differences in the propensity to cause infection, form a basis for rational selection of a wider diversity of strains for future research in the field.
We gratefully acknowledge the many colleagues who have supplied us over many years with the isolates we submitted to MLST. We are particularly grateful to E. M. Johnson, C. C. Kibbler, N. Nolard, F. Symoens (at the IHEM collection), J. B. Anderson, D. R. Soll, and J. Schmid for supplying us with well-pedigreed isolates for this MLST project and to Marie-Elisabeth Bougnoux and Christophe d'Enfert for DNA from their panel of isolates.
Supplemental material for this article can be found at http://jcm.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»