Previous Article | Next Article ![]()
Journal of Clinical Microbiology, January 2006, p. 77-84, Vol. 44, No. 1
0095-1137/06/$08.00+0 doi:10.1128/JCM.44.1.77-84.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
and
P. P. Bosshard*
Institut für Medizinische Mikrobiologie, Universität Zürich, 8006 Zürich, Switzerland
Received 28 June 2005/ Returned for modification 15 August 2005/ Accepted 12 October 2005
|
|
|---|
|
|
|---|
For molecular identification, we have chosen sequence analysis since this procedure is simple and can be fully automated. In addition, interpretation of nucleic acid sequences is straightforward and does not depend on too much expertise compared to morphological analyses. As target, we have chosen the internal transcribed spacer (ITS) region, which is located between the highly conserved genes coding for 18S and 28S rRNA. The ITS encompasses the two noncoding regions ITS1 and ITS2, which are separated by the highly conserved 5.8S rRNA gene (20). The ITS1 and ITS2 regions are more variable than the adjacent rRNA gene sequences and thus promise a better separation of closely related species. As the inspection of yeast ITS sequences which are available in the public database GenBank (NCBI) suggested that some entries are incorrect and because certain medically relevant species are not included, we decided to establish an in-house database. Since the number of known yeast species is enormous, we restricted our database to species occurring in the medical diagnostic laboratory.
In this study, we compared sequence-based identification with conventional identification. Based on these results, we established an algorithm for the effective identification of yeasts in the diagnostic laboratory.
|
|
|---|
DNA extraction. Yeast strains were cultivated on Sabouraud agar at room temperature. Two loops full of fungal culture were collected and digested at 37°C for 2 h with 30 U of Lyticase (Sigma-Aldrich Chemie GmbH, Schnelldorf, Germany) in 200 µl digestion buffer (50 mM Tris HCl, 1 mM EDTA, pH 8.0). Alkaline lysis was performed with the addition of 10 µl 1 M NaOH and 10 µl 10% sodium dodecyl sulfate and incubation for 10 min at 95°C. Following neutralization with 10 µl 1 M HCl, DNA was purified with the QIAamp DNA blood mini kit (QIAGEN, Basel, Switzerland) according to the instructions of the manufacturer. Briefly, 200 µl of AL buffer from the kit was added and incubated for 10 min at 70°C. Then, 200 µl of pure ethanol was added, and the whole mixture was loaded on a column. The column was washed twice with the washing buffers from the kit, and the DNA was finally eluted from the column in 100 µl H2O.
Amplification and sequencing. Amplification was performed in a LightCyler (Roche, Rotkreuz, Switzerland) in a volume of 20 µl containing 3 mM MgCl2, 0.5 µM each of primers ITS1 and ITS4 (20), 2 µl of SYBR green master mix, and 2 µl of eluted DNA. Cycling parameters included an initial denaturation for 10 min at 95°C and 50 cycles of 1 sec at 95°C, 5 sec at 53°C, and 40 sec at 72°C. The amplification products were purified with the QIAquick PCR purification kit (QIAGEN, Basel, Switzerland). For construction of the database, sequences were generated with forward primer ITS1 and with backward primer ITS4 and, if necessary, also with forward primer ITS3 and backward primer ITS2, both located in the 5.8S rRNA gene (20). For the clinical isolates, strains were sequenced with forward primer ITS1; in case the forward sequence was not readable, backward primer ITS4 was used for generating the sequence. The BigDye kit (Applied Biosystems, Rotkreuz, Switzerland) and an automated DNA sequencer (ABI Prism 310 genetic analyzer; Applied Biosystems) were used for sequencing.
Construction of the database. A list covering the majority of the yeasts occurring in the medical diagnostic laboratory was defined. This list covers 48 species from 8 genera and includes all medically important and the most frequent colonizing yeasts (Table 1). For a total of 90 strains (at least one isolate for each species), the ITS region was amplified and both strands were sequenced and assembled to compose the ITS region. The strains originated from quality controls (United Kingdom National External Quality Assessment Service, Sheffield, United Kingdom), patient isolates, and the Centraalbureau voor Schimmelcultures (Baarn, The Netherlands) strain collection. The sequences obtained were analyzed together with sequences available in GenBank (NCBI). The SeqWeb version 2.1.0 (web interface to a core set of sequence analysis programs in the GCG Wisconsin package; Accelrys, Cambridge, United Kingdom) evolution program was used to define intra- and interspecies homologies. Sequences that were confirmed by at least one additional independent sequence of the same species (i.e., sequence homology greater than 95%) and that fit well into the similarity tree calculated from the sequence alignment were included in the in-house database. The resulting yeast ITS database is stored in the SmartGene IDNS (Lausanne, Switzerland).
|
View this table: [in a new window] |
TABLE 1. Yeast species included in the ITS database, intraspecies homology and number of analyzed sequences
|
Between June and November 2003, all clinical isolates subjected to ID32C were also investigated by sequence determination. The resulting sequence was visually compared with the electropherogram, corrected, and analyzed using the SmartGene IDNS custom platform with the in-house yeast ITS database. Undetermined nucleotides (designated N's) in the reference sequence were counted as mismatches. The isolate was assigned to a species if the sequence revealed a homology of
97% (over the whole length of the sequence) to a reference sequence (depending on the intraspecies homology), and if the next species showed less than 95% homology over the whole length of the sequence. If the comparison with the in-house database did not allow species assignment, the sequence was compared with GenBank using the FASTA algorithm of the GCG Wisconsin package (Accelrys, Cambridge, United Kingdom). The results of the sequence analysis and the ID32C test were compared, and discrepant results were resolved by additional morphological criteria.
|
|
|---|
97%. Most species included in our database can be clearly distinguished from each other, as shown in Fig. 1 and in detail in Table 2. Candida stellatoidea is regarded as a synonym of C. albicans, since these two species cannot be discriminated from each other by the ITS sequence (10). Candida africana is included as a separate species, although it is difficult to discriminate from C. albicans with an interspecies sequence homology of 99.3 to 99.8%, which is almost identical to the intraspecies homology of C. albicans (99.3 to 100.0%). For identification of C. albicans and C. africana, the first reference sequence with no mismatch was accepted as the correct identification. Cryptococcus laurentii was split into three genogroups according to the homology of the ITS region; the genogroups showed high intra- but poor intergroup homologies (Table 1 and Fig. 2).
![]() View larger version (40K): [in a new window] |
FIG. 1. Similarity tree calculated from the alignment of yeast species included in the ITS database (one ITS sequence per species), using SeqWeb version 2.1.0.
|
|
View this table: [in a new window] |
TABLE 2. Intra- and interspecies homology of some Candida species
|
![]() View larger version (28K): [in a new window] |
FIG. 2. Similarity tree calculated from the alignment of Cryptococcus ITS sequences using SeqWeb version 2.1.0. The sequences of Cryptococcus laurentii cluster in three genogroups called group Ia, group Ib, and group II. Sequences with the prefix AB or AF were extracted from GenBank (NCBI); sequences with the prefix IMM and marked with a star were obtained from the strain collection in-house.
|
![]() View larger version (22K): [in a new window] |
FIG. 3. Similarity tree calculated from the alignment of Trichosporon ITS sequences using SeqWeb version 2.1.0. The sequences of Trichosporon asahii, T. asteroides, T. inkin and T. ovoides cannot be clearly distinguished from each other, thus, they are combined in the Trichosporon group A. Sequences with the prefix AB or AF were extracted from GenBank (NCBI); sequences with the prefix IMM and marked with a star were obtained from the strain collection in-house.
|
During the 6-month period, 113 isolates could not be identified by phenotypic criteria on CHROMagar Candida and rice agar and were therefore included in this study.
Identification with ID32C. Using the ID32C system, 70.8% (80 of 113) of the isolates studied were identified to the species level without additional tests. An additional 19.5% (22 of 113) of the isolates were identified by additional tests as recommended by the manufacturer. These additional tests included growth at different temperatures (e.g., 35°C or 40°C), presence of pseudohyphae, and positive reaction for hydrolysis of esculin or urea.
Identification by sequencing. By sequence determination and comparison with the in-house sequence database, 98% (111 of 113) of the strains were identified to the species level (97.0 to 100.0% homology to the best-matching reference sequence). Only two strains could not be identified due to missing reference sequences in the databases (less than 90% homology to the best match).
Comparison of ID32C and sequencing. By comparing the two methods, 85.8% (97 of 113) of the results were identical (Table 3). Of these 97 isolates, 96 strains, which were assigned to a species by ID32C, were confirmed by sequencing. One strain could not be identified with either method (Table 4).
|
View this table: [in a new window] |
TABLE 3. Comparison of ID32C and sequence analysis by identification level for 113 isolates
|
|
View this table: [in a new window] |
TABLE 4. Concordant results for ID32C and sequence analysis
|
|
View this table: [in a new window] |
TABLE 5. Discrepant results by identification type
|
|
View this table: [in a new window] |
TABLE 6. Final comparison of identification by ID32C versus nucleic acid sequencing
|
|
|
|---|
The reliability of identification by sequencing not only depends on the length of the sequence determined, but also on the quality and availability of reference sequences. In GenBank, some species cannot be distinguished from others of the same genus by the ITS sequence. It is not always obvious whether failure of discrimination is a result of mislabeling, of close relationship, or of erroneous taxonomic separation. The definition of a species in mycology is complicated (15). One species may have several names given by different mycologists or due to reassignment of a species based on sequence analysis. Today, morphological characteristics have to be supported by molecular analysis before definition of a new species is approved (e.g., see reference 16). High sequence similarity of the ITS regions may be evidence, but it does not provide definite proof for the identity of two taxa. However, for a large majority of species, the similarity tree of the ITS region is identical to the phylogenetic tree, allowing good identification (14).
To avoid the aforementioned problems of identification with GenBank, we generated an in-house ITS database covering medically relevant yeasts and including only confirmed sequences. To evaluate the quality of our database and the efficiency of phenotypic identification, we compared sequencing with biochemical identification by the ID32C system. In our study, correct species identification was achieved in 98% of the strains by sequence analysis and in 85% by ID32C. The ID32C system misidentified 7% of the isolates. Sequencing identified those isolates mainly as species not included in the ID32C database and biochemically similar to a species included in the ID32C database, e.g., C. rugosa and C. pararugosa or C. utilis and C. fabianii.
Most of the strains included in our study belong to the genus Candida sp. (94 of 113), 4 isolates belong to Trichosporon sp., 13 isolates belong to Saccharomyces cerevisiae, and 2 strains could not be identified. Other yeast genera such as Malassezia sp. or Rhodotorula sp. are rare in the diagnostic mycological laboratory (19); these species are, however, included in the in-house database. The vast majority of isolates in the clinical laboratory belong to the species Candida albicans, C. glabrata, C. krusei, and C. tropicalis, which can be reliably identified by a combination of morphology on CHROMagar Candida and rice agar, as confirmed by the sequencing of strains with typical phenotypic characteristics.
Results of sequence analysis can be expected within two working days. The results of the ID32C identification system are available after 24 h of incubation, but readings need to be confirmed after 48 h since some strains may show delayed growth. Weak growth with certain substrates makes the resulting identification dependent on the duration of incubation, on individual interpretation, and on the possibility of confirmation by morphology. Sequence analysis, on the other hand, leads to unambiguous identification, but comprises several steps (DNA extraction, amplification, and sequence analysis) which require more hands-on time than the preparation and reading of an ID32C test strip.
Based on the results of this study, we have implemented the following algorithm in our diagnostic mycology laboratory (Fig. 4). Yeast strains are first grown on chromogenic agar and on rice agar. Candida albicans, C. glabrata, C. krusei, and C. tropicalis, in general, can be reliably identified by their morphology (>90% of clinical isolates), although the misidentification of C. dubliniensis as C. albicans is possible. Strains with no specific morphology (5 to 10% of clinical isolates) are subjected to biochemical analysis. If no unambiguous species identification is obtained (about 1% of clinical isolates) and if a correct species identification is of concern, we proceed to sequence determination of the ITS region. This algorithm implements both the cost-effectiveness and precision of analysis that are required in the diagnostic laboratory.
![]() View larger version (27K): [in a new window] |
FIG. 4. Diagnostic algorithm. The numbers in bracket are based on applying this algorithm to the study isolates. a, combination of chromogenic agar and rice agar.
|
Present address: Bio-Analytica AG, 6000 Luzern 6, Switzerland. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»