| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Previous Article | Next Article ![]()
Journal of Clinical Microbiology, March 2009, p. 674-679, Vol. 47, No. 3
0095-1137/09/$08.00+0 doi:10.1128/JCM.01028-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Institute of Diagnostic Virology, Friedrich-Loeffler-Institut, Südufer 10, Greifswald-Insel Riems 17493, Germany
Received 29 May 2008/ Returned for modification 2 September 2008/ Accepted 14 December 2008
|
|
|---|
|
|
|---|
In contrast, after proper sample preparation, which can be done within 3.5 h, followed by overnight clonal amplification of the library fragments, the new high-throughput pyrosequencing technique (5) available by use of the Genome Sequencer (GS) FLX instrument (Roche, Mannheim, Germany) generates large amounts of sequence information in a single run, thereby providing several repetitions of sequencing in a single experiment. The key features of this new technology are a sample preparation technique that does not rely on cloning for the generation of shotgun sequencing libraries; bead-bound clonal amplification of the library fragments in a single emulsion PCR (emPCR) run, which allows parallel amplification of the whole library; and massively parallel sequencing of the clonally amplified bead-bound shotgun library fragments in the wells of a PicoTiterPlate (PTP) that yields one sequencing read per bead. During preparation of the single-stranded template DNA (sstDNA) library, the target DNA is randomly fragmented and special adapters are ligated to the DNA fragments. The adapters serve as priming sites for PCR amplification of the library, thereby eliminating the need for sequence knowledge prior to sequencing. Via one of these adapters, the library fragments are bound to beads and are then clonally amplified in an emPCR. In the emPCR, the droplets of the water in an oil emulsion are microreactors containing the PCR reagents and one bead-bound DNA fragment, which thereby allows parallel clonal amplification of the complete sstDNA library in a single PCR. Subsequently, the beads carrying the amplified library are recovered from the emulsion, prepared for sequencing, and loaded into a PTP. This PTP has approximately 1.6 million wells, each of which holds only one bead. Thus, during loading, the beads carrying the amplified library are physically separated, which allows the sequencing of every bead-bound amplicon in a single sequencing read. Thus, up to 100 Mb of raw sequence data can be obtained in a single instrument run by sequencing hundreds of thousands of bead-bound DNA fragments in parallel. This enables sequencing of complete AIV genomes with a great deal of reliability within 3 days after sample receipt.
|
|
|---|
RNA extraction. Total RNA was isolated from allantoic fluid (140 µl) with a QIAamp viral RNA minikit (Qiagen, Hilden, Germany), according to the manufacturer's instructions.
PCR, library preparation, sequencing, and sequence assembly. Figure 1 provides an overview of the complete procedure from RNA extraction to sequencing and an approximate timeline. DNA was generated from genomic RNA diluted 10-fold in RNA-safe buffer (50 ng/µl carrier RNA, 0.05% Tween 20, 0.05% sodium azide in RNase-free water) (4) by reverse transcription-PCR (RT-PCR) with a SuperScript III one-step RT-PCR system with Platinum Taq high-fidelity polymerase (Invitrogen, Carlsbad, CA). For every H5N1 HPAIV genome, 11 PCRs were set up in duplicate with the primers listed in Table 1. Large segments 1, 2, and 3 were amplified as two fragments each. To allow for proper sequence assembly, the PCR products for each of these genome segments overlap approximately 320, 130, and 180 bp, respectively. Five microliters of the dilute template and 1 µl of each primer (10 µM; final concentration in the PCR mixture, 0.4 µM) were mixed and denatured for 2 min at 95°C. Immediately after denaturation, the samples were frozen in liquid nitrogen, and subsequently, 18 µl of the PCR master mixture (12.5 µl 2x reaction mixture, 1 µl Superscript III reverse transcriptase-Platinum Taq high-fidelity polymerase enzyme mixture, 4.5 µl RNase- and DNase-free distilled water) was added. PCR was performed in a model 2720 thermal cycler (Applied Biosystems, Foster City, CA). The cycling parameters were 30 min at 55°C (reverse transcription); 2 min at 94°C (inactivation of reverse transcriptase and activation Taq polymerase); 40 cycles of 15 s at 94°C (denaturation), 30 s at 55°C (annealing), and 3 min at 68°C (elongation); and 1 cycle of a hold for 5 min at 68°C (final synthesis). The PCR products were gel purified with a QIAquick gel extraction kit (Qiagen). The purified DNA was quantified with a model 1000 spectrophotometer (NanoDrop Technologies, Inc., Wilmington, DE) in the double-stranded DNA mode and was then pooled in equimolar amounts. This DNA pool (3 to 5 µg) was transformed to sstDNA libraries by using a GS DNA library preparation kit (Roche), according to the manufacturer's instructions, but the Ampure bead purification step was omitted. Instead, the fragmented DNA was cleaned with one column from a MinElute PCR purification kit (Qiagen). The purified fragmented DNA was filtered through a PCR Kleen spin column (Bio-Rad Laboratories, Munich, Germany) and was finally concentrated with a MinElute PCR purification kit. The sstDNA libraries were subjected to duplicate emPCRs with the GS emPCR kit I (Roche), according to the manufacturer's instructions, with 0.5 and 1 copy per bead. After bead recovery and enrichment, all beads available from the duplicate emPCRs (up to a maximum of 45,000 beads) were loaded and sequenced in one small lane (totaling two lanes per library with two emPCRs per lane) of the PTP by using a GS SR70 sequencing kit (Roche) and the appropriate instrument run protocol. The resulting sequencing reads were sorted according to the genome segments to which they related and were subsequently assembled into one contig (i.e., a set of overlapping sequencing reads) per segment by using the GS FLX sequence assembly software newbler (version 1.1.03.24; Roche). During the assembly, the primer sequences were subtracted from the raw data.
![]() View larger version (34K): [in a new window] |
FIG. 1. Work flow and timeline for the generation of sstDNA libraries from samples containing H5N1 AIV and for sequencing with the GS FLX instrument. The image displays the main steps for the sequencing of AIV genomes as separate filled arrows labeled with a short descriptor. Blue arrows, steps of the protocol established or modified in the present study; gray arrows, standard procedures of the sequencing protocol with the GS FLX instrument. Every step is accompanied by an open arrow labeled with the approximate duration of the respective experimental procedure. The vertical open arrow displays the overall timeline from sample receipt to retrieval of the final sequence and first-level data analyses. For details on the experimental procedures, refer to the text (blue steps) and the manufacturer's documentation (gray steps).
|
|
View this table: [in a new window] |
TABLE 1. Primers used for RT-PCR amplification of H5N1 HPAIV genomic segments
|
Nucleotide sequence accession numbers. The nucleotide sequences obtained in this study are available from the EMBL/GenBank/DDBJ database under accession numbers FM165519 to FM165526 (isolate R1372/2007) and FM165527 to FM165534 (isolate R1959/2007).
|
|
|---|
40 (i.e., a probability of
99.99% that the individual bases were correctly identified). Sequencing of isolate R747/2006 yielded 7,673 reads, with an average read length of 98.8 bases, totaling 0.76 Mb of raw sequence. The assembly resulted in 13,154 bases with a median depth of 45, and 99.57% of the bases had a quality score of
40. These assembled unrevised sequences covered 98.99% of the previously reported coding sequences and had 99.94% identity with the reference sequence. Table 2 summarizes all data regarding sequence quality. Table 3 shows details of the comparisons of the coding portions of the new and the previously published sequences. No differences between the coding sequences for both the M and the NS genes were detected for either isolate R65/2006 or isolate R747/2006. Moreover, no deletions in any of the raw sequences were found. All insertions traced were part of homopolymers that were overcalled, i.e., when the assembler software identified more bases than the number actually present in the sequence. In most cases, these inserted bases had a base quality score of <20, i.e., a probability of <99% that the individual bases were correctly identified. On the contrary, the majority of all bases had a quality score of
40, which corresponds to a probability of
99.99% that the individual bases were correctly identified (Table 2). Therefore, it must be assumed that the insertions are not true insertions but sequencing artifacts. In contrast to the insertions, the base quality score for the substitutions, with two exceptions, was 64 (a probability of 99.99996% that the individual bases were correctly identified, which is the highest possible score). Only half of the base substitutions detected caused amino acid substitutions; the remaining half were silent substitutions. One possible cause for these deviations of the new sequence data from the previously reported sequence data might be the fact that the virus had been passaged between the sequencing experiments. |
View this table: [in a new window] |
TABLE 2. Summary of data characterizing the quantity and quality of the sequences obtained
|
|
View this table: [in a new window] |
TABLE 3. Details of the comparison of the coding portion of the new GS FLX instrument-generated sequences for isolates R65/2006 and R747/2006 and the previously reported sequences for these isolates as a referencea
|
Determination of PCR sensitivity. In order to assess the sensitivity of our approach, template RNA from isolates R65/2006 and R747/2006 was serially diluted in RNA isolated from AIV-negative sample material. These template dilutions were used for both the generation of all PCR products for sequencing and quantitation of the approximate number of genome copies by real-time RT-PCR by use of a copy number standard. To this end, the dilute samples were examined by quantitative RT-PCR directed against the M segment (8) . Table 4 summarizes the data from the dilution series experiment. Here, the PCRs for fragments 1B and 3A turned out to be the least sensitive. Most dilute samples which allowed the production of sufficient amounts of fragments 1B and 3A had a threshold cycle (CT) value of approximately 26. This CT value corresponds to roughly 5.6 x 103 copies/µl of RNA extract. Because during the extraction RNA from 140 µl of allantoic fluid was eluted in 50 µl elution buffer, our procedure enables the sequencing of complete AIV genomes from 6 µl of total RNA isolated from allantoic fluid, in which the concentration is about 2 x 103 genome copies per µl. This would allow the sequencing of complete AIV genomes from 6 µl of RNA extracted from swabs containing approximately 3.0 x 105 genome copies when the RNA is finally eluted in 50 µl of buffer.
|
View this table: [in a new window] |
TABLE 4. Summary of results of dilution series experiment for the determination of PCR sensitivity
|
|
View this table: [in a new window] |
TABLE 5. Data characterizing for each segment the quantity and quality of the sequences obtained for isolates R1959/2007 and R1372/2007
|
![]() View larger version (23K): [in a new window] |
FIG. 2. Comparison of the sequence depth of segment 4 from both isolate R1372/2007 and isolate R1959/2007. The sequence depth (ordinate) plotted against the nucleotide position of the mRNA (abscissa) within the sequence of segment 4 is shown. The depth is the number of times that every nucleotide in the final sequence was sequenced, i.e., how many independent reads contributed information to the base called at the respective position.
|
This project was funded by the Federal Ministry of Food, Agriculture and Consumer Protection (BMELV), Germany (project no. FSI 1-1.1.2.).
Published ahead of print on 24 December 2008. ![]()
This paper is dedicated to the memory of Daniela Hase. ![]()
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»