Previous Article | Next Article ![]()
Journal of Clinical Microbiology, October 2005, p. 5179-5186, Vol. 43, No. 10
0095-1137/05/$08.00+0 doi:10.1128/JCM.43.10.5179-5186.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Laboratoire de Virologie, Reference Laboratory for Enteric Viruses, Centre Hospitalier Universitaire, 21079 Dijon,1 Microbiologie Médicale et Moléculaire, UFRs Médecine et Pharmacie, 21079 Dijon,2 Laboratoire de Microbiologie, IFREMER, BP 21105, 44311 Nantes Cedex 03, France3
Received 16 December 2004/ Returned for modification 31 January 2005/ Accepted 20 June 2005
|
|
|---|
|
|
|---|
Human calciviruses (HuCVs) have been shown to be the most common cause of viral gastroenteritis outbreaks and sporadic community cases in all age groups in industrialized countries (5-8, 31). HuCVs are single-stranded, positive-sense RNA viruses of the family Caliciviridae. Complete nucleotide sequencing of several caliciviruses has shown that their genome is organized in three open reading frames (ORFs) (20, 27). ORF1 encodes a large nonstructural polyprotein that is cleaved into several viral proteins, among which is an RNA-dependent RNA polymerase. ORF2 encodes the major capsid protein VP1, and ORF3 encodes a minor structural protein VP2 (13). Complete genomic sequences are available for only a few strains, but numerous partial sequencings of either part of the RNA polymerase gene or part of the capsid gene have been done independently.
HuCVs are phylogenetically divided into two genera, Norovirus and Sapovirus (13). Noroviruses are divided into five distinct genogroups, and each genogroup can be divided into several genotypes. The classification of caliciviruses in genogroups and genotypes is based on the sequence of the capsid, partial (2, 39) or complete (12). According to a classification adapted from Green et al. (12) and Ando et al. (2), genogroup I (GI), GII, and GIII are composed of seven, eight, and two genotypes, respectively. GIV is represented by one strain (41), as is genogroup V (23). This classification is in constant evolution with the discovery of new strains. Recently, Kageyama et al. proposed a classification with 31 genotypes (14 for GI and 17 for GII) (22). At the same time, Vinje et al. identified 15 genotypes in the GII (40). Also, Zheng et al. (44) defined 8 and 17 genotypes in the GI and GII, respectively.
For most strains, analysis of the ORF1 region, used by most laboratories for diagnostic reverse transcription (RT)-PCR, correlates with the clustering based on the ORF2. However, several studies have shown that strains group within different clusters, depending on whether the polymerase or capsid gene is sequenced, suggesting that recombination may have occurred. This is the case for the strains Snow Mountain (16) and Wortley and Seacroft (39). Jiang et al. (19) characterized a first naturally occurring recombinant, Arg320, by cloning and sequencing a long fragment of the genome from the RNA polymerase region to the 3' end of the genome. This recombinant would derive from two parent strains of genogroup II, Lordsdale virus for the RNA polymerase and Mexico virus for the capsid. Katayama et al. (24) described two recombinants, WUG1 of genogroup I and Saitama U1 of genogroup II. Recently, Snow Mountain virus was confirmed to be a recombinant norovirus (28). Also, Hansman et al. (15) characterized Mc37 as a new recombinant by complete genome sequencing.
We have isolated and characterized by cloning and sequencing several strains of noroviruses that correspond to different recombinants. GGIIb variants were detected for the first time in France in August 2000 (F. Bon, P. Pothier, C. Hemery, M. Cournot, C. Castor, H. de Valk, P. Gourier-Frery, P. Benhamida, R. Roques, L. Le Coustumier, F. Villeneuve, P. Megraud, P. Le Cann, E. Kohli, and A. Gallay, 21st Réunion Interdisciplinaire de Chimiothérapie Anti-infectieuse, Paris, France, 2001) and then in several European countries (4, 17, 33, 38). Their RNA polymerases do not correspond to any known genotype, but their capsid genes are close to the capsid gene of viruses of three different genotypes, Hawaii, Toronto, and Snow Mountain, indicating that recombinations occurred. This paper reports on the characterization of three other strains of HuCV that are probable recombinants. The sequencing of long genomic fragments containing a portion of the RNA polymerase gene and a portion of the capsid gene shows that these two portions of the genome present homologies with distinct genotypes.
|
|
|---|
RNA extraction. Viral RNA was extracted from 10% stool suspensions in phosphate-buffered saline (pH 7.5) with a QIAamp viral RNA kit (QIAGEN, Hilden, Germany), according to the manufacturer's instructions, and was either reverse transcribed immediately or stored at 40°C.
RT-PCR. RT-PCR was performed to amplify sequences of the capsid gene in the 55 stool samples by using murine leukemia virus reverse transcriptase and AmpliTaq DNA polymerase purchased from Applied Biosystems (Foster City, Calif.). Both RT and PCR were carried out with the reaction mixture preparations recommended by the manufacturer. Primer set SRI1 and SRI2 (14) and primer set Mon381 and Mon383 (32) were used to amplify the sequences of the capsid gene of noroviruses genogroup I and II, respectively, giving amplicons of 316 and 322 bp, respectively. The cycling conditions were as follows: one cycle of reverse transcription at 42°C for 15 min and, for PCR, denaturation for 2 min at 94°C; 40 amplification cycles with denaturation for 30 s at 94°C, annealing for 1 min at 50°C, and extension for 1 min at 72°C; and a final cycle of incubation at 72°C for 15 min.
For some strains, another RT-PCR was performed with primers NI (11) and Mon383 to amplify a long genomic fragment that included the junction between the RNA polymerase and the capsid genes and with the Superscript II RNase H- reverse transcriptase (Invitrogen Corporation, Carlsbad, Calif.) and the Expand High Fidelity PCR system (Roche Molecular Biochemicals, Mannheim, Germany), according to the manufacturer's instructions for both reaction mixture preparations. The PCR cycle conditions used were the same as those used for amplification of the capsid gene.
Cloning and sequencing of the RT-PCR products. RT-PCR-amplified DNA fragments of the expected sizes, as determined by agarose gel electrophoresis, were excised from the gel, extracted, and purified with a Qiaex II gel extraction kit (QIAGEN). Capsid gene fragments were sequenced directly after purification.
The PCR products obtained with primer set NI and Mon383 (1,189 bp) were cloned into the pGEM-T Easy Vector System (Promega Corporation, Madison, Wis.), according to the manufacturer's instructions.
Sequencing of the PCR products or plasmids was performed with the same primers used for amplification by using the ABI Prism Big Dye Terminator cycle sequencing ready reaction kit on an automated sequencer (model 373A DNA sequencing system) (both from Applera Corporation, Foster City, Calif.). For the long fragments cloned into the plasmid, if necessary the sequence was determined by using multiple steps: after determination of the end sequences of the cDNA with the primers NI and Mon383, the remaining sequences were determined by genomic walking with primers based upon newly obtained sequences adjacent to those regions. All sequences were confirmed by sequencing at least twice.
Consensus sequences were produced by overlapping the sequences of the RNA polymerase genes and the cloned genes.
Characterization and phylogenetic analyses. Alignments of the sequences with the sequences in the GenBank library were carried out by using Fasta version 3.3t06 and GCG software, available in the national service Infobiogen (http://www.infobiogen.fr). For the RNA polymerase, alignments were also performed with reference strains available in the database of the European Food-Borne Viruses Network (25, 30).
For the phylogenetic analyses, sequences were first aligned by using the CLUSTAL-W program in the PHYLIP format. The neighbor-joining method was used to create phylogenetic trees by using the different alignments.
Detection of recombination. Detection of recombination was achieved by using similarity plots in SimPlot, as described previously (29), as well as a statistical analysis based on Sawyer's test (S. Sawyer, 1999, http://www.math.wustl.edu/sawyer). SimPlot calculates and plots the percent identity of a query sequence to a panel of reference sequences versus the position in a sliding window, which is moved across the alignment in steps. Sawyer's test looks for aligned segments for which a pair of sequences is sufficiently similar to be suggestive of gene conversion.
Nucleotide sequence accession numbers. The nucleotide sequences determined in this study have been deposited in GenBank under accession numbers AY682550, AY682551, and AY682552 for samples S63, L23, and E3, respectively.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Strains presenting low percentages of nucleotide sequence identity with those of reference strains for the RNA polymerase genea
|
Phylogenetic analysis of partial RNA polymerase and capsid sequences. Figure 1 represents the phylogenetic trees for six strains, strains S63, L23, E3, E23, GEA471, and S55. The RNA polymerases of strains S63, L23, and E3 are related to Melksham, Hillingdon, and Bristol viruses, respectively, although they could not be clearly assigned to these genotypes; and their capsid genes can be classified into the Hillingdon, Hawaii, and Melksham genotypes, respectively. Also, for E23, GEA471, and S55, the phylogenetic trees do not give the same classification when the trees are based on the RNA polymerase and the capsid. These results confirm that for each of these strains the RNA polymerase gene and the capsid gene do not belong to the same genotype, thus suggesting the possibility of recombination.
![]() View larger version (16K): [in a new window] |
FIG. 1. Phylogenetic analysis of partial (A) RNA polymerase (145-bp) and (B) capsid (277-bp) sequences. The strains characterized in our laboratory are represented in boldface. Strains SH96 and E293 were characterized in another study; they present a good correlation between the RNA polymerase and the capsid genes, with SH96 being assigned to the Hawaii genotype and E293 being assigned to the Bristol genotype. The GenBank accession numbers for the calicivirus reference strains are Amsterdam/1998/NL, AF195848; Chiba407/1987/JP, AB042808; Hesse3/1997/GE, AF093797; Leeds/1990/UK, AJ277608; Musgrove/1989/UK, AJ277614; Norwalk/1968/US, M87661; Seacroft/1990/UK, AJ277620; Sindlesham/1995/UK, AJ277615; and Winchester/1994/UK, AJ277609. The others are given in Table 1. *, sequences not available in GenBank (the sequences were kindly provided by Marion Koopmans and Harry Vennema).
|
For strains S63, L23, and E3, the long sequences corresponding to ORF1-ORF2 fragments were analyzed with SimPlot software by comparing the sequences of strains representing each genotype of GII. Figure 2 shows the results obtained by comparison with the Melksham, Hawaii, Bristol, and Hillingdon sequences as the reference sequences.
![]() View larger version (20K): [in a new window] |
FIG. 2. SimPlot analysis of partial RNA polymerase and capsid gene sequences of recombinant strains. Window size, 200 bp; step, 20 bp. The query sequences for panels A, B, and C are S63, L23, and E3, respectively. The vertical axis indicates the nucleotide identities between the query strain and the four reference strains, expressed as percentages. The horizontal axis indicates the nucleotide positions (in base pairs; corresponding to base pairs 4233 to 5660, 4290 to 5660, and 4233 to 5648 of the Hawaii strain with GenBank accession number U07611 for S63, L23, and E3, respectively). Vertical lines indicate the beginning of ORF2 (continuous vertical line) and putative recombination breakpoints identified by Sawyer's test at the ORF1-ORF2 junction (dotted vertical line). Gray line, Hillingdon; dashed line, Melksham genotype; bold line, Bristol; dotted line, Hawaii.
|
For L23, the first part of the sequence, which corresponds to ORF1, between residues 1 and 815 (residues 4290 to 5104 in Hawaii virus) is closer to Hillingdon than to other reference strains but presents only 85% similarity, on average, which is not high enough for the strains to be considered of the same genotype; the second part corresponding to ORF2 and starting at position 796 (position 5085 in Hawaii virus) is closer to Hawaii strain essentially at the 5' end. It must be noted that the decrease in the percentage of similarity with Hawaii virus starting at about position 900 in Fig. 2B (position 5190 in Hawaii virus) is in accordance with the rather low percentage of nucleotide identity of the capsid fragment with that of Hawaii virus reported in Table 1 (82%). However, the amino acid identity percentage is high (94%).
For E3, the sequence between nucleotides 1 and 872 (ORF1) (nucleotides 4233 to 5104 in Hawaii) is closer to the Bristol sequence, but the sequence between nucleotides 853 and 1416 (ORF2) (nucleotides 5085 to 5648 in Hawaii virus) is close to Melksham sequence, which is also the case for the capsid gene analyzed separately. These three examples showing sudden shifts in similarities suggest that a recombination occurred near the junction of ORF1 and ORF2, which is localized between nucleotides 5085 and 5104 in the Hawaii genome.
Identification of recombination by Sawyer's test. The available sequences of strains E3, L23, and S63, including the ORF1-ORF2 junction, were aligned separately with the following reference strains: Hawaii, Bristol, Melksham, and Hillingdon. As controls, three confirmed recombinants were also tested: Arg320 (GenBank accession number AF190817), which has been previously published as a Lordsdale-Mexico recombinant (19), and two strains of the GGIIb group which were detected in our laboratory, strains L169 (GGIIb/Toronto) and E673 (GGIIb/Snow Mountain) (Table 1). The sequences of these three additional strains located in the same region as those of E3, L23, and S63 were aligned with the sequences of the same reference strains, as well as those of Mexico, Toronto, and Snow Mountain strains for Arg320, L169, and E673, respectively. The results allowed detection of different aligned segments with sequences sufficiently similar to be suggestive of gene conversion (Table 2): S63 and Hillingdon presented a 515-bp homologous fragment which corresponds to the totality of the ORF2 sequence that we obtained for S63; L23 shows a short fragment (122 bp) homologous to that of the Hawaii genotype at the 5' end of ORF2; E3 presents a 185-bp sequence homologous to that of the Bristol genotype at the 3' end of ORF1 and a 330-bp sequence homologous to that of the Melksham genotype at the 5' end of ORF2. This suggests the presence of recombination breakpoints upstream and downstream the beginning of ORF2 (Table 2; Fig. 3) located at positions 25, 45, and 6 or 9 (when it is considered that position 0 is the beginning of ORF2) for strains S63, L23, and E3, respectively. For the three confirmed recombinants, breakpoints were identified at positions 26 (strains Arg320 and L169) and 67 (strain E673).
|
View this table: [in a new window] |
TABLE 2. Homologous fragments suggestive of recombination according to Sawyer's test
|
![]() View larger version (76K): [in a new window] |
FIG. 3. Recombination breakpoints detected in the junction region between ORF1 and ORF2 of recombinant strains by Sawyer's test. Positions according to the beginning of ORF2 are 45, 6 or 9, and 25 for the three putative recombinants L23, E3, and S63, respectively, and 26, 26, and 67 for the confirmed recombinants Arg320, L169, and E673, respectively.
|
|
View this table: [in a new window] |
TABLE 3. Epidemiology of the three recombinant strains
|
By comparison with the sequences in GenBank, we found that the L23 sequence is very close (98 to 99% nucleotide identity) to eight sequences of strains characterized in the United States, but no data concerning their frequency or geographical distribution are available. It must be noted that the complete ORF2 sequences of these strains have been determined, but their ORF1 sequences are at least 550 bp shorter than the L23 ORF1 sequences, thus preventing comparison of the 5' end of L23 with the sequences of the 5' ends of these strains. The sequences of S63 and E3 did not correspond to any sequences submitted to GenBank so far.
|
|
|---|
Taken together, these results confirm that the junction region between ORF1 and ORF2 may be critical for recombination, as is the case for other recombinant noroviruses described in the literature. Sawyer's test allowed us to identify several recombination breakpoints located in this region, upstream and downstream of the beginning of ORF2, suggesting that there is not a unique breakpoint in this region. Of interest, positions 25 and 26 were identified as breakpoints for three different recombinant strains: S63 (Hillingdon genotype capsid, GII-5), Arg320 (Mexico genotype capsid, GII-3), and L169 (Toronto genotype capsid, GII-3), suggesting that this particular region may be more frequently involved in recombination. Although the exact mechanism of recombination has not been elucidated, it has been suggested that during coinfection, the RNA-dependent RNA polymerase could switch from one RNA molecule to another during viral replication because of the high sequence conservation that increases the interaction between RNA molecules. Our investigations of gastroenteritis outbreaks showed that coinfection of patients by several noroviruses has been observed frequently, especially in water- and oyster-related outbreaks, and has also been described several times in the literature (1, 10, 35).
The capsid of the Melksham strain that is part of the E3 strain has been found to recombine with the polymerase of the Bristol strain to give the recombinant Snow Mountain strain (28). The RNA polymerase of the Bristol strain is also found in recombinant strains Arg320 (19) and Wortley (39) in combination with the capsids of genotypes Mexico and Hawaii, respectively. Of interest, sample E3, considered a recombinant between Bristol and Melksham (pol and capsid, respectively), is different from the Snow Mountain agent, which is also a Bristol-Melksham recombinant (86% nucleotide sequence homology in ORF1 and 92% nucleotide sequence homology in ORF2). Thus, some genotypes may be more prone to recombination and the formation of new strains than others. However, the identification of a greater number of recombinants will be required to confirm this hypothesis.
All the recombinants described in this study, as well as the GGIIb variants, presented a low nucleotide identity percentage with known genotypes in the RNA polymerase, suggesting that the capsid sequences of such "odd" strains should be further sequenced. It is not known if the "odd" strains just mentioned really exist or if recombination between the polymerase and capsid genes of two different strains would induce mutations in the polymerase gene, producing a decrease in nucleotide similarities between the polymerases of the recombinant and parent strains. Some recombinant noroviruses described in the literature present high percentages of homology with two different genotypes in both ORFs, meaning that our observation is not a general characteristic of recombinants.
Unlike GGIIb and other norovirus recombinants, such as the Snow Mountain strain and Arg320, which are widely distributed, the three recombinants characterized in this study do not seem to have diffused widely. One was the cause of an outbreak in Crete in 1997; the other two were detected in two sporadic cases of infection in January and December 1999, respectively. Since then, they have never been identified among caliciviruses detected in France. Similar strains were found in other European countries but only their RNA polymerase genes and not their capsid genes were characterized, so they may be different. However, even if we consider, based on the RNA polymerase, that they are the same strains, it appears that they were not reported many times in other European countries. Also, L23 is similar to strains characterized in the United States, but we have no data about their geographical distribution or frequency. Of interest, it has recently been proposed (Second International Calicivirus Conference, Dijon, France, 6 to 10 November 2004) that these strains define a new genotype represented by strain Tiffin (GenBank accession number AY502010). Whatever the rate of recombination in noroviruses, it seems that some recombinants are able to diffuse in the human population and emerge as new strains, while others cause infection in only a few cases and do not spread. The reasons for this difference are not clear. All the recombinants described so far present known capsids, and since capsids are responsible for the immunogenicity, the mechanism that causes a selective advantage is not clear. Variations in antigen sites and/or receptor binding sites may be critical by causing immune escape and/or increasing the number of susceptible hosts.
In conclusion, the results presented here suggest that recombination is not a rare phenomenon among noroviruses and thus contributes to the genetic diversity of these viruses. Recombinant strains may be underestimated by the fact that characterization of noroviruses is usually based on the RNA polymerase gene sequence only, whereas it is necessary to know both the RNA polymerase and the capsid gene sequences to identify such viruses. Identification of recombination breakpoints in the region located at the junction of ORF1-ORF2 confirms the importance of this region in this phenomenon. Further work is needed to determine more precisely the exact implication of recombinant noroviruses in gastroenteritis as well as the molecular basis for the selective advantage observed for some of these strains.
This work was supported by the European Commission, under the Quality of Life and Management of Living Resources, 5th Framework (QLK1-CT-1999-00594).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»