Previous Article | Next Article ![]()
Journal of Clinical Microbiology, February 2002, p. 524-531, Vol. 40, No. 2
0095-1137/01/$04.00+0 DOI: 10.1128/JCM.40.2.524-531.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Molecular Diagnostics Unit, Cork Institute of Technology, Bishopstown,1 Department of Medical Microbiology, Cork University Hospital, Wilton, Cork, Ireland,2 Viral Gastroenteritis Section, MS G-04, Centers for Disease Control and Prevention, Atlanta, Georgia 303333
Received 3 August 2001/ Returned for modification 27 September 2001/ Accepted 18 November 2001
|
|
|---|
|
|
|---|
The rotavirus genome consists of 11 double-stranded RNA gene segments that encode five structural and six nonstructural proteins (28). Viral proteins VP4 and VP7 are responsible for the production of a neutralizing antibody response to rotavirus infection. These proteins have distinct antigenic specificities referred to as P and G types, respectively (10). Currently, 9 P serotypes and 10 G serotypes have been identified in humans (9, 23, 33, 36). Natural immunity to rotavirus infection is thought to be serotype specific (15). Antibodies derived from a primary infection are described as homotypic and not sufficiently cross-reactive to prevent infection by heterotypic strains. For any vaccine and subsequent vaccination program to be effective, vaccines must be designed so that they include the neutralization phenotypes of the epidemiologically important serotypes. Several investigators have described the predominant types circulating globally as G1P[8], G2P[4], G3P[8], and G4P[8] (3, 14, 25, 30, 38). However, unconventional G types and unusual combinations of G and P types are now being reported with increasing frequency (6, 19, 22, 26, 30, 31). In addition, the genetic complexity within individual serotypes is now regarded as being more extensive than was previously anticipated. Jin et al. (16) identified four distinct genetic lineages of viruses of the G1 serotype (designated G1-1, G1-2, G1-3, and G1-4), and Piec and Palombo (29) described the existence of genetic and antigenic subtypes of serotype G2. Similarly, subtypes of rotavirus serotype G4 have also been reported (7, 20). These genetic variants have the potential to give rise to antigenically distinct strains with similar serotypes. This feature has important implications for vaccine design.
It is now recognized that a greater understanding of the genetic and antigenic compositions of the wild-type virus populations in different geographical locations is required (25). Molecular biology-based techniques have contributed significantly to the characterization of rotaviral genomes. Methods including PCR typing and hybridization studies have facilitated the identification of several serotypes, and in addition, DNA sequence analysis has provided a complete characterization of many rotavirus gene segments. Previously, Gouvea et al. (12) proposed the use of restriction fragment length polymorphism (RFLP) protocols to uniquely identify and group rotavirus strains. Their studies suggested that this strategy may prove useful in monitoring of the extent of genetic variation among rotavirus strains within a population and may prove valuable in the examination of interspecies transmission (4, 12).
In the present study an RFLP assay was developed to identify and differentiate the G types of 100 Irish rotavirus isolates. The Irish isolates analyzed in the present study represented a collection of 330 strains previously isolated over a 3-year period from 1997 through 1999 (21). In addition, the assay was used to examine the extent of genetic relationships between Irish strains and global strains of the same serotype. As a consequence, novel strains circulating in the Irish population were identified.
|
|
|---|
RNA purification and RT-PCR. Double-stranded rotaviral RNA genomes were extracted by a phenol-chloroform method as described by Gouvea et al. (11). Full-length (1,062-bp) VP7 gene segments encoding the major neutralization protein, VP7, were amplified by reverse transcriptase (RT)-mediated PCR (RT-PCR) with primer pair Beg9 and End9 (11). The amplified products were resolved by conventional agarose gel (1.5%) electrophoresis and were visualized after ethidium bromide staining (0.1 mg/ml).
RFLP analysis. Each amplified DNA product (amplified as described above) was analyzed separately by direct digestion with 1 U of RsaI (GT"AC), AluI (AG"CT), or EcoRV (GAT"ATC). Following 1.5 h of incubation at 37°C, the digested products were analyzed by conventional agarose gel (2%) electrophoresis (as outlined above).
Database rotavirus sequences. Thirty strains whose G types were known (serotypes G1 through G4, G8, and G9) were selected from the current GenBank database to represent strains from various geographical locations (see Table 3). These sequences were analyzed for the corresponding RsaI, AluI, and EcoRV restriction sites with DNA Sequencher (version 4.01) sequencing software (Gene Codes Corp., Ann Arbor, Mich.).
|
View this table: [in a new window] |
TABLE 3. GenBank sources of VP7-encoding gene segments from group A human rotavirus strains
|
Nucleotide sequence accession numbers. The DNA sequences were directly submitted to GenBank and were assigned the following GenBank accession numbers: AF254137, AF254138, AF254139, AF254140, AF254141, and AF281044.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Profiles of VP7 gene segments from Irish strains obtained with restriction enzymes RsaI, AluI, and EcoRV
|
![]() View larger version (58K): [in a new window] |
FIG. 1. Enzyme profiles of VP7 segments digested with three restriction enzymes, RsaI, AluI, and EcoRV. All products were analyzed by agarose gel (2%) electrophoresis and visualized by staining with ethidium bromide (0.1 mg/ml). (a) RsaI digestion. Lanes M, molecular size marker, 100-bp ladder; lane 1, undigested VP7 fragment; lane 2, DNA control digested with RsaI; lane 3, CIT-54RV (G1); lane 4, CIT-124RV (G4); lane 5, CIT-179RV (G1); lane 6, CIT-202RV (G4); lane 7, CIT-222RV (G4); lane 8, CIT-70RV (G1); lane 9, CIT-76RV (G1); lane 10, CIT-78RV (G1); lane 11, CIT-79RV (G1); lane 12, CIT-95RV (G2); lane 13, CIT-155RV (G1 + G4); lane 14, CIT-109RV (G1); lane 15, CIT-112RV (G1); lane 16, CIT-313RV (G1). (b) AluI digestion. Lanes M, molecular size marker, 100-bp ladder, and molecular size marker, grade V; lane 1, undigested VP7 fragment; lane 2, DNA control digested with AluI; lane 3, CIT-68RV (G2); lane 4, CIT-245RV (G2); lane 5, CIT-128RV (G2); lane 6, CIT-152RV (G2); lane 7, CIT-123RV (G2); lane 8, CIT-132RV (G1); lane 9, CIT-176RV (G4); lane 10, CIT-178RV (G4); lane 11, CIT-224RV (G4); lane 12, CIT-273RV (G2); lane 13, CIT-278RV (G4); lane 14, CIT-289RV (G4); lane 15, CIT-293RV (G4); lane 16, CIT-298RV (G4); lane
|
![]() View larger version (38K): [in a new window] |
FIG. 2. Schematic representations of the restriction maps identified for VP7-encoding gene segment 9 of group A human rotaviruses. The approximate positions of the nine variable regions (VR1 through VR9) are represented by boxes (with the locations of each indicated beneath each panel); and major antigenic regions VR5, VR7, and VR8 are highlighted by hatched boxes. The corresponding restriction sites and nucleotide positions identified for RsaI (a), AluI (b), and EcoRV (c) are shown and are indicated by arrows.
|
|
View this table: [in a new window] |
TABLE 2. RFLP patterns identified for Irish isolates by enzyme profiles
|
Five profiles were obtained by digestion with AluI (profiles A1 through A5; Table 1). A brief analysis of Irish G2 viruses demonstrated that these strains were associated with a single enzyme profile, profile A4 (Fig. 1b, lane 4). Serotype G1 and G4 strains usually generated profiles A1 and A3, respectively, after resolution by gel electrophoresis (Fig. 1b, lanes 8 and 15). Furthermore, both serotypes were also associated with the A2 profile (Fig. 1b, lane 13, and Table 1). The majority of isolates that were part of mixed infections demonstrated single profiles, which suggested that one of the coinfecting G-type strains was dominant (Table 1). Some exceptions were noted, particularly in the case of two isolates that had mixed A profiles (data not shown), and again, strain CIT-254RV (as noted above) had a unique profile, profile A5 (Table 1). A particular feature observed after AluI digestion was the production of a predominant profile on a background of DNA fragments consistent with partial digestion (Fig. 1b, lanes 4, 11, and 14). Gouvea et al. (12) previously noted this feature when VP7 gene segments were similarly digested with BstYI. This feature did not hinder assignment of a profile to these isolates.
Similarly, the E profiles generated with EcoRV were also informative. Serotype G2 isolates were associated with the E3 profile alone (Fig. 1c, lane 3), and serotype G1 strains were associated only with the E1 profile (Fig. 1c, lane 7). A majority of serotype G4 viruses (n = 9; 53%) were also associated with the E1 profile; however, some strains (n = 8; 47%) also generated a profile designated E2 (Fig. 1c, lane 10). Isolates that were part of mixed infections again had either mixed E profiles or single E profiles indicative of one of the strains responsible for the infection. Isolate CIT-254RV, which previously demonstrated unique enzyme profiles with RsaI and AluI, was in this case associated with the E1 profile, similar to serotype G1 and G4 strains.
Computational analysis of the sequences of 30 randomly selected global rotavirus strains from the GenBank database permitted the construction of the corresponding R, A, and E profiles. An additional 14 R-profile banding patterns were noted after RsaI digestion, and these were designated R7 through R20 (data not shown). These were distinct from the R profiles associated with the Irish strains. Nineteen distinct A profiles (profiles A6 to A24) were associated with the database strains, producing a total of 24 profiles for the complete collection, and similarly, 12 additional E profiles, denoted profiles E4 through E15, were obtained after computational analysis of the preselected global strains (data not shown).
Combined RFLP profiles. The enzyme profiles obtained previously were then combined for each strain to generate individual RFLP patterns. The 130 human rotavirus VP7-encoding genes were classified into a total of 36 RFLP patterns. Only 9 patterns were observed among the Irish strains, and all of these are listed in Table 2.
For the serotype G1 viruses, 17 of the 25 (68%) strains analyzed had the pattern R1A1E1, Four strains (16%) had the pattern R1A2E1, and three strains (12%) had the pattern R2A2E1. A single G1 isolate had a unique pattern of R6A1E1. Seventeen of the serotype G4 viruses produced three RFLP patterns: R3A3E1 (n = 7; 41%), R3A3E2 (n = 8; 47%), and R3A2E1 (n = 2; 12%). Specifically, the R3A3 combination was evident in the majority of G4 Irish strains (n = 15; 88%). All of the Irish serotype G2 isolates had a single RFLP pattern, pattern R4A4E3 (Table 2). The RFLP profiles of the global strains showed marked genetic variations, with only a single isolate (isolate GBR-1) having an RFLP profile that corresponded to those of the Irish strains (Table 3).
Isolates that were part of mixed G-type infections could also be assigned to specific RFLP profiles. The majority of these corresponded to the RFLP profile for one of the apparently dominant coinfecting viruses. Some exceptions included strains with obvious single enzyme profiles, such as CIT-254RV (21). This isolate was identified by PCR typing as having a mixture of serotypes, serotypes G1 and G3 (21), and had a unique RFLP pattern, pattern R5A5E1 (Table 2). Examination of the electropherotype pattern for this isolate by polyacrylamide gel electrophoresis (PAGE) also suggested that this strain was unique, as the PAGE pattern identified was distinct from those of the other strains in this collection of Irish strains (data not shown).
DNA sequence analysis and genetic variation within the variable-region domains of VP7. Sequence analysis of VP7 DNA fragments cloned from six Irish isolates with defined G serotypes confirmed the location and subsequent enzyme profiles for RsaI, AluI, and EcoRV (Table 1 and Fig. 2). The isolates chosen for sequence analysis were selected on the basis of their corresponding RFLP arrays (i.e., R, A, and E profiles) and included the following: CIT-4RV (R1A1E1), CIT-6RV (R2A2E1), CIT-313RV (R6A1E1), CIT-220RV (R3A3E2), CIT-176RV (R4A4E3), and CIT-254RV (R5A5E1) (21). Open reading frames were identified in all cases, and the deduced amino acid sequences were compared to each other and the VP7 sequences of selected global isolates and were subsequently aligned by use of the CLUSTALW program (35).
Careful inspection of the restriction maps of the VP7-encoding genes of the Irish strains revealed several interesting features (Fig. 2). In this study some restriction sites identified were associated with a specific G type, including the RsaI site located at nucleotide position 318 bp (the asterisk labeled 318 in Fig. 2a), which was unique to the Irish serotype G1 viral population (Table 1). The RsaI site at nucleotide position 774 bp (the asterisk labeled 774 in Fig. 2a) was uniquely linked with G2 strains (Table 1). These sites exist within variable regions VR5 and VR9, respectively. Other restriction sites were located outside the variable regions, and these also appeared to be associated with specific G types. Some of these included the EcoRV site at nucleotide position 858 bp (the asterisk labeled 858 in Fig. 2c), which was identified with serotype G2 strains (Table 1), and the RsaI sites at nucleotide position 448 and 467 bp (the asterisks labeled 448 and 467, respectively, in Fig. 2a), which appeared to be conserved among serotype G4 and G1 strains, respectively (Table 1).
Comparison of the amino acid lineages among VP7 proteins from Irish and global strains whose G serotypes were determined. Rotavirus strains of the G1 serotype are the most prevalent strains circulating in the Irish population (22, 24). Of all Irish isolates studied in our laboratory, G1 strains produced the greatest RFLP-associated diversity (Table 2). It was therefore of interest to further characterize the VP7 proteins from these viruses. The deduced amino acid sequences of 3 Irish serotype G1 isolates selected on the basis of different RFLP profiles (R1A1E1 [isolate CIT-4RV], R2A2E1 [isolate CIT-6RV], and R6A1E1 [isolate CIT-313RV]) were compared to 11 global serotype G1 viruses from various geographical locations (Table 4).
|
View this table: [in a new window] |
TABLE 4. Amino acid alignments of 14 G1 VP7 protein sequencesa
|
The deduced protein sequences of representative G2 and G4 Irish strains were also examined and compared to those of global strains (data not shown). Comparison of these sequences demonstrated a high level of amino acid sequence similarity between Irish and global strains of similar serotypes. However, amino acid substitutions were identified within the variable regions of the Irish isolates only. These data, together with the results of RFLP analysis, further supported the view that the Irish population of serotype G2 and G4 strains was homogeneous and could be distinguished from global strains.
The amino acid sequence of the VP7 protein of isolate CIT-254RV was also examined and aligned with those of other VP7 proteins of isolates representing serotypes G1 to G4, G8, and G9 (data not shown). Sequence similarities to more than one serotype were identified in variable regions VR1 through VR9, including serotypes G1, G3, and G9. However, unique amino acid substitutions which may affect antigenic determinants were also identified in VR6 and VR7. This information, together with the unique RFLP and PAGE patterns associated with this isolate, suggested that CIT-254RV was a novel strain circulating in the population.
|
|
|---|
The most significant genetic variation among Irish strains occurred within the serotype G1 virus population, among which four different RFLP patterns were identified. This finding indicated that, compared to serotype G2 and G4 strains, the serotype G1 viruses were genetically more diverse. To investigate this further the nucleotide and deduced amino acid sequences of three G1 isolates associated with three different RFLP patterns were identified. These data revealed that at least two distinct G1 lineages were cocirculating in the Irish population, including G1-1 (isolate CIT-6RV) and G1-3 (isolate CIT-4RV). Both of these strains were isolated in the first year of the original study (21), indicating that both lineages were cocirculating at that time. Significantly, the RFLP pattern identified for the latter strain (strain CIT-4RV) was R1A1E1, which was associated with 68% of the Irish serotype G1 isolates examined. This may imply that the G1-3 lineage predominated. A single G1 isolate, CIT-313RV, was associated with the unique RFLP pattern R6A1E1. Examination of the deduced VP7 amino acid sequence of this strain revealed similarities to the VP7 amino acid sequences of both the G1-1 and the G1-3 lineages (data not shown). Significantly, strain CIT-313RV was isolated in 1999, near the end of the study, and thus may be the product of an interserotypic recombination event (34) that may have occurred between the VP7 genes of two distinct G1 lineages, G1-1 and G1-3. The RFLP data informed our choice of VP7 genes for sequencing, and subsequently, two lineages, G1-1 and G1-3, were detected. It would not be unreasonable to suggest that had all 25 serotype G1 strains been sequenced, additional lineages may have been identified.
A restriction enzyme's potential to identify the presence of mixed G-type infections appeared to be dependent on the dosage of the gene for VP7 from coinfecting strains and whether recombination events occurred between strains. The majority of isolates with mixed serotypes were associated with single R, A, and E profiles. It may be reasonable to suggest that one of the coexisting VP7-encoding gene segments was present at a higher copy number in one strain than in other strains. However, some isolates with mixed serotypes, including isolate CIT-254RV, were associated with unique restriction profiles. Further examination of the protein sequence confirmed that this strain was antigenically distinct from other Irish isolates. The two major mechanisms believed to be responsible for the production of antigenic variants of rotavirus strains are nucleotide substitution and gene reassortment (14, 34). However, the latter mechanism occurs in an all-or-none fashion, and the deduced protein sequence of CIT-254RV revealed similarities to the sequences of more than one serotype. Intragenic recombination is a third mechanism by which rotavirus strains escape the neutralizing effect of the host immune system. This involves the exchange of antigenic regions between strains of different serotypes and can potentially create antigenic variants with dramatic changes in their antigenicities (34). This mechanism may be more suited as an explanation for the evolution of strain CIT-254RV in the Irish population. In these settings the RFLP assay was capable of detecting novel antigenically distinct strains.
Comparative sequencing studies previously identified nine variable regions (VR1 to VR9) within the gene segment encoding the VP7 protein (13). These regions are known to be divergent among different serotypes but highly conserved within a given serotype and are thus useful for prediction of the serotype of any isolate by sequence comparison. In particular three of these regions, VR5 (including amino acid residues 87 to 101), VR7 (amino acid residues 142 to 152) and VR8 (amino acid residues 208 to 221) are designated the major antigenic regions within VP7 (Fig. 2, hatched boxes) (7, 8). As stated above, some restriction sites located within the variable regions were associated with a specific G type, supporting the hypothesis that the sequences of the variable regions are highly conserved within a given serotype and are divergent between serotypes. However, some restriction sites located outside the variable regions were G-type specific. These regions encoded the corresponding conserved amino acid residues that were not directly involved in antigenic specificity but, rather, that predicted a specific G type. In earlier studies, Gouvea et al. (12) suggested that these sites are probably reminiscent of a G-type ancestor, and as they do not appear to be influenced by direct selective pressure from neutralizing antibodies, they are stable over time and thus may be useful as potential epidemiological markers for Irish isolates.
Finally, recent reports identified the major circulating G serotypes in Ireland as G1 and G2, with a lower incidence of serotype G4 strains (22). It was previously noted that mixed infections occurred in large numbers, and this feature had the potential to lead to the emergence of novel strains. In the present study restriction enzyme analysis in conjunction with sequencing studies identified genetic and antigenic variants circulating in the population. Thus, RFLP analysis could be a useful epidemiological tool for monitoring the emergence of new strains, assessing the efficacy of vaccination, and identifying strains for ancestral evolution.
Financial support from Irish Government Scientific Funding Agencies is gratefully acknowledged (grants GTP 96/CR/028 and ARG 98/226). Wyeth Lederle (Ireland) is also acknowledged for financial support.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»