Previous Article | Next Article 
Journal of Clinical Microbiology, June 2005, p. 2750-2755, Vol. 43, No. 6
0095-1137/05/$08.00+0 doi:10.1128/JCM.43.6.2750-2755.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Molecular Epidemiology of a Hepatitis C Virus Outbreak in a Hemodialysis Unit
Maria Alma Bracho,1
María José Gosalbes,1
David Blasco,2
Andrés Moya,1 and
Fernando González-Candelas1*
Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, València, Spain,1
Direcció General de Salut Pública, Conselleria de Sanitat i Consum, Generalitat Valenciana, València, Spain2
Received 4 November 2004/
Returned for modification 1 February 2005/
Accepted 24 February 2005

ABSTRACT
We analyzed a hepatitis C virus (HCV) transmission case in the
hemodialysis unit of a private clinic by sequencing two genome
regions of virus isolates from a number of patients attending
this unit and some external controls. The analysis of 337 nucleotides
(nt) in the NS5B region did not provide enough resolution to
ascertain which patients were actually involved in the outbreak
and the potential source. Nevertheless, this region allowed
the exclusion of several patients as putative sources of the
transmission case based on their genotypes and phylogenetic
relationships. On the other hand, the analysis of several 472-nt-long
clone sequences per sample in a more rapidly evolving region
of the HCV genome, coding for the envelope proteins and encompassing
hypervariable region 1, allowed us to establish the existence
of at least two independent transmission events involving two
different source patients and three recipients. The direction
of the transmissions was further corroborated by different measures
of genetic variability within and among samples.

INTRODUCTION
During the spring of 2002, three patients who attended the hemodialysis
unit of a private clinic in Vinaròs (Castelló,
Spain) tested positive for hepatitis C virus (HCV) despite having
tested negative shortly before. Since the same unit was regularly
attended by other HCV-positive patients and given that the hemodialysis
procedure has a high risk for HCV transmission (
10,
13), it
was necessary to study whether these patients had been infected
in the clinic and, eventually, to determine the source of infection
in order to facilitate the adoption of further safety measures
to prevent any new HCV transmission.
Usually, molecular epidemiology analysis is based on the sequencing and comparison of a single product in a relatively conserved, i.e., slowly evolving, region of the target genome, such as NS5B or core in HCV (3, 12). This is a valid approach as long as there has been enough time for nucleotide differences to accumulate so that differentiation among ancestral and derived genomes has occurred. However, given the extraordinarily high evolutionary rate of HCV in some regions, such as hypervariable region 1 (HVR1), located in the amino terminus of the envelope 2 protein gene, it is possible to gain much better resolution of the evolutionary, and consequently epidemiological, relationships by using sequence information from these regions for cases in which a short time has elapsed between transmission from the source and sampling for analysis (2, 4, 6, 15).
In this report, we provide an illustrative case in which the analysis of a fast-evolving viral region allows better elucidation of a transmission case than the use of a single sequence from a more slowly evolving genome region.

MATERIALS AND METHODS
Samples.
Serum samples were obtained from nine HCV-infected patients
who attended the hemodialysis clinic and from four unrelated
HCV-positive patients (Table
1) from the nearest reference hospital
(Hospital General de Castelló) to be used as local controls.
The patients were diagnosed using an enzyme-linked immunosorbent
assay (ORTHO HCV 3.0 ELISA TestSystem with Enhanced SAVe; Ortho-Clinical
Diagnostics, Cambridge, United Kingdom) and further confirmed
by line immunoassay (INNO-LIA HCV II; Innogenetics N.V., Ghent,
Belgium). The HCV genotypes of the samples were assigned by
sequencing of an NS5B gene fragment and comparison with reference
sequences from GenBank. Additional unrelated sequences from
an independent molecular epidemiology study of HCV patients
in our region (
8,
20) were included in the analyses. Samples
were taken in May 2002 and were stored frozen at 70°C
until they were processed in July of the same year.
View this table:
[in this window]
[in a new window]
|
TABLE 1. Serum samples received for study from HCV-positive patients attending the hemodialysis unit at the clinic where the suspected transmission case was reported
|
RNA extraction and RT-PCR.
Viral RNA was obtained from 200 µl of serum for each sample
using a High Pure Viral RNA kit (Roche Diagnostics GmbH, Mannheim,
Germany). Reverse transcriptions (RT) were carried out in 20-µl
volumes containing 5 µl of eluted RNA, 500 µM of
deoxynucleoside triphosphate (dNTP), 1 µM of hexamers,
100 units of Moloney murine leukemia virus reverse transcriptase
(Promega Corp., Madison, WI), and 20 units of rRNasin RNase
inhibitor (Promega). Reaction mixtures were incubated at 42°C
for 60 min, followed by 2 min at 95°C.
Amplification and direct sequencing of NS5B.
Direct sequences of PCR products were obtained for a 337-nucleotide (nt)-long fragment of the NS5B gene. PCR was performed in a 50-µl volume containing 5 µl of RT product, 100 µM of dNTP, 200 nM of each primer, and 2.5 units of Taq polymerase (Amersham Biosciences, Piscataway, NJ). Amplified products were purified with a High Pure PCR Products Purification kit (Roche). Direct sequencing of purified PCR products was performed on an 8-µl volume, including 1.0 µl of the PCR-amplified DNA, with the ABI PRISM BigDye Terminator v3.0 Cycle Sequencing Ready Reaction kit in an ABI 3700 automated sequencer (Applied Biosystems, Foster City, CA). Sequences were verified, and both strands were assembled using the Staden package (1). The sense and antisense primers used for amplification and direct sequencing of this region were 5'-TATGATACYCGCTGYTTYGACTC-3' and 5'-GTACCTRGTCATAGCCTCCGTGAA-3'.
Cloning and sequencing of E1-E2 region.
A 472-nt fragment of the E1-E2 region containing HVR1 and HVR2 regions was amplified by nested PCR. The first amplification was performed in a 100-µl volume containing 10 µl of the RT product, 10 µl of 10x PCR buffer, 200 µM (each) dNTP, 400 nM (each) primer (sense, 5'-CGCATGGCYTGGGAYATGAT-3'; antisense, 5'-GGYGSGTARTGCCARCARTA-3'), and 2.5 U of Pfu DNA polymerase (Stratagene, La Jolla, CA). When necessary, a second PCR was performed with a nested sense primer (5'-GGGATATGATRATGAAYTGGTC-3') and the same antisense primer indicated above. In all cases, PCR was performed in a Perkin-Elmer 2400 thermal cycler with the following thermal profile: 94°C for 3 min; 5 cycles at 94°C for 30 s, 55°C for 30 s, and 72°C for 3 min; 35 cycles at 94°C for 30 s, 52°C for 30 s, and 72°C for 3 min; and a final extension at 72°C for 10 min. A single amplified product was observed after electrophoresis on a 1.4% agarose gel stained with ethidium bromide.
Amplification products were purified with a High Pure PCR Products Purification kit (Roche) and then directly cloned in EcoRV-digested pBluescript II SK(+) phagemid (Stratagene). Plasmid DNA was purified with a High Pure Plasmid Isolation kit (Roche). Recombinant clones were sequenced by the use of KS and SK primers (Stratagene) and the same procedure described above for the NS5B gene.
Sequence analysis.
NS5B sequences were analyzed with a panel of 51 additional HCV sequences from the same genome region, including genotypes 1a (10 sequences), 1b (38 sequences), 3a (2 sequences), and 5a (1 sequence). Similarly, the 122 E1-E2 region sequences derived from recombinant clones corresponding to the 10 patients infected with HCV genotype 1b were analyzed, along with 73 sequences of the same genotype and HCV genome region.
For both HCV genome regions, multiple sequence alignments were obtained using ClustalW (19). The model that accounted best for the observed evolutionary pattern was ascertained using Modeltest 3.5 (11) and PAUP* v4.0b10 (18). A phylogenetic tree was obtained by maximum likelihood using PHYML (7) and employing the evolutionary model derived in the previous step. Support for the phylogenetic tree was obtained by bootstrap resampling (5) with 2,000 pseudoreplicates and neighbor-joining clustering (17) using the same evolutionary model as above, as implemented in MEGA2 (9).
Genetic variability estimates for sequences derived from related samples (see below) were obtained with DnaSP 3.95 (16).
Nucleotide sequence accession numbers.
The HCV sequences obtained in this study have been deposited in GenBank under accession numbers AJ880694 to AJ880757 for Ns5b region sequences and AJ866035 to AJ866183 for clones of the E1-E2 region.

RESULTS
The HCV genotype (Table
1) was determined by sequencing of an
NS5B gene fragment and comparison with reference sequences of
known genotype (Fig.
1). Eight of the nine sequences derived
from patients attending the case clinic shared the same HCV
genotype 1b, and the other was genotyped as 3a. Among the four
control samples, two also harbored HCV-1b viruses. This information
allowed us to exclude genotypes other than 1b as a potential
source for the outbreak, and hence, the ensuing analyses were
restricted to samples of genotype 1b.
The evolutionary model that best accounted for the data in the
NS5B region corresponds to transversional model distance (
14),
with a gamma distribution accounting for heterogeneity in evolutionary
rates among sites (Shape parameter = 0.5910) and a proportion
of invariable sites (Pinvar = 0.3826). In the phylogenetic tree
for this region (Fig.
1), two different monophyletic groups,
each including three sequences from the case, can be observed.
One group includes sequences from two of the patients whose
seroconversion prompted the study (patients 40 and 41) and one
patient previously known to be HCV infected (patient 43). The
other group included the sequence from the third patient prompting
the study (patient 42) and the sequences from two other known
HCV-positive patients from the same clinic (patients 36 and
39). In this group, sequences from patients 39 and 42 were almost
identical in this region and formed a well-supported subgroup
(bootstrap support [BS] = 97%). Apart from the well-supported
nodes defining the HCV genotypes and subtypes included in the
study, the phylogenetic tree for the NS5B region presents only
four nodes with bootstrap support higher than 70% (Fig.
1).
Two of these correspond to two samples taken at different times
(separated by 6 months) from the same patients (G26, BS = 98%,
and C05, BS = 95%, respectively), one corresponds to two unrelated
sequences (C29 and A21; BS = 78%), and the last one is the already-mentioned
group comprising samples 39 and 42.
The analysis of the NS5B region does not provide a clear-cut answer to the relevant question of how many patients, if any, were infected with HCV at the case clinic. To obtain a better-resolved picture of the phylogenetic relationships among the involved patients, we proceeded by analyzing a more rapidly evolving region of the HCV genome. We determined the sequences of 122 cloned fragments derived from the 10 patients, 8 from the case clinic and 2 controls, in the E1-E2 region of HCV (Fig. 2). These sequences were analyzed with the sequences of 73 cloned fragments from the same region and genotype, a number of which were also derived from the same patient at different serial times (separated by 6 or 12 months). In this case, the best model of evolution corresponded to GTR (general time reversible) (14), with a gamma distribution accounting for heterogeneity in evolutionary rates among sites (Shape parameter = 1.0455) and a proportion of invariable sites (Pinvar = 0.3358). The resulting maximum-likelihood phylogenetic tree is shown in Fig. 2. As in the previous tree, most phylogenetic groups do not reach a BS of >70%, and those that do can be divided into two categories. One category includes all pairs of serial samples from the same patient, with BSs of >90% and >99% in most cases; the second category includes nodes with 70% < BS < 90% that encompass samples from epidemiologically unrelated patients.
Sequences from the studied samples appear in two forms in the
phylogenetic tree for the E1-E2 region. Samples 36, 37, 38,
90, and 92 show highly supported (BS > 99%) monophyletic
clusters that include only sequences derived from each individual
patient. However, sequences from samples 40, 41, and 43 on the
one hand and 39 and 42 on the other are grouped in two separate,
highly supported (BS = 100%) clusters, designated groups A and
B, respectively. Within them, sequences from one patient each,
43 and 39, do not group monophyletically, as opposed to the
sequences from the other patients in the corresponding groups
(Fig.
3). In group A (Fig.
3), all the sequences from samples
40 and 41 are almost identical and form monophyletic groups
with very high support (BS = 100 and 99%, respectively), whereas
sequences derived from sample 43 are distributed in two main
subgroups, one of them clearly related to sequences from the
other two samples (BS = 81%). Furthermore, sequences from sample
43 are much more diverse than those from the other two patients.
A similar pattern can be observed for the sequences of group
B (Fig.
3), with all sequences being identical and forming a
highly supported monophyletic group (BS = 98%) and with sample
39 presenting more diverse sequences that do not group in a
single cluster, some of which are clearly related (occupying
a basal position) to those from sample 42.
Table
2 presents a summary of the intrapatient genetic variability
derived from the comparison of clone sequences obtained for
the E1-E2 regions of the 10 samples with HCV genotype 1b included
in the study. Different measures of genetic variation are coincident
in separating two clearly distinct groups, one comprising samples
40, 41, and 42, those derived from the three patients suspected
to have been recently infected by HCV, and the other with the
remaining samples, including the two controls of genotype 1b.
The first group is characterized by very low genetic variability,
with at most two haplotypes and a single mutation among the
10 clone sequences derived from each sample. This is also reflected
in other genetic variability parameters that allow a better
comparison with other samples, for which the number of clone
sequences is occasionally different. The second group is characterized
by much higher values of genetic variation, both in number of
haplotypes (
h), total number of mutations (

), substitutions
per site (

), and differences among pairs of sequences (

). These
results are indicative of a recent origin of the viral population
infecting the patients in the first group.
View this table:
[in this window]
[in a new window]
|
TABLE 2. Summary of intrapatient genetic variability for the E1-E2 region of HCV genotype 1b samples analyzed in this study
|
We further compared the levels of differentiation within and
between two groups involved in the transmission, as detected
in the previous phylogenetic analyses. For comparison, we also
analyzed the differentiation between and within control samples
and the two transmission groups. Table
3 summarizes these analyses.
In the three comparisons between pairs of suspect source and
recipient patients (40-43, 41-43, and 39-42), the different
parameters evaluating genetic differentiation indicate a much
lower level of difference between the patients in these pairs
than in the comparison between unrelated samples (between groups
A and B, between controls and both groups, and between pairs
of control samples). This is also reflected in the almost complete
absence of polymorphic sites in the sequences derived from recently
infected patients that are monomorphic in the corresponding
sources and in the number of fixed differences, as opposed to
what happens in the comparison between sequences derived from
epidemiologically unrelated patients.

DISCUSSION
A detailed analysis of the phylogenetic relationships and genetic
variability and differentiation within and among sequences in
a rapidly evolving region of the HCV genome provided a different,
more accurate image of the relationships among infected patients
involved in a hepatitis C virus transmission that occurred at
a hemodialysis unit. The analysis revealed the occurrence of
two independent transmission events, one involving one source
(represented by sample 39) and one infected patient (patient
42), the other involving one common source (patient 43) and
two recipients (patients 40 and 41). Both transmission events
resulted in well-defined, highly supported monophyletic groups
when several cloned sequences of the E1-E2 region of the HCV
genome from the corresponding samples were analyzed using phylogenetic
methods. The support for each of these groups as a whole was
similar to that received by the equivalent monophyletic groups
in which all the clone sequences from the control and unrelated
samples appeared in the same analysis (Fig.
2). Similar levels
of support from bootstrap resampling were observed only in clades
formed by sequences from the same patient and, in consequence,
can be considered a defining mark of very closely related sequences,
derived either from the same individual at close time intervals
or from very closely related samples, such as those represented
by source and recipient individuals. On the other hand, sequences
from unrelated samples either do not receive significant bootstrap
support (lower than 70%) or the support is barely higher than
this value and always lower than 90%.
The analysis of a more conserved, slowly evolving region of the HCV genome such as NS5B does not provide a well enough resolved picture of the previously described relationships. First, high statistical support by bootstrap analysis is obtained only for genotype and subtype clades (Fig. 1), for serially sampled sequences from the same patient, and for one of the two groups related to a recent transmission event, the one represented by samples 39 and 42. However, the other group, composed of sequences derived from samples 40, 41, and 43, is present in the phylogenetic tree but does not receive enough bootstrap support. The analysis of this region also presents an example of a potentially more disturbing result. There appears to be a group (patients 36, 39, and 42) which, although without statistical support, could be mistakenly thought to represent a different, larger transmission case. Nevertheless, the analysis of this region does provide evidence for a lack of association between sequences, even from the same genotype, not related by transmission events. In turn, this allows one to focus on a more labor-intensive analysis involving the cloning and later sequencing of a more rapidly evolving region, such as E1-E2 in the HCV genome.
Another relevant consideration emerging from this study is the need to incorporate as many possible control samples from the local population(s) as are available in the study of outbreaks or transmission cases. The usual evidence for a close relatedness between viruses derived from different samples is their grouping into a more or less well-supported monophyletic group. However, this is not necessarily the result of direct transmission, as it can also be due to both samples being derived from a common population with an older, more divergent common ancestor. If this is the case, then other samples from the same source population and epidemiologically unrelated to the case in question will reveal the true nature of this relationship, but if no such additional sample is included in the study, which will certainly happen if sampling of controls is not sufficient, the relationship will not be noticed. In this study, the availability of a large number of local samples of the same genotype as those involved in the transmission case helped us to ascertain the existence of at least two different transmission chains with different origins. Nevertheless, the two sources are related, as they derive from a common local pool. Without incorporating a sufficient number of local control samples, both sources and the infected samples could be erroneously interpreted as deriving from a single source in the hemodialysis unit. As a consequence, we can conclude that whatever procedure or protocol was not handled properly and resulted in transmission, it was not a unique event and was repeated at least twice. This implies that a more thorough revision of the operating procedures in this hemodialysis unit is necessary.

ACKNOWLEDGMENTS
This work was supported by Conselleria de Sanitat i Consum,
Generalitat Valenciana, and project Grupos03/204 from Agència
Valenciana de Ciència i Tecnologia.

FOOTNOTES
* Corresponding author. Mailing address: Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Apartado Oficial 22085, 46071 Valencia, Spain. Phone: 34 963543653. Fax: 34 96 3543670. E-mail:
fernando.gonzalez{at}uv.es.


REFERENCES
1 - Andrews, P. 1992. Evolution and environment in the hominoidea. Nature 360:641-646.[CrossRef][Medline]
2 - Casino, C., J. McAllister, F. Davidson, J. Power, E. Lawlor, P. L. Yap, P. Simmonds, and D. B. Smith. 1999. Variation of hepatitis C virus following serial transmission: multiple mechanisms of diversification of the hypervariable region and evidence for convergent genome evolution. J. Gen. Virol. 80:717-725.[Abstract]
3 - Echevarria, J. M., P. León, C. J. Domingo, J. A. López, C. Elola, M. Madurga, F. Salmerón, P. L. Yap, J. Daub, and P. Simmonds. 1996. Laboratory diagnosis and molecular epidemiology of an outbreak of hepatitis C virus infection among recipients of human intravenous immunoglobulin in Spain. Transfusion 36:725-730.[CrossRef][Medline]
4 - Esteban, J. I., J. Gómez, M. Martell, B. Cabot, J. Quer, J. Camps, A. González, T. Otero, A. Moya, R. Esteban, and J. Guardia. 1996. Transmission of hepatitis C virus by a cardiac surgeon. N. Engl. J. Med. 334:555-560.[Abstract/Free Full Text]
5 - Felsenstein, J. 1985. Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39:783-791.[CrossRef]
6 - González-Candelas, F., M. A. Bracho, and A. Moya. 2003. Molecular epidemiology and forensic genetics: application to a hepatitis C virus transmission event at a hemodialysis unit. J. Infect. Dis. 187:352-358.[CrossRef][Medline]
7 - Guindon, S., and O. Gascuel. 2003. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52:696-704.[CrossRef][Medline]
8 - Jiménez, N. 2004. Evolución del virus de la hepatitis C en muestras hospitalarias de la Comunidad Valenciana. Ph.D. thesis. Universitat de Valencia, Valencia, Spain.
9 - Kumar, S., K. Tamura, I. B. Jakobsen, and M. Nei. 2001. MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17:1244-1245.[Abstract/Free Full Text]
10 - Memon, M. I., and M. A. Memon. 2002. Hepatitis C: an epidemiological review. J. Viral Hepat. 9:84-100.[CrossRef][Medline]
11 - Posada, D., and K. A. Crandall. 2001. Selecting the best-fit model of nucleotide substitution. Syst. Biol. 50:580-601.[Medline]
12 - Power, J. P., E. Lawlor, F. Davidson, E. C. Holmes, P. L. Yap, and P. Simmonds. 1995. Molecular epidemiology of an outbreak of infection with hepatitis C virus in recipients of anti-D immunoglobulin. Lancet 345:1211-1213.[CrossRef][Medline]
13 - Pradat, P., and C. Trépo. 2000. HCV: epidemiology, modes of transmission and prevention of spread. Baillieres Best Pract. Res. Clin. Gastroenterol. 14:201-210.[CrossRef][Medline]
14 - Rodríguez, F., J. L. Oliver, A. Marín, and J. R. Medina. 1990. The general stochastic model of nucleotide substitution. J. Theor. Biol. 142:485-501.[Medline]
15 - Ross, R. S., S. Viazov, and M. Roggendorf. 2002. Phylogenetic analysis indicates transmission of hepatitis C virus from an infected orthopedic surgeon to a patient. J. Med. Virol. 66:461-467.[CrossRef][Medline]
16 - Rozas, J., J. C. Sanchez-DelBarrio, X. Messeguer, and R. Rozas. 2003. DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19:2496-2497.[Abstract/Free Full Text]
17 - Saitou, N., and M. Nei. 1987. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406-425.[Abstract]
18 - Swofford, D. L. 2002. PAUP*. Phylogenetic analysis using parsimony (* and other methods). Sinauer Associates, Sunderland, Mass.
19 - Thompson, J. D., D. G. Higgins, and T. J. Gibson. 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.[Abstract/Free Full Text]
20 - Torres-Puente, M. 2004. Variabilidad genética y respuesta al tratamiento antiviral en el virus de la hepatitis C (VHC). Ph.D. thesis. Universitat de Valencia, Valencia, Spain.
Journal of Clinical Microbiology, June 2005, p. 2750-2755, Vol. 43, No. 6
0095-1137/05/$08.00+0 doi:10.1128/JCM.43.6.2750-2755.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
This article has been cited by other articles:
-
Cottam, E. M, Thebaud, G., Wadsworth, J., Gloster, J., Mansley, L., Paton, D. J, King, D. P, Haydon, D. T
(2008). Integrating genetic and epidemiological data to determine transmission pathways of foot-and-mouth disease virus. Proc R Soc B
275: 887-895
[Abstract]
[Full Text]
-
Wrobel, B., Torres-Puente, M., Jimenez, N., Bracho, M. A., Garcia-Robles, I., Moya, A., Gonzalez-Candelas, F.
(2006). Analysis of the Overdispersed Clock in the Short-Term Evolution of Hepatitis C Virus: Using the E1/E2 Gene Sequences to Infer Infection Dates in a Single Source Outbreak. Mol Biol Evol
23: 1242-1253
[Abstract]
[Full Text]