|
|
||||||||
Short Communication |
1 Department of Virology, Erasmus Medical Center, Dr Molewaterplein 50, 3015 GE Rotterdam, The Netherlands
2 Center of Excellence in Viral Hepatitis Research, Department of Pediatrics, Chulalongkorn Hospital, Bangkok, Thailand
3 Department of Biological Analysis, Pasteur Institute Ho Chi Minh City, Ho Chi Minh City, Vietnam
Correspondence
Bart L. Haagmans
b.haagmans{at}erasmusmc.nl
| ABSTRACT |
|---|
|
|
|---|
The GenBank/EMBL/DDBJ sequence accession numbers for the sequences reported in this study are EU246930–EU246940.
Supplementary material is available with the online version of this paper.
| MAIN TEXT |
|---|
|
|
|---|
HCV genotype 6 viruses mainly circulate in South East Asia (SEA), including Vietnam (Tokita et al., 1994
), Indonesia (Tokita et al., 1996
), Thailand (Chinchai et al., 2003
; Thaikruea et al., 2004
), Cambodia (Caruana et al., 2005
) and Myanmar (Shinji et al., 2004
). Currently, genotype 6 viruses are classified into 17 subtypes (6a–6q) based on complete genome analysis (Lu et al., 2006
, 2007a
, b
; Tokita et al., 1996
, 1998
). Recently, the existence of subtype 6r and 6s viruses was proposed based on sequence analysis of core/E1 and NS5B from HCV strains isolated from Cambodian immigrants in Canada (Murphy et al., 2007
). In addition, a subtype of some viruses could not be assigned and these were classified as novel candidate HCV genotype 6 (Lu et al., 2006
; Shinji et al., 2004
).
High nucleotide sequence heterogeneity hampers complete genome characterization of new subtypes due to inefficient primer annealing. We developed techniques to obtain large RT-PCR fragments amplified from HCV-positive sera using degenerate primer sets and obtained entire HCV genome sequences from 100 µl HCV-positive sera (Noppornpanth et al., 2006a
). This led to the complete genome characterization of a Vietnamese HCV recombinant genotype 2i/6p strain (Noppornpanth et al., 2006a
). In this report, we selected several sera from HCV-positive blood donors from Vietnam and Thailand based on a preliminary screen to include rare or unclassified HCV subtypes 6 (Noppornpanth et al., 2006a
, b
) and performed full-genome sequencing to characterize further HCV subtypes 6.
Total RNA was isolated from 100 µl serum by proteinase K/phenol/chloroform extraction as described previously (Rispeter et al., 1997
). The entire HCV genome was amplified from three different cDNA reverse transcription reactions from positions 5460, 8625 and 9325 (numbering system according to prototype strain H77; GenBank accession no. AF009606
[GenBank]
). Three overlapping large PCR fragments were first-round amplified by degenerate primers, followed by nested PCR as described previously (Noppornpanth et al., 2006a
). Amplified products were gel-purified with the QIAquick gel extraction kit (Qiagen) and ligated directly into plasmid pCR2.1 (TA cloning kit; Invitrogen). For each amplicon, five positive clones were selected and sequenced. Multiple sequence alignment was performed with the BioEdit program (version 7.0.1) according to amino acid alignment. Phylogenetic trees were constructed from tree puzzle 5.2 (Schmidt et al., 2002
) and MrBayes version 3.0b4 (Ronquist & Huelsenbeck, 2003
) programs using genetic distance calculated by maximum-likelihood with rate heterogeneity and Bayesian methods, respectively. Transition/translation ratio, proportion of invariant sites and gamma distribution factors were estimated based on the real sequence datasets. Nucleotide and amino acid substitutions were estimated by using HKY and WAG models, respectively. Bootstrapping was calculated from 10 000 tree-puzzle steps or 5000 trees generated from the MrBayes program. Pairwise nucleotide comparison was estimated based upon p distances with MEGA3 software (Kumar et al., 2004
). Phylogenetic analysis and sequence similarity test were performed using reference sequences of HCV subtypes 6a–6q, published in the Los Alamos HCV database (http://hcv.lanl.gov/content/hcv-index). Genetic recombination events were examined by similarity plot and bootscanning analysis, using the SimPlot program (version 3.5.1 for Windows98/NT/2000/XP; available from the author, Stuart C. Ray, http://sray.med.som.jhmi.edu/SCRoftware/).
We characterized 11 complete HCV genotype 6 genomes, including seven isolates from Vietnam (D9, D33, D42, D49, D83, D85 and D88) and four isolates from Thailand (TH22, TH24, TH31 and TH52). These genomes varied in length from 9373 to 9423 nt, encoding 3016 to 3023 aa (Supplementary Table S1 available in JGV Online).
The whole genome of TH52 shared the main branch with samples C-0044 and C-0046, representative of HCV subtype 6f (Fig. 1
). There is a high sequence similarity (94.7 %) compared to the prototype strain and results from the short fragment analysis confirmed that TH52 is a member of HCV subtype 6f (Table 1
and Supplementary Fig. S2 available in JGV Online). Interestingly, 52/53 samples of subtype 6f classified in the Los Alamos database were obtained from Thailand (Kuiken et al., 2005
). Full-length genomes of HCV subtype 6i (Th602 and C-0159), a subtype found only in Thailand, have been sequenced recently (Lu et al., 2007b
). TH24 complete genome and core, E1 and NS5B phylogenetic trees showed 99–100 % bootstrap support related to the C-0159 sequence (Fig. 1
). We identified TH22 and TH31 strains as HCV subtype 6n based on phylogenetic analysis of full-length nucleotide and amino acid sequences (Fig. 1
and Supplementary Fig. S1), while whole genome sequence similarity was 93.9 and 95.8 %, respectively, when compared with subtype 6n reference strain D86/93 isolated from Thailand (Lu et al., 2007b
). Both strains clustered with KM42 isolated from China (Lu et al., 2005
) and D86/93 with high bootstrap support. HCV subtype 6n was first isolated from Thailand (Apichartpiyakul et al., 1994
), and then later reported to be isolated from India, China and Myanmar (Shinji et al., 2004
).
|
|
Phylogenetic analysis of short regions (core, E1 and NS5B) indicated that D33 clustered to the subtype 6l prototype strain 537796 (Supplementary Fig. S2). Complete genome phylogeny and pairwise similarity (95.1 %) confirmed classification of D33 as subtype 6l (Fig. 1
and Table 1
). Remarkably, subtypes 6l and 6k were isolated from Vietnamese individuals (Lu et al., 2006
, 2007a
). D85 on the other hand had 92.5 % sequence similarity to subtype 6o prototype strain QC227 isolated from an Asian immigrant in Canada (Murphy et al., 2007
), confirmed by phylogeny of whole genome and short fragment analysis (Fig. 1
and Supplementary Fig. S2). Subtype 6o is a rare subtype in SEA and we detected only one subtype 6o virus out of 95 HCV-positive samples collected from Vietnam (data not shown).
Based on full-length nucleotide and amino acid sequences D42, D88 and D83 are closely related to subtype 6e strain GX004, recently reported from China (Li et al., 2006
) (Fig. 1
and Supplementary Fig. S1). Sequence similarities compared with GX004 were 87.0, 90.8 and 83.6 % for D42, D88 and D83, respectively (Table 1
). However, phylogenetic analysis of complete genome, core and NS5B sequences revealed that D83 branched separately from other subtypes (Fig. 1
and Supplementary Fig. S2). D83 was about 18 % different from D42 and D88 at nucleotide level and about 10 % different at amino acid level (data not shown).
Based on nucleotide and amino acid sequences of the whole genome, D49 branched closely to subtype 6q (Fig. 1
and Supplementary Fig. S1). However, short genome fragment analysis using core and NS5B revealed branching with subtypes 6q and 6d, respectively, while E1 analysis showed a separate branch linked to subtype 6p (Supplementary Fig. S2). Sequence similarities compared with subtypes 6a–6q prototype sequences were 72.6–79.2 % (Table 1
).
Although complete genome sequences of HCV subtypes 6a–6q were used as reference strains, some genomes such as GZ52557 and KM41 isolated from China (Lu et al., 2005
), and at least 23 strains (provided in the Los Alamos HCV database) based on short fragment sequences of core/E1 and NS5B (Supplementary Table S2), could not be classified. Because we could not subtype D49 and D83, we reanalysed the phylogenetic tree including all subtypes (6a–6s) and unassigned subtype samples reported thus far. Now the core/E1 region of D49 showed close relationship with subtype 6p and clustered with QC240, QC131 and QC145, while NS5B clustered with the same viruses but separated more clearly from other subtypes (Fig. 2a and b
). Unexpectedly, both core/E1 and NS5B from D83 grouped with QC191 and QC323 viruses and clearly clustered separately from subtype 6e (Fig. 2a and b
). Thus, viruses D49, QC240, QC131 and QC145 were assigned to the novel HCV subtype 6t, while D83, QC191 and QC323 are candidate subtype 6u viruses.
|
Although phylogenetic analysis based on the full-length genome is the most accurate method for HCV classification, most HCVs have been classified based on short sequence analysis of the core, E1 and NS5B regions (Robertson et al., 1998
; Sandres-Saune et al., 2003
; Simmonds et al., 2005
). Historically, HCV classification based on short sequence phylogenetic analysis using a simple nucleotide substitution model has led to incorrect branching between closely related subtypes. HCV genotypes 7, 8 and 9 from SEA have been identified using the unweighted pair-group method with arithmetic mean (UPGMA) for phylogenetic tree construction (Tokita et al., 1994
). However, combining the complex nucleotide substitution model with neighbour-joining results in grouping of these sequences to HCV genotype 6 subtypes (Mizokami et al., 1996
; Simmonds et al., 1994
). In our study, we performed the classification based on the HKY nucleotide substitution model and maximum-likelihood method to increase the accuracy of nucleotide phylogenetic analysis. Moreover, we generated protein phylogenetic trees based on amino acid sequences to confirm our results.
Even though a range of HCV subtype 6a–6q whole genomes has been characterized, a number of viruses cannot be classified (Murphy et al., 2007
). These unassigned samples may represent novel candidate HCV subtypes. Assignment of subtypes 6t and 6u in our study is in line with the HCV genotyping consensus proposal (Simmonds et al., 2005
). NS5B and core/E1 sequences of at least three examples are required for a provisional designation of HCV new subtype and at least 15 % difference at the nucleotide level from other HCV subtypes should be present (Simmonds et al., 2005
). We have defined two full-length genome sequences that represent new subtypes of HCV genotype 6. Subtype 6t comprises D49, QC240, QC131 and QC145, all isolated from Vietnamese individuals, and subtype 6u includes D83, which grouped to QC191 and QC323 and shows a separate branch from subtype 6e. Sample QC191 and QC323 originated from Asian immigrants in Canada but remained unassigned with respect to subtype classification (Murphy et al., 2007
). Grouping of D83 sequence obtained from our study with these strains suggests that subtype 6u circulates in SEA. High sequence similarity of HCV subtypes 6e and 6u indicates that both may share the same origin but have shown genetic drift as a result of long-term circulation.
Molecular evolution of HCV genotype 6 based on phylogenetic relationship separates all subtypes into three groups. Subtypes 6a and 6b are related to each other, subtypes 6c, 6d, 6e, 6f, 6g, 6p, 6q and 6o share the same phylogenetic branch and the last group contains subtypes 6h, 6i, 6j, 6k, 6l, 6m and 6n. Epidemiological studies of HCV genotype 6 have revealed that certain subtypes, such as 6d, 6h and 6l, are found in Vietnam while 6c, 6f, 6i and 6j are found in Thailand, whereas subtype 6m and 6n were isolated mainly from Thailand and Myanmar. Although there may be some sampling bias and information about subtype distribution is lacking from some countries such as Laos, Cambodia and Myanmar, these results suggest that spread of HCV occurs in closed populations. These data are consistent with the idea that HCV genome diversity is related to geographical distribution and routes of viral transmission in certain areas. Whereas genotypes 1, 2 and 3 have spread worldwide mainly through contaminated blood and intravenous drug users (Alter, 1999
), the genotype 4, 5 and 6 viruses are more restricted to African and Asian countries (Simmonds et al., 2005
) and vary genetically as a result of long endemicity in certain areas (Pybus et al., 2001
).
| ACKNOWLEDGEMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
Apichartpiyakul, C., Chittivudikarn, C., Miyajima, H., Homma, M. & Hotta, H. (1994). Analysis of hepatitis C virus isolates among healthy blood donors and drug addicts in Chiang Mai, Thailand. J Clin Microbiol 32, 2276–2279.
Caruana, S. R., Kelly, H. A., De Silva, S. L., Chea, L., Nuon, S., Saykao, P., Bak, N. & Biggs, B. A. (2005). Knowledge about hepatitis and previous exposure to hepatitis viruses in immigrants and refugees from the Mekong Region. Aust N Z J Public Health 29, 64–68.[CrossRef][Medline]
Chinchai, T., Labout, J., Noppornpanth, S., Theamboonlers, A., Haagmans, B. L., Osterhaus, A. D. & Poovorawan, Y. (2003). Comparative study of different methods to genotype hepatitis C virus type 6 variants. J Virol Methods 109, 195–201.[CrossRef][Medline]
Choo, Q. L., Richman, K. H., Han, J. H., Berger, K., Lee, C., Dong, C., Gallegos, C., Coit, D., Medina-Selby, R. & other authors (1991). Genetic organization and diversity of the hepatitis C virus. Proc Natl Acad Sci U S A 88, 2451–2455.
Kuiken, C., Yusim, K., Boykin, L. & Richardson, R. (2005). The Los Alamos hepatitis C sequence database. Bioinformatics 21, 379–384.
Kumar, S., Tamura, K. & Nei, M. (2004). MEGA3: integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment. Brief Bioinform 5, 150–163.
Li, C., Fu, Y., Lu, L., Ji, W., Yu, J., Hagedorn, C. H. & Zhang, L. (2006). Complete genomic sequences for hepatitis C virus subtypes 6e and 6g isolated from Chinese patients with injection drug use and HIV-1 co-infection. J Med Virol 78, 1061–1069.[CrossRef][Medline]
Lu, L., Nakano, T., He, Y., Fu, Y., Hagedorn, C. H. & Robertson, B. H. (2005). Hepatitis C virus genotype distribution in China: predominance of closely related subtype 1b isolates and existence of new genotype 6 variants. J Med Virol 75, 538–549.[CrossRef][Medline]
Lu, L., Nakano, T., Li, C., Fu, Y., Miller, S., Kuiken, C., Robertson, B. H. & Hagedorn, C. H. (2006). Hepatitis C virus complete genome sequences identified from China representing subtypes 6k and 6n and a novel, as yet unassigned subtype within genotype 6. J Gen Virol 87, 629–634.
Lu, L., Li, C., Fu, Y., Gao, F., Pybus, O. G., Abe, K., Okamoto, H., Hagedorn, C. H. & Murphy, D. (2007a). Complete genomes of hepatitis C virus (HCV) subtypes 6c, 6l, 6o, 6p and 6q: completion of a full panel of genomes for HCV genotype 6. J Gen Virol 88, 1519–1525.
Lu, L., Li, C., Fu, Y., Thaikruea, L., Thongswat, S., Maneekarn, N., Apichartpiyakul, C., Hotta, H., Okamoto, H. & other authors (2007b). Complete genomes for hepatitis C virus subtypes 6f, 6i, 6j and 6m: viral genetic diversity among Thai blood donors and infected spouses. J Gen Virol 88, 1505–1518.
Mizokami, M., Gojobori, T., Ohba, K., Ikeo, K., Ge, X. M., Ohno, T., Orito, E. & Lau, J. Y. (1996). Hepatitis C virus types 7, 8 and 9 should be classified as type 6 subtypes. J Hepatol 24, 622–624.[CrossRef][Medline]
Murphy, D. G., Willems, B., Deschenes, M., Hilzenrat, N., Mousseau, R. & Sabbah, S. (2007). Use of sequence analysis of the NS5B region for routine genotyping of hepatitis C virus with reference to C/E1 and 5' untranslated region sequences. J Clin Microbiol 45, 1102–1112.
Noppornpanth, S., Lien, T. X., Poovorawan, Y., Smits, S. L., Osterhaus, A. D. & Haagmans, B. L. (2006a). Identification of a naturally occurring recombinant genotype 2/6 hepatitis C virus. J Virol 80, 7569–7577.
Noppornpanth, S., Sablon, E., De Nys, K., Truong, X. L., Brouwer, J., Van Brussel, M., Smits, S. L., Poovorawan, Y., Osterhaus, A. D. & Haagmans, B. L. (2006b). Genotyping hepatitis C viruses from Southeast Asia by a novel line probe assay that simultaneously detects core and 5' untranslated regions. J Clin Microbiol 44, 3969–3974.
Pybus, O. G., Charleston, M. A., Gupta, S., Rambaut, A., Holmes, E. C. & Harvey, P. H. (2001). The epidemic behavior of the hepatitis C virus. Science 292, 2323–2325.
Rispeter, K., Lu, M., Lechner, S., Zibert, A. & Roggendorf, M. (1997). Cloning and characterization of a complete open reading frame of the hepatitis C virus genome in only two cDNA fragments. J Gen Virol 78, 2751–2759.[Abstract]
Robertson, B., Myers, G., Howard, C., Brettin, T., Bukh, J., Gaschen, B., Gojobori, T., Maertens, G., Mizokami, M. & other authors (1998). Classification, nomenclature, and database development for hepatitis C virus (HCV) and related viruses: proposals for standardization. International Committee on Virus Taxonomy. Arch Virol 143, 2493–2503.[CrossRef][Medline]
Ronquist, F. & Huelsenbeck, J. P. (2003). MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574.
Sandres-Saune, K., Deny, P., Pasquier, C., Thibaut, V., Duverlie, G. & Izopet, J. (2003). Determining hepatitis C genotype by analyzing the sequence of the NS5b region. J Virol Methods 109, 187–193.[CrossRef][Medline]
Schmidt, H. A., Strimmer, K., Vingron, M. & von Haeseler, A. (2002). Tree-puzzle: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18, 502–504.
Shepard, C. W., Finelli, L. & Alter, M. J. (2005). Global epidemiology of hepatitis C virus infection. Lancet Infect Dis 5, 558–567.[CrossRef][Medline]
Shinji, T., Kyaw, Y. Y., Gokan, K., Tanaka, Y., Ochi, K., Kusano, N., Mizushima, T., Fujioka, S., Shiraha, H. & other authors (2004). Analysis of HCV genotypes from blood donors shows three new HCV type 6 subgroups exist in Myanmar. Acta Med Okayama 58, 135–142.[Medline]
Simmonds, P. (2004). Genetic diversity and evolution of hepatitis C virus – 15 years on. J Gen Virol 85, 3173–3188.
Simmonds, P., Holmes, E. C., Cha, T. A., Chan, S. W., McOmish, F., Irvine, B., Beall, E., Yap, P. L., Kolberg, J. & Urdea, M. S. (1993). Classification of hepatitis C virus into six major genotypes and a series of subtypes by phylogenetic analysis of the NS-5 region. J Gen Virol 74, 2391–2399.
Simmonds, P., Smith, D. B., McOmish, F., Yap, P. L., Kolberg, J., Urdea, M. S. & Holmes, E. C. (1994). Identification of genotypes of hepatitis C virus by sequence comparisons in the core, E1 and NS-5 regions. J Gen Virol 75, 1053–1061.
Simmonds, P., Bukh, J., Combet, C., Deleage, G., Enomoto, N., Feinstone, S., Halfon, P., Inchauspe, G., Kuiken, C. & other authors (2005). Consensus proposals for a unified system of nomenclature of hepatitis C virus genotypes. Hepatology 42, 962–973.[CrossRef][Medline]
Thaikruea, L., Thongsawat, S., Maneekarn, N., Netski, D., Thomas, D. L. & Nelson, K. E. (2004). Risk factors for hepatitis C virus infection among blood donors in northern Thailand. Transfusion 44, 1433–1440.[CrossRef][Medline]
Tokita, H., Okamoto, H., Tsuda, F., Song, P., Nakata, S., Chosa, T., Iizuka, H., Mishiro, S., Miyakawa, Y. & Mayumi, M. (1994). Hepatitis C virus variants from Vietnam are classifiable into the seventh, eighth, and ninth major genetic groups. Proc Natl Acad Sci U S A 91, 11022–11026.
Tokita, H., Okamoto, H., Iizuka, H., Kishimoto, J., Tsuda, F., Lesmana, L. A., Miyakawa, Y. & Mayumi, M. (1996). Hepatitis C virus variants from Jakarta, Indonesia classifiable into novel genotypes in the second (2e and 2f), tenth (10a) and eleventh (11a) genetic groups. J Gen Virol 77, 293–301.
Tokita, H., Okamoto, H., Iizuka, H., Kishimoto, J., Tsuda, F., Miyakawa, Y. & Mayumi, M. (1998). The entire nucleotide sequences of three hepatitis C virus isolates in genetic groups 7–9 and comparison with those in the other eight genetic groups. J Gen Virol 79, 1847–1857.[Abstract]
Received 16 November 2007;
accepted 24 January 2008.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| INT J SYST EVOL MICROBIOL | MICROBIOLOGY | J GEN VIROL |
| J MED MICROBIOL | ALL SGM JOURNALS | |