|
|
||||||||
1 Department of Pathology, University of Texas Medical Branch at Galveston, 301 University Blvd, Galveston, TX 77555-0609, USA
2 University of California at Berkeley, Berkeley, CA 94720, USA
3 School of Natural Sciences, University of Texas, Austin, TX 78712, USA
4 Department of Microbiology and Immunology, University of Texas Medical Branch at Galveston, 301 University Blvd, Galveston, TX 77555-0609, USA
5 Sealy Center for Vaccine Development and Center for Biodefense and Emerging Infectious Diseases, University of Texas Medical Branch at Galveston, 301 University Blvd, Galveston, TX 77555-0609, USA
Correspondence
Alan D. T. Barrett
abarrett{at}utmb.edu
| ABSTRACT |
|---|
|
|
|---|
Supplementary tables are available in JGV Online.
| INTRODUCTION |
|---|
|
|
|---|
YFV is the prototype member of the family Flaviviridae, the genus Flavivirus, consisting of a single-stranded, positive-sense RNA genome, almost 11 kb in length. A type I 5'-terminal cap (m7GpppAmp) precedes the genome. The genome is flanked by 5' and 3' non-coding regions (NCR) and translated as a single open reading frame (ORF) that is co- and post-translationally cleaved into 10 proteins, including three structural proteins (C, prM/M and E) and seven non-structural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B and NS5) (Barrett & Monath, 2003
).
Understanding genetic relationships of this medically important viral pathogen is valuable for monitoring of future YFV outbreaks. Based on the examination of prM/M, E and 3'NCR of the YFV genome, previous studies have identified seven genotypes of YFV (Mutebi et al., 2001
; Wang et al., 1996
). Five African genotypes have been proposed, which include west African genotype I (strains from Nigeria and Ivory Coast), west African genotype II (strains from Senegal, plus additional Nigerian and Ivory Coast strains), east African genotype (strains from Kenya and Uganda), east/central African genotype [strains from Zaire (Democratic Republic of Congo), Uganda, Ethiopia and central African Republic] and Angolan genotype (Mutebi et al., 2001
). Two South American genotypes have also been proposed: South American genotype I (Brazil, Columbia, Panama and Trinidad) and South American genotype II (predominantly Peru, plus a few isolates from Brazil and Trinidad) (Wang et al., 1996
; Bryant & Barrett, 2003
; Barrett & Monath, 2003
).
In the present study, we sought to expand upon these genetic relationships using full genome analysis of east African (Uganda48a), east/central African (Ethiopia61b) and Angolan (Angola71) genotypes. Strain Angola71 was isolated from a human in early 1971, during a YF outbreak that occurred in this region during this time period (Pinto & Filipe, 1973
). Prior to this outbreak, there had been no reported outbreaks in this country since 1872. Factors that led to the emergence of a YF outbreak in a region that had previously been free of the disease for nearly a century are unknown; however, genomic analysis of this particular virus strain may reveal potential contributing molecular characteristics. In addition, YFV strain Ethiopia61b was isolated during a human epidemic that occurred in this region in 1961 and 1962 (Andral et al., 1968
; Serie et al., 1968
), while Uganda48a was isolated in Uganda in 1948 from an unknown source, likely from a sylvan outbreak in this region at the time (Smithburn et al., 1949
). Using these strains, we sought to expand upon the very limited full genome database for YFV, of which this report is the first to present genomic sequences of strains from east and central Africa.
| METHODS |
|---|
|
|
|---|
|
Phylogenetic and sequence analyses.
Sequence analysis was performed using Contig Express. Translations and alignments were performed using Vector NTI. Percentage similarities and differences were calculated using MegAlign (DNASTAR). NetNGlyc (Gupta et al., 2004; http://www.cbs.dtu.dk/services/NetNGlyc/) was utilized for glycosylation site prediction. PAUP (Swofford, 2003
) was utilized for the construction of phylogenetic trees by the neighbour-joining method (HKY85 parameter). In addition, GCUA version 1.1 (McInerney, 1998
) was used for codon usage analysis and percentage GC calculations. For genomic codon usage analysis, the first 10 000 nt of the ORF was utilized (GCUA 1.1 is limited to 10 000 nt).
| RESULTS |
|---|
|
|
|---|
Nucleotide sequence variation among diverse YFV strains
Uganda48a, Ethiopia61b and Angola71 were compared to previously published African YFV genomic sequences. In the neighbour-joining phylogenetic tree shown in Fig. 1
, Uganda48a and Ethiopia61b showed a high level of identity with each other but were both distinct from Angola71. Together, these newly sequenced strains formed a separate clade from the west African strains. In addition, phylogenetic relationships similar to those shown in Fig. 1
were observed when phylogenetic trees were constructed for each structural and non-structural protein gene independently, as well as the 3'NCR (data not shown).
|
|
Amino acid sequence variation between various YFV genotypes
Differences in amino acid composition ranged from 0·2 to 7·6 % for the various genotype-representative YFV strains compared with the prototype Asibi strain (Table 2
). Alignment of the ORF of the various strains is shown in Fig. 2
. Several amino acid substitutions characterized strains from west Africa, east Africa and Angola. Angola71, Uganda48a and Ethiopia61b shared many common amino acid substitutions and together comprised the east and central African lineage. The Angolan genotype possessed 52 additional genotype-specific amino acid substitutions (six in the C protein, one in prM/M, six in NS1, seven in NS2A, two in NS2B, seven in NS3, three in NS4A, two in NS4B and 18 in NS5), while the east African genotype-representative strain Uganda48a had 21 genotype-specific substitutions (four in the C protein, one in prM/M, one in E, four in NS1, two in NS2A, two in NS4B and seven in NS5) and Ethiopia61b had 23 (two in the C protein, three in prM/M, one in E, two in NS1, four in NS2A, four in NS3, one in NS4A and six in NS5). The west Africa I-representative strain, Ivory Coast, had 37 aa substitutions that differentiated it from west Africa II genotype strains (three in C, six in E, two in NS1, two in NS2A, one in NS2B, four in NS3, one in NS4A, four in NS4B and 14 in NS5; Supplementary Table S3 available in JGV Online).
|
|
|
|
|
In the E protein, all YFV strains examined had one site at position 309 and one site at position 469 which, like position 55 of M, may not be glycosylated due to its location in a hydrophobic region of the protein. N-linked glycosylation sites in NS1 were at Asn 130 and Asn 208, consistent with previous findings (Muylaert et al., 1996
). All strains also possessed two possible sites in NS4B (positions 63/64 and 89/90) and two possible sites in NS5 (positions 226/227 and 232/233).
| DISCUSSION |
|---|
|
|
|---|
In particular, a deletion of Thr (or His when compared to CAR77-900) identified in the N terminus of NS4B of strains from west Africa may be of potential significance (Fig. 3
). While little is known about the structure and function of YFV NS4B, for other members of the family Flaviviridae there is evidence that the protein may play a role in vector competence (Hanley et al., 2003
), may serve as an interferon antagonist (Munoz-Jordan et al., 2003
; Jones et al., 2005
) and may facilitate virus replication through association with intracellular membranes (Elazar et al., 2004
; Gao et al., 2004
). Furthermore, the deletion is located in an area of the protein that is highly variable for east and central African strains compared with the published west African YFV strains (Figs 2 and 3![]()
) and is predicted to be outside the membrane in hydrophobicity plots (data not shown). Comparison of this region of YFV NS4B with the same region of other flaviviruses (Fig. 3b
) revealed that Yokose virus, a close relative of YFV, also possesses an amino acid at the position of the additional amino acid in the east and central African strains, and has an amino acid signature that is more similar to these strains than west African strains of YFV. In addition, regions of the ORF that were highly variable between eastern and central African strains, and strains from western Africa, were also observed in C and NS5 proteins. The significance of this variability is unknown; however, C protein substitutions were located in the C-terminal hydrophobic sequence that is cleaved from the protein in the mature virion (Chambers et al., 1990
), and NS5 substitutions were located outside the polymerase motifs (Lai et al., 1999
), which were conserved for all strains examined.
Results of codon usage analysis revealed more similar codon usage when genomic sequences were examined than when individual genes were studied (Table 3
). The cause of codon usage bias in YFV may be a combination of factors, including mutational bias (due to genomic compositional constraints) and/or translational selection on areas of the YFV genome. GC3s were therefore plotted against Nc values and compared to the expected Nc if base composition is the only factor involved (solid curve) (Fig. 4
). Results suggested that while base compositional constraints may account for most of the codon usage bias in some genes (e.g. NS2B of Ethiopia61b that overlaps the curve), it is unlikely to be the single determining factor, as some points are located farther from the expected Nc curve. This is consistent with previous studies suggesting that while base composition may account for the majority of codon usage bias between different flavivirus groups, other factors may also be involved (Jenkins et al., 2001
; Jenkins & Holmes, 2003
). It is therefore possible that translational selection, involving increased translational efficiency due to preferential usage of a codon for which there is an abundant cognate tRNA, may also contribute to the observed codon usage bias. For example, vector and vertebrate host species vary according to geographical location for YFV. Since these different cell types may exhibit very different codon/translational preferences, this could contribute to the differences in codon usage bias for various genotypes. This possibility may be unlikely, however, since previous studies did not find a correlation between codon usage bias and arthropod association (tick vs mosquito vs no vector) (Jenkins et al., 2001
). In order to fully explore the potential importance of translational selection in the codon usage bias of YFV, experiments must be undertaken to determine the cellular levels of tRNA availability in response to viral gene expression.
Since glycosylation of YFV M, E and NS1 proteins may be important for the function of these proteins (Ruiz-Linares et al., 1989
; Ballinger-Crabtree & Miller, 1990
; Muylaert et al., 1996
), we also analysed sequences of the various YFV genomes for potential N-linked glycosylation sites. Positions of N-linked glycosylation motifs in E, prM/M and NS1 were conserved for all virus strains. In addition, conserved N-linked glycosylation motifs were also detected in the NS4B and NS5 proteins. Whether or not these additional sites are glycosylated in vivo remains to be determined. While topology predictions of NS4B indicate three major hydrophobic segments (which are therefore unlikely to be glycosylated), predicted N-linked glycosylation sites are located outside these regions. YFV NS5 serves as the viral RNA polymerase and is therefore unlikely to be glycosylated.
In conclusion, this study supports the existence of previously identified genotypes of YFV, on the basis of full-genome phylogenetic analysis. High degrees of genetic diversity are found within the YFV species, while potential glycosylation sites are generally conserved. Phenotypic evaluation of differences among these viruses remains to be assessed, in particular, the potential significance of highly variable regions in C and NS5, as well as the Thr/His deletion in NS4B of west African viruses. Identification of genetic relationships and potential corresponding phenotypic diversity of YFV is important for monitoring of future outbreaks.
| ACKNOWLEDGEMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
Ballinger-Crabtree, M. E. & Miller, B. R. (1990). Partial nucleotide sequence of South American yellow fever virus strain 1899/81: structural proteins and NS1. J Gen Virol 71, 21152121.
Barrett, A. D. & Monath, T. P. (2003). Epidemiology and ecology of yellow fever virus. Adv Virus Res 61, 291315.[CrossRef][Medline]
Bryant, J. E. & Barrett, A. D. (2003). Comparative phylogenies of yellow fever isolates from Peru and Brazil. FEMS Immunol Med Microbiol 39, 103118.[CrossRef][Medline]
Bryant, J., Wang, H., Cabezas, C., Ramirez, G., Watts, D., Russell, K. & Barrett, A. (2003). Enzootic transmission of yellow fever virus in Peru. Emerg Infect Dis 9, 926933.[Medline]
Chambers, T. J., Hahn, C. S., Galler, R. & Rice, C. M. (1990). Flavivirus genome organization, expression, and replication. Annu Rev Microbiol 44, 649688.[CrossRef][Medline]
Chang, G. J., Cropp, B. C., Kinney, R. M., Trent, D. W. & Gubler, D. J. (1995). Nucleotide sequence variation of the envelope protein gene identifies two distinct genotypes of yellow fever virus. J Virol 69, 57735780.[Abstract]
Colebunders, R., Mariage, J. L., Coche, J. C. & 9 other authors (2002). A Belgian traveler who acquired yellow fever in the Gambia. Clin Infect Dis 35, e113e116.[CrossRef][Medline]
Elazar, M., Liu, P., Rice, C. M. & Glenn, J. S. (2004). An N-terminal amphipathic helix in hepatitis C virus (HCV) NS4B mediates membrane association, correct localization of replication complex proteins, and HCV RNA replication. J Virol 78, 1139311400.
Gao, L., Aizaki, H., He, J. W. & Lai, M. M. (2004). Interactions between viral nonstructural proteins and host protein hVAP-33 mediate the formation of hepatitis C virus RNA replication complex on lipid raft. J Virol 78, 34803488.
Hahn, C. S., Dalrymple, J. M., Strauss, J. H. & Rice, C. M. (1987). Comparison of the virulent Asibi strain of yellow fever virus with the 17D vaccine strain derived from it. Proc Natl Acad Sci U S A 84, 20192023.
Hanley, K. A., Manlucu, L. R., Gilmore, L. E., Blaney, J. E., Jr, Hanson, C. T., Murphy, B. R. & Whitehead, S. S. (2003). A trade-off in replication in mosquito versus mammalian systems conferred by a point mutation in the NS4B protein of dengue virus type 4. Virology 312, 222232.[CrossRef][Medline]
Jenkins, G. M. & Holmes, E. C. (2003). The extent of codon usage bias in human RNA viruses and its evolutionary origin. Virus Res 92, 17.[CrossRef][Medline]
Jenkins, G. M., Pagel, M., Gould, E. A., de A Zanotto, P. M. & Holmes, E. C. (2001). Evolution of base composition and codon usage bias in the genus Flavivirus. J Mol Evol 52, 383390.[Medline]
Jones, M., Davidson, A., Hibbert, L., Gruenwald, P., Schlaak, J., Ball, S., Foster, G. R. & Jacobs, M. (2005). Dengue virus inhibits alpha interferon signaling by reducing STAT2 expression. J Virol 79, 54145420.
Lai, V. C., Kao, C. C., Ferrari, E., Park, J., Uss, A. S., Wright-Minogue, J., Hong, Z. & Lau, J. Y. (1999). Mutational analysis of bovine viral diarrhea virus RNA-dependent RNA polymerase. J Virol 73, 1012910136.
Lepiniec, L., Dalgarno, L., Huong, V. T., Monath, T. P., Digoutte, J. P. & Deubel, V. (1994). Geographic distribution and evolution of yellow fever viruses based on direct sequencing of genomic cDNA fragments. J Gen Virol 75, 417423.
McInerney, J. O. (1998). GCUA: general codon usage analysis. Bioinformatics 14, 372373.
Monath, T. P. (1999). Facing up to re-emergence of urban yellow fever. Lancet 353, 1541.[Medline]
Munoz-Jordan, J. L., Sanchez-Burgos, G. G., Laurent-Rolle, M. & Garcia-Sastre, A. (2003). Inhibition of interferon signaling by dengue virus. Proc Natl Acad Sci U S A 100, 1433314338.
Mutebi, J. P., Wang, H., Li, L., Bryant, J. E. & Barrett, A. D. (2001). Phylogenetic and evolutionary relationships among yellow fever virus isolates in Africa. J Virol 75, 69997008.
Mutebi, J. P., Rijnbrand, R. C. A., Wang, H., Ryman, K. D., Wang, E., Fulop, L. D., Titball, R. & Barrett, A. D. T. (2004). Genetic relationships and evolution of genotypes of yellow fever virus and other members of the yellow fever virus group within the Flavivirus genus based on the 3' noncoding region. J Virol 78, 96529665.
Muylaert, I. R., Chambers, T. J., Galler, R. & Rice, C. M. (1996). Mutagenesis of the N-linked glycosylation sites of the yellow fever virus NS1 protein: effects on virus replication and mouse neurovirulence. Virology 222, 159168.[CrossRef][Medline]
Pinto, M. R. & Filipe, A. R. (1973). Arbovirus studies in Luanda, Angola. 1. Virological and serological studies during a yellow fever epidemic. Bull W H O 49, 3135.[Medline]
Pisano, M. R., Nicoli, J. & Tolou, H. (1997). Homogeneity of yellow fever virus strains isolated during an epidemic and a post-epidemic period in West Africa. Virus Genes 14, 225234.[CrossRef][Medline]
Robertson, S. E., Hull, B. P., Tomori, O., Bele, O., LeDuc, J. W. & Esteves, K. (1996). Yellow fever: a decade of reemergence. JAMA 276, 11571162.[Abstract]
Ruiz-Linares, A., Cahour, A., Despres, P., Girard, M. & Bouloy, M. (1989). Processing of yellow fever virus polyprotein: role of cellular proteases in maturation of the structural proteins. J Virol 63, 41994209.
Serie, C., Casals, J., Panthier, R., Bres, P. & Williams, M. C. (1968). Studies on yellow fever in Ethiopia. 2. Serological study of the human population. Bull W H O 38, 843854.[Medline]
Smithburn, K. C., Haddow, A. J. & Lumsden, W. H. R. (1949). An outbreak of sylvan yellow fever in Uganda with Aedes (Stegomyia) africanus Theobald as principal vector and insect host of the virus. Am J Trop Med 43, 7489.
Swofford, D. L. (2003). PAUP*: Phylogenetic analysis using parsimony (* and other methods), version 4.0b10. Sunderland, MA: Sinauer Associates.
Tomori, O. (1999). Impact of yellow fever on the developing world. Adv Virus Res 53, 534.[Medline]
Wang, E., Ryman, K. D., Jennings, A. D., Wood, D. J., Taffs, F., Minor, P. D., Sanders, P. G. & Barrett, A. D. (1995). Comparison of the genomes of the wild-type French viscerotropic strain of yellow fever virus with its vaccine derivative French neurotropic vaccine. J Gen Virol 76, 27492755.
Wang, E., Weaver, S. C., Shope, R. E., Tesh, R. B., Watts, D. M. & Barrett, A. D. (1996). Genetic variation in yellow fever virus: duplication in the 3' noncoding region of strains from Africa. Virology 225, 274281.[CrossRef][Medline]
Wright, F. (1990). The effective number of codons' used in a gene. Gene 87, 2329.[CrossRef][Medline]
Received 3 June 2005;
accepted 1 December 2005.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| INT J SYST EVOL MICROBIOL | MICROBIOLOGY | J GEN VIROL |
| J MED MICROBIOL | ALL SGM JOURNALS | |