|
|
||||||||
1 Clear Springs Foods Inc., PO Box 712, Buhl, ID 83316, USA
2 Department of Biological Sciences, Campus Box 8007, Idaho State University, Pocatello, ID 83209-8007, USA
Correspondence
Vern Winston
winsvern{at}isu.edu
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Because of physical and functional constraints, very few mutations result in an increase in virus fitness (Domingo, 2006
). Mutations that result in decreased fitness of the virus are removed from the gene pool by negative selection. Some (if not most) changes are neutral. They have no negative effect on fitness, but also do not provide a selective advantage (Domingo, 2006
). In the case of virus surface proteins, those that demonstrate enhanced fitness might behave in a number of ways. In one instance, the change might result in an improved interaction with the host, by more efficient host binding, entry or uncoating of the virus. Alternatively, the changes might disrupt the interaction of the virus with the proteins of the host immune system. Specific antibodies or receptors on the cells of the immune system might not recognize the altered epitopes as effectively, resulting in an enhanced ability of the virus to escape the defence systems of the host. These sites are under positive selection and are identifiable because the number of non-synonymous amino acid changes at these sites exceeds the number of synonymous changes (Domingo, 2006
; Yang et al., 2000
). Until the advent of high-speed computers, identification of sites of positive selection was not reliable. However, with the widespread availability of high-speed computers, it has been possible to develop methods to identify individual sites under positive selection. Two of the most commonly used programs to identify sites of positive selection are PAML (Yang, 2007
) and MrBayes (Huelsenbeck & Dyer, 2004
). In our study, recent (1990–2006) isolates of IHNV were obtained from commercial fish farms in the state of Idaho, USA. A 303 nt segment of the major surface (G) protein gene was amplified and sequenced. The sequences were evaluated by using a fully Bayesian method (MrBayes; Huelsenbeck & Dyer, 2004
) to identify codons where the rate of non-synonymous mutation exceeded that of synonymous mutation in a manner consistent with positive selection.
To test the relevance of these predictions, attempts were made to correlate the location of positively selected sites on the amino acid sequence with the three-dimensional (3D) structure of the virus protein. As the structure of the IHNV G surface protein has not been determined, we used a software algorithm (CHPmodels; Lund et al., 2002
) to construct a 3D representation of the IHNV G protein. This approach utilized the structural and sequence homologies between the IHNV G sequence and the vesicular stomatitis virus (VSV) surface protein sequence (Roche et al., 2007
) to construct a model of the IHNV molecule. The locations of positively selected sites were mapped onto the predicted 3D model of the IHNV protein. The predicted sites on the IHNV sequence mapped onto the surface of the protein that would be expected to be in contact with antibodies and/or cellular receptors, and at sites reported by others (Huang et al., 1996
; Troyer et al., 2000
) to be sites of mutation in the IHNV molecule. These regions also corresponded to the major antigenic sites of the VSV surface protein (Vandepol et al., 1986
).
| METHODS |
|---|
|
|
|---|
Isolation of RNA, RT-PCR and sequencing.
RNA was isolated from cell lysates by using a QIAamp viral RNA mini kit (Qiagen) following the manufacturer's instructions. DNA was synthesized from viral RNA by using an RT-PCR kit (Qiagen One-Step RT-PCR) as directed by the manufacturer, using outer primers described by Emmenegger et al. (2000)
. Before sequencing, PCR primers were removed by using ExoSAP shrimp alkaline phosphatase (USB). PCR products were sequenced by using an ABI 3100 Genetic Analyzer (Idaho State University Molecular Research Core Facility), BigDye chemistry (ABI) and the inner primers described by Emmenegger et al. (2000)
as sequencing primers. Each PCR product was sequenced in both directions and the output was analysed by using the Staden Package (Staden et al., 2000
) for evaluation of base calls and production of a contiguous alignment of the complementary sequence.
Sequence analysis.
Sequences were aligned by codon and trimmed to the 303 nt mid-G sequence reported by others (Emmenegger et al., 2000
; Troyer et al., 2000
; Kurath et al., 2003
; Troyer & Kurath, 2003
). Reference sequences included GenBank accession numbers AF237983–AF237992, which represent earlier Idaho isolates (Troyer et al., 2000
), L40878, a representative of M clade isolates (Nichol et al., 1995
), and L40881 (SRCV), used by others (Troyer & Kurath, 2003
) as an outgroup sequence. Bayesian analysis was performed by using MrBayes 3.1.2 (Huelsenbeck & Dyer, 2004
) in parallel mode. Specific commands for MrBayes were: lset nucmodel=codon nst=2 omegavar=m3, report possel=yes. Each chain was run for 1.2x106 cycles. The sump and sumt commands were used to tabulate posterior probabilities of positive selection of each amino acid site, and to build consensus trees. Results of estimations obtained before the process reached convergence were discarded. Typically, the first 200 000 cycles were discarded.
Mapping of IHNV sites on VSV 3D structure.
The 3D structure of the IHNV G protein was predicted by using the CPHmodels 2.0 homology modelling server (http://www.cbs.dtu.dk/services/CPHmodels/) (Lund et al., 2002
). The amino acid sequence of the IHNV G protein (GenBank accession no. AAC42146
[GenBank]
; SRCV strain) was used as input to the web interface. The software identified the B chain of the VSV surface protein (2CMZ.pdb) as the highest-scoring template candidate, and constructed a 3D representation of the IHNV protein based on this template. The locations of positively selected sites were visualized by using VMD software (Visual Molecular Dynamics). A CLUSTAL W alignment (Thompson et al., 1994
) was used to confirm the correspondence between IHNV and VSV sequences (data not shown).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
|
|
|
Structural comparisons
If positive selection is arising as a result of the interaction between virus and host, either as a function of antigen–antibody interaction or as a result of enhanced binding to cells, the sites involved must be located on the outside surface of the virus so that these interactions can occur. To visualize the possible location of positively selected sites on the virus, the 3D modelling software CPHmodels (Lund et al., 2002
) was used. This software identified the B chain of the surface protein of VSV (2CMZ.pdb) as the highest-scoring template candidate. A 3D model of the IHNV protein was constructed by using the VSV protein as a template. The resulting structures are shown in Fig. 3
(a–c), with the corresponding prefusion form of the VSV structure shown in Fig. 3(d)
[note that, in Fig. 3(a–c)
, only the mid-G portion of the IHNV molecule is represented]. In this figure, the molecules are oriented so that the view is from the top, facing directly toward the membrane. Fig. 3(a–c)
are labelled to represent sites identified as being under positive selection at the >95 % (red) and >70 % (green) confidence levels by MrBayes. Fig. 3(a)
represents sites identified when all members of the dataset were included. Fig. 3(b, c)
represent sites identified when only the members of clade A–B or clade C–D, respectively, were used in the analysis. Fig. 3(d)
is labelled to reflect the locations of amino acids that change in monoclonal antibody escape mutants of VSV (Vandepol et al., 1986
). Fig. 3(e, f)
represent the result of mapping sites of IHNV G amino acid changes reported by others (Huang et al., 1996
; Troyer et al., 2000
) on the prefusion form of the molecule.
All of the IHNV G sites identified by the Bayesian approach as undergoing positive selection were on or near the top surface of the molecule. This is consistent with the hypothesis that these sites may be involved in interaction with host antibodies. The cluster of IHNV G sites from aa 270 to 277 is in a prominent
-helix (helix E), which should be readily accessible to antibodies. This region corresponds to the VSV A2 region, which is one of the two major epitopic regions of that virus (Fig. 3d
). Helix E is also in a region of the molecule that does not change shape as the molecule converts from its prefusion state in the extracellular virion to the pH-activated state in the lysosome (Roche et al., 2007
). This would allow for more amino acid substitutions in this region. Amino acid changes in a hinge region, for example, would be more damaging to the function of the protein. Notable is aa 274, which is conserved in all of these sequences. This could indicate that this site is critical for the stabilization of this helix or for the binding of the virus to the host cell. The pattern of sites that were identified as undergoing positive selection by the Bayesian approach was in general agreement with reports of others (Huang et al., 1996
; Troyer et al., 2000
) (Fig. 3e, f
). However, Huang et al. (1996)
observed variation at aa 230 and 231 but, in our study, these sites were absolutely conserved. The difference between our study and theirs could be explained by the fact that, in their study, the virus had been passaged repeatedly in culture to produce antibody-escape mutants. It is known that repeated passage in culture can allow amino acid changes that are not observed in vivo (Novella et al., 2005
). Troyer et al. (2000)
reported no changes at these sites, which may reflect the fact that the virus used in that study had not been passaged extensively. Finally, the A1 epitopic region of the VSV structure appeared to share the same general region of the molecule as aa 78 and 81 of IHNV G, identified by Huang et al. (1996)
as an epitope of IHNV. This would suggest that it might be informative if future studies also sequenced this region of this molecule.
Further work is needed to explore the suggestions provided by these results. A determination of the 3D structure of the IHNV G protein is needed to confirm the location of the positively selected sites on the surface of the protein identified by the Bayesian approach. If it is possible to identify the regions of the G protein that are undergoing rapid selection, it might be possible to design vaccines whose sequences mirror the specific patterns of change being observed. Conversely, the fact that areas of the protein are conserved may imply that change in these areas is impossible if the virus is to remain viable. Vaccines directed toward these vital regions might be more effective, because the virus is prevented by structural constraints from mutating in these areas.
| ACKNOWLEDGEMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
Emmenegger, E. J., Meyers, T. R., Burton, T. O. & Kurath, G. (2000). Genetic diversity and epidemiology of infectious hematopoietic necrosis virus in Alaska. Dis Aquat Organ 40, 163–176.[Medline]
Huang, C., Chien, M. S., Landolt, M., Batts, W. & Winton, J. (1996). Mapping the neutralizing epitopes on the glycoprotein of infectious haematopoietic necrosis virus, a fish rhabdovirus. J Gen Virol 77, 3033–3040.
Huelsenbeck, J. P. & Dyer, K. A. (2004). Bayesian estimation of positively selected sites. J Mol Evol 58, 661–672.[CrossRef][Medline]
Huelsenbeck, J. P., Jain, S., Frost, S. W. D. & Kosakovsky-Pond, S. L. (2006). A Dirichlet process model for detecting positive selection in protein-coding DNA sequences. Proc Natl Acad Sci U S A 103, 6263–6268.
Kurath, G., Garver, K. A., Troyer, R. M., Emmenegger, E. J., Einer-Jensen, K. & Anderson, E. D. (2003). Phylogeography of infectious haematopoietic necrosis virus in North America. J Gen Virol 84, 803–814.
Lund, O., Nielsen, M., Lundegaard, C. & Worning, P. (2002). CPHmodels 2.0: X3M a computer program to extract 3D models. In Abstracts of the CASP5 conference (the Fifth Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction), Asilomar, CA, USA, 2002, A102. http://www.cbs.dtu.dk/services/CPHmodels/abstract.php
Nichol, S. T., Rowe, J. E. & Winton, J. R. (1995). Molecular epizootiology and evolution of the glycoprotein and non-virion protein genes of infectious hematopoietic necrosis virus, a fish rhabdovirus. Virus Res 38, 159–173.[CrossRef][Medline]
Novella, I. S., Gilbertson, D. L., Borrego, B., Domingo, E. & Holland, J. J. (2005). Adaptability costs in immune escape variants of vesicular stomatitis virus. Virus Res 107, 27–34.[CrossRef][Medline]
Oshima, K. H., Arakawa, C. K., Higman, K. H., Landolt, M. L., Nichol, S. T. & Winton, J. R. (1995). The genetic diversity and epizootiology of infectious hematopoietic necrosis virus. Virus Res 35, 123–141.[CrossRef][Medline]
Roche, S., Rey, F., Gaudin, Y. & Bressanelli, S. (2007). Structure of the prefusion form of the vesicular stomatitis virus glycoprotein G. Science 315, 843–848.
Staden, R., Beal, K. F. & Bonfield, J. K. (2000). The Staden package, 1998. Methods Mol Biol 132, 115–130.[Medline]
Thompson, J. D., Higgins, D. G. & Gibson, T. J. (1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22, 4673–4680.
Tordo, N., Benmansour, A., Calisher, C., Dietzgen, R. G., Fang, R.-X., Jackson, A. O., Kurath, G., Nadin-Davis, S., Tesh, R. B. & Walker, P. J. (2005). Family Rhabdoviridae. In Virus Taxonomy: Eighth Report of the International Committee on Taxonomy of Viruses, pp. 623–644. Edited by C. Fauquet, M. A. Mayo, J. Maniloff, U. Desselberger & L. A. Ball. San Diego, CA: Academic Press.
Troyer, R. M. & Kurath, G. (2003). Molecular epidemiology of infectious hematopoietic necrosis virus reveals complex virus traffic and evolution within Southern Idaho aquaculture. Dis Aquat Organ 55, 175–185.[Medline]
Troyer, R. M., Lapatra, S. E. & Kurath, G. (2000). Genetic analyses reveal unusually high diversity of infectious haematopoietic necrosis virus in rainbow trout aquaculture. J Gen Virol 81, 2823–2832.
Vandepol, S. B., Lefrancois, L. & Holland, J. J. (1986). Sequences of the major antibody binding epitopes of the Indiana serotype of vesicular stomatitis virus. Virology 148, 312–325.[CrossRef][Medline]
Yang, Z. (2007). PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24, 1586–1591.
Yang, Z., Nielsen, R., Goldman, N. & Pedersen, A.-M. (2000). Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 155, 431–449.
Received 17 September 2007;
accepted 1 November 2007.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| INT J SYST EVOL MICROBIOL | MICROBIOLOGY | J GEN VIROL |
| J MED MICROBIOL | ALL SGM JOURNALS | |