J Gen Virol Try IJSEM Online
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Gen Virol 88 (2007), 1315-1318; DOI 10.1099/vir.0.82681-0

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Nibert, M. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nibert, M. L.
Agricola
Right arrow Articles by Nibert, M. L.

Short Communication

‘2A-like’ and ‘shifty heptamer’ motifs in penaeid shrimp infectious myonecrosis virus, a monosegmented double-stranded RNA virus

Max L. Nibert

Department of Microbiology and Molecular Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA

Correspondence
Max L. Nibert
mnibert{at}hms.harvard.edu


   ABSTRACT
TOP
ABSTRACT
MAIN TEXT
REFERENCES
 
Penaeid shrimp infectious myonecrosis virus (IMNV) is a monosegmented double-stranded RNA virus that forms icosahedral virions and is tentatively assigned to the family Totiviridae. New examinations of the IMNV genome sequence revealed features not noted in the original report. These features include (i) two encoded ‘2A-like’ motifs, which are likely involved in open reading frame (ORF) 1 polyprotein ‘cleavage’; (ii) a 199 nt overlap between the end of ORF1 in frame 1 and the start of ORF2 in frame 3; and (iii) a ‘shifty heptamer’ motif and predicted RNA pseudoknot in the region of ORF1–ORF2 overlap, which probably allow ORF2 to be translated as a fusion with ORF1 by –1 ribosomal frameshifting. Features (ii) and (iii) bring the predicted ORF2 coding strategy of IMNV more in line with that of its closest phylogenetic relative, Giardia lamblia virus, as well as with that of several other members of the family Totiviridae including Saccharomyces cerevisiae virus L-A.


   MAIN TEXT
TOP
ABSTRACT
MAIN TEXT
REFERENCES
 
Penaeid shrimp infectious myonecrosis virus (IMNV) was isolated at the University of Arizona Aquaculture Pathology Laboratory from farmed Penaeus vannamei shrimp received from growers in north-eastern Brazil (Lightner et al., 2004Down; Poulos et al., 2006Down). As the virus name implies, the disease in these shrimp was characterized by necrosis of skeletal muscle, especially in the distal abdominal segments and tail fan. This disease was reproduced in the laboratory by injection of purified IMNV virions into pathogen-free P. vannamei (Poulos et al., 2006Down).

Many basic features of IMNV were reported along with its genome sequence by Poulos et al. (2006) (GenBank Downaccession no. AY570982 [GenBank] ). IMNV virions are icosahedral and approximately 40 nm in diameter. They contain one major capsid protein (MCP), which has a relative mass of 106 000 and an unblocked N terminus of sequence IVSMENQSEID. The genome is a single molecule of double-stranded RNA (dsRNA), 7560 bp long. It contains two extended open reading frames (ORF) in different frames of the genomic plus strand: ORF1 in frame 1 (nt 136–4953) and ORF2 in frame 3 (nt 5241–7451). ORF1 (1605 aa) encodes a 179 kDa protein that includes the N-terminal sequence of the MCP, starting at Ile705. The protein spanning aa 705–1605 of the ORF1 product has a sequence-predicted mass of 99 kDa, consistent with the relative mass of the MCP, which thus appears to be cleaved from a larger ORF1 precursor. A 60 aa region at the N terminus of the full-length ORF1 product shares sequence similarities with dsRNA-binding proteins. ORF2 (736 aa) encodes an 85 kDa protein that contains a series of motifs characteristic of an RNA-dependent RNA polymerase (RdRp). Proteins representing ORF2 and the first 704 aa (80 kDa) of ORF1 have yet to be identified, although minor protein bands reported to have relative masses of 149 000, 42 000 and 24 000 were seen in overloaded gel lanes of IMNV virions and may reflect additional virion components. Phylogenetic analysis based on the RdRp region links IMNV to members of the family Totiviridae of monosegmented dsRNA viruses with icosahedral virions and most closely to Giardia lamblia virus (GLV, GenBank accession no. L13218 [GenBank] ).

To date, IMNV is the only member of the Totiviridae (assignment still tentative) to infect a host other than a fungus or protozoan and therefore other distinctive features might be expected. Given the close phylogenetic relationship between the RdRp regions of IMNV and GLV, however, the apparent lack of ORF2 translation as a fusion with ORF1 in IMNV seems somewhat curious. I therefore examined the published genome sequence of IMNV for further clues about its coding strategies.

Examination of aa 1–704 in IMNV ORF1 revealed two conserved nonapeptides, GDVESNPGP and GDVEENPGP, at aa 86–94 and 370–378, respectively. A BLASTP search and review of the literature showed that these sequences are identical to the 2A or ‘2A-like’ motifs in the polyproteins of Foot-and-mouth disease virus (GDVESNPGP), Cricket paralysis virus (GDVESNPGP), Thosea asigna virus (GDVEENPGP) and others (Donnelly et al., 2001Down; Palmenberg et al., 1992Down). The preceding list includes plus-strand RNA viruses from the families Picornaviridae, Dicistroviridae and Tetraviridae. In these other viral proteins, a cotranslational event, either autocleavage between Gly and Pro or failure of the Gly–Pro peptide bond to be formed, results in separation of the first polypeptide ending in Gly from the second polypeptide starting with Pro. The presence of two such motifs in the N-terminal third of IMNV ORF1 preceding the MCP suggests that the ORF1 product is a polyprotein that is cotranslationally ‘cleaved’ into consecutive 93, 284 and 1228 aa fragments. The 1228 aa fragment would be further cleaved either co- or post-translationally, and by a different mechanism, to yield a 327 aa fragment and the 901 aa MCP that begins with Ile705 as described above.

The functions of the 93 aa (10 kDa), 284 aa (32 kDa) and 327 aa (38 kDa) fragments of IMNV ORF1 are unknown, although the 93 aa fragment encompasses the N-terminal region previously noted to share sequence similarities with dsRNA-binding proteins (Poulos et al., 2006Down). Interestingly, 2A-like sequences are also found in the NS34 protein of group C rotaviruses (family Reoviridae, genus Rotavirus, of 11-segmented dsRNA viruses) and allow a C-terminal 69 aa fragment of NS34, also with similarities to dsRNA-binding proteins, to be released from the protein (Donnelly et al., 2001Down). Thus, at least two distinct dsRNA viruses, IMNV and group C rotaviruses, appear to use 2A-like sequences to release small N- or C-terminal dsRNA-binding fragments, which might be involved in either virus replication or modulation of the host’s innate immune response.

Poulos et al. (2006)Down reported that IMNV ORF1 and ORF2, in frames 1 and 3, respectively, are non-overlapping. Though true that the first standard start codon (AUG, nt 5241–5243) in ORF2 begins 288 nt downstream of the ORF1 stop codon (UAA, nt 4951–4953), there are no in-frame stop codons upstream of this ORF2 start codon until the UGA at nt 4749–4751 (Fig. 1Down). Thus, ORF2, when defined as the region devoid of stop codons, begins 199 nt upstream of the end of ORF1, meaning that ORF1 and ORF2 are overlapping. This creates the possibility that ORF2 could be translated as a fusion with ORF1 if ribosomal –1 frameshifting was to occur within the region of overlap.


Figure 1
View larger version (105K):
[in this window]
[in a new window]

 
Fig. 1. IMNV sequences encompassing the region of ORF1–ORF2 overlap. Nucleotide sequences (lower case) representing the end of ORF1 and the start of ORF2, as identified by Poulos et al. (2006)Down, are underlined at top and doubly underlined at bottom, respectively. Nucleotide positions are indicated at left. The deduced amino acid sequence of the end of ORF1 [in frame 1 (marked f1 at left)] is indicated in single-letter code beneath the encoding nucleotide sequences; ORF1 terminates with the UAA codon (*) at nt 4951–4953. The deduced amino acid sequence of the newly identified start of ORF2 [in frame 3 (marked f3 at left)] is indicated in single-letter code beneath the encoding nucleotide sequences; ORF2 begins after the UGA codon (*) at nt 4749–4751. The shifty heptamer sequence GGGUUUU at nt 4803–4809 in the region of ORF1-ORF2 overlap is thickly overlined. The regions of base pairing in the predicted RNA pseudoknot structure are thinly overlined and labelled to indicate the pairing partners (1a–1b, 2a–2b and 3a–3b). The predicted amino acid sequence of the ORF1–ORF2 fusion over the illustrated region is indicated by shading; the sequence across the proposed –1 frameshift is predicted to be SGFYEC as shown.

 
Ribosomal –1 frameshifting is often associated with a ‘shifty (or slippery) heptamer’ motif, XXXYYYZ, where X is A, C, G or U, Y is A or U, and Z is A, C or U (Bekaert et al., 2003Down; Jacks et al., 1988Down). An RNA pseudoknot shortly downstream of the heptamer, helping to pause the translating ribosome, commonly increases the frequency of frameshifting (Brierley et al., 1989Down; Giedroc et al., 2000Down; Plant et al., 2003Down). For IMNV ORF2 to be translated as a fusion with ORF1, –1 frameshifting would need to occur within the region of ORF1–ORF2 overlap, nt 4752–4950. In fact, the sequence GGGUUUU is found at nt 4803–4809 (Fig. 1Up), which qualifies as a shifty heptamer. Immediately preceding this sequence in IMNV is the dinucleotide UC (Fig. 1Up), which may favour –1 frameshifting (Bekaert & Rousset, 2005Down). Moreover, using the web-based program HPknotter (Huang et al., 2005Down) with the pknotsRG (version 1.2) kernel (Reeder & Giegerich, 2004Down), a class-1 H-type pseudoknot with a minimal free energy of –10.1 kcal mol–1 (42.3 kJ mol–1) was predicted for nt 4818–4862, shortly after the shifty heptamer (Fig. 1Up). Assuming that the ORF1–ORF2 fusion is cleaved after Asn704 to generate the N terminus of the MCP region at Ile705, as for the non-fused MCP (Poulos et al., 2006Down), the size of the fused MCP–RdRp is predicted to be 1734 aa or 196 kDa.

Phylogenetic comparisons involving the RdRp region sequences of IMNV have identified GLV as its closest relative (Crawford et al., 2006Down; Poulos et al., 2006Down). GLV is a member of the family Totiviridae and prototype of the genus Giardiavirus. Its 6277 nt genome includes two ORFs, which are in different frames and overlap by 220 nt (Wang et al., 1993Down). A ribosomal –1 frameshift in the region of overlap allows ORF2 to be translated as a fusion with ORF1 (Li et al., 2001Down; Wang et al., 1993Down). The new findings for IMNV thus bring its ORF2 coding strategy more in line with that of GLV. Notably, some other members of the Totiviridae also use –1 frameshifting to translate ORF2 as a fusion with ORF1, including the well-studied Saccharomyces cerevisiae virus L-A (Dinman et al., 1991Down).

The newly identified coding features of IMNV are summarized in Fig. 2Down and provide clear predictions for future work. The MCP–RdRp fusion would be expected to assemble in only a few copies per virion, with the MCP region occupying a position in the putatively ‘T=2’ capsid and the RdRp region projecting into the virion interior, where it could function in viral transcription as described for S. cerevisiae virus L-A (Ribas & Wickner, 1998Down; Naitow et al., 2002Down). Some of the predicted N-terminal fragments of the IMNV ORF1 polyprotein may also be assembled into virions. Although the RdRp region sequences of IMNV are most closely related to those of GLV, IMNV probably deserves to typify a new genus because of its very different host (crustacean arthropod vs flagellated protozoan) and the distinctive features of its ORF1 polyprotein.


Figure 2
View larger version (16K):
[in this window]
[in a new window]

 
Fig. 2. Diagram of IMNV coding strategies. Upper and lower diagrams are shown on the same scale. Nucleotide positions are indicated across the top. The first AUG codon in ORF2 is shown at right and is the start site suggested by Poulos et al. (2006)Down. Predicted protein products, after polyprotein processing and –1 frameshifting, are numbered 1–5 and described at lower right. The predicted MCP–RdRp fusion is shaded. dsRBM, dsRNA-binding motifs; MCP, major capsid protein; RdRp, RNA-dependent RNA polymerase motifs. See text for further explanations.

 
A final finding that arose during these analyses is that 2A-like sequences are also found in the genome sequences of certain members of the family Reoviridae, genus Cypovirus, of ten-segmented dsRNA viruses that infect insects and assemble virions occluded in cytoplasmic polyhedra. Specifically, 2A-like sequences are found in the segment 5 translation products (879–881 aa) of cypovirus 1 isolates from Bombyx mori (GDIESNPGP, aa 227–235; GenBank accession no. AB035733 [GenBank] ), Dendrolimus punctatus (GDVESNPGP, aa 227–235; accession no. AY163248 [GenBank] ) and Lymantria dispar (GDVESNPGP, aa 227–235; accession no. AF389466 [GenBank] ), as well as the cypovirus 18 isolate from Operophtera brumata (GDVESNPGP, aa 228–236; accession no. DQ192245 [GenBank] ). Thus, 2A-like ‘cleavage’ sequences appear to have been incorporated into the coding strategies of a number of different dsRNA viruses.


   ACKNOWLEDGEMENTS
 
I thank Said Ghabrial for helpful discussions and Tim Baker, S. G., Don Lightner, Bonnie Poulos and Kathy Tang for helpful comments on the manuscript. I also thank the anonymous reviewer who kindly suggested that there is indeed a properly positioned, predicted RNA pseudoknot in IMNV.


   REFERENCES
TOP
ABSTRACT
MAIN TEXT
REFERENCES
 
Bekaert, M. & Rousset, J. P. (2005). An extended signal involved in eukaryotic –1 frameshifting operates through modification of the E site tRNA. Mol Cell 17, 61–68.[CrossRef][Medline]

Bekaert, M., Bidou, L., Denise, A., Duchateau-Nguyen, G., Forest, J. P., Froidevaux, C., Hatin, I., Rousset, J. P. & Termier, M. (2003). Towards a computational model for –1 eukaryotic frameshifting sites. Bioinformatics 19, 327–335.[Abstract/Free Full Text]

Brierley, I., Digard, P. & Inglis, S. C. (1989). Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot. Cell 57, 537–547.[CrossRef][Medline]

Crawford, L. J., Osman, T. A., Booy, F. P., Coutts, R. H., Brasier, C. M. & Buck, K. W. (2006). Molecular characterization of a partitivirus from Ophiostoma himal-ulmi. Virus Genes 33, 33–39.[CrossRef][Medline]

Dinman, J. D., Icho, T. & Wickner, R. B. (1991). A –1 ribosomal frameshift in a double-stranded RNA virus of yeast forms a gag–pol fusion protein. Proc Natl Acad Sci U S A 88, 174–178.[Abstract/Free Full Text]

Donnelly, M. L., Hughes, L. E., Luke, G., Mendoza, H., ten Dam, E., Gani, D. & Ryan, M. D. (2001). The ‘cleavage’ activities of foot-and-mouth disease virus 2A site-directed mutants and naturally occurring ‘2A-like’sequences. J Gen Virol 82, 1027–1041.[Abstract/Free Full Text]

Giedroc, D. P., Theimer, C. A. & Nixon, P. L. (2000). Structure, stability and function of RNA pseudoknots involved in stimulating ribosomal frameshifting. J Mol Biol 298, 167–185.[CrossRef][Medline]

Huang, C.-H., Lu, C. L. & Chiu, H.-T. (2005). A heuristic approach for detecting RNA H-type pseudoknots. Bioinformatics 21, 3501–3508.[Abstract/Free Full Text]

Jacks, T., Power, M. D., Masiarz, F. R., Luciw, P. A., Barr, P. J. & Varmus, H. E. (1988). Characterization of ribosomal frameshifting in HIV-1 gag–pol expression. Nature 331, 280–283.[CrossRef][Medline]

Li, L., Wang, A. L. & Wang, C. C. (2001). Structural analysis of the –1 ribosomal frameshift elements in giardiavirus mRNA. J Virol 75, 10612–10622.[Abstract/Free Full Text]

Lightner, D. V., Pantoja, C. R., Poulos, B. T., Tang, K. F. J., Redman, R. M., Pasos de Andrade, T. & Bonami, J. R. (2004). Infectious myonecrosis: new disease in Pacific white shrimp. Glob Aquac Advocate 7, 85

Naitow, H., Tang, J., Canady, M., Wickner, R. B. & Johnson, J. E. (2002). L-A virus at 3.4 Å resolution reveals particle architecture and mRNA decapping mechanism. Nat Struct Biol 9, 725–728.[CrossRef][Medline]

Palmenberg, A. C., Parks, G. D., Hall, D. J., Ingraham, R. H., Seng, T. W. & Pallai, P. V. (1992). Proteolytic processing of the cardioviral P2 region: primary 2A/2B cleavage in clone-derived precursors. Virology 190, 754–762.[CrossRef][Medline]

Plant, E. P., Jacobs, K. L., Harger, J. W., Meskauskas, A., Jacobs, J. L., Baxter, J. L., Petrov, A. N. & Dinman, J. D. (2003). The 9 Å solution: how mRNA pseudoknots promote efficient programmed –1 ribosomal frameshifting. RNA 9, 168–174.[Abstract/Free Full Text]

Poulos, B. T., Tang, K. F., Pantoja, C. R., Bonami, J. R. & Lightner, D. V. (2006). Purification and characterization of infectious myonecrosis virus of penaeid shrimp. J Gen Virol 87, 987–996.[Abstract/Free Full Text]

Reeder, J. & Giegerich, R. (2004). Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics. BMC Bioinformatics 5, 104[CrossRef][Medline]

Ribas, J. C. & Wickner, R. B. (1998). The Gag domain of the Gag–Pol fusion protein directs incorporation into the L-A double-stranded RNA viral particles in Saccharomyces cerevisiae. J Biol Chem 273, 9306–9311.[Abstract/Free Full Text]

Wang, A. L., Yang, H.-M., Shen, K. A. & Wang, C. C. (1993). Giardiavirus double-stranded RNA genome encodes a capsid polypeptide and a gag–pol-like fusion protein by a translational frameshift. Proc Natl Acad Sci U S A 90, 8595–8599.[Abstract/Free Full Text]

Received 2 November 2006; accepted 18 December 2006.


This article has been cited by other articles:


Home page
J. Gen. Virol.Home page
G. A. Luke, P. de Felipe, A. Lukashev, S. E. Kallioinen, E. A. Bruno, and M. D. Ryan
Occurrence, function and evolutionary origins of '2A-like' sequences in virus genomes
J. Gen. Virol., April 1, 2008; 89(4): 1036 - 1042.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Nibert, M. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nibert, M. L.
Agricola
Right arrow Articles by Nibert, M. L.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
INT J SYST EVOL MICROBIOL MICROBIOLOGY J GEN VIROL
J MED MICROBIOL ALL SGM JOURNALS