J Gen Virol Faster Access
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Gen Virol 69 (1988), 603-612; DOI 10.1099/0022-1317-69-3-603
© 1988 Society for General Microbiology

This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Takkinen, K.
Right arrow Articles by Pettersson, R. F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Takkinen, K.
Right arrow Articles by Pettersson, R. F.
Agricola
Right arrow Articles by Takkinen, K.
Right arrow Articles by Pettersson, R. F.

Nucleotide Sequence of the Rubella Virus Capsid Protein Gene Reveals an Unusually High G/C Content

Kristiina Takkinen1, Gabriele Vidgren1, Jonas Ekstrand3, Ulf Hellman2, Nisse Kalkkinen1, Christer Wernstedt2 and Ralf F. Pettersson3

1 Recombinant DNA Laboratory, University of Helsinki, Valimotie 7, SF-00380 Helsinki, Finland,
2 Ludwig Institute for Cancer Research, Uppsala Branch, Box 595, S-751 23 Uppsala, Sweden
and3 Ludwig Institute for Cancer Research, Stockholm Branch, Box 60202, S-104 01 Stockholm, Sweden

The nucleotide sequence of the rubella virus capsid protein (C) gene has been determined from a cDNA clone derived from the 40S genomic RNA. The sequence covers the coding region of the C protein (831 nucleotides), 70 nucleotides of the 5' untranslated region, and the 5' end of the downstream E2 membrane protein gene. The capsid gene is unusually rich in C (41.6%) and G (31.2%) residues (G + C 72.8%), and poor in A (15.4%) and U residues (11.8%). There are regions with long runs of up to 45% C or 35% G residues. The codon usage is non-random, with a strong preference for C and G residues in the third position. Starting from two in-frame AUG codons (seven amino acid residues apart) an open reading frame (ORF) was identified that extended in frame into the ORF coding for the downstream E2 membrane protein gene. Since the amino terminus of the capsid protein is blocked, we could not determine which of the AUGs serve as the initiating codon. To verify that the deduced ORF was correct, we have determined the amino acid sequence of 13 tryptic peptides corresponding to one-third of the C protein. Our data show that the C protein is about 277 residues in length (Mr about 30750). It is very hydrophilic and rich in prolines (14.1%) and arginines (14.4%). Clusters of these amino acids are concentrated in the aminoterminal third of the C protein. No sequence homology to the capsid protein of several alphaviruses was observed. Together with our previous sequence data we have now completed the sequence of the genes coding for the structural proteins C, E2 and E1 of rubella virus.

Keywords: rubella virus, nucleotide sequence, capsid protein sequence

Received 13 October 1987; accepted 16 November 1987.


This article has been cited by other articles:


Home page
Clin. Microbiol. Rev.Home page
J.-Y. Lee and D. S. Bowden
Rubella Virus Replication and Links to Teratogenicity
Clin. Microbiol. Rev., October 1, 2000; 13(4): 571 - 587.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
INT J SYST EVOL MICROBIOL MICROBIOLOGY J GEN VIROL
J MED MICROBIOL ALL SGM JOURNALS
Copyright © 1988 by the Society for General Microbiology.