J Gen Virol Try IJSEM Online
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Gen Virol 87 (2006), 1917-1933; DOI 10.1099/vir.0.81792-0

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplementary table
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Escasa, S. R.
Right arrow Articles by Arif, B. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Escasa, S. R.
Right arrow Articles by Arif, B. M.
Agricola
Right arrow Articles by Escasa, S. R.
Right arrow Articles by Arif, B. M.
© 2006 Society for General Microbiology

Sequence analysis of the Choristoneura occidentalis granulovirus genome

Shannon R. Escasa1, Hilary A. M. Lauzon1, Amanda C. Mathur1, Peter J. Krell2 and Basil M. Arif1

1 Laboratory for Molecular Virology, Great Lakes Forestry Centre, Sault Ste Marie, ON P6A 2E5, Canada
2 Department of Molecular and Cellular Biology, University of Guelph, Guelph, ON N1G 2W1, Canada

Correspondence
Basil M. Arif
barif{at}nrcan.gc.ca


   ABSTRACT
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
REFERENCES
 
The genome of the Choristoneura occidentalis granulovirus (ChocGV) isolated from the western spruce budworm, Choristoneura occidentalis, was sequenced completely. It was 104 710 bp long, with a 67.3 % A+T content and contained 116 potential open reading frames (ORFs) covering 88.4 % of the genome. Of these, 29 ORFs were conserved in all fully sequenced baculovirus genomes, 30 were GV-specific, 53 were present in some nucleopolyhedroviruses (NPVs) and/or GVs, three were common to ChocGV and Choristoneura fumiferana GV (ChfuGV) and one was so far unique. To date, ChocGV is the only GV identified that contains a homologue of the apoptosis inhibitor protein P35/P49, present in some group I NPVs. It is also the first GV without a Xestia c-nigrum GV ORF 26 homologue. Five homologous regions (hrs)/repeat regions, lacking typical NPV hr palindromes were identified. ChocGV hrs were similar to each other but not to other GV hrs. A 1.8 kb repeat region with a high A+T content (81 %) and multiple repeats of 21–210 bp was found between choc36 and 37. This area resembled the non-homologous region origin of DNA replication (non-hr ori) identified in Cryptophlebia leucotreta GV (CrleGV) and Cydia pomonella GV (CpGV). Based on the mean amino acid identities of homologous proteins, ChocGV was closest to fully sequenced genomes CpGV (52.3 %) and CrleGV (52.1 %). The closest amino acid identity was to individual ORFs from the partially sequenced ChfuGV genome (97.2 % in 38 ORFs). Phylogenetic analysis placed ChocGV in a clade with CrleGV and CpGV.

The GenBank/EMBL/DDBJ accession number for the sequence reported in this paper is DQ333351.

Comparison of the ChocGV genome with other granuloviruses is available as a supplementary table in JGV Online.


   INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
REFERENCES
 
Granuloviruses (GVs) have so far only been identified in Lepidoptera, while nucleopolyhedroviruses (NPVs) have been identified mainly in Lepidoptera, but also in Diptera and Hymenoptera (Blissard et al., 2000Down). GVs infect both agricultural and forest insect pests, making them potentially important as biological insecticides (Rashidan et al., 2004aDown). The sequencing of the Choristoneura occidentalis GV (ChocGV) genome brings the number of totally sequenced GVs to eight.

The western spruce budworm, Choristoneura occidentalis, and the eastern spruce budworm, Choristoneura fumiferana, were previously considered to be the same species separated geographically by the Rocky Mountains. They are now considered to be different species. C. occidentalis is the most widely distributed and destructive defoliator of coniferous trees in Western North America. The first recorded outbreak was in 1909 on Vancouver Island, Canada. The insects prefer new growth, damaging leaves, flowers, cones and affecting tree survival and regeneration (Fellin & Dewey, 1999Down). Damage begins in the spring when larvae mine into needles and enter the swollen buds. Webs are spun on new foliage, where insects feed by chewing needles off at their base. Trees may recover unless repeated severe defoliation occurs over 3–5 years (http://www.pfc.forestry.ca). Previous work by Arif et al. (1986)Down describes comparisons of granuloviruses isolated from Choristoneura spp. Baculoviruses that have been isolated from C. occidentalis and C. fumiferana are cross-infective (Fellin & Dewey, 1999Down). This manuscript reports the complete sequencing and analysis of the ChocGV genome and compares it with other baculovirus genomes.


   METHODS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
REFERENCES
 
Virus preparation and DNA extraction.
ChocGV was propagated in C. fumiferana larvae by placing third instar larvae on an artificial diet (10 per cup), surface treated with 3.8x108 occlusion bodies (OBs). Infected larvae were collected 3 weeks later, homogenized and stirred for 2 h in 0.5 % SDS. The homogenate was filtered and centrifuged at 5000 r.p.m. (13.1 rotor; Beckman J2-21) for 15 min at 15 °C. The OB pellet was washed three times in deionized water, recentrifuged and suspended in 10 ml water. The sample was loaded on to a 45 % sucrose cushion and centrifuged (SW28 rotor; Beckman ultracentrifuge) at 8000 r.p.m. for 20 min at 15 °C. The OB pellet was resuspended in water and centrifuged at 5000 r.p.m. for 15 min at 15 °C. OBs were dissolved in an alkaline solution (1.0 M sodium carbonate and 0.4 M sodium thioglycollate) and placed on to a continuous 10–45 % sucrose gradient and centrifuged at 20 000 r.p.m. (SW28 rotor; Beckman Ultracentrifuge) for 1.25 h at 4 °C. The banded virions were withdrawn (with a syringe), diluted with TE (10 mM Tris and 1.0 mM EDTA) and centrifuged at 22 000 r.p.m. (SW28 rotor; Beckman Ultracentrifuge) for 2 h at 4 °C. The pellet was suspended in 500 µl TE. Proteinase K (25 µl, 20 mg ml–1) was added and incubated at 37 °C for 30 min followed by another 25 µl proteinase K with 1 % SDS (final concentration) and incubated at 50 °C for 30 min. DNA was extracted with phenol/chloroform and dialysed against several changes of TE for 24 h at 4 °C.

DNA sequencing and analysis.
ChocGV genomic DNA was nebulized, cloned into pUC18 and sequenced to yield 10 times coverage (Applied Biosystems; BGI Life Technology). Base calling was done using the software Phred (Ewing et al., 1998Down; Ewing & Green, 1998Down), pUC18 contaminations were removed with CrossMatch (Green, 2004Down), and the sequence was assembled using Phrap (Green, 1996Down). Gaps and regions with low quality data identified after preliminary assembly were filled in or refined by sequencing PCR amplified products.

Sequence analysis was carried out using NCBI open reading frame (ORF) finder (Wheeler et al., 2003Down), BLAST (Altschul et al., 1990Down), DNASTAR (Burland, 2000Down) and the Multi-purpose automated genomics project investigation environment (Magpie), available through the University of Calgary (Gaasterland & Sensen, 1996Down). Methionine-initiated ORFs potentially encoding 50 or more amino acids were considered for further analysis. Smaller ORFs with baculovirus homologues were also considered. DNA regions containing potential ORFs with unusually large repeats were deemed as non-coding and analysed separately. Repeated sequences were identified using REPuter (Kurtz & Schleiermacher, 1999Down) and Tandem Repeats Finder (Benson, 1999Down). ExPASy (Gasteiger et al., 2003Down), Prosite (Hulo et al., 2004Down) and SMART (Schultz et al., 1998Down) were used to identify protein motifs. Multiple sequence alignments and percentage amino acid identities were carried out using DNASTAR's MEGALIGN CLUSTAL W (Thompson et al., 1994Down) with default conditions. GeneDoc was used to shade conserved sequences in aligned proteins (Nicholas & Nicholas, 1997Down). Seven GV genomes and 11 NPV genomes were used to identify gene conservation in ChocGV. The genomes included were from: Cydia pomonella GV (CpGV; Luque et al., 2001Down), Cryptophlebia leucotreta GV (CrleGV; Lange & Jehle, 2003Down), Adoxophyes orana GV (AdorGV; Wormleaton et al., 2003Down), Phthorimaea operculella GV (PhopGV; GenBank accession no. AF499596 [GenBank] ), Plutella xylostella GV (PlxyGV; Hashimoto et al., 2000Down), Xestia c-nigrum GV (XecnGV; Hayakawa et al., 1999Down), Agrotis segetum GV (AgseGV; GenBank accession no. NC_005839 [GenBank] ), Autographa californica multicapsid NPV (AcMNPV; Ayres et al., 1994Down), Helicoverpa armigera NPV (HearNPV G4; Chen et al., 2001Down), Spodoptera litura NPV (SpltNPV; Pang et al., 2001Down), Spodoptera exigua MNPV (SeMNPV; IJkel et al., 1999Down), Lymantria dispar MNPV (LdMNPV; Kuzio et al., 1999Down), Orgyia pseudotsugata MNPV (OpMNPV; Ahrens et al., 1997Down), Choristoneura fumiferana defective NPV, (CfDEFNPV; Lauzon et al., 2005Down), Epiphyas postvittana NPV (EppoNPV; Hyink et al., 2002Down), Rachiplusia ou MNPV (RoMNPV; Harrison & Bonning, 2003Down), Bombyx mori NPV (BmNPV; Gomi et al., 1999Down) and Neodiprion lecontei NPV (NeleNPV; Lauzon et al., 2004Down). Promoter regions were found using Promoter Scan (Prestridge, 1995Down) and InterPro (Mulder et al., 2003Down). Gene parity plots (Hu et al., 1998Down) compared the gene order of ChocGV with CpGV, CrleGV, AdorGV and AcMNPV. Concatamers of 29 conserved baculovirus proteins from 27 genomes were used to determine phylogeny. A most parsimonious tree was constructed using CLUSTAL W protein alignments and PAUP 4.0b10 (Swofford, 2000Down) with maximum-parsimony, heuristic search, step-wise addition option and bootstrap analysis (1000 replicates). Genomes used in phylogenetic analysis included those previously listed plus the following: HearNPV C1 (Zhang et al., 2005Down), Helicoverpa zea SNPV (HzSNPV; Chen et al., 2002Down), Mamestra configurata NPV A (MacoNPV A; Q. Li et al., 2002), MacoNPV B, (L. Li et al., 2002), Adoxophyes honmai NPV (AdhoNPV; Nakai et al., 2003Down), Choristoneura fumiferana MNPV (CfMNPV; de Jong et al., 2005Down), Neodiprion sertifer NPV (NeseNPV; Garcia-Maruniak et al., 2004Down) and Culex nigripalpus NPV (CuniNPV; Afonso et al., 2001Down).


   RESULTS AND DISCUSSION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
REFERENCES
 
ChocGV sequence analysis
Approximately 11.5 times coverage of the genome was achieved by generating 1 209 500 nt of raw data from 1793 sequencing reads. An overall error rate of 0.009 % was estimated. The ChocGV genome contained 104 710 bp, slightly larger than the 99 kb estimated by restriction endonuclease analysis. The discrepancy in size was due to the presence of 10 HindIII fragments less than 1.0 kb identified by sequencing that were not seen on agarose gels. ChocGV is the third smallest GV sequenced to date, with PlxyGV (100 999 bp) and AdorGV (99 657 bp) being smaller (Supplementary Table S1 available in JGV Online). The A+T content of ChocGV was 67.3 %, second only to CrleGV with 67.7 % (Lange & Jehle, 2003Down). ChocGV contained 116 methionine-initiated ORFs, 62 (53.4 %) clockwise and 54 (46.6 %) counterclockwise, with respect to granulin that was designated ORF 1 (Table 1Down, Fig. 1Down). Choc41 (46 aa) was considered a potential ORF as it showed baculovirus homology (Table 1Down). There were minor overlaps between certain ORFs; the longest (139 bp) covered the 3' end of alkaline-exonuclease and the 5' end of helicase-2.


View this table:
[in this window]
[in a new window]
 
Table 1. Characteristics of the ChocGV genome

Predicted ORFs are compared with homologues in eight GVs and AcMNPV. Direction of transcription is noted by the symbols: > (+ve strand) and < (–ve strand). Promoter motifs are identified as early (E) and late (L). Protein motifs and other homologues are shown in the ‘Comments' column. Late promoter motif (DTAAG), where D=A, G or T within 150 nt from start codon.

 

Figure 1
View larger version (31K):
[in this window]
[in a new window]
 
Fig. 1. Representation of ChocGV linear ORF map. The arrows indicate direction of transcription and relative size of ORFs. ChocGV ORF numbers and homologous ORFs in AcMNPV and CpGV are shown above the arrows and gene names below the arrows. Genome position is indicated by the scale (kb). ORFs are shaded according to the key.

 
Intergenic spaces and a non-homologous region origin of DNA replication
Most intergenic spaces in the genome were less than 50 bp (48 spaces) or 50–300 bp (36 spaces). Four were between 300 and 600 bp and three were greater than 1.0 kb. The first large intergenic space was between choc21 and 22 (1018 bp) and the second between choc22 and 23 (1545 bp). These areas were expected to contain homologues to cp28 and cp30, respectively, but neither ORF was present (Fig. 1Up), and no BLAST matches with other baculovirus genomes were found. The third intergenic space (3441 bp) between choc36 and 37 contained a 1.8 kb region with multiple direct repeats that was extremely AT rich (81 %). Initial computer analysis suggested the presence of a 1144 aa ORF in this region with 27 leucine zippers. Leucine zippers contain leucine residues (L) at every seventh position over at least seven helical turns (Gattiker et al., 2002Down) that establish amphipathy to help stabilize a long {alpha}-helix (Landschulz et al., 1988Down). The leucine zipper motif is: [L(X6)L(X6)L(X6)L] (Bateman et al., 2004Down). This potential ORF did not have any baculovirus homologues. Sequencing errors are more common within repeat regions as frequent recombination can lead to differences in data between clones (Ahrens et al., 1997Down). Also, ChocGV was not plaque purified, increasing the possibility of clonal variation. Manual examination revealed variation among different clones covering the same area. Other fully sequenced GV genomes contain three smaller ORFs in this region (cp49, cp50 and cp51; Fig. 1Up). The presence of three ORFs between choc36 and 37 was suggested by BLASTN analysis. These potential ORFs had high identity to three ORFs from ChfuGV (GenBank accession nos AAN77189 [GenBank] , AAN77190 [GenBank] and AAN77191 [GenBank] ). The beginning of the region also showed low nucleotide identity to cp50/crle47 and its 3' region showed low identity to xecn47. Manual examination of the trace data showed several areas where sequencing errors were possible or where the addition or subtraction of a single base might cause a frameshift. Two of these potential changes involved areas with large strings of adenines. If bases were added to cause frameshifts, the first potential ORF between choc36 and 37 would be larger than its ChfuGV homologue (GenBank accession no. AAN77189 [GenBank] ) as it would have three leucine zippers instead of one, suggesting that duplication of repeat regions has occurred in the ChocGV genome. The second potential ChocGV ORF had three repeats of 210 bp constituting 84 % of the ORF and the third lacked repeats but had a transmembrane domain at its 5' end.

There were many factors to consider when interpreting the large space between choc36 and choc37: (i) the space had multiple repeats, (ii) it contained ambiguous bases in the trace data, (iii) ChocGV was not plaque purified and clonal variation existed in this region, (iv) no other baculovirus genomes contained homologues to the large potential protein between choc36 and 37, and (v) difficulties arose in deciding where bases should be added or deleted to cause a frameshift producing three small ORFs. Because of the combination of these factors, decisions to have one large ORF or three smaller ORFs between choc36 and 37 were both rejected. Instead, DNA regions containing ORFs with large, unusual repeats were considered non-coding and were analysed separately as was done with the genome of CrleGV (Lange & Jehle, 2003Down).

Intergenic spaces greater than 1.0 kb are found in other GVs. XecnGV has a 1473 bp intergenic region (Hayakawa et al., 1999Down) in approximately the same location as the first large intergenic space in ChocGV. CrleGV has a 1.8 kb intergenic space between crle26 and crle27 that contains a 300 bp AT-rich region, direct repeats, short palindromes and is considered to be a potential non-homologous region origin of DNA replication (non-hr ori) (Jehle, 2002Down; Lange & Jehle, 2003Down). CpGV has a repeat region of 1.13 kb with six short imperfect repeats, three large imperfect repeats and an AT-rich section also considered to be a non-hr ori. The CpGV non-hr ori region, however, is located within three ORFs (cp25–cp27) none of which have baculoviral homologues (Huang & Levin, 1999Down). The non-hr ori repeats in CpGV and CrleGV are not found elsewhere in the genomes and are not similar to their homologous regions (hrs) (Luque et al., 2001Down; Lange & Jehle, 2003Down). Other baculoviruses with non-hr oris include: AcMNPV (Kool et al., 1994Down), OpMNPV (Pearson et al., 1993Down), SeMNPV (Heldens et al., 1997Down) and Spodoptera littoralis (SpliMNPV) (Huang & Levin, 1999Down). These regions are complex, contain direct and inverted repeats and are between 800 and 4000 bp long (Luque et al., 2001Down). The ChocGV 1.8 kb repeat region between choc36 and 37 showed similarities to the non-hr ori regions in CrleGV and CpGV as it had multiple repeats not found elsewhere in the genome, was not similar to hrs and was very AT rich.

Hrs regions
Typically, baculoviruses have many hrs interspersed throughout their genomes. These regions may act as enhancers of RNA transcription, as origins of DNA replication and as sites of recombination (Hayakawa et al., 2000Down). Most NPV hrs contain 30 bp palindromes within direct repeats, whereas GV repeat regions are more variable and often lack palindromes (Wormleaton et al., 2003Down). XecnGV contains nine hrs each with three to six direct repeats lacking a palindromic core (Hayakawa et al., 1999Down). PlxyGV has four large repeat regions centred on a palindrome that more closely resemble NPV hrs (Hashimoto et al., 2000Down), while AdorGV has nine repeat regions unlike NPV hrs (Wormleaton et al., 2003Down). ChocGV contained five hrs/repeat regions varying in size and number of direct repeats. The first (hr-1), had three repeated sequences in a 157 bp area, hr-2, four repeated sequences within 129 bp, and hr-3, 10 repeated sequences within 288 bp. The fourth and fifth hrs/repeat regions had four repeated sequences in a 106 bp area and 162 bp area, respectively (Fig. 2aDown). Direct repeats averaged 27 bp with the shortest being 19 bp and the longest 39 bp. These regions were not like NPV hrs and did not have embedded palindromes. Each hr was homologous to all other ChocGV hrs but not to hrs in other baculoviruses. Despite the differences between GV and NPV hrs, and between GV hrs themselves, the relative locations of hrs in ChocGV were similar to the locations of three of the hrs in CrleGV, and to eight of 13 repeats in CpGV (Fig. 2bDown).


Figure 2
View larger version (58K):
[in this window]
[in a new window]
 
Fig. 2. Alignment of ChocGV hrs/repeat regions. (a) ChocGV hrs/repeat regions are numbered from 1 to 5, with repeats designated by letter relative to their position in the genome. The direct repeat consensus sequence is shown below the alignment. Shading reflects the degree of nucleotide conservation: black, 100 %; dark grey, >85 % and light grey, >60 %. (b) The location of ChocGV five hrs/repeat regions is compared with that of CrleGV and CpGV. Homologous ORFs within each section are shaded the same colour.

 
Gene content and organization
ChocGV contained all 29 core genes found in baculovirus genomes sequenced to date. It was the first fully sequenced GV lacking a xecn26 homologue. Of 116 total ORFs, 30 were found only in GVs, 53 were found in some GVs and/or NPVs, three only in ChocGV/ChfuGV (choc4, 10 and 11), and one (choc87) was so far unique. Since the ChfuGV genome has not yet been fully sequenced, a homologue to choc87 may still be identified. Of the ORFs found only in ChocGV/ChfuGV, choc4 and choc11 showed 100 % amino acid identity to their ChfuGV homologues (GenBank accession nos AAM60756 [GenBank] and AAM60754 [GenBank] ). Choc10 shared 80 % amino acid identity with its ChfuGV homologue (GenBank accession no. AAM60753 [GenBank] ) and showed strong homology to conserved domains in translation initiation factors 5 and 2 (eIF5C and eIF2B_5) from Anopheles gambiae (GenBank accession no. EAA05225 [GenBank] ) (e=1.0 e-125, 55.5 % amino acid identity). Translation initiation factors are responsible for the binding of the initiator Met-tRNAi and mRNA to ribosomes (Das & Maitra, 2000Down). Choc36 had a 100 % amino acid identity to a ChfuGV ORF (GenBank accession no. AAN77188 [GenBank] ), which is listed in GenBank as a homologue to xecn152 (enhancin). Therefore, it would be expected that choc36 would also be a homologue to xecn152. However, BLASTP results showed that choc36 did not have a baculovirus match but had a high identity (9.0 e-10, 30.7 % amino acid identity) to a protein phosphatase regulatory subunit 15A in ORF 93 of Amsacta moorei entomopoxvirus (AmEPV; Bawden et al., 2000Down). CLUSTAL W alignments showed that ChfuGV (GenBank accession no. AAN77188 [GenBank] ) and choc36 shared 23.8 % amino acid identity with xecn152. Therefore, choc36 was accepted as a xecn152 homologue. Choc25 showed 21.4 % amino acid identity to a previously unique CrleGV ORF 9 and 34.3 % to a protein from a parvo-like virus [ORF 2 Yamanashi isolate from Bombyx mori densovirus (BmDNV-2; Bando et al., 1995Down)]. Other ChocGV ORFs had homologues in NPVs but not GVs. Choc20 shared 27.7 % amino acid identity with LdMNPV ORF 7 and 29.8 % with CfDEFNPV ORF 145. Choc53 shared a 24.4 % amino acid identity to LdMNPV ORF 111 and 27.8 % with MacoNPV A ORF 46.

Choc62 and choc63 had BLASTP matches to cp82 at 35.6 and 25.6 % amino acid identity, respectively, but shared only 10.3 % amino acid identity with each other. The combined size of choc62 (261 bp) and choc63 (597 bp) was similar to that of cp82 (921 bp), suggesting that the ORFs corresponded to different regions of cp82. Choc62 aligned with the 3' end of cp82 (664–921 bp), while choc63 aligned with the 5' end (89–746 bp). The sequencing data did not reveal any sequencing errors that could link the two potential ORFs as one. The function of cp82 is not known. It is conceivable that low expression of a cp82 homologue could take place in ChocGV even if a computational frameshift had occurred resulting in choc62 and choc63. This suggestion is corroborated by the fact that low gene expression of lacZ was detected in AcMNPV p10 deletion mutants containing a translational frameshift in the lacZ region (Williams et al., 1989Down).

Inhibitors of apoptosis
Apoptosis plays an important role in virus replication and in cellular response to infection (O'Brien, 1998Down). Viruses have acquired genes, the products of which evade or suppress programmed cell death. There are two types of baculovirus proteins that suppress apoptosis, P35/P49 homologues and inhibitors of apoptosis (IAPs) (Du et al., 1999Down). P35 acts as a direct inhibitor of proteases, while IAPs act upstream to prevent activation of the proteases (Vaux & Strasser, 1996Down). Functional homologues of P35 are found in some but not all NPVs, whereas IAPs are present in both NPVs and GVs. ChocGV is the first GV with a P35/P49 homologue. Choc15 shared the highest amino acid identity with SpliMNPV P49 (GenBank accession no. AJ006751 [GenBank] ) (27 %), SpltNPV ORF 55 P49 (25.7 %) and RoMNPV ORF 128 P35 (21 %). Choc15 showed a BLASTP match (1.0 e-07, 19.1 % amino acid identity) with a previously unreported P35 homologue from AmEPV (AMV010) (Bawden et al., 2000Down) (Fig. 3Down). While these values are not very high, functionality awaits further analysis. It is not yet known if choc15 functions as a P35 or as a P49 homologue as it shares conserved amino acids with both but its functionality and target of activity are yet to be determined. P35 proteins have a large loop domain known as the reactive site loop that contains the caspase recognition motif DQMDG at its apex, while P49 proteins contain the motif TVTDG (Pei et al., 2002Down). Choc15 and AMV010 appear to have different active site motifs than those of P35 and P49. Two motifs, 92DCSND96 and 86VYNFD90 in choc15 and AMV010, respectively, were found in the same relative location as the active motifs of P35 and P49 (Fig. 3Down). It has been shown that AMV010 is an active apoptotic suppressor and that mutating the aspartic acid residue (D90) within the above motif results in an inactive protein (R. Clem, personal communication). Experiments are under way to verify the activity of choc15 and to ascertain the essentiality of D96 within its putative active site motif. The molecular mass of choc15 was 42.6 kDa, which differs from P49 (49 kDa) and P35 (35 kDa) homologues. It is also interesting to note that the location of choc15 within ChocGV was similar to that of iap-3 in both CpGV and CrleGV (Fig. 2bUp). Five groups of iap genes have been identified in baculoviruses but not all are active suppressors of apoptosis (Ikeda et al., 2004Down). Homologues of iap-3, generally identified as active inhibitors of apoptosis, are found in both NPVs and GVs, but iap-5 homologues have so far been found only in GVs (Wormleaton et al., 2003Down). ChocGV contained two iap homologues; choc84 (iap-3) and choc95 (iap-5). Most IAP proteins contain baculovirus inhibitor repeats (BIRs) and a RING finger (Schultz et al., 1998Down). Both choc84 and choc95 contained two BIRs and one zinc finger ring. IAP and P35 are not homologues and are likely unrelated in ancestry and mechanism of action (Crook et al., 1993Down).


Figure 3
View larger version (67K):
[in this window]
[in a new window]
 
Fig. 3. Choc15 alignment with P35 homologues. Proteins aligned include AmEPV (ORF 10), AcMNPV (ORF 136), BmNPV (ORF 112), RoMNPV (ORF 128), HycuNPV (GenBank accession no. AAO17287), LeseNPV (GenBank no. AAF78504), SpltMNPV (ORF 55) and SpliMNPV (GenBank no. AJ006751). Shading depicts conserved amino acids: black, 100 %; dark grey, >80 % and light grey, >60 %. The caspase recognition motifs in P35 homologues are boxed, underlined in P49 homologues and boxed in choc15 and AMV010.

 
Replication, translation and structural genes
All genes involved in DNA replication or expression listed by Herniou et al. (2003)Down were found in ChocGV (Table 2Down). Of the replication- and transcription-specific genes present in some but not all baculoviruses (Hayakawa et al., 2000Down), ChocGV did not have late expression factor 7 (lef-7) or lef-12 typically found in group I NPVs. The genome also lacked immediate-early gene 2 (ie-2), proliferating cell nuclear antigen (pcna), pe38, ribonuclease reductase 1 (rr1), rr2 and dutpase.


View this table:
[in this window]
[in a new window]
 
Table 2. ChocGV genes grouped according to function

Genes present in ChocGV are represented by an ORF# and those missing are represented by CpGV and/or XecnGV ORFs.

 
Homologues of all conserved baculoviral genes encoding structural proteins were found in ChocGV (Table 2Up). Granulin and ODV-E25 shared 90.9 and 72.2 % amino acid identity, respectively, with homologues from the seven sequenced GVs, making them two of the most highly conserved proteins in ChocGV. All three genes involved in per os infectivity were present in ChocGV, namely p74 (Faulkner et al., 1997Down) (choc46), per os infectivity factor-1 (pif-1) (Kikhno et al., 2002Down) (choc56) and pif-2 (Pijlman et al., 2003Down) (choc35). These three genes are conserved among all baculoviruses sequenced to date (Lauzon et al., 2004Down; Garcia-Maruniak et al., 2004Down).

Three ChocGV ORFs contained sequences corresponding to the N terminus of the polyhedron envelope protein (PEP). These ORFs were found in similar locations and shared high similarity with other GV PEPs. ChocGV PEP proteins (choc17, 18 and 19) and CpGV PEP proteins (cp20, 22 and 23) averaged 70.6 and 68.5 % amino acid identity, respectively, with the PEP proteins from CrleGV (crle20, 23, 24). PEP is found on the surface of OBs and is important for the formation of polyhedra as it stabilizes and prevents them from fusing (Bateman et al., 2004Down).

P10 proteins are characterized by having various common structural and functional domains, including a coiled-coil region, followed by a proline-rich area [(G/A/V/L/I) P (D/E/N) (V/L/I/P) P], a variable region and finally a basic region at the C-terminal. P10 proteins from different baculoviruses are characterized by the size differences among their shared domains and their sequences are generally poorly conserved (Van Oers & Vlak, 1997Down). Among the seven sequenced GV genomes, only three have P10. AdorGV and XecnGV each have one (ador13 and xecn5, respectively), while PlxyGV has three (plxy2, plxy21 and plxy50). The mean amino acid identity of choc45 with GV P10 proteins was only 14.5 %, but was 30 % with nine NPV P10s (Fig. 4aDown), with the highest match being op133 (46.2 %), making choc45 closer to NPV P10s than to GV P10s. ChfuGV P10 (GenBank accession no. AAN77200 [GenBank] ), which is identical to choc45, clustered within group I NPV P10s in a phylogenetic tree (Lange & Jehle, 2003Down), and the gene arrangement of p10 in ChfuGV was identical to that of many NPVs (Rashidan et al., 2004bDown).


Figure 4
View larger version (69K):
[in this window]
[in a new window]
 
Fig. 4. P10 homologues. (a) Alignment of baculovirus P10 proteins: ChocGV_a (choc45), ChocGV_b (choc18), CrleGV (crle23 PEP/P10), AcMNPV (ORF 137), CfDEFNPV (ORF 131), OpMNPV (ORF 133), EppoNPV (ORF 120), CfMNPV (ORF 129), SeMNPV (ORF 130), SpltNPV (ORF 19), LdMNPV (ORF 41) and HearNPV_C1 (ORF 21). The coiled-coil region consists of heptad repeats shaded light grey (*), proline-rich regions in dark grey and the basic region in black. (b) This model, modified from Van Oers & Vlak (1997)Down, shows how P10 domains vary between choc45 P10 and choc18 (P10/PEP). Shading in (b) corresponds to that in (a). (c) The alignment of choc45 and AmEPV ORF 32 (FALPE) show both have an identical proline-rich stretch at their carboxyl ends not found in any other baculovirus P10 (except ChfuGV). They share a 30.5 % amino acid identity, higher than choc45 with GV P10s.

 
ChocGV contained a second P10-like motif attached to a PEP protein (choc18). It contained a coiled-coil region with a heptad of hydrophobic and hydrophilic amino acids. The sizes of the domains for choc45 (P10) and the P10 region of choc18 are shown in Fig. 4(b)Up. Along with their P10 proteins, AdorGV and XecnGV also have PEP/P10 homologues with a P10-like motif (ador17 and xecn19, respectively). Other GVs with PEP/P10 proteins include: CpGV (cp22), PhopGV (phop20), CrleGV (crle23) and AgseGV (agse19). The P10 domains within crle23 and choc18 shared 51.1 % amino acid identity and the PEP domains 87.8 %. The close association of P10 and the polyhedron envelope has been well documented in many NPVs and it is known that the presence of P10 is essential for the formation of the polyhedron envelope. In some GVs, the functional association of PEP and P10 might have been conserved in a single protein (Lange & Jehle, 2003Down). Further work must be done to fully understand the role of the different P10 homologues in ChocGV.

Entomopoxviruses produce cytoplasmic fibrils made of filament-associated late proteins (FALPE), while baculoviruses have filaments in the nuclei and cytoplasm of infected cells made of P10 protein (Alaoui-Ismaili & Richardson, 1998Down). FALPE homologues share some of the structural features of P10 proteins (a proline-rich region and a basic C-terminal tail), but amino acid identity between FALPE and P10 proteins is low (Alaoui-Ismaili & Richardson, 1998Down). AMV032 (FALPE) and choc45 (P10), however, share a high amino acid identity (30.5 %) due to their identical proline-rich region, alternating proline (P) and glutamic acid (E) (Fig. 4cUp) not seen in other NPV or GV P10s to date (except ChfuGV). However, these EP repeats have been seen in Trypanosoma sp. procyclins (glycosyl phosphatidylinositol-anchored proteins that contain EP/GPEET amino acid repeats at the C-terminal) (Rashidan et al., 2004bDown; Ruepp et al., 1999Down). In phylogenetic analysis (data not shown), FALPE and choc45 grouped with each other, suggesting a strong evolutionary relationship.

Auxiliary genes
Auxiliary genes are not essential for replication but may give the virus a selective advantage in nature (O'Reilly, 1997Down). ChocGV did not appear to contain baculovirus repeat orfs (bros), iap-1, chitinase, cathepsin and some other auxiliary genes present in some GVs (Table 2Up). Bro genes are highly repetitive (16 present in LdMNPV) and conserved, but their function is not yet clear (Kuzio et al., 1999Down). Chitinases are directly involved in the degradation of insect cuticle during moulting, and cathepsin is involved with insect liquefaction (Slack et al., 1995Down; Hawtin et al., 1997Down). The absence of these two genes from ChocGV is manifested by the larvae not liquefying upon death. It is not yet clear as to why ChocGV lacks chitinase and cathepsin. ChocGV, however, had an ORF (choc9) with a coiled-coil domain, a chitin-binding domain (type2 ChtBD2), and a high amino acid identity with crle8 (72.7 %) and cp9 (66.7 %), which also have ChtBD2. Chitin-binding domains are extracellular domains that contain six cysteines needed to form three disulfide bridges (Schultz et al., 1998Down). Auxiliary genes present in ChocGV are listed in Table 2Up. Ubiquitin (ubq) was the second most conserved gene in ChocGV, having a mean of 78.8 % amino acid identity to homologues in seven sequenced GVs. All GV genomes sequenced to date have three fibroblast growth factors (fgfs). ChocGV is no exception, with choc57, 102 and 114 in the same relative locations as fgfs in other GVs. FGFs are ubiquitous in nature and are highly conserved in structure and amino acid sequence. They are found throughout the genome and play important roles in cell proliferation, differentiation and tissue repair (Ornitz & Nobuyuki, 2001Down).

Comparison of ChocGV with other baculoviruses
Comparative analysis of baculoviruses can provide insight into their evolutionary history. Similarities and differences in amino acid sequences and in gene order help place baculoviruses in groups sharing gene characteristics and overall genome relatedness. ChocGV and ChfuGV are almost identical in size and restriction enzyme digestion profiles. Comparison of DNA profiles from both viruses cut with EcoRI, HindIII, KpnI and BamHI reveals only minor differences in band size ranging from 0.05 to 5 kb, with the largest difference being in HindIII-P. Comparison of structural proteins also reveals only minor size differences between the two viruses (Arif et al., 1986Down). CLUSTAL W alignment of individual ChocGV sequences with ChfuGV sequences revealed 100 % amino acid identity for 27 of 38 ChfuGV ORFs sequenced to date (Table 1Up), with the mean amino acid identity for all being 97.2 %. With similar restriction patterns, biological relationships and high genomic homology, ChocGV and ChfuGV may be variants of the same viral species. The fully sequenced baculoviruses with the highest mean amino acid identity with ChocGV were CpGV (106 homologous ORFs, 52.3 % amino acid identity), CrleGV (106 ORFs, 52.1 % amino acid identity) and PhopGV (103 ORFs, 48.1 % amino acid identity) (Supplementary Table S1 available in JGV Online). The NPV with the highest mean amino acid identity to ChocGV was HearNPV G4 (65 homologous ORFs averaging 31.2 %).

Gene arrangement in baculoviruses is a reflection of their evolutionary history, with more closely related viruses sharing a higher degree of gene collinearity (Hu et al., 1998Down). Gene parity plots compare the positions of homologous genes in different genomes and are used to illustrate conservation between baculovirus genomes (Herniou et al., 2003Down). Examination of the number of ORFs in the same order as ChocGV (diagonal line in Fig. 5Down), showed that ChocGV shared the highest gene order with CrleGV (88.8 % collinearity), CpGV (87.9 %), AdorGV (85.3 %) and AcMNPV (24 %) (Fig. 5Down). The positions of iap-3 and p10 were the most variable. In ChocGV, p10 was found immediately downstream of p74, as in most NPVs. However in most GVs, p74 is distal from p10. One drawback of using parity plot analysis as compared with phylogenetic trees is that they do not give quantitative information on genome relatedness (Herniou et al., 2001Down).


Figure 5
View larger version (12K):
[in this window]
[in a new window]
 
Fig. 5. Gene parity plot comparison of ChocGV with AcMNPV (a), CpGV (b), CrleGV (c) and AdorGV (d). Homologous genes are plotted based on their relative location in the genomes. Those without homologues are aligned on the x or y axes, respectively. The positions of ChocGV p10 and iap-3 relative to AcMNPV, CpGV, CrleGV and AdorGV are noted. Each diamond represents an ORF.

 
Phylogenetic analysis
Traditionally, phylogenies were determined by analysing the sequences of individual genes such as polyhedrin, granulin (Zanotto et al., 1993Down), dna polymerase (Bulach et al., 1999Down) or ecdysteroid UDP-glucosyltransferase (Barrett et al., 1995Down) from several genomes. Recently, however, a concatenation of shared proteins has been shown to produce more reliable trees (Herniou et al., 2001Down). A most parsimonious tree produced using concatamers of the 29 conserved proteins from 27 fully sequenced baculovirus genomes (Fig. 6Down) reflected the three major taxonomic divisions within the lepidopteran baculoviruses: group I NPV, group II NPV and GVs. ChocGV grouped in a clade with CpGV and CrleGV. This was expected, as they shared the highest mean amino acid identity to ChocGV. The Hymenopteran and Dipteran baculoviruses grouped in separate clades. A new baculovirus classification system has been proposed for these non-lepidopteran baculoviruses, as they do not group with lepidopteran NPVs or GVs (Herniou et al., 2003Down; Lauzon et al., 2004Down).


Figure 6
View larger version (17K):
[in this window]
[in a new window]
 
Fig. 6. The most parsimonious tree based on concatenation of 29 conserved proteins from 27 baculovirus genomes. Bootstrap values (%) for 1000 replicates are shown. Lepidopteran baculoviruses branched into three major groups: group I and II NPVs andGVs. The Hymenopteran baculoviruses NeleNPV and NeseNPV, and the Dipteran virus CuniNPV, formed their own branches.

 
Based on sequence and phylogenetic analyses of complete genomes, ChocGV was most closely related to CpGV, but individual ORFs showed the highest identity to those in the partially sequenced ChfuGV genome. ChocGV, however, had some features that were closer to NPVs than to GVs. A P35/P49 homologue, previously found only in NPVs, was present in ChocGV. ChocGV P10 shared a higher amino acid identity with NPV P10s than with GV P10s and ChocGV contained two ORFs (choc20 and 53) previously found only in NPVs. The relationship of ChocGV P10 to AmEPV FALPE was also noteworthy, with choc45 being the only baculovirus ORF identified to date that contains a proline-rich region similar to that in FALPE. The common set of granulovirus genes has dropped, as ChocGV is the first GV sequenced without a xecn26 homologue. As more baculovirus genomes are sequenced, more information will become available about genes that are essential for NPVs and GVs and on the evolution of the family Baculoviridae.


   ACKNOWLEDGEMENTS
 
This research was supported by grants from Genome Canada through the Ontario Genomics Institute and the Canadian Biotechnology Strategy Fund. We would like to thank the University of Calgary (Department of Biochemistry and Molecular Biology) for the use of Magpie in aiding in genome sequence analysis. We thank Dr Rollie Clem for making some of his data on AMV010 P33 available to us.


   REFERENCES
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
REFERENCES
 
Afonso, C. L., Tulman, E. R., Lu, Z., Balinsky, C. A., Moser, B. A., Becnel, J. J., Rock, D. L. & Kutish, G. F. (2001). Genome sequence of a baculovirus pathogenic for Culex nigripalpus. J Virol 75, 11157–11165.[Abstract/Free Full Text]

Ahrens, C. H., Russell, R. L., Funk, C. J., Evans, J. T., Harwood, S. H. & Rohrmann, G. F. (1997). The sequence of the Orgyia pseudotsugata multinucleocapsid nuclear polyhedrosis virus genome. Virology 229, 381–399.[CrossRef][Medline]

Alaoui-Ismaili, M. H. & Richardson, C. D. (1998). Insect virus proteins (FALPE and p10) self associate to form filaments in infected cells. J Virol 72, 2213–2223.[Abstract/Free Full Text]

Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990). Basic local alignment search tool. J Mol Biol 215, 403–410.[CrossRef][Medline]

Arif, B. M., Zhang, G.-Y. & Jamieson, P. (1986). A comparison of three granulosis viruses isolated from Choristoneura spp. J Invertebr Pathol 48, 180–186.[CrossRef]

Ayres, M. D., Howard, S. C., Kuzio, J., Lopez-Ferber, M. & Possee, R. D. (1994). The complete DNA sequence of Autographa californica nuclear polyhedrosis virus. Virology 202, 586–605.[CrossRef][Medline]

Bando, H., Hayakawa, T., Asano, S., Sahara, K., Nakagaki, M. & Iizuka, T. (1995). Analysis of the genetic information of a DNA segment of a new virus from silkworm. Arch Virol 140, 1147–1155.[CrossRef][Medline]

Barrett, J. W., Krell, P. J. & Arif, B. M. (1995). Characterization, sequencing and phylogeny of the ecdysteroid UDP-glucosyltransferase gene from two distinct nuclear polyhedrosis viruses isolated from Choristoneura fumiferana. J Gen Virol 76, 2447–2456.[Abstract/Free Full Text]

Bateman, A., Coin, L. L., Eddy, S. R. & 10 other authors (2004). The Pfam protein families database. Nucleic Acids Res 32, D138–D141.[Abstract/Free Full Text]

Bawden, A. L., Glassberg, K. J., Diggans, J., Shaw, R., Farmerie, W. & Moyer, R. M. (2000). Complete genomic sequencing of the Amsacta moorei entomopoxvirus: analysis and comparison with other poxviruses. Virology 274, 120–139.[CrossRef][Medline]

Benson, G. (1999). Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res 27, 573–580.[Abstract/Free Full Text]

Blissard, G. W., Black, B., Crook, N., Keddie, B. A., Posse, R., Rohrmann, G., Theilmann, D. & Volkman, L. (2000). Baculoviridae. In Virus Taxonomy; Seventh Report of the International Committee on Taxonomy of Viruses, pp. 195–202. Edited by M. H. V. van Regenmortel, C. M. Fauquet, D. H. L. Bishop, E. B. Carstens, M. K. Estes, S. M. Lemon, J. Maniloff, M. A. Mayo, D. J. McGeoch, C. R. Pringle & R. B. Wickner. San Diego: Academic Press.

Bulach, D. M., Kumar, C. A., Zaia, A., Liang, B. & Tribe, D. E. (1999). Group II nucleoployhedrovirus subgroups revealed by phylogenetic analysis of polyhedrin and DNA polymerase gene sequences. J Invertebr Pathol 73, 59–73.[CrossRef][Medline]

Burland, T. G. (2000). DNASTAR'S Lasergene sequence analysis software. Methods Mol Biol 132, 71–91.[Medline]

Chen, X., IJkel, W. F. J., Tarchini, R. & 8 other authors (2001). The sequence of the Helicoverpa armigera single nucleocapsid nucleopolyhedrovirus genome. J Gen Virol 82, 241–257.[Abstract/Free Full Text]

Chen, X., Zhang, W. J., Wong, J. & 9 other authors (2002). Comparative analysis of the complete genome sequences of Helicoverpa zea and Helicoverpa armigera single-nucleocapsid nucleopolyhedroviruses. J Gen Virol 83, 673–684.[Abstract/Free Full Text]

Crook, N. E., Clem, R. J. & Miller, L. K. (1993). An apoptosis-inhibiting baculovirus gene with a zinc finger-like motif. J Virol 67, 2168–2174.[Abstract/Free Full Text]

Das, S. & Maitra, U. (2000). Mutational analysis of mammalian translation initiation factor 5 (eIF5): role of interaction between the beta subunit of eIF2 and eIF5 in eIF5 function in vitro and in vivo. Mol Cell Biol 20, 3942–3950.[Abstract/Free Full Text]

de Jong, J. G., Lauzon, H. A. M., Dominy, C., Poloumienko, A., Carstens, E., Arif, B. M. & Krell, P. J. (2005). Analysis of the Choristoneura fumiferana nucleopolyhedrovirus genome. J Gen Virol 86, 929–943.[Abstract/Free Full Text]

Du, Q., Lehavi, D., Faktor, O., Qi, Y. & Chejanovsky, N. (1999). Isolation of an apoptosis suppressor gene of the Spodoptera littoralis nucleopolyhedrovirus. J Virol 73, 1278–1285.[Abstract/Free Full Text]

Ewing, B. & Green, P. (1998). Base-calling of automated sequencer traces using Phred. II Error probabilities. Genome Res 8, 186–194.[Abstract/Free Full Text]

Ewing, B., Hillier, L., Wendl, M. C. & Green, P. (1998). Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Res 8, 175–185.[Abstract/Free Full Text]

Faulkner, P., Kuzio, J., Williams, G. V. & Wildon, J. A. (1997). Analysis of p74, a PDV envelope protein of Autographa californica nucleopolyhedrovirus required for occlusion body infectivity in vivo. J Gen Virol 78, 3091–3100.[Abstract]

Fellin, D. G. & Dewey, J. E. (1999). Western Spruce Budworm. Forest insect and disease leaflet 53. USDA forest service.

Gaasterland, T. & Sensen, C. W. (1996). Fully automated genome analysis that reflects user needs and preferences. A detailed introduction to the MAGPIE system architecture. Biochimie 78, 302–310.[Medline]

Garcia-Maruniak, A., Maruniak, J. E., Zanotto, P. M. A., Doumbouya, A. E., Liu, J. C., Merritt, T. M. & Lanoie, J. S. (2004). Sequence analysis of the genome of the Neodiprion sertifer nucleopolyhedrovirus. J Virol 78, 7036–7051.[Abstract/Free Full Text]

Gasteiger, E., Gattiker, A., Hoogland, C., Ivanyi, I., Appel, R. D. & Bairoch, A. (2003). ExPASy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res 31, 3784–3788.[Abstract/Free Full Text]

Gattiker, A., Gasteiger, E. & Bairoch, A. (2002). ScanProsite: a reference implementation of a Prosite scanning tool. Appl Bioinformatics 1, 107–108.[Medline]

Gomi, S., Majima, K. & Maeda, S. (1999). Sequence analysis of the genome of Bombyx mori nucleopolyhedrovirus. J Gen Virol 80, 1323–1337.[Abstract]

Green, P. (1996). Documentation for Phrap (http://bozeman.mbt.washington.edu).

Green, P. (2004). CrossMatch (http://phrap.org/phrap.docs/general.html).

Harrison, R. L. & Bonning, B. C. (2003). Comparative analysis of the genomes of Rachiplusia ou and Autographa californica multiple nucleopolyhedrovirus. J Gen Virol 84, 1827–1842.[Abstract/Free Full Text]

Hashimoto, Y., Hayakawa, T., Ueno, Y., Fujita, T., Sano, Y. & Matsumoto, T. (2000). Sequence analysis of the Plutella xylostella granulovirus genome. Virology 275, 358–372.[CrossRef][Medline]

Hawtin, R. E., Zarkowska, T., Arnold, K., Thomas, C. J., Gooday, G. W., King, L. A., Kuzio, J. A. & Possee, R. D. (1997). Liquefaction of Autographa californica nucleopolyhedrovirus-infected insects is dependent on the integrity of virus-encoded chitinase and cathepsin genes. Virology 238, 243–253.[CrossRef][Medline]

Hayakawa, T., Ko, R., Okano, K., Seong, S. I., Goto, C. & Maeda, S. (1999). Sequence analysis of the Xestia c-nigrum granulovirus genome. Virology 262, 277–297.[CrossRef][Medline]

Hayakawa, T., Rohrmann, G. F. & Hashimoto, Y. (2000). Patterns of genome organization and content in lepidopteran baculoviruses. Virology 278, 1–12.[CrossRef][Medline]

Heldens, J. G. M., Broer, R., Zuidema, D., Goldbach, R. W. & Vlak, J. M. (1997). Identification and functional analysis of a non-hr origin of DNA replication in the genome of Spodoptera exigua multicapsid nucleopolyhedrovirus. J Gen Virol 78, 1497–1506.[Abstract]

Herniou, E. A., Luque, T., Chen, X., Vlak, J. M., Winstanley, D., Cory, J. S. & O'Reilly, D. R. (2001). Use of whole genome sequence data to infer baculovirus phylogeny. J Virol 75, 8117–8126.[Abstract/Free Full Text]

Herniou, E. A., Olszewski, J. A., Cory, J. S. & O'Reilly, D. R. (2003). The genome sequence and evolution of baculoviruses. Annu Rev Entomol 48, 211–234.[CrossRef][Medline]

Hu, Z. H., Arif, B. M., Jin, F., Martens, J. W. M., Chen, X. W., Sun, J. S., Zuidema, D. G., Goldbach, R. W. & Vlak, J. M. (1998). Distinct gene arrangement in the Buzura suppressaria single-nucleocapsid nucleopolyhedrovirus genome. J Gen Virol 79, 2841–2851.[Abstract]

Huang, J. & Levin, D. B. (1999). Identification and functional analysis of a putative non-hr origin of DNA replication from Spodoptera littoralis type B multinucleocapsid nucleopolyhedrovirus. J Gen Virol 80, 2263–2274.[Abstract/Free Full Text]

Hulo, N., Sigrist, C. J., Le Saux, V., Langendijk-Genevaux, P. S., Bordoli, L., Gattiker, A., De Castro, E., Bucher, P. & Bairoch, A. (2004). Recent improvements to the PROSITE database. Nucleic Acids Res 32, D134–D137.[Abstract/Free Full Text]

Hyink, O., Dellow, R. A., Olsen, M. J., Caradoc-Davies, K. M. B., Drake, K., Herniou, E. A., Cory, J. S. O'Reilly D. R. & Ward, V. K. (2002). Whole genome analysis of the Epiphyas postvittana nucleopolyhedrovirus. J Gen Virol 83, 957–971.[Abstract/Free Full Text]

IJkel, W. F. J., van Strien, E. A., Heldens, J. G. M., Broer, R., Zuidema, D., Goldbach, R. W. & Vlak, J. M. (1999). Sequence and organization of the Spodoptera exigua multicapsid nucleopolyhedrovirus genome. J Gen Virol 80, 3289–3304.[Abstract/Free Full Text]

Ikeda, M., Yanagimoto, K. & Kobayashi, M. (2004). Identification and functional analysis of Hyphantria cunea nucleopolyhedrovirus iap genes. Virology 321, 359–371.[CrossRef][Medline]

Jehle, J. A. (2002). The expansion of a hypervariable, non-hr ori-like region in the genome of Cryptophlebia leucotreta granulovirus provides in vivo evidence for the utilization of baculovirus non-hr oris during replication. J Gen Virol 83, 2025–2034.[Abstract/Free Full Text]

Kikhno, I., Gutierrez, S., Croizier, L., Croizier, G. & Ferber, M. L. (2002). Characterization of pif, a gene required for the per os infectivity of Spodoptera littoralis nucleopolyhedrovirus. J Gen Virol 83, 3013–3022.[Abstract/Free Full Text]

Kool, M., Goldbach, R. W. & Vlak, J. M. (1994). A putative non-hr origin of DNA replication in the HindIII-K fragment of Autographa californica multiple nucleocapsid nuclear polyhedrosis virus. J Gen Virol 75, 3345–3352.[Abstract/Free Full Text]

Kurtz, S. & Schleiermacher, C. (1999). REPuter fast computation of maximal repeats in complete genomes. Bioinformatics Applications Note 15, 426–427.[Abstract/Free Full Text]

Kuzio, J., Pearson, M. N., Harwood, S. H., Funk, C. J., Evans, J. T., Slavicek, J. M. & Rohrmann, G. F. (1999). Sequence and analysis of the genome of a baculovirus pathogenic for Lymantria dispar. Virology 253, 17–34.[CrossRef][Medline]

Landschulz, W. H., Johnson, P. F. & McKnight, S. L. (1988). The leucine zipper: a hypothetical structure common to a new class of DNA binding proteins. Science 240, 1759–1764.[Abstract/Free Full Text]

Lange, M. & Jehle, J. A. (2003). The genome of the Cryptophlebia leucotreta granulovirus. Virology 317, 220–236.[CrossRef][Medline]

Lauzon, H. A. M., Lucarotti, C. J., Krell, P. J., Feng, Q., Retnakaran, A. & Arif, B. M. (2004). Sequence and organization of the Neodiprion lecontei nucleopolyhedrovirus genome. J Virol 78, 7023–7035.[Abstract/Free Full Text]

Lauzon, H. A. M., Jamieson, P. B., Krell, P. J. & Arif, B. M. (2005). Gene organization and sequencing of the Choristoneura fumiferana defective nucleopolyhedrovirus genome. J Gen Virol 86, 945–961.[Abstract/Free Full Text]

Li, L., Donly, C., Li, Q., Willis, L. G., Keddie, B. A., Erlandson, M. A. & Theilmann, D. A. (2002a). Identification and genomic analysis of a second species of nucleopolyhedrovirus isolated from Mamestra configurata. Virology 297, 226–244.[CrossRef][Medline]

Li, Q., Donly, C., Li, L., Willis, L. G., Theilmann, D. A. & Erlandson, M. (2002b). Sequence and organization of the Mamestra configurata nucleopolyhedrovirus genome. Virology 294, 106–121.[CrossRef][Medline]

Luque, L., Finch, R., Crook, N., O'Reilly, D. R. & Winstanley, D. (2001). The complete sequence of the Cydia pomonella granulovirus genome. J Gen Virol 82, 2531–2547.[Abstract/Free Full Text]

Mulder, N. J., Apweiler, R., Attwood, T. K. & 34 other authors (2003). The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res 31, 315–318.[Abstract/Free Full Text]

Nakai, M., Goto, C., Kang, W., Shikata, M., Luque, T. & Kunimi, Y. (2003). Genome sequence and organization of a nucleopolyhedrovirus isolated from the smaller tea tortrix, Adoxophyes honmai. Virology 316, 171–183.[CrossRef][Medline]

Nicholas, K. B. & Nicholas, H. B. (1997). GeneDoc: a tool for editing and annotating multiple sequence alignments. (Distributed by authors.) Version 2.6.002.

O'Brien, V. (1998). Viruses and apoptosis. J Gen Virol 79, 1833–1845.[Medline]

O'Reilly, D. R. (1997). Auxiliary genes of baculoviruses. In The Baculoviruses, pp. 267–300. Edited by L. K. Miller. New York: Plenum.

Ornitz, D. M. & Nobuyuki, I. (2001). Protein family review: fibroblast growth factors. Genome Biol 2, 3005.1–3005.12.

Pang, Y., Yu, J., Wang, L. & 7 other authors (2001). Sequence analysis of the Spodoptera litura multicapsid nucleopolyhedrovirus genome. Virology 287, 391–404.[CrossRef][Medline]

Pearson, M. N., Bjornson, R. M., Ahrens, C. & Rohrmann, G. F. (1993). Identification and characterization of a putative origin of DNA replication in the genome of a baculovirus pathogenic for Orgyia pseudotsugata. Virology 197, 715–725.[CrossRef][Medline]

Pei, Z., Reske, G., Huang, Q., Hammock, B. D., Qi, Y. & Chejanovsky, N. (2002). Characterization of the apoptosis suppressor protein P49 from the Spodoptera littoralis nucleopolyhedrovirus. J Biol Chem 277, 48677–48684.[Abstract/Free Full Text]

Pijlman, G. P., Pruijssers, A. J. P. & Vlak, J. M. (2003). Identification of pif-2, a third conserved baculovirus gene required for per-os infection of insects. J Gen Virol 84, 2041–2049.[Abstract/Free Full Text]

Prestridge, D. S. (1995). Predicting Pol II promoter sequences using transcription factor binding sites. J Mol Biol 249, 923–932.[CrossRef][Medline]

Rashidan, K. K., Nassoury, N., Giannopoulos, P. N., Mauffette, Y. & Guertin, C. (2004a). Identification, characterization, and phylogenic analysis of conserved genes within odvp-6e/odv-e56 gene region of Choristoneura fumiferana granulovirus. J Biochem Mol Biol 37, 206–212.[Medline]

Rashidan, K. K., Nassoury, N., Giannopoulos, P. N., Mauffette, Y. & Guertin, C. (2004b). Identification, characterization and phylogenic analysis of conserved genes within the p74 gene region of Choristoneura fumiferana granulovirus genome. J Biochem Mol Biol 37, 700–708.[Medline]

Ruepp, S., Kurath, U., Renggli, C. K., Drun, R. & Roditi, I. (1999). Glutamic acid/alanine-rich protein from Trypanosoma congolense is the functional equivalent of ‘EP’ procyclin from Trypanosoma brucei. Mol Biochem Parasitol 98, 151–156.[CrossRef][Medline]

Schultz, J., Milpetz, F., Bork, P. & Ponting, C. P. (1998). SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci U S A 95, 5857–5864.[Abstract/Free Full Text]