J Gen Virol Email Content Delivery
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Gen Virol 87 (2006), 1595-1602; DOI 10.1099/vir.0.81585-0

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Sanjuán, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sanjuán, R.
Agricola
Right arrow Articles by Sanjuán, R.
© 2006 Society for General Microbiology

Quantifying antagonistic epistasis in a multifunctional RNA secondary structure of the Rous sarcoma virus

Rafael Sanjuán

Instituto de Biología Molecular y Celular de Plantas, CSIC-UPV, 46022 Valencia, Spain

Correspondence
Rafael Sanjuán
rafsaver{at}ibmcp.upv.es


   ABSTRACT
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES
 
Recent studies have suggested that antagonistic epistasis (i.e. mutations having smaller effects in combination than alone) may be common among RNA viruses, in contrast to other biological systems. Here, by re-analysing previously published data from a random viral library, selection and epistasis coefficients were estimated in the U5-IR stem and loop of the Rous sarcoma virus, a region that adopts a conserved secondary structure and is involved in various essential steps of viral infection. The estimated mutational fitness effects are extremely high and genetic interactions are antagonistic on average. This pattern might be representative of RNA virus genomes, which show high compaction and frequent secondary structures. The implications for RNA virus adaptability are explored.


   INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES
 
Epistasis occurs when mutations interact in their fitness effects (Wolf et al., 2000Down). Epistatic selection can be demonstrated by testing whether mutations act multiplicatively on fitness-related traits (Elena & Lenski, 1997Down; Whitlock & Bourguet, 2000Down; Sanjuán et al., 2004bDown). For deleterious mutations, if the observed fitness is above the expected multiplicative value, i.e. if mutations lessen their impact as they accumulate, epistasis is antagonistic. On the contrary, if mutations are more harmful as they accumulate, epistasis is synergistic. Genetic interactions are believed to play a central role in evolution, for they should be involved in basic processes as the genesis of genetic diversity (Wright, 1931Down, 1982Down), speciation (Dobzhansky, 1936Down; Muller, 1939Down), the origin and maintenance of recombination (Crow, 1970Down; Kondrashov, 1988Down), or the accumulation of deleterious mutations through genetic drift (Kondrashov, 1994Down; Butcher, 1995Down).

In contrast to the number of theoretical studies, there are still relatively few data on the nature of epistasis in real populations. Many experimental studies have sought to detect epistatic selection in eukaryotes and prokaryotes and, alongside some cases of synergistic epistasis (Mukai, 1969Down; de Visser et al., 1996Down; Whitlock & Bourguet, 2000Down), no deviation from multiplicative effects has been observed in other cases (Elena & Lenski, 1997Down; de Visser & Hoekstra, 1998Down; Elena, 1999Down; Elena & Lenski, 2001Down; Burch et al., 2003Down). Recent studies have provided new insights that antagonistic epistasis might be the norm among RNA viruses, in contrast to other biological systems (Burch & Chao, 2004Down; Sanjuán et al., 2004bDown; Bonhoeffer et al., 2004Down). First, after carrying out a mutation accumulation experiment, Burch & Chao (2004)Down reported that, in bacteriophage {phi}6, mutations are less harmful in low fitness genetic backgrounds than in high fitness backgrounds. Second, after performing a site-directed mutagenesis study in the vesicular stomatitis virus (VSV), Sanjuán et al. (2004b)Down concluded that, on average, the fitness of genotypes carrying two random mutations was above the multiplicatively expected value. Finally, Bonhoeffer et al. (2004)Down measured the productivity of nearly 10 000 human immunodeficiency type 1 virus genotypes carrying mutations that had accumulated in patients, predominately on drug therapy, in the reverse transcriptase and protease genes, concluding that, on average, fitness was higher than expected in the absence of epistasis.

The success in finding epistasis among RNA viruses, in contrast to more complex genomes, might stem from technical reasons, namely the ability to perform a higher number of replicates and to manipulate genomes more finely, but it could also reflect some specific genetic features. First, RNA viruses have extremely compact genomes, with little redundancy, few non-essential regions and frequent gene overlapping. Hence, most mutations are likely to affect a large portion of the whole set of viral functions and as a consequence, single mutations can be strongly deleterious (Domingo et al., 1996Down, 2001Down; Novella, 2003Down; Sanjuán et al., 2004aDown). Conversely, multiple mutations are likely to hit repeatedly the same functions. In this latter case, mutational effects can overlap with a higher chance than if non-related genes are hit, thus producing antagonistic epistasis (Wilke et al., 2003Down; Elena et al., 2006Down). Second, RNA frequently adopts secondary structures, with hairpin stems formed by base-pairing stretches. Single mutations can destabilize these structures, but there exists the possibility that subsequent mutations, despite their standalone deleterious effects, restore base-pairing, thereby compensating the effects of previous mutations. Here, I focus on the U5-IR region of the Rous sarcoma virus (RSV), which is adjacent to the primer binding site and adopts a conserved stem and loop structure. The region is multifunctional and essential for virus survival, since it is needed for efficient initiation of reverse transcription, encodes the integrase recognition site at the end of the U5 viral DNA and, may have a role in RNA packaging. Taking advantage of a previously published random library dataset (Johnson et al., 2004Down), I developed a simple method for estimating the intensity of selection and quantifying epistasis in this region of RSV. As expected, the data are consistent with strong mutational effects and antagonistic epistasis.


   METHODS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES
 
Rationale.
In a randomized library, PCR-based mutagenesis with degenerate primers is used to convert a short target sequence into a random stretch, where all four nucleotides are found with equal probability. Thus, for a target of length L, a swarm of 4L different allelic variants is created. Following mutagenesis, viral particles carrying the randomized sequence are recovered by transfection. Since this moment, selection will modify the nucleotide frequency at each position, removing unviable genotypes and pushing the fittest genotype towards fixation. The mode and timing by which the nucleotide composition of the population is driven by selection can be used to estimate selection and epistasis coefficients.

Model.
Given that for each target site, initially all four nucleotides have the same frequency, the probability of finding a nucleotide other than that of the wild-type is 0.75 for each site. Hence, in the target sequence, the number of differences to the wild-type, i, follows a binomial distribution,Down


Formula 001

(1)

In the absence of selection, this initial distribution will remain unchanged if newly arising mutations are neglected (this assumption is reasonable in the short term, because the mutant frequency is high enough not to be noticeably affected by de novo mutation). Similarly, genetic recombination can be neglected provided that the target stretch is short enough to ensure a quasi-complete linkage between nucleotide positions during the time scale in which the experiment takes place. Here, the target is no more than 9 nt long, and less than 10 generations elapse during the evolution experiment. The model is deterministic since selection is the only factor assumed to modify the initial mutant frequency distribution. Under the multiplicative effects model, the expected fitness, E(W), associated to genotypes carrying i mutations isDown


Formula 002

(2)
where s is the selection coefficient against deleterious mutations (0<=s<=1), here assumed to be constant for simplicity. After g generations, the frequency of genotypes carrying i mutations will be reduced relative to the wild-type sequence by a factor of (1–s)gi. The frequency of genotypes carrying i mutations after g generations will beDown


Formula 003

(3)
where C is a constant satisfying Formula .

In the presence of epistasis, the expected fitness of genotypes carrying i mutations can be written asDown


Formula 004

(4)
(Wilke et al., 2003Down), where e stands for a mean epistasis coefficient. Epistasis is predominantly synergistic if e<0, effects are multiplicative if e=0 and, if 0<e<=1, epistasis is antagonistic. After g generations, the frequency of genotypes carrying i mutations will be reduced relative to the wild-type by a factor of (1–s)gi(1–e). The frequency of genotypes carrying i mutations after g generations will beDown


Formula 005

(5)
where C' satisfies Formula .

Parameter estimation.
The likelihood of observing i mutations is given by equation 1 for the neutral model, equation 3 for the multiplicative model and equation 5 for the epistatic model, where parameters L and g are experimentally given. For the multiplicative model, parameter s can be estimated, given the data, by maximum-likelihood (ML), assuming that all observations are independent and that hence the overall likelihood is the product of individual likelihoods. For the epistatic model, the parameter vector (s, e) can be estimated in the same manner. Numerical calculations can be done as follows: N, L, g, the number of possible alleles per site (here, four), plus initial values for the selection coefficient (s0) and the epistasis coefficient (e0), when necessary, are set as input parameters. A precision parameter, p, is also defined. In the first round, all combinations in the range s0±10p0 and e0±10p0 are explored in a discrete manner with step size p0. In all cases, I have used s0=0.5, e0=0 and p0=0.05. The output consists of a maximum-likelihood (ML) s1 value (multiplicative model) or a (s1, e1) vector (epistatic model) and its associated log-likelihood. After this first round, s1 or (s1, e1) can be used as new input and p can be set to a lower value to obtain more precise ML estimators.

Model test and confidence intervals.
The neutral, multiplicative and epistatic models are nested and therefore can be compared using a likelihood ratio test (LRT) (Sokal & Rohlf, 2000Down). If Ln, Lm and Le are the ML values corresponding to the neutral, multiplicative and epistatic model, respectively, quantities 2(logLm–logLn) and 2(log Le–log Lm) follow a {chi}2 distribution with one degree of freedom under the null hypotheses s=0 and e=0, respectively. More sophisticated methods for model selection (Johnson & Omland, 2004Down) always led to the same conclusions. The bounds of 95 % confidence intervals can be estimated as the parameter values for which the likelihood of the model drops two log-units. This ‘two logs' rule has indeed the same basis as the LRT, since two is approximately half the percentile 95 % of a {chi}2 distribution with one degree of freedom. Confidence intervals are approximately symmetrical and thus they are given in the form of a simple error through the text.

RSV dataset.
Fig. 1Down shows the RSV U5-IR wild-type primary and secondary structure according to Johnson et al. (2004)Down. In this work, the stem and loop were subjected to random mutagenesis in separate experiments. The U5-IR stem library included positions 83–86 and 96–99, hence 48 combinations were possible. The U5-IR loop library included positions 87–95, thus 49 combinations were possible. At each step of the library construction, transfection and in vivo culture, at least 106 clones were sampled and therefore, both libraries should include all possible sequences. Johnson et al. (2004)Down explored the possibility of a bias in the sequence composition by sequencing 37 individual clones from the U5-IR loop library and no deviations from randomness were observed.


Figure 1
View larger version (13K):
[in this window]
[in a new window]
 
Fig. 1. The U5-IR stem and loop, located around the primer binding site, at the 5' end of RSV genome (modified from Johnson et al., 2004Down). This structure is required for efficient initiation of reverse transcription and for DNA integration. The shaded regions indicate nucleotides that were randomized in the U5-IR stem (dark grey) and U5-IR loop (light grey).

 
In the original work, following mutagenesis and transfection, infection passages were performed in plates containing ~2x106 cells at an m.o.i. of 0.2 particles per cell. Given that host cells are transformed by RSV, the initially infected cells remain virus-productive until the end of an infection passage, here lasting 72 h. Approximately 3–4 viral particles are released per h per cell (J. Leis, personal communication), yielding approximately 108 particles at the end of each passage. Given that viral absorption takes ~1 h, when particles from the second infection cycle are released, all cells should be already infected with viruses released from the first cycle, thus preventing additional infection cycles. This means that ~20 % of particles produced after an infection passage came from the primarily infected cells, whereas the remaining 80 % came from the second cycle. Therefore, on average, ~1.8 infection cycles took place per passage. Although the definition of a generation in viruses is not straightforward, here, I follow the convention that a generation matches an infection cycle. This convention is based on the fact that synthesis of new viral particles from progenitor particles is concomitant to the release of daughter particles from the host cell. The possibility that more than a single replication round could occur within a cell is ignored.

Two infection passages were carried out by Johnson et al. (2004)Down for the U5-IR stem experiment, that is, g ~3.6 viral generations were completed, whereas for the case of U5-IR loop, five passages were done, thus g ~9 generations were completed. For the U5-IR stem, of 50 clones sequenced, 31 had a wild-type genotype, whereas 19 retained one to eight mutations. For the U5-IR loop, of 37 viral clones, five clones had a wild-type sequence, whereas 32 retained one to nine mutations.

Accounting for heterogeneity in the number of generations.
The above model assumes that all individuals have undergone the same number of generations. This would be true for m.o.i.=1 particle per cell, since a single infection could take place per infection passage. Provided that no more than two generations are completed in each infection, in general, for m.o.i.=m and after n passages, the fraction of the population undergoing g generations can be calculated asDown


Formula 006

(6)
Formula Equation 3 is thus modified as follows:Down


Formula 007

(7)

Analogously, equation 5 becomesDown


Formula 008

(8)

For a given m-value, ML estimation of parameters s and e, confidence intervals and LRT were done in the same manner as for the model that assumed no heterogeneity in the number of generations.


   RESULTS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES
 
Fit to alternative models and parameter estimation
Fig. 2Down(a) shows the expected frequency of each mutational class under the assumption of neutrality, superposed to the observed frequencies for the U5-IR stem. The log-likelihood for the neutral model (equation 1) is logLn=–402.085. The multiplicative model (equation 3; Fig. 2bDown) provides a substantially better fit, with a log-likelihood of logLm=–140.556. This improvement is highly significant according to an LRT ({chi}2=523.058, P<0.001). The estimated selection coefficient associated to single-nucleotide substitutions is s=0.494±0.035. In turn, the fit is largely improved by introducing epistasis in the model (equation 5; Fig. 2cDown), yielding logLe=–73.084 ({chi}2=134.944, P<0.001). This improvement seems to be due to the ability of the latter model to account for the high frequency of wild-type sequences in the population and the bimodal shape of the observed mutant frequency. For this model, the estimated mean selection coefficient associated to single-nucleotide substitutions is s=0.838±0.002, significantly larger than for the multiplicative effects model, and the mean epistatic coefficient is highly positive (e=0.651±0.033).


Figure 2
View larger version (14K):
[in this window]
[in a new window]
 
Fig. 2. Observed (black thin bars) and predicted (grey bars) mutant frequencies for the U5-IR stem after two infection passages. (a) Neutral model. (b) Multiplicative model. (c) Epistatic model.

 
Results for the U5-IR loop are analogous to those obtained for the U5-IR stem. Fig. 3Down(a) shows the expected frequency of each mutational class under the assumption of neutrality and the observed frequencies for the U5-IR loop. The log-likelihood for the neutral model is logLn=–141.058. The multiplicative model (Fig. 3bDown) fits the data significantly better (logLm=–101.061, {chi}2=79.994, P<0.001) and the estimated selection coefficient associated to single-nucleotide substitutions in this region is s=0.106±0.019. Nonetheless, the fit to the multiplicative model is improved by the epistatic model (logLe=–77.878, {chi}2=46.366, P<0.001; Fig. 3cDown). As was the case for the stem region, this better fit seems related to the ability to account for the high wild-type frequency and the bimodal shape of the observed mutant frequency. For the epistatic model, the estimated selection coefficient associated to single-nucleotide substitutions is s=0.477±0.027 and the epistatic coefficient is e=0.666±0.051.


Figure 3
View larger version (15K):
[in this window]
[in a new window]
 
Fig. 3. Observed (black thin bars) and predicted (grey bars) mutant frequencies for the U5-IR loop after five infection passages. (a) Neutral model. (b) Multiplicative model. (c) Epistatic model.

 
Accounting for heterogeneity in the number of generations
Since the m.o.i. is lower than one particle per cell, more than a single generation can be completed per infection passage. Some viruses will undergo one generation each passage, but others will have the opportunity to undergo a second cycle. This heterogeneity can be accounted for in both multiplicative (equation 6) and epistatic models (equation 7), whereas the neutral model remains unchanged. For an m.o.i. of 0.2 particle per cell, the multiplicative model gives an estimate of s=0.606±0.038 for the U5-IR stem and s=0.114±0.025 for the U5-IR loop. In both cases, the multiplicative model provides a significantly better fit than the neutral model (Table 1Down). Indeed, the estimated selection coefficients are even larger than those obtained with models that do not account for heterogeneity in the number of generations. The epistatic model gives an estimate of s=0.871±0.019 and e=0.552±0.044 for the U5-IR stem, whereas s=0.488±0.036 and e=0.644±0.050 for the U5-IR loop. In both cases, the fit is significantly better than for the multiplicative model (Table 1Down). In sum, the results are fully consistent with the simpler models that do not account for heterogeneity in the number of generations.


View this table:
[in this window]
[in a new window]
 
Table 1. ML estimations, for different m.o.i. values, of the selection and epistasis coefficients using models that account for heterogeneity in the number of generations

The error terms correspond to 95 % confidence intervals. For the neutral model, logLn=–402.085. In all cases, according to LRT, the multiplicative model provides a significantly better fit than the neutral model, which in turn is significantly improved by the epistatic model (P<0.0001 in all cases).

 
The above estimates rely on an m.o.i. that is subject to experimental error. To explore to what extent changes in this parameter can affect the results, the m.o.i. has been varied from 0.1 to 1.0 particle per cell. As the m.o.i. increases, changes in genotype frequencies take place in lower numbers of generations and thus, the estimated selection coefficients increase. However, in the range 0.1–1.0, the choice of the m.o.i. value does not substantially affect the results: selection is strong and epistasis is predominantly antagonistic (Table 1Up). Analogous results could be obtained by letting the parameter g vary in the models that do not account for heterogeneity in the number of generations (not shown).

Inferred evolutionary dynamics
The mean population fitness, the variance in the number of mutations and the frequency of the wild-type sequence provide an overview of the population dynamics viruses would follow according to the models considered above. In doing that, I have focused on the experimentally ideal situation where the m.o.i. is one particle per cell. In this particular case, models that do not account for heterogeneity in the number of generations and models that do account for this heterogeneity are equivalent. The mean fitness of the population can be calculated from the expected fitness (equations 2 and 4) and frequency (equations 3 and 5) of each mutant class. The variance in the number of mutations per genotype is Formula , {epsilon} being the mean number of mutations per genotype in the entire population. Finally, the frequency of the wild-type in the population is directly given by equations 3 and 5. I have explored the expected dynamics of adaptation under the multiplicative and epistatic models, setting both s and e to their corresponding ML estimates. Fig. 4Down shows changes in mean population fitness and variance in mutation number for the U5-IR stem and loop. Although the rate of adaptation is initially faster for the multiplicative model, the situation is reversed after few generations and the time to reaching the maximal fitness is shorter for the epistatic model. The effect of including epistasis has two effects on the model. First, it inflates the estimated selection coefficient favouring the fittest sequence. Second, the maximal diversity in the number of mutations reached by the population is larger for positive epistatic coefficients, because genotypes carrying relatively large numbers of mutations are transiently more abundant due to their milder selective disadvantage.


Figure 4
View larger version (12K):
[in this window]
[in a new window]
 
Fig. 4. Expected change in mean population fitness (a, c) and variance in mutation number (b, d) through time according to the multiplicative (black circles) and epistatic (white circles) models. Parameters were set to their ML-value. (a, b) U5-IR stem. (c, d) U5-IR loop.

 
More generally, I have explored a large region of the parameter space letting s and e move simultaneously along intervals (0, 1) and (–1, 1), respectively. As expected, the selection coefficient strongly determines the rate at which the population reaches its optimum. Although negative epistasis coefficients strengthen the advantage of the wild-type, whereas positive coefficients weaken it, epistasis modifies the rate of adaptation to a much lesser extent than does the selection coefficient (Fig. 5Down). Similar results are obtained for the fixation rate of the wild-type (not shown). In contrast, the maximum variance in the number of mutations along the selective sweep is more strongly influenced by the epistasis coefficient than by the selection coefficient (Fig. 5Down).


Figure 5
View larger version (48K):
[in this window]
[in a new window]
 
Fig. 5. Joint effect of selection and epistasis coefficients upon the rate of adaptation and the maximum variance in mutation number along the fixation process. The rate of adaptation was calculated as the inverse of the time necessary to reach 0.99 mean population fitness.

 

   DISCUSSION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES
 
RNA viruses show lower genetic robustness than more complex DNA organisms (Elena et al., 2006Down). In mutation accumulation studies, drastic fitness losses, ranging from 22 % in bacteriophage {phi}6 (Chao, 1990Down) to 82 % in Human immunodeficiency virus 1 (Yuste et al., 1999Down), have been observed. Burch & Chao (2004)Down estimated selection coefficients ranging from s=0.054 in phage {phi}6 to s=0.230 in poliovirus 1, well above values reported for more complex systems (Keightley, 1994Down; Kibota & Lynch, 1996Down; Elena et al., 1998Down; Bruggeman et al., 2003Down). These differences are supported by a site-directed mutagenesis study (Sanjuán et al., 2004aDown), where a selection coefficient of s=0.514 was estimated for single-nucleotide substitutions in VSV. High sensitivity to mutation is further supported by the lack of neutrality associated to synonymous changes (Novella et al., 2004Down). In general, the genetic basis for this reduced robustness might lie on the compactness of RNA virus genomes and the abundance of secondary structures displaying a regulatory function (Novella, 2003Down). Here, selection coefficients of s=0.838 and s=0.477 have been estimated for the U5-IR stem and loop of RSV, respectively. These are non-translated regions, but carry out essential functions for virus survival. The larger value obtained for the U5-IR stem is in accordance with the stronger constraints imposed by the necessity of maintaining a tight secondary structure. On one hand, these results are a good example of low genetic robustness in RNA viruses. On the other hand, it must be acknowledged that they are based on a single region and thus might not be representative of the entire RSV genome.

Three recent studies have reported antagonistic epistasis in RNA viruses (Burch & Chao, 2004Down; Sanjuán et al., 2004bDown; Bonhoeffer et al., 2004Down). Given the heterogeneity of the viral models employed, it seems very plausible that antagonistic epistasis among deleterious mutations is a general feature of RNA viruses. Although fitness is defined in a relative scale in which there is room for accumulating unlimited deleterious effects, in genomes with relatively few functions as those of RNA viruses, mutations are likely to hit repeatedly the same function. When this occurs, the deleterious effects caused by different mutations can partially overlap, hence producing antagonistic epistasis. Interestingly, low genetic robustness and antagonistic epistasis might be related genome properties. In RNA viruses, single mutations can cause significant fitness losses but, after the initial loss, further mutations may have a comparatively lesser impact. Supporting this possibility, theoretical work, RNA folding (Wilke & Adami, 2001Down) and bacteriophage T7 (You & Yin, 2002Down) simulations have shown that selection coefficients associated to single mutations are highly correlated to epistasis coefficients. Antagonistic epistasis can be a consequence of abundant RNA secondary structures as well. The latter play a key role in the regulation of gene expression (Hofacker et al., 2004Down). Indeed, compensatory mutations have been proven to cause an excess of antagonistic epistasis in computer simulations with RNA secondary structures (Wilke et al., 2003Down). Johnson et al. (2004)Down showed that some nucleotides in the U5-IR region influence selection of nucleotides at nearby positions, mainly leading to the re-establishment of base pairing after mutagenesis. The results obtained here support these conclusions.

The epistatic model has been designed to describe the adaptive dynamics of a randomized sequence. When populations face new environments, for instance when a virus is challenged by the immunological system or invades a new host, the alleles at some loci (as for example a region encoding an epitope) can become a pool of highly unadapted variants. Hence, the picture of an initially random sequence stretch seems biologically relevant. However, the generality of the model is limited by the assumptions of no genetic drift, no recombination and constant selection coefficients. The first of these assumptions is reasonable in the U5-IR experimental setting, where population sizes were kept high enough to ensure that the effect of drift is negligible. The same is true for recombination, since the mutated loci are tightly linked and the number of experimental passages is low. Finally, mutational effects are obviously variable and hence, the assumption of constant selection and epistasis coefficients is not realistic. Though assessing how variation in selection coefficients might affect the reliability of the analysis would require a more extensive mathematical analysis, some verbal arguments can be made. In the first approach, the model assumed that all viruses at the end of the experiment had undergone a constant number of generations. However, for m.o.i. lower than one particle per cell, after several passages, the population conforms to a pool of lineages that have gone through different numbers of generations and consequently, have experienced variable amounts of selection. For example, for the U5-IR loop library, five discrete classes can be distinguished. Presumably, the effect of introducing variable selection coefficients could be similar to that of introducing variable numbers of generations, since analogously, the model would allow for classes of genotypes under different selection intensities. Regarding epistasis, the present analysis can only measure its overall direction and hence, the results do not preclude the existence of synergistic interactions. Though from a molecular standpoint, it is straightforward that both synergistic and antagonistic interactions must be common, from an evolutionary standpoint, the question of whether mean deviations from multiplicative effects occur is more relevant.

Keeping the limitations of the model in mind, it is predicted that large selection coefficients and strong antagonistic epistasis facilitate fast adaptation, while retaining a transiently high genetic variance. These predictions are in agreement with RNA virus high diversity and rapid adaptation to changing environments (Holland et al., 1982Down; Novella et al., 1995Down; Domingo & Holland, 1997Down). So-called ‘memory genomes’, defined as variants that were formerly fitter but are now maintained at a low frequency in the population (but still higher than expected under a simple mutation-selection balance), have been reported in RNA viruses (Ruiz-Jarabo et al., 2000Down). According to Figs 2 and 3UpUp, some genotypes genetically distant from the wild-type, probably located on the top of neighbouring adaptive peaks, can be transiently maintained in the population at relatively high frequencies, hence creating a multimodal mutant frequency distribution and potentially facilitating re-adaptation to former environments. In contrast, experiments carried out with bacteria have shown that mutations conferring adaptation to specific niches are costly in terms of adaptability (Buckling et al., 2003Down). Whether differences in genomic organization and epistasis are responsible for differences between RNA virus and DNA organism adaptability is an important issue that still remains to be elucidated. I suggest that after episodes of genetic drift, intense mutagenesis or environmental changes, RNA virus populations could reach suboptimal fitness peaks and that, due to antagonistic epistasis, these suboptimal variants could be maintained in the population, increasing its diversity and, together with high selection coefficients, accelerating adaptation to novel environments.


   ACKNOWLEDGEMENTS
 
This work was supported by an I3P postdoctoral contract from CSIC, Spain. I thank two anonymous referees for their constructive comments on the manuscript.


   REFERENCES
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES
 
Bonhoeffer, S., Chappey, C., Parkin, N. T., Whitcomb, J. M. & Petropoulos, C. J. (2004). Evidence for positive epistasis in HIV-1. Science 306, 1547–1550.[Abstract/Free Full Text]

Bruggeman, J., Debets, A. J., Wijngaarden, P. J., de Visser, J. A. G. M. & Hoekstra, R. F. (2003). Sex slows down the accumulation of deleterious mutations in the homothallic fungus Aspergillus nidulans. Genetics 164, 479–485.[Abstract/Free Full Text]

Buckling, A., Wills, M. A. & Colegrave, N. (2003). Adaptation limits diversification of experimental bacterial populations. Science 302, 2107–2109.[Abstract/Free Full Text]

Burch, C. L. & Chao, L. (2004). Epistasis and its relationship to canalization in the RNA virus {phi}6. Genetics 167, 559–567.[Abstract/Free Full Text]

Burch, C. L., Turner, P. E. & Hanley, K. A. (2003). Patterns of epistasis in RNA viruses: a review of the evidence from vaccine design. J Evol Biol 16, 1223–1235.[CrossRef][Medline]

Butcher, D. (1995). Muller's ratchet, epistasis and mutation effects. Genetics 141, 431–437.[Abstract]

Chao, L. (1990). Fitness of RNA virus decreased by Muller's ratchet. Nature 348, 454–455.[CrossRef][Medline]

Crow, J. F. (1970). Genetic loads and the cost of natural selection. In Mathematical Topics in Population Genetics, pp. 128–177. New York: Springer.

de Visser, J. A. G. M. & Hoekstra, R. F. (1998). Synergistic epistasis between loci affecting fitness: evidence in plants and fungi. Genet Res 71, 39–49.[CrossRef]

de Visser, J. A. G. M., Hoekstra, R. F. & van den Ende, H. (1996). The effect of sex and deleterious mutations on fitness in Chlamydomonas. Proc R Soc Lond B Biol Sci 263, 193–200.

Dobzhansky, T. H. (1936). Studies on hybrid sterility. II. Localization of sterility factors in Drosophila pseudoobscura hybrids. Genetics 21, 113–135.[Free Full Text]

Domingo, E. & Holland, J. J. (1997). RNA virus mutations and fitness for survival. Annu Rev Microbiol 51, 151–178.[CrossRef][Medline]

Domingo, E., Escarmís, C., Sevilla, N., Moya, A., Elena, S. F., Quer, J., Novella, I. S. & Holland, J. J. (1996). Basic concepts in RNA virus evolution. FASEB J 10, 859–864.[Abstract]

Domingo, E., Biebricher, C. K., Eigen, M. & Holland, J. J. (2001). Quasispecies and RNA Virus Evolution: Principles and Consequences. Edited by Landes Bioscience, Austin, Texas.

Elena, S. F. (1999). Little evidence for synergism among deleterious mutations in a nonsegmented RNA virus. J Mol Evol 49, 703–707.[CrossRef][Medline]

Elena, S. F. & Lenski, R. E. (1997). Test of synergistic interactions among deleterious mutations in bacteria. Nature 390, 395–398.[CrossRef][Medline]

Elena, S. F. & Lenski, R. E. (2001). Epistasis between new mutations and genetic background and a test of genetic canalization. Evolution Int J Org Evolution 55, 1746–1752.[CrossRef][Medline]

Elena, S. F., Ekunwe, L., Hajela, N., Oden, S. A. & Lenski, R. E. (1998). Distribution of fitness effects caused by random insertion mutations in Escherichia coli. Genetica 102–103, 349–358.[CrossRef]

Elena, S. F., Carrasco, P., Daròs, J. A. & Sanjuán, R. (2006). Mechanisms of genetic robustness in RNA viruses. EMBO Rep 7, 168–173.[CrossRef][Medline]

Hofacker, I. L., Stadler, P. F. & Stocsits, R. R. (2004). Conserved RNA secondary structures in viral genomes: a survey. Bioinformatics 20, 1495–1499.[Abstract/Free Full Text]

Holland, J. J., Spindler, K., Horodyski, F., Grabau, E., Nichol, S. & VandePol, S. (1982). Rapid evolution of RNA genomes. Science 215, 1577–1585.[Abstract/Free Full Text]

Johnson, J. B. & Omland, K. S. (2004). Model selection in ecology and evolution. Trends Ecol Evol 19, 101–108.[CrossRef][Medline]

Johnson, M., Morris, S., Chen, A., Stavnezer, E. & Leis, J. (2004). Selection of functional mutations in the U5-IR stem and loop regions of the Rous sarcoma virus genome. BMC Biol 2, 8.[Medline]

Keightley, P. D. (1994). The distribution of mutation effects on viability in Drosophila melanogaster. Genetics 138, 1315–1322.[Abstract]

Kibota, T. T. & Lynch, M. (1996). Estimate of the genomic mutation rate deleterious to overall fitness in E. coli. Nature 381, 694–696.[CrossRef][Medline]

Kondrashov, A. S. (1988). Deleterious mutations and the evolution of sexual reproduction. Nature 336, 435–440.[CrossRef][Medline]

Kondrashov, A. S. (1994). Muller's ratchet under epistatic selection. Genetics 136, 1469–1473.[Abstract]

Mukai, T. (1969). The genetic structure of natural populations of Drosophila melanogaster. VII. Synergistic interaction of spontaneous mutant polygenes controlling viability. Genetics 61, 749–761.[Free Full Text]

Muller, H. J. (1939). Reversibility of evolution considered from the standpoint of genetics. Biol Rev Camb Philos Soc 14, 261–280.

Novella, I. S. (2003). Contributions of vesicular stomatitis virus to the understanding of RNA virus evolution. Curr Opin Microbiol 6, 399–405.[CrossRef][Medline]

Novella, I. S., Duarte, E. A. Elena S. F., Moya, A., Domingo, E. & Holland, J. J. (1995). Exponential increases of RNA virus fitness during large population transmissions. Proc Natl Acad Sci U S A 92, 5841–5844.[Abstract/Free Full Text]

Novella, I. S., Zárate, S., Metzgar, D. & Ebendick-Corpus, B. E. (2004). Positive selection of synonymous mutations in vesicular stomatitis virus. J Mol Biol 342, 1415–1421.[CrossRef][Medline]

Ruiz-Jarabo, C. M., Arias, A., Baranowski, E., Escarmís, C. & Domingo, E. (2000). Memory in viral quasispecies. J Virol 74, 3543–3547.[Abstract/Free Full Text]

Sanjuán, R., Moya, A. & Elena, S. F. (2004a). The distribution of fitness effects caused by single-nucleotide substitutions in an RNA virus. Proc Natl Acad Sci U S A 101, 8396–8401.[Abstract/Free Full Text]

Sanjuán, R., Moya, A. & Elena, S. F. (2004b). The contribution of epistasis to the architecture of fitness in an RNA virus. Proc Natl Acad Sci U S A 101, 15376–15379.[Abstract/Free Full Text]

Sokal, R. R. & Rohlf, F. J. (2000). Biometry: the Principles and Practice of Statistics in Biological Research. New York, USA: Freeman.

Whitlock, M. C. & Bourguet, D. (2000). Factors affecting the genetic load in Drosophila: synergistic epistasis and correlations among fitness components. Evolution Int J Org Evolution 54, 1654–1660.[CrossRef][Medline]

Wilke, C. O. & Adami, C. (2001). Interaction between directional epistasis and average mutational effects. Proc R Soc Lond B Biol Sci 268, 1469–1474.[Medline]

Wilke, C. O., Lenski, R. E. & Adami, C. (2003). Compensatory mutations cause excess of antagonistic epistasis in RNA secondary structure folding. BMC Evol Biol 3, 3.[CrossRef][Medline]

Wolf, J. B., Brodie, E. D., III & Wade, M. J. (2000). Epistasis and the Evolutionary Process. Edited by Oxford University Press, Oxford, UK.

Wright, S. (1931). Evolution in mendelian populations. Genetics 16, 97–159.[Free Full Text]

Wright, S. (1982). The shifting balance theory and macroevolution. Annu Rev Genet 16, 1–19.[Medline]

You, L. & Yin, J. (2002). Dependence of epistasis on environment and mutation severity as revealed by in silico mutagenesis of phage T7. Genetics 160, 1273–1281.[Abstract/Free Full Text]

Yuste, E., Sánchez-Palomino, S., Casado, C., Domingo, E. & López-Galíndez, C. (1999). Drastic fitness loss in human immunodeficiency virus type 1 upon serial bottleneck events. J Virol 73, 2745–2751.[Abstract/Free Full Text]

Received 4 October 2005; accepted 2 February 2006.


This article has been cited by other articles:


Home page
J. Virol.Home page
P. Carrasco, F. de la Iglesia, and S. F. Elena
Distribution of Fitness and Virulence Effects Caused by Single-Nucleotide Substitutions in Tobacco Etch Virus
J. Virol., December 1, 2007; 81(23): 12979 - 12984.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
F. de la Iglesia and S. F. Elena
Fitness Declines in Tobacco Etch Virus upon Serial Bottleneck Transfers
J. Virol., May 15, 2007; 81(10): 4941 - 4947.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. Sanjuan, J. Forment, and S. F. Elena
In Silico Predicted Robustness of Viroid RNA Secondary Structures. II. Interaction between Mutation Pairs
Mol. Biol. Evol., November 1, 2006; 23(11): 2123 - 2130.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Sanjuán, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sanjuán, R.
Agricola
Right arrow Articles by Sanjuán, R.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
INT J SYST EVOL MICROBIOL MICROBIOLOGY J GEN VIROL
J MED MICROBIOL ALL SGM JOURNALS