Effect of Regulatory Architecture on Broad versus Narrow Sense Heritability

Yunpeng Wang; Jon Olav Vik; Stig W. Omholt; Arne B. Gjuvsland

doi:10.1371/journal.pcbi.1003053

Abstract

Additive genetic variance (V_A) and total genetic variance (V_G) are core concepts in biomedical, evolutionary and production-biology genetics. What determines the large variation in reported V_A/V_G ratios from line-cross experiments is not well understood. Here we report how the V_A/V_G ratio, and thus the ratio between narrow and broad sense heritability (h²/H²), varies as a function of the regulatory architecture underlying genotype-to-phenotype (GP) maps. We studied five dynamic models (of the cAMP pathway, the glycolysis, the circadian rhythms, the cell cycle, and heart cell dynamics). We assumed genetic variation to be reflected in model parameters and extracted phenotypes summarizing the system dynamics. Even when imposing purely linear genotype to parameter maps and no environmental variation, we observed quite low V_A/V_G ratios. In particular, systems with positive feedback and cyclic dynamics gave more non-monotone genotype-phenotype maps and much lower V_A/V_G ratios than those without. The results show that some regulatory architectures consistently maintain a transparent genotype-to-phenotype relationship, whereas other architectures generate more subtle patterns. Our approach can be used to elucidate these relationships across a whole range of biological systems in a systematic fashion.

Author Summary

The broad-sense heritability of a trait is the proportion of phenotypic variance attributable to genetic causes, while the narrow-sense heritability is the proportion attributable to additive gene effects. A better understanding of what underlies variation in the ratio of the two heritability measures, or the equivalent ratio of additive variance V_A to total genetic variance V_G, is important for production biology, biomedicine and evolution. We find that reported V_A/V_G values from line crosses vary greatly and ask if biological mechanisms underlying such differences can be elucidated by linking computational biology models with genetics. To this end, we made use of models of the cAMP pathway, the glycolysis, circadian rhythms, the cell cycle and cardiocyte dynamics. We assumed additive gene action from genotypes to model parameters and studied the resulting GP maps and V_A/V_G ratios of system-level phenotypes. Our results show that some types of regulatory architectures consistently preserve a transparent genotype-to-phenotype relationship, whereas others generate more subtle patterns. Particularly, systems with positive feedback and cyclic dynamics resulted in more non-monotonicity in the GP map leading to lower V_A/V_G ratios. Our approach can be used to elucidate the V_A/V_G relationship across a whole range of biological systems in a systematic fashion.

Citation: Wang Y, Vik JO, Omholt SW, Gjuvsland AB (2013) Effect of Regulatory Architecture on Broad versus Narrow Sense Heritability. PLoS Comput Biol 9(5): e1003053. https://doi.org/10.1371/journal.pcbi.1003053

Editor: William Stafford Noble, University of Washington, United States of America

Received: August 10, 2012; Accepted: March 23, 2013; Published: May 9, 2013

Copyright: © 2013 Wang et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: This work was supported by the Research Council of Norway (http://www.rcn.no) under the eVITA program, project number 178901/V30, and by the Virtual Physiological Rat Project (http://virtualrat.org) funded through NIH grant P50-GM094503. The development of the cgptoolbox was supported by the VPH-Network of Excellence (http://vph-noe.eu) through exemplar project EP7. NOTUR, the Norwegian metacenter for computational science, provided computing resources under project nn4653k. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The broad-sense heritability of a trait, , is the proportion of phenotypic variance attributable to genetic causes, while the narrow-sense heritability , is the proportion attributable to additive gene action. The total genetic variance includes the variance explained by intra-locus dominance () and inter-locus interactions (). The reasons for and importance of this non-additive genetic variance that distinguishes the two heritability measures has been subject to substantial controversy for more than 80 years (e.g., [1]–[6]). It was recently shown through statistical arguments that for traits with many loci at extreme allele frequencies, much of the genetic variance becomes additive with h²/H² (or equivalently V_A/V_G) typically >0.5 [3]. In populations with intermediate allele frequencies, such as controlled line crosses, lower V_A/V_G ratios are often reported [7], [8]. This is illustrated in Table 1, which summarizes estimated V_A/V_G ratios from a collection of studies on such populations. This wide range of h²/H² ratios reported for line crosses cannot be explained by an allele-frequency argument, and putative explanations must be based on how the regulatory architecture of the underlying biological systems shape the genotype-phenotype (GP) map.

Download:

Table 1. Examples of reported V_A/V_G ratios of from line-crossing experiments.

https://doi.org/10.1371/journal.pcbi.1003053.t001

It is important to understand the causal underpinnings of the observed variation in h²/H² ratios within and between biological systems for several reasons. In human quantitative genetics, where twin studies are commonly used, most heritability estimates refer to H² [9]. In cases where h²/H² is low, this can lead to unrealistic expectations about how much of the underlying causative variation may be located by linear QTL detection methods [6]. On the other hand, low narrow sense heritability for a given complex trait does not necessarily imply that the environment determines much of the variation. In evolutionary biology, additive variance is the foremost currency for evolutionary adaptation and evolvability. Important questions in this context are for example (i) to which degree is there selection on the regulatory anatomies themselves to maintain high additive variance, (ii) are there organizational constraints in building adaptive systems such that in some cases a low h²/H² ratio must of necessity emerge while the proximal solution is still selected for? Moreover, in production biology with genetically modified, sexually reproducing organisms, one would like to ensure that the modifications would be passed over to future generations in a fully predictable way. Thus, one would like to ensure that the modification becomes highly heritable in the narrow sense.

As a step towards a physiologically grounded understanding of the variation of the h²/H² relationship across biological systems or processes, we posed the question: Are there regulatory structures, or certain classes of phenotypes, more likely to generate low V_A/V_G ratios than others? Addressing this question requires the linking of genetic variation to computational biology in a population context (e.g., [10]–[19]), so-called causally-cohesive genotype-phenotype (cGP) modeling [15], [17], [18]. We applied this approach to five well-validated computational biology models describing, respectively, the glycolysis metabolic pathway in budding yeast [20], the cyclic adenosine monophosphate (cAMP) signaling pathway in budding yeast [21], the cell cycle regulation of budding yeast [22], the gene network underlying mammalian circadian rhythms [23], and the ion channels determining the action potential in mouse heart myocytes [24] These models differ in their regulatory architecture; below, we show that they also differ in the range of V_A/V_G ratios that they can exhibit. In particular, positive feedback regulation and oscillatory behavior seem to dispose for low V_A/V_G ratios. The results suggest that our approach can be used in a generic manner to probe how the h²/H² ratio varies as a function of regulatory anatomy.

Methods

Simulations of cGP models

The five cGP models were built and analyzed with the cgptoolbox (http://github.com/jonovik/cgptoolbox) an open-source Python package developed by the authors; further source code specific to the simulations in this paper is available on request. In the following we describe the three main parts of the workflow: (i) the mapping from genotypes to parameters, (ii) the mapping from parameters to phenotypes, i.e. solving the dynamic models and (iii) the setup of Monte-Carlo simulations combining the two mappings (Figure S1). For each model, we briefly describe its origins, the software used to solve it, which parameters were subject to genetic variation, what phenotypes were recorded, and criteria for omitting outlying datasets. Figures S2, S3, S4, S5, S6 shows graphical representations of the five model systems and Text S1 contains more detailed descriptions of all five models.

Genotype to parameter mapping.

For each model, the following procedure was repeated 1000 times (see also “Monte Carlo simulations” below) for different selections of parameters to be subjected to simulated genetic variation. We started by sampling three polymorphic loci, each determining one or two parameters in the dynamic model. Tables of eligible loci with corresponding parameters and their baseline values are listed in Table S1, S2, S3, S4, S5, corresponding to the cAMP, glycolysis, cell cycle, circadian and action potential models respectively. Heritable variation in a chosen parameter was generated for a single biallelic locus with allele indexes 0 and 1 in the following manner. First, two numbers r₁ and r₂ were sampled uniformly in the interval [0.7, 1.3]. The parameter value for a homozygote 00 was set to where b is the baseline value, for a homozygote 11 the parameter value was . The heterozygous genotype 01 was assigned the average of the two homozygotes , resulting in an additive mapping from genotypes to parameter values.

cAMP model.

The model of the complete cAMP signaling pathway in S. cerevisiae [21] taking the external glucose level as input was downloaded as SBML code (http://www.biomedcentral.com/content/supplementary/1752-0509-3-70-s1.xml) and integrated using PySCeS [25]. Genetic variation was introduced on association/dissociation and phosphorylation/dephosphorylation rates of signal proteins (see Figure S2 and Table S1). The initial steady state concentrations before adding external glucose, the peak values after adding glucose and the time taken to reach peak values of cellular proteins were recorded as phenotypes (see Figure 1A for phenotype illustration and Table S6 for phenotype descriptions).

Download:

Figure 1. Phenotypes derived from the cGP models.

Graphical illustration of the phenotypes recorded for the five cGP models studied. Time courses (state variable on y-axis, time on x-axis) for the baseline parameter set are displayed for all five models. A. In the absence of external glucose all state variables (concentration of cAMP is shown) in the cAMP model [21] converge to a stable steady state (blue circle on y-axis). After addition of external glucose (5 mM added at time 50) we see a rapid change followed by a slow return to the original steady state. In addition to the original steady state, the extremal concentration (top blue circle) as well as the time to reach the extremum (blue line) was recorded as phenotypes. B. Metabolite concentrations (internal glucose (GLCi), glucose-6-phospate (G6P) and fructose-6-phospate (F6P) are shown) in the glycolysis model [20] all converge to a stable steady state, indicated by open circles. The steady state concentrations for 13 metabolites were recorded as phenotypes from this model. C. For the cell cycle model [22] we recorded the peak level and the time from bottom to peak as for the circadian model (Figure 1D), and in addition we recorded cell cycle events such as bud emergence at the time when [BUD] = 1 indicated by the black arrow. D. mRNA and protein concentrations (mRNA for Bmal1 (MB), Cry (MC) and Per (MP) are shown) in the circadian model [23] converge to a limit cycle. In addition to the period of oscillation (long blue line) for each of the 16 variables the peak level (open blue circle) as well as the time from bottom to peak (short blue line) were recorded as phenotypes. E. We used the base level, peak level, amplitude, time to peak, and time to 25%, 50%, 75% and 90% recovery of the action potential and calcium transient as cell level phenotypes of the action potential model [24]. An action potential is shown in the figure.

https://doi.org/10.1371/journal.pcbi.1003053.g001

Glycolysis model.

The model published by Teusink et al. [20] describes glycolysis in S. cerevisiae through the kinetics of 13 glycolytic enzymes determining the fluxes of metabolite state variables. Genetic variation was introduced on maximal reaction rates for the enzymes (see Figure S3 and Table S2). We downloaded the model from the BioModels database (http://www.ebi.ac.uk/biomodels-main/BIOMD0000000064) in SBML L2 V1, and solved it with PySCeS [25] to find the stable steady state concentrations of metabolites, which were used as phenotypes (see Figure 1B and Table S7). Datasets were discarded if one or more of the genotypes did not give a stable steady state, as can happen due to a saddle-node bifurcation [26].

Cell cycle model.

The model of the consensus control mechanisms of the cell cycle in S. cerevisae modeled by algebraic/differential equations that describe the continuous changes in state variables and discontinuities due to cellular events [22] was obtained from the CellML repository (http://models.cellml.org/workspace/chen_calzone_csikasznagy_cross_novak_tyson_2004). Genetic variation was introduced on the production and decay rates of various proteins (see Figure S4 and Table S3). The published model contains reset rules (events) for both parameters and state variables, but the CellML implementation only includes the parameter (k_mad2, k_bub2 and k_lte1) rules. Reset rules for state variables [BUD], [SPN], and [ORI] as described in the model paper, were implemented by solving the model with rootfinding for the relevant variables. The model was numerically integrated using the CVODE solver [27] until convergence of cell division time, cell cycle events. The peak levels and time to peak levels for the cytosolic protein concentrations, together with the timing of cell division events were recorded as phenotypes (see Figure 1C for phenotype illustrations and Table S8 for phenotype descriptions).

Circadian model.

The model of the mammalian circadian clock published by Leloup and Goldbeter [23] describes the dynamics of mRNA and proteins of three genes in the cytosol and nucleus. Genetic variation was introduced on mRNA decay rates (see Figure S5 and Table S4). The model was downloaded from CellML repository (http://models.cellml.org/workspace/leloup_goldbeter_2004) and integrated using CVODE [27] until convergence to its limit cycle. As phenotypes we used the bottom levels and time to from bottom level to peak value of the concentrations of mRNAs, proteins and protein complexes. In addition, we recorded the period of oscillations (see Figure 1D for phenotype illustrations and Table S9 for phenotype descriptions).

Action potential model.

The model of [24] is an extension of [28] and describes the action potential and calcium transient of a mouse heart muscle cell. We obtained CellML code from the authors and the file is available as supplementary material in [17]. Numerical integration was done using CVODE [27]. Genetic variation was introduced on the maximal conductances of ion channels and pump affinities (see Figure S6 and Table S5). Phenotypes were generated by simulated regular pacing as done in [17], [18], with a stimulus potassium current of −15 V/s was lasting for 3 ms at the start of each stimulus interval. The model was simulated to convergence as described in [17]; datasets that failed to converge were discarded. The initial level, peak level, amplitude, and time to 25, 50, 75 and 90% recovery of the action potential and calcium transient were recorded as the cell level phenotypes (see Figure 1E for phenotype illustrations and Table S10 for phenotype descriptions).

Monte Carlo simulations.

For each model we performed 1000 Monte Carlo simulations as follows (see Figure S1 for an illustration). We first sampled three polymorphic loci for introduction of genetic variation and sampled the genotype-to-parameter map as described above. Then all 27 possible three-locus genotypes were enumerated, mapped into 27 parameter sets and for each parameter set the dynamic model was solved and phenotypes (as described above and in Figure 1) were obtained. To avoid artifacts arising from numerical noise datasets with low variability were omitted from the genetic analysis. Absolute variability was measured as the difference between the maximum and minimum values of a phenotype in a dataset, and relative variability as the ratio of the absolute variation to the mean phenotype of the same dataset. The threshold values for each phenotype and the number of datasets exceeding the thresholds are listed in Tables S6, S7, S8, S9, S10, for the cAMP, glycolysis, cell cycle, circadian and action potential models, respectively.

Statistical analysis

Decomposition of genetic variance.

A single Monte Carlo simulation results in genotype-to-phenotype maps comprised by 27 genotypic values (i.e. the phenotype values corresponding to the 27 genotypes) for a given phenotype. We used the NOIA framework [29] to compute the resulting genetic variance () in a hypothetical F₂ population and decompose it into additive () and non-additive components ( and ). This was done with the function linearGPmapanalysis in the R package noia (http://cran.r-project.org/web/packages/noia/) version 0.94.1.

Monotonicity of GP-maps.

We build on the definitions of monotonicity and the indexing of alleles introduced in [30]. Given a simulated GP map with 27 genotypic values we measured the degree of order-breaking for a particular locus k by the allele substitution effects at that locus. For a fixed background genotype at all other loci (as indicated in eq. (14) in [30]), we computed the difference in genotypic value when substituting a 0-allele with a 1-allele (i.e. when going from 00 to 01 or from 01 to 11 at locus k). We collected substitution effects across all 9 background genotypes to compute N, the sum of all negative substitution effects, and A, the sum of absolute values of all substitution effects. If the GP map is monotone for locus k then , and if it is order-breaking for locus k the .

Results/Discussion

System classification and phenotype dimensionality

The five cGP models studied in this work differ in several ways, both in their function and the underlying network structure. The glycolysis and cAMP models are both pathway models with an acyclic series of reactions transforming inputs to outputs. The glycolysis model [20] is more advanced than the metabolic models in earlier genetically oriented studies (e.g., [3], [31], [32]) as it contains many different types of enzyme kinetics as well as negative feedback regulation of some enzyme activities through product inhibition. The cAMP model [21] contains a number of negative feedback loops, but overall it also has a clear pathway structure where the glucose signal is relayed from G-proteins to cAMP to the target kinase PKA. Both these two models have in common relatively simple dynamics with solutions converging to a stable steady state (Figure 1A and B).

In contrast, the three other models show cyclic behavior resulting from an interplay between positive and negative feedback loops (Figure 1 C–E). However, there are clear differences between these models too. The heart cell model [24] is an excitable system with feedback mechanisms including calcium-induced calcium release and several voltage-dependent ion channels. In contrast to pacemaker cells, it relies on external pacing to initiate the action potential. The circadian rhythm model [23], [33] is a gene expression network with intertwined positive and negative transcriptional feedback loops, giving a limit cycle oscillator with sustained oscillations even in continuous darkness. The cell cycle model [22] centers around a positive feedback loop between B-type cyclins in association with cyclin dependent kinase and inhibitors of the cyclin dependent kinase, which establishes a hysteresis loop causing the cell cycle to emerge from transitions between the two alternative stable steady states.

This crude classification of the five cGP models into pathway models and more complex regulatory systems is clearly reflected in the effective dimensionality of the phenotypes arising in our Monte Carlo simulations. We studied the phenotypic dimensionality for all five cGP models by Principal Component Analysis (PCA) for each Monte Carlo simulation (Figure 2). Across all simulated datasets, 95% of phenotypic variation of the glycolysis and cAMP models can be explained by the first 3 principal components, the cell cycle and heart cell models require the first 5 principal components, and 7 components are required for the circadian model. Since the genotype-to-parameter maps are additive for all five models, these differences in the effective dimensionality of high-level phenotypes indicate that the mappings from parameters to phenotypes are simpler for the pathway models than the other three models. This, together with results on the effect of positive feedback on statistical epistasis in gene regulatory networks [11], suggested that the glycolysis and cAMP models might result in higher V_A/V_G ratios than the other three models.

Download:

Figure 2. Dimensionality of phenotypic variation.

The proportion of total phenotypic variation explained (y axis) versus the number of principal components (x axis) across all five cGP models (colour coded). For each Monte Carlo data set the matrices containing the full genotype-phenotype map for all M recorded phenotypes was standardized to unit variance before principal components analysis. Each boxplot summarizes explained variance for close to 1000 Monte Carlo simulations.

https://doi.org/10.1371/journal.pcbi.1003053.g002

The ratio of additive genetic variance to total genetic variance

The results confirmed our expectations regarding high V_A/V_G ratios for the glycolysis and cAMP models. Furthermore, a number of distinct patterns emerged. The cAMP model shows the overall highest V_A/V_G ratios values (Figure 3A and Table S6), with mean and median values above 0.99 across all recorded phenotypes. The 0.05-quantile (i.e. only 5 percent of the Monte Carlo simulations show lower values than this) V_A/V_G values were above 0.97 for all phenotypes and no values lower than 0.6 were observed. In other words, an intra- and inter-locus additive model of gene action very well approximates the genotype-phenotype maps arising from this cGP model.

Download:

Figure 3. The empirical cumulative distribution function of V_A/V_G ratios for phenotypes of the cAMP (A) and the glycolysis (B) models.

A. The empirical cumulative distribution functions (y axis) of V_A/V_G ratios (x axis) for all phenotypes studied in the cAMP model: The initial steady state concentrations before adding external glucose of the cyclic adenosine monophosphate (cAMP), the G-protein Ras2a (Ras2a), the guanine-nucleotide-exchange factor (Cdc25), the protein kinase A (PKAi). The peak values after adding glucose of these proteins (cAMPv, Ras2av, Cdc25v and PKAiv), the Kelch repeat homologue protein (Krhv), the G-protein Gpa2a (Gpa2av), and the phosphodiesterase (Pde1v). The time taken to reach the peak values (cAMPt, Ras2at, Cdc25t, PKAit, Krht, Gpa2at, Ped1t). See Table S6 for further phenotype descriptions and numerical summaries of the distribution of V_A/V_G ratios. B. The empirical cumulative distribution function (y axis) of V_A/V_G ratios (x axis) for the steady state concentrations of 13 metabolites in the glycolysis model: acetaldehyde (ACE), 1,3-bisphospoglycerate (BPG), fructose-1,6-bisphosphate (F16P), fructose 6-phosphate (F6P), glucose 6-phosphate (G6P), glucose in cell (GLCi), nicotinamide adenine dinucleotide (NADH), phosphates in adenine nucleotide (P), 2-phosphoglyerate (P2G), 3-phosphoglycerate (P3G), phosphoenolpyruvate (PEP), pyruvate (PYP), and trio-phosphate (TRIO). See Table S7 for further phenotype descriptions and numerical summaries of the distribution of V_A/V_G ratios.

https://doi.org/10.1371/journal.pcbi.1003053.g003

The glycolysis model also has mean and median V_A/V_G values close to 1 for all phenotypes (Figure 3B and Table S7). But compared to the cAMP model, the numbers are clearly lower; the lowest recorded mean value (phenotype BPG) is 0.9 and 0.05-quantile values are below 0.7 for some phenotypes. A few V_A/V_G values below 0.5 are observed for all phenotypes. The distribution of V_A/V_G ratios for the cell cycle model (Figure S7 and Table S8) is quite similar to that of the glycolysis model, with a lowest mean V_A/V_G value of 0.93 for time to peak for Sic1 and with 0.05-quantiles below 0.8 for some phenotypes. V_A/V_G values below 0.1 are observed for a few Monte Carlo simulations in some phenotypes.

For each of the cAMP, glycolysis and cell cycle models the distributions of V_A/V_G ratios were quite similar across all phenotypes, and a large majority of the Monte Carlo simulations showed very high ratios. The circadian clock model differs from these three models both in terms of displaying large variation between phenotypes and in terms of having a much larger proportion of low V_A/V_G values (Figure 4A and Table S9). Four phenotypes stand out with V_A/V_G distributions that resemble a uniform distribution U(0,1). These are the time from bottom to peak for the phosphorylated and unphosphorylated proteins of Per and Cry, and they have median V_A/V_G values ranging from 0.46 to 0.70 and 0.05-quantile values in the range 0.04 to 0.10. The remaining phenotypes have somewhat higher V_A/V_G values, but over half of the recorded phenotypes have 0.05-quantiles below 0.6. Median V_A/V_G values are below 0.9 for the majority of phenotypes of the action potential model. And all recorded phenotypes have a large proportion of low V_A/V_G ratios (Figure 4B and Table S10) with 0.05-quantiles in the range 0.18-0-35. The distributions are quite similar across action potential and calcium transient phenotypes, but the time to 90% repolarization for the action potential shows somewhat higher values than the others.

Download:

Figure 4. The empirical cumulative distribution function of V_A/V_G ratios for phenotypes of the circadian model (A) and the action potential model (B).

The empirical cumulative distribution functions (y axis) of V_A/V_G ratios (x axis) for phenotypes studied in the circadian model and the heart cell model. A. The upper-left panel (Bmal1) shows phenotypes related to bmal1 gene, including the mRNA (MB), the unphosphorylated/phosphorylated protein in cytosol (BC/BCP) and nucleus (BN/BNP). The bottom-right panel (Per) is for per gene, including the mRNA (MP), the unphosphorylated protein (PC) and the phosphorylated protein (PCP). The bottom concentration (solid line) and the time take to peak (dashed line) of each species are recorded phenotypes. The bottom-left panel (Cry) is related to cry gene, including the mRNA (MC), the unphosphorylated protein (CC) and phosphorylated protein (CCP). The upper-right panel (Complex) is for protein complexes PCC, PCN, PCCP and PCNP. The period of circadian rhythm (Period, dotted line) is also shown. See Table S9 for further phenotype descriptions and numerical summaries of the distribution of V_A/V_G ratios. B. The empirical cumulative distribution functions (y axis) of V_A/V_G ratios (x axis) for phenotypes studied in the action potential model: time to 25%, 50%, 75% and 90% of initial values, the amplitude, initial values (Base), peak values, time to reach peak values of action potential (left panel) and calcium transient (right panel) are shown. See Table S10 for further phenotype descriptions and numerical summaries of the distribution of V_A/V_G ratios.

https://doi.org/10.1371/journal.pcbi.1003053.g004

All five cGP models are capable of creating V_A/V_G ratios close to 1, and except for two phenotypes for the circadian model all median values of V_A/V_G are well above 0.5. This supports the hypothesis [30] that biological systems tend to involve regulatory machinery that in general leads to considerable additive genetic variance even at intermediate allele frequencies. That is not to say that selection cannot sometimes produce regulatory solutions that tend towards low V_A/V_G ratios; in fact, the incidence of low V_A/V_G ratios varied markedly among the five models that we studied. Because the genotype-parameter maps were purely additive, all non-additive genetic variance is a result of non-linearity in the ODE models. The expected effect of introducing non-additivity in the genotype-parameter maps would be a further decrease in the V_A/V_G ratios.

Our finding that models with complex regulation involving positive feedback loops tend to give lower V_A/V_G agrees with a previous study on gene regulatory networks [11]. Considering the relatively high V_A/V_G ratios of the cell cycle model compared to the circadian and action potential models, the following quote from Tyson and Novak's [34] discussion of why the cell-cycle is better understood as a hysteresis loop than as a limit cycle oscillator (LCO), is highly informative: “Generally speaking, the period of an LCO is a complicated function of all the kinetic parameters in the differential equations. However, the period of the cell division cycle depends on only one kinetic property of the cell: its mass-doubling time.” This seems to explain why the genotype-phenotype maps arising from the cell-cycle models are much more linear than the maps from the circadian model, which is an LCO.

Monotonicity explains much of the V_A/V_G patterns

In a given population V_A/V_G is a function of allele frequencies as well as the GP map, and GP maps with strong interactions can still give high V_A/V_G values in populations with extreme allele frequencies [3]. In populations with intermediate allele frequencies the V_A/V_G values are determined mainly by the shape of the genotype-phenotype map, and the observed differences between the five cGP models in the distribution of V_A/V_G values motivates a search for underlying explanatory principles.

The recently proposed concept of monotonicity (or order-preservation) of GP maps seems to be one such principle. In short, a GP map is said to be monotone if the ordering of genotypes by gene content (the number of alleles of a given type) is preserved in the ordering of the associated phenotypic values (see [30] for details). Figure 5 depicts three extreme types of GP maps seen in our simulations. Nearly additive GP maps as shown in Figure 5A give V_A/V_G values very close to one. GP maps with strong magnitude epistasis, but still order-preserving, typically result in intermediate V_A/V_G values (Figure 5B), while highly non-monotone or order-breaking GP maps (Figure 5C) showing strong overdominance and/or sign epistasis result in V_A/V_G values close to zero.

Download:

Figure 5. Three distinct types of genotype-phenotype maps.

Examples of three distinct types of genotype-phenotype maps seen in our simulations. For each subfigure the phenotypic value is shown on the y-axis while the x-axises, line colours and plot columns indicate the genotype at the three loci. A. A nearly additive map exemplified by the GP map of the time to peak concentration of Cdc25 (V_A/V_G = 0.99) in the cAMP model; B. A fully monotone but non-additive map exemplified by the GP map of the concentration of P2G protein (V_A/V_G = 0.41) in the glycolysis model; and, C. A strongly non-monotonic map is found the time to peak concentration of the PC protein (V_A/V_G = 0.03) from the circadian model.

https://doi.org/10.1371/journal.pcbi.1003053.g005

Based on recent results from studies of gene regulatory networks [30], we anticipated that the three cGP models with complex regulation involving positive feedback would result in considerably more non-monotone or order-breaking GP maps than the two pathway models. To test this, we measured the amount of order-breaking in all simulated GP maps (see Methods) and found a very clear pattern (Figure 6). While the datasets from the glycolysis and cAMP models contained only 1.1% and 1.3% GP maps with order-breaking for any locus, those from the cell cycle, circadian and action potential models contained 20.7%, 44.4% and 69.5%, respectively. Moreover, monotone GP maps gave higher V_A/V_G values than non-monotone GP maps for all five cGP models (Mann-Whitney test; p-values below 1e-10 for all five models).

Download:

Figure 6. The number of loci for which the GP-map shows order-breaking.

The number of Monte Carlo simulations where the GP-map for a given phenotype is clearly order-breaking (GP maps with N/A>0.05, see Methods) is shown for the cAMP model (A), the glycolysis model (B), the cell cycle model (C), the circadian model (D) and the action potential model (E). Only phenotypes with at least one Monte carlo simulation resulting in an order-breaking GP map are shown.

https://doi.org/10.1371/journal.pcbi.1003053.g006

However, despite the fact that the glycolysis model rarely shows order-breaking even for a single locus, it possesses much lower V_A/V_G values than the cAMP model. A plausible explanation is that the steady-state concentrations of metabolites can markedly increase for parameter values close to a saddle-node bifurcation point [26]. Simulation outcomes with unstable steady states were discarded, but in those cases where one of the genotypes (i.e. parameter sets) come close to the bifurcation point without crossing it we get genotype-phenotype maps as in Figure 5B, where one genotype (or a small set) gives much higher phenotypic values than the others. Such GP maps, similar to the duplicate factor model in Hill et al. [3], are known to give low V_A/V_G ratios despite being monotonic. Similar GP maps giving V_A/V_G ratios close to zero were also found by Keightley [32] in his study of metabolic models possessing null alleles at all loci.

Considerations on the genericity of the results

Our main reason for restricting the sampled genetic variation of parameters to within 30% of the published baseline values was to avoid qualitative (or topological) changes of the dynamics. Such qualitative changes are often biologically realistic descriptions of knockouts or other large genetic changes, for example action potentials of alternating amplitude (alternans) [17]; loss of stable circadian oscillation [23]; and non-viable cell-cycle mutants phenotypes [22]. However, since the heritability and variance component concepts are defined for phenotypes showing continuous rather than discrete variation, we sought to avoid such qualitative changes here.

We ran simulations with five polymorphic loci for the cAMP (Figure S8A), glycolysis (Figure S8B), cell cycle (Figure S9) and action potential (Figure S10) models (the circadian model describes only three genes explicitly). The resulting V_A/V_G values were slightly lower than with three loci, but the overall shape of the distributions and the clear differences between models did not change. This indicates that our findings are of general relevance for oligogenic traits.

It should be emphasized that the five studied cGP models differ in several other aspects than those highlighted here, such as the system size (number of state variables) and the process time scales. These features could also contribute to the observed variation in the distributions of V_A/V_G ratios. However, such structural differences are unavoidable when the aim is to compare experimentally validated models designed to describe specific biological systems. A complementary approach is to study generic models where system size and equation structure is fixed, while the connectivity matrix can be changed to describe a family of systems [35]. This facilitates graph-theoretic comparison of systems at the expense of some biological realism. We anticipate that the major conclusions from such studies will be similar to ours, but it may very well be that other important generic insights may also come to the fore.

All the models in our study describe parts of the cellular machinery and the resulting phenotypes are thus cellular rather than organismal. We do not think this is a major shortcoming in terms of the main conclusions that emerge from our results. However, we anticipate that application of our approach on multiscale models including cellular, tissue and whole-organ phenotypes [36] will provide a much improved foundation for explaining how properties of the GP map vary across and within biological systems in terms of regulatory anatomy and associated genetic variation [37], [38].

As our approach can be used together with any computational biology model, it has the potential to contribute substantially to a theoretical foundation capable of predicting when we are to expect low or high V_A/V_G or h²/H² ratios from the principles of regulatory biology. Causally cohesive genotype-phenotype modeling thus appears to qualify as a promising approach for integrating causal models of biological networks and physiology with quantitative genetics [39]–[44].

Supporting Information

Figure S1.

Flowchart of Monte Carlo simulations and analysis. Flowchart of the Monte Carlo simulations described in the Methods section “Monte Carlo simulations” and subsequent analysis described in the Methods section “Statistical analysis”.

https://doi.org/10.1371/journal.pcbi.1003053.s001

(PDF)

Figure S2.

Graphical representation of cAMP model. Figure modified from http://www.biomedcentral.com/1752-0509/3/70/figure/F7. Red numbers, correspond to the rows in Table S1, and indicate the model elements where genetic variation was introduced.

https://doi.org/10.1371/journal.pcbi.1003053.s002

(PDF)

Figure S3.

Graphical representation of glycolysis model. Figure modified from the CellML model repository (http://models.cellml.org/workspace/teusink_passarge_reijenga_esgalhado_vanderweijden_schepper_walsh_bakker_vandam_westerhoff_snoep_2000). Red numbers, correspond to the rows in Table S2, and indicate the model elements where genetic variation was introduced.

https://doi.org/10.1371/journal.pcbi.1003053.s003

(PDF)

Figure S4.

Graphical representation of cell cycle model. Figure modified from the CellML model repository (http://models.cellml.org/workspace/chen_calzone_csikasznagy_cross_novak_tyson_2004). Red numbers, correspond to the rows in Table S3, and indicate the model elements where genetic variation was introduced.

https://doi.org/10.1371/journal.pcbi.1003053.s004

(PDF)

Figure S5.

Graphical representation of circadian model. Figure modified from the CellML model repository (http://models.cellml.org/workspace/leloup_goldbeter_2004). Red numbers, correspond to the rows in Table S4, and indicate the model elements where genetic variation was introduced.

https://doi.org/10.1371/journal.pcbi.1003053.s005

(PDF)

Figure S6.

Graphical representation of action potential model. Figure modified from the CellML model repository (http://models.cellml.org/workspace/bondarenko_szigeti_bett_kim_rasmusson_2004). Red numbers, correspond to the rows in Table S5, and indicate the model elements where genetic variation was introduced.

https://doi.org/10.1371/journal.pcbi.1003053.s006

(PDF)

Figure S7.

The empirical cumulative distribution function of V_A/V_G ratios for phenotypes of the cell cycle model. The empirical cumulative distribution functions (y axis) of V_A/V_G ratios (x axis) for all phenotypes studied in the cell cycle model. The phenotypes are divided into 3 groups. Cell events refer to the discrete events defined in the model paper and include timing of budding (Bud), timing of DNA replication (Rep) and timing of alignment of chromosomes on the metaphase plates (Spn). Peak concentration include the concentration of the phosphorylated anaphase-promoting complex (APCP), the G1-stabilizing protein Cdc6, the B-type Cyclin protein 2 (Clb2), the S-phase promoting B-type Cyclin (Clb5), the starter kinase (Cln2) and the G1 phase stabilizing protein (Sci1). The time to peak phenotypes include the time to reach peak concentrations of APCP, Cdc6, Clb2, Clb5, Cln2 and Sci1. See Table S8 for further phenotype descriptions and numerical summaries of the distribution of V_A/V_G ratios.

https://doi.org/10.1371/journal.pcbi.1003053.s007

(PDF)

Figure S8.

The empirical cumulative distribution function of V_A/V_G ratios for phenotypes of the cAMP (A) and the glycolysis (B) models with 5 polymorphic loci. Figure 3 shows results from simulations with 3 polymorhpic loci. A. The empirical cumulative distribution functions (y axis) of V_A/V_G ratios (x axis) for all phenotypes studied in the cAMP model: The initial steady state concentrations before adding external glucose of the cyclic adenosine monophosphate (cAMP), the G-protein Ras2a (Ras2a), the guanine-nucleotide-exchange factor (Cdc25), the protein kinase A (PKAi). The peak values after adding glucose of these proteins (cAMPv, Ras2av, Cdc25v and PKAiv), the Kelch repeat homologue protein (Krhv), the G-protein Gpa2a (Gpa2av), and the phosphodiesterase (Pde1v). The time taken to reach the peak values (cAMPt, Ras2at, Cdc25t, PKAit, Krht, Gpa2at, Ped1t). B. The empirical cumulative distribution function (y axis) of V_A/V_G ratios (x axis) for the steady state concentrations of 13 metabolites in the glycolysis model acetaldehyde (ACE), 1,3-bisphospoglycerate (BPG), fructose-1,6-bisphosphate (F16P), fructose 6-phosphate (F6P), glucose 6-phosphate (G6P), glucose in cell (GLCi), nicotinamide adenine dinucleotide (NADH), phosphates in adenine nucleotide (P), 2-phosphoglyerate (P2G), 3-phosphoglycerate (P3G), phosphoenolpyruvate (PEP), pyruvate (PYP), and trio-phosphate (TRIO).

https://doi.org/10.1371/journal.pcbi.1003053.s008

(PDF)

Figure S9.

The empirical cumulative distribution function of V_A/V_G ratios for phenotypes of the cell cycle model with 5 polymorphic loci. Figure S7 shows results from simulations with 3 polymorhpic loci. The empirical cumulative distribution functions (y axis) of V_A/V_G ratios (x axis) for all phenotypes studied in the cell cycle model. The phenotypes are divided into 3 groups. Cell events refer to the discrete events defined in the model paper and include timing of budding (Bud), timing of DNA replication (Rep) and timing of alignment of chromosomes on the metaphase plates (Spn). Peak concentration include the concentration of the phosphorylated anaphase-promoting complex (APCP), the G1-stabilizing protein Cdc6, the B-type Cyclin protein 2 (Clb2), the S-phase promoting B-type Cyclin (Clb5), the starter kinase (Cln2) and the G1 phase stabilizing protein (Sci1). The time to peak phenotypes include the time to reach peak concentrations of APCP, Cdc6, Clb2, Clb5, Cln2 and Sci1.

https://doi.org/10.1371/journal.pcbi.1003053.s009

(PDF)

Figure S10.

The empirical cumulative distribution function of V_A/V_G ratios for phenotypes of the action potential model with 5 polymorphic loci. Figure 4B shows results from simulations with 3 polymorhpic loci. The empirical cumulative distribution functions (y axis) of V_A/V_G ratios (x axis) for phenotypes studied in the action potential model: time to 25%, 50%, 75% and 90% of initial values, the amplitude, initial values (Base), peak values, time to reach peak values of action potential (left panel) and calcium transient (right panel) are shown.

https://doi.org/10.1371/journal.pcbi.1003053.s010

(PDF)

Table S1.

Polymorphic model elements of the cAMP model. A list of cAMP model elements and parameters used to manifest genetic variation. Parameter names from the original publication (Table 4 in [21], names used in the SBML file retrieved from http://www.biomedcentral.com/content/supplementary/1752-0509-3-70-s1.xml and baseline values with units.

https://doi.org/10.1371/journal.pcbi.1003053.s011

(PDF)

Table S2.

Polymorphic model elements of the glycolysis model. A list of glycolysis model elements and parameters used to manifest genetic variation. Parameter names from the original publication [20], names used in the SBML file retrieved from http://www.ebi.ac.uk/biomodels-main/BIOMD0000000064 and baseline values with units.

https://doi.org/10.1371/journal.pcbi.1003053.s012

(PDF)

Table S3.

Polymorphic model elements of the cell cycle model. A list of cell cycle model elements and parameters used to manifest genetic variation. Parameter names from Table 1 and Table 2 in the original publication [22], names used in the CellML file retrieved from http://models.cellml.org/workspace/chen_calzone_csikasznagy_cross_novak_tyson_2004 and baseline values with units.

https://doi.org/10.1371/journal.pcbi.1003053.s013

(PDF)

Table S4.

Polymorphic model elements of the circadian model. A list of circadian model elements and parameters used to manifest genetic variation. Parameter names from Table 1 (parameter set 4) in the original publication [23], names used in the CellML file “leloup_goldbeter_2004.cellml” retrieved from http://models.cellml.org/workspace/leloup_goldbeter_2004/ and baseline values with units.

https://doi.org/10.1371/journal.pcbi.1003053.s014

(PDF)

Table S5.

Polymorphic model elements of the action potential model. A list of action potential model elements and parameters used to manifest genetic variation. Parameter names from Table B1 in the original publication [24], names used in the CellML file which is available as supplementary material (filename “LNCS model.zip”) at doi:10.3389/fphys.2011.00106 and baseline values with units.

https://doi.org/10.1371/journal.pcbi.1003053.s015

(PDF)

Table S6.

Summary of phenotype descriptions, variability thresholds and distribution of V_A/V_G ratios for the cAMP model. The first three columns list the phenotype abbreviations used in this study, a text description of the phenotypes and their units. The thresholds used to filter out dataset with very low relative and/or absolute variability are listed in the next two columns, followed by the number of Monte Carlo simulations (out of 1000) passing the threshold. The last 7 columns contain quantiles and means of the V_A/V_G values for the datasets passing the variability threshold.

https://doi.org/10.1371/journal.pcbi.1003053.s016

(PDF)

Table S7.

Summary of phenotype descriptions, variability thresholds and distribution of V_A/V_G ratios for the glycolysis model. The first three columns list the phenotype abbreviations used in this study, a text description of the phenotypes and their units. The thresholds used to filter out dataset with very low relative and/or absolute variability are listed in the next two columns, followed by the number of Monte Carlo simulations (out of 1000) passing the threshold. The last 7 columns contain quantiles and means of the V_A/V_G values for the datasets passing the variability threshold.

https://doi.org/10.1371/journal.pcbi.1003053.s017

(PDF)

Table S8.

Summary of phenotype descriptions, variability thresholds and distribution of V_A/V_G ratios for the cell cycle model. The first three columns list the phenotype abbreviations used in this study, a text description of the phenotypes and their units. The thresholds used to filter out dataset with very low relative and/or absolute variability are listed in the next two columns, followed by the number of Monte Carlo simulations (out of 1000) passing the threshold. The last 7 columns contain quantiles and means of the V_A/V_G values for the datasets passing the variability threshold.

https://doi.org/10.1371/journal.pcbi.1003053.s018

(PDF)

Table S9.

Summary of phenotype descriptions, variability thresholds and distribution of V_A/V_G ratios for the circadian model. The first three columns list the phenotype abbreviations used in this study, a text description of the phenotypes and their units. The thresholds used to filter out dataset with very low relative and/or absolute variability are listed in the next two columns, followed by the number of Monte Carlo simulations (out of 1000) passing the threshold. The last 7 columns contain quantiles and means of the V_A/V_G values for the datasets passing the variability threshold. Abbreviations: phosphorylated – phos., cytosolic – cyt., nuclear – nuc., bottom concentration – b.c., peak concentration – p.c.

https://doi.org/10.1371/journal.pcbi.1003053.s019

(PDF)

Table S10.

Summary of phenotype descriptions, variability thresholds and distribution of V_A/V_G ratios for the action potential model. The first three columns list the phenotype abbreviations used in this study, a text description of the phenotypes and their units. The thresholds used to filter out dataset with very low relative and/or absolute variability are listed in the next two columns, followed by the number of Monte Carlo simulations (out of 1000) passing the threshold. The last 7 columns contain quantiles and means of the V_A/V_G values for the datasets passing the variability threshold.

https://doi.org/10.1371/journal.pcbi.1003053.s020

(PDF)

Text S1.

More detailed descriptions of the five cGP models.

https://doi.org/10.1371/journal.pcbi.1003053.s021

(PDF)

Acknowledgments

We are thankful to Katherine C. Chen for help on implementing and solving the cell cycle model.

Author Contributions

Conceived the study: ABG SWO. Performed simulations and analysis: YW ABG JOV. Wrote the paper: YW JOV SWO ABG.

References

1. Carlborg Ö, Haley CS (2004) Opinion: Epistasis: too often neglected in complex trait studies? Nat Rev Genet 5: 618–625 .
- View Article
- Google Scholar
2. Fisher RA (1930) The genetical theory of natural selection. Oxford: Clarendon Press. 302p.
3. Hill WG, Goddard ME, Visscher PM (2008) Data and Theory Point to Mainly Additive Genetic Variance for Complex Traits. PLoS Genet 4: e1000008.
- View Article
- Google Scholar
4. Long N, Gianola D, Rosa GJM, Weigel KA (2011) Marker-assisted prediction of non-additive genetic values. Genetica 139: 843–854 .
- View Article
- Google Scholar
5. Wright S (1931) Evolution in Mendelian Populations. Genetics 16: 97–159.
- View Article
- Google Scholar
6. Zuk O, Hechter E, Sunyaev SR, Lander ES (2012) The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci U S A 109: 1193–1198 .
- View Article
- Google Scholar
7. Mackay TFC, Stone EA, Ayroles JF (2009) The genetics of quantitative traits: challenges and prospects. Nat Rev Genet 10: 565–577 .
- View Article
- Google Scholar
8. Hill WG (2010) Understanding and using quantitative genetic variation. Philos Trans R Soc Lond B Biol Sci 365: 73–85 .
- View Article
- Google Scholar
9. Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits. Sunderland, Mass.: Sinauer Associates. 980p.
10. Gertz J, Gerke JP, Cohen BA (2010) Epistasis in a quantitative trait captured by a molecular model of transcription factor interactions. Theor Popul Biol 77: 1–5 .
- View Article
- Google Scholar
11. Gjuvsland AB, Hayes BJ, Omholt SW, Carlborg Ö (2007) Statistical epistasis is a generic feature of gene regulatory networks. Genetics 175: 411–420 .
- View Article
- Google Scholar
12. Omholt SW, Plahte E, Oyehaug L, Xiang K (2000) Gene regulatory networks generating the phenomena of additivity, dominance and epistasis. Genetics 155: 969–980.
- View Article
- Google Scholar
13. Peccoud J, Vander Velden K, Podlich D, Winkler C, Arthur L, et al. (2004) The selective values of alleles in a molecular network model are context dependent. Genetics 166: 1715–1725.
- View Article
- Google Scholar
14. Pumir A, Shraiman B (2011) Epistasis in a Model of Molecular Signal Transduction. PLoS Comput Biol 7: e1001134.
- View Article
- Google Scholar
15. Rajasingh H, Gjuvsland AB, Våge DI, Omholt SW (2008) When Parameters in Dynamic Models Become Phenotypes: A Case Study on Flesh Pigmentation in the Chinook Salmon (Oncorhynchus tshawytscha). Genetics 179: 1113–1118.
- View Article
- Google Scholar
16. Salazar-Ciudad I, Jernvall J (2010) A computational model of teeth and the developmental origins of morphological variation. Nature 464: 583–586 .
- View Article
- Google Scholar
17. Vik JO, Gjuvsland AB, Li L, Tøndel K, Niederer S, et al. (2011) Genotype-Phenotype Map Characteristics of an In silico Heart Cell. Front Physiol 2: 106 .
- View Article
- Google Scholar
18. Wang Y, Gjuvsland AB, Vik JO, Smith NP, Hunter PJ, et al. (2012) Parameters in dynamic models of complex traits are containers of missing heritability. PLoS Comput Biol 8: e1002459 .
- View Article
- Google Scholar
19. Welch SM, Dong Z, Roe JL, Das S (2005) Flowering time control: gene network modelling and the link to quantitative genetics : Modelling complex traits for plant improvement. Aust J Agric Res 56: 919–936.
- View Article
- Google Scholar
20. Teusink B, Passarge J, Reijenga CA, Esgalhado E, van der Weijden CC, et al. (2000) Can yeast glycolysis be understood in terms of in vitro kinetics of the constituent enzymes? Testing biochemistry. Eur J Biochem 267: 5313–5329.
- View Article
- Google Scholar
21. Williamson T, Schwartz J-M, Kell DB, Stateva L (2009) Deterministic mathematical models of the cAMP pathway in Saccharomyces cerevisiae. BMC Syst Biol 3: 70 .
- View Article
- Google Scholar
22. Chen KC, Calzone L, Csikasz-Nagy A, Cross F, Novak B, et al. (2004) Integrative Analysis of Cell Cycle Control in Budding Yeast. Mol Biol Cell 15: 3841–3862 .
- View Article
- Google Scholar
23. Leloup J-C, Goldbeter A (2004) Modeling the mammalian circadian clock: sensitivity analysis and multiplicity of oscillatory mechanisms. J Theor Biol 230: 541–562 .
- View Article
- Google Scholar
24. Li L, Niederer SA, Idigo W, Zhang YH, Swietach P, et al. (2010) A mathematical model of the murine ventricular myocyte: a data-driven biophysically based approach applied to mice overexpressing the canine NCX isoform. Am J Physiol Heart Circ Physiol 299: H1045–H1063 .
- View Article
- Google Scholar
25. Olivier BG, Rohwer JM, Hofmeyr J-HS (2005) Modelling cellular systems with PySCeS. Bioinformatics 21: 560–561 .
- View Article
- Google Scholar
26. Reijenga KA, van Megen YMGA, Kooi BW, Bakker BM, Snoep JL, et al. (2005) Yeast glycolytic oscillations that are not controlled by a single oscillophore: a new definition of oscillophore strength. J Theor Biol 232: 385–398 .
- View Article
- Google Scholar
27. Cohen S, Hindmarsh C (1996) CVODE, a stiff/nonstiff ODE solver in C. Computers in physics 10: 138–143.
- View Article
- Google Scholar
28. Bondarenko VE, Szigeti GP, Bett GCL, Kim S-J, Rasmusson RL (2004) Computer model of action potential of mouse ventricular myocytes. Am J Physiol Heart Circ Physiol 287: H1378–H1403 .
- View Article
- Google Scholar
29. Alvarez-Castro JM, Carlborg Ö (2007) A unified model for functional and statistical epistasis and its application in quantitative trait Loci analysis. Genetics 176: 1151–1167 .
- View Article
- Google Scholar
30. Gjuvsland AB, Vik JO, Woolliams JA, Omholt SW (2011) Order-preserving principles underlying genotype-phenotype maps ensure high additive proportions of genetic variance. J Evol Biol 24: 2269–2279 .
- View Article
- Google Scholar
31. Kacser H, Burns JA (1981) The molecular basis of dominance. Genetics 97: 639–666.
- View Article
- Google Scholar
32. Keightley PD (1989) Models of quantitative variation of flux in metabolic pathways. Genetics 121: 869–876.
- View Article
- Google Scholar
33. Leloup J-C, Goldbeter A (2003) Toward a detailed computational model for the mammalian circadian clock. Proc Natl Acad Sci U S A 100: 7051–7056 .
- View Article
- Google Scholar
34. Tyson JJ, Novak B (2001) Regulation of the eukaryotic cell cycle: molecular antagonism, hysteresis, and irreversible transitions. J Theor Biol 210: 249–263 .
- View Article
- Google Scholar
35. Siegal ML, Promislow DEL, Bergman A (2007) Functional and evolutionary inference in gene networks: does topology matter? Genetica 129: 83–103 .
- View Article
- Google Scholar
36. Hunter P, Coveney PV, De Bono B, Diaz V, Fenner J, et al. (2010) A vision and strategy for the virtual physiological human in 2010 and beyond. Philos Transact A Math Phys Eng Sci 368: 2595–2614 .
- View Article
- Google Scholar
37. Houle D, Govindaraju DR, Omholt S (2010) Phenomics: the next challenge. Nat Rev Genet 11: 855–866.
- View Article
- Google Scholar
38. Dermitzakis ET (2012) Cellular genomics for complex traits. Nat Rev Genet 13: 215–220 .
- View Article
- Google Scholar
39. Benfey PN, Mitchell-Olds T (2008) From genotype to phenotype: systems biology meets natural variation. Science 320: 495–497 .
- View Article
- Google Scholar
40. Flint J, Mackay TFC (2009) Genetic architecture of quantitative traits in mice, flies, and humans. Genome Research 19: 723–733 .
- View Article
- Google Scholar
41. Moore A (2012) Towards the new evolutionary synthesis: Gene regulatory networks as information integrators. Bioessays 34: 87–87 .
- View Article
- Google Scholar
42. Nadeau JH, Dudley AM (2011) Genetics. Systems genetics. Science 331: 1015–1016 .
- View Article
- Google Scholar
43. Rockman MV (2008) Reverse engineering the genotype-phenotype map with natural genetic variation. Nature 456: 738–744 .
- View Article
- Google Scholar
44. Sieberts SK, Schadt EE (2007) Moving toward a system genetics view of disease. Mamm Genome 18: 389–401 .
- View Article
- Google Scholar
45. Le Rouzic A, Alvarez-Castro JM, Carlborg Ö (2008) Dissection of the genetic architecture of body weight in chicken reveals the impact of epistasis on domestication traits. Genetics 179: 1591–1599 .
- View Article
- Google Scholar
46. Prows DR, Hafertepen AP, Gibbons WJ, Winterberg AV, Nick TG (2007) A genetic mouse model to investigate hyperoxic acute lung injury survival. Physiol Genomics 30: 262–270 .
- View Article
- Google Scholar
47. Jordan KW, Carbone MA, Yamamoto A, Morgan TJ, Mackay TFC (2007) Quantitative genomics of locomotor behavior in Drosophila melanogaster. Genome Biol 8: R172 .
- View Article
- Google Scholar
48. Wang P, Lyman RF, Shabalina SA, Mackay TFC, Anholt RRH (2007) Association of polymorphisms in odorant-binding protein genes with variation in olfactory response to benzaldehyde in Drosophila. Genetics 177: 1655–1665 .
- View Article
- Google Scholar
49. Zhang JF, Waddell C, Sengupta-Gopalan C, Potenza C, Cantrell RG (2007) Diallel analysis of root-knot nematode resistance based on galling index in upland cotton. Plant Breeding 126: 164–168.
- View Article
- Google Scholar
50. Cuevas HE, Staub JE, Simon PW (2010) Inheritance of beta-carotene-associated medocarp colour and fruit maturity of melon(Cucumis melo L.). Euphytica 173: 129–140 .
- View Article
- Google Scholar
51. Flint-Garcia SA, Buckler ES, Tiffin P, Ersoz E, Springer NM (2009) Heterosis is prevalent for multiple traits in diverse maize germplasm. PLoS ONE 4: e7433 .
- View Article
- Google Scholar
52. Lyimo HJF, Pratt RC, Mnyuku RSOW (2011) Heritability and gene effect estimates for components of partial resistance to grey leaf spot of maize by generation mean analysis. Plant Breeding 130: 633–639 .
- View Article
- Google Scholar
53. Kearsey MJ, Pooni HS, Syed NH (2003) Genetics of quantitative traits in Arabidopsis thaliana. Heredity 91: 456–464 .
- View Article
- Google Scholar
54. Chakravarthi DVN, Rao YV, Rao MVS, Manga V (2010) Genetic analysis of in vitro callus and production of multiple shoots in eggplant. Plant Cell Tiss Organ Cult 102: 87–97 .
- View Article
- Google Scholar

[ref1] 1. Carlborg Ö, Haley CS (2004) Opinion: Epistasis: too often neglected in complex trait studies? Nat Rev Genet 5: 618–625 .
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Fisher RA (1930) The genetical theory of natural selection. Oxford: Clarendon Press. 302p.

[ref3] 3. Hill WG, Goddard ME, Visscher PM (2008) Data and Theory Point to Mainly Additive Genetic Variance for Complex Traits. PLoS Genet 4: e1000008.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref4] 4. Long N, Gianola D, Rosa GJM, Weigel KA (2011) Marker-assisted prediction of non-additive genetic values. Genetica 139: 843–854 .
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref5] 5. Wright S (1931) Evolution in Mendelian Populations. Genetics 16: 97–159.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref6] 6. Zuk O, Hechter E, Sunyaev SR, Lander ES (2012) The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci U S A 109: 1193–1198 .
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref7] 7. Mackay TFC, Stone EA, Ayroles JF (2009) The genetics of quantitative traits: challenges and prospects. Nat Rev Genet 10: 565–577 .
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref8] 8. Hill WG (2010) Understanding and using quantitative genetic variation. Philos Trans R Soc Lond B Biol Sci 365: 73–85 .
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref9] 9. Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits. Sunderland, Mass.: Sinauer Associates. 980p.

[ref10] 10. Gertz J, Gerke JP, Cohen BA (2010) Epistasis in a quantitative trait captured by a molecular model of transcription factor interactions. Theor Popul Biol 77: 1–5 .
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref11] 11. Gjuvsland AB, Hayes BJ, Omholt SW, Carlborg Ö (2007) Statistical epistasis is a generic feature of gene regulatory networks. Genetics 175: 411–420 .
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref12] 12. Omholt SW, Plahte E, Oyehaug L, Xiang K (2000) Gene regulatory networks generating the phenomena of additivity, dominance and epistasis. Genetics 155: 969–980.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref13] 13. Peccoud J, Vander Velden K, Podlich D, Winkler C, Arthur L, et al. (2004) The selective values of alleles in a molecular network model are context dependent. Genetics 166: 1715–1725.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref14] 14. Pumir A, Shraiman B (2011) Epistasis in a Model of Molecular Signal Transduction. PLoS Comput Biol 7: e1001134.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref15] 15. Rajasingh H, Gjuvsland AB, Våge DI, Omholt SW (2008) When Parameters in Dynamic Models Become Phenotypes: A Case Study on Flesh Pigmentation in the Chinook Salmon (Oncorhynchus tshawytscha). Genetics 179: 1113–1118.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref16] 16. Salazar-Ciudad I, Jernvall J (2010) A computational model of teeth and the developmental origins of morphological variation. Nature 464: 583–586 .
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref17] 17. Vik JO, Gjuvsland AB, Li L, Tøndel K, Niederer S, et al. (2011) Genotype-Phenotype Map Characteristics of an In silico Heart Cell. Front Physiol 2: 106 .
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref18] 18. Wang Y, Gjuvsland AB, Vik JO, Smith NP, Hunter PJ, et al. (2012) Parameters in dynamic models of complex traits are containers of missing heritability. PLoS Comput Biol 8: e1002459 .
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref19] 19. Welch SM, Dong Z, Roe JL, Das S (2005) Flowering time control: gene network modelling and the link to quantitative genetics : Modelling complex traits for plant improvement. Aust J Agric Res 56: 919–936.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref20] 20. Teusink B, Passarge J, Reijenga CA, Esgalhado E, van der Weijden CC, et al. (2000) Can yeast glycolysis be understood in terms of in vitro kinetics of the constituent enzymes? Testing biochemistry. Eur J Biochem 267: 5313–5329.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref21] 21. Williamson T, Schwartz J-M, Kell DB, Stateva L (2009) Deterministic mathematical models of the cAMP pathway in Saccharomyces cerevisiae. BMC Syst Biol 3: 70 .
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref22] 22. Chen KC, Calzone L, Csikasz-Nagy A, Cross F, Novak B, et al. (2004) Integrative Analysis of Cell Cycle Control in Budding Yeast. Mol Biol Cell 15: 3841–3862 .
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref23] 23. Leloup J-C, Goldbeter A (2004) Modeling the mammalian circadian clock: sensitivity analysis and multiplicity of oscillatory mechanisms. J Theor Biol 230: 541–562 .
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref24] 24. Li L, Niederer SA, Idigo W, Zhang YH, Swietach P, et al. (2010) A mathematical model of the murine ventricular myocyte: a data-driven biophysically based approach applied to mice overexpressing the canine NCX isoform. Am J Physiol Heart Circ Physiol 299: H1045–H1063 .
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref25] 25. Olivier BG, Rohwer JM, Hofmeyr J-HS (2005) Modelling cellular systems with PySCeS. Bioinformatics 21: 560–561 .
View Article
Google Scholar

[70] View Article

[71] Google Scholar

[ref26] 26. Reijenga KA, van Megen YMGA, Kooi BW, Bakker BM, Snoep JL, et al. (2005) Yeast glycolytic oscillations that are not controlled by a single oscillophore: a new definition of oscillophore strength. J Theor Biol 232: 385–398 .
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref27] 27. Cohen S, Hindmarsh C (1996) CVODE, a stiff/nonstiff ODE solver in C. Computers in physics 10: 138–143.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref28] 28. Bondarenko VE, Szigeti GP, Bett GCL, Kim S-J, Rasmusson RL (2004) Computer model of action potential of mouse ventricular myocytes. Am J Physiol Heart Circ Physiol 287: H1378–H1403 .
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref29] 29. Alvarez-Castro JM, Carlborg Ö (2007) A unified model for functional and statistical epistasis and its application in quantitative trait Loci analysis. Genetics 176: 1151–1167 .
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref30] 30. Gjuvsland AB, Vik JO, Woolliams JA, Omholt SW (2011) Order-preserving principles underlying genotype-phenotype maps ensure high additive proportions of genetic variance. J Evol Biol 24: 2269–2279 .
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref31] 31. Kacser H, Burns JA (1981) The molecular basis of dominance. Genetics 97: 639–666.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref32] 32. Keightley PD (1989) Models of quantitative variation of flux in metabolic pathways. Genetics 121: 869–876.
View Article
Google Scholar

[91] View Article

[92] Google Scholar

[ref33] 33. Leloup J-C, Goldbeter A (2003) Toward a detailed computational model for the mammalian circadian clock. Proc Natl Acad Sci U S A 100: 7051–7056 .
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref34] 34. Tyson JJ, Novak B (2001) Regulation of the eukaryotic cell cycle: molecular antagonism, hysteresis, and irreversible transitions. J Theor Biol 210: 249–263 .
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref35] 35. Siegal ML, Promislow DEL, Bergman A (2007) Functional and evolutionary inference in gene networks: does topology matter? Genetica 129: 83–103 .
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref36] 36. Hunter P, Coveney PV, De Bono B, Diaz V, Fenner J, et al. (2010) A vision and strategy for the virtual physiological human in 2010 and beyond. Philos Transact A Math Phys Eng Sci 368: 2595–2614 .
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref37] 37. Houle D, Govindaraju DR, Omholt S (2010) Phenomics: the next challenge. Nat Rev Genet 11: 855–866.
View Article
Google Scholar

[106] View Article

[107] Google Scholar

[ref38] 38. Dermitzakis ET (2012) Cellular genomics for complex traits. Nat Rev Genet 13: 215–220 .
View Article
Google Scholar

[109] View Article

[110] Google Scholar

[ref39] 39. Benfey PN, Mitchell-Olds T (2008) From genotype to phenotype: systems biology meets natural variation. Science 320: 495–497 .
View Article
Google Scholar

[112] View Article

[113] Google Scholar

[ref40] 40. Flint J, Mackay TFC (2009) Genetic architecture of quantitative traits in mice, flies, and humans. Genome Research 19: 723–733 .
View Article
Google Scholar

[115] View Article

[116] Google Scholar

[ref41] 41. Moore A (2012) Towards the new evolutionary synthesis: Gene regulatory networks as information integrators. Bioessays 34: 87–87 .
View Article
Google Scholar

[118] View Article

[119] Google Scholar

[ref42] 42. Nadeau JH, Dudley AM (2011) Genetics. Systems genetics. Science 331: 1015–1016 .
View Article
Google Scholar

[121] View Article

[122] Google Scholar

[ref43] 43. Rockman MV (2008) Reverse engineering the genotype-phenotype map with natural genetic variation. Nature 456: 738–744 .
View Article
Google Scholar

[124] View Article

[125] Google Scholar

[ref44] 44. Sieberts SK, Schadt EE (2007) Moving toward a system genetics view of disease. Mamm Genome 18: 389–401 .
View Article
Google Scholar

[127] View Article

[128] Google Scholar

[ref45] 45. Le Rouzic A, Alvarez-Castro JM, Carlborg Ö (2008) Dissection of the genetic architecture of body weight in chicken reveals the impact of epistasis on domestication traits. Genetics 179: 1591–1599 .
View Article
Google Scholar

[130] View Article

[131] Google Scholar

[ref46] 46. Prows DR, Hafertepen AP, Gibbons WJ, Winterberg AV, Nick TG (2007) A genetic mouse model to investigate hyperoxic acute lung injury survival. Physiol Genomics 30: 262–270 .
View Article
Google Scholar

[133] View Article

[134] Google Scholar

[ref47] 47. Jordan KW, Carbone MA, Yamamoto A, Morgan TJ, Mackay TFC (2007) Quantitative genomics of locomotor behavior in Drosophila melanogaster. Genome Biol 8: R172 .
View Article
Google Scholar

[136] View Article

[137] Google Scholar

[ref48] 48. Wang P, Lyman RF, Shabalina SA, Mackay TFC, Anholt RRH (2007) Association of polymorphisms in odorant-binding protein genes with variation in olfactory response to benzaldehyde in Drosophila. Genetics 177: 1655–1665 .
View Article
Google Scholar

[139] View Article

[140] Google Scholar

[ref49] 49. Zhang JF, Waddell C, Sengupta-Gopalan C, Potenza C, Cantrell RG (2007) Diallel analysis of root-knot nematode resistance based on galling index in upland cotton. Plant Breeding 126: 164–168.
View Article
Google Scholar

[142] View Article

[143] Google Scholar

[ref50] 50. Cuevas HE, Staub JE, Simon PW (2010) Inheritance of beta-carotene-associated medocarp colour and fruit maturity of melon(Cucumis melo L.). Euphytica 173: 129–140 .
View Article
Google Scholar

[145] View Article

[146] Google Scholar

[ref51] 51. Flint-Garcia SA, Buckler ES, Tiffin P, Ersoz E, Springer NM (2009) Heterosis is prevalent for multiple traits in diverse maize germplasm. PLoS ONE 4: e7433 .
View Article
Google Scholar

[148] View Article

[149] Google Scholar

[ref52] 52. Lyimo HJF, Pratt RC, Mnyuku RSOW (2011) Heritability and gene effect estimates for components of partial resistance to grey leaf spot of maize by generation mean analysis. Plant Breeding 130: 633–639 .
View Article
Google Scholar

[151] View Article

[152] Google Scholar

[ref53] 53. Kearsey MJ, Pooni HS, Syed NH (2003) Genetics of quantitative traits in Arabidopsis thaliana. Heredity 91: 456–464 .
View Article
Google Scholar

[154] View Article

[155] Google Scholar

[ref54] 54. Chakravarthi DVN, Rao YV, Rao MVS, Manga V (2010) Genetic analysis of in vitro callus and production of multiple shoots in eggplant. Plant Cell Tiss Organ Cult 102: 87–97 .
View Article
Google Scholar

[157] View Article

[158] Google Scholar

Figures

Abstract

Author Summary

Introduction

Methods

Simulations of cGP models

Genotype to parameter mapping.

cAMP model.

Glycolysis model.

Cell cycle model.

Circadian model.

Action potential model.

Monte Carlo simulations.

Statistical analysis

Decomposition of genetic variance.

Monotonicity of GP-maps.

Results/Discussion

System classification and phenotype dimensionality

The ratio of additive genetic variance to total genetic variance

Monotonicity explains much of the VA/VG patterns

Considerations on the genericity of the results

Supporting Information

Acknowledgments

Author Contributions

References

Monotonicity explains much of the V_A/V_G patterns