Academia.eduAcademia.edu

Sequential Branch Points in Hierarchical Cell Fate Determination

Sequential Branch Points in Hierarchical Cell Fate Determination David Foster, Sui Huang, Stuart Kauffman arXiv:0903.4215v1 [q-bio.MN] 25 Mar 2009 Institute for Biocomplexity and Bioinformatics, University of Calgary, Biological Sciences Building, 2500 University Dr. N.W., Calgary, Alberta, Canada T2N 1N4 Abstract Multi-potent stem or progenitor cells undergo a sequential series of binary fate decisions which ultimately generates the diversity of differentiated cell. Efforts to understand cell fate control have focused on simple gene regulatory circuits that predict the presence of multiple stable states, bifurcations and switch-like transition. However, existing gene network models do not explain more complex properties of cell fate dynamics such as the hierarchical branching of developmental paths. Here, we construct a generic minimal model of the genetic regulatory network which naturally exhibits five elementary characteristics of cell differentiation stability, branching, exclusivity, directionality, and promiscuous expression. We argue that a modular architecture comprised of repeated network elements in principle reproduces these features of differentiation by repressing irrelevant modules and hence, channeling the dynamics in the high-dimensional phase space through a simpler, lower dimensional subspace. We implement our model both with ordinary differential equations (ODEs) to explore the role of bifurcations in producing the oneway character of differentiation and with stochastic differential equations (SDEs) to demonstrate the effect of noise on the system in simulations. We further argue that binary cell fate decisions are prevalent in cell differentiation due to general features of dynamical systems. This minimal model makes testable predictions about the structural basis for directional, discrete and diversifying cell phenotype development and thus, can guide the evaluation of real gene regulatory networks that govern differentiation. Key words: cell differentiation, genetic regulatory network, dynamics, modularity 1. Introduction The genetic regulatory network plays an essential role in controlling cell differentiation and maintaining the stability of cell types. Each cell type has its own stable gene expression profile, the set of genes in its genome which are actively expressed [1]. Thus, understanding cell differentiation requires a model of the behavior of many interacting genes which establish the expression profiles associated with each cell type. Gene regulatory interactions are often described as a network, with each node representing a gene and links between two genes indicating a regulatory interaction. The large scale characteristics of genetic regulatory networks have been studied in some detail [2, 3], but only in terms of network structure: the degree sequence [4], the prevalence of network motifs [5, 6], and so forth. In terms of GRN dynamics, most efforts have been “bottom-up” - devoted to simulating specific regulatory interactions or analyzing models of small genetic circuits [7, 8] and therefore focused on specific subsections of the network. Our goal is to work in a “top-down” fashion to construct a Preprint submitted to Journal of Theoretical Biology September 24, 2009 minimal dynamical model which captures many important features of cell differentiation. Most significantly, we capture the sequential branching differentiation of cells from the multipotent progenitor cell to more lineage-restricted cells and finally to terminal cells. To construct our model, we first note five elementary characteristics of cell differentiation which we wish to capture: stability, directionality, branching, exclusivity, and promiscuous expression. Of course, this is not meant to be an all encompassing or fully detailed description of cell differentiation. However, these simplified characteristics prove rich enough to create an interesting model. We review here our definition of each characteristic. Stability : Cell types should be stable states of the dynamics. A cell which is in specific cell type should remain in that cell type for a “long time” (relative to the time scale of gene activation, transcription and translation), unless altered by an external signal. Directionality: Progenitor cells should reliably differentiate into a variety of more specialized cell types [9] and should only very rarely to regain pluripotency by de-differentiating [10]. Branching: Differentiation should occur through discrete lineage specifications, in which a multipotent cell makes a binary fate decision and commits to one of two different lineages. Exclusivity: Once cells in our model have committed to a specific lineage, they should lose the ability to produce cell types from the other lineage. Each differentiation event both selects a specific lineage and forecloses the possibility of the other. Promiscuous expression: Multipotent progenitor cells should simultaneously express genes specific to all of their prospective lineages. For example, a hematopoitic stem cell expresses genes found in all of the lineages into which it can ultimately differentiate [11]. 2. Model specification To create a model which exhibits these characteristics, we take our inspiration from the purported network properties of real GRN. Biological networks exhibit modularity [12, 13, 14]. Modular networks can be divided into largely independent subnetworks, called modules. The nodes in a given module interact strongly with other nodes in the module, but only weakly with nodes in other modules. In our model, the network is composed of many modules, each responsible for a specific cell fate decision. These modules influence each other in an hierarchically organized fashion, as shown in Fig. 1, which ensures that differentiation events occur in a branching, sequential fashion. The dynamical state associated with each cell type is a product of the dynamics of the associated module. Arrows pointing from one module to another indicate influence of the former on the latter. Influence between modules acts as follows: a parameter in the regulated module depends on the expression level of a gene in the regulating module. In the top pairs of panels the system is in the “Multi” (multi-potent) state, and in the bottom pair, the system is in one of the two available progenitor cell types. On the dynamical side, the module A undergoes a change in its state (the specifics of which we will discuss shortly). In making the change, it exerts an influence on the two modules downstream, in this case turning off module C and allowing module B to enter a state which describes “Pro 1” (Progenitor 1). Notice that like module A, modules B and C are also linked to down-stream modules on which they exert influence, and that module A is itself influenced by a module “higher up” on the differentiation cascade. This allows the process to proceed sequentially through as many levels as are required. Our model suggests an important role for modularity: restricting the available flows in the state space of the dynamics. These restrictions ensure that only the appropriate pathways of differentiation are available to a given cell type. The modular structure of the network is what imposes the branching structure on cell differentiations. 2 Network Cell type x1 A x2 Multi B x3 C x4 x5 x1 A x2 x4 Pro 2 Multi B x3 Pro 1 x6 C x5 x6 Pro 1 Figure 1: Transition of a module in our network from the multipotent cell type to one of the progenitor cell types. Modules consist of two genes, each of which activates its own transcription and inhibits the transcription of the other gene. In the top panels, both genes in module A are active at a moderate level, represented by the gray coloration. This intermediate steady state is poised between the two subordinate lineages. Modules B and C are also both expressing their genes at a moderate level, as a result of the influence of module A. In the lower panels, the cell has differentiated from the multipotent cell type to progenitor type 1. Module A has switched from the intermediary stable state to the stable state in which gene x1 is highly active, whereas gene x2 has become inactive. As a result, module C has become completely inactive due to the signals from module A, whereas module B remains in the intermediate steady state, and is thus prepared to make further cell fate decisions. 3 3. Dynamics of a single module Previous work has studied the behavior of GRNs using a variety of different dynamical system models: Boolean networks [15, 16, 17], differential equations [18, 19], and chemical master equations [20, 21]. Although each model has its merits, for our current purposes differential equations provide the best combination of rich behavior and analytical tractability. To model a set of N interacting genes as differential equations, we associate the expression level (number of proteins produced by the gene which are present in the cell) of each gene with variable xi . The state of the system is given by the vector x = [x1 . . . xN ]. Since genes influence the transcription rates of other genes, the rate of transcription (or equally the instantaneous rate of change of the expression level) of each gene, ẋi , is given by: ẋi = Fi (x) (1) If we plot the possible expression levels of each gene as one axis in a graph, the state space of the dynamics is given by {R+ }N . For a given state, the growth or decay of the expression level of each gene defines a vector ẋ = [ ẋ1 . . . ẋN ], which points in the direction that the state of the system will move in state space under the dynamics of the system. This is the trajectory of the system. A stable fixed point is any point in state space in which the state of the system is “trapped” (under the deterministic dynamics, ignoring noise). Formally, this occurs for any point x⋆ such that ẋi⋆ = Fi (x⋆ ) = 0 for all i. In our model, each cell type is represented by a stable fixed point of the dynamics. Our goal is to construct the simplest plausible set of genetic regulatory interactions for which the state of the system moves, either by random fluctuations or external signals, through a sequence of branching, exclusive steps from the fixed point which represents a multipotent cell type, to states which represent a series of intermediary cell types, and finally to a terminal cell type. Huang et al [22] provide experimental evidence to show that the genetic circuit controlling hematopoetic (blood cell) stem cells consists of a pair of mutually repressing genes, each of which also acts as a self-activator. Such a feedback loop is alleged to be an over-represented motif in real genetic networks [4]. We have chosen the Huang switch for our cell-fate controlling module, as shown in Fig. 1. The differential equations describing this system are: b1 ψn1 a1 x1n + − k1 x1 x1n + θ1n x2n + ψn1 n n a2 x b2 ψ x˙2 = n 2 n + n 2 n − k2 x2 x2 + θ2 x1 + ψ2 x˙1 = (2) These equations model the dynamics of self activation and mutual inhibition with Hill functions. The parameters a1 , a2 give the maximum transcription rate due to self activation, and θ1 , θ2 describe the concentration of protein x1 or x2 at which the inflection point in the Hill function occurs. Parameters b1 , b2 give the basal rate of transcription, which is suppressed by a downward sloping Hill function with ψ1 , ψ2 controlling the concentration of protein at which its inflection point occurs. The cooperativity of the interactions is given by n, which controls the steepness of the Hill function. The rate of protein degradation is controlled by k1 , k2 . To simplify our analysis we impose certain symmetries on the equations, and select specific values for some parameters which produce the desired behavior. We make the maximum contributions of the basal expression rate and the self-activation the same (a1 = b1 ) and thus have 4 a single parameter which controls the overall maximum transcription rate of the gene. For simplicity, we also set the cross-over points of the two Hill functions (for the self-activation and inhibition) to the same value (θ1 = ψ1 = θ). Finally, we choose to make all parameter values a, θ, and k identical between the two equations (so k1 = k2 ), thus making the behavior symmetric between x1 and x2 . Thus we can write the new set of parameter values as: a1 , a2 , b1 , b2 = a θ1 , θ2 , ψ1 , ψ2 = θ = 1 n=4 k1 , k2 = k (3) We choose specific parameter values θ = 1 and n = 4. These parameter choices are not unique, and do not require careful tuning: as long as n ≥ 4 a wide range of values of θ, a, and k produce the desired tri-stable behavior. This leaves us with the new simplified equations:   4  x1 1   − kx1 + 4 x˙1 = a  4  x1 +4 1 x2 + 1   x 1   − kx2 x˙2 = a  4 2 + 4 x2 + 1 x1 + 1 (4) These equations have only two free parameters: a and k. These are the parameters which are influenced by other modules, following the schematic shown in Fig 1, and as detailed below. In order to better understand the influence of these parameters, we explore the bifurcation which occurs as they are change. A bifurcation is a dynamical event in which a small change of a parameter leads to a qualitative change in the dynamics of the system. Bifurcations can destabilize fixed points, or cause points which were previously unstable to become stable. While a fixed point may be stable under the deterministic dynamics of the system, in the presence of noise even a stable fixed point may not trap the system indefinitely. Thus bifurcations which change the stability of a fixed point are the dynamical equivalents of cell fate decisions in differentiating cells. For example, a parameter change may destabilize the fixed point representing a multipotent cell type, thus driving the state to a different fixed point, which represents the new progenitor cell type. For a = 2 and k = 1, the two variable system is “tri-stable”, meaning that there are three stable fixed points in the dynamcs. This state space is shown in the top left panel of Fig 2. The three stable fixed points are given by black dots: there is an intermediate fixed point, characterized by moderate amounts of x1 and x2 , and two exterior fixed points, characterized by large amounts of one variable, and small amounts of the other. Leaving the parameter a = 2 fixed and tuning k from 1 to 1.5 drives a bifurcation from tri-stablity to bi-stabliltiy, as shown in the subsequent three panels of Fig. 2. The intermediate fixed point is destabilized, leaving only the two exterior stable fixed points. The bifurcation also occurs if the overall transcription rate, a, is increased while k is kept constant –only the ratio of these parameters have a real effect on the qualitative dynamics. Their absolute values merely set the time scale. In their analysis of the tri-stable switch, Huang et al [22] argue that the “intermediate” fixed point represents the hematopoitic stem cell, and the two “exterior” fixed points represent the two committed lineages. They provide evidence that the differentiation from hematopoesis to the erythroid or myelomonocytic lineages is caused by the destabilization of the intermediate fixed point via a subccritical pitchfork bifurcation. This in turn forces the cell to shift to one of the 5 k=1 k = 1.166 k = 1.333 k = 1.5 [x2 ] [x2 ] [x1 ] [x1 ] Figure 2: Each panel shows the state space of the two gene toggle switch for the parameter values chosen (a = 2 and k = 1, 1.166, 1.333, 1.5). For k = 1.0 there are three stable fixed points in the dynamics (shown as black dots) and two unstable fixed points (shown as gray diamonds). The arrows give the trajectory of the system at each point in phase space, with longer arrows indicating faster movement and shorter arrows slower movement. As k changes from 1 to 1.5 the module undergoes a subcritical pitchfork bifurcation [23] (shown in more detail in Fig. 3). The two interior unstable fixed points converge on the central stable fixed point eventually destabilizing it when k ≈ 1.3, leaving only the two exterior stable fixed points. 6 [x1 ] a. b. c. k Figure 3: The x1 position of the fixed points as the parameter k changes from 1.0 to 1.5. The black circles represent stable fixed points while the gray diamonds represent unstable fixed points. As the module undergoes a subcritical pitchfork bifurcation, a cell at the intermediate stable fixed point moves along the trajectory shown by the arrow a). When k ≈ 1.3 the fixed point is destabilized and the cell is pushed by any random fluctuations from the now unstable fixed point to one of the exterior fixed points following either trajectory b) or the dotted trajectory. Once the cell has been drawn to one of the exterior fixed points, if the module reverses the bifurcation and once again becomes tri-stable, the cell will remain at its new stable fixed point, following trajectory c). So once the state of the system has undergone the bifurcation, it cannot easily be driven back to its starting position, thus making the differentiation event modeled by this bifurcation one-way. 7 two other fixed points. In Fig. 3 we show the bifurcation diagram corresponding to this event. The black dots indicate the stable fixed points, and the grey, the unstable fixed points. The x-axis shows the value of the bifurcation parameter, k. When k = 1.0 the circuit is tri-stabile: there are three stable fixed points. When k = 1.3, the intermediate stable fixed point is destabilized by the approach of the two unstable fixed points. Thus, the state of the system must migrate to one of the two remaining stable points. However, starting from one of the exterior fixed points, if we reverse the process and decrease the bifurcation parameter, the system will not return to the intermediate steady state–the exterior fixed points remain stable to small perturbations throughout. Thus, to reverse the cell differentiation, not only would the the intermediate fixed point need to be restabilized, the state of the system would also need to be driven from the exterior fixed point. This ensures that cell differentiation is directional: a simple, transient, one parameter perturbation can drive the system toward specificity, but there is no similarly simple way to de-differentiate back to multipotency. Noise is ubiquitous in biological systems and important for many functions [24]. The GRN as a whole, but especially the genes responsible for the process of development (and cell differentiation), must be organized for reliable performance. Thus we require that our model behaves robustly in the presence of noise. Feedback loops [25] and modularity [26] have been shown to contribute to robustness, and because our model makes use of these design characteristics, we find that robustness is easily attained. We simulate noise in the dynamics using a Langevin equation: ẋi = Fi (x) + gi (x)ηi (t) (5) Here, Fi (x) is the same function as in the deterministic dynamics. In the second term, an arbitrary function gm i (x) describes the influence of the random function ηi (t) on the variables. We choose to introduce only additive noise: gi (x) is simply a constant value for all i, with no dependence on x, as suggested elsewhere [27]. We can then simulate the evolution of the system forward by one time step of size ∆t with the equations: √ xi (t + ∆t) = xi (t) + ∆t ∗ ẋi (t) + ∆t ∗ g ∗ σi (6) The parameter g sets the amount of noise in the dynamics; we have chosen to make it the same for every gene. The parameter σi drawn at each time step from a gaussian distribution with mean 0 and standard deviation 1. The influence of the random variable is similar to a random walk. Since the variables in our system describe the concentration of protein products, we also require that no variable be allowed to drop below zero, since a negative concentration is non-physical. In the presence of noise, we find that the transition between stability and instability for the intermediate fixed point is no longer abrupt. As the module approaches the bifurcation point, the restoring force which pushes the state of the system back towards the intermediate fixed point weakens. We simulated 500 cells starting at each of the three stable fixed points for values of k = 1, 1.1, 1.2, 1.3 with g = .1 and ∆t = .001 for 40, 000 time steps (40 units of t, a fairly lengthy period in simulation time relative to the timescale of transcription and translation). The distribution of the final states for cells starting at the intermediate fixed point spreads as k increased (shown in Fig. 4). As the value of k increases, the variance of the gene expression increases. This behavior may soon be observable in vivo, using flow cytometry to measure the approximate expression level of a gene for each cell in a population of cells approaching differentiation. By observing the variance of genes as cells differentiate, we may gain a clearer picture of which genes are controlling cell fate decisions: the expression level of significant genes should become more varied across the population as the cells approach a cell fate decision. 8 k=1 k = 1.1 k = 1.2 k = 1.3 [x2 ] [x2 ] [x1 ] [x1 ] Figure 4: The behavior of the module in the presence of noise as it approaches the bifurcation. Each panel shows the final positions of 500 simulations (“cells”) starting from the central fixed point simulated for 40 units of t (in ∆t = .001 time steps). The noise magnitude is g = .1 and the overall activity a = 2.0 for various values of k. Black circles show the positions of the stable fixed points and gray squares show the end points of each simulation. As k increases, the variance of expression levels of x1 and x2 increases near the intermediate fixed point. For k = 1.0, 1.1 none of the simulated cells “escape” from the intermediate fixed point. For k = 1.2 approximately 50 cells (or 10%) escape, divided evenly between the two exterior fixed points. When k = 1.3 approximately 90% escape. In similar simulations at the same noise strenght, g = .1, if the “cells” start near one of the exterior fixed points, none of them transitioned to the intermediate fixed point for any parameter values tested. 9 Before the actual bifurcation point is reached, we see non-trivial numbers of cells have already moved to one of the exterior, ‘differentiated’ fixed points. This indicates that for values of k ≤ 1.2 the intermediate fixed point becomes metastable (weakly stable). Thus in the presence of noise, the state of the system will not remain near this fixed point indefinitely. As we discuss later, progenitor cell types may be metastable fixed points in which the state of the network rests for a short while before continuing its differentiation. This would explain the difficulty of culturing and maintaining such cell types in vitro. Finally, our model provides a potential dynamical explanation of the observed prevalence of binary (as opposed to trinary, etc) branches in cell differentiation. The sub-critical pitchfork bifurcation found in our modules arises from the topology of the regulatory interactions and is present for many choices of parameter values without careful tuning. This bifurcation leads naturally to binary cell differentiations. In Fig. 4 the distribution of final states does not spread evenly throughout the state space: it spreads preferentially along a pair of trajectories which correspond to the two sides of the unstable eigenvector of the intermediate fixed point. These trajectories, in turn, lead to one of the two exterior fixed points. In general, only one of the eigenvalues of the fixed point becomes positive at the bifurcation point. Thus, near the interior fixed point, flows along the other eigenvectors still push the state of the system toward the interior fixed point, concentrating flows along the unstable eigenvector. Trinary branchings, since they cannot depend on such simple mechanics of the state space, are comparatively much more fragile and require much more careful selection of parameters. We have found in our model that binary branching occurs ‘for free’ from a robustly occurring pitchfork bifurcation, with the simulated cells flowing away from their starting state along one of the two ends of the single, destabilized eigenvector. 4. Sequential Differentiation Now we are ready to build the full hierarchical model. In Fig. 5, we show a differentiation event in more detail: three modules are shown, A, B, and C. Module A influences B and C. In the top panel, module A is in the intermediate fixed point, but this fixed point is only meta-stable. This meta-stability is caused by the influence of the signal ∆A . If module A is part of a larger cascade, then the signal will originate from a gene in the module above A in the hierarchy; if A is at the top of the hierarchy then the signal may originate from outside the cell. Module A expresses moderate levels of x1 and x2 , which in turn influence the parameters kB , aB and kC , aC in modules B and C, through the functional relationships: kB = G(x1 ), kC = G(x2 ) aB = H(x1 ), aC = H(x2 ) (7) For simplicity, the same functions G and H are used throughout the network. In Fig. 6 we show how the piece-wise linear functions G and H respond to changes in the variable on which they depend. The specific functional form chosen is not important; sequential differentiation will occur as long as the functions reflect two basic requirements. First, if the value of x decreases dramatically, indicating that the gene x has been shut off, then transcription rate, a, of the downstream module, which is under the influence of x, should be dramatically decreased. To achieve this, for small values of x, H(x) should be small, effectively silencing the genes in the influenced module. Second, if the value of x increases substantially, then the fixed point in the influenced module should be destabilized, preparing it to undergo differentiation. Thus, the function G(x), 10 x1 kA = G(∆A ) = 1.25 x1 Module A kB = G(x1 ) = 1.0 x3 aB = H(x1 ) = 2.0 intermediate x2 kC = G(x2 ) = 1.0 x5 aC = H(x2 ) = 2.0 x4 x3 Module B x2 meta-stable intermediate x4 x1 kA = G(∆A ) = 1.25 Module B x4 x3 x6 x2 x2 x1 Module A meta-stable x1 differentiated kB = G(x1 ) = 1.25 x3 aB = H(x1 ) = 2.0 x5 Module C Noise drives differentiation: x6 x2 kC = G(x2 ) = 1.0 x5 aC = H(x2 ) = 0.2 inactive x4 Module C x6 x5 x6 Figure 5: The modular network and consequences of a single cell fate decision, in detail. In the top panel, the state of the system is poised in the meta-stable intermediate stable state of module A. Modules B and C are both in the intermediate stable state, expressing x3 , x4 , x5 , and x6 at a moderate level. In the lower panel, module A has undergone differentiation, driven by noise, switching to one of the two exterior stable states, x1 high, x2 low. This in turn has caused module B to become meta-stable by increasing the value of kB from 1.0 to 1.25. Module C is inactivated by the change in parameter aC from 2.0 to 0.2, essentially silencing the genes in the module. The k and a parameters in all three modules are functionally dependent on one of the genes in the module directly above them. In the case of module A, we denote the stimulus as a generic external signal ∆A to reflect the fact that it could come from another gene, or some external differentiation signal. 11 G(x) H(x) x Figure 6: The piecewise linear response functions G and H. In the top panel we show the function G, which controls the decay rate k, and in the bottom panel we show the function H, which controls the transcription rate a. Specifically, G = 1.0 for x ≤ 2.7 and G = 1.25 for x ≥ 3.0 and interpolates between these values for 2.7 ≤ x ≤ 3.0. Similarly H = 0.2 for x ≤ 1.0 and G = 2.0 for x ≥ 1.5 and interpolates between these values for 1.0 ≤ x ≤ 1.5. which controls k, should increase slightly in value in response to higher values of x. This higher value of G(x) will drive the bifurcation above. Returning to Fig. 5, moderate levels of x1 and x2 result in parameters in modules B and C which maintain stable, moderate expression levels of their genes x3 , x4 , x5 and x6 . Note that any modules downstream of B and C would also express their genes at an intermediate level. Since module A is in merely a meta-stable state, the state of the system is driven by noise after some time to one of the two exterior fixed points, in this case x1 high, x2 low. The result is shown in the lower panel: module C has been silenced by the decrease in expression of x2 (thus also shutting off all the genes which are in modules downstream from it, under the control of x5 or x6 ). Module B, under the influence of increased x1 , has become meta-stable, and is thus waiting to drive a differentiation event as the system “chooses” between expressing x3 and x4 . The suggested links that integrate the modules correspond to the simplest possible regulatory connections that generate the observed branching behavior. Whether real networks utilize similar connections remains to be seen, however, modules of mutually regulating transcription factors clearly are linked in real GRN. 12 Module A x1 x2 Meta-stable state Decided Module B x3 x4 Meta-stable state Decided Module C x5 x6 Inactivated t Figure 7: A simulation of three interacting modules with noise g = 0.1. At t = 5 the intermediate fixed point of module A is changed from stable to meta-stable by an external stimulus. For the next ≈ 10 units of time t the top module remains meta-stable, then finally falls into the x1 high exterior fixed point, as shown in the top panel. This silences module C (and thus genes x5 and x6 ) which are rendered inactive, as shown by the bottom panel. Finally, in the middle panel we see module B, which became meta-stable after module A made its cell fate decision and entered the exterior fixed point. Module B remains meta-stable for ≈ 25 units of t until it transitions to an exterior fixed point, completing the differentiation of the cell through this segment of the network. 13 5. Whole network results Fig. 7 shows the result of a simple three module simulation, with the noise parameter g = 0.1. The differentiation tree functions properly in the presence of noise: at the outset, all genes are expressed at a moderate level. At time t = 5, a signal is introduced which destabilizes the top module. After some time, the state of the the top module is decided in favor of x1 . As x2 decreases, we see that genes x5 and x6 are also silenced. The genes x3 and x4 continue to be expressed at moderate levels for some time, until through random fluctuations, the system exits the meta-stable state, resulting in increased x4 expression and decreased x3 expression. At the outset, when any outcome was possible and the cell multipotent, all of the genes were expressed. As the cell differentiates and loses its pluripotency, the genes which correspond to the lineages which are no longer accessible to the cell are silenced. As seen in the simulation described above, individual cells can remain in the meta-stable state for a long time before continuing their differentiation. We simulated 2000 iterations of the model, and measured the length of time the system spent in meta-stable states. We found an exponential distribution of waiting times, with some of our simulated cells spending as many as 290 units of t in the meta-stable state. For the purposes of comparison, we note that once the state escapes the region of state space close to the meta-stable fixed point and the transition to one exterior fixed points begins, the transition often occurs in less than 5 units of t. So while individual cells cannot be maintained in these meta-stable states indefinitely, they can be quite long lasting. We have not considered the proliferation of cells, as described by Hoffmann et al [27]. The proliferation rate is a function of the state of the GRN. Thus if the proliferation rate is high in the meta-stable and intermediate states, a population of cells could be maintained in the meta-stable state indefinitely. The number of progenitor cells found in vivo is the result of a balance between cells differentiating into the meta-stable state from more multipotent cell types, progenitor cells proliferating, and progenitor cells differentiating out of the progenitor cell type to a more lineage committed state through the stochastic dynamics described above. The metastability of the dynamical fixed point may provide an explanation for the limited capacity for self renewal of some progenitor cell types in vitro. 6. Conclusion Our model reproduces all of the desired behaviors of cell differentiation by using established network properties of real genetic nets. Cell differentiation occurs in a branching, exclusive fashion between a dynamically distinct cell types. A cell type higher in the differentiation tree exhibits promiscuous expression of all its accessible lineages. Cell fate decisions are directional: small parameter changes can easily drive differentiation down the tree, but not back up again. It is easy to add additional control mechanisms to this model. For example, we could add a necessity for external signals, in concert with the influence of “higher” modules, to drive each bifurcation. Our goal however is merely to create the minimal outlines of a dynamical system capable of capturing the general observed behaviors of cell differentiation. In addition, our model suggests a new role for modularity and an explanation for the prevalence of binary cell fate decisions. Modules allow for the easy implementation of repeated behaviors, in this case facilitating the repeated branchings observed in cell fate decisions. A hierarchical, modular network topology serves to channel trajectories in the state space through lower dimensional sub-spaces. This channels the state of the system through an orderly progression of fixed 14 points, representing the various cell types. Many of the important properties of cell differentiation can be captured quite naturally by a system of loosely coupled genetic modules. Further we found that binary cell fate decisions emerge effortlessly and robustly from the topology of our network modules due to the behavior of flows near the intermediate fixed point. The dynamical simplicity and robustness of binary cell fate decisions could explain their prevalence in cell differentiation. Building progressively more accurate whole-system models that recapitulate key functionalities provides an important complement to the traditional “bottom up” approach, which seeks maximal integration of all available details. A “top down” approach, focused on less detailed minimal models, allows us to focus on understanding the influence of the topology of the genetic regulatory network on the resulting dynamics. The details of our individual modules and their interconnections are of course over simplified and incomplete. However, emerging evidence from the topology of real genetic networks suggests that the overall modular architecture is likely to be correct. We hope that future experimental results will show to what degree the genetic regulatory network in general, and the cell differentiation network in particular can be treated as a collection of loosely coupled modules. As large-scale models improve through interaction with experimental evidence, we believe that the top down and bottom up approaches can eventually “meet in the middle”, fitting carefully specified subsections of the genetic network into a larger network framework. 7. Acknowledgements The authors would like to thank Jacob Foster for useful comments and discussions. References [1] Kauffman, S.A, 1969. Metabolic stability and epigenesis in randomly constructed genetic nets. J. Theor. Biol. 22, 437-467. [2] Davidson, E., et al., 2002. A Genetic Regulatory Network for Development. Science 295, 1669-1678. [3] Alon, U., 2009. Biological Networks: The Tinkerer as an Engineer. Science 301, 1866-1867. [4] Lee, T.I., et al., 2002. Transcriptional Regulatory Networks in Saccharomyces cerevisiae. Science 298, 799-804. [5] Shen-Orr, S., Milo, R., Mangan, S., Alon, U., 2002. Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genetics 31, 64-68. [6] Milo, R., et al., 2002. Network Motifs: Simple Building Blocks of Complex Networks. Science 298, 824-827. [7] Chickarmane, V., et al., 2006. Transcriptional Dynamics of the Embryonic Stem Cell Switch. PLoS Comput. Biol. 2(9) e123. [8] Ackers, G.K., Johnson, A.D., Shea, M.A., 1982. Quantitative Model for Gene Regulation by lambda Phage Repressor. Proc. Natl. Acad. Sci. 79, 1129-1133. [9] Huang, S., Eichler, G., Bar-Yam, Y., Ingber, D.E., 2005. Cell Fates as High-Dimensional Attractor States of a Complex Gene Regulatory Network. Phys. Rev. Lett. 94, 128701. [10] Kal, T., Spradling, A., 2004. Differentiating germ cells can revert into functional stem cells in Drosophila melanogaster ovaries. Nature 428, 564-569. [11] Cross, M., Enver, T., 1997. The lineage commitment of haemopoietic progenitor cells. Current Opinion in Genetics & Development 7, 609-613. [12] Hartwell, L.H., Hopfield, J.J., Leibler, S., Murray, A.W., 1999. From molecular to modular cell biology. Nature 402, C47-52. [13] Ihmels, J., et al., 2002. Revealing modular organization in the yeast transcriptional network. Nature Genetics 31, 370-377. [14] Ravasz, E., Somera, A.L., Mongru, D.A., Oltvai, Z.N., Barabasi, A.L., 2002. Hierarchical Organization of Modularity in Metabolic Networks. Science 297, 1551-1555. [15] Kauffman, S.A., 2004. The ensemble approach to understand genetic regulatory networks. Physica A 340, 733-740. 15 [16] Kauffman, S.A., Peterson, C., Samuelsson, B., Troein, C., 2003. Random Boolean network models and the yeast transcriptional network. Proc. Natl. Acad. Sci. USA 100, 14796-14799. [17] Espinosa-Soto, C., Padilla-Longoria, P., Alvarez-Buylla, E.R., 2004. A Gene Regulatory Network Model for CellFate Determination during Arabidopsis thaliana Flower Development That Is Robust and Recovers Experimental Gene Expression Profiles. The Plant Cel 16, 2923-2939. [18] Gardner, T.S., Cantor, C.R., Collins, J.J., 2000. Construction of a genetic toggle switch in Escherichia coli. Nature 403, 339-342. [19] Chickarmane, V., Troein, C., Nuber, U.A., Sauro, H.M., Peterson, C., 2006. Transcriptional Dynamics of the Embryonic Stem Cell Switch. PLoS Comput. Biol. 2(9). [20] Ribeiro, A., Zhu, R., Kauffman, S.A., 2006. A General Modeling Strategy for Gene Regulatory Networks with Stochastic Dynamics. Journal of Computational Biology 13(9), 1630-1639. [21] Danos, V., Feret, J., Fontana, W., Harmer, R., Krivine, J., 2008. Rule-based modelling symmetries, refinements. Proceedings of FMSB 2008, Lecture Notes in Bio Informatics, to appear. [22] Huang, S., Guo, Y., May, G., Enver, T., 2007. Bifurcation dynamics in lineage-commitment in bipotent progenitor cells. Developmental Biology 305, 695-713. [23] Strogatz, S., 1994. Nonlinear Dynamics and Chaos. Perseus Publishing, Cambridge, 55-60. [24] Elowitz, M.B., 2002. Stochastic Gene Expression in a Single Cell. Science 297, 1183-1186. [25] Eldar, A., et al., 2002. Robustness of the BMP morphogen gradient in Drosophila embryonic patterning. Nature 419, 304-308. [26] Variano, E.A., McCoy, J.H., 2004. Networks, Dynamics, and Modularity. Phys. Rev. Lett. 92, 188701. [27] Hoffmann M., Chang H.H., Huang S., Ingber D.E., Loeffler M., et al., 2008. Noise-Driven Stem Cell and Progenitor Population Dynamics. PLoS One 3(8):e2922. doi:10.1371/journal.pone.0002922 16