The role of spatial averaging in the precision of gene expression patterns

During embryonic development, differentiating cells respond via gene expression to positional cues from morphogen gradients. While gene expression is often highly erratic, embryonic development is precise. We show by theory and simulations that diffusion of the expressed protein can enhance the precision of its expression domain. While diffusion lessens the sharpness of the expression boundary, it also reduces super-Poissonian noise by washing out bursts of gene expression. Balancing these effects yields an optimal diffusion constant maximizing the precision of the expression domain.

During embryonic development, differentiating cells respond via gene expression to positional cues from morphogen gradients. While gene expression is often highly erratic, embryonic development is precise. We show by theory and simulations that diffusion of the expressed protein can enhance the precision of its expression domain. While diffusion lessens the sharpness of the expression boundary, it also reduces super-Poissonian noise by washing out bursts of gene expression. Balancing these effects yields an optimal diffusion constant maximizing the precision of the expression domain.
Embryonic development is driven by orderly, spatial patterns of gene expression that assign each cell in the embryo its particular fate. Experiments in recent years have vividly demonstrated that gene expression is often highly stochastic [1]. Yet, embryonic development is exceedingly precise. A vivid example is the Bicoid-Hunchback system in the early Drosophila embryo, which has become a paradigm for understanding the formation of spatial gene-expression patterns. Shortly after fertilization, the morphogen protein Bicoid (Bcd) forms an exponential concentration gradient along the anteriorposterior axis of the embryo, which provides positional information to the differentiating nuclei. One of the target genes of Bcd is hunchback (hb), which is expressed in the anterior half of the embryo. The posterior boundary of the hb expression domain is very sharp: by cell cycle 13, the position of the boundary varies only by about one nuclear spacing [2,3,4]. This precision is higher than the best achievable precision for a time-averaging based read-out mechanism of the Bcd gradient [3]. Intriguingly, the study of Gregor et al. revealed that the Hb concentrations in neighboring nuclei exhibit spatial correlations and the authors suggest that this implies a form of spatial averaging enhancing the precision of the posterior Hb boundary [3]. However, the mechanism for spatial averaging remained unclear.
In this manuscript, we analytically and numerically study the Bcd-Hb system. Our analysis reveals a simple, yet powerful mechanism for spatial averaging, which is based on the diffusion of Hb itself. We show analytically that Hb diffusion between neighboring nuclei reduces the super-Poissonian part of the noise in its concentration, with a factor which depends on the diffusion length of Hb and the dimensionality of the system. In essence, diffusion reduces the noise by washing out bursts in gene expression. This mechanism is generic, and applies not only to any developmental system, but also to any biochemical network in general. For example, if a signaling protein is activated at one end of the cell and then has to diffuse to another place to activate another system, e.g. the messenger CheY in bacterial chemotaxis, then our results show that the non-Poissonian noise in the activa-tion of the signaling protein is washed out by diffusion; for this reason it may be beneficial to spatially separate the in-and output of a signaling pathway. Our analysis also reveals that while Hb diffusion reduces the noise, it also lessens the steepness of its expression boundary [3]. The interplay between these two antagonistic effects leads to an optimal diffusion constant of D ≃ 0.1 µm 2 / s that maximizes the precision of the hb expression domain.
We consider a minimal model of the Bcd-Hb system, which contains the necessary ingredients for understanding the mechanism of spatial averaging. This model includes the stochastic and cooperative activation of Hb by Bcd in each nucleus, and the diffusion of Hb between neighboring nuclei. To model the cooperative binding of Bcd to the hb promoter, we assume that Bcd proteins can bind sequentially to five binding sites on the hb promoter. When all five sites are occupied, the promoter is active (activity n = 1) and Hb proteins are produced stochastically with rate β; otherwise the promoter is inactive (activity n = 0). Hb proteins are degraded stochastically with rate µ. To obtain a lower bound on the precision of the hb expression domain, we assume that Bcd binds to the promoter with a diffusion limited on-rate k on = 4παD B /V = 24αD B /d 3 = 8.4 × 10 −5 s −1 where α ≃ 3 nm is the dimension of a binding site, d ≃ 6.5 µm the diameter of a nucleus and D B ≃ 0.32 µm 2 / s the Bcd diffusion constant [5]. Since the on-rate is assumed to be diffusion-limited, cooperativity of Hb activation is tuned via the Bcd off-rate k off (j) = a/b j which decreases with increasing number j of promoter-bound Bcd. Using a = 12.3 s −1 and b = 6 the average promoter activity as function of the number of Bcd molecules B in the nucleus approximately follows the Hill-function with h = 5 and K eq = 690. The Hill-coefficient h was inferred from the relation between Hb and Bcd concentration, and the activation threshold K eq , where n(K eq ) = 0.5, is the average number of Bcd molecules, B , in a nucleus at the Hb domain boundary [3].
To describe the formation of the spatial Hb pattern, we place N = 64 × 64 = 4096 nuclei on a square lattice with spacing ℓ = 8.5 µm [3]. Hb diffuses over the lattice with diffusion constant D. With reflecting boundary conditions in one and periodic in the other direction, our model is a cylinder which is symmetric around the anterior-posterior x-axis. Differential activation of Hb is induced by the Bcd profile which decreases exponentially with the distance x from the anterior pole at x = 0. The decay length is λ B = 119.5 µm [3] and the threshold position for Hb activation is set to x t = 0.5L; L = 64ℓ = 544 µm is the length of the embryo. Diffusion of Bcd between nuclei induces fluctuations in the Bcd copy number on the time-scale Because τ d is much smaller than the time-scale for promoter binding, k −1 on ≃ 10 4 s, Bcd copy number fluctuations are efficiently averaged out by slow binding of Bcd to the promoter. We therefore assume that the total number of Bcd in a nucleus is constant and given by Eq. (2). The state of the system is described by the number of promoter bound Bcd and the number of Hb in every nucleus. The state changes by reactions inside a nucleus and diffusion of Hb between nuclei. The dynamics is described by a reaction-diffusion master equation which we solve numerically [6].   (5) and (8). All simulations are for the full, two-dimensional system.
We now consider the Hb boundary width when the Hb diffusion constant D is zero. Fig. 2a shows that this decreases with increasing lifetime of the Hb protein. To understand this behavior, we note that the boundary width is, to a good approximation, given by where σ(x t ) is the standard deviation of the Hb copy number and | H(x t ) ′ | is the magnitude of the Hb gradient at the boundary position x t [3,7]. When the Hb lifetime τ is much longer than the time-scale k off (5) −1 of the promoter state fluctuations, then the noise in Hb copy number becomes Poissonian and the variance is given by the mean: σ 2 (x t ) = H(x t ) = n(x t ) β/µ = 0.5βτ . On the other hand, the steepness of the boundary, | H(x t ) ′ |, increases linearly with τ when D = 0 and the synthesis rate β is kept constant. Eq. (3) thus predicts that the boundary width ∆x decreases as τ −1/2 for large τ ≫ k off (5) −1 , which is indeed observed in the simulations (see Fig. 2a). Hence, the hb expression boundary could be made arbitrarily precise if the lifetime of the Hb protein could be increased indefinitely.
In practice the averaging time cannot be made arbitrarily long [3]. Ultimately, it is limited by the nuclear division time τ nd , which at cell cycle 13 is roughly 30 minutes. Fig. 2a shows that for τ ≃ τ nd and vanishing Hb diffusion constant D = 0, the boundary width would be ∆x ≃ 4 nuclei, which is larger than the precision measured experimentally [3]. The limited averaging time thus puts strong constraints on the precision that can be achieved via the mechanism of time-averaging alone. Fig. 2a reveals, however, that the precision of the hb expression domain can be enhanced significantly by increasing the Hb diffusion constant to a finite value. This may seem surprising, since the steepness of the boundary, | H(x t ) ′ |, decreases with increasing Hb diffusion constant (Fig. 1) and this-as Eq. (3) shows-tends to increase the boundary width. However, increasing the Hb diffusion constant also reduces the noise in the Hb copy number. This is the mechanism of spatial averaging, which we now study analytically. To elucidate the mechanism of spatial averaging, we first analyze how the steepness of the Hb boundary, | H(x t ) ′ |, depends on the Hb diffusion length λ, and then how the noise in the Hb copy number at the boundary, σ(x t ), depends on λ (see Eq. (3)). To calculate the Hb profile, we note that a nucleus at a position x i produces Hb with an average rate β n(x i ) (Fig. 3a). In one dimension, the steady state number of Hb molecules in a nucleus of size ℓ at x is approximately The total Hb profile is the sum of H(x) ps over all point sources x i = 0, ℓ, 2ℓ, . . . . To do the sum, we approximate the activity n(x i ) from Eqs. (1) and (2) by a piecewise linear function which behaves as n(x i ) = 0.5 + m(x t − x i ) around the boundary for x t − 1/2m < x i < x t + 1/2m and which is unity below and zero above the boundary.
It increases linearly with the number of Hb molecules β/µ and decreases as the Hb diffusion length increases. Due to the rotational symmetry, the slope in the full, twodimensional system is also given by Eq. (5). Fig. 3b plots Eq. (5) as function of λ and compares the approximation with two-dimensional simulation results. It is seen that the agreement is very good.  (8)) in an isolated volume has been calculated numerically. These results are compared to the Hb noise strength for the full, two-dimensional system (see Fig. 1) in a nucleus at the boundary with n = 0.5 (triangles).
Next, we study the effect of Hb diffusion on the variance in the Hb copy number, σ 2 (see Eq. (3)). For clarity, we first do the calculations for one-dimensional diffusion and extend our approximation to general dimensions afterwards. To compute σ 2 , we exploit the observation made above that each nucleus acts as a point source of Hb (Fig. 3a), and that the expression of Hb in each nucleus is an independent stochastic process. We denote the variance of the Hb copy number in a nucleus of size ℓ at a distance x from the point source as σ 2 ps (x). In a uniform space, the total Hb variance in a nucleus is the sum σ 2 = i σ 2 ps (x i ) over point sources at all x i . To calculate σ 2 ps (x), we assume that the Hb dynamics is fast on the time-scale of promoter switching, such that the Hb concentration switches between zero and H(x) ps / n , with H(x) ps given by Eq. (4). In this limit, where the second term is due to promoter switching with variance 1 − n n , and the first term describes the Poisson noise coming from the production, diffusion and decay of Hb when the promoter is active. Fig. 4a shows σ 2 ps (x) for different values of the diffusion length λ. For small λ, σ 2 ps (x) is large for small x, but reduces quickly with increasing x. Hence, the variance at a given nucleus is determined by few nuclei in the immediate neighborhood but each individual contribution is large. For increasing λ the number of nuclei contributing to the variance increases, but the individual contributions of nearby nuclei are smaller; this is because diffusion washes out the bursts of Hb production at the source. To see which of the two opposing effects dominates, we sum Eq. (6) over x i to obtain the total variance: Compared to the variance in an isolated nucleus, the overall effect of diffusion in a one-dimensional, uniform space is a reduction of the non-Poissonian noise by (4λ/ℓ) −1 . The Poissonian part cannot be reduced because diffusion itself is Poissonian. Indeed, spatial averaging can reduce the effect of noise in gene expression, but only if this is super-Poissonian. Simulations also allow us to compute σ 2 ps (x) for slower Hb dynamics. In this case, the non-Poissonian part of the variance is smaller. However, the qualitative dependence of x remains the same: the non-Poissonian part is washed out by diffusion over a distance proportional to λ (inset Fig. 4a).
The finite range of non-Poissonian fluctuations implies the following simple model.
This simple expression elucidates that it is the super-Poissonian part of the noise, σ 2 0 − H , which is reduced by spatial averaging. Since N scales with the dimensionality d as N ∼ λ d , this reduction becomes more efficient in higher dimensions. Interestingly, this expression also reveals that if hb expression would be sub-Poissonian [8], diffusion would increase the noise. Fig. 4b shows the non-Poissonian part of the variance in a uniform, two-dimensional system as function of the diffusion length; the exact, numerical result is compared to the prediction of Eq. (8) with N chosen to be the number of nuclei within a distance 2λ of a given nucleus along the edges of a square lattice, N (λ) = 8(λ/ℓ) 2 . For sufficiently large λ for which N (λ) > 1, the approximation is excellent. Fig. 4b also compares the results for the uniform system to those of the full, non-uniform system in which a Bcd gradient activates Hb. Clearly, the wellstirred approximation describes the variance at the Hb boundary very well; only for λ > 5ℓ is the variance at the boundary significantly influenced by nuclei far away from the boundary with smaller variance.
Finally, we can combine Eqs. (5) and (8) with Eq. (3) to predict the boundary width ∆x. Fig. 2b shows that the prediction agrees very well with the simulations. This figure also shows that the two antagonistic effects of Hb diffusion-reducing the slope but also the variance of the Hb concentration-lead to a diffusion constant that optimizes the boundary width for a fixed Hb lifetime. Interestingly, this minimal width is less than one nuclear spacing, as found experimentally [3]. This suggests that for achieving the necessary precision, mechanisms based on multiple gradients or interactions between multiple Bcd targets are not required, although these may provide robustness against embryo-to-embryo variations [9,10,11]. Since the exact mechanism for Hb transport is unknown, it is tempting to speculate that the precision achieved via spatial averaging could be increased further by separating the two effects of Hb diffusion using an anisotropic transport of Hb: slow transport along the anterior-posterior axis would allow for steep spatial profiles, while fast diffusion along the perpendicular direction would allow for effective spatial averaging. The benefit of anisotropic transport could imply the existence of active mechanisms for Hb transport.
Spatial averaging can only be beneficial when the noise in Hb production has a super-Poissonian component. The observation of spatial correlations in Hb by Gregor et al. indicates that this is the case [3]. The question arises whether bursts in gene expression are inevitable and spatial averaging a prerequisite for achieving a precise hb expression domain. The Hb lifetime is limited by the nuclear division time, while the time-scale for promoter state fluctuations is limited by the diffusion of Bcd. The magnitude of the bursts could be reduced by reducing the promoter strength, but this would lower the Hb copy number and hence the steepness of the Hb boundary; the net result would, in fact, be a decrease of its precision. In a model with explicit hb mRNA, the effect of promoter state fluctuations could be alleviated by reducing the transcription rate. However, to achieve a sufficiently steep and precise Hb boundary with a low number of hb mRNA, the translation rate had to be increased such that Hb production becomes super-Poissonian again. It thus appears that spatial averaging is a fundamental mechanism for generating precise patterns of gene expression. Clearly, it will be of interest to study how other modes of transport, e.g. sub-diffusion or active transport, affect the mechanism of spatial averaging.
We thank Filipe Tostevin for a critical reading of the manuscript. This work is supported by FOM/NWO (TE, PRtW) and The Royal Society (MH).