PBoC2 - Problems - Hints02 08 13 2
PBoC2 - Problems - Hints02 08 13 2
Preface v
6 Entropy Rules! 13
iii
iv CONTENTS
This document contains a series of hints to guide the reader with some of the
more difficult problems in the book. The goal of the hints is to give the reader
a better sense of what we are asking for in the problem since often the problem
statements can be complemented by extra information. Second, the goal of the
hints is to sketch a logical attack on the problem of interest.
If our readers find a specific problem obscure or deserving of a hint, please
let us know and we will consider adding a hint for the problem of interest.
v
Chapter 2
Biological Structures:
Rulers at Many Different
Scales
• Lambda repressor/DNA complex: 1LMB.pdb from the PDB has the DNA-
binding region of lambda repressor complexed with DNA.
7
8 CHAPTER 2. RULERS AT MANY DIFFERENT SCALES
If we recall that
N
X N!
(p + q)N = pn1 q N −n1 , (2.2)
n1 =0
n 1 !(N − n 1 )!
9
Chapter 5
11
12 CHAPTER 5. MECHANICAL AND CHEMICAL EQUILIBRIUM
which is a net ATP gain of one per pyruvate molecule. We also gained one
NADH molecule per pyruvate in the conversion from glyceraldehyde-3-phosphate
into 1,3-bisphosphoglycerate.
Finally, we turn to the synthesis on valine itself. We see that we start off
with two pyruvate molecules and that in the process of getting to valine we
had to pay one NADPH molecule. We also had to transform glutamate into
2-ketoglutarate. The cell will have to pay an extra energy cost to turn this
2-ketoglutarate back into glutamate.
Let’s go to “glutamate biosynthesis III”. Notice that the precursor for glu-
tamate is in fact 2-ketoglutarate. In order to obtain it one NADPH molecule
and an ammonia molecule have to be paid. Inorganic ions such as phosphate
and ammonium are present in bacterial growth medium, which always contains
a mixture of salts as well as a carbon source such as glucose. As noted in ta-
ble 5.1Constructing the Cell: Managing the Mass and Energy Budget of the
Celltable.5.1, however, we ignore the cost of phosphate and ammonium.
The net cost of making valine out of two pyruvate molecules is two NADPH
molecules. If we add the cost of making two pyruvate molecules it results in
• One glucose molecule consumed.
• Two ATP molecules gained.
Entropy Rules!
13
14 CHAPTER 6. ENTROPY RULES!
From eqn. 6.8 we see that we need L in terms of Ltot , Rtot , and Kd . We do this
by using eqns. 6.4 and 6.5 to rewrite eqn. 6.6
LR = Ltot − L
R = Rtot − LR = Rtot − Ltot + L
L · (Rtot − Ltot + L)
Kd = , (6.9)
Ltot − L
which we can solve for L which can then be substituted backed into the expres-
sion for pbound .
with a similar expression for the binding of CO. It is worth exploring how ex-
pressions like this arise by using simple lattice models like those explored in the
chapter.
15
Chapter 8
∂G b2 1 ∂ ∂G 1 ∂2G ∂2G
= [ (r )+ 2 + ], (8.1)
∂N 6 r ∂r ∂r r ∂θ2 ∂z 2
where G is a function of the arguments r, θ, z and N . The strategy the reader
should adopt is separation of variables by adopting a trial solution of the form
1 dn b2 1 1 ∂ ∂R 1 1 ∂2Θ 1 ∂2Z
= [ (r )+ 2 2
+ ] (8.3)
N dN 6 R r ∂r ∂r r Θ ∂θ Z ∂z 2
The concept with separation of variables is that the two sides are functions of
different variables but are equal to each other. The only way to guarantee that
is true is that both sides are equal to the same constant. For example, for the
case shown above, we have both equal to a separation constant −k 2 , resulting
in
dn
= −k 2 N, (8.4)
dN
which implies in turn
2
n(N ) = e−k N
. (8.5)
17
18 CHAPTER 8. THE STRUCTURE OF THE MACROMOLECULES
We can once again banish different terms to different sides of the equation
resulting in
b2 1 1 ∂ ∂R 1 1 ∂2Θ b2 1 ∂ 2 Z
− [ (r )+ 2 2
− k2 = . (8.6)
6 R r ∂r ∂r r Θ ∂θ 6 Z ∂z 2
Once again, given that we have a function of r and θ on the left side and a
function of z on the right side, both sides must be equal to a constant which
this time we christen −m2 . This process is repeated until we have solved four
separate differential equations for the four unknown functions. Next, the reader
needs to impose boundary conditions which amounts to making sure that the
solution vanishes on the surface of the cylinder. This leads us to the ability to
write the solution as
∞ X
∞ X
∞ α2 2
X pj b 2 2 2
+ q 6L
π b αpj
G(r, θ, z, N ) = e−( 6a2 2 )N
Jp ( r)
p=0 j=1 q=1
a
qπz
×sin (Apjq cos pθ + Bpjq sin pθ). (8.7)
L
To determine the constants Apjq and Bpjq , we now need to appeal to our “initial
condition” which tells us the origin of the polymer chain (i.e. the tether point).
In particular, we have
∞ X ∞
∞ X
X αpj qπz
G(r, θ, z, 0) = Jp ( r)sin (Apjq cos pθ + Bpjq sin pθ). (8.8)
p=0 j=1 q=1
a L
The integrals on the right are all straightforward since each of them reflects a
convenient orthogonality relation of the form
Z a
αp0 j 0 αpj a2
Jp0 ( r)Jp ( r)rdr = δpp0 δjj 0 Jp+1 (αpj )2 , (8.10)
0 a a 2
and
L
q 0 πz
Z
qπz L
sin sin dz = δqq0 (8.11)
0 L L 2
and Z 2π
cos pθ cos p0 θdθ = δpp0 π. (8.12)
0
HINTS TO THE PROBLEMS 19
G(r, θ, z, N )
p(r, θ, z, N ) = R a R L R 2π . (8.13)
0 0 0
G(r, θ, z, N ) rdrdθdz
Once the reader does this algebra, the remaining step for making contact with
the data is to figure out how to collapse the probability distribution so it is only
a function of z or the perpendicular distance from the axis of the cell.
The sum goes over all possible choices, which for a three-dimensional walk are:
li ∈ {±x̂, ±ŷ, ±ẑ} for each of the N steps, i = 1, 2, . . . , N . The function δl
returns 1 if l is zero, otherwise the value of the function is 0.
The next step is to use the mathematical identity
Z 2π Z 2π Z 2π
1 1 1
δ(l) = eilx kx dkx · eily ky dky · eilz kz dkz , (8.15)
2π 0 2π 0 2π 0
where the vector l = (lx , ly , lz ) has integer coordinates. This will lead to an
expression for the number of random walks that return to the origin written as
an integral from which the scaling with N can be derived.
21
Chapter 10
Putting this into the equation for the beam energy leads to an expression for
the beam energy in terms of the amplitudes θ̃2n−1 . The average values of the
amplitudes can in turn be obtained from equipartition. The idea here is that the
energy of the beam acquires the form of a sum over independent modes (each
one having an energy quadratic in the amplitude), and in thermal equilibrium
each mode has an average energy of kB T .
23
Chapter 11
Biological Membranes:
From the Golgi Apparatus
to Membrane Fusion
1 A2
Z
hAact − Ai = q 2 h|h̃(q|2 id2 q (11.4)
2 (2π)2
By substituting the result for hh(q)2 i, we can then find a relation between the
tension and the area change.
25
26 CHAPTER 11. BIOLOGICAL MEMBRANES
Effectively, the differential equation of interest will be the part we have labeled
“stuff” above.
The differential equation can be shown to be related to a Bessel differential
equation. The most intuitive idea is to recognize that problems with cylindrical
symmetry like that of our ion channel often lead to equations that can be beaten
into the form of Bessel’s differential equation which is of the form
d2 u du
r2 +r + k 2 r2 u = 0, (11.7)
dr2 dr
or to the modified Bessel equation which can be written as
d2 u du
r2 +r − k 2 r2 u = 0. (11.8)
dr2 dr
Note that these two equations are not the most general form of the Bessel or
modified Bessel equation, but suffice for our purposes. The details of how to
solve such an equation itself are beyond the scope of this book, but we will
show the relation between the differential equation we need to solve given in
eqn. 11.172 and the Bessel equation. In particular, we can write the modified
Bessel equation as
∇2 u = k 2 u. (11.9)
If we now apply ∇2 to both sides of this equation, we are left with
∇4 u = k 2 ∇2 u, (11.10)
∇4 u = k 4 u, (11.11)
HINTS TO THE PROBLEMS 27
One way to obtain the energy is to use the divergence theorem which permits
us to convert the energy over the entire membrane into an integral just around
the boundary of the protein itself. Effectively, the divergence theorem in this
form amounts to a sophisticated version of integration by parts. The theorem
can be stated as Z Z
F · n ds = ∇ · F dA. (11.23)
∂Ω Ω
Our aim is to figure out how the free energy of deformation can be written such
that it is clear that the right side is the divergence of some vector. In particular,
our goal is to rewrite the term (∇2 u)2 which can be built around the observation
that
∇ · (∇2 u∇u) = ∇3 u · ∇u + (∇2 u)2 . (11.24)
In addition, we have
which means that if we subtract these two equations, we can isolate (∇2 u)2 as
We can now plug this way of rewriting (∇2 u)2 back into the expression for the
free energy of deformation. The result is
Z Z 2
Kb Kt u
Ghydrophobic = [−∇ · (u∇3 u − ∇2 u∇u) + u∇4 u]d2 r + d2 r.
2 2 a
This very interesting result brings us much closer to having an expression that
will permit us to invoke the divergence theorem. In particular, if we rewrite this
as
Z Z
Kb Kt Kb 4
Ghydrophobic = [−∇ · (u∇3 u − ∇2 u∇u)]d2 r + ( 2 u + ∇ u)ud2 r,
2 2a 2
(11.27)
we note that the term in parentheses in the second integral is nothing more than
the equilibrium differential equation we solved in the first place and is thus zero.
Hence, we can now rewrite the energy as
Z Z
Kb 3 2 2 Kb
Ghydrophobic = [−∇·(u∇ u−∇ u∇u)]d r = − r·(u∇3 u−∇2 u∇u)rdθ.
2 2
(11.28)
Chapter 13
Random Walks: A
Statistical View of
Biological Dynamics
∂ ∂2
c(x, t) = D 2 c(x, t). (13.1)
∂t ∂x
Let’s √
take the Fourier transform of both sides (which just means multiplying by
eikx / 2π and integrating) and take stock of the situation:
Z +∞ Z +∞
1 ∂ D ∂2
√ eikx c(x, t) dx = √ eikx 2 c(x, t) dx (13.2)
2π −∞ ∂t 2π −∞ ∂x
Z +∞ Z +∞
∂2
∂ 1 ikx D
√ e c(x, t) dx = √ eikx 2 c(x, t) dx (13.3)
∂t 2π −∞ 2π −∞ ∂x
Z +∞
∂ D ∂2
c̃(k, t) = √ eikx 2 c(x, t) dx. (13.4)
∂t 2π −∞ ∂x
All that we’ve done so far is to exchange the order of integration and differenti-
ation on the LHS and then used the definition of c̃. Now comes the important
part. The key is to integrate by parts on the RHS to get the partial deriva-
tives w.r.t x off of c(x, t) and move them onto the exponential instead. How is
this done? Recall that for two functions F and G that both depend on x the
following identity holds (the product rule):
∂ ∂F ∂G
(F G) = G+F . (13.5)
∂x ∂x ∂x
29
30 CHAPTER 13. A STATISTICAL VIEW OF BIOLOGICAL DYNAMICS
Let’s suppose that either or both of F and G vanish at ±∞, then we have:
Z +∞ Z +∞
∂G ∂F
F dx = − G dx. (13.8)
−∞ ∂x −∞ ∂x
This is just what we wanted; we’ve succeeded in moving the derivative off of G
on the LHS and onto F on the RHS, at the cost of a minus sign. Let’s go back
to Eq. 13.4 and integrate by parts:
Z +∞
∂ D ∂2
c̃(k, t) = √ eikx 2 c(x, t) dx (13.9)
∂t 2π −∞ ∂x
+∞
D ∂c
= √ eikx
2π ∂x −∞
Z +∞
D ∂ ikx ∂
−√ e c(x, t) dx (13.10)
2π −∞ ∂x ∂x
Z +∞
D ∂
= −√ (ik)eikx c(x, t) dx (13.11)
2π −∞ ∂x
+∞ Z +∞
Dik D
= − √ eikx c(x, t) +√ (ik) (13.12)
2π −∞ 2π −∞
∂ ikx
× e c(x, t) dx
∂x
Z +∞
D
= + √ (ik)2 eikx c(x, t) dx (13.13)
2π −∞
= −k 2 Dc̃(k, t). (13.14)
We integrated by parts twice and had to assume that the flux ∂c/∂x and con-
centration c(x, t) both vanish at infinity which are both very reasonable assump-
tions. At first glance it may seem like we haven’t gained that much. But hold
on! Instead of having derivatives w.r.t. both x and t, we now have derivatives
w.r.t. time t only! We have transformed our PDE into an ODE; c̃ is a function
of k but for our purposes it’s just a parameter; let’s rewrite our equation to
make this all explicit:
d
c̃(t; k) = −k 2 D c̃(t; k). (13.15)
dt
Of course this is an equation we know very well how to solve, and we can just
write down the answer: 2
c̃(k, t) = c̃(k, 0)e−k Dt . (13.16)
HINTS TO THE PROBLEMS 31
d2 ρ
1 dT 11 d dρ 1 1 dρ
= r = +r 2 . (13.19)
D T dt ρ r dr dr ρ r dr dr
Since both sides of the equation depend on different variables, they must be
equal to a constant, which we set equal to K 2 . This means that the solution
for the temporal part is
2
T (t) = e−DK t
(13.20)
and the resulting equation for the spatial part
d2 ρ dρ
r2 2
+r + K 2 r2 ρ = 0. (13.21)
dr dr
This last equation can be written in a Bessel form by making the substitution
z = kr. Once the solutions are in hand, the boundary conditions need to
be imposed. In particular, we have the following expression for the boundary
condition
We can determine the coefficients ai using the initial condition c(r, 0).
Chapter 15
• The filament with length n might grow into a filament with length n + 1,
contributing with a term −v+ p+ (n, t).
• Finally, a filament that was shrinking can undergo a “rescue” and turn
into a growing one, giving the term +f−+ p− (n, t).
∂p+ (n, t)
= v+ (p+ (n − 1, t) − p+ (n, t)) − f+− p+ (n, t) + f−+ p− (n, t) (15.1)
∂t
33
34 CHAPTER 15. CYTOSKELETON UNDER CONSTRUCTION
dp+ (n, t)
v+ = f−+ p− − f+− p+ (15.2)
dn
dp− (n, t)
v− = f−+ p− − f+− p+ .
dn
This is a linear system of first order differential equations. We can write the
RHS of both equations in matrix form using
−f+− /v+ f−+ /v+
(15.3)
−f+− /v− f−+ /v−
We want to look for its eigenvalues and eigenvectors which will permit us to
write down the solutions themselves.
Dynamics of Molecular
Motors
35
Chapter 17
37
38 CHAPTER 17. BIOLOGICAL ELECTRICITY
u(1) = pC (V ) (17.6)
u(2) = pO (V )
u(3) = pI (V )
u(4) = V (x, t)
where Jn (x) is the n-th Bessel function of the first kind. Another useful identify
to keep in mind is the one relating the integral of Bessel functions such as
Z
J0 (x)x dx = xJ1 (x). (18.5)
(d) The first zero of the intensity, related to the resolution limit, will be given by
the first zero of the first Bessel function of the first kind, J1 (x). The zeros of this
39
40 CHAPTER 18. LIGHT AND LIFE
Organization of Biological
Networks
41
Chapter 21
With these definitions, the calculations of the chemical potentials comes down
to figuring out the entropy for the number of ways of arranging the base pairs
and nucleosomes.
(b) With the chemical potentials in hand we turn to computing the probability
p(i) that the ith site of a genomic region of length N is occupied by a nucleosome,
given that the origin is nucleosome free. This probability, p(i) = 1 − p̄(i), where
p̄(i) is the probability that the ith site is nucleosome free (i.e., it is a ”base pair”
in the sense of the urn model). If we introduce
nmax
nuc
X [(i − nnuc d) + nnuc ]!
S(i) = eβ(i−nnuc d)µbp eβnnuc µnuc , (21.2)
n =0
(i − n nuc d)!n nuc !
nuc
as in the text, as the thermodynamic weight that takes into account all the
possible ways of covering i sites on the genome with base pairs and nucleosomes,
then
S(i − 1)S(N − i)
p̄(i) = . (21.3)
S(N )
The numerator in the above expression is the weight of all the states in which
the ith site is not covered by a nucleosome while the denominator is the weight
43
44 CHAPTER 21. SEQUENCES, SPECIFICITY, AND EVOLUTION
of all the states regardless of whether the ith site is nucleosome free or not.
The numerator is obtained by considering the ith site nucleosome free and then
noting that this leaves the first i − 1 sites before it and the N − i sites after it
free for occupying with nucleosomes or base pairs. Since occupying these two
stretches of genomic DNA with free base pairs and nucleosomes are independent
of each other, the total weight is the product of weights, S(i-1) S(N-i).
We can use Mathematica to make plots of p(i) for different values of the
average linker length hLi.