1 Resistors, Capacitors, and Transistors: Today's Lecture: Scaling Laws For Transistors
1 Resistors, Capacitors, and Transistors: Today's Lecture: Scaling Laws For Transistors
Announcements:
Midterm on Feb. 28: Papers assigned for midterm:
• Architectural and Organizational Tradeoffs in the Design of the MultiTitan CPU. Norman P. Jouppi.
• A 0.18-µm CMOS IA-32 Processor With a 4-GHz Integer Execution Unit. Glenn Hinton, Michael Upton,
et al.
Note that the IA-32 paper was previously assigned on the reading list but was never covered in lecture. I plan to
post practice questions within a week.
To keep the units straight, voltage is measured in “volts”, where one volt is one joule per coulomb. A joule is a unit of
energy (one kilogram∗ meter2 /second2 ), and a coulomb is a unit of electrical charge (−6.24 ∗ 1018 electrons). Current
is measured in “amperes ’ where one ampere is one coulomb per second. Resistance is measured in ohms, where one
ohm is one volt per ampere.
Now consider a rectangular bar of some resistive material (you can sketch it in in figure 1). Let ℓ be the length of
the bar, and h and w be the height and width respectively. We expect the resistance to be proportional to the length, ℓ,
and inversely proportional to the cross-sectional area, wh. The constant of proportionality is called the restivity of the
material and is typically written with the Greek letter ρ. We have
ℓ
R = wh ρ (2)
1
Figure 1: A resistor as a rectangular bar of metal
2
Figure 3: A capacitor as a two parallel conductors
It’s handy to know formulas for resistors in series and resistors in parallel. They are:
Rseries = R1 + R2
(4)
1/Rparallel = (1/R1 ) + (1/R2 )
1.2 Capacitors
A capacitor stores charge. A capacitor is typically formed by having two conductors separated by an insulator (you
can draw the sketch in figure 3). To continue our water examples, a capacitor is like a water tank. You can pump water
into the tank (from the bottom), but the more water that you pump in, the more pressure you need to exert to add more
water. If we assume a linear relationship, we get
Q = CV (5)
where Q is the charge stored in the capacitor in coulombs, and C is the “capacitance” of the capaictor. Capacitance is
measured in “farads” where one farad is one coulomb per volt.
The capacitance of a water tank is proportional to its cross-sectional area. Likewise, if a capacitor consists of
two parallel plates, each a rectangle that is w by ℓ, then the capacitance is proportional to wℓ. To figure out the
constant of proportionality, we need to look a little more closely at our water tank analogy. In a water tank, pressure
builds as the tank is filled because of the weight of the water. Electrons don’t weigh very much. Instead, the pressure
develops because electrons are repelled from one another. Charge builds up on the plates of a capacitor because the
electrons on one plate are attracted to the positive charges on the other plate, even though they can’t get there through
the insulator. We can think of the insulator as a flexible barrier. Although the electrons can’t get through, they can
deform it and make room for more electrons. The thicker the insulator is, the harder it will be to deform. Thus, the
capacitance is inversely proportional to d, the distance between the two plates, and we write ǫ to indicate the constant
of proportionality. We get:
C = wl d ǫ (6)
where ǫ is the “dielectric constant” in units of farads per meter. It is common to measure the “dielectric constant” as a
multiple of that for a vacuum:
ǫ = ǫ0 ǫR (7)
where ǫ0 is the dielectric constant for a vacuum (ǫ0 = 8.854 ∗ 10−12 F/m), and ǫR is the relative dielectric constant.
For glass (the common insulator between layers in a chip), the relative dielectric constant is about 4. For silicon nitride
(the insulator between the gate and the channel, the relative dielectric constant is about 7.5. About the lowest dielectric
constant for a solid is for teflon which is around 2.
3
Figure 4: A transistor
We can take our formulas above, and derive more useful formulas for understanding capacitors. First, we can
calculate how much energy it takes to charge a capacitor from 0 to V volts. Note that a volt times a coulomb is a
joule. For each coulomb that we shove into the capacitor, we can calculate what pressure is applied. This gives us the
formula: Z u
E = Cu du
0 (8)
1 2
= 2 CV
Note that C du is the incremental charge, and u is the pressure that must be exerted to move that charge onto the
capacitor.
Next, we can differentiate equation 5 with respect to time and get
d d
dt Q = C dt
(9)
d
I = C dt
where we’ve used the relationship that current is the rate of change of charge. In all of these formulas, we’ve as-
sumed that C is constant. When capacitors are formed by semiconductors (i.e. around the transistors), the actual
capacitance depends on the voltages of the various semiconductors. You can think of this as a water tank with varying
cross-sectional area (curvy walls). For the analysis presented in this class, we’ll make the simplifying approximation
that capacitances don’t vary. This obliterates many details of real circuit behaviour, but will be adequate for us to
understand the basic scaling and asymptotic properties.
It’s handy to know formula for capacitors in parallel (we won’t be worrying about capacitors in series in this class).
The formula is:
Cparallel = C1 + C2 (10)
1.3 Transistors
As you can draw in figure 4, a transistor can be modeled as a switch that has some resistance when it is on. Likewise,
it has capacitances from its gate, source, and drain, to ground. We write ℓ for the “length” of the transistor, this is the
4
Figure 5: A simple RC circuit
distance from the source to the drain. We write w for the “width” of the transistor, this is the parallel extend of the
source and drain along the channel. Typically, transistors are much wider than they are long, but these are this is the
standard terminology.
When a n-channel transistor is conducting, the gate has attracted a thin layer of electrons to the top of the channel.
We can think of this as a sheet of resistive material, and we’ll write ρn to denote the sheet resisitivity of this material.
From this, we have that the on-resistance of a n-channel device is
ℓ
Ron,n (w, ℓ) = w ρn (11)
Likewise, we can write ρp fo the sheet resistivity of a p-channel transistor when it is conducting to get
ℓ
Ron,p (w, ℓ) = w ρp (12)
The capacitance of the gate and the channel form a parallel plate capacitor, and we conclude:
wℓ
Cgate (w, ℓ) = dSi3 N4 ǫSi3N4 (13)
Finally, the source and drain capacitances are proportional to the width of the transistor. The source capacitor is formed
by the source-depletionLayer-substrate sandwich and likewise for the drain.
2 Delays
2.1 RC circuits
The critical thing to understand is that the product of a resistance and a capacitance is a time:
Consider the circuit you can draw in figure 5. Let the switch set the input to 0 volts for all time up to time 0. Then,
the voltage on the capacitor will be 0 volts at time 0. At time 0 flip the switch to set the input voltage to U volts. We
5
Figure 6: A two-inverter chain
have:
IR (t) = (Vin (t) − Vout (t))/R, current through the resistor
d
IC (t) = C dt Vout , current through the capactor
IR (t) = IC (t), Kirchoff’s current law
d Vin (t)−Vout (t)
dt Vout (t) = , a little algebra
RC
(15)
Vout (0) = 0, assumed
Vin (t) = U, assumed, for t > 0
Vout (t) = (1 − e−t/RC )U
Thus, RC is the time for the signal to transition to (1 − e−1 ) ≈ 0.63 of its final value. Since we’re not worried about
little constants here and there in this presentation, we’ll consider this to be the transition time of the circuit.
rn = ℓ0 ∗ ρn,
rp = ℓ0 ∗ ρp, (16)
cg = dSiℓ0N ǫSi3 N4
3 4
Typically, rp ≈ 2rn . Let α = rp /rn . We’ll make the additional assumption that gates are designed so that the
resistance of the pull-up and pull-down networks are equal. For example, for an inverter, this means that the width of
the p-channel transistor will be α times that of the n-channel one. This isn’t necessarily optimal, but it’s close enough,
and it simplifies the analysis.
6
First, we’ll consider an inverter that drives one other inverter that is the same size as itself (draw it in figure 6). For
the delay, we get:
7
Wire Resistance: λ−1
Recall that the resistance of a rectangular bar is (ℓ/(wh))ρ. Scaling w, ℓ, and d all by λ scales the resistance by
λ/(λ2 ) = λ−1 .
Note that the transition from aluminum wiring to copper provided a one-time, reduction of wire resistance by
about 30much opportunity to go further in this direction (the resistivity of silver is only slighly lower than that
of copper).
Wire Capacitance: λ
Same reasoning as for transistor capacitance.
Short Wire Delay: 1
Just multiply wire resistance by wire capacitance. Note that this is for wires whose length scale with everything
else. This is what is meant by “Short Wire”.
Long Wire Delay (unbuffered): λ−2
Real designs have some fraction of their wires that cross the entire chip. Thus, w and h scale with λ, but ℓ
remains fixed. We now get a capacitance of ((w + h)r/d)ǫ where r is the distance across the chip, w is the wire
width, h is wire height, and d is wire spacing. We have that w, h, and d scale as λ, and r and ǫ are constant.
Thus, long wire capacitance scales as 1.
The resistance of a long wire is r/(wh))ρ which scales as λ−2 thus long wire delay scales as λ−2 .
Buffer spacing: λ1.5
As noted earlier, the clock period for logic scales as λ. Wire delay can be reduced by inserting buffers. Wire
delay is minimized by choosing the separation between wires such that the wire segment delay equals the buffer
delay. We’ve shown that buffer delay (a special case of gate delay) scales as λ. Thus, we need to choose our
wire length, x such that wire delay scales as λ as well. The delay for a wire segment of length ℓ is:
ℓ (w+h)ℓ
δ(ℓ) = wh ρ ∗ d ǫ
Noting that w, h, and d scale as λ, and ρ and ǫ are constant, we get that δ(ℓ) scales as ℓ2 /λ2 . Thus, if the wire
delay is to scale as λ, then ℓ2 must scale as λ3 . This means that ℓ must scale as λ1.5 .
Thus, if we reduce the transistor length √
by a factor of two, the number of (smaller) gates that a wire can cross
per FO4 delay goes down by a factor of 2. If the clock period remains a fixed number of F04 delays, then the
number of optimally buffered segments
√ that can be traversed in a clock period remains constant under scaling.
However, these segments traverse λ fewer gates. Thus, the number of gates within one clock period of another
goes down by a factor of λ. In other words, the region of synchronous design shrinks.
Power consumption (first analysis): 1 Power is, to a rough approximation, α2 nCVdd 2
f where n is the number of logic
gates; C is the capacitance per logic gate; Vdd is the power supply voltage; f is the clock frequency; and α is
the fraction of clock cycles that each node changes. If we just scale the design, α remains constant; n goes as
λ−2 ; C goes as λ; Vdd goes as λ, and f goes as λ−1 . We multiply it all together and see that power consumption
remains constant. This seems like wondeful news.
8
3.3 Voltage scaling