PowerDistributionNetworkDesignForVLSI PDF
PowerDistributionNetworkDesignForVLSI PDF
POWER DISTRIBUTION
NETWORK DESIGN
FOR VLSI
ffirs.qxd 3/24/2004 11:23 AM Page iii
POWER DISTRIBUTION
NETWORK DESIGN
FOR VLSI
QING K. ZHU
Intel Corporation
Matrix Semiconductor Inc., U.S.A.
Copyright © 2004 by John Wiley & Sons, Inc. All rights reserved.
Published by John Wiley & Sons, Inc., Hoboken, New Jersey.
Published simultaneously in Canada.
No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or
by any means, electronic, mechanical, photocopying, recording, scanning or otherwise, except as
permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior
written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to
the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, (978) 750-8400, fax
(978) 646-8600, or on the web at www.copyright.com. Requests to the Publisher for permission should
be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ
07030, (201) 748-6011, fax (201) 748-6008.
Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in
preparing this book, they make no representation or warranties with respect to the accuracy or
completeness of the contents of this book and specifically disclaim any implied warranties of
merchantability or fitness for a particular purpose. No warranty may be created or extended by sales
representatives or written sales materials. The advice and strategies contained herein may not be
suitable for your situation. You should consult with a professional where appropriate. Neither the
publisher nor author shall be liable for any loss of profit or any other commercial damages, including
but not limited to special, incidental, consequential, or other damages.
For general information on our other products and services please contact our Customer Care
Department within the U.S. at 877-762-2974, outside the U.S. at 317-572-3993 or fax 317-572-4002.
Wiley also publishes its books in a variety of electronic formats. Some content that appears in print,
however, may not be available in electronic format.
ISBN 0-471-65720-4
10 9 8 7 6 5 4 3 2 1
ftoc.qxd 3/24/2004 11:26 AM Page v
CONTENTS
Preface vii
1 Introduction 1
1.1 Power Supply Noise 2
1.2 Power Network Modeling 4
1.3 Modelling of Switching Currents 12
1.4 On-Chip Decoupling Capacitance 16
1.5 On-Chip Inductance 20
1.6 Process Scaling Impacts 28
1.7 Summary 32
2 Design Perspectives 33
2.1 Planning for Communication Chips 34
2.2 Planning for Microprocessor Chips 44
2.3 IBM CAD Methodology 55
2.4 Design for IR Drop 62
2.5 Package-Level Methodology 67
2.6 Summary 73
3 Electromigration 75
3.1 Basic Definitions and EM Rules 75
3.2 EM Analysis Tool 80
3.3 Full-Chip EM Methodology 83
3.4 Summary 85
v
ftoc.qxd 3/24/2004 11:26 AM Page vi
vi CONTENTS
4 IR Voltage Drop 87
4.1 Causes of IR Drop 87
4.2 Overview of IR Analysis 89
4.3 Static Analysis Approach 96
4.4 Dynamic Analysis Approach 99
4.5 Circuit Analysis with IR Drop Impacts 103
4.6 Summary 103
Glossary 191
References 199
Index 205
fpref.qxd 3/24/2004 11:28 AM Page vii
PREFACE
vii
fpref.qxd 3/24/2004 11:28 AM Page viii
viii PREFACE
and CAD tools for the power distribution of the VLSI chip and
package. The user guide of the VoltageStorm™ tool from Cadence
Design Systems, Inc. is referred to throughout [51], together with
the author’s experience using this tool in designs.
The book is organized into seven chapters. Chapter 1 is an in-
troduction to the power supply network, power network modeling,
decoupling capacitors, and process scaling trends. Chapter 2 illus-
trates the design perspectives for the power distribution network,
including power network planning, layout specifications, decou-
pling capacitance insertion, modeling and analysis of power net-
works, and IR drop analysis and reduction. Chapter 3 explores
electromigration phenomena for the on-chip power distribution
network.
Chapter 4 discusses IR drop analysis methodology. It is taken
primarily from the VoltageStorm™ tool, using both static and dy-
namic analysis methods. The static method is performed for some
level worst-case IR drop analysis without the knowledge of input
vectors at the chip’s primary inputs. Chapter 5 describes the com-
mands and user interfaces of the VoltageStorm™ tool from Ca-
dence Design Systems, Inc. [51]. Chapter 6 lists the microproces-
sor design examples, with a focus on on-chip power distribution.
Readers will gain the insights into industry chip design for power
distribution networks from these examples.
Chapter 7 discusses the flip-chip and package design issues,
since the package is a part of the global power distribution. A case
study has been provided in this chapter for selecting the package
options, based on the performance requirements for the power
supply. Power network measurement techniques from silicon are
also discussed at the end of Chapter 7.
A glossary of key words and basic terms is provided at the end
of the book to help understand the basic concepts in VLSI design
and power distribution.
With the continually decreasing supply voltages and the in-
creasing transistor switching currents on-chip, power supply nois-
es on-chip remains the challenging issue for high-performance
chip design. More and more research will be needed in the future
in CAD tools for switching current modeling and accurate power
network analysis. The design methodology for power delivery will
need to consider the performance, layout area, and package tech-
nology optimization for future chips.
The author would like to thank Mr. George J. Telecki at John
fpref.qxd 3/24/2004 11:28 AM Page ix
PREFACE ix
Wiley & Sons, Inc. for providing the chance to get this book pub-
lished. He also thanks his co-workers in Intel Corporation, includ-
ing David Ayers, Alex Waizman, and Bendik Kleveland. Finally,
he appreciates the strong support from family members, includ-
ing wife Huiling Song and two sons Phillip and Michael.
c01.qxd 12/16/2003 11:21 AM Page 1
1
INTRODUCTION
2 INTRODUCTION
3
Supply voltage (V) 2.5
2
1.5
1
0.5
0
0.25 0.18 0.13 0.1
Minimum feature size (µm)
(a)
Gate oxide thickness (A)
60
50
40
30
20
10
0
0.25 0.18 0.13 0.1
Minimum feature size (µm)
(b)
Figure 1-1. Power supply (a) and gate oxide scaling (b) trends.
4 INTRODUCTION
cussed in [8]. The VLSI design basic to the power network de-
sign, such as metal sizing equations, can be found in [9]. Inter-
connect scaling issues in the deep-submicron process can be
found in [10].
6 INTRODUCTION
Switching Switching
Is circuit Is circuit
Rd Rd
Decoupling Rs Decoupling Rs
capacitor capacitor
Cd Cd
Cs Cs
C D
Vss Line
Vcc SW
Lvcc Vccdie
Rdecap Rsw
Vc(t=0)=Vcc Cdecap Csw
Lvss Vc(t=0)=0
Vssdie
Cdecap>>Csw
RCdecap<<RCsw
G=Vcc
G=Vcc
Cgate
S Cgate
D
Rds_on/4
Rds_on/2 Rds_on/2
Rds_on
8 INTRODUCTION
(a)
RC:
Net pattern
matching
RC
library
segment
Break Break
line 1 line 2
Break Break
line 3 line 4
(b)
34]. In addition, the SPF can include the device section that mod-
els the extracted devices from the physical layout.
In general, the capacitance can be formed between any poly-
gons in the layout, although the closer ones have more significant
capacitances, and thus have more impact on the total capacitance
of the net. Figure 1-7 shows the possible capacitances between the
gates and metal lines in the physical layouts.
The capacitance to the substrate is dominant over other cou-
pling capacitances in the old one or two metals technology. But
the situation changes in the latest submicron technology with sev-
en to eight metal layers, since the top-level metals are far away
from the substrate, and the total capacitance of these top-level
metals is more impacted by the coupling capacitances between ad-
jacent lines in the same layer or adjacent layers of the layout.
In addition, the spacing between metal lines is continually
scaled, so the coupling capacitance between neighboring metal
lines becomes more and more important. The calculation of the re-
sistance or capacitance can be done through the direct solution of
the well-known Maxwell’s EM equations or Green’s functions
[17].
A complex geometrical layout can require an extremely long
computational time using the direct EM field solution. Therefore,
10 INTRODUCTION
12 INTRODUCTION
Id = CloadVcc f (1-2)
In Equation (1-2), Cload is the total output load of the driver, in-
cluding the gate load and interconnect load; Vcc is the supply volt-
age; and f is the switching activity of Cload. Although the charge
and discharge dynamic current Id is a predominant component of
c01.qxd 12/16/2003 11:21 AM Page 13
112pS
20.7pS
179pS
(a)
Figure 1-9. Switching noise simulation based on power grid modelling. (a) Sim-
ulation result. (Figure continues on next page)
c01.qxd 12/16/2003 11:21 AM Page 14
14 INTRODUCTION
VCCDRV VCC21
VSSDRV VSS21
(b)
M6
M5
(c)
Figure 1-9 (continued). (b) Simulated circuit. (c) M5 and M6 power grid model-
ling. (Figure continues on next page)
c01.qxd 12/16/2003 11:21 AM Page 15
(d)
Vcc
A
I(t)
B D
Cload
Vss
(a)
I(t)
Tp/2 Tp/2
i(n) i(p)
tr tf
t
(b)
16 INTRODUCTION
voltage
1.43V
1.3V: normal voltage
Thresholds
1.17V
violation
Voltage waveform at the node
time
Figure 1-11. Supply voltage thresholds and noisy nodes definition [83].
c01.qxd 12/16/2003 11:21 AM Page 18
18 INTRODUCTION
Nominal voltage:
Vcc = 1.3V
Voltage thresholds:
Vcc:[1.17V - 1.43V]
Noisy nodes:
Node 25 (min V = 0.47V)
Node 10 (min V = 1.15V)
(a)
Nominal voltage:
Vcc = 1.3V
Voltage thresholds:
Vcc:[1.17V - 1.43V]
Noisy nodes:
None
Decoupling capacitors:
Node 25
Node 10
(b)
Figure 1-12. Adding decoupling capacitors at noisy nodes [83]. (a) Nodes 10 and
25 are noisy. (b) Adding more capacitors on Nodes 10 and 25.
c01.qxd 12/16/2003 11:21 AM Page 19
one example with the simulated voltages of two nodes (Node 25 and
Node 10) in the power network.
The minimum voltages (0.47 V and 1.15 V) of these nodes are
less than the required lower threshold (1.17 V), and thus they are
noisy nodes. The decoupling capacitor is added at each of these
two noisy nodes and the voltages eventually satisfy the required
thresholds, as shown in Figure 1-12(b).
Figure 1-13 shows the high-level decoupling capacitance opti-
mization flow [83]. Procedure I adds the decoupling capacitors at
the noisy nodes. Procedure II removes the unnecessary decou-
pling capacitance overallocated initially.
We have done experiments on a power network model with
about 100 RLC grids and decoupling capacitors. Current sources
have been added at each node in the model for transistor transi-
tions with the current waveforms, as shown in Figure 1-10(b). The
Simulate the power network model with RLC elements and current sources.
Identify the “noisy” nodes by comparing the voltage results with the specified thresholds.
While (there is “noisy” node){
For (each “noisy” node){
Add a step size of the decoupling capacitance.
}
Simulate the power network model with the updated decoupling capacitance.
Identify “noisy” nodes by comparing simulation voltages with the required thresholds.
}
20 INTRODUCTION
450
400 Vcc = 0.0375V
350
300
250
200
Vcc = 0.075V
150
100
Vcc = 0.15V
50
0
0% 10% 20% 30% 40% 50%
22 INTRODUCTION
450
Vcc = 0.0375V
Decoupling Capacitance
400
350
300
250
200 Vcc = 0.075V
Decoupling
150
100 Vcc = 0.15V
50
0
0% 10% 20% 30% 40% 50%
⌬VccIncreasing
Vcc IncreasingRate
Rate
(f)
Figure 1-15. Decoupling capacitor [43]. (a) PN Junction. (b) MOS varactor.
when the chip becomes faster and larger in size. The characteris-
tic impedance is Z0 = 兹L 苶/
苶C苶. Adding decoupling capacitors will in-
crease the capacitance but does not affect the inductance of the
power planes. As a result, Z0 is reduced, and current spikes gener-
ate smaller voltage drops because ⌬V = Z0⌬I
Low impedance of the power network helps the pulse response
and curbs the instantaneous fluctuations. The impedance Z0 can
be further reduced by lowering the inductance L of the power net-
work. This section presents a metal wire design method to reduce
the inductance by carefully selecting the sizes and spaces of pow-
er lines.
Figure 1-16(a) shows five different combinations of the widths
and spaces for two adjacent Vcc and Vss lines [21]. The inductance
and resistance of these five combinations are shown in Figure
1-16(b) and Figure 1-16(c) for 10,000 m long power lines. The in-
ductance is calculated by using a two-dimensional model with the
current loops between adjacent Vss and Vcc lines. The first-order
estimation of the unit-length loop inductance for two adjacent Vcc
and Vss lines is as follows:
s
L = ᎏ (1-5)
w
24 INTRODUCTION
4.24 0.8
Case 1 Medium width pair of
minimum spaced M5 M5
Vcc
2.12
0.8
Case 2 Try half width pair of
minimum spaced M5 M5
Vcc
1.64 0.84
Case 3 Narrow width pair of
minimum spaced M6 M6
Vcc
37 0.84
Case 4 Wide minimum spaced M6 M6
lines pair Vcc
4.24 22
Case 5 Spread out medium width
M5 M5
Vcc
(a)
1.2
L [nH/1000u]
1.1
4.24 22
1
M5
37 0.84
0.9
M6
0.8
0.7
4.24 0.8
2.12 M5
0.6
0.8
M5
0.5
0.4
1.64 0.84
M6 Freq [MHz]
0.3
1 10 100 1,000 10,000
(b)
Figure 1-16. Characterization results of Vdd/Vss metal structures [21]. (a) Vcc
and Vss cases. (b) On-chip inductance characterizations.
c01.qxd 12/16/2003 11:21 AM Page 25
35
1.64 0.84
R[⍀/1000u]
30.8 M6 30.8
30
27.6
26.3 26.6
25
2.12
0.8
20
M5
4.24 0.8
M5 16.5
15 14.2
13.2
4.24 22
10
M5
5.92
37 0.84
5
M6
3.97
1.3
Freq [MHz]
0
1 10 100 1,000 10,000
(c)
35
1.64 0.84
R[⍀ /1000u]
30.8 M6 30.8
30
27.6
26.3 26.6
25
2.12
0.8
20
M5
4.24 0.8
M5 16.5
15 14.2
13.2
4.24 22
10
M5
5.92
37 0.84
5
M6
3.97
1.3
Freq [MHz]
0
1 10 100 1,000 10,000
(d)
26 INTRODUCTION
1,000
τ[pS]
37 0.84
391.97 M6
391.97 391.97 391.97 391.97 391.97 384.17
341.45
264.67
178.14
4.24 22
95.72
100 M5
90.15
(e)
s =
冪莦 ᎏ
f
(1-6)
conductor larger than approximately 2s will not reduce the effec-
tive resistance of the line.
Figure 1-16(c) shows the resistance plots over the frequency for
the five line configurations shown in Figure 1-16(a). The skin ef-
fects are observed at the higher frequencies with the increased re-
sistances for all configurations. Case 4, shown in Figure 1-16(c),
which has the largest width, shows the skin effect at the lowest
frequency due to its large width.
The impedance of a power line is calculated as follows:
|Z(f)| = 兹R
苶2苶+ 苶
苶苶(2 苶fL
苶苶 )2 (1-7)
= L/R (1-8)
28 INTRODUCTION
Scenario A
The line width and space are both reduced by S, assuming the
line thickness change is negligible in process shrinking. The unit-
length resistance is increased by 1/S. The unit-length capacitance
is reduced in S by assuming that the plate capacitance is reduced
by 1/S2 but the coupling capacitance increases by 1/S due to the
smaller line space.
Vcc Vss
Preferred
Vcc Vss Vcc Vss Vcc Vss Vcc Vss Vcc Vss
The die size is reduced by S2, and the length of power lines is
scaled in S. The line resistance for the power network is not
changed, and the line capacitance for the power network or long
signal lines is reduced by S2.
Based on Equation (1-5), the unit-length inductance between
two adjacent Vcc and Vss lines is not changed, because the line space
(s) and line width (w) are both reduced by S. The total line induc-
tance is reduced in S, due to the power line length scaled in S.
Chip clock frequency is assumed to increase by 1/S2, which is a
simplification of the fact that the microprocessor frequency will
roughly double every two years for the next process generation. In
Scenario A, the logic of the chip is changed very little and the
number of toggling transistors per clock cycle is kept unchanged.
The channel length and width of each device are both scaled
down in S. The average gate capacitance is down by S2. So the total
c01.qxd 12/16/2003 11:21 AM Page 30
30 INTRODUCTION
Scenario B
The die size is assumed to be not changed in this scenario, so the
global line length is not changed. The line resistance of the power
network is increased by 1/S. The line capacitance of the power
network, or long signals, is reduced in S, since the unit-length ca-
pacitance is down in S, as derived in Scenario A.
Based on Equation (1-5), the unit-length inductance between
two adjacent Vcc and Vss lines is not changed due to the line space
(s) and the line width (w), both reduced by S. The total line induc-
tance is not changed because the global line length is not changed.
The chip clock frequency is supposed to increase by 1/S2 about
every two years for each process generation. In Scenario B, new
c01.qxd 12/16/2003 11:21 AM Page 31
32 INTRODUCTION
1.7 SUMMARY
2
DESIGN PERSPECTIVES
34 DESIGN PERSPECTIVES
IO pads (0.28mm)
Boundary scan bank (0.10mm)
Fabric ESRAM
10.9mm 2.2mm
Routing channel (0.50mm)
15.40mm
36 DESIGN PERSPECTIVES
The IR drop or voltage drop is estimated for either Vdd or Vss net-
works. Let us assume the Vdd worst-case drop is ⌬Vdd, and the Vss
worst-case drop is ⌬Vss. So the total worst-case IR drop across the
Vdd and Vss networks is (⌬Vdd + ⌬Vss). Let us assume the voltage
(Vdd) at the inputs of the Vdd pads is Vmax, and the Vss voltage at
the inputs of the Vss pads is 0 V. Therefore, the lowest voltage
Vmin in the chip is estimated based on the following equation:
Figure 2-1 shows the floor plan of the communication chip. The
area is about 15.40 × 19.33 mm. This chip is in a wire bonding
package with Vdd and Vss pads on the chip’s four boundaries.
The power lines cross the main regions as follows: Fabric,
ESRAM, standard cells, and routing channels.
(a)
(b)
38 DESIGN PERSPECTIVES
vices to extract the average current. Figure 2-3 shows the current
models used for each unit region in this example. In addition, the
currents will be different in different regions of the chip due to
different circuit density and switching activity.
The modeling of the current sources can be improved continual-
ly during the chip design stages as more circuits are designed and
more accurate current estimations are obtained. In addition, the
power grid current modeling can be further optimized based on
some test chip or earlier version chip’s power measurement. The
initial specifications of the power grid will come up based on the
simulation model, as shown in Figure 2-2. Figure 2-4 shows the
power routing specifications in the fabric tile region of this com-
munication chip [45].
The simulation result for the power grid model in this chip is
shown in Figure 2-5. The simulation is done for the IR drop analy-
sis. The worst-case IR drop, based on Figure 2-5, is about 99 mV
(1.71 – 1.6112 V). The lowest (Vdd – Vss) voltage across the chip is
about 1.512 V (1.6112 – 0.0998 V).
For the communication chip power grid design shown in Figure
2-1, due to the wire bonding package technology in which all the
Vdd and Vss pads are located on the chip boundaries, many power
straps are required across different regions and routing channels.
In our case, the IR drop target is about 100 mV for each Vdd or Vss
network across the chip.
The following specifications are given for the power routing on
the chip for the Vdd network; the Vss network has the same specifi-
cations and equal metal lines in the routing [45].
40 DESIGN PERSPECTIVES
+ Vdd = 1.7100
+ Vss = 0.
+ xi_2865.n_5 = 1.6910
+ xi_2866.n_5 = 1.6932
+ xi_2868.n_5 = 1.6980
+ xi_218.n_5 = 1.6926
+ xi_219.n_5 = 1.6951
+ xi_2867.n_5 = 1.6890
+ xi_636.n_5 = 1.6945
+ xi_427.n_5 = 1.6386
+ xi_638.n_5 = 1.6846
+ xi_637.n_5 = 1.6817
+ xi_4.n_5 = 1.6659
+ xi_432.n_5 = 1.6281
+ xi_431.n_5 = 1.6504
+ xi_2870.n_5 = 1.7029
+ xi_840.n_5 = 1.6959
+ xi_424.n_5 = 1.6503
+ xi_423.n_5 = 1.7007
+ xi_428.n_5 = 1.6112
+ xi_425.n_5 = 1.6508
+ xi_434.n_5 = 1.6529
+ xi_430.n_5 = 1.6439
+ xi_429.n_5 = 1.6120
+ xi_433.n_5 = 1.6261
+ xi_426.n_5 = 1.6692
+ xi_6339.n_5 = 1.6979
+ xi_220.n_5 = 1.7008
Figure 2-6 shows the complete power grid (Vdd) simulation model.
The node voltages in the simulation by DC analysis are shown in
this figure and the lowest voltage is about 1.32 V at the center of
c02.qxd 12/16/2003 11:50 AM Page 42
42 DESIGN PERSPECTIVES
3 .8 mA 131 .0 mA
1 26.7mA 1 49 .3 mA
1 .7 2 V 1 .5 9 V1 02 .8 mA 1 .4 9 V 4 0. 6mA 1 .4 6 V 1.46 V 1 .5 8 V
1 .8 V Vdd
240m A 240m A 240m A 120m A
58m A 178m A
12.1 mA 54.3 mA 103 .4 mA 1 02.1mA 43.4 mA 5 6. 8mA
102 .7mA 2 19 .6 mA 1 .8 V
1 .8 V Vdd 3 8.7 mA 8 3. 2mA 2 66 .6 mA Vdd
1 69 .3 mA 1.66 V
102 .7mA 1 .7 3 V2 51 .4 mA 1 .5 2V 1.36 V 1.3 2 V 1.40 V
2 19 .6 mA
1 .8 V Vdd
1 02.7mA
Vdd
1 .8 V Vdd 17m A 157m A 280m A 280m A 280m A 140m A 1 .8 V
1 .8 V 1 .8 V 1 .8 V 1 .8 V
Current and Voltag e D istribut ions Vdd Vdd Vdd 1 .8 V Vdd
Vdd
i @ l Ch l S t 8 /28 /01
1.8
Lowest voltage V (fabric center)
1.6
1.4
1.2 30um pad
1 60um pad
0.8 90um pad
0.6 120um pad
0.4
0.2
0
1 (34um) 2 (68um) 3 (102um) 4 (136um) 5 (170um)
# 34um Lines per VDD Bus
(a)
Lowest voltage V (fabric center)
(b)
44 DESIGN PERSPECTIVES
兹苶
L苶
/苶
C
Q= ᎏ (2-2)
R
46 DESIGN PERSPECTIVES
48 DESIGN PERSPECTIVES
In order to plan the metal grid design for the full-chip power net-
work, the package model and decoupling capacitor model have to
be included in the entire AC analysis. A reasonably good AC pow-
er network model must be built. We discussed power network
modeling and characterization in Chapter 1.
In this section, we will examine the power network AC analysis
model from two high-performance microprocessors [47–48]. At the
minimum, the analysis must account for the Vcc source, the moth-
erboard Vcc/Vss traces, the board decoupling capacitors, the CPU
socket, the package pin, the power planes, the on-package decou-
pling capacitances, the CPU I/O, core circuits, and the global clock
distribution network.
c02.qxd 12/16/2003 11:51 AM Page 49
With this AC model, the CPU I/O and core can be toggled to
mimic the execution of the CPU, and the power network perfor-
mance can be measured and analyzed. The AC model from a high-
performance microprocessor is made of three submodels: the
package model, the I/O model, and the CPU core model. These
models are shown in Figure 2-12.
The I/O and core cell models are represented by an array of the
circuit models to model the global power grid on the M4 and M3
layers across the chip, with the switching current tied to each core
cell to model the switching activity of the circuit, as shown in Fig-
ure 2-13. The current model can be a triangular or other current
c02.qxd 12/16/2003 11:51 AM Page 50
50 DESIGN PERSPECTIVES
Figure 2-13. I/O and CPU core power network modeling [47].
waveform from the circuit simulation of this design. The I/O mod-
el will include the detailed I/O circuits.
Since the global clock tree will consume a lot of power, in this
model the detailed model of the clock tree is included for the
whole power network simulation. In addition, the decoupling ca-
pacitors are included in this model, as shown in Figure 2-13.
As shown in Figure 2-13, the total chip is partitioned into 180
core cells in this AC model. Each cell represents about 1150 ×
1000m2 of area in the chip. Each cell includes the modeling of
M4 Vcc/Vss, M3 Vcc/Vss and the back power plane network. The on-
chip decoupling capacitors are added in the model to simulate the
effectiveness of such capacitors.
c02.qxd 12/16/2003 11:51 AM Page 52
52 DESIGN PERSPECTIVES
54 DESIGN PERSPECTIVES
culty lies in routing the filled decoupling capacitors to the Vcc and
Vss lines in the layout. Once the decoupling capacitors are insert-
ed into the layout, the schematic should be updated with the in-
serted decoupling capacitors to make sure the layout versus
schematic (LVS) is clean in the layout verification.
When we update the schematic, the decoupling capacitors can
add one nMOS device, with the total gate area equal to the sum of
all the individual decoupling capacitors.
56 DESIGN PERSPECTIVES
nections to the chip, and the I/O pins to the board interface are
all included.
To analyze the on-chip power supply voltage drop, we need to
model the resistance, capacitance, and inductance of each power
bus segment. The nominal resistance at 25°C, R25 = Rs/width, is
determined by each layer’s sheet resistance Rs and the width of
the power line.
At an operating temperature of 85°C, the resistance is in-
creased with the following well-known linear model to reflect the
increase of the temperatures:
Z = R + j L (2-4)
58 DESIGN PERSPECTIVES
(a)
(b)
(a)
(b)
Figure 2-19. Decoupling capacitors and RC modeling [49]. (a) n-well junction
capacitor. (b) Thin oxide capacitor.
c02.qxd 12/16/2003 11:51 AM Page 60
60 DESIGN PERSPECTIVES
itor Cckt is derived from the built-in capacitance between Vdd and
ground in nonswitching circuits, as shown in Figure 2-20. The to-
tal capacitance C, the sum of Cp and Cn, from nonswitching cir-
cuits, is estimated as [49]:
62 DESIGN PERSPECTIVES
where Iavg is the average current switched by the logic gates from
the power lines originating from a Vdd pad. The term IR drop (⌬V)
is derived from Equation (2-6), which is based on the product of
the current I flowing through the effective resistance Reff. Based
on Equation (2-6), the methods to reduce the voltage IR drop are
summarized as follows:
Figure 2-21 shows a power supply connected to the chip pads. The
resistors in this figure are the effective resistances in the Vdd and
Vss power grid distribution. R11–R14 are for Vdd and R21–R24 are
for Vss. G1–G4 are for logic gates. When the designers are doing
the transistor-level simulation, the voltages (V1–V4) are assumed
to be equal.
In reality, due to the power grid resistances, the Vdd voltage
will be reduced due to the current flowing through resistors
R11–R14, whereas the Vss voltage will be increased due to the
same current flowing through resistors R21–R24. The worst-case
drop between the Vdd and Vss at any logic gate G1–G4 should be
estimated as follows:
where ⌬Vmax is the worst-case voltage drop between Vdd and Vss,
⌬Vdd is the IR drop of the Vdd distribution, and ⌬Vss is the IR drop
64 DESIGN PERSPECTIVES
noise [6, 52]. The IR drop is defined as the average of the peak
currents in the power network multiplied by the effective resis-
tance from the power supply pads to the center of the chip. There-
fore, in the wire-bonding environment, we can observe the worst-
case IR drop or the lowest supply voltage at the center of the chip.
Flip-chip technology, which provides area pads on the top of the
chip, can ease this problem and this package technology is seen to
be more popular for the chips employing 0.13 m process technol-
ogy due to the IR drop problem.
The following example shows the IR drop problem in a wire-
bonding package technology with five metal layers with 0.25 m
process technology. M5 is completely used for power straps to re-
duce the IR drop. Readers can see the severity of the IR drop
problem in the case of the wire-bonding package technology in the
communication chip.
A postlayout simulation methodology has been described as fol-
lows [54]. The methodology has been used in the standard cell de-
sign style in a Vdd and ground mesh structure, as shown in Figure
2-22. The standard cell design style has the regular rows of cells
aligned in multiple rows, and the power lines of the standard cells
are butted together in the same row. The circuit simulation to a
set of standard cells is used to understand the parameters that
impact the IR drop.
66 DESIGN PERSPECTIVES
앫 Simulate all standard cells and classify them into two class-
es: negligible IR drop impact and severe IR drop impact. The
latter class for all the standard cells will have current from
the Vdd to the cell at the switching points greater than the
current threshold (i.e., 1 mA).
앫 Draw the schematic of the Vdd mesh, featuring a metal resis-
tor for each vertical or horizontal metal segment of the power
mesh. It is recommended that a contact or via resistance be
inserted in order to improve accuracy. In the postlayout, the
RC extraction tool can be used to get the complete RC net-
work [59, 60].
앫 At each cell of the power grid, add a current source to model
the sum of the switching current of cells tied from this point.
앫 Partition the whole chip into smaller areas based on the cur-
rent source points in the above modeling. Inside each area,
we can calculate the average current from Vdd to all cells be-
longing to the area.
앫 A worst-case assumption can be made that all the cells in
this area will switch at the same time if we do not have the
switching activity patterns. But the best way is to decide
that the ratio of the cells will switch based on switching ac-
tivity patterns, so the worst-case whole switching total cur-
rent can be multiplied by this ratio (20%, 30%, or 40%) to get
a more realistic current consumption.
앫 The estimated average currents are taken as the current
sources. In addition, the current sources can be modeled as
triangular or trapezoidal waveforms, as shown in Figure 2-
18.
앫 Simulate the Vdd or Vss model with the interconnect RC and
current sources. If you have a large-sized power grid, the fast
circuit simulator will be preferred.
The standard cells simulation can be done using the stimuli vec-
tors to model the transient current waveform from Vdd to the
gates. The simulation can be done in different corners of the
process, with different temperatures, supply voltages, and transi-
c02.qxd 12/16/2003 11:51 AM Page 67
tion times of the input signals to the standard cells. Figure 2-23
shows the schematic of a few standard cells in the design [54].
68 DESIGN PERSPECTIVES
cause large sections of the die get turned on and off at various
times [61].
There are three ways to handle the di/dt: (1) lower the induc-
tance so that V = L · di/dt becomes lower, (2) add decoupling ca-
pacitance in strategic locations, and (3) identify and reduce,
where possible, high sources of di/dt in the design.
In order to get a rough idea of the magnitude of the problem, as
seen from the package pins, let us look at the maximum allowable
package–die loop inductances for several Intel microprocessors, as
shown in Table 2-3 [61]. The L · di/dt noise generated on the chip
can be calculated as follows in Table 2-3:
where L is the loop inductance, Icc is the total current from the
power supply to the circuits of the chip, and Tc is the clock cycle
time.
Table 2-3 calculates the inductance L, using Equation (2-8),
based on the power supply noise upper limit, about 5% of Vdd. If
we know the power supply noise upper limit, the Icc(average) of
the chip, and the clock cycle time or clock frequency, Equation (2-
8) can derive the maximum allowable loop inductance L. This
simple model shows dramatic reduction of the maximum allow-
able inductance in the design for the power network in high-per-
formance microprocessors with increasing frequencies.
Given an initial stimulus on the circuit, the power network Vcc
and Vss will try to oscillate 180 degrees out of phase at the ringing
frequency as follows:
1
0 = ᎏ (2-9)
兹苶
L苶C
앫 Supply the chip with as many Vdd and Vss pins as possible to
reduce the LVcc/Vss loop inductance.
앫 Add the decoupling capacitors on the die so that the highest
frequency components of di/dt do not need to be supplied by
highly inductive paths through the package and board.
앫 Try different architecture techniques to limit di/dt, especial-
ly in the case of clock gating for power saving.
70 DESIGN PERSPECTIVES
The timing failures are easy to catch during testing, but relia-
bility problems are not. Low-power design introduces its own set
of problems. An ideal low-power design would result in low values
of Iavg and di/dt. All units on the die would use small currents
when active and very little current when inactive.
Low-power designs for microprocessors can typically result in
reducing the maximum current peaks moderately, reducing the
time spent at peak levels greatly, and causing very low values of
current when the chip is carrying out easy tasks or is in standby
mode [61].
One concern is the use of lower voltage to achieve low power.
Although low power supply voltages help lower the power con-
sumed, higher transistor counts and higher frequency rates usu-
ally keep the Icc relatively high.
Lower Vcc usually means maintaining a lower absolute value of
the voltage noise. Considering the IR drop across the die, power
supply guard bands, and tester guard bands, very little margin is
left for the on-die power supply oscillations. Since the di/dt usu-
ally remains fairly high, large values of decoupling capacitance
are needed.
Decoupling capacitance reduces the power supply noise by
charging up during the steady state and supplying current during
the time at which the circuit switches. Also, decoupling capaci-
tance filters out the differential mode noise on the Vss line from
the power supply by keeping the Vdd and Vss constant.
Some amount of decoupling capacitance exists naturally on the
chip—capacitance of n-wells to the substrate, capacitance of the
circuits that are not switching, capacitance between the Vdd and
Vss traces, etc. A conservative estimate is that only 10–20% of the
circuits on the chip switch at any given time; the remaining cir-
cuits act as decoupling capacitors [61].
Additional decoupling capacitance is usually placed on the die
opportunistically if there exist unutilized areas on the die. One
example of this opportunistic capacitance placement is in the
routing channels with empty spaces. The difficulty is greater in
routing to the power grids for Vdd and Vss to these decoupling ca-
pacitors. The need for on-die decoupling capacitance is growing
with the increased operating frequency and increased die size.
A very common example of a large number of drivers switching
simultaneously occurs in wide signal buses. For example, in the
case of a microprocessor, the worst-case scenario is with the
c02.qxd 12/16/2003 11:51 AM Page 71
write-back bus on four different ports, for a total 292 bits switch-
ing simultaneously. Each bit drives a 5 pF load with a CMOS in-
verter size of pMOS = 120 m and nMOS = 78 m.
Figure 2-25 shows a plot of the maximum supply voltage drop
as a function of the total width of a p-transistor switching simul-
taneously from low to high for this write-back bus. The write-back
bus drivers are laid out in a strip 1000 m tall and 6000 m long
[61].
The power supply noise is obtained by simulating bus drivers in
a power grid model for this microprocessor, with the resistance
and inductance of lines and decoupling capacitors properly mod-
eled. In Figure 2-25, the amount of the decoupling (CD) related to
the total load (CD/Cload) is varied to show the effects on the power
supply noise [61].
Identifying potential noisy areas on the die based on the loca-
tions of wide signal buses is fairly easy. However, it is not an easy
task to find clumps of simultaneously switching random logic
gates on the die. Such clumps as commonly used can be as bad as
the example given above in terms of injecting noise into the sup-
ply rails. Hot spots can be identified by summing up the driver
sizes (pMOS only or nMOS only) that switch in the same timing
window from adjacent devices in the design.
Figure 2-25. Voltage drop versus driver size and decoupling capacitance [61].
c02.qxd 12/16/2003 11:51 AM Page 72
72 DESIGN PERSPECTIVES
2.6 SUMMARY 73
2.6 SUMMARY
3
ELECTROMIGRATION
76 ELECTROMIGRATION
0
i(t)dt (3-2)
冪莦 冕莦莦i莦(t莦)d莦t莦
T0
S
irms = ᎏ 2
(3-3)
T0 0
78 ELECTROMIGRATION
Idc
Cmax = ᎏᎏ (3-5)
s·fsw·Vdd
Irms
Cmax = ᎏ –1 (3-6)
fsw·Vdd
The maximum current allowed through all contact and via in-
terfaces is described as follows. The number of contacts and vias
placed across a line, perpendicular to the direction of the current
flow, must be maximized or increased as soon as the line width
permits, per layout rule restrictions, as shown in Figure 3-3.
If multiple vias are used, the allowable current value equals
the allowable current per via times the number of vias. In all
cases, the total current must not exceed the interconnecting metal
line current limit, as shown in Table 3-1.
Multiple vias, or maximum coverage arrays of vias, added down
the metal strip in the direction of the current flow do not increase
the maximum current flow. Only the first via, or row of the via ar-
80 ELECTROMIGRATION
ray, contributes to the current flow due to the nature of the inlaid
copper process [64]. Multiple vias, or arrays of vias, must be used
to increase the reliability in case of blocked or resistive vias.
앫 An interconnect database
앫 Device capacitance data
앫 Driver-strength database
앫 Electromigration limits for all design layers
82 ELECTROMIGRATION
a step voltage function at the driver inputs. For example, the de-
fault value for both variables is 10 ⍀.
You must set the voltage range and cycle time by using the ac-
tivity command. You can improve the quality of the estimate by
adjusting the activity ratio on a per-net basis over consecutive
analyses. The tool will report the nets that cannot be passed in
the current density check.
Method 2 produces more accurate currents in the net than
Method 1 and is almost as fast as Method 1. It requires the driver
information—the direction and strength of the ports driving a
net—to make more realistic current estimates. The tool will calcu-
late the driver data and place it into a file. This type of analysis
uses the same algorithm as Method 1, which enables you to re-
peat the electromigration checks for nets that failed in Method 1,
calculating worst-case values without driver information. When
you specify Method 2, you must use the Load Driver command to
load driver strength information.
Method 3 is the slowest but most accurate method. Consider
using this method only for critical nets, that is, nets that allow
failures during the electromigration analysis with Methods 1 or 2.
Using the driver information, the tool uses a simulation method
to determine Javg, Jpeak, and Jrms in every resistor. This analysis
gives the most accurate results for each resistor in the net but re-
quires a longer run time compared to Methods 1 and 2. Method 3
will require the driver information—the direction and strength of
the ports driving a net—to make a more realistic current estima-
tion.
When you specify Method 3, you must use the Load Driver com-
mand to load driver strength information. But more accurate
analysis using the detailed simulation in Method 3 will increase
the run time. Method 3 only analyzes the nets that failed in
Method 2.
Method 4 performs the electromigration analysis by using pre-
calculated average, RMS, and peak device currents. It can also de-
fine groups of devices that either charge or discharge a net. This
methodology assumes that truly parallel devices, which are tran-
sistors with the drain, gate, source, and bulk connected to the
same node, act together as a unit. It derives a separate solution
for each driver charging or discharging the net.
For devices with no current specified, it assumes a zero current
and does not calculate a separate solution. If you do not specify a
current for any of the devices connected to the net, the tool issues
c03.qxd 12/16/2003 12:05 PM Page 83
84 ELECTROMIGRATION
3.4 SUMMARY 85
3.4 SUMMARY
4
IR VOLTAGE DROP
88 IR VOLTAGE DROP
The location and design of I/O pads are a further source of the
IR drop. Simultaneously switching output pads, which always
have a large load, creates a strong demand for the power current
and causes IR drop. The placement of I/O pads and power pins is
a difficult design challenge. I/O rings normally have independent
power rings and pads to prevent I/O ring IR drops from affecting
the internal chip power.
Another common source of IR drop problems is the isolation of
block power grids. It is common to isolate the power grids for sen-
sitive blocks in the design, such as phase-locked loops and memo-
ries. However, power grid problems can result from excessive iso-
lation or insufficient isolation.
Excessive isolation occurs when the block’s power grid is so well
isolated that the resistance from the power pad to the block is ex-
cessive, causing the IR drop. Insufficient isolation occurs when
neighboring blocks create an IR drop that will impact the sensi-
tive block. The IR drop in the sense amplifier is of particular con-
cern for the memory design.
Many low-power design methodologies apply techniques to re-
duce the average power dissipation of a block. Techniques such as
gated clocking isolate the power demands to the times of the block
activity. Low power consumption does not necessarily mean low
IR drop. If we design the block power grid on the basis of average
power consumption, undersized power buses will create IR drop
problems.
The last source of IR drop problems is errors in connecting glob-
al power grids to block power grids. It is common to design the
global and local power grids separately. The power grid is de-
signed to attach the block power grid at a large number of points
after the block is finally placed.
Either manual or automatic techniques are used to insert the
vias in the design where the grids are to be connected. This
process may cause the attached points to be missed, resulting in a
large IR drop to a portion of this chip.
Power grid analysis helps to identify weak spots in the power net-
work. Weak spots are the lower supply voltages that result in ex-
cessive IR drop or ground bounce. A good power grid analysis tool
c04.qxd 12/16/2003 12:14 PM Page 90
90 IR VOLTAGE DROP
not only helps you find such weak spots, but also helps you under-
stand what you must change to improve the weak spots. The IR
drop analysis tool VoltageStorm™ Transistor-Level PGS from Ca-
dence Design Systems will perform this task [51].
It includes static, activity-based, and dynamic analyses. Power
grid analysis involves the extraction of power grid and netlist
data from your chip layout, followed by the analysis of the power
grid and netlist. The interface between circuit netlist analysis and
power grid analysis is implemented by using the tap currents.
In most cases, each tap current is a transistor current, but it
could emanate from a variety of elements. Tap currents are cur-
rents arising from the connection of transistors to the power grid.
Figure 4-1 shows a typical netlist analysis view of transistors con-
nected to a power grid. Each transistor is modeled with a tap cur-
rent, as shown in Figure 4-1(b).
If the netlist has a million transistors connected to the Vdd
wire, data for a million transistors is passed to the power grid
(a)
(b)
Figure 4-1. Tap current model of each transistor tied to Vdd [51].
c04.qxd 12/16/2003 12:14 PM Page 91
92 IR VOLTAGE DROP
94 IR VOLTAGE DROP
(a)
(b)
96 IR VOLTAGE DROP
control tends to create time steps that are too small for practical
use in the power grid analysis. It is used in netlist analysis,
but not in the power grid analysis. We can manually control the
step size used in the power grid analysis by setting the parame-
ters.
98 IR VOLTAGE DROP
where A is the activity ratio of the gate, CGATE is the total capaci-
tance of the nets in the gate including the load capacitance, Vdd is
the supply voltage, and F is the chip frequency.
Computing tap current on the basis of net activity introduces
two additional requirements for the layout extraction: (1) para-
sitic capacitances must now be computed for signal nets, and (2)
back-annotation of the net names from the schematic.
We can also derive the average transistor currents by perform-
ing vector-based simulation in the netlist analysis. This can
achieve more accurate average power grid currents by using the
transistor-level simulation of several vectors. This approach is
most commonly used at the block level for electromigration analy-
sis.
The tool uses one test vector input file, performs the simula-
tion over the vectors provided, and tracks the tap currents [51].
c04.qxd 12/16/2003 12:14 PM Page 99
It tracks the average, peak, and RMS currents at once and re-
ports them in three separate tap current files. Each tape of tap
current data provides a different perspective of simulation be-
haviour, allowing us to select which is the best suited to the
need.
Computing the average tap current on the basis of the vector
simulation requires one more additional condition for the layout
RC extraction: the parasitic capacitances should now be computed
for signal nets. If the vector input signals are not labeled in the
GDSII input, you must back-annotate the signal names in the
schematic to the extracted netlist from the layout.
Dynamic power grid analysis is the next step to improve the tap
current estimation accuracy, based on the input vectors at I/O
pins. It also includes the time variation of the currents in the
analysis. Rather than averaging the currents as in the static pow-
er analysis, this dynamic power analysis enables us to see the fine
time variation of currents over a clock cycle.
The challenge in the dynamic power analysis is to find the
weakness in the power grid by using the minimal amount of com-
putational time. A technique in the dynamic analysis includes the
capability for a form of vector compression in the creation of the
dynamic tap current data [51].
The vector compression is intended to create an effective worst-
case IR drop test vector by merging the behavior of many vectors
into a single equivalent vector set. The dynamic power grid analy-
sis introduces two additional requirements for the extraction be-
yond those of static analysis as follows:
are generated from the RC extraction from the layout; and the
transistor capacitances are embedded in the dynamic tap cur-
rents extraction. The decoupling capacitances are also included in
the transistor capacitances.
The dynamic power analysis processes the dynamic current
data as piecewise constant current sources. The recommended
step size is about a single gate delay. Another criterion is to use
one-tenth of the clock cycle as the step size, so if our clock cycle is
10 ns, it will use 1 ns as the step size in the dynamic simulation.
If we want to include the pin inductance, a smaller step size is
required, such as one-twentieth of the clock cycle. The power grid
solution is performed by constructing and solving the massive ma-
trix problem. The size of the matrix describing the resistive con-
nectivity of a full-chip Vdd network can be very large.
The number of resistors in the Vdd network can be the number
of metal layers times the number of transistors in the circuit. In
the five-to-six metal layers process, the ratio will be five to six
times; and 10 million transistors will have 50 million resistors in
the network. The matrix to solve the power grid analysis is huge.
VoltageStorm™ from Cadence Design Systems uses vector
compression to reduce the overall computational time, because
the time to solve a large matrix for each of the large number of
time points can be very large [51]. If we simulate the chip for 100
vectors, and select 10 steps per clock cycle in the dynamic analy-
sis, we may perform 1000 solutions of the power grid. This may
not be practical with existing computational resources. The vector
compression reduces the number of solutions to 10. It is useful
when our objective is to resolve the temporal issues of the static
analysis or to estimate the magnitude of worst-case the IR drop
more precisely.
The dynamic analysis will introduce the time correlation to the
analysis data. The chips are synchronous in their behavior, with
the clock being the synchronous signal. Introducing the temporal
correlation in the dynamic analysis splits the activity occurring at
different portions in the clock cycle, rather than modeling the
clock cycles as a single time-averaged value. The key is to improve
the resolution in a clock cycle, not across many clock cycles or vec-
tors [51].
For example, assume that we split a 10 ns clock cycle into 10
buckets {B1 – B10} of 1 ns each, B1–B10. B1 corresponds to the
interval 0.0–1.0 ns into the clock cycle, B2 to the interval 1.0–2.0
c04.qxd 12/16/2003 12:14 PM Page 101
ns, and so on. Figure 4-4 illustrates the current for gate G1 over a
clock cycle [51].
If gate G1 can only switch in the time interval corresponding to
bucket B2 in the dynamic analysis, the current value for gate G1
in buckets B1 and B3–B10 should be 0.0 A in all clock cycles. The
current value in bucket B2 may be 0.0 A in some clock cycles and
nonzero in others. Over the 100 vectors, 1000 total buckets corre-
spond to gate G1. The 1000 buckets correspond to 100 vectors and
B1–B10 offsets into each vector.
The second concept in vector compression is that of peak analy-
sis, with a goal of finding the worst-case current. When simulat-
ing to determine the peak current of a transistor, we take the
maximum value found for the transistor currents at each time
point.
We would like to find the worst-case set of current buckets for
gate G1 to create a current waveform for a single worst-case clock
cycle. We want to find the peak over many vectors, but meanwhile
want to preserve the time offsets or buckets in clock cycles.
In summary, the vector compression technique will assign the
worst-case bucket with the largest current to the specific gate
(e.g., G1) for many vectors and so on for all gates. For example,
Table 4-1 shows the peak currents in different input vectors at
bucket 2 for Gate 1, so 2.1 mA is used for the largest current for
Gate 1.
4.6 SUMMARY
The analysis of the power grid can be done either by static or dy-
namic methods. The static method uses the average current and
current scaling factor to estimate the static number for the IR
c04.qxd 12/16/2003 12:14 PM Page 104
drop. It is faster and easier to identify the weak spots in the pow-
er distribution grid by using this method.
The dynamic method improves the accuracy by simulating the
power grid and tap currents in multiple time points of the clock
cycle, similar to the transient analysis of circuit simulation for
both the circuit netlist and the power grid resistance network.
The dynamic method is not usually practical for a full-chip scale
due to the long simulation time required in multiple input test
vectors, but it is worthwhile to try it out using one or two test vec-
tors or the vector compression technique [51].
The best solution to a given IR drop problem depends on the
type of the IR drop, the chip architecture, the chip layout, and the
functionality. Several approaches can be used in the circuit and
layout design to fix power drop problems as follows [51]:
5
POWER GRID ANALYSIS
This chapter will explain how to use CAD tools to help you find
the weak spots in the power grid. We chose to use the Volt-
ageStorm™ tool from Cadence Design Systems, Inc. although sev-
eral other CAD tools perform similar tasks [51]. Weak spots are
implementation characteristics that result in excessive IR drop,
electromigration stress, or pin currents during the operation of
the chip.
There are three approaches to finding weak spots. The first is
finding the weaknesses in the power grid that are likely to impact
the proper functioning of the chip, regardless of the magnitude of
the impact. This approach is quite common and best addressed by
using static analysis. It is strongly recommended to use static
analysis before dynamic analysis, because static analysis can find
the problems quickly.
The second approach to finding weak spots is to predict a worst-
case IR drop vector on the basis of the limited coverage of the vec-
tors for analysis.
The third approach to finding weak spots is to address the pre-
cise voltage drop on the grid for a specific test vector. This ap-
proach is common in memory design or when the cost of changing
a design is high and we want to determine the exact magnitude of
the IR drop.
This chapter is organized in six sections. Section 5-1 describes
the data preparation and provides an overall introduction to a
CAD tool used for the IR drop analysis. Section 5-2 explains the
steps needed to execute the CAD tool. Section 5-3 discusses ad-
5.1 INTRODUCTION
The following sections show the steps used to load the input data-
bases, do the power grid analysis, and show the IR drop analysis
[51]. The next section will show a more specific design example for
the application of this CAD flow.
which will turn these transistors off, are assigned 0.0 A cur-
rent. We can exit the Thunder window as follows:
Thunder > quit
An alternative way of using Thunder is to create a com-
mand file, for example: ipeak.cmd, which contains the three
Thunder commands introduced. Then we can use the com-
mand line version of Thunder to perform the analysis by en-
tering this command as follows:
Shell > thunder.tty ipeak.cmd
The above command creates the VDD.ipeak output file,
the same as the pwrnet command’s output. Next, we can run
Lightning by using the following steps, which will load the
power grid RC network modeling, specify the power source
pin locations, load the Ipeak current data file (VDD.ipeak)
generated in the above steps, and then solve the linear net-
work of the power grid modeling with RC and tap currents,
and show the lowest voltage across the full-chip power grid.
3. Move to the working directory containing the Vdd power grid
database:
Shell>> cd $lightning_working_directory
4. Run Lightning:
Shell>> lightning
Lightning > load design_VDD.mhdr
The above step loads the binary power grid database for
Vdd and displays the power grid in the plotter window. The
metal layers are shown in different colors—such as M3 in
purple, M2 in tan and M1 in blue—in the layout display:
Lightning > putvsrc M3 Vsrc1 3.3 24000 17000
Lightning > putvsrc M3 Vsrc2 3.3 284000 17000
Lightning > putvsrc M3 Vsrc3 3.3 24000 12000
Lightning > putvsrc M3 Vsrc4 3.3 284000 12000
The above step is used to define where the power source
pins are placed. Four Vdd pads are placed in the chip bound-
ary. M3 is the power line, which is started from the Vdd in-
put pin. {24000 17000}, etc. are the X–Y locations of the pads
in the layout with the drawn dimensions.
Be sure to name each source differently. After each com-
mand, a white dot corresponding to the placement of the
voltage source in the layout plotter window can be seen.
The voltage is actually placed at the power grid subnode on
M3 near the specified location. Units are in m in general
c05.qxd 12/16/2003 12:22 PM Page 110
ters, and each filter contains a range. We can establish a set of fil-
ter ranges by using the following methods:
shows where the circuit has the largest IR drops, as well as the
voltage trends from the power pins to each area of the chip.
A design example shows that the red colors in the plot, which
has the largest IR drop, are located in the central control units on
the left side of the chip [51]. The reason for this is that the power
routing for much of the control circuitry is provided only from one
side of the block, yielding high IR drops at the isolated end of the
power bus, whereas the power is supplied to the top and bottom
blocks from both sides of the block.
We can also use the VoltageStorm™ tool to view the geometric
distribution of tap currents. We can use the scan tc command to
view the total currents in the design. We can also create filters to
plot the tap currents. Before doing this, we can set the analysis
tape to Tap_Current, as shown in Table 5-1. One design example
shows that the larger areas of currents are located in the data
path units of the chip and smaller currents in the control units.
Resistor current value distribution has different characteristics
than node voltages. The maximum current should be near to the
power pins and the minimum near the transistors. After observ-
ing the IR drop in the plots, the next examination is of the current
flows in the circuits to create the IR drop. The current flow trends
may not be as you expected or currents from several power pins
may merge in the middle of the chip to create a high current
through the wires with high IR drops.
We can use scan rc command to get the data for the resistor
current, and set the analysis type to resistor_current before the
filtering and plotting. Finally, we can use the plot rc command to
obtain colorful plots with the highest-current chip area in red, the
medium resistor current area in orange, and the low-current area
in green.
Once we understand how current flows through the chip, we ex-
amine the current densities of metal wires, which is a first-order
indication of the electromigration failures within the chip. Use
the Current_Density analysis type to examine them.
We can change the current density limits for metal layers. The
current density reporting is not based on the actual value, but on
the ratio of wire current density over the required limit. If the ra-
tio is more than 1.0, there is a potential electromigration failure
in that area.
Summarizing the modeling, analysis, and viewing results using
the VoltageStorm™ (Thunder and Lightning) tool suite from Ca-
dence, the recommended design flow is described as follows [51]:
c05.qxd 12/16/2003 12:22 PM Page 114
solve the power grid again, which will save the computa-
tional time for a large-size chip.
10. Iterate between using the scan ir command and setting the
filter to derive a good set of filters to observe the IR drop.
These filters are generally equal-sized steps. We can gener-
ate the plot of the IR drop by using the plot ir command.
Use the image command to create a GIF image of the IR
drop plot.
11. Iterate between using the scan rc command and setting the
filter to derive a good set of filters to observe resistor cur-
rent flow in the chip. These filters are generally decreased
in magnitude logarithmically. We can generate the plot of
the resistor current by using the plot rc command. Use the
image command to create a GIF image of the resistor cur-
rents plot.
12. Create a plot for each layer with only the metal layer and
its error layer turned on. For example, turn off the grid and
all errors, then turn on M2 and M2 errors and save the im-
age. These plots help in understanding the behavior of each
metal layer.
13. Iterate between the scan rj command and filter setting to
derive a good set of filters to observe the resistor current
density in the design. These filters are generally decreased
in magnitude logarithmically. We can generate the plot of
the current density by using the plot rj command.
Look for the few wire segments that have the highest val-
ues. Create a GIF image of this plot using the image com-
mand. If we define the appropriate model parameters, we
can also generate a plot of electromigration risk.
14. Create a plot for each layer with only the metal layer and
its error layer turned on. For example, turn off the grid and
all errors, and turn on M2 and M2 errors and save the im-
age. These plots help us to identify the specific wires most
likely to fail because of electromigration.
#! /bin/csh -f
#This is the flow for core Vdd/Vss nets IR static analysis
#(ocelot, 64bit, 4GB Memory, 30GB disk)
#/chip/thunder/tablegen tablegen.cmd
cd /remote/chamfs3/jonathan/simplex/viper_d_2/xt
xtc64 chip_xt.cmd
net_profile chip_cmln.net
cd /remote/chamfs4/home/qing/simplex/viper_d/thunder/static_1
thunder.tty64 run.cmd
cd /remote/chamfs4/home/qing/simplex/viper_d/thunder/static_2
thunder.tty64 run.cmd
cd /remote/chamfs4/home/qing/simplex/viper_d/thunder
itaputil combine static_1/vdd.ipeak static_2/vdd.ipeak vdd.ipeak
itaputil combine static_1/vss.ipeak static_2/vss.ipeak vss.ipeak
cd /remote/chamfs4/home/qing/simplex/viper_d/firePOWER
runFEX_p_a&
runFEX_p_b&
runFEX_p_c&
runFEX_p_d
cd /remote/cougar3/simplex/viper_d/firePOWER
mergenet _s VDD_simplex -o add /remote/chamfs4/home/qing/simplex/viper_d/firePOWER/chip_cmln_*.hdr
mergenet _s VSS_simplex -o vss /remote/chamfs4/home/qing/simplex/viper_d/firePOWER/chip_cmln_*.hdr
cd /remote/cougar3/simplex/viper_d/lightning/static_vdd
lightning.tty64 run.cmd
cd /remote/cougar3/simplex/viper_d/lightning/static_vss
lightning.tty64 run.cmd
(a)
Figure 5-3. Voltage plots of power distribution networks [53]. (a) Vdd.
where A is the activity ratio of the gate, Cgate the total capaci-
tance of the nets including the wires and gates, F the clock fre-
c05.qxd 12/16/2003 12:22 PM Page 121
(b)
quency of the chip, and Vdd the supply voltage. This equation for
the average current is derived by considering the charge, Q, re-
quired to charge the outputs of the gate in a clock cycle interval
(1/AF).
This derivation of average current is not a function of transistor
sizes. If your design has multiple clocks, select one clock to be the
reference for the activity analysis, and scale the gates associated
with other clocks accordingly.
For example, if CLK1 has a period of 10 ns and CLK2 has a pe-
riod of 15 ns, and CLK1 is to be the reference for activity-based
analysis, scale the activity of gates in the CLK2 domain by 0.666.
On the other hand, if we have the actual toggle numbers of all
nets, use CLK1 as the reference to divide the net toggle counts to
derive the activity values.
Here are steps to run the activity-based static power grid
analysis using the VoltageStorm™ tool [51].
c05.qxd 12/16/2003 12:22 PM Page 122
When we use the VDD.avg file, for example, the IR drop in this
case goes down as far as 3.265 V [51]. However, when we use the
VDD.max file, the IR drop goes down to 2.772 V [51], but this
number may be an overestimation of the peak IR drop in the pow-
er grid, because it models all the transistors turned on to their
maximum currents at the same time.
The actual peak IR drop is somewhere between that reported
using the VDD.max file and that reported using the VDD.avg file.
Use the VDD.max file only on the small blocks to perform an easy
pass and fail screening of the block power grid.
If we want to apply peak currents to large designs to which
many vectors have been applied, we will see an unrealistic mea-
sure of the IR drop. We can use it on small blocks in which many
gates could potentially switch at the same time.
5.6 SUMMARY
With the complexity of the power grid and reduced power supply
voltages in modern VLSI chips, CAD tools are necessary to assist
designers in finding failures or weak spots in power network de-
signs. This chapter discusses the most popular tool, Volt-
ageStorm™ from Cadence, with modeling and analysis capability,
and explains how to use this CAD tool to aid in IR drop analysis
and improvement.
The tool provides the following capabilities: (1) modeling of the
power network in the resistance network, (2) modeling the tran-
sistor switching current in the tap current, and (3) solving the
power network model in the linear circuit.
The tool also provides the capability to help designers locate
and fix errors in the power grid layout. For example, PGS explo-
ration is one example that uses the internal power grid analysis
database to fix the power grid and output a list of changes needed
with zero violations for the power grid. Layout designers can use
up these changes, as necessary, to fix the power grid design.
c06.qxd 12/16/2003 12:24 PM Page 135
6
MICROPROCESSOR
DESIGN EXAMPLES
and both AC and DC drops are reduced. For the local metal lay-
ers, a tree-based distribution was chosen, with custom width se-
lection for the trunks and branches according to the area current
drain requirements. The global power grids and associated local
tree structures are shown in Figure 6-1 [66].
It is difficult to optimize the power distribution using a single
C4 bump pitch for both the I/O and the core due to their different
requirements. In the core, the optimization is primarily driven by
the potential for power collapse but constrained by the effective
routing channel space available for global signals. However, in
the I/O area, power collapse, minimization of the interconnect
length to a C4 bump, and package-level routability are some of
the additional constraints.
A 252 m bump pitch for the core and 235 m bump pitch for
the I/Os were chosen [66]. The overlap region between the core
and I/O area is strapped with custom power grids. In the I/O ring
design, special attention was paid to the placement of signals and
power/ground bumps and their ratio, such that loop inductance is
minimized while maintaining the continuous return paths for I/O
signals.
The processor is packaged in a six-layer organic land grid array
(OLGA) package. Dedicated power and ground planes are used to
minimize the package-level power distribution and the noise due
to package-level power distribution. Power distribution was de-
signed with two different Vcc supplies to enable lower-power ap-
plications.
The core power supply voltage level can be dropped significant-
ly while maintaining the I/Os and other special analog circuits
with a different supply. All of the special circuits within the core
were verified at a 1.1 V supply voltage to enable this voltage scal-
ability.
c06.qxd 12/16/2003 12:24 PM Page 137
(a)
(b)
Figure 6-1. (a) Global power grid (M4 and M5) and (b) local power trees for the
Intel IA-32 Pentium-III chip [66].
c06.qxd 12/16/2003 12:24 PM Page 138
are located at the bottom of the chip. The control and execution
units are located in the middle of the chip.
The instruction catch and decoder blocks are located at the top
of the chip. The clock is distributed from the PLL output up to the
flip-flops through a balanced tree network. All the inputs of flip-
flops and clock buffers are connected through a clock grid network
to minimize clock skew.
The main power network uses a grid in M5/M6 and M7 (three
metal layers). There are 2065 solder bumps, of which 1251 are
used for Vdd and Vss. These bumps are area-distributed over the
chip area by the flip-chip technology. The I/O contains 800 solder
bumps, 470 of which are signal bumps, whereas 330 are used for
power and ground. The bumps in the core area and in the channel
regions are placed away from the active circuitry to prevent soft
errors due to alpha particles released from the bumps.
c06.qxd 12/16/2003 12:24 PM Page 141
Table 6-2. Process and device parameters for the Hitachi SuperH™ CPU [68]
Technology 0.2 m, P-sub, triple-well CMOS
Gate channel length (Lg) 0.2 m (1.8 V device) and 0.35 m (3.3 V device)
Gate oxide thickness (tox) 4.5 nm (1.8 V device) and 8 nm (3.3 V device)
Threshold voltage (Vth) 0.15 V (1.8 V device) and 0.45 V (3.3 V device)
Metal layers Metal 1–3 (0.88 m pitch) and Metal 4–5
(1.76 m pitch)
Area 6.84 × 6.84 mm2
Transistor count 3.3 M
c06.qxd 12/16/2003 12:24 PM Page 142
the active mode, the fluctuation in the substrate bias causes signif-
icant threshold voltage variation and lowers the operating speed.
The peak overshoot of the substrate noises can be reduced by
lowering the supply voltage or increasing the source and sub-
strate diffusion capacitances. The decap time of the noise depends
on the substrate impedance. A long decap time that exceeds the
cycle time causes the substrate noise to accumulate.
To reduce the substrate impedance and achieve substrate bias-
ing, the switched substrate impedance scheme has been devel-
oped. This scheme switches the substrate impedance, as well as
the substrate bias, according to the operation mode. Figure 6-6
shows the switched impedance scheme for this microprocessor.
A standby controller and a vbb controller (VBC controller) con-
trol the voltage of the substrates, denoted as vbp for the pMOS
substrate and vbn for the nMOS substrate. In the standby mode,
these are driven with a high-voltage, high-output impedance dri-
ver in the VBC macro. In the active mode, the substrates are driv-
en with about 10000 switch cells over the chip [68].
Each switch cell consists of two thick-tox and high-threshold
voltage MOS transistors. One transistor with a gate signal cbp is
connected to vbp and add. Another with a gate signal cbn is connect-
ed to vbn and vss. These transistors reduce the substrate imped-
c06.qxd 12/16/2003 12:24 PM Page 143
ance; in other words, they keep the substrate biases of the MOS
transistors equal to their local power supplies.
Therefore, even if the local power supply drops due to a power
line pump or simultaneous switching noise, the substrate bias is
quickly recovered. The VBC macro consists of four circuits—
VBCP, VBCN, VBCI, and VBCG—and is fed by supply voltages
add (normally 1.8 V) and vwell (3.3 V). VBCG generates vsub volt-
age, which is a negative voltage used as the third voltage source
in the VBC macro. The vsub voltage is equal to add – vwell = 1.8 V –
3.3 V = –1.5 V.
Figure 6-7 shows the waveforms of a complete transition from
active mode to standby mode. When the microprocessor goes from
the active to the standby mode, the standby controller stops all
c06.qxd 12/16/2003 12:24 PM Page 144
1.8 V logic circuits. After that, it issues a vbbenb signal. Then the
VBC macro drives cbp up to vwell (3.3 V) and cbn down to vsub (–1.5
V). These signals turn off all switch cells. The VBC macro also dri-
ves cbp to 3.3 V and vbn to –1.5 V. This mode transition takes
about 50 s.
Figure 6-8 shows the layout of a standard cell and a switch cell
for random logic circuitry. Both cells have the same height. In a
conventional CMOS cell, the substrate biasing lines, vbp and vbn,
are connected to the power lines (add and vss) locally. In the new
scheme, these lines are interconnected separately to bias the sub-
strate.
The substrate bias lines vbp and vbn are interconnected by M1
and are parallel to the power lines add and vss. The switch cell has
additional vertical power lines add and vss interconnected by M2.
Furthermore, between add and vss, there are four metal lines: two
c06.qxd 12/16/2003 12:25 PM Page 145
are the substrate biasing lines vbp and vbn and the other two are
the gate lines cbp and cbn.
In order to reduce the chip area overhead, the design uses iden-
tical heights for each cell compared to the conventional CMOS
cell, as shown in Figure 6-8 [68]. The width of the power lines to
M1 is reduced to about 77% that of the conventional CMOS cell.
This increases the impedance of the power lines.
To reduce the impedance, the power lines are routed in a fine
mesh structure. Figure 6-9 shows the metal routing of vbp, vbn, cbp,
cbn, and power lines. The switch cells are placed in rows, and the
distance between two switch cells is about 200 m. The thicker
metal levels of M4 and M5 also form a coarse power line mesh that
reduces the impedance of the power lines. The chip area overhead
of the switch cells is less than 2% because the switch cells are
placed under the power lines in M2, as shown in Figure 6-9.
The data flow in the data path is designed so as to be parallel to
the power lines and p- or n-wells. This layout will reduce the
c06.qxd 12/16/2003 12:25 PM Page 146
Figure 6-11 shows the decoupling capacitor cell that fits under
the data flow wiring tracks. The cell is double bit-pitch wide (43.2
m) and 14 tracks tall (25.2 m). Two out of the 14 horizontal
wiring tracks are specially blocked for the decoupling capacitor
wiring so the capacitor can fit right under the wiring tracks. A
low-resistance layout of the capacitor cell provides a fast time con-
stant of about 85 ps.
This die with 750 I/O signals and 1735 power bumps is flip-chip-
attached to a multilayered ceramic land grid array package [70].
Figure 6-12 shows the die micrograph of the chip [70]. The pack-
c06.qxd 12/16/2003 12:25 PM Page 149
This simulation was done after the core was attached to the
pad ring and the result shows a black region in the bottom right
of the die. This large IR drop being highlighted is where the
power supply connections between the core and the pad rings are
incomplete. A hook-up is added here later to fix this IR drop
problem.
Voltage regulation requirements of each generation of micro-
processors are more critical as the on-chip voltage decreases and
the AC current increases. Distributed thin-oxide capacitors are
used for supporting instantaneous current variations within the
die, but are insufficient to compensate for the tank circuit formed
by the parasitic LC in line with the supply distribution.
Simulation shows nearly an order of magnitude increase in
supply network AC impedance seen by an internal gate at reso-
nance. This resonant frequency is much lower than the system
clock frequency but can limit the speed performance. A special
voltage regulator circuit is placed 99 times to reduce the reso-
nance from the board to the package to the chip.
The voltage regulator circuit increases the charge stored or de-
livered by a given amount of added decoupling capacitors by ac-
tively increasing the voltage across the capacitor’s terminals. The
operation is done by stacking fully charged equal value capacitors
c06.qxd 12/16/2003 12:25 PM Page 152
the series switches, are completely off. Then Vinst drops, causing
node B to fall, cutting off N2. Slightly later, node A falls, turning
on P1. This changes C2 from being in shunt with C1 to being in
series. Similarly, the mirror devices, P2 and N1, are being cut off
and turned on, respectively. This allows the series-connected C1
and C2 to discharge into the power grid, which forces Vinst up. In
the next time section, where Vinst > Vave, node A rises, cutting off
P1, and then node B rises, turning on N2. Similarly, N1 turns off
and then P2 turns on. This switches C1 and C2 into the shunt
mode, allowing them to be charged by Vinst and forces Vinst to drop.
Once Vinst = Vave, node B returns to Vdd/2, which returns the cir-
cuit to the weakly charging mode.
The switched capacitors are enhancement mode MOSFET de-
vices, laid out in a waffle-type structure to maximize capacity
[70]. The regulators are evenly distributed across the chip in 99
instances, which are directly hooked up to the main global power
grid.
Care has been taken in shielding sensitive signals and in man-
aging high-current-density paths. The regulators are placed un-
derneath the global routing channels to reduce the layout area
impacts.
6.7 SUMMARY
7
PACKAGE AND I/O DESIGN
FOR POWER DELIVERY
The length of the electrical connections between the chip and the
substrate can be further reduced using flip-chip or C4 technology.
This technology is achieved by distributing the I/O solder bumps
Power Distribution Network Design for VLSI, by Qing K. Zhu 157
ISBN 0-471-65720-4 © 2004 John Wiley & Sons, Inc.
c07.qxd 12/19/2003 9:06 AM Page 158
over the die, flipping the chips over, aligning them with the con-
tact pads on the substrate, and connecting the solder bumps be-
tween the chip and package to make connections.
This saves silicon area and increases the maximum number of
I/O and power/ground terminals available with a given die size.
This package also provides more efficiently routed signal and
power/ground interconnections on the chips. Therefore, modern
high-speed chips and microprocessors use this flip-chip technolo-
gy to achieve high speed and lower power noise.
For example, the 450 MHz RISC microprocessor from Motorola
has a chip footprint with a total of 794 C4 or flip-chip pads [72].
Two hundred and sixty-six pads are used for 64-bit bus transfer,
64-bit L2 interface, and control. The remaining C4 pads are used
for power and ground and possible extension to 128 bit bus trans-
fer and L2 interface options.
The 1.8 V Vdd and ground C4 pads are distributed over the core
of the chip to reduce the voltage drop and feed the internal power
structure. The signal I/Os are distributed around the periphery to
reduce the wiring congestion in the package substrate and to iso-
late the ESD structures from the internal circuits.
L2 cache interface C4s are placed along the left side (bits
0–63) and bottom (bits 64–128) of the chip. This allows for an op-
timal multichip module design of this processor, with two
SRAMs using the 360-pin solution. The data transfer signals are
on the right side of the chip, and the address/control signals are
at the top.
A total of 236 Vdd and ground C4 pads are used for the internal
1.8 V core and supply 1.8 V power to off-chip I/O drivers and re-
ceivers, 55 Vdd and ground C4s for the external L2 interface, and
73 Vdd and ground C4s for the external bus transfer address and
control [72].
Flip-chip connection technology as the first level chip-to-pack-
age connection option traditionally is regarded as being the con-
trolled collapse chip connection (C4) process, which was originat-
ed by IBM [73].
Figure 7-1 shows the schematic, which is a bare IC device
flipped upside down with its active area or I/O side attached to a
substrate via a connecting medium. The device may be any of the
substrates providing an interconnection network between the
flipped active device and other active, or even passive devices,
such as the decoupling capacitors.
c07.qxd 12/19/2003 9:06 AM Page 159
where Leff is the effective inductance of the power and ground con-
nections, di/dt is the peak rate of the change of the currents for
each driver, and N is the number of drivers used during the
switching. di is the current demand of each driver during the
switching event, and dt is the rise and fall time of the signal.
In reality, the ⌬V does not increase linearly with Leff or N, be-
cause any increase in ⌬V will slow down the circuits and reduce
the di/dt. The effective inductance Leff is primarily a function of
the package design. Reducing Leff requires minimizing the induc-
tances of the power and ground distribution networks and also
the use of the decoupling capacitors.
The decoupling capacitor placed between the power and ground
pins of each chip can act as a local source of the charges during
the switching events, so that not all of the switching current has
to be supplied from the system ground to minimize the local
change in voltage. Figure 7-2 shows the equivalent circuit model
for a CMOS output driving a capacitance [74].
c07.qxd 12/19/2003 9:06 AM Page 161
Table 7-2. Noise budgets for package and system level at 6°C [74]
Noise Source Noise Budget (mV)
Load reflections 100
Interconnect impedance mismatch 100
Crosstalk 100
Simultaneous switching noise (SSN) 150
AC noise 25
Signal IR drop 25
Vcc IR drop 14
Internal chip noise 50
1
f = ᎏᎏ (7-3)
2兹L苶C
苶苶d
苶C
兹L 苶苶d
Q= ᎏ (7-4)
R
very high frequency with the flip-chip package and use both the
package-level and on-chip decoupling capacitors.
The degree of ground bounce depends on multiple factors, such
as the total current inputs to the chip, the clock delay and skew,
and the switching activity factor. With reduction in clock delay
and clock skew, the higher harmonic components will become
stronger, and the most acceptable design technique would be
staged decoupling, with both on-package and on-chip decoupling
to reduce the package resonance to a small value.
With no additional on-package decoupling capacitance, it re-
quires a very large on-chip capacitance to decouple the package in-
ductance. As the power and ground inductance decrease to meet
the simultaneous switching noise reduction requirement, the re-
quired on-chip capacitance becomes larger. On-package decoupling
capacitors should be used to decouple the package inductance.
The value should be high enough to make the first resonant fre-
quency and the resonant impedance sufficiently low. The reso-
nant frequency, as specified in Equation (7-3), should be four to
five times smaller than the clock frequency.
The following parameters should be considered when predict-
ing the high-frequency ground bounce:
Pcore
Bounce(fclk) = ᎏᎏ (7-5)
3.3 · Zin(fclk)
Here Pcore is the power dissipation due to the core gate, including
the flip-flops. As the power dissipation increases, it becomes nec-
essary to decouple at very high frequency.
With less on-chip decoupling, it is important to reduce the chip-
to-package inductance with integrated decoupling, along with
large high-performance on-package decoupling. The bounce mag-
nitudes observed in the simulations are less than 70% of the val-
ues predicted by the above equations [75].
The delay derating factor for the ASIC standard cell library is
Kv = 1.03 for a 160 mV reduction in the supply voltage, or 3% in-
crease in the delay for a 5% reduction in the voltage [75]. For 320
mV peak bounce on both Vdd and Vss, the delay penalty is 6%, ap-
proximately the dynamic effect of the bounce with an average ef-
fect.
For a critical path in a 100 MHz system, if only a 5 ns delay
with gate and loads is produced by this bounce, the delay penalty
is 300 ps [75]. Unless the low-frequency bounce is designed within
a controlled limit, the effect on chip power consumption may be
noticeable. This results are from the fact that the power consump-
tion from the ground bounce affects all gates in the chip.
The simulations indicate a significant variation in the power
dissipation. For example, for a very high performance package
with no on-package decoupling, the power dissipation may vary
from 13.0 W to 18.9 W [75]. One effect of the power consumption
from low-frequency bounce is that it is dependent on the relative
position of the clock frequency and resonance frequency.
A study methodology for the ground bounce and decoupling ca-
pacitance has been proposed as follows [75]:
that includes the package. Figure 7-5 shows the simulation result
of the Vss bounce noise for a flip-chip package [75].
Vcc
250 m
Vss
(a)
(a)
Vcc
Vss
250 m
Vcc
Vss
(b)
(b)
(a) (b)
the chip to feed the middle of the chip. Both M4 and M5 are as-
sumed to be 1.8 m thick and about 21 m⍀/square. This is an op-
timistic assumption, since only the last layer can be made signifi-
cantly thicker than the other layers. M3 is assumed to be 0.8 m
thick and about 47 m⍀/square.
The average current drawn by the chip is about 12.5 A and the
supply voltage is about 2.5 V. We assume in all cases 30% for Vcc
and 30% for Vss. M3 is used for equalization of about 5%/5%
Vcc/Vss. The effective resistance of M4 for Vss or Vcc is increased by
70 m⍀/square [76].
The average voltage drop can now be calculated by considering
uniform current injection from one side. The current is reduced by
two times, the resistance is only 0.5 square, and the current is re-
duced linearly from the edge of the chip to the middle as follows:
where the Itot is the total current consumed by the chip and Rs is
the metal sheet resistance. For the case of the interdigitated pow-
er supply in M4 with the Vcc/Vss metal widths of 30 m/30 m, as
shown in Figure 7-6(b), the power is supplied only from two sides
and only M4 is used to carry the average current.
Based on Equation (7-6), for this case, the voltage drop is calcu-
lated as: Vcc_drop = 6.25 · (52.5e – 3/2) · 0.5 = 82 mV. The average
drop in the Vss is dependent on the number of substrate taps. The
number of taps is determined by peak noise considerations, so the
average voltage drop will be small.
On average, we could get the Vcc ~ Vss = 130 mV, so we could
achieve a good routability with only 40% Vcc in M4 and a tolerable
average voltage drop. But this power routing configuration with
the wire-bonding package has high inductance and, therefore, it
has a high switching noise drop across the package and chip.
We consider the second option of the metal routing for Vcc and
Vss using the wire-bonding package. The M4 Vcc and Vss are 30/30
m in width and the M5 Vcc and Vss are also 30/30 m in width.
We assume that the M4 and M5 have the same metal thickness.
We also assume that the power supply is from all four sides of the
chip, so the inductance will be reduced.
We can roughly estimate that the voltage drop from the chip
side to the middle is reduced to 220 mV/2 = 110 mV, and the
routability is also improved significantly with the fifth metal lay-
er (M5) added for the power routing [76].
The C4 power distribution is quite different. The resistance in
the package plane is only 2.36 m⍀/square, so the voltage drop in
the package from the edge of the die to the center, assuming a
uniformly distributed current injection to the chip, is about (12.5
A/4) · Rs/4 ⬵ 2 mV [76].
One suggestion is to place the power routing on the package in-
stead of on the chip. The maximum number of solder bumps on
the 17 × 17mm2 chip with a minimum pitch of 250 m is 172/0.252
⬵ 4600.
Since the landing pad of the solder ball is 70 × 70 m, the total
area used, if we use a maximum number of solder balls, is 4600 ·
0.072 = 23 m2, which is about 8% (23/17 · 17) of the chip area. By
using about half of the solder bumps for power and ground, we
need little local routing in M4/M3 from the solder bumps. In addi-
tion to reducing the inductance, the C4 technology also reduces
the on-chip power routing significantly.
c07.qxd 12/19/2003 9:06 AM Page 172
1. The current source for Vcc and Vss. The current waveform
can be injected between the local Vcc and Vss power supplies.
2. The decoupling capacitances, with the modeling of parasitic
capacitance and an explicit decoupling capacitance.
3. Power network metal RL modeling.
Figure 7-9. Full-chip and package model of power distribution network [76].
el. It turns out that the n-well decoupling capacitance is not very
effective in absorbing short spikes, due to the high lateral resis-
tance in the well.
Figure 7-11 shows the average noise with and without the 100 nF
on-chip decoupling capacitors. Obviously, the noise performance is
better with the additional decoupling capacitors. However, the
noise without the additional decoupling capacitors is not that se-
vere, so the intrinsic parasitic capacitance does help significantly.
In order to find the exact requirement of the on-chip decoupling
capacitance, a better knowledge of the clocking strategy, the pow-
er saving requirements, and bus protocol is necessary. In the pow-
er network noise simulation, the switching currents are modeled
on the power grid model. The modeling of the switching currents
is the key to the power noise results.
Figure 7-12 shows the current waveform used in Figure 7-11’s
result. It uses a waveform peaking at the beginning of the cycle
with the rising edge of the inserted clock and falling off to 50%
and 20% of the peak at the middle and the end of the cycle, re-
spectively.
The average current is 500 mA and the peak is about 800 mA,
and with 25 cells in the full-chip modeling this results in current
Figure 7-12. Switching current waveforms injected into each cell [76].
1. Normal mode: all 25 cells in the full-chip model use the cur-
rent waveform shown in Figure 7-12, with an average cur-
rent of 0.5 A each and total current 12.5 A.
2. Power saving mode: 2 × 3 or a total of six units in the lower-
right corner are turned off in one cycle to simulate the effect
of the power saving in large units, about 24%.
3. Peak power mode: the current in one unit in the center of
the chip is five times larger for five cycles in order to simu-
late the effect of local peak and average activity.
4. I/O noise mode: we assume that in the worst case, 150 I/Os
switch with 75 I/Os at two sides. In the model with five ele-
ments per side, the current in each side element is ramped
to (75/5) · 70 mA = 1 A in 1 ns, and back to 600 mA after 2
ns, and kept high for 8 ns by assuming the bus speed is half
the clock frequency.
c07.qxd 12/19/2003 9:06 AM Page 177
Table 7-4 shows the simulation results for the above test condi-
tions in different package and power routing configurations. The
metal utilization and approximate resistance and inductance val-
ues are summarized in the table. The inductance and metal uti-
lization of the C4 technology is much lower than the cases in the
wire-bonding technology.
Table 7-5 shows the result for the Vcc–Vss noise comparisons. It
is interesting to note that the power saving and peak power condi-
tions cause larger power peaks than the I/O noise. In the past,
I/Os have been known to cause most of the noises.
(a)
(b)
Figure 7-13. Switching power noises in (a) Vcc and (b) Vss for the peak power
test [76].
c07.qxd 12/19/2003 9:07 AM Page 180
Figure 7-14. Measured noise at the pins of the chip package [77].
tio between the noise at the die and the noise at the pins for vari-
ous decoupling capacitors at the board.
Figure 7-16 shows the on-board decoupling capacitor model for
a ceramic 1 F capacitor with 15 m⍀ parasitic resistance and 2.1
nH parasitic inductance [77].
Figure 7-15 shows that for a majority of the test range frequen-
cies, noise at the die becomes much greater relative to the noise at
the pins as the number of capacitors increase on the board. As the
frequency increases to greater than 60 MHz, the noise at the pins
quickly becomes much greater than the noise at the die, due to
the resonance effects. A measurement bandwidth is selected to
achieve a more consistent relationship between the pins and die
noise over this frequency range.
To understand the effects of varying the number of decoupling
capacitors at the board, a model is developed for the package and
die’s parasitic to be used in the SPICE simulation [77]. Using pre-
viously taken test data, it is possible to plot how the simulation
model compares to the actual device.
Figure 7-17 shows the discrepancies between the simulation
data and empirical data over a range of 20 MHz to 100 MHz for
the zero capacitance case. The discrepancy seen between the em-
pirical data and the model is likely due to residual impedance
found on the board [77].
A Pentium-II chip scheme dedicated roughly 75% of the M4 lay-
er and 12% of the M3 layer to Vcc and Vss routing [78]. The Vss re-
sistance is significantly lower than could have been achieved by
using all of M3 and M4 for power routing.
Since M4 is the only thick (low-resistance) metal layer, the
main supply current was constrained to the latest M4 routing di-
mension. Hence, the bulk of the Vcc and Vss pins are located to
the left and right of the die, where Vcc bond wires tie the pack-
age Vcc planes to the left and right edges of the die, and where a
regular array of parallel M4 Vcc lines terminate, as shown in
Figure 7-18.
The objectives of the measurement are as follows:
M4 Vcc, 20.6 m
91.2 m
M4 Vcc, 20.6 m
GCLK
M4 Vss, 10.3 m
die for a single part, and for a single slice through the die
center for many others. The mapping is done with the part
running a high-power pattern.
2. AC snapshots. AC waveforms are taken of the Vcc, Vss, and
Vcc–Vss for 33 locations around the die and cavity. Some of
these were taken using picoprobes for both transistors and
passive differentials, and comparable results were obtained.
The differential probes for this type of study are used to au-
tomatically subtract the Vcc and Vss with a low-noise result.
Since there is no need for time-consuming averaging, stor-
ing, and subtracting of waveforms, the measurement is an
order of magnitude faster than with the FET probes, which
allows the engineer more time to search the patterns for the
worst-case voltage spikes. Maximum di/dt patterns were
used for these snapshots.
age drop at the worst-case point on the die is at most 100 mV [78].
The 100 mV DC drop at 133 MHz and 2.755 V is accumulated as
follows:
In spite of the fact that the Vcc metal grid is about six times as
wide as the Vss grid, 80% of the IR drop is in the Vcc supply. Note
that in both the package and on the die, the Vcc drop is about four
times that of the Vss.
This is due to the fact that the Vcc has to go through an extra
set of vias and through bond wires, and then it must laterally tra-
verse the metal grid. The Vss current travels to the interior of the
die on a dedicated metal plane and then has a very short path ver-
tically up through the die.
Had the Vss current been distributed in the same manner as the
Vcc current, it would have increased the total supply drop by at
least 60% and it would have required essentially all of the M4 and
M3 planes to be used for the power and clock routing [78]. Thirty-
three points around the die and bond cavity were probed using a
variety of high-speed probes. All of the locations were probed us-
ing the stop-clock pattern, which halts the high power loop and
then allows it to resume [78].
Several of the more interesting locations were probed while
running patterns tailored to exercise the local power grid, but
none of these produced voltage spikes as bad as the stop-clock pat-
tern.
The AC measurements for the patterns and positions include
the following:
앫 pnl_vc (VDD): power pads for core logic and I/O interface
with the nominal voltage 1.8 V. These I/O cells provide pow-
er to the standard cell core area and interface between the
core rings and I/O pads. These pads are paired with the
ground pads pnl_gcs.
앫 pnl_gcs (GND/SGND): ground pads for core standard cell log-
ic, the ground, and substrate ground connections within the
I/O set.
앫 pnl_go (VSSO): ground pads for output drivers. Included in
these pads are ESD protection circuits. These pads must be
included in the pad ring at regular intervals to provide good
power distribution and ESD protection for I/Os.
앫 pnl_go (VSSO): power pads for the output drivers only. In-
cluded in these pads are ESD protection circuits. These pads
must be included in the pad ring at regular intervals to pro-
vide good power distribution and ESD protection for the I/Os.
These pins operate at 3.3 V nominal.
앫 pnl_vop: power pads for the output drivers, power for
predrivers and for input buffers. These pins are nominally at
3.3 V.
앫 pnl_vp (VDDP): supplies the voltage for the predrivers and
the first stage of the input receiver and is nominally 3.3 V.
(a)
(b)
cut. The core power and ground voltages (Vdd and Vcc) remain in-
tact.
In this diagram, the VDD2.5 supplies power to I/Os that have a
2.5 V reference voltage, and the VDD3.3 supplies power to the
I/Os that use a 3.3 V reference voltage. The break cell is inserted
between them to separate power and ground for the VDDO,
VDDP, and VSSO buses. In Figure 7-20(b), break cells isolate the
noise that can occur on power bus connections between high-
speed LVDS I/Os and slower TTL I/Os.
7.6 SUMMARY
GLOSSARY
192 GLOSSARY
strate, and connecting the solder bumps between the chip and
package to make the connections.
Capacitance: the charge storage capability between two conduc-
tors.
Capacitance Model: the mathematical equations used to estimate
the interconnect capacitances. They contain the variables of geo-
metrical parameters associated with neighboring metal lines.
Chip: the packaged integrated circuits that can be used as a basic
building block in complex electrical system designs.
Characterization: the process used to reveal the dependence of
electrical performance on design parameters.
Circuit Simulation: the method of using computer programs to
model transistors and interconnects, and solve the IV current
and voltage equations. The final results are presented as text
files or graphical waveforms.
Circuit Timing Analysis: the task of analyzing the hold time and
set-up time requirements in sequential logic and other timing
constraints across the chip.
Contact: the connection from metal in one layer to diffusion or
polysilicon layers.
Core: the chip without the I/O region.
Crossbar Leakage Current: the current from Vdd to ground due to
a possible short between pMOS and nMOS transistors in the
circuit. This is a kind of wasted current for circuit operations.
Decoupling Capacitors: refers to the capacitors added between
Vdd and Vss lines, used to protect the power supply voltage
from sudden switching currents. These decoupling capacitors
are required inside the chip, on the package, and in the sys-
tem board.
Deep-Submicron Process: refers to the VLSI process technology
with about 0.18 m minimum feature size or less.
Delay: the time needed from the 50% Vdd of the input signal to the
50% Vdd of the output signal through the circuits.
Delta-I Current: same as di/dt noise; the change of switching cur-
rents in a short period.
Design Guidelines: the set of guidelines provided by senior design-
ers for the IC design team to follow, to meet the performance,
area, and power requirements of chip design.
Design Methodology: the set of design guidelines, CAD tools, and
design data flows used to design an IC chip from the conceptual
idea to the final working silicon.
gloss.qxd 12/16/2003 12:45 PM Page 193
GLOSSARY 193
194 GLOSSARY
GLOSSARY 195
196 GLOSSARY
GLOSSARY 197
198 GLOSSARY
REFERENCES
200 REFERENCES
Aided Design of Integrated Circuits and Systems, Vol. 17, No. 8, Aug. 1998,
p. 645.
12. L. T. Pillage and R. A. Rohrer, “Asymptotic Waveform Evaluation for Timing
Analysis,” IEEE Trans. on Computer-Aided Design, Vol. 9, No. 4, April 1990,
p. 352.
13. R. Kielkowski, Inside SPICE, McGraw-Hill, 1994.
14. K. L. Shepard, S. M. Carey, E. K. Cho, B. W. Curran, R. F. Hatch, D. E. Hoff-
man, S. A. McCabe, G. A. Northrop, and R. Seigler, “Design Methodology for
the S/390 Parallel Enterprise Server G4 Microprocessor,” IBM Journal of
Research and Development, Vol. 41, No. 4/5, May 1997, p. 515.
15. K. L. Shepard and T Zian, “Return-Limited Inductance: A Practical Ap-
proach to On-Chip Inductance Extraction,” IEEE Transactions on Computer-
Aided Design of Integrated Circuits and Systems, Vol. 19, No. 4, April 2000,
p. 425.
16. M. Basel, “Accurate and Efficient Extraction of Interconnect Circuits for
Full-Chip Timing Analysis,” in Proceedings of Design Automation Confer-
ence, 1995, p. 118.
17. A. K. Goel, “High-Speed Interconnections ,” Wiley, 1994, Chapter 2.
18. F. Najm, “Transition Density: a New Measure of Activity in Digital Cir-
cuits,” IEEE Trans. on Computer-Aided Design, Vol. 12, No. 2, Feb. 1993, p.
310.
19. M. Xakellis and F. Najm, “Statistical Estimation of the Switching Activity in
Digital Circuits,” in Proceedings of 31st ACM/IEEE Design Automation Con-
ference, 1994, p. 728.
20. E. Grim, Technical Presentations, Intel Corporation, 1999.
21. A. Waizman, Technical Presentations, Intel Corporation, 1998.
22. D. Ayers, Private Communications, Intel Corporation, 1998.
23. T. Burton, Technical Presentations, Intel Corporation, 1998.
24. Q. Zhu, “A New Technique: Decap (Decoupling Capacitance) Sizing and In-
sertion Based on Power Noise Violation Nodes,” USA Patent # 6446016, Sep.
2002.
25. Y. L. Le Coz and R. B. Iverson, “A Stochastic Algorithm for High-Speed Ca-
pacitance Extraction in Integrated Circuits,” Solid-State Electronics, Vol. 35,
No. 7, July 1992, p. 1005.
26. P. Larsson, “Resonance and Damping in CMOS Circuits with on-Chip De-
coupling Capacitance,” IEEE Transactions on Circuits and Systems—I: Fun-
damental Theory and Applications, Vol. 45, No. 8, Aug. 1998, p. 849.
27. The National Technology Roadmap for Semiconductors, Semiconductor Re-
search Corporation, 1997.
28. Q. Zhu and W. W.-M. Dai, “High Speed Clock Network Sizing Optimization
Based on Distributed RC and RLC Interconnect Models,” IEEE Transactions
on Computer-Aided Design of Integrated Circuits and Systems, Vol. 15, No.
9, Sept. 1996, p. 1106.
29. G. K. Rao, Multilevel Interconnect Technology, McGraw-Hill, 1993.
30. TSMC Technology Roadmap, www.tsmc.com, 2002.
refs.qxd 12/16/2003 12:48 PM Page 201
REFERENCES 201
31. TSMC 0.18 m Logic 1P6M Salicide 1.5 V/3.3 V Design Rule, Taiwan Semi-
conductor Manufacturing Co., Nov. 2000.
32. P. E. Allen and D. R. Holberg, CMOS Analog Circuit Design, Oxford Univer-
sity Press, 1987.
33. Star-RCXT User Guide, Synopsys Corporation, 2002.
34. Star-SimXT User Guide, Synopsys Corporation, 2002.
35. R. Kumar, Noise Design and Analysis, Intel Corporation, 1997.
36. A. K. Goel, High-Speed VLSI Interconnections: Modeling, Analysis and Sim-
ulation, Wiley, 1994.
37. P. DeWilde and Z.-Q. Ning, Models for Large Integrated Circuits, Kluwer Ac-
ademic Publishers, 1990.
38. C. S. Walker, Capacitance, Inductance, and Crosstalk Analysis, Artech
House, 1990.
39. Virtuoso User Guide, Cadence Design Systems, Inc., 2002.
40. HSPICE User Guide, Synopsys, Inc., 2002.
41. N. D. Arora et al., “Modelling and Extraction of Interconnect Capacitances
for Multilayer VLSI Circuits,” IEEE Trans. on Computer-Aided Design of In-
tegrated Circuits and Systems, Vol. 15, No. 1, pp. 58–67, Jan. 1996.
42. Q. Zhu, “Star-RCXT Capacitance Accuracy Study,” T-RAM, Inc., Feb. 2002.
43. J. Savoj and B. Razavi, High-Speed CMOS Circuits for Optical Receivers,
Kluwer Academic Publishers, 2001.
44. W. S. Song and L. A. Glasser, “Power Distribution Techniques for VLSI
Circuits,” IEEE Journal of Solid-State Circuits, Vol. SC-21, No. 1, Feb.
1986.
45. Q. Zhu, Power Grid Design and Specifications, Chameleon Systems, Inc.,
2001.
46. D. Ayers, Microprocessor Power Network Design, Intel Corporation, 1998.
47. Y. Jiang, P6C AC Analysis, Intel Corporation, 1995.
48. Y. Jiang, P6C Decoupling Capacitor Methodology, Intel Corporation, 1995.
49. H. H. Chen and D. D. Ling, “Power Supply Analysis Methodology for Deep-
Submicron VLSI Chip Design,”in Proceedings of 34th Design Automation
Conference, 1997, p. 638.
50. B. J. Rubin, “An Electromagnetic Approach for Modeling High-Performance
Computer Package,” IBM Journal of Research and Development, Vol. 34, pp.
585–600, July 1990.
51. VoltageStorm Transistor-Level PGS User Guide, Cadence Design Systems,
Inc., 2002.
52. A. Chandrakasan, W. J. Bowhill, and F. Fox, Design of High-Performance
Microprocessor Circuits, IEEE Press, 2001, Chapter 24.
53. Q. Zhu and J. Pabustan, Post-Layout Static IR Analysis Flow Based on Sim-
plex Tool, Chameleon Systems, Inc., 2001.
54. V. L. Bars, IR Drop Evaluation in a Power/Ground Mesh, Project Report on
UC Santa Cruz Extension, 1997.
55. S. Chowdhury and J.S. Barkatullah, “Estimation of Maximum Currents in
refs.qxd 12/16/2003 12:48 PM Page 202
202 REFERENCES
REFERENCES 203
INDEX
AC Analysis, 191, 48, 53, 55 Design Guidelines, 192, 28, 33, 73,
Activity-based Analysis, 191, 106, 162
107. 119. 120, 121, Design Methodology, 192, 75, 84
Design Rule, 193, 7, 10, 84
Back-annotation, 191, 196, 116 Device, 1, 4, 9, 20, 34, 37, 53, 55–56,
Block-Level Power Distribution, 191 58, 60, 69, 71, 80, 82, 84–85, 88,
Break Cell, 191, 188, 189, 106–107, 116–117, 138–139,
By-pass Capacitor, 191 141–142, 153
Device-Level Extraction, 193
C4 Package, 191, 2, 4, 20, 34, 44, 45, Di/Dt, 1, 16, 20, 29, 30–31, 33–35,
63 67–70, 150, 155–156, 160, 185,
Capacitance, 191, 4, 6, 8, 9, 10, 11, 12 190
Capacitance Model, 191, 10, 1, 2, 8, Die, 193, 1, 12, 28, 29, 30, 44, 45, 53,
Chip, 191, 1, 2, 4, 6, 7, 8, 12 67, 68
Characterization, 191, 24, 25, 32, 48 Die Size, 193, 28, 29, 30, 44, 70, 72,
Circuit Simulation, 191, 37, 51, 104, 139
107, 162 Die Micrograph, 193, 139, 140, 148,
Circuit Timing Analysis, 191 149
Contact, 192, 10, 11, 53, 66, 76, 78, DRC, 193, 46.
79, 84 Dynamic Analysis, 193, 91, 94, 95, 99
Core, 192, 117, 136, 140, 150, 158,
172, 188 ECO (Engineering Change Order),
Crossbar Leakage Current, 192 193, 104, 133
Electromigration, 193, 3, 56, 75, 76,
Decoupling Capacitor, 192, 1, 2, 4, 6, 80, 81, 82, 83, 84, 85
7, 16, 19, 23, 44, 46, 52, 53, 54, ESR, 193, 72
55, 57, 59, 61 Extracted Parasitic, 193
Deep-Submicron Process, 192, 4, 44 Extraction, 196, 7, 10, 66, 99, 173
Delay, 192, 20, 27, 34, 58, 100, 103,
126, 160. Flip-Chip, 193, 43, 61, 65, 104, 139,
Delta-I Current, 192, 147 148, 153, 156, 157, 158, 159
206 INDEX
Floorplanning, 193, 33, 34, 35, 61 On-chip Inductance, 195, 2, 7, 20, 24,
Full Chip Power Distribution, 193, 4 180
Gate Capacitance, 193, 20, 29, 30, Package, 2, 4, 16, 20, 33–34, 36, 38,
166. 41, 44–45, 48, 50, 53, 55–56, 60,
Gate Delay, 193, 100, 103, 127 63, 65, 67–68, 72–73, 107, 136,
GDSII, 193, 99 139, 148, 151, 155–158
Global Power Network, 194, 33 Parasitic,195, 7, 8, 10, 20, 98, 175,
Ground Bounce, 194, 89, 160, 164, 177, 151, 161, 162, 178, 181,
165, 167 183.
Parasitic Capacitance, 195, 99, 107,
Hot Spot, 194, 12, 16, 57, 61, 71, 172, 174, 175
186 Peak Current, 195, 58, 59, 65, 77, 81,
101, 108, 110, 116, 117, 118,
Impedance, 194, 23, 25, 27, 57 125, 127, 129
Interconnect RC, 194, 7, 66 Physical Design, 195
IR Drop, 194, 27, 29, 30–36, 38, Piecewise Linear, 195, 58, 96, 107
62–66, 87–104 Place & Route, 195
I/O Library, 194, 188. Power Bus, 195, 42, 55, 56, 57, 60, 61,
89, 104, 113, 181, 189
L*di/dt Noise, 194, 1, 20, 29–35, 67 Power Distribution, 195, 1, 2, 4–5, 20,
Leakage Current, 194, 13, 141, 28–30, 32–35, 42, 44–45, 55–56,
146–147 69, 73, 76, 84, 97, 104
Linear Network, 194, 1, 109. Power Grid, 195, 1, 4, 6–7, 12, 14, 20,
Loop Inductance, 194, 23, 29, 68, 69, 32–33, 35–38, 41–43, 45, 47–50,
136, 150 62–63.
Low-pass Filter, 194, 1, 44 Power Grid Analysis, 195, 1, 33, 89,
LVS, 194, 54 91–108, 116, 121–129
Power Network, 196, 1–7, 12, 16–20,
Mean Time to Fail (MTF), 194, 54, 76, 28–32, 33–34, 44, 48, 50–55, 58,
85 65, 68, 73, 83, 89, 104, 116,
Metal Capacitance, 194, 29, 174 118, 129, 133, 135, 140, 157,
Metal Ion, 194, 75 162, 167–168, 172, 175, 180
Metal Structure, 195, 24, 25 Power Strap, 196, 38, 65, 118, 119
Metal Layer, 4, 9–10, 33–34, 73, 76, Power Supply Voltage, 196, 1, 53,
84, 100, 109, 113, 115, 135, 56–57, 59–61, 70, 88, 116, 133,
140–141, 147, 153 136, 156
Metal Utilization, 195, 167, 172, Power Switching Noise, 196, 88
177 Pre-layout Netlist, 196
Modeling, 195, 1–8, 12–16, 27, 32, 35,
37, 38, 48, 50–51, 57, 59, 63, RC Back-annotation, 196
72–73, 92, 161. RC Extraction, 196, 7, 10, 66, 99, 173
RC Data, 196, 8,
Netlist, 8, 90, 92–93, 95–97, 99–134 RC Netlist, 196, 8
Noise, 195, 1–2, 12–13, 16. 20, 30–33, Reflection Noise, 196, 161
44, 60–61, 67–69, 71, 72, 88, Regulator, 196, 72, 150–153, 156, 163
136, 141, 142, 146, 147, 150, Resonance, 196, 150–151, 163–165,
155, 156, 157, 159, 161, 175 181, 183
Noisy Nodes, 195, 17, 18, 19, 32 RLC Segment, 196, 4
index.qxd 12/16/2003 10:08 AM Page 207
INDEX 207
Routing, 33, 36, 38, 39, 42, 46, 48, 57, Tap Current, 197, 85, 90–91, 94–104,
70, 88, 113, 130–131, 136, 153, 106, 109–133
155, 167–172, 177, 181, 183, Technology Parameters, 197, 135, 147
185 Top Metal Layer, 197, 4, 135
RMS Current, 196, 77, 81, 84, 99, Transistor, 197, 1–4, 13, 19, 28–30,
128 45, 67, 73, 83–85, 88–104,
106–133, 135–156
Scaling, 196, 1–3, 28–32, 62, 76, 88, Transistor-Level Simulation, 197, 63,
103, 107, 110, 118, 119, 173 83, 98
Simulation, 197, 3, 6, 12, 19, 20, 38, Transmission Line, 197, 161
40, 42, 45–46, 51–52, 58, 60, 63,
65, 80, 83, 92, 98–99, 104, Unit-length Capacitance, 198, 10, 28,
106–107, 116, 118–119, 30
124–125, 127, 162–163, Unit-length Inductance, 198, 4, 29
165–167, 172, 175, 177, 181, Unit-length Resistance, 198, 4, 28
183–184
Simultaneous Switching Noise (SSN), Via, 198, 10–11, 32, 34, 55, 72, 76,
195, 143, 157, 159, 161, 162, 78–80, 84–85, 89, 129–130, 186
164, 190 Via Resistance, 198, 66
Standard Parasitic Format (SPF), Vector-Based Analysis, 198
197, 8, 9. Voltage Distribution, 198, 1
Standby Mode, 197, 141–143, 146 Voltage Fluctuation, 198, 12, 16
Static Analysis, 197, 92–93, 96, 100, Voltage Threshold, 198, 17
105, 107–108, 110, 116–117, Voltage Regulation, 198, 151
119, 124, 125, 127–128
Switching Current, 197, 1, 4, 12, 15, Weak Spot, 198, 89–90, 92–93,
45, 49, 55, 66–67, 73, 133, 155, 104–106, 126, 133
160, 175–176 Wire Bonding, 198, 36, 38, 43, 61, 63,
Switching Factor, 197, 28, 31–32, 76, 65, 118, 159, 168–169, 171–172,
93, 98, 103, 110, 118–119, 173 177