Chapter 8
Chapter 8
Circuits
A Design Perspective
Jan M. Rabaey
Anantha Chandrakasan
Borivoje Nikolic
Design
Methodologies
December 10, 2002
Productivity (Trans./Staff-Month)
Logic Transistors per Chip (K)
1,000 10,000
X
100 X X
1,000
Xx X
X
2.5m 10 21%/Yr. compound 100
Productivity growth rate
1 10
1981
1983
1985
1987
1989
1991
1993
1995
1997
1999
2001
2003
2005
2007
2009
A growing gap between design complexity and design productivity
Source: sematech97
MEMORY
INPUT/OUTPUT
CONTROL
DATAPATH
Courtesy: Philips
Domain-specific processor
100-1000
Energy Efficiency (in MOPS/mW)
Embedded microprocessor
10-100
(e.g. DSP)
Configurable/Parameterizable
Hardwired custom
1-10
0.1-1
Custom Semicustom
Cell-based Array-based
Routing
channel
Rows of cells
Routing channel
Functional requirements are
module reduced by presence
(RAM, of more interconnect
multiplier, …) layers
[Brodersen92]
Cell-structure
hidden under
interconnect layers
x0 x1
x2
AND OR
plane plane
f0 f1
x0 x1 x2
minterm
x0 x0 x1 x1 x2 x2 f0 f1
Pull-up devices Pull-up devices
BUFFER
B U FFER PR E-C H AR G E
CHARGE
BUFFER
PRE-
B U FFER PR E-C H AR G E
PRE-CHARGE
BUFFER
B U FFER PR E-C H AR G E
BUFFER
PRE-
B U FFER PR E-C H AR G E
• Output buffers and the input buffers
of the next stage are shared.
Area:
RPLAs (2 layers) 1.23
SCs (3 layers) - 1.00, 1
NPLAs (4 layers) 1.31
Delay
RPLAs 1.04
SCs 1.00
0.6
NPLAs 1.09
Synthesis time: for RPLA , synthesis time equals design time;
SCs and NPLAs still need P&R.
Also: RPLAs are regular and predictable
0.2
0 2 4 6 area
HDL
HDL
Pre-Layout
Pre-Layout Structural
Design Iteration
Simulation
Simulation Logic
LogicSynthesis
Synthesis
Floorplanning
Floorplanning
Post-Layout
Post-Layout
Simulation
Simulation Placement
Placement Physical
Circuit
CircuitExtraction
Extraction Routing
Routing
Tape-out
Physical
PhysicalSynthesis
Synthesis
Place-and-Route
Place-and-Route
Optimization
Optimization
Artwork
© Digital Integrated Circuits2nd Design Methodologies
Late-Binding Implementation
Array-based
Pre-diffused Pre-wired
(Gate Arrays) (FPGA's)
polysilicon
VD D
metal
rows of Uncommited
uncommitted possible
cells GND contact Cell
routing
channel Committed
Cell
(4-input NOR)
Out
PMOS
PMOS
NMOS
NMOS
NMOS
CLR
Q
CLK
Q
GND
Memory
Subsystem
Via-programmable cross-point
metal-5 metal-6
programmable via
n+ antifuse diffusion
2 l
: programmed node
NA NA f 1 f 0
product
1 terms
j -wide OR array
D Q
j OUT
j
macrocell
CLK
A B C i i inputs
B 1
SA Y
1
C
D 1
SB
S0
S1
In Out
Memory
Out 00 00
01 1
10 1
11 0
ln1 ln2
D4 Bits xxxx
Logic control
D3 xx xx
function xx
xx x xx x
D2 of xx
xxx
D1
Logic xx xx
functionx x
x
of x
xxx x
F4
Bits xxxx
F3 Logic xx control xx
function xx
xx x xx x
F2 of xx
xxx
F1
xx xx
x
xxxxx x
H x
P
Multiplexer Controlled
by Configuration Program
Xilinx 4000 Series
Cell
Horizontal
tracks
Vertical tracks
Connect Box
Interconnect
Point
t PIA
LAB1 LAB2
LAB
PIA
t PIA
LAB6
Array-based Mesh-based
(MAX 3000-7000) (MAX 9000)
Standard-cell like
floorplan
I/O B u ffe rs
I/O B u ffe rs
R ows o f logic m odule s
R outing channels
I/O Buffers
12 Quad
8 Single
4 Double
3 Long
Direct
CLB 2 Connect
3 Long
12 4 4 8 4 8 4 2
Xilinx XC4000ex
600k transistors
208-pin PGA
fclock = 50 MHz
P
av = 3.6 W @ 5V
Analog
Multi-
Spectral + 1 Gbit DRAM cost, performance, and
RAM
Imager energy are the real issues!
Preprocessing
DSP and control intensive
64 SIMD Processor C Mixed-mode
Array + SRAM system Combines programmable and
+2 Gbit application-specific modules
Image Conditioning DRAM
100 GOPS Recog-
Software plays crucial role
nition
Reconfigurable
• 1.2 Million transistors
Data-path • 40 MHz at 1V
• 2 extra supplies: 0.4V,
Interface
Embedded memories
Embedded PowerPc
Hardwired multipliers
High-speed I/O