Lecture 9
Lecture 9
Methodology
Productivity (Trans./Staff-Month)
Logic Transistors per Chip (K)
1,000,000 10,000,000
1,000 10,000
100 1,000
1997
1981
1985
1993
2001
2005
2009
1989
Courtesy: Philips
Custom Semicustom
Cell-based Array-based
Domain-specific processor
100-1000
Energy Efficiency (in MOPS/mW)
Embedded microprocessor
10-100
(e.g. DSP)
Configurable/Parameterizable
Hardwired custom
1-10
0.1-1
x0 x1
x2
AND OR
plane plane
f0 f1
x0 x1 x2
minterm
x0 x0 x1 x1 x2 x2 f0 f1
Pull-up devices Pull-up devices
BUFFER
B U FFER PR E-C H AR G E
CHARGE
BUFFER
PRE-
B U FFER PR E-C H AR G E
PRE-CHARGE
BUFFER
B U FFER PR E-C H AR G E
BUFFER
PRE-
B U FFER PR E-C H AR G E
• Output buffers and the input buffers
of the next stage are shared.
Area:
RPLAs (2 layers) 1.23
SCs (3 layers) - 1.00, 1
NPLAs (4 layers) 1.31
Delay
RPLAs 1.04
SCs 1.00
0.6
NPLAs 1.09
Synthesis time: for RPLA , synthesis time equals design time;
SCs and NPLAs still need P&R.
Also: RPLAs are regular and predictable
0.2
0 2 4 6 area
Routing
channel
Rows of
cells
Functional Routing channel
module requirements are
reduced by presence
(RAM, of more interconnect
multiplier, layers
)
[Brodersen92]
Cell-structure
hidden under
interconnect layers
Synopsys
© Digital Integrated Circuits2nd and F. Brewer 2003 DesignCompiler Design Methodologies
The “Design Closure” Problem
Courtesy
© Digital Integrated Circuits2nd and F. Brewer 2003 Synopsys Design Methodologies
Gate Array — Sea-of-gates
polysilicon
VD D
metal
rows of Uncommited
uncommitted possible
cells GND contact Cell
routing
channel Committed
Cell
(4-input NOR)
Out
PMOS
PMOS
NMOS
NMOS
NMOS
Random Logic
Memory
Subsystem
Via-programmable cross-point
metal-5 metal-6
programmable via
[Pileggi02]
© Digital Integrated Circuits2nd and F. Brewer 2003 Design Methodologies
Pre-wired Arrays:
Classification of prewired arrays (or field-programmable devices):
Based on Programming Technique
Fuse-based (program-once)
Non-volatile EPROM based
RAM based
Programmable Logic Style
Array-Based
Look-up Table
Programmable Interconnect Style
Channel-routing
Mesh networks
n+ antifuse diffusion
2
: programmed node
NA NA f 1 f 0
B 1
SA Y
1
C
D 1
SB
S0
S1
H1 H2 H0 EC
D4 S/R Bypass
Logic Din
control
D3 YQ
function F’
G’
D SD Q
D2 F H’
D1
Logic EC RD
function G’
H’
H 1 Y
F4
S/R Bypass
F3 Logic control XQ
Din
function F’ D SD Q
F2 G’
G H’
F1
EC RD
clock
H’ 1 X
F’
Multiplexer Controlled
by Configuration Program
Xilinx 4000 Series
Cell
Horizontal
tracks
Vertical tracks
Connect Box
Interconnect
Point
t PIA
LAB1 LAB2
LAB
PIA
t PIA
LAB6
Array-based Mesh-based
(MAX 3000-7000) (MAX 9000)
Standard-cell like
floorplan
I/O Buffers
I/O Buffers
Rows o f logic m odule s
Routing c hannels
I/O Buffers
12 Quad
8 Single
4 Double
3 Long
Direct
CLB 2 Connect
3 Long
12 4 4 8 4 8 4 2
Xilinx XC4000ex
Analog
Multi-
Spectral
RAM + 1 Gbit DRAM cost, performance, and
Imager Preprocessing energy are the issues!
DSP and control intensive
64 SIMD Processor C
Array + SRAM system Mixed-mode
+2 Gbit Software design is crucial
Image Conditioning DRAM
100 GOPS Recog-
nition
Embedded memories
Embedded PowerPc
Hardwired multipliers
High-speed I/O
Courtesy: Xilinx
© Digital Integrated Circuits2nd and F. Brewer 2003 Design Methodologies
Summary
Design Choice forced by System Tradeoffs
Deep Sub-micron Challenges
Regularity (Design flexibility at smallest scales)
Power consumption!
Interconnection Parasitics
Nanoscopic Devices/Modules
New circuit solutions are bound to emerge
Who can afford design in the years to come?