0% found this document useful (0 votes)

70 views

L15: Custom and ASIC VLSI Integration

- Curt Schurgers Introductory Digital Systems Laboratory 3 Custom Design / Layout Itanium has 6 integer execution units like this 9-1 Mux 5-1 Mux a CARRYGEN g64 node1 ck1 REG sum sumb to Cache SUMGEN + LU Hand crafting the layout to achieve maximum clock rates (> 1Ghz) Exploits regularity in datapath structure to optimize interconnects.

Uploaded by

pinoytsikboy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views

L15: Custom and ASIC VLSI Integration

Uploaded by

pinoytsikboy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

L15: Custom and ASIC VLSI Integration

Acknowledgements:
Materials in this lecture are courtesy of the following people and used with permission.
- Rabaey, J., A. Chandrakasan, B. Nikolic. Digital Integrated Circuits: A Design Perspective.
Prentice Hall, 2003.
- Curt Schurgers

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 1

Layout 101

Cross-Section VDD p-type substrate

n-type well

metal/pdiff
contact
Wp

IN OUT

VDD Wn

contact
Ln frommetal
S to ndiff
G Circuit Representation GND

D metal poly n+ p+
diff diff
IN OUT
D Layout
Follow simple design rules (contract
G
between process and circuit designers)
S
(Courtesy of Chris Terman. Used with permission.)
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 3
Custom Design/Layout
Itanium has 6 integer execution units like this
a
9-1 Mux

5-1 Mux
g64
CARRYGEN

node1

SUMSEL
sum sumb

REG
ck1
to Cache
9-1 Mux

2-1 Mux

SUMGEN s0
+ LU s1
b

LU : Logical
Unit
1000um

From register files / Cache / Bypass

Multiplexers

Shifter

Adder stage 1
Wiring
Die photograph of the
Loopback Bus
Loopback Bus

Loopback Bus

Adder stage 2

Wiring
Itanium integer datapath
Bit slice 63

Bit slice 2
Bit slice 1
Bit slice 0

Adder stage 3

Sum Select Bit-slice Design Methodology

To register files / Cache

Hand crafting the layout to achieve maximum clock rates (> 1Ghz)
Exploits regularity in datapath structure to optimize interconnects
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 4
The ASIC Approach
Design Capture Behavioral

Verilog
Verilog(or
(orVHDL
VHDL))
Pre-Layout
Pre-Layout
Simulation Structural
Simulation
Design Iteration

Logic
LogicSynthesis
Synthesis

Floorplanning
Floorplanning
Post-Layout
Post-Layout
Simulation
Simulation Placement
Placement Physical

Circuit
Circuit Routing
Routing
Extraction
Extraction

Tape-out
Most Common Design Approach for Designs up to 500Mhz
Clock Rates
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 5
Standard Cell Example

Power Supply Line (VDD) Delay in (ns)!!

3-input NAND cell

(from ST Microelectronics):
C = Load capacitance
T = input rise/fall time
Ground Supply Line (GND)

Each library cell (FF, NAND, NOR, INV, etc.) and the variations on size
(strength of the gate) is fully characterized across temperature, loading, etc.
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 6
Standard Cell Layout Methodology

2-level metal technology Current Day Technology

Cell-structure hidden under interconnect layers

With limited interconnect layers, dedicated routing channels

between rows of standard cells are needed
Width of the cell allowed to vary to accommodate complexity
Interconnect plays a significant role in speed of a digital circuit
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 7
Verilog to ASIC Layout
(the push button approach)

After
Synthesis
module adder64 (a, b, sum);
input [63:0] a, b;
output [63:0] sum;

assign sum = a + b;
endmodule

After Routing

After
Placement

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 8

The “Design Closure” Problem

VDD BUS

CL
d1 l1
CI
λ = =5
d2 l2
CI CL
CL
Wire-to-wire capacitance causes
inter-wire delay dependencies

Iterative Removal of Timing Violations (white lines)

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 9
Macro Modules

256×32 (or 8192 bit) SRAM Generated by hard-macro module generator

Generate highly regular structures (entire memories,

multipliers, etc.) with a few lines of code
Verilog models for memories automatically generated
based on size

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 10

Clock Distribution

D Q

(Image removed due to copyright considerations.)

D Q

For 1Ghz clock, skew budget is 100ps. IBM Clock Routing

Variations along different paths arise
from:
• Device: VT, W/L, etc.
• Environment: VDD, °C
• Interconnect: dielectric thickness
variation
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 11
The Power Supply Wires are Not Ideal!

To VDD Grid

Ccoup
To VDD Grid
Receiver

Cint Rd

Driver

GROUND GRID

Pad Pad

The IR-drop problem causes internal power supply voltage

to be less than the external source

(Courtesy of Prof. David Blaauw. Used with permission.)

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 12
Analog Circuits: Clock Frequency
Multiplication (Phase Locked Loop)

down

VCO produces high frequency square wave

Divider divides down VCO frequency
PFD compares phase of ref and div
Loop filter extracts phase error information
Used widely in digital systems for clock synthesis
(a standard IP block in most ASIC flows)
(Courtesy of Michael Perrott. Used with permission.)
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 13
Behavioral Transformations

There are a large number of implementations of the same

functionality
These implementations present a different point in the
area-time-power design space
Behavioral transformations allow exploring the design
space a high-level

Optimization metrics: power

1. Area of the design
2. Throughput or sample time TS
3. Latency: clock cycles between
the input and associated output
change area
4. Power consumption
5. Energy of executing a task time
6. …

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 14

Fixed-Coefficient Multiplication

Conventional Multiplication X3 X2 X1 X0
Y3 Y2 Y1 Y0
Z=X·Y
X 3 · Y0 X 2 · Y0 X 1 · Y0 X 0 · Y0
X 3 · Y1 X 2 · Y1 X 1 · Y1 X 0 · Y1
X 3 · Y2 X 2 · Y2 X 1 · Y2 X 0 · Y2
X 3 · Y3 X 2 · Y3 X 1 · Y3 X 0 · Y3
Z7 Z6 Z5 Z4 Z3 Z2 Z1 Z0

Constant multiplication (become hardwired shifts and adds)

X3 X2 X1 X0
Z = X · (1001)2 1 0 0 1
X3 X2 X1 X0
X3 X2 X1 X0
Z7 Z6 Z5 Z4 Z3 Z2 Z1 Z0

X Z
Y = (1001)2 = 23 + 20
<< 3
shifts using wiring
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 15
Transform: Canonical Signed Digits (CSD)

Canonical signed digit representation is used to increase the number of

zeros. It uses digits {-1, 0, 1} instead of only {0, 1}.

Iterative encoding: replace 0 1 1 … 1 1 1 0 0 … 0 -1

string of consecutive 1’s
2N-2 + … + 21 + 20 2N-1 - 20

Worst case CSD has 50% non zero bits

01101111 0 1 1 0 1 1 1 1 0 1 1 1 0 0 0 -1
=

10010001 1 0 0 -1 0 0 0 -1

X << 7 Z
<< 4
Shift translates to re-wiring
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 16
Algebraic Transformations

Commutativity Distributivity
A C B
A B A B
B A
C
⇔
⇔

A+B=B+A (A + B) C = AB + BC

Associativity Common sub-expressions

A B B C X Y
X Y X
C A
⇔
⇔

A B
A B
(A + B) + C = A + (B+C)
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 17
Transforms for Efficient Resource Utilization

A B C D E FG H I
Time multiplexing: mapped to
3 multipliers and 3 adders
1

distributivity
A C B D E FG H I

Reduce number of operators

1
to 2 multipliers and 2 adders

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 18

A Very Useful Transform: Retiming

Retiming is the action of moving delay around in the systems

Delays have to be moved from ALL inputs to ALL outputs or vice versa

D
D
D
D
D

Cutset retiming: A cutset intersects the edges, such that this would result in two disjoint
partitions of these edges being cut. To retime, delays are moved from the ingoing to the
outgoing edges or vice versa.

D
Benefits of retiming:
• Modify critical path delay
• Reduce total number of registers

(Courtesy of Prof. Charles E. Leiserson. Used with permission.)

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 19
Retiming Example: FIR Filter

x(n) D D D Symbol for multiplication

Direct form
h(0) h(1) h(2) h(3) K
y (n) = h(n) ⊗ x(n) = ∑ x(n − i ) ⋅ h(i )
y(n) i =0

associativity of
x(n) the addition
D D D

(10) h(0) h(1) h(2) h(3) Tclk = 22 ns

y(n)

(4) retime
x(n)

h(0) h(1) h(2) h(3)

Transposed form Tclk = 14 ns
y(n) D D D

Note: here we use a first cut analysis that assumes the delay of a chain of operators is the sum
of their individual delays. This is not accurate.
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 20
Pipelining, Just Another Transformation
(Pipelining = Adding Delays + Retiming)

Contrary to retiming,
pipelining adds extra registers
to the system

add input
registers
D D How to pipeline:
1. Add extra registers at
D D all inputs
2. Retime

retime

D D

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 21

The Power of Transforms: Lookahead
y(n) = x(n) + A y(n-1) x(n) y(n)
y(n) loop
x(n)
unrolling D A 2D
A D A

y(n) = x(n) + A[x(n-1) + A y(n-2)]

Try pipelining
this structure distributivity
x(n) y(n)

D 2D
How about pipelining A A A
this structure! associativity

x(n) y(n)
x(n) y(n)
retiming
A D D D D 2D
A A2
A2
precomputed
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 22
Scan Testing
... Idea: have a mode in which all registers are chained
into one giant shift register which can be loaded/
0 read-out bit serially. Test remaining (combinational)
1 logic by
ScanShift (1) in “test” mode, shift in new values for all
shift out register bits thus setting up the inputs to the
combinational logic
0 (2) clock the circuit once in “normal” mode, latching
1 the outputs of the combinational logic back into
CLK the registers
ScanShift
(3) in “test” mode, shift out the values of all
shift in register bits and compare against expected
ScanShift shift in results.

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 23

Trends: “Chip in a Day”
(Matlab/Simulink to Silicon…)

S reg X reg
Add, Mult2
Sub,
Shift
Mac1 Mac2
Mult1

Map algorithms directly to silicon - bypass writing Verilog!

(Courtesy of R. Brodersen. Used with permission.)
L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 24
Trends: Watermarking of Digital Designs

Fingerprinting is a technique to deter people from illegally

redistributing legally obtained IP by enabling the author of the IP to
uniquely identify the original buyer of the resold copy.
The essence of the watermarking approach is to encode the author's
signature. The selection, encoding, and embedding of the signature
must result in minimal performance and storage overhead.

(Images removed due to copyright considerations.)

L15: 6.111 Spring 2004 Introductory Digital Systems Laboratory 25

Prep For TI - Digital (With Sample Questions)
No ratings yet
Prep For TI - Digital (With Sample Questions)
7 pages
L16: Power Dissipation in Digital Systems
No ratings yet
L16: Power Dissipation in Digital Systems
23 pages
Avlsi Mel ZG623
No ratings yet
Avlsi Mel ZG623
5 pages
L10: Analog Building Blocks (Opamps, A/D, D/A)
No ratings yet
L10: Analog Building Blocks (Opamps, A/D, D/A)
29 pages
VLSI DESIGN TOOLS LAB MANUAL 1.1 (1)
No ratings yet
VLSI DESIGN TOOLS LAB MANUAL 1.1 (1)
29 pages
ME Vlsi Design
No ratings yet
ME Vlsi Design
28 pages
22Scheme_VLSI Lab Manual
No ratings yet
22Scheme_VLSI Lab Manual
101 pages
VLSI Lab Manual
No ratings yet
VLSI Lab Manual
83 pages
Me Vlsi - Syllabi
No ratings yet
Me Vlsi - Syllabi
14 pages
Vlsi Lecture
No ratings yet
Vlsi Lecture
5 pages
6 DHX
No ratings yet
6 DHX
4 pages
VLSI Expert
No ratings yet
VLSI Expert
15 pages
Mtech Vlsi - Second Sem
No ratings yet
Mtech Vlsi - Second Sem
16 pages
688CC
No ratings yet
688CC
11 pages
Chapter 1 - Overview On Digital IC Design
100% (1)
Chapter 1 - Overview On Digital IC Design
45 pages
Bits Pilani
No ratings yet
Bits Pilani
3 pages
Nandha College of Technology: Academic Year 2022-23 (Even Semester)
No ratings yet
Nandha College of Technology: Academic Year 2022-23 (Even Semester)
9 pages
VLSI
No ratings yet
VLSI
24 pages
DSD Dica Lesson Plan-C
No ratings yet
DSD Dica Lesson Plan-C
3 pages
Cell B
No ratings yet
Cell B
73 pages
AICTE
No ratings yet
AICTE
17 pages
DSD Lec 1
No ratings yet
DSD Lec 1
32 pages
ECNG 3016 Advanced Digital Electronics: Eneral Nformation
No ratings yet
ECNG 3016 Advanced Digital Electronics: Eneral Nformation
11 pages
Eee 4232
No ratings yet
Eee 4232
135 pages
EL-408 VLSI System Design (Revised-2022)
No ratings yet
EL-408 VLSI System Design (Revised-2022)
58 pages
Floating Point Multipliers: Simulation & Synthesis Using VHDL
No ratings yet
Floating Point Multipliers: Simulation & Synthesis Using VHDL
40 pages
lesson plan 18jr
No ratings yet
lesson plan 18jr
3 pages
Vlm Zg515 Course Handout
No ratings yet
Vlm Zg515 Course Handout
7 pages
VLSI Lab Manual
No ratings yet
VLSI Lab Manual
77 pages
Academic Course Description
No ratings yet
Academic Course Description
5 pages
VLSI Anna University New Syllabus 2013
No ratings yet
VLSI Anna University New Syllabus 2013
33 pages
M.tech Vlsi Syllabus: D.A.John & K.Martin, Analog Integrated Circuit Design, Wiley, 1997
No ratings yet
M.tech Vlsi Syllabus: D.A.John & K.Martin, Analog Integrated Circuit Design, Wiley, 1997
5 pages
Birla Institute of Technology and Science, Pilani: Pilani Campus AUGS/ AGSR Division
No ratings yet
Birla Institute of Technology and Science, Pilani: Pilani Campus AUGS/ AGSR Division
3 pages
Arduino Signal Processing
No ratings yet
Arduino Signal Processing
7 pages
l1
No ratings yet
l1
28 pages
ECE syllabus_5th Semeseter
No ratings yet
ECE syllabus_5th Semeseter
16 pages
(Exam) (Exam) (Assignment) : Semester 1 Year 3
No ratings yet
(Exam) (Exam) (Assignment) : Semester 1 Year 3
7 pages
VLSI Lab Manual
No ratings yet
VLSI Lab Manual
41 pages
Vlsi Design PDF
No ratings yet
Vlsi Design PDF
22 pages
Syllabus
No ratings yet
Syllabus
183 pages
Pipelining Verilog
No ratings yet
Pipelining Verilog
26 pages
VLSI
No ratings yet
VLSI
22 pages
Recommended Back Loaded Horn Type Enclosure
No ratings yet
Recommended Back Loaded Horn Type Enclosure
1 page
Satellite Subsystems: Telemetry, Tracking and Command System
No ratings yet
Satellite Subsystems: Telemetry, Tracking and Command System
2 pages
Honeywell MS4105, MS7505, MS8105
No ratings yet
Honeywell MS4105, MS7505, MS8105
2 pages
IES 1988 Question Paperpdf
No ratings yet
IES 1988 Question Paperpdf
8 pages
Diagrama Alternador Mazda 3 2006
50% (2)
Diagrama Alternador Mazda 3 2006
2 pages
172DIP PCB Mount Miniature Reed Relay/: SPDT and DPDT 0.25 Amp Rated
No ratings yet
172DIP PCB Mount Miniature Reed Relay/: SPDT and DPDT 0.25 Amp Rated
2 pages
Navid Lashkarian, Signal Processing Division, Xilinx Inc., San Jose, USA, Chris Dick, Signal Processing Division, Xilinx Inc., San Jose, USA
No ratings yet
Navid Lashkarian, Signal Processing Division, Xilinx Inc., San Jose, USA, Chris Dick, Signal Processing Division, Xilinx Inc., San Jose, USA
6 pages
Vumatel Installation Guide
No ratings yet
Vumatel Installation Guide
6 pages
Analog Electronics
No ratings yet
Analog Electronics
47 pages
Ds 5510 5516 0610
No ratings yet
Ds 5510 5516 0610
2 pages
Microphone University: by Mikkel Nymand
No ratings yet
Microphone University: by Mikkel Nymand
4 pages
Design and Construction of A Solar Powered Streetlight System
100% (1)
Design and Construction of A Solar Powered Streetlight System
7 pages
Bscthesis 09
No ratings yet
Bscthesis 09
41 pages
Week-2 Analog Electronics Notes and Experiments
No ratings yet
Week-2 Analog Electronics Notes and Experiments
19 pages
LM 380 N
No ratings yet
LM 380 N
10 pages
Vouchers
No ratings yet
Vouchers
17 pages
Reference Signal and Use
No ratings yet
Reference Signal and Use
28 pages
UVS 610 UHF Valve Sensor: Accessory To MPD 600
No ratings yet
UVS 610 UHF Valve Sensor: Accessory To MPD 600
2 pages
Installation Guide Fluence Keith Merrow Custom Set: Downloaded From Manuals Search Engine
No ratings yet
Installation Guide Fluence Keith Merrow Custom Set: Downloaded From Manuals Search Engine
12 pages
Concepts of The BTS Multiplexing Mode
No ratings yet
Concepts of The BTS Multiplexing Mode
2 pages
Resume MOHD ZAKRI ABDULLAH TAHIR PDF
No ratings yet
Resume MOHD ZAKRI ABDULLAH TAHIR PDF
6 pages
BPC Final Exam Instrumentation 2020
No ratings yet
BPC Final Exam Instrumentation 2020
5 pages
FPGA DS 02056 4 1 MachXO2 Family Data Sheet
No ratings yet
FPGA DS 02056 4 1 MachXO2 Family Data Sheet
119 pages
TM4C123G LaunchPad Workshop
No ratings yet
TM4C123G LaunchPad Workshop
336 pages
How To Make Series and Parallel Connections of An SCR
No ratings yet
How To Make Series and Parallel Connections of An SCR
4 pages
Unistar 15 A2
No ratings yet
Unistar 15 A2
1 page
SET-1 (Analog Communications) - ECE Max. Marks: 10 M Time: 60 Min Date: Answer Any TWO Questions 2x 5 Marks 10 Marks
No ratings yet
SET-1 (Analog Communications) - ECE Max. Marks: 10 M Time: 60 Min Date: Answer Any TWO Questions 2x 5 Marks 10 Marks
2 pages
CIT423 2023_1
No ratings yet
CIT423 2023_1
2 pages
Owner's Manual: For Latest Instructions Please Go To
No ratings yet
Owner's Manual: For Latest Instructions Please Go To
8 pages
Class - Analog CMOS Design & Tech - Part-12
No ratings yet
Class - Analog CMOS Design & Tech - Part-12
15 pages