0% found this document useful (0 votes)

141 views116 pages

Physical Design Automation

Uploaded by

ijalab1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

141 views116 pages

Physical Design Automation

Uploaded by

ijalab1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 116

Physical

Physical Design
Design
Automation
Automation
Speaker:
Debdeep Mukhopadhyay
Dept of Comp. Sc and Engg
IIT Madras, Chennai

June 20, 2021 National Workshop on VLSI 1

Design 2006
Synthesis Flow
High-Level
Synthesis

Logic
Synthesis

Physical
Design

Fabrication and
Packaging
2
Figures adopted with permission from Prof. Ciesielski, UMASS
Physical Design
Circuit
Design

Partitioning

Floorplanning
&
Placement

Routing

Fabrication
3
What is Backend?
• Physical Design:
1. FloorPlanning : Architect’s job

2. Placement : Builder’s job

3. Routing : Electrician’s job

At sub-micron level
4
So, what is Partitioning?

System Level Partitioning System

PCBs
Board Level Partitioning

Chips

Chip Level Partitioning

Subcircuits
/ Blocks
5
Partitioning of a Circuit

6
Why partition ?
• Ask Lord Curzon 
– The most effective way to solve problems of high
complexity : Parallel CAD Development
• System-level partitioning for multi-chip designs
– Inter-chip interconnection delay dominates system
performance
• IO Pin Limitation
• In deep-submicron designs, partitioning defines
local and global interconnect, and has significant
impact on circuit performance

7
Objectives
• Since each partition can correspond to a chip,
interesting objectives are:
– Minimum number of partitions
• Subject to maximum size (area) of each partition
– Minimum number of interconnections between
partitions
• Since they correspond to off-chip wiring with more
delay and less reliability
• Less pin count on ICs (larger IO pins, much higher
packaging cost)
– Balanced partitioning given bound for area
of each partition

8
Circuit Representation
• Netlist: B
– Gates: A, B, C, D A
– Nets: {A,B,C}, {B,D}, {C,D}

• Hypergraph: C D
– Vertices: A, B, C, D
– Hyperedges: {A,B,C}, {B,D}, {C,D}

– Vertex label: Gate size/area B

– Hyperedge label: A
Importance of net (weight)

C D
9
Circuit Partitioning:
Formulation
Bi-partitioning formulation:
Minimize interconnections between partitions
c(X,X’)

X X’

•Minimum cut: min c(x, x’)

•minimum bisection: min c(x, x’) with |x|= |x’|
•minimum ratio-cut: min c(x, x’) / |x||x’|

10
A Bi-Partitioning Example
a c 100 e
100 100
100 100
9
min-cut
4

b 10 d 100 f

mini-ratio-cut min-bisection

Min-cut size=13
Min-Bisection size = 300
Min-ratio-cut size= 19

Ratio-cut helps to identify natural clusters

11
Iterative Partitioning
Algorithms
• Greedy iterative improvement
method (Deterministic)
– [Kernighan-Lin 1970]

• Simulated Annealing (Non-

Deterministic)

12
Restricted Partition Problem
• Restrictions:
– For Bisectioning of circuit
– Assume all gates are of the same size
– Works only for 2-terminal nets

• If all nets are 2-terminal, hypergraph  graph

b b
a a

c d c d
Hypergraph Graph
Representation Representation 13
Problem Formulation
• Input: A graph with
– Set vertices V (|V| = 2n)
– Set of edges E (|E| = m)
– Cost cAB for each edge {A, B} in E
• Output: 2 partitions X & Y such that
– Total cost of edge cuts is minimized
– Each partition has n vertices
• This problem is NP-Complete!!!!!

14
A Trivial Approach
• Try all possible bisections and find the best one
• If there are 2n vertices,
# of possibilities = (2n)! / n!2 = nO(n)

• For 4 vertices (a,b,c,d), 3 possibilities

1. X={a,b} & Y={c,d}
2. X={a,c} & Y={b,d}
3. X={a,d} & Y={b,c}

• For 100 vertices, 5x1028 possibilities

• Need 1.59x1013 years if one can try 100M

possbilities per second 15

Definitions
• Definition 1: Consider any node a in block X.
The contribution of node a to the cutset is
called the external cost of a and is denoted
as Ea, where
Ea =Σcav (for all v in Y)
• Definition 2: The internal cost Ia of node a
in X is defined as follows:
Ia =Σcav (for all v in X)

16
Example
• External cost (connection) Ea = 2
• Internal cost Ia = 1

X b Y
c
a d

17
Idea of KL Algorithm
• Da = Decrease in cut value if moving a = Ea-Ia
– Moving node a from block X to block Y would decrease the
value of the cutset by Ea and increase it by Ia

X b Y X b Y
c
c
a d a d

Da = 2-1 = 1
Db = 1-1 = 0
18
Useful Lemmas

• To maintain balanced partition, we must move a

node from Y to X each time we move a node
from X to Y
• The effect of swapping two modules a in X with
b in Y is characterized by the following lemma:
• Lemma 1: If two elements a in X and b in Y are
interchanged, the reduction in the cost is given
by:
gain(a,b)= gab = Da + Db - 2cab

19
Example
• If switch a & b, gain(a,b) = Da+Db-2cab
– cab: edge cost for ab

X b Y X b Y
c c
d
a a d
gain(a,b) = 1+0-2 = -1

20
Useful Lemmas

• The following lemma tells us how to update the D-

values after a swap.
• Lemma 2: If two elements a in X and b in Y are
interchanged, then the new D-values are given by
D’k = Dk + 2cka - 2ckb; for all k in X – {a}
D’m = Dm + 2cmb - 2cma; for all m in Y – {b}

• Notice that if a module j is neither

connected to a nor to b then cja = cjb = 0,
and, Dj=D’j
21
Overview of KL Algorithm
• Start from an initial partition {X,Y} of n elements each
• Use lemmas 1 and 2 together with a greedy procedure to
identify two subsets A in X, and B in Y, of equal
cardinality, such that when interchanged, the partition
cost is improved
• A and B may be empty, indicating
in that case that the current
partition can no longer be improved

22
Idea of KL Algorithm
• Start with any initial legal partitions X and Y
• A pass (exchanging each vertex exactly once) is described below:
1. For i := 1 to n do
From the unlocked (unexchanged) vertices,
choose a pair (A,B) s.t. gain(A,B) is largest
Exchange A and B. Lock A and B.
Let gi = gain(A,B)
2. Find the k s.t. G=g1+...+gk is maximized
3. Switch the first k pairs
• Repeat the pass until there is no
improvement (G=0)

23
Greedy Procedure to Identify A,
B at Each Iteration
1. Compute gab for all a in X and b in Y
2. Select the pair (a1, b1) with maximum gain g1 and lock
a1 and b1
3. Update the D-values of remaining free cells and
recompute the gains
4. Then a second pair (a2, b2) with maximum gain g2 is
selected and locked. Hence, the gain of swapping the
pair (a1, b1) followed by the (a2, b2) swap is G2 = g1 +
g2.

24
Greedy ….(contd.)
5. Continue selecting (a3, b3), … , (ai, bi), … ,
(an, bn) with gains g3, … , gi, … , gn
6. The gain of making the swap of the first k
pairs is Gk = g1+…+gk. If there is no k such
that Gk > 0 then the current partition
cannot be improved; otherwise choose the
k that maximizes Gk, and make the
interchange of {a1, a2, … , ak} with {b1, b2,
… , bk} permanent

25
Partitioning:
Simulated Annealing

June 20, 2021 National Workshop on VLSI 26

Design 2006
State Space Search
Problem
• Combinatorial optimization problems (like partitioning) can
be thought as a State Space Search Problem.
• A State is just a configuration of the combinatorial objects
involved.
• The State Space is the set of all possible states
(configurations).
• A Neighbourhood Structure is also defined (which states
can one go in one step).
• There is a cost corresponding to each state.
• Search for the min (or max) cost state.

27
Greedy Algorithm
• A very simple technique for State Space
Search Problem.
• Start from any state.
• Always move to a neighbor with the min
cost (assume minimization problem).
• Stop when all neighbors have a higher cost
than the current state.

28
Problem with Greedy Algorithms
• Easily get stuck at local minimum.
• Will obtain non-optimal solutions.
Cost

State

• Optimal only for convex (or concave for

maximization) funtions.

29
Greedy Nature of KL
• KL is almost greedy algorithms.
Pass 1 Pass 2
Cut Value

Partitions

• Purely greedy if we
Move 1 consider a pass as a “move”.
Cut Value

Move 2 A B

A Move

B A
Partitions 30
Simulated Annealing
• Very general search technique.
• Try to avoid being trapped in local
minimum by making probabilistic moves.
• Popularize as a heuristic for optimization
by:
– Kirkpatrick, Gelatt and Vecchi, “Optimization
by Simulated Annealing”, Science,
220(4598):498-516, May 1983.

31
Basic Idea of Simulated
Annealing
• Inspired by the Annealing Process:
– The process of carefully cooling molten metals in order
to obtain a good crystal structure.
– First, metal is heated to a very high temperature.
– Then slowly cooled.
– By cooling at a proper rate, atoms will have an increased
chance to regain proper crystal structure.
• Attaining a min cost state in simulated
annealing is analogous to attaining a good
crystal structure in annealing.

32
Simulated Annealing
Temperature
Cost dropping
Drop back

State
33
The Simulated Annealing Procedure
Let t be the initial temperature.
Repeat
Repeat
– Pick a neighbor of the current state randomly.
– Let c = cost of current state.
Let c’ = cost of the neighbour picked.
– If c’ < c, then move to the neighbour (downhill move).
– If c’ > c, then move to the neighbour with
probablility e-(c’-c)/t (uphill move).
Until equilibrium is reached.
Reduce t according to cooling schedule.
Until Freezing point is reached.

34
Things to decide when using SA
• When solving a combinatorial problem,
we have to decide:
– The state space
– The neighborhood structure
– The cost function
– The initial state
– The initial temperature
– The cooling schedule (how to change t)
– The freezing point

35
Common Cooling Schedules
• Initial temperature, Cooling schedule,
and freezing point are usually
experimentally determined.
• Some common cooling schedules:
– t = t, where  is typically around 0.95
– t = e-t t, where  is typically around 0.7
– ......

36
Hierarchical Design
• Several blocks after partitioning:

• Need to:
– Put the blocks together.
– Design each block.
Which step to go first?

37
Hierarchical Design
• How to put the blocks together
without knowing their shapes and the
positions of the I/O pins?
• If we design the blocks first, those
blocks may not be able to form a
tight packing.

38
Floorplanning
The floorplanning problem is to plan the positions and
shapes of the modules at the beginning of the design
cycle to optimize the circuit performance:
– chip area
– total wirelength
– delay of critical path
– routability
– others, e.g., noise, heat
dissipation, etc.

39
Floorplanning v.s. Placement
• Both determines block positions to optimize the
circuit performance.
• Floorplanning:
– Details like shapes of blocks, I/O pin positions, etc.
are not yet fixed (blocks with flexible shape are
called soft blocks).
• Placement:
– Details like module shapes and I/O pin positions are
fixed (blocks with no flexibility in shape are called
hard blocks).

40
Floorplanning Problem
• Input:
– n Blocks with areas A1, ... , An
– Bounds ri and si on the aspect ratio of block Bi
• Output:
– Coordinates (xi, yi), width wi and height hi for each block
such that hi wi = Ai and
ri  hi/wi  si
• Objective:
– To optimize the circuit performance.

41
Bounds on Aspect Ratios
If there is no bound on the aspect ratios, can we pack
everything tightly?
- Sure!

But we don’t want to layout blocks as long strips, so we

require
ri  hi/wi  si for each i.

42
Slicing and Non-Slicing
Floorplan
• Slicing Floorplan:
One that can be obtained by
repetitively subdividing
(slicing) rectangles
horizontally or vertically.

• Non-Slicing Floorplan:
One that may not be obtained
by repetitively subdividing
alone.

43
Polar Graph Representation
• A graph representation of floorplan.
• Each floorplan is modeled by a pair of directed acyclic
graphs:
– Horizontal polar graph
– Vertical polar graph
• For horizontal (vertical) polar graph,
– Vertex: Vertical (horizontal) channel
– Edge: 2 channels are on 2 sides of a block
– Edge weight: Width (height) of the block

Note: There are many other graph representations.

44
Polar Graph: Example

Vertical Polar Graph

Horizontal Polar Graph
45
Simulated
Simulated Annealing
Annealing using
using Polish
Polish Expression
Expression
Representation
Representation

D.F. Wong and C.L. Liu,

“A New Algorithm for Floorplan Design”
DAC, 1986, pages 101-107.

June 20, 2021 National Workshop on VLSI 46

Design 2006
Representation of Slicing Floorplan

Slicing Floorplan Slicing Tree

V
1 3
H H
4 5
2 1 H 3
2 6 7
V V
6 7 4 5

Polish Expression
(postorder traversal
of slicing tree) 21H67V45VH3HV
47
Polish Expression
• Succinct representation of slicing floorplan
– roughly specifying relative positions of blocks
• Postorder traversal of slicing tree
1. Postorder traversal of left sub-tree
2. Postorder traversal of right sub-tree
3. The label of the current root
• For n blocks, a Polish Expression contains n operands (blocks)
and n-1 operators (H, V).
• However, for a given slicing floorplan, the corresponding
slicing tree (and hence polish expression) is not unique.
Therefore, there is some redundancy in the representation.

48
Skewed ST and Normalized PE
• Skewed Slicing Tree:
– no node and its right son are the same.
• Normalized Polish Expression:
– no consecutive H’s or V’s.

Slicing Floorplan Slicing Tree (Skewed) Slicing Tree

V V
1 3
H H H H
4 5 2 1 H 3 2 1 V H
2 6 7 V V 6 7 V 3

6 7 4 5 4 5

21H67V45VH3HV 21H67V45V3HHV
Polish Expression 49
Normalized Polish Expression
• There is a 1-1 correspondence between Slicing
Floorplan, Skewed Slicing Tree, and Normalized
Polish Expression.
• Will use Normalized Polish Expression to
represent slicing floorplans.
– What is a valid NPE?
• Can be formulated as a state space search
problem.

50
Neighborhood Structure
• Chain: HVHVH.... or VHVHV....
16H35V2HV74HV
Chains

• The moves:
M1: Swap adjacent operands (ignoring chains)
M2: Complement some chain
M3: Swap 2 adjacent operand and operator
(Note that M3 can give you some invalid NPE.
So checking for validity after M3 is needed.)

51
Example of Moves
1 1
M1
2 4
5 5
3 4 3 2
34V2H5V1H 32V4H5V1H M3

1 1
4 5 5
M2 3 2
3 2 4
32V45VH1H 32V45HV1H
52
Shape Curve
• To represent the possible shapes of a block.

Block with several

Soft block existing design
h h
Feasible
Feasible region
region
wh = A

(0,0) w (0,0) w
53
Combining Shape Curves
h 1
2 2
• 12V: 1
12V
w

12H
2 h

• 12H: 1 1
2
w 54
Find the Best Area for a
NPE
• Recursively combining shape curves.
Pick the
best
2 V
1
3 1 H

3 2

55
Updating Shape Curves after Moves

• If keeping k points for each shape curve, time for

shape curve computation for each NPE is O(kn).
• After each move, there is only small change in the
floorplan. So there is no need to start shape
curve computation from scratch.
• We can update shape curves incrementally after
each move.
• Run time is about O(k log n).

56
Initial Solution
• 12V3V4V...nV

1 2 3 .... n

57
Annealing Schedule
• Ti = Ti-1 where =0.85
• At each temperature, try k x n moves
(k is around 5 to 10)
• Terminate the annealing process if
– either # of accepted moves < 5%
– or the temperate is low enough

58
Problem formulation
• Input:
– Blocks (standard cells and macros) B1, ... , Bn
– Shapes and Pin Positions for each block Bi
– Nets N1, ... , Nm
• Output:
– Coordinates (xi , yi ) for block Bi.
– No overlaps between blocks
– The total wire length is minimized
– The area of the resulting block is minimized or given a fixed
die
• Other consideration: timing, routability, clock,
buffering and interaction with physical synthesis

59
Importance of Placement
• Placement is a key step in physical design
• Poor placement consumes large area,
leads to difficult/ impossible routing task
• Ill placed layout cannot be improved by
high quality routing
• Quality of placement:
– Layout area
– Routability
– Performance (usually timing, measured by
delay of critical/ longest net)
60
Placement
affects chip area

61
…And also Wire Length

62
Force Directed Approach
• Transform the placement problem to the
classical mechanics problem of a system of
objects attached to springs
• Analogies:
– Module (Block/Cell/Gate) = Object
– Net = Spring
– Net weight = Spring constant
– Optimal placement = Equilibrium configuration

63
An Example

Resultant
Force

64
Force Calculation
• Hooke’s Law:
– Force = Spring Constant x Distance
• Can consider forces in x- and y-direction separately:

Distance d ij  ( x j  xi ) 2  ( y j  yi ) 2
Net Cost cij (xj, yj)

F  cij ( x j  xi ) 2  ( y j  yi ) 2
F
Fx  cij ( x j  xi ) Fx
Fy  cij ( y j  yi ) (xi, yi)
Fy
65
Problem Formulation
• Equilibrium: j cij (xj - xi) = 0 for all module i
• However, trivial solution: xj = xi for all i, j.
Everything placed on the same position!
• Need to have some way to avoid overlapping
• A method to avoid overlapping:
– Add some repulsive force which is inversely
proportional to distance (or distance squared)
• Solution of force equations correspond to the
minimum potential energy of system
– n
PE   [( Fxi ) 2 ( Fyi ) 2 ]
i 1

66
Comments on
Force-Directed Placement
 Use directions of forces to guide the
search
 Usually much faster than simulated
annealing
x Focus on connections, not shapes of blocks
x Only a heuristic; an equilibrium
configuration does not necessarily give a
good placement
? Successful or not depends on the way to
eliminate overlapping
67
Routing in design flow
B

A C
Post Placed
Netlist

INV

Routing
AND
Process of finding
OR
geometric layouts of the
net

Floorplan/Placement
68
The Routing Problem
• Apply it after Placement
• Input:
– Netlist
– Timing budget for, typically, critical nets
– Locations of blocks and locations of pins
• Output:
– Geometric layouts of all nets
• Objective:
– Minimize the total wire length, the number of vias, or just
completing all connections without increasing the chip
area.
– Each net meets its timing budget.

69
The Routing Constraints
• Examples:
– Placement constraint
– Number of routing layers
– Delay constraint
– Meet all geometrical constraints (design rules)
– Physical/Electrical/Manufacturing constraints:
• Crosstalk

70
Steiner Tree
• For a multi-terminal net, we can construct a
spanning tree to connect all the terminals
together.
• But the wire length will be large.
• Better use Steiner Tree: Steiner
A tree connecting all terminals and some Node
additional nodes (Steiner nodes).
• Rectilinear Steiner Tree:
Steiner tree in which all the edges run
horizontally and vertically.

71
Routing Problem is Very Hard
• Minimum Steiner Tree Problem:
– Given a net, find the Steiner tree with the
minimum length.
– Input :An edge weighted graph G=(V,E) and
a subset D (demand points)
– Output: A subset of vertices V’(such that
D is covered) and induces a tree of
minimum cost over all such trees
– This problem is NP-Complete!
72
Heuristic Algorithms
• Use MST (minimum spanning tree)
algorithms to start with
– CostMST/CostRMST≤3/2
– Heuristics can guarantee that the weight of
RST is at most 3/2 of the weight of the
optimal tree
• Apply local modifications to reach a RMST
(rectilinear minimum steiner tree)

73
Kinds of Routing
• Global Routing
• Detailed Routing
– Channel
– Switchbox
• Others:
– Maze routing
– Over the cell routing
– Clock routing

74
General Routing Paradigm
Two phases:

75
Extraction and
Timing Analysis
• After global routing and detailed
routing, information of the nets can be
extracted and delays can be analyzed.
• If some nets fail to meet their timing
budget, detailed routing and/or global
routing needs to be repeated.

76
Routing Regions

77
Global Routing

Global routing is divided into 3 phases:

1. Region definition
2. Region assignment
3. Pin assignment to routing regions

78
Maze
Maze Routing
Routing

June 20, 2021 National Workshop on VLSI 79

Design 2006
Maze Routing Problem
• Given:
– A planar rectangular grid graph.
– Two points S and T on the graph.
– Obstacles modeled as blocked vertices.
• Objective:
– Find the shortest path connecting S and T.
• This technique can be used in global or
detailed routing (switchbox) problems.

80
Grid Graph
S S
S 

T X X 
T
X X  T

Area Routing Grid Graph Simplified

(Maze) Representation

Blocked cells

81
Maze Routing

82
Lee’s Algorithm
“An Algorithm for Path Connection and
its Application”, C.Y. Lee, IRE
Transactions on Electronic Computers,
1961.

83
Basic Idea
• A Breadth-First Search (BFS) of the
grid graph.
• Always find the shortest path
possible.
• Consists of two phases:
– Wave Propagation
– Retrace

84
An Illustration

S
0 1 2 3
1 2 3
3 4 5
T
5 4 5 6

85
Wave Propagation
• At step k, all vertices at Manhattan-
distance k from S are labeled with k.
• A Propagation List (FIFO) is used to
keep track of the vertices to be
considered next.
S S S
0 0 1 2 3 0 1 2 3
1 2 3 1 2 3
3 3 4 5
T T T
5 4 5 6
After Step 0 After Step 3 After Step 6 86
Retrace
• Trace back the actual route.
• Starting from T.
• At vertex with k, go to any vertex
with label k-1.
S
0 1 2 3
1 2 3
3 4 5
T
5 4 5 6
Final labeling
87
How many grids visited using Lee’s algorithm?
13 121110 7 6 7 7 9 10
12 1110 9 6 5 6 7 8 9 101112
1110 9 8 7 6 5 4 7 8 9 1011
10 9 8 7 6 5 4 3 6 7 8 9 10
7 6 54 3 2 1 2 3 4 5 67 8 9
6 5 4 3 2 1 S1 23 4 5 6 7 8
9 8 7 6 3 2 1 2 3 4 5 6 78 9
10 9 8 7 3 5 6 7 8 9 10
1110 9 8 9 10 76 7 8 9 1011
12 11 10 11121110 9 8 9 101112
13 12 11121312 1110 9 10111213
12 13 1312 1110 111213
13 13 1211 1213
1312 T 13
13
88
Time and Space Complexity

• For a grid structure of size w  h:

• Time per net = O(wh)
• Space = O(wh log wh) (O(log wh) bits are needed during
exploration phase + one additional bit to indicate blocked or
not)
• For a 2000  2000 grid structure:
• 12 bits per label
• Total 6 Mbytes of memory!

• For 4000 x 4000, 48 M bytes!

89
Acker’s coding :
Improvement to Lee’s Algorithm
• The vertices in wave-front L are always
adjacent to the vertices L-1 and L+1 in
the wavefront
• Soln: the predecessor of any wavefront is
labeled different from its successor
• 0,0,1,1,0,….
• Need to indicate blocked or not
• Hence can do away with 2 bits
• Time complexity is not improved
90
Acker’s Technique

S
0 1 0 1
1 0 1
1 0 1
T
1 0 1 0

91
Detailed
Detailed Routing
Routing

June 20, 2021 National Workshop on VLSI 92

Design 2006
Detailed routing
• Global routing do not define wires
• They define routing regions
• Detailed router places actual wires
within regions, indicated by the
global router
• We consider the channel routing
problem here…

93
Channel Routing
• A channel is the routing region bounded
by two parallel rows of terminals
• Assume top and bottom boundary
• Each terminal is assigned a number to
indicate which net it belongs to
• 0 indicates : does not require an
electrical connection

94
Channel Routing

channel

95
Channel Routing
Terminals
Via

Upper boundary

Tracks Dogleg

Lower boundary

Trunks Branches
96
Channel Routing
0 1 4 5 1 6 7 0 4 9 10 10

2 3 5 3 5 2 6 8 9 8 7 9
How to connect all the points with the same
label with the smallest no. of tracks
(to minimize the channel height)? 97
Horizontal Constraint
Graph (HCV)
0 1 6 1 2 3 5
1 2

6 3 5 4 0 2 4
6 3

5 4
0 1 6 1 2 3 5
6
1
3
5
4 Clique of size 4
2
98
Left-Edge Algorithm
1. Sort the horizontal segments of the
nets in increasing order of their left
end points.
2. Place them one by one greedily on
the bottommost available track.

99
Left-Edge Algorithm
0 1 6 1 2 3 5

6 3 5 4 0 2 4

1. Sort by left end points. 2. Place nets greedily.

0 1 6 1 2 3 5
0 1 6 1 2 3 5
6
1 5
3 3
5 1 2
4 6 4
2
6 3 5 4 0 2 4
6 3 5 4 0 2 4
100
Vertical Constraint
Graph and Doglegs
1 2

1 2

VCG : Cycle
2 1
2 imposes a
1 imposes a vertical
vertical 1 2
constraint on 2, as
constraint
top terminal belongs
on 1
to 1 and bottom
terminal belongs to 2

Dogleg
2 1
101
The
The Cadence
Cadence
Tutorial
Tutorial

June 20, 2021 National Workshop on VLSI 102

Design 2006
Silicon Ensemble (Cadence)

• LEF: Cell boundaries, pins, routing layer (metal)

spacing and connect rules.

• DEF: Contains netlist information, cell placement,

cell orientation, physical connectivity.

• GCF: Top-level timing constraints handed down by

the front end designer are handed to the SE,
using PEARL.

103
The files required
• Pre-running file:
• se.ini- initialization file for SE.

• Create the following directories:

• lef, def, verilog (netlist) , gcf.

• Type seultra –m=300 &, opens SE in graphical

mode.

104
Importing required files
• Import LEF (in the order given):
• header.lef, xlitecore.lef, c8d_40m_dio_00.lef
• Import gcf file:
• Import verilog netlist, xlite_core.v,
c8d_40m_dio_00.v, padded_netlist.v
• Import the gcf file as system constraints file.
• Import the .def file for the floor-planning

105
Structure of a Die
• A Silicon die is mounted inside a chip package.
• A die consists of a logic core inside a power ring.
• Pad-limited die uses tall and thin pads which
maximises the pads used.
• Special power pads are used for the VDD and VSS.
• One set of power pads supply one power ring that
supplies power to the I/O pads only: Dirty Power.
• Another set of power pads supply power to the
logic core: Clean Power.

106
• Dirty Power: Supply large
transient current to the output
transistor.
• Avoids injecting noise into the
internal logic circuitry.
• I/O Pads can protect against ESD
as it has special circuit to protect
against very short high voltage
pulses.
107
Design Styles
• PAD limited design: The number of
PADS around the outer edge of the die
determines the die size , not the
number of gates.

• Opposite to that we have a core-

limited design.

108
Concept of clock Tree

Main
Branch

Side
Branches

Clock
Pad

109
CLOCK DRIVER
A1, B1, C1
CLK D1, D2, E1
D3, E2, F1
C1 C2 CL

Clock
Spine

An important result:

The delay through a chain of CMOS gates is minimized when the

ratio between the input capacitance C1 and the load C2 is about 3.

110
Clock and the cells

B1 E1
E2
B2

CLK
D1
D2
F1

111
• All clocked elements are driven from one
net with a clock spine, skew is caused by
differing interconnect delays and loads
(fanouts ?).
• If the clock driver delay is much larger
than the inter-connect delay, a clock
spline achieves minimum skew but with
latency.
• Spread the power dissipation through the
chip.
• Balance the rise and the fall time.
112
Placement
• Row based ASICS.
• Interconnects run in horizontal and
vertical directions.
• Channel Capacity: Maximum number
of horizontal connections.
• Row Utilization

113
Routing
• Minimize the interconnect length.

• Maximize the probability that the

detailed router can completely finish the
job.

• Minimize the critical path delay.

114
Conclusion: Our backend
flow
1. Loading initial data.
2. Floor-planning
3. I/O Placing
4. Planning the power routing : Adding Power rings , stripes
5. Placing cells
6. Placing the clock tree.
7. Adding filler cells.
8. Power routing : Connect the rings to the follow pins of the cells.
9. Routing ( Global and final routing )
10. Verify Connectivity, geometry and antenna violations.
11. Physical verification (DRC and LVS check using Hercules).

Thank You

115
Main references
• Algorithms for VLSI Physical Design
Automation (Hardcover) by Naveed A.
Sherwani

• Application-Specific Integrated Circuits,

M. J. Sebastian Smith

• Silicon-Ensemble Tool, Cadence®

116

Krage Et Al, 2020
No ratings yet
Krage Et Al, 2020
17 pages
Lecture 07: PARTITIONING: Prof. Indranil Sengupta
No ratings yet
Lecture 07: PARTITIONING: Prof. Indranil Sengupta
143 pages
Lecture4 Partition2
100% (1)
Lecture4 Partition2
51 pages
VLSI Physical Design Automation: Lecture 4. Circuit Partitioning (II)
No ratings yet
VLSI Physical Design Automation: Lecture 4. Circuit Partitioning (II)
51 pages
FPGAPartitioning
No ratings yet
FPGAPartitioning
83 pages
Lec 06
No ratings yet
Lec 06
78 pages
Lecture3 Partition
100% (1)
Lecture3 Partition
24 pages
Course:-Cad For Vlsi Partitioning Algorithm (K-L Algorithm and F-M Algorithm)
No ratings yet
Course:-Cad For Vlsi Partitioning Algorithm (K-L Algorithm and F-M Algorithm)
9 pages
23 Partitioning Algorithms
No ratings yet
23 Partitioning Algorithms
23 pages
Comparative Analysis of KL and SA Partitioning Algorithms Implemented On VLSI Circuit Partitioning
No ratings yet
Comparative Analysis of KL and SA Partitioning Algorithms Implemented On VLSI Circuit Partitioning
7 pages
Optimization Report
No ratings yet
Optimization Report
9 pages
FALLSEM2018-19 - ECE5019 - TH - TT531 - VL2018191001170 - Reference Material I - KL Algo For Partitioning
No ratings yet
FALLSEM2018-19 - ECE5019 - TH - TT531 - VL2018191001170 - Reference Material I - KL Algo For Partitioning
30 pages
System Partitioning
No ratings yet
System Partitioning
3 pages
8.CS G553-Compre Answer Key-I Sem 2018-2019
No ratings yet
8.CS G553-Compre Answer Key-I Sem 2018-2019
7 pages
Partitioning PDF
No ratings yet
Partitioning PDF
11 pages
Daa Combined
No ratings yet
Daa Combined
407 pages
Algorithm Design Methods: Spring 2007
No ratings yet
Algorithm Design Methods: Spring 2007
44 pages
687lec13 Partitioning
No ratings yet
687lec13 Partitioning
26 pages
05 Partition
No ratings yet
05 Partition
22 pages
Design Automation: ENGR 3430 - Digital VLSI Mark L. Chang Spring '07
No ratings yet
Design Automation: ENGR 3430 - Digital VLSI Mark L. Chang Spring '07
52 pages
Example: Figure 6: (A) A Circuit To Be Partitioned (B) Its Corre-Sponding Graph
No ratings yet
Example: Figure 6: (A) A Circuit To Be Partitioned (B) Its Corre-Sponding Graph
11 pages
Chapter 3 Greedy Algorithm
No ratings yet
Chapter 3 Greedy Algorithm
13 pages
Chap 2
No ratings yet
Chap 2
53 pages
Dynamic Programming: Design and Analysis of Algorithms
No ratings yet
Dynamic Programming: Design and Analysis of Algorithms
41 pages
DAA Assignment PDF
No ratings yet
DAA Assignment PDF
13 pages
1 Greedy Algorithms: 1.1 Activity Selection Problem
No ratings yet
1 Greedy Algorithms: 1.1 Activity Selection Problem
6 pages
Lec 2
No ratings yet
Lec 2
33 pages
Optimal Sequential Partitions of Graphs (Brian W. Kernighan) (1971)
No ratings yet
Optimal Sequential Partitions of Graphs (Brian W. Kernighan) (1971)
7 pages
CS 4700: Foundations of Artificial Intelligence: Bart Selman Selman@cs - Cornell.edu Local Search
No ratings yet
CS 4700: Foundations of Artificial Intelligence: Bart Selman Selman@cs - Cornell.edu Local Search
43 pages
Printed File
No ratings yet
Printed File
33 pages
Algorithms ADA
No ratings yet
Algorithms ADA
10 pages
05 DP1
No ratings yet
05 DP1
19 pages
Unit-Iv Syllabus What Is Greedy Approach?: Greedy: Interval Scheduling, Minimum Cost
No ratings yet
Unit-Iv Syllabus What Is Greedy Approach?: Greedy: Interval Scheduling, Minimum Cost
15 pages
KL Partitioning
No ratings yet
KL Partitioning
29 pages
(It-Ebooks-2017) It-Ebooks - Design and Analysis of Algorithms Lecture Notes (MIT 6.046J) - Ibooker It-Ebooks (2017) PDF
No ratings yet
(It-Ebooks-2017) It-Ebooks - Design and Analysis of Algorithms Lecture Notes (MIT 6.046J) - Ibooker It-Ebooks (2017) PDF
135 pages
DAA
No ratings yet
DAA
62 pages
Dynamic Programming
No ratings yet
Dynamic Programming
22 pages
CS 258F Winter 2006 Homework 1 (Due Monday Jan. 30, 2006)
No ratings yet
CS 258F Winter 2006 Homework 1 (Due Monday Jan. 30, 2006)
1 page
Solutions Practice Set DP Greedy
No ratings yet
Solutions Practice Set DP Greedy
6 pages
Algorithm Design and Analysis (CS60007) Assignment 1: 1 Interval Scheduling
No ratings yet
Algorithm Design and Analysis (CS60007) Assignment 1: 1 Interval Scheduling
6 pages
ADSA Notes
No ratings yet
ADSA Notes
24 pages
DAA
No ratings yet
DAA
7 pages
Introduction To Algorithms
No ratings yet
Introduction To Algorithms
28 pages
CC204
No ratings yet
CC204
8 pages
Daa Two Mark Questions
No ratings yet
Daa Two Mark Questions
9 pages
Design and Analysis of Algorithm
No ratings yet
Design and Analysis of Algorithm
33 pages
Daa Unit-Ii
No ratings yet
Daa Unit-Ii
68 pages
DAA Unit3 Greedy Methods
No ratings yet
DAA Unit3 Greedy Methods
61 pages
Genetic Algorithm For Circuit Partitioning
No ratings yet
Genetic Algorithm For Circuit Partitioning
6 pages
Algorithm and Its Respective Problems - Docx - 20241203 - 203341 - 0000
No ratings yet
Algorithm and Its Respective Problems - Docx - 20241203 - 203341 - 0000
16 pages
Lab-Viva Questions
No ratings yet
Lab-Viva Questions
6 pages
Algorithm To CHK
No ratings yet
Algorithm To CHK
14 pages
Eee G626 921 C 2017 1
No ratings yet
Eee G626 921 C 2017 1
7 pages
LAC (Location JArea Code) Optimisation
No ratings yet
LAC (Location JArea Code) Optimisation
28 pages
Daa Part 2C
No ratings yet
Daa Part 2C
10 pages
Wa0036.
No ratings yet
Wa0036.
92 pages
Greedy DP
No ratings yet
Greedy DP
81 pages
Solutions to Problems in Fluids and Turbomachinery
From Everand
Solutions to Problems in Fluids and Turbomachinery
Rahul Basu
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Redhawk Linux User'S Guide: 0898004-300 April 2003
No ratings yet
Redhawk Linux User'S Guide: 0898004-300 April 2003
233 pages
Iare Vlsi Notes 2
No ratings yet
Iare Vlsi Notes 2
247 pages
Place & Route Tutorial #1: I. Setup
No ratings yet
Place & Route Tutorial #1: I. Setup
13 pages
I. Setup For Cadence Innovus
No ratings yet
I. Setup For Cadence Innovus
20 pages
Lecture 02 HDL PDF
No ratings yet
Lecture 02 HDL PDF
216 pages
Asic/Fpga C Asic/Fpga C Chip Design Chip Design: Power Di Power Di Ssipation Ssipation
No ratings yet
Asic/Fpga C Asic/Fpga C Chip Design Chip Design: Power Di Power Di Ssipation Ssipation
51 pages
Tutorial Innovus
No ratings yet
Tutorial Innovus
47 pages
6 Prime Time
No ratings yet
6 Prime Time
42 pages
Compressors, Gas Dynamics and Gas Turbines MCQ
100% (1)
Compressors, Gas Dynamics and Gas Turbines MCQ
11 pages
Forms of EMH
No ratings yet
Forms of EMH
33 pages
Applications of Nanotechnology in Agriculture
No ratings yet
Applications of Nanotechnology in Agriculture
417 pages
2022 Lutomirski - Strength Reduction Factors
No ratings yet
2022 Lutomirski - Strength Reduction Factors
9 pages
Emergent Ecapture Pro Manual v0.1.7 (2022-08-05)
No ratings yet
Emergent Ecapture Pro Manual v0.1.7 (2022-08-05)
128 pages
Wear3 PDF
No ratings yet
Wear3 PDF
8 pages
Relevance of Free Jet Model For Soil Erosion by Impinging Jets
No ratings yet
Relevance of Free Jet Model For Soil Erosion by Impinging Jets
15 pages
Keyboard Shortcuts Linux
No ratings yet
Keyboard Shortcuts Linux
1 page
SAP HANA SQL Script Reference en
No ratings yet
SAP HANA SQL Script Reference en
256 pages
Unit III 1
No ratings yet
Unit III 1
11 pages
AE264 Spring2014 HW1
No ratings yet
AE264 Spring2014 HW1
3 pages
Variable Reviewer
No ratings yet
Variable Reviewer
24 pages
Chapter - 2 - Fundamentals of C++ Programming
100% (2)
Chapter - 2 - Fundamentals of C++ Programming
16 pages
The "Everything We Could Find On Microsoft VBA" List: Microsoft Support Knowledge Base
0% (1)
The "Everything We Could Find On Microsoft VBA" List: Microsoft Support Knowledge Base
3 pages
Cells and Tissues
No ratings yet
Cells and Tissues
5 pages
Quant Checklist Module 16 by Aashish Arora
No ratings yet
Quant Checklist Module 16 by Aashish Arora
60 pages
Selective Determination of Fe (III) in Fe (II) Samples by UV-spectrophotometry With The Aid of Quercetin and Morin
No ratings yet
Selective Determination of Fe (III) in Fe (II) Samples by UV-spectrophotometry With The Aid of Quercetin and Morin
8 pages
IEC Certification Kit Release Notes
No ratings yet
IEC Certification Kit Release Notes
27 pages
Water Level Indicator
No ratings yet
Water Level Indicator
29 pages
3200AMMe - Part 4
100% (1)
3200AMMe - Part 4
207 pages
Chapter 5
No ratings yet
Chapter 5
30 pages
Preview-9781108945509 A49238094
No ratings yet
Preview-9781108945509 A49238094
36 pages
Grade 8-Ls-13-Light-Work Book
No ratings yet
Grade 8-Ls-13-Light-Work Book
6 pages
6.state in React
No ratings yet
6.state in React
31 pages
Stragieretal.2019 Efficacyofanewstrengthtrainingdesign The3 7method
No ratings yet
Stragieretal.2019 Efficacyofanewstrengthtrainingdesign The3 7method
12 pages
Laser Maser
No ratings yet
Laser Maser
4 pages
Matlab Programmming Previous Papers
No ratings yet
Matlab Programmming Previous Papers
4 pages
Network How To
100% (1)
Network How To
139 pages
Physics 2020 QP Set 1 English
No ratings yet
Physics 2020 QP Set 1 English
10 pages