Lecture7 Logicaleffort
Lecture7 Logicaleffort
Hyeon-Min Bae
C g ,nand 5
= (2 input)
C g ,inv 3
2N +1
= = g nor (N input)
3
Cint, nand 6
= = 2 (2 input)
Cint,inv 3
2N + N
= = N = pnor
3
Logical effort g: How much harder it (N input)
is to drive an equivalent gate
compared to an inverter
Copyright 2022, Nanoscale advanced integrated systems lab, KAIST 4
Logical effort
CP = 2Cunit Delay
2W
W
Cint CL
Load
CN = Cunit
Delay ~ RW (Cint + CL )
t p = kRW Cint (1+ CL /Cint )
= t p 0 (1+ fCgin /gCgin )
= t p 0 (1+ f /g )
Cint = gCgin with g » 1
f = CL/Cgin - effective fanout
R = Runit/W ; Cint =WCunit
tp0 = 0.69RunitCunit
Intrinsic delay
N f jgj
t p = t p 0,inv å ( Pj + )
j =1 g inv
tp is minimized when f i gi = f i +1 gi +1 = ... = h (const)
Note: compared to inverter chain, gi appears
If all gates were inverter à gi=1
Copyright 2022, Nanoscale advanced integrated systems lab, KAIST 9
Apply to Inverter Chain
In Out
1 2 N CL
æ Cgin, j +1 ö
t pj ~ RunitCunit ç1 + ÷
ç gC ÷
è gin , j ø
N N æ Cgin, j +1 ö
t p = å t p , j = t p 0 å ç1 + ÷, Cgin, N +1 = CL
ç gC
i =1 è
÷
j =1 gin, j ø
N Næ C gin, j +1 ö
t p = å t p , j = t p 0 å ç1 + ÷, C gin, N +1 = C L
ç gC
i =1 è
÷
j =1 gin, j ø
dt p t p0 1 C gin, j +1
= ( - )
dC gin, j g C gin, j -1 C gin, j 2
Optimum when
dt p C gin, j C gin, j +1
=0® = = f Scale factor
dC gin, j C gin, j -1 C gin, j
- each stage has the same effective fanout (Cout/Cin)
- each stage has the same delay
CL
P fj = F =
j C g1
F: overall electrical effort
Pgj =G
j
G: path logical effort
P f j P g j = h N = H = FG
j j
Path logical effort
F: known
G: known
H=FG: known
h=H1/Nàfi=h/gi
fi contains information regarding scale factors
Copyright 2022, Nanoscale advanced integrated systems lab, KAIST 13
Logical Effort of Gates
t pNAND
Normalized delay (d)
g= tpINV
p=
d=
g=
p=
d=
F(Fan-in)
1 2 3 4 5 6 7
Fan-out (h)
t pNAND
Normalized delay (d)
g = 4/3 tpINV
p=2
d = (4/3)h+2
g=1
p=1
d = h+1
F(Fan-in)
1 2 3 4 5 6 7
Fan-out (h)
f1 f2 f3
1 2 3
Cg1
Cint1 Cg2 Cint2
Cg2=f1Cg1=S2Cg2,ref
f1g1S1=S2g2;Si+1gi+1=figiSi
S1=1 S2 S3
Cg1
Cint1 Cg2Cint2 Cg3 Cint3 CL
S=1
C g ,nand 5
= (2 input)
C g ,inv 3
2N +1
= = g nor (N input)
3
Cint, nand 6
= = 2 (2 input)
Cint,inv 3
2N + N
= = N = pnor
3
(N input)
S=1
C g ,nand 4
= (2 input)
C g ,inv 3
N +2
= = g nand (N input)
3
Cint, nand 6
= = 2 (2 input)
Cint, inv 3
2N + N
= = N = pnand
3
(N input)
For 3 input: g=5/3, p=3
Copyright 2022, Nanoscale advanced integrated systems lab, KAIST 20
Example
• F=10Cg1/Cg1=10
• G=g1g2g3=5/3x5/3x1=25/9
• FG=H=250/9 àh=(250/9)1/3@3
• figi=h=3
• f1=3/5x3=9/5=1.8
• f2=3/5x3=9/5=1.8
• f3=3
• à Completes out designs à determine scale
factor (S1, S2, S3)
• à Determine overall delay
3
fi gi
t p 0,min = t p 0,inv å ( Pi + )
i =1 g inv
= 100 ps (2 + 3 + 3 + 3 + 1 + 3) = 1500 ps
t p1 = 100 ps ´ 5 = 500 ps
t p 2 = 100 ps ´ 6 = 600 ps
t p 3 = 100 ps ´ 4 = 400 ps
Cg 2
f1 = ® f1C g 1 = C g 2
Cg1
f1S1C g 1,ref = S 2C g 2,ref
f1S1g 2nor = S 2g 3nand
5 3
S 2 = 1.8 ´ 1´ ´ = 1.8
3 5
f2S 2g 3nand = S 3ginv
5
S 3 = 1.8 ´ 1.8 ´ = 5.4
3
Co2,1 Co4
1 2 3 4
Cg1 Cg3 Cg4 CL
Cg2
Co2,2
C g 2 + Co 2
b1 = ; Co 2 = Co 2,1 + Co 2, 2
Cg 2
Cg 3 C g 4 + Co 4
b2 = = 1; b3 = , b4 = 1
Cg 3 Cg 4
CL f1 f 2 f 3 f 4
F= = àOn path cap ratio
C g1 b1 b2 b3 b4
C g 2 + Co 2 Cg 2 C g 3 C g 3 C g 4 + Co 4 Cg 4 CL
=[ ][ ][ ][ ]
C g1 C g 2 + Co 2 C g 2 C g 3 Cg 3 C g 4 + Co 4 C g 4
Pf i Pf i
F= = ® Pf i = FB
Pbi B
Copyright 2022, Nanoscale advanced integrated systems lab, KAIST 26
Branching
GFB = H = Pf i g i
Same as unbranched case
f i g i = h ® h = H 1/ N
𝑐!"#,%&"𝑐!"#,%''
! 𝑓! 𝑔! = ! 𝑔!
𝑐!
!
𝑐!"#,%& 𝑐!"#,%& + 𝑐!"#,%''
= ! 𝑔! = 𝐺𝐹𝐵
𝑐! 𝑐!"#,%&
!
N
fi gi
t p min = t p 0,inv å ( Pj + )
j =1 g
N
NH 1/ N
= t p 0,inv (å Pj + )
j =1 g
𝑓! 𝐶(,!"# 𝑓!
= −→ 𝐶(,! = 𝐶(,!"# àElectrical effort
𝑏! 𝐶(,! 𝑏!
after factoring out
the branching effort
𝑓!
𝑆! 𝑔! 𝐶)*',!&+,𝑆!"#𝑔!"#𝐶)*',!&+
𝑏!
'! (!
𝑆!"# = 𝑆
-! ! (!"#
.# (# /0# '!
𝑆! = ∏/
(! -!
Cinv
1000Cinv
Cinv
Cout
Cinv CA Cout
3!"#
Delay=∑ 𝑇12(𝑝! + 𝑔! 𝑓! ) = ∑ 𝑇12(𝑝! + 𝑔! )
3!
𝑐! 𝑐!"#
𝑔!0# = 𝑔!
𝑐!0# 𝑐!
• Back to basics, constant gf results minimum delay,
• Concept of FO4 applies
"$!"
"$$!"
!"#à#
w x y
1 A B
A=8, B=64
2
Pdyn = CLVDD f 0®1
f 0®1 = a 0®1 f clk
a 0®1 Number of 0à1 transitions in Tclk: activity factor
0 < a 0®1 Can be greater than 1
Energy for 0à1 and 1à0 transitions and Pdyn do not
depend on NMOS and PMOS sizes
Pdyn can be reduced by reducing VDD, CL and a
a 0®1 = p0 p1 = p0 (1 - p0 )
N 0 N1 N 0 ( 2 N - N 0 )
a 0®1 = N N =
2 2 22 N
N0: the number of zero entries in the output column of the truth table
N1: the number of one entries in the output column of the truth table
A O1
O2
B
C F
D
A O1
B
F
C O2
D
O1 O2 F
P1, chain ¼ 1/8 1/16
P0, chain ¾ 7/8 15/16
P(0à1), chain 3/16 7/64 15/256
P1, tree ¼ ¼ 1/16
P0, tree ¾ ¾ 15/16
P(0à1), tree 3/16 3/16 15/256
A B
B C
F F
C A
p ( A = 1) = 0.5
P( B = 1) = 0.2
P(C = 1) = 0.1