Post-Si Validation Tutorial
Post-Si Validation Tutorial
net/publication/299562791
CITATIONS READS
5 1,765
3 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by T.M. Mak on 02 April 2016.
6000
~1/4 sec
Pentium 4 of real time
5000
execution
4000
3000
2000
1000 Full-
chip
0
40'98
43'98
46'98
49'98
52'98
03'99
06'99
09'99
12'99
15'99
18'99
21'99
24'99
27'99
30'99
33'99
36'99
39'99
42'99
45'99
48'99
51'99
09/29/03 TM Mak Post Silicon Validation 5
Cost of a processor bug
10B$
Lost sales
1B$
Recall d,
n
f ou is
Cost
100M$
g is e it
b u siv
r a pen
10M$ late e ex
he or
Time-to-market T m
the
1M$
spec
Margin to spec
09/29/03 TM Mak Post Silicon Validation 10
Fail Shmoos
VDD/Freq Fail Shmoo Pattern List: ALU
+----+----+----+----+----+----+----+----+
40.0 MHz |***************AAAAAABBCCCCCCCCCCCCCCCCCC| 25 ns
38.5 MHz |*****************AAAAABBCCCCCCCCCCCCCCCCC|
37.0 MHz |********************AAAAABBCCCCCCCCCCCCCC|
26
27
ns
ns
• Indicates
35.7 MHz |**********************AAAABBCCCCCCCCCCCCC|
34.5 MHz |************************AAAABBBCCCCCCCCCC|
28
29
ns
ns performance curve of
33.3 MHz |*************************AAAABBBBCCCCCCCC|
32.3 MHz |***************************AAABBBBCCCCCCC|
30
31
ns
ns various failure modes
31.2 MHz |****************************AAABBBBCCCCCC| 32 ns
30.3 MHz |*****************************AABBBBBCCCCC| 33 ns – Letter order is NOT
29.4 MHz |******************************AABBBBBCCCC| 34 ns
27.8 MHz |*******************************ABBBBBCCCC| 36 ns pattern order
27.0 MHz |*******************************ABBBBBCCCC| 37 ns
26.3 MHz |********************************ABBBBBCCC|
25.0 MHz |********************************ABBBBBCCC|
38
40
ns
ns
• EG:
23.8 MHz |*********************************ABBBBCCC|
23.3 MHz |*********************************DDEEECCC|
42
43
ns
ns – ‘A’ is the first speed-
22.2 MHz |*********************************DDDEEECC|
20.8 MHz |*********************************DDDDEEEC|
45
48
ns
ns
limiting failure mode at
20.0 MHz |**********************************DDDEEEC|
18.9 MHz |**********************************DDDDEEE|
50
53
ns
ns
high frequency & low
17.9 MHz |**********************************DDDDDEE| 56 ns VDD
16.9 MHz |**********************************DDDDDDE| 59 ns
+----+----+----+----+----+----+----+----+
VDD 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0
– ‘D’ and ‘E’ are hard
low voltage failures
Character Cycle Vector Pattern
---------------------------------------------------- modes even at slow
A
B
116392
89228
2347
5467
gecc1000
geba1113 speed.
C 122 55 ext003
D 83535 1178 geba1113
E 83288 855 geba1113
750 Legend
force_temperature Temp range
Count Axis
500 0
110
250
FF INF SNaN SNaN SNaN SNaN SNaN SNaN SNaN SNaN SNaN QNaN QNaN QNaN QNaN QNaN QNaN QNaN QNaN QNaN QNaN
FE
F0-FD
Ex
Dx
Cx
Bx
Ax
9x
8x
7x
6x
5x
4x
3x
2x
1x
02-0F
01
00 0 DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN DEN
• This table represents the dataspace for a single precision operand (32-bits). The
yellow areas represent the special portions of the dataspace: Zero, Infinity,
Denormals, and NaNs. These rows account for less than 1% of the dataspace.
• The green areas representing “interesting” areas because they contain boundary
conditions. These areas also account for less than 1% of the dataspace.
• The blue areas represent the remaining 98%+ of the dataspace
VCTERM VCTERM
1.89 in 2.12 in
0.8 in
(48 mm) (54 mm)
iA Processor iA Processor iA Chipset
0.1 in
Charge Device
Model
- -
- -
-
- -
- - IESD -
- - -
Machine Model
di/dt
is large -
- -
-
-
IESD
-
-
-
- -
Human Body
Model
09/29/03 TM Mak Post Silicon Validation 41
Infant Mortality and Life-test
Infant Mortality (declining failure rate)
Failure Due to Latent Reliability Defects
Goals: 500 DPM within 0-30 days & 200 Wearout (increasing failure rate)
Rate Due to oxide wearout, EM, hot-e,
FIT within 0-1 year
etc Goal: <0.1% failing for intrinsic
Cumulative Fallout Vs. Time reliability mechanisms
(follows a lognormal distribution)
~1 year 7 YR Time
Wearout
Target
0hr 12hr
6hr 24hr 48hr 168hr 500hr 1000hr
Models
↓ Specification
↓ Property
HDL -- Behavior =? Verification
↓ HDL -- RTL behavior of nand4 is
begin process(a,b,c,d) Usually piecemeal,
begin
DESIGN
Detect Errors
ERRORS AM
RE
ST
UP
Prevent Errors VE
MO
09/29/03 TM Mak Post Silicon Validation 48
Diagnosis tools
• Controllability & observability of 1s and 0s
– States and data
• Use DFT (design-for-test) and DFD (design-for-debug)
hooks
– Scan, and everything else
• Fault diagnosis T2 T1
– Identify set of all modeled faults
– Use fault simulation to find which tests detect which faults
– Given pass / fail status of a specimen for each test, intersect
fault lists to diagnose the fault
– Success is a function of how well the modeled faults match
reality
• Diagnosis tools are often of little help for silicon debug
Fault List
09/29/03 TM Mak Post Silicon Validation 49
Silicon Debug
• Runs all sort of tests on a sample of new units
• Avoid chasing after defects
– If failure signatures (on new samples) are random, skip until
you can find systematic failures
• Try different frequencies, voltages, temperatures and
get the test to pass
– If no pass region, it is a logic error (functional bug)
– If a passing region can be found, it is a circuit problem
– If it works at lower frequency, it is a performance (delay)
problem
– Shmoo holes can exist (fails only under specific conditions
and not above or below)
Tester System
Expensive Cheap
Per pin control Hard to control pins
Deterministic Non-deterministic
Hard vector generation Easy vector generation
Enables easy debug Complicated debug
1 2 3 4 5 6
Sample Holder
and fixturing
Probestation base
-
Filter grid Servo Amp
+ Vref
Secondary electrons
emitted from surface
IC Line: Potential Vs
Metal2
Metal3
Die
Backside
Substrate
a) Flip Chip MCM with 2 Chips A & B and LICAs. Chip B b) First Globally thin Chip B only to a thcikness
needs to be probed ~100µm - using a fast wet chemical etch
Metal 1 10µm
d) Magnified view of region in circle in c). Final Probe hole c) Magnified view of silicon chip B in MCM in the region
drilled at the base of the LCE generated trench to expose an of the circle. Use Laser Chemical Etch to mill a local
N+ diffusion (NAC) diode. The E-Beam probes the N+ trench to within 10 µ m of the P-Well and active circuits .
diffusion directly. The tapered holes improve electron The trench walls are sloped to minimize the amount of
collection through the hole for the E-Beam probing and FIB silicon removed.
imaging
Stepped (tapered)
probe hole
P+
Probe hole back
filled with Tungsten
to enhance contrast ILD0 NAC Metal 1
of SEM cross
section (not present
for probing)
Coarse LCE
Trench Etch Step
Fine LCE
Trench Etch Step
Silicon
Substrate
Optical
power
power
• Place DFD probe-
able diodes at gate time time
inputs With NO input With applied
electrical signal electrical signal
IR Objective
Mode Locked Laser
Lens
100MHz, 1.064µm,
~30ps pulse width
Photodiodes
To Detection
Laser sampling pulses Electronics
time
09/29/03 TM Mak Post Silicon Validation 100
Picosecond Imaging Circuit
Analysis
Lands
Cl2 filled Cell
SiCl Active Area
Silicon
Substrate
Focused Gas
Focused Deliver
Ion Beam
Ion Beam Needle
1um
Metal Line (signal)
Drain
Contact
re: Asenov et al
09/29/03 TM Mak Post Silicon Validation 114
Other debug/analysis tools
• Material analysis tools
– Fab/assembly process debug
be?
cracks through
the layers
Delamination at
interfaces
Cracks in plated
through hole Via adhesion
issues
C4 Bump
C4 Bump Void
Plated
Through
Hole
Stacked Vias Solder joints Here, the package is intact and has not
been tampered with in the region of
interest
IR radiation
High speed
IR camera
Image/data
processing defect
Heat conduction
Flash Lamps
Monitor surface ∆T as
Surface excitation Data process
a function of time (in msec)
IR image
IR image
Reflection SAM
image
Good
region