0% found this document useful (0 votes)

143 views6 pages

DFT Strategy For Arm Cores

DFT Strategy for Arm Cores

Uploaded by

yellow51

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

143 views6 pages

DFT Strategy For Arm Cores

DFT Strategy for Arm Cores

Uploaded by

yellow51

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Product How To: DFT strategy for ARM

processor-based designs
Chris AllsupKun Chung, - January 22, 2013

One of the most significant design trends of the decade is the widespread use of ARM® multicore
processors in systems-on-chip (SoCs). Designers’ ability to easily and cost-effectively employ
multiple, high-performance embedded processors as needed to meet the computational
requirements of the end application has helped fuel the explosive growth in mobile computing,
networking infrastructure, and digital home entertainment systems. But from the design-for-test
(DFT) perspective, is there a strategy for easily and cost-effectively testing multicore designs? A key
challenge is already emerging: as the number of processor cores increases, it has become
increasingly difficult to maintain high test quality without a requisite increase in cost stemming from
the need to allocate substantially more pins for digital test.

This is an important consideration at Samsung Electronics, which designs a variety of SoCs

containing ARM multicore processors. Simply adding more chip-level pins for testing conflicts with
packaging constraints and can potentially undermine other cost-saving techniques that rely on
utilizing fewer pins (see sidebar on multisite testing). What is needed instead is a DFT strategy
optimized for designs that use multicore processors—a strategy in which the architecture and
automation elements work in tandem to lower test cost without compromising test quality or
significantly increasing automatic test pattern generation (ATPG) runtime.

This article provides an example of an optimized DFT architecture, referred to as “shared I/O.” It is
enabled by Synopsys’ synthesis-based test solution, which has been used successfully in Samsung’s
multicore processor designs. The experience demonstrates that shared I/O is a better approach than
the standard DFT architecture for testing multicore designs since it reduces test costs by utilizing
fewer pins while providing the same or better test time reduction.

Dedicated test pins

Figure 1 shows a standard DFT architecture for a design with quad-core processors and user-defined
logic at the top level. The cores need not be identical but all blocks contain an embedded scan
compressor-decompressor (CODEC) to reduce the number of tester cycles required for achieving
high test coverage, which in turn reduces test cost [1, 2].
Figure 1. The standard DFT architecture for a quad-core design with CODECs embedded in
each core and at the top level. Because each CODEC uses its own dedicated pins, the
number of pins needed to test the design grows large as more processor cores are added.

The amount of compression implemented for a particular CODEC determines the number and length
of its scan chains, and is chosen to ensure an approximately uniform scan chain length L across all
the digital logic in the design. While more compression shortens the chain length and achieves
greater test time reduction, the amount of compression applied is constrained in practice by a
minimum number of scan inputs to the CODEC as well as routing considerations. Even so, as the
number of cores increases, it becomes essential to keep the number of scan I/O needed to test each
CODEC reasonably small to avoid exceeding the number of chip-level pins available for testing.

Shared test pins

The optimized DFT architecture is illustrated in Figure 2. The chip-level test pins are uniformly
connected to all the CODECs in the design, and an integration block added to the CODEC outputs
enables full observability of the scan chains. Assuming a design contains N CODECs, then log2(N)
test input pins are allocated to select which logic is being observed on the output pins.

Figure 2. The optimized architecture shares uniformly-connected test input pins and uses
integration logic to observe the CODEC outputs. The pin count increases by just log2(N)
with the number of processor cores, N.

Synopsys’ DFTMAX™ compression was used to implement both DFT architectures for a 20-nm
design containing four identical ARM processor cores, one of the latest versions of the Cortex-A
series, plus user-defined logic. Only a few modifications to the original DFT scripts were required to
implement the optimized architecture. Table 1 compares test pin count and normalized TetraMAX®
ATPG stuck-at pattern count results for the standard versus shared I/O architectures using
equivalent chain length and high fault coverage for both scenarios:

Table 1. Shared I/O results in fewer ATPG patterns than the standard architecture even
when only half as many pins are used.

Despite consuming half the test pin resources, shared I/O required substantially fewer patterns (and
tester cycles)—30% fewer for the power-aware patterns, which are generated to avoid false failures
during production testing [3]. The decrease in pattern count can be explained in part by an increase
in ATPG efficiency that comes “for free” when the scan inputs are shared among multiple cores.
However, in this application, designers also used additional tools in the Synopsys product, enabled
when processor cores are identical, that enhance both pattern efficiency and the ability to isolate
defective parts. Optimizations for identical cores
Optimizations for identical cores
The select lines going into the integration block in Figure 2 control logic were used to isolate
defective scan chains, making it possible to determine which of several faulty values belong to which
scan chains in which cores. The integration block also contains XOR trees that provide virtually the
same high observability as the standard approach that relies on dedicated connections to output
pins. When the processor cores are identical, it is possible to improve diagnostics accuracy using a
technique borrowed from image processing. “Swizzling” or rotating the order of a CODEC’s outputs
with respect to the order used in its neighboring CODEC, depicted in Figure 3, ensures that a fault
can be detected and isolated to a particular core.

Figure 3. “Swizzling” or port bit rotation is one of the techniques DFTMAX uses to improve
diagnostics and ATPG efficiency for designs utilizing identical processor cores.
As an added benefit, scan chain isolation and output port rotation provide TetraMAX enhanced
ability to manage unknown logic values (X’s) captured across multiple scan chains and cores and,
more generally, to improve fault coverage, pattern count, and runtime. Pattern generation efficiency
is improved further through the use of automation that focuses the ATPG effort on a single processor
core while fault simulating the entire set. These DFT and ATPG optimizations, applied in
combination, make it possible to trade-off test pin count versus test cycle count reduction to achieve
significant cost savings when utilizing multiple identical processor cores.

Flexible architecting
We have observed that the shared I/O strategy, with minor variations to the architecture of Figure 2,
holds up well as we scale the number of processor cores up or down. If a large portion of the logic in
a design is external to the cores, we can easily implement a hybrid arrangement wherein a subset of
the pins are shared among the identical cores while other pins are dedicated to the top-level
CODEC. When testing this “mixed-shared” variant on the same quad-core design using 33 pins, we
found the pattern counts decreased by about 15% compared with the standard architecture of
Figure 1.

In contrast, when the processor core count is relatively high, the blocks are partitioned across the
SoC based on topology constraints that might be at odds with sharing all the scan inputs and outputs
among all the cores. For these large multi-processor designs, sharing subsets of pin groups among
subsets of identical cores as shown in Figure 4—a feature automated in DFTMAX—avoids routing
congestion and timing issues. For example, we recently implemented group-shared I/O for a large
design with many processor cores, a strategy that led to the same test quality benefits as the
traditional approach but utilized fewer than half the test pins. This freed-up more ATE channels for
multi-site testing that resulted in a 59% reduction in test execution cost per wafer.

Figure 4. Group-shared I/O reduces routing overhead for partitioned designs with many
identical processor cores.
Conclusion
A DFT strategy optimized for testing multicore processor designs is needed to achieve both high-
quality and cost-effective manufacturing test, especially as the number of cores per design
increases. The shared I/O architecture we have highlighted lowers the cost of testing ARM
processor-based designs and other multicore SoCs in two fundamental ways: First, it reduces the
number of test pins required for efficient compression of high-coverage test patterns, which
decreases packaging costs and facilitates deployment of other cost-saving methodologies such as
multisite testing. Second, it reduces ATPG pattern count, which decreases test cycle time and,
therefore, test execution cost. Automation in the test solution enables flexible implementation of
shared I/O architecture variants. Optimizations such as scan chain isolation and output port rotation,
embedded in the integration logic, improve diagnostics accuracy and facilitate greater ATPG
efficiency for making economical pin-count and pattern-count tradeoffs.

Sidebar: Multi-site Testing

Multi-site testing is a technique that reduces test time and cost by screening multiple dies
simultaneously. It uses an ATE system fitted with a probe card specifically designed for parallel load
operations. The maximum degree of parallelism depends on the number of ATE channels and the
number of pins per die available for testing. The cost saving from multisite testing increases with the
multi-site count (i.e., the degree of parallelism), M, according to:

where α is a constant. The exponential reflects the efficiency of the system, which declines as the
parallelism increases for a variety of reasons. For instance, if M is large, stepping over a wafer will
not fit into the probe card’s footprint, leading to more touchdowns than the number of devices per
wafer divided by M [4].

In many situations mixed signal, embedded memory, flash, and quiescent current testing are the
time bottlenecks, with scan testing consuming only a small portion of the total test time even though
it requires the most pins. When production volumes are very high, substantial cost savings can be
achieved by “sacrificing” some of these pins in the interests of parallelism that reduces total test
time even though the scan test time itself increases.

If you liked this feature, and would like to see a weekly collection of related features
delivered directly to your inbox, sign up for the Test & Measurement World newsletter
here.

References
1. DFTMAX Compression Backgrounder, Fall 2009, Synopsys, Inc.
2. Allsup, C., “The Economics of Implementing Scan Compression to Reduce Test Data Volume and
Test Application Time,” Proc. International Test Conf., 2006.
3. Bahl, S.; Mattiuzzo, R.; Khullar, S.; Garg, A.; Graniello, S.; Abdel-Hafez, K.S.; Talluto, S., “State
of the Art Low Capture Power Methodology,” Proc. International Test Conf., 2011.
4. Kuntzsch, C.; Shah, M.; Mittermaier, N., “Massive Test Cost Reduction by Advanced SCAN
Testing,” SNUG Germany 2010 Proceedings.

About the Authors

Kun Young Chung is the DFT manager of the Design Technology Team at Samsung Electronics. He
received BS and MS degrees from Seoul National University in 1996 and 1998, and a MS and PhD in
electrical engineering from the University of Southern California in 2002 and 2008. Kun Young’s
research interests include delay testing, power-aware DFT, scan compression, and 3D IC testing.

Chris Allsup, marketing manager in Synopsys’ synthesis and test group, has more than 20 years
combined experience in IC design, field applications, sales, and marketing. He earned a BSEE
degree from UC San Diego and an MBA degree from Santa Clara University. Chris has authored
numerous articles and papers on design and test.

● Fundamentals of Multicore Processing

● Multicore and virtualization in automotive environments
● Getting around multicore walls: the road less traveled
● Synopsys unveils multicore optimization technology

M32 Opcenito
No ratings yet
M32 Opcenito
14 pages
Overview of Scan Insertion
No ratings yet
Overview of Scan Insertion
3 pages
Tetramax DF Ds
No ratings yet
Tetramax DF Ds
4 pages
Benefits of Moving To Plug-And-Play Hierarchical DFT
No ratings yet
Benefits of Moving To Plug-And-Play Hierarchical DFT
5 pages
Importent Document For DFT PDF
No ratings yet
Importent Document For DFT PDF
10 pages
Fault Models PDF
No ratings yet
Fault Models PDF
33 pages
Start Practice Exam Test Questions Part 1 of The Series
No ratings yet
Start Practice Exam Test Questions Part 1 of The Series
20 pages
DFT Some
No ratings yet
DFT Some
79 pages
ATPG and Fault Simulation: Alberto Bosio
No ratings yet
ATPG and Fault Simulation: Alberto Bosio
63 pages
Mbist - GD
No ratings yet
Mbist - GD
350 pages
4 Bist
No ratings yet
4 Bist
165 pages
Liebert PEX3 - UM-ENG-ASIA
No ratings yet
Liebert PEX3 - UM-ENG-ASIA
125 pages
DFT Lbist PDF
100% (1)
DFT Lbist PDF
10 pages
Memory Bist Insertion Time
No ratings yet
Memory Bist Insertion Time
4 pages
Fault Simulation
No ratings yet
Fault Simulation
46 pages
05 Testing PDF
No ratings yet
05 Testing PDF
118 pages
Design For Testing
No ratings yet
Design For Testing
14 pages
Memory Fault Types
No ratings yet
Memory Fault Types
22 pages
Table of Contents: The IDDQ Mystery
No ratings yet
Table of Contents: The IDDQ Mystery
9 pages
Fault Models, Detection & Simulation Fault Models, Detection & Simulation - Organization Organization
No ratings yet
Fault Models, Detection & Simulation Fault Models, Detection & Simulation - Organization Organization
31 pages
Week 3
No ratings yet
Week 3
34 pages
Chapter 7 DFT TurboScan 2003 7
No ratings yet
Chapter 7 DFT TurboScan 2003 7
115 pages
Dftmax Compression Shared I/O: Minimize The Cost of Testing Arm Processor-Based Designs and Other Multicore Socs
No ratings yet
Dftmax Compression Shared I/O: Minimize The Cost of Testing Arm Processor-Based Designs and Other Multicore Socs
5 pages
Atpg Low Power
No ratings yet
Atpg Low Power
11 pages
Chain Test: For Any Design To Do Chain Test Flow Is Same. Here We Did For Adaptive - Scan - TCL File in Lab5 Basic Chain Test
100% (1)
Chain Test: For Any Design To Do Chain Test Flow Is Same. Here We Did For Adaptive - Scan - TCL File in Lab5 Basic Chain Test
2 pages
Tessent Plltest User'S Manual: Software Version 2014.1 March 2014
No ratings yet
Tessent Plltest User'S Manual: Software Version 2014.1 March 2014
146 pages
Bi - Directional Signals in IEEE P1500 Standard
No ratings yet
Bi - Directional Signals in IEEE P1500 Standard
68 pages
Test Logic Insertion For Optimization of Testing Parameters On Lower Technology Node
No ratings yet
Test Logic Insertion For Optimization of Testing Parameters On Lower Technology Node
6 pages
Impact of Peer Pressure To Decision Making Skills
No ratings yet
Impact of Peer Pressure To Decision Making Skills
43 pages
The Introduction To Edt (Testkompress) : High Test Quality & Low Test Cost
No ratings yet
The Introduction To Edt (Testkompress) : High Test Quality & Low Test Cost
29 pages
Testing
No ratings yet
Testing
23 pages
Skripsi Tanpa Bab Pembahasan
No ratings yet
Skripsi Tanpa Bab Pembahasan
40 pages
DFT Timing Design Methodology For At-Speed BIST: February 2003
No ratings yet
DFT Timing Design Methodology For At-Speed BIST: February 2003
7 pages
Structural Faults in DFT
No ratings yet
Structural Faults in DFT
14 pages
Loes BRCM
No ratings yet
Loes BRCM
22 pages
At Speed Atpg
No ratings yet
At Speed Atpg
34 pages
Ieee p1687 Internal Jtag Taps Embedded Instrumentation White Paper
No ratings yet
Ieee p1687 Internal Jtag Taps Embedded Instrumentation White Paper
12 pages
Ijtag One
No ratings yet
Ijtag One
11 pages
Tutorial Design Compiler
No ratings yet
Tutorial Design Compiler
9 pages
Formulation of FEM For Two-Dimensional Problems
No ratings yet
Formulation of FEM For Two-Dimensional Problems
24 pages
Assignment II - Testability For VLSI Design
No ratings yet
Assignment II - Testability For VLSI Design
6 pages
Design For Testability and Scan
No ratings yet
Design For Testability and Scan
19 pages
Scan Through TAP
No ratings yet
Scan Through TAP
10 pages
DFT Lab Description
No ratings yet
DFT Lab Description
22 pages
Cell Based ATPG
No ratings yet
Cell Based ATPG
8 pages
Logic Design For Single On-Chip Test Clock Generation For N Clock Domain - Impact On SOC Area and Test Quality
No ratings yet
Logic Design For Single On-Chip Test Clock Generation For N Clock Domain - Impact On SOC Area and Test Quality
5 pages
Vlsi Test and Testability DFT Advisor Report
No ratings yet
Vlsi Test and Testability DFT Advisor Report
15 pages
242-Article Text-280-1-10-20180718 PDF
No ratings yet
242-Article Text-280-1-10-20180718 PDF
14 pages
Testability Concepts and DFT
No ratings yet
Testability Concepts and DFT
5 pages
An Introduction To Scan Test For Test Engineers: Part 2 of 2
No ratings yet
An Introduction To Scan Test For Test Engineers: Part 2 of 2
9 pages
4new Text Document
No ratings yet
4new Text Document
4 pages
Real Estate Brochure - Ekta World
No ratings yet
Real Estate Brochure - Ekta World
31 pages
DLD Assignment 9
No ratings yet
DLD Assignment 9
29 pages
A Robust Scan Insertion Methodology
No ratings yet
A Robust Scan Insertion Methodology
5 pages
LBIST
No ratings yet
LBIST
10 pages
Useful Book Ieee1500 DFT
No ratings yet
Useful Book Ieee1500 DFT
63 pages
DFT DRC - PPT 0
No ratings yet
DFT DRC - PPT 0
3 pages
Corporate Organization PDF
No ratings yet
Corporate Organization PDF
25 pages
Thorough Test Means Testing Through The RAM
No ratings yet
Thorough Test Means Testing Through The RAM
2 pages
Dftmax Ultra WP
No ratings yet
Dftmax Ultra WP
4 pages
BIST Controller
No ratings yet
BIST Controller
3 pages
Usman & Nduka (2022)
No ratings yet
Usman & Nduka (2022)
4 pages
Alat Mesin Fiamma All
No ratings yet
Alat Mesin Fiamma All
45 pages
Sharmeine B. Tenedero: E-Mail
No ratings yet
Sharmeine B. Tenedero: E-Mail
3 pages
DFT Student
No ratings yet
DFT Student
82 pages
SpyGlass MBIST Reference
No ratings yet
SpyGlass MBIST Reference
86 pages
Git Github
No ratings yet
Git Github
5 pages
RF Heirarchical 2010
No ratings yet
RF Heirarchical 2010
6 pages
RSPO Free Prior and Informed Consent FPIC Guide 2022 - RSPO GUI T08 002 V2 ENG
No ratings yet
RSPO Free Prior and Informed Consent FPIC Guide 2022 - RSPO GUI T08 002 V2 ENG
65 pages
Applied Use of Parabolas
No ratings yet
Applied Use of Parabolas
7 pages
Syhunliong v. Rivera GR No. 200148
No ratings yet
Syhunliong v. Rivera GR No. 200148
21 pages
Vinay Sharma: Director, Ni2designs
No ratings yet
Vinay Sharma: Director, Ni2designs
6 pages
Different File Formats
No ratings yet
Different File Formats
10 pages
DFT - Scan Insertion
No ratings yet
DFT - Scan Insertion
33 pages
Low Test Coverage Debug
No ratings yet
Low Test Coverage Debug
5 pages
Second Admission List - Political - Science
No ratings yet
Second Admission List - Political - Science
26 pages
Swapnil Arvindrao ShegekarThesis-2
No ratings yet
Swapnil Arvindrao ShegekarThesis-2
78 pages
Mbist Test 1736512778
No ratings yet
Mbist Test 1736512778
6 pages
Dis W23
No ratings yet
Dis W23
1 page
PPL (P1) Checkride POA
No ratings yet
PPL (P1) Checkride POA
33 pages
Esg and Risk
No ratings yet
Esg and Risk
22 pages
GeM-Bidding-7098423 - Nalco
No ratings yet
GeM-Bidding-7098423 - Nalco
10 pages
Wilson Step 5 Passages
No ratings yet
Wilson Step 5 Passages
13 pages
Ap15 Compsci A q2
No ratings yet
Ap15 Compsci A q2
9 pages
14.4 V & 18 V Family Handout-R1
No ratings yet
14.4 V & 18 V Family Handout-R1
2 pages
HD3 IDS Plough Specs
No ratings yet
HD3 IDS Plough Specs
6 pages
2025 Final Program
No ratings yet
2025 Final Program
10 pages
Teaching Students With Severe Disabilities 6th Edition by David Westling Erik W Carter
No ratings yet
Teaching Students With Severe Disabilities 6th Edition by David Westling Erik W Carter
318 pages
Application-Specific Integrated Circuit ASIC A Complete Guide
From Everand
Application-Specific Integrated Circuit ASIC A Complete Guide
Gerardus Blokdyk
No ratings yet

DFT Strategy For Arm Cores

Uploaded by

DFT Strategy For Arm Cores

Uploaded by

Product How To: DFT strategy for ARM

This is an important consideration at Samsung Electronics, which designs a variety of SoCs

Dedicated test pins

Shared test pins

Sidebar: Multi-site Testing

About the Authors

● Fundamentals of Multicore Processing

You might also like