0% found this document useful (0 votes)

70 views6 pages

Optimization of Noc Wrapper Design Under Bandwidth and Test Time Constraints

Interface

Uploaded by

Marwan Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views6 pages

Optimization of Noc Wrapper Design Under Bandwidth and Test Time Constraints

Interface

Uploaded by

Marwan Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Optimization of NoC Wrapper Design Under Bandwidth and Test Time Constraints

Fawnizu Azmadi Hussin, Tomokazu Yoneda, and Hideo Fujiwara

Graduate School of Information Science, Nara Institute of Science and Technology
Kansai Science City, 630-0192, Japan
fawniz-h, yoneda, fujiwara@is.naist.jp
Abstract
In this paper, two wrapper designs are proposed for corebased test application based on Networks-on-Chip (NoC)
reuse. It will be shown that the previously proposed NoC
wrapper does not efficiently utilize the NoC bandwidth, which
may result in poor test schedules. Our wrappers (Type 1 and
Type 2) complement each other to overcome this inefficiency
while minimizing the overhead. The Type 2 wrapper uses
larger area overhead to increase bandwidth efficiency, while
the Type 1 takes advantage of some special configurations
which may not require a complex and high-cost wrapper. Two
wrapper optimization algorithms are applied to both wrapper
designs under channel bandwidth and test time constraints,
resulting in very little or no increase in the test application
time compared to conventional TAM approaches.

1. Introduction
The NoC [1] provides abundant communication resources,
which makes the traditional approach of adding extraneous
Test Access Mechanism (TAM) [2, 3] overkill. Several research groups have published works on NoC test scheduling
[4, 5, 6] utilizing the NoC as the test data transportation path
from external testers to the CUTs. Test scheduling for the
NoC router [6, 7] and crosstalk test of the interconnects [8]
have also been discussed. In these approaches, each CUT is
wrapped by an IEEE 1500 [9] compatible wrapper in order to
provide isolation and access during the test application.
Many NoC architectures have been proposed such as SPIN
[10], thereal [11, 12], SoCIN [13], NOSTRUM [14], QNoC
[15], and HERMES [16]; all are based on a synchronous communication between nodes. Several other types of NoCs such
as CHAIN [17], NEXUS [18], and ANoC [19] are based on
Globally Asynchronous Locally Synchronous (GALS) communication. The copious NoC architectures highlight the
growing interest in NoC as a next generation SoC interconnect.
With regard to the NoCs Design-for-Testability (DFT), the
authors in [20] presented an architecture called ANoC-TEST,
which targets the Asynchronous Networks-on-Chip (ANoC)
[19]. In [21], the proposed NoC wrapper takes advantage of

12th IEEE European Test Symposium (ETS'07)

0-7695-2827-9/07 $20.00 2007

the guaranteed bandwidth and latency provided by the NoC to

ensure test data integrity. Their experimental results showed
that in terms of core test time, the proposed NoC wrapper is
comparable to the TAM-based IEEE 1500 wrapper, and NoCreuse [4, 5, 6] capable. However, due to the constraint of
the parallel-serial conversion at the input port, the proposed
wrapper requires much higher guaranteed bandwidth on the
NoC than the actual rate of the test data loaded into the test
wrapper. This is further explained in Sect. 4.2
In this paper, we are proposing two types of NoC wrappers
based on the guaranteed bandwidth and latency. The wrappers
complement each other in order to optimize the NoC bandwidth utilization and minimize the test application time. For
a given bandwidth or a test application time constraint, the
proposed wrapper optimization algorithm finds the optimum
configuration using a binary search algorithm.
The rest of the paper is organized as follows: The NoC
model and the IP core model are described in Sect. 2 and 3,
respectively. In Sect. 4, a detailed description of the proposed
NoC wrapper architecture is given. The wrapper optimization methodology is explained in Sect. 5. Some experimental
results on selected benchmark circuits are given in Sect. 6.
Finally, concluding remarks are offered in Sect. 7

2. NoC Model
The proposed wrapper utilizes the functional communication channel between a test source and a CUT. The delivery
channel can be a dedicated path or a transparent virtual channel. The wrapper is topology independent; it can be used for
any NoC architecture as long as minimum sustainable bandwidth and latency are guaranteed during the test application
of the target CUT. The quality-of-service guarantees ensure
that the test data are available at the CUT at the right time. In
this paper, the thereal [11, 12] NoC is used to explain the
wrapper design and optimization.
The thereal NoC routers [11] provide both guaranteed
and best-effort services. The guaranteed throughput (GT)
router guarantees uncorrupted, lossless, and ordered data
transfer, and both latency and throughput over a finite time interval. It also implements a network interface (NI) [12]NI
kernel and NI shellswhich connects the network routers to

I/O

ATE
Channel 1

NoC
l

NIS

ATE

I/O

ATE
Channel 2

Virtua
l chan
ne

I/O Port 1

Core
5

NIS

Core
6

NIS
NIS

NIS
NIK

NIK

NIS
NIK

I/O Port 2

NIS

nel
Virtual chan

SoC
Core
1
Core
2
Core
3

Network Interface
(input side)
1 2

PDI

Core
4

PCI

IP Core
PDO
1 2

NIS

From other cores, PIs, etc

PCO

Network Interface
(output side)

Internal
scan chains
To other cores, POs, etc

Figure 1. NoC model based on the thereal NoC

Figure 3. IP core model interfaced to the NI port

the IP cores by means of shared-memory abstraction (Fig. 1)

utilizing a transaction-based protocol.
Figure 1 shows a NoC model based on the thereal architecture consisting of four GT routers
. The NI
supports multiple communication protocols required by the
IP cores. Two of the NI shells are labelled I/O port 1 and
I/O port 2, which can be used to interface the external ATE
ports to the NoC. Two virtual channels (VC) are shown connecting the ATE on port 1 to core 2 and the ATE on port 2
to core 3. Each VC is guaranteed a minimum bandwidth,
. The term represents the max

imum link bandwidth between each pair of GT routers
for some link
and along the VC path. If

can be allocated to
,
the
remaining

other VCs in order to allow simultaneous test applications of
multiple CUTs.
Figure 2 shows a simplified timing diagram of an AXI
burst write transaction [22]. In order to reuse the NoC during test, the ATE needs to communicate with the CUT using
the read/write transactions. Furthermore, the test methodology can be extended to reuse the embedded processors as test
sources and sinks in place of the external ATE.

into primary data outputs (PDO), consisting of RDATA[31:0]

signals (not included in Fig. 2), and primary control outputs
(PCO), consisting of BRESP[1:0] signals.
With the new classifications, core I/Os can be categorized
as PDI, PDO, PCI, PCO, and other PI/POs which are not connected to the communication port of the NoC as shown in
Fig. 3. The PDIs and PDOs are used to carry the test vectors from the ATE to the CUT, and the test responses from
the CUT to the ATE, respectively. The PCIs and PCOs are
needed to operate in the functional mode during the test application to ensure that the read/write transactions, by which
the test data and responses are transmitted, execute properly.
Since the CUT is not operating in the normal mode, the PCO
signals must be generated by a wrapper controller. Special
boundary cells proposed in [21] are used for PCOs to make
the NoC operate in the normal mode to transfer the test responses. For all other PI/POs, the IEEE 1500 boundary cells
are used.

3. IP Core Model
IP core I/Os consist of primary inputs (PI), primary outputs (PO), scan inputs (SI) and scan outputs (SO). A subset
of the PIs can be categorized into primary data inputs (PDI)
and primary control inputs (PCI). Assuming that the CUT
communicates with the NoC by means of the AXI protocol
(Fig. 2), PDI would be made up of WDATA[31:0] signals,
while PCI consists of ADDR[31:0], AVALID, DLAST, and
DVALID signals. Some PO signals can also be categorized
T0

CLK
ADDR[31:0]

AVALID
WDATA[31:0]

D(A0)

D(A1)

D(A2)

DVALID
OK

Figure 2. AXI burst-write transaction

12th IEEE European Test Symposium (ETS'07)

0-7695-2827-9/07 $20.00 2007

Core wrapper design for a TAM-based test architecture has

been explained in [2, 3]. For a CUT, given internal scan
chains (ISC) of length, , primary inputs,
primary outputs, bidirectionals, and
wrapper scan chains
(WSC), the WSCs are formed while minimizing the maximum scan-in and scan-out depths. Scan-in elements consist
of zero or more inputs, bidirectionals, and ISCs. Scan-out
elements consist of zero or more outputs, bidirectionals, and
ISCs.
Figure 4 shows
for a CUT with , and
flip-flops, ,
, and . The scan elements are
optimally divided to form scan chains with maximum scan-in
depth, , and maximum scan-out depth, , respectively; this is an optimal wrapper scan chain design [2].
As a result, the total test application time (TAT) can be calculated by equation (1), where is the number of test vectors.
For Fig. 4, the TAT is, clock cycles.

DLAST

BRESP[1:0]

4. NoC Wrapper Architecture

(1)

When using TAMs as the delivery channel, the scan chain

inputs and outputs are connected directly to the ATE input and
output channels through the TAM wires. In order to reuse the

Scan-in depth, i,k

Scan-out depth, so,k

Scan-in depth, si,k

11
11

4
3

From
PDI
port

4
3

9
9

To
PDO
port

shift
Legends:

NoC as the delivery channel, the scan chains are connected to

the existing functional connections. Therefore, the test control and synchronization are no longer at the hand of the ATE,
rendering the IEEE 1500 wrapper inadequate. Sections 4.1
4.3 explain how these problems are addressed in the proposed
NoC wrappers.

The proposed Type 1 wrapper uses the same approach as in
[2, 3] when forming the wrapper scan chains which minimizes
. For a given number of wrapper scan chains,

, and the PDI bit-width, , the number of PDI bits that
can be used to carry the test data for each wrapper scan chain,
, is given by equation (2) [21]. To differentiate these PDI
bits, those that can carry the test data are called input data
boundary cells, IDBC (shaded black in Fig. 5). If

(Eqn. (3)), some PDI bits cannot be used to carry the test data.
A similar analysis can be done for the output data boundary
cells (ODBC), resulting in equations (4) and (5).

(2)
(3)
(4)
(5)

For the CUT with 8-bit PDI/PDOs, and three wrapper scan
chains (Fig. 5), means that each
wrapper scan chain is interfaced to two IDBC/ODBC cells.
In addition,

means that the
remaining two PDI/PDO bits cannot be used to carry the test
data. These unused PDI/PDO bits become part of the wrapper
scan chain, with no extra functionality. Since in
typical cases, the following discussion on the PDI on the input
port also applies to the PDO on the output port.
During the test application, IDBC cells are loaded with the
test data in one clock cycle, in the normal operation mode
(refer to Fig. 5). The IDBC cells change into the test mode,
during which the test data are serially shifted for two clock
cycles to empty the contents into the scan chains. After completion, the IDBC cells change again into the normal mode to
capture the next incoming data from the PDI port. This operation is controlled by a test controller which keeps track of
the number of loads and shifts using counters [23].
For the NoC wrapper with a scan-in depth of nine (Fig. 5),
after four repetitions of loads and shifts, the first eight bits of

12th IEEE European Test Symposium (ETS'07)

0-7695-2827-9/07 $20.00 2007

Scan-out depth, o,k

Figure 4. TAM-based wrapper scan chains made up

of PI/PO boundary cells (square) and internal scan
chains (rectangle)

load

IDBC/ODBC
Normal boundary cell
Internal scan chain

Figure 5. Type 1 NoC wrapper architecture

each scan chains are loaded with the test data. To load the
last bit, the IDBC cells are loaded with new test data and a
single shift clock is applied. However, before applying the
capture cycle, the IDBC must also be loaded with valid test
data. After the last single shift, only part of the IDBC cells
contain valid test data. Reloading the IDBC data from the PDI
port can corrupt the valid data currently in the IDBC cells.
To overcome this problem, the first

shift
cycles of every test pattern must shift in dummy bits into the
scan chains. After the scan chains are completely loaded, another clock cycle is required to load the IDBC cells with valid
test data before applying the capture cycle. Here, a formal
definition of a new terminology based on this new scheme is
given.
[Definition] The Scan-in (scan-out) elements for the Type
1 NoC wrapper consist of the unused IDBC (ODBC) cells,
bidirectional cells, and internal scan chains (i.e. excluding all
the IDBC/ODBC cells). The maximum scan-in and scan-out
depths are denoted by and , respectively (Fig. 5).
As a result of the new test scheme, the number of shift
cycles required for the Type 1 NoC wrapper is summarized by
equations (6) and (7). Equation (8) gives the total TAT, where
the additional represents the final load of the IDBC data
prior to the capture cycle. For the NoC wrapper in Fig. 5,
clock cycles, smaller than based on
equation (1).

(6)

(7)

(8)

For a CUT with

wrapper scan chains and
scan frequency, its scan rate/bandwidth is given by

. As shown in the previous example
(Fig. 5), some PDI bits cannot be used to carry the test data
due to the Type 1 wrappers input architecture constraint. In
order to supply the test data to the CUT at

rate, the
required channel bandwidth on the NoC is given in equation
(9). For the NoC wrapper in Fig. 5, the scan and required
bandwidths are bits-per-second () and , respectively.

4000

1000

Number of wrapper scan chains

0
2

8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48 50 52 54 56 58 60 62 64

4
3

To PDO port

2000

Scan-out depth, so,k

Test frequency = 100 MHz

From PDI port

3000

Scan-in depth, si,k

Required NoC bandwidth

Bandwidth (Mbps)

5000

shift

Scan bandwidth

6000

shift

Figure 6. Scan rate and required bandwidth of a

Type 1 NoC wrapper for p93791s Core 6 [25]

Legends:

load

(9)

Figure 6 shows the required bandwidth of the proposed

Type 1 NoC wrapper (Fig. 5) compared to the actual scan
bandwidth for an ITC02 benchmark circuit. For some number of wrapper scan chains, the required bandwidth is almost twice that of the scan bandwidth. For these cases (i.e.
), the Type 1 NoC wrapper is inefficient in terms
of NoC bandwidth utilization, similar to the NoC wrapper
in [21]. For other cases, it is as efficient as the TAM-based
wrapper while having the advantage of NoC reuse support capability with minimal area overhead. In the next section, an
alternate wrapper architecture is proposed to overcome this
limitation.

#
$"
!"
Section 4.2 has shown that the Type 1 wrapper is inefficient
in terms of bandwidth utilization. The Type 2 NoC wrapper in
Fig. 7, is designed to complement the Type 1 wrapper in this
aspect. Extra load/shift registers are added to the PDI/PDO
ports, similar to the buffer architecture in [23] for the reuse
of the SoCs functional bus and the bandwidth matching registers in [24]. The load/shift registers translate the PDI bitwidth into the number of wrapper scan chains using parallelserial shift registers. As a result, the required NoC bandwidth
matches the scan bandwidth. The TAT for the Type 2 NoC
wrapper is also the same as the TAM-based wrappers in equation (1). This is achieved at the cost of area overhead of
load/shift registers and a more complex control scheme to realize the bit-width conversion. Therefore, it is important that
the Type 2 wrapper is used only when necessary. The next
section looks at two proposed optimization schemes.

Load/shift register
element
Normal boundary cell
Internal scan chain

Figure 7. Type 2 NoC wrapper with an I/O interface

time ( or ) and bandwidth ( or ) must be considered.

The problem of optimizing the number of wrapper scan
chains (
) is formally defined as follows.
: Given a core with functional inputs,
functional outputs, bidirectionals, internal scan chains of length
, and a maximum bandwidth for the virtual channel between the core and the ATE, ,
find the number of wrapper scan chains,
, such that
() the TAT is minimum, () the required bandwidth,
, and ()
is minimum subject to objectives () and ().
: Given a core as in , and a maximum TAT, ,
find the number of wrapper scan chains,
, such
that () the required bandwidth, , is minimum, ()
TAT , and ()
is minimum subject to objectives () and ().
A similar problem for a TAM-based wrapper design has
been proven to be NP-complete in [2]. Therefore, heuristic algorithms are proposed to solve both and . Figure 9 illustrates graphically the search steps for (when
) for core 17 of the p93791 [25] benchmark circuit. Since the TAT and the required bandwidth are
monotonic decreasing and increasing with respect to
, respectively, binary search algorithms can be used to find the
optimal value of
. At each search step, the optimal wrapper scan chains which minimizes are formed using the proposed algorithm in [2], described in Sect. 4. For
the Type 1 wrapper, binary search takes place in steps 1 and
2 (refer to Fig. 9). In step 1, the maximum number of scan

chains,

, such that is located. In step 2,

the search is restricted to

to find the optimal

Parallel core tests are performed according to an optimum

test schedule under constraints. Figure 8 shows an example
of a bin-packing optimization [2, 3], where a rectangle represents the bandwidth-TAT ratio for every CUT. For an optimal
packing, the new entry into the bin must be selected properly,
which means that all possible bandwidth-TAT combinations
must be explored. For optimum results, both the available test

12th IEEE European Test Symposium (ETS'07)

0-7695-2827-9/07 $20.00 2007

Bandwidth

5. Optimization of the NoC Wrappers

Core 2

B1
B2

Core 1

Core 3
Core 4

T2
Time

Figure 8. Test schedule optimization

Table 2. TAT comparison for the circuit in [21]

Table 1. p93791s Core 6 [25] with 64-bit PDI/PDOs

TAT (clock cycles)
%
nsc
TAM [2, 3] Type 1 NoC increase
1 5,317,007
5,312,372 -0.09%
2 2,658,613
2,656,404 -0.08%
3 1,809,815
1,812,442 0.15%
4 1,358,456
1,359,988 0.11%
5 1,126,316
1,127,848 0.14%
6
907,097
909,286 0.24%
7
793,217
794,749 0.19%
8
679,337
680,212 0.13%
9
674,957
676,489 0.23%
10
565,457
566,770 0.23%
11
561,077
562,171 0.19%
12
455,738
455,956 0.05%

nsc
13
14
15
16-19
20-21
22
23
24-38
39-42
43-45
46
47-64

%
TAT (clock cycles)
TAM [2, 3] Type 1 NoC increase
451,577
452,452 0.19%
451,358
451,576 0.05%
447,197
448,072 0.20%
341,858
342,076 0.06%
337,478
338,134 0.19%
333,317
333,754 0.13%
231,478
231,258 -0.10%
227,978
228,196 0.10%
223,598
223,816 0.10%
219,218
219,436 0.10%
115,848
115,847 0.00%
114,317
114,535 0.19%

value for
. The progression of the binary search is graphically illustrated in Fig. 9. As a result,
(Type 1) with
a TAT of 65,098 clock cycles.
For the Type 2 wrapper,
is directly calculated since

is a linear function of
. Binary search in step 2
(similar to the Type 1 wrapper) results in
with a
TAT of 32,766 clock cycles. Clearly a better result for the
Type 2 wrapper. In this case ( ), the Type
1 wrapper is unable to utilize efficiently the allocated bandwidth because of the constraint in its I/O architecture.
A similar heuristic is implemented for and some selected cases for both algorithms are presented in Sect. 6.

6. Experimental Results
In order to evaluate the effectiveness of the proposed
methodology, we have conducted experiments on three IP
cores. Core 17 and core 6 (the largest of p93791 circuit)
from the ITC02 benchmark [25] are selected in order to offer
comparisons with TAM-based approaches [2, 3]. Another IP
corean example core from [21]allows some comparison
with an NoC wrapper to be offered.
A TAT comparison between the proposed Type 1 NoC
wrapper and TAM-based approaches is given in Table 1, for
core 6 with bits. In all cases, the differences are
always less than 0.25%; the proposed Type 1 NoC wrapper
does not incur noticable penalty on the TAT. In fact, some reductions are achieved for
and scan chains. For
the Type 2 NoC wrapper, the TAT is the same as the TAMbased approach because the added interface between the CUT
and the NoC port does not constraint the scan chain design.
The Type 2 wrappers required bandwidth matches the scan
bandwidthan improvement due to the extra load/shift registers.
For the circuit from [21], the TAT is given in Table 2. Compared to the TAM-based wrapper, the proposed Type 1 NoC
wrapper is better for smaller number of wrapper scan chains.
For wider scan chains, the TATs are about 3% longer. However, compared to the NoC wrapper design in [21]1 , the Type
1 wrapper is always superior.
1 Based

on the corrected results obtained from the paper author because of

reporting error in the original published literature.

12th IEEE European Test Symposium (ETS'07)

0-7695-2827-9/07 $20.00 2007

nsc
1
2
3
4
5
6

TAM
[2, 3]
5,532
2,771
1,858
1,396
1,363
1,363

Amory
[21]1
5,532
2,771
1,858
1,451
1,429
1,418

Proposed
Type 1
Type 2
5,300
5,532
2,660
2,771
1,780
1,858
1,428
1,396
1,406
1,363
1,395
1,363

% increase
Type 1 / TAM
Type 1 / Amory
-4.19%
-4.19%
-4.01%
-4.01%
-4.20%
-4.20%
2.29%
-1.59%
3.15%
-1.61%
2.35%
-1.62%

Further, we implemented the wrapper architecture proposed in [21] and compared the results (Table 3) for the a
larger IP core (core 6 of p93791). The TAT and the required

bandwidth,
, (column 3) are obtained for selected

(column 1). Using
(column 4) as input

to , the corresponding
, , and TAT for the proposed Type 2 wrapper are obtained. Using at most the bandwidth required by [21], the proposed wrapper gives shorter
TATs. For
scan chains (last row), [21] requires 33%
more bandwidth to obtain a comparable TAT.
Table 4 compares the Type 1 and Type 2 wrappers when
and are applied. For , both
wrappers result in similar performancea slight advantage
for Type 1 in terms of area overhead. At
, Type 2 is clearly the winner, with only 0.8% bandwidth overhead to achieve 32.5% TAT reductions. For
, Type 2 requires 31% smaller bandwidth
with less than 0.7% TAT overhead. On the other hand, at

, Type 1 wrapper is superior due to its minimal wrapper hardware overhead. The results illustrate the
tradeoffs between the two types of NoC wrappers for a given
constraint, which can be explored during the test schedule optimization.

7. Conclusion
We have proposed two versions of the NoC wrapper that
requires minimal overhead on the test application time and
area overhead. The previously proposed wrapper design did
not handle the problem of inefficient bandwidth utilization.
In this paper, we have proposed two heuristics that find the
optimal wrapper design for a given maximum bandwidth or
maximum test application timeimportant for test schedule
optimization.
Table 3. TAT comparison with [21] (Core 6)
nsc
11
15
22
24

Amory [21]
TAT
562,172
448,073
333,755
228,416

Breq
1,280
1,600
3,200
3,200

Bmax
1,280
1,600
3,200
3,200

nsc
12
16
24
24

Proposed (Type 2)
Breq
TAT
1,200
455,738
1,600
341,858
2,400
227,978
2,400
227,978

%incr. TAT
-18.9%
-23.7%
-31.7%
-0.2%

Table 4. Selected optimization results (Core 17)

Given:
B Bmax (Mbps)
1,700
3,000
T

Tmax
70,000
200,000

Type 1

Type 2

nsc
15
21

Breq (Mbps)
1,600
2,133

TAT
97,648
96,129

nsc
15
23

Breq (Mbps)
1,500
2,300

nsc
22
8

Breq (Mbps)
TAT
3,200
65,098
800
193,128

nsc
22
8

Breq (Mbps)
TAT
2,200
65,530
800
192,912

TAT
97,215
64,882

1.E+06

TAT (clock cycles)

Test Application Time

1.E+05

Type 1
wrapper

Step 2

Type 2
Type 1

Type 2
wrapper
Step 2

1.E+04
1

11 12 13

14 15

16 17 18

19 20

21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64

Step 1

Required Bandwidth

6.E+09

Bmax = 5.6E+09

Required
Bandwidth
(bps)

4.E+09

2.E+09

Step 1

Number of wrapper scan chains

0.E+00
1

Figure 9. Optimization of NoC wrapper design for a given . In step 2 (Type 1), the dotted lines represent the
search space which halves in every progression of the binary search

The proposed wrapper does not incur large test time overhead (against TAM-based designs) for the same number of
wrapper scan chains (about 3% for a very small circuit, and
less than 0.25% for larger circuits). The wrappers scale well
for large circuits. The advantage of the proposed wrapper is
that NoC reuse is possible with only small test time overhead. With additional allowances on the area overhead, the
proposed wrapper (Type 2) can efficiently utilize the NoC
bandwidth with zero overhead on the test application time.

Acknowledgements
This work was supported in part by Japan Society for the
Promotion of Science (JSPS) under Grants-in-Aid for Scientific Research B (No.15300018) and for Young Scientists B
(No.18700046).

References
[1] L. Benini and G. D. Micheli, Networks on Chips: A New SoC
Paradigm, IEEE Computer, 35(1), pp. 70-80, 2002.
[2] V. Iyengar, K. Chakrabarty, and E. J. Marinissen, Test Wrapper and
Test Access Mechanism Co-Optimization for System-on-Chip, Journal
of Electronic Testing: Theory and Applications 18, pp. 23-230, 2002.
[3] S. K. Goel and E. J. Marinissen, SoC Test Architecture Design for Efficient Utilization of Test Bandwidth, ACM Trans. Design Automation
of Electronic Systems, Vol. 8(4), Oct. 2003, pp. 399-429.
[4] E. Cota, L. Carro, and M. Lubaszewski, Reusing and On-Chip Network
for the Test of Core-Based Systems, ACM Trans. Design Automation
of Electronic Systems, Vol. 9, No. 4, October 2004, pp. 471-499.
[5] A. M. Amory, E. Cota, M. Lubaszewski, and F. G. Moraes, Reducing
Test Time With Processor Reuse in Network-on-Chip Based Systems,
In Proc. Integrated Circuits and Systems Design, 2004, pp. 111-116.
[6] C. Liu, Z. Link, and D.K. Pradhan, Reuse-Based Test Access and Integrated Test Scheduling for Network-on-Chip, In Proc. Design, Automation and Test in Europe, 2006, pp. 303-308
[7] A. M. Amory, E. Briao, E. Cota, M. Lubaszewski, and F. G. Moraes, A
Scalable Test Strategy for Network-on-Chip Routers, In Proc. IEEE
International Test Conference, 2005, pp. 591-599.
[8] C. Grecu, P. Pande, A. Ivanov, and R. Saleh, BIST for Network-onChip Interconnect Infrastructure, VLSI Test Symposium, 2006, pp. 3035.
[9] E. J. Marinissen, R. Kapur, M. Lousberg, T. McLaurin, M. Ricchetti,
and Y. Zorian, On IEEE P1500 standard for embedded core test, Journal of Electronic Testing: Theory and Applications, 2002, pp. 365-383.

12th IEEE European Test Symposium (ETS'07)

0-7695-2827-9/07 $20.00 2007

[10] P. Guerrier and A. Greiner, A Generic Architecture for On-Chip

Packet-Switched Interconnection, In Proc. Design, Automation and
Test in Europe, 2000, pp. 250-256.
[11] E. Rijpkema, Trade Offs in the Design of a Router with both Guaranteed and Best-Effort Services for Networks on Chip, In Proc. Design,
Automation and Test in Europe, 2003, pp. 10350-10355.
[12] A. Radulescu, et al., An Efficient On-Chip NI Offering Guaranteed
Services, Shared-Memory Abstraction, and Flexible Network Configuration, IEEE Trans. Computer-Aided Design of Integrated Circuits and
Systems, Vol. 24(1), pp. 4-17, Jan. 2005.
[13] C. A. Zeferino and A. A. Susin, SoCIN: A Parametric and Scalable
Network-on-Chip, In Proc. Symposium on Integrated Circuits
and Systems Design, 2003, pp. 169-174.
[14] M. Millberg, E. Nilsson, R. Thid, and A. Jantsch, Guaranteed Bandwidth using Looped Containers in Temporally Disjoint Networks within
the Nostrum Network on Chip, In Proc. Design, Automation and Test
in Europe, 2004, pp. 890-895.
[15] E. Bolotin, I. Cidon, R. Ginosar, and A. Kolodny, QNOC: QoS Architecture and Design Process for Network on Chip, Journal of Systems
Architecture: The Euromicro Journal, 50(23), pp. 105-128, Feb. 2004.
[16] F. G. Moraes, N. Laert, V. Calazans, A. V. de Mello, L. H. Mller, L.
C. Ost, HERMES: an Infrastructure for Low Area Overhead PacketSwitching Networks on Chip, Integration, the VLSI Journal, 38(1), pp.
69-93, Oct. 2004.
[17] J. Bainbridge and S. Furber, Chain: a Delay-Insensitive Chip Area
Interconnect, IEEE Micro, Vol. 22(5), pp. 16-23, Sept./Oct. 2002.
[18] A. Lines, Asynchronous Interconnect for Synchronous SoC Design,
IEEE Micro, Vol. 24(1), pp. 32-41, Jan./Feb. 2004.
[19] E. Beigne, F. Clermidy, P. Vivet, A. Clouard, and M. Renaudin, An
Asynchronous NoC Architecture Providing Low Latency Service and
its Multi-level Design Framework, In Proc. IEEE Intl Symposium on
Asynchronous Circuits and Systems, 2005, pp. 54-63.
[20] X.-T. Tran, J. Durupt, F. Bertrand, V. Beroulle, and C. Robach, A DFT
Architecture for Asynchronous Networks-on-Chip, In Proc. IEEE European Test Symposium, 2006, pp. 219-224.
[21] A. M. Amory, K. Goossens, E. J. Marinissen, M. Lubaszewski, and F.
Moraes, Wrapper Design for the Reuse of Networks-on-Chip as Test
Access Mechanism, European Test Symposium, 2006, pp. 213-218.
[22] ARM, AMBA AXI Protocol Specification, March 2004.
[23] F. A. Hussin, T. Yoneda, A. Orailoglu, and H. Fujiwara, PowerConstrained SOC Test Schedules Through Utilization of Functional
Buses, Intl Conference on Computer Design, 2006, pp. 230-236.
[24] A. Khoche, Test Resource Partitioning for Scan Architectures using
Bandwidth Matching, Digest of Workshop on Test Resource Partitioning, 2002, pp. 1.4.1-1.4.8.
[25] E. J. Marinissen, V. Iyengar, and K. Chakrabarty, A Set of Benchmarks
For Modular Testing of SOCs, In Proc. International Test Conference,
2002, pp. 519-528.

Duality and Modern Economics - Cornes PDF
No ratings yet
Duality and Modern Economics - Cornes PDF
304 pages
Giai Tich 2 Nam2017 Nhom 6
No ratings yet
Giai Tich 2 Nam2017 Nhom 6
3 pages
A Case Study of Hierarchical Diagnosis For Core-Ba
No ratings yet
A Case Study of Hierarchical Diagnosis For Core-Ba
6 pages
IET Computers Digital Tech - 2017 - Khan - Comparative Analysis of Network On Chip Simulation Tools
No ratings yet
IET Computers Digital Tech - 2017 - Khan - Comparative Analysis of Network On Chip Simulation Tools
9 pages
Eetop - CN - Designing Reliable and Efficient Networks On Chips by Dr. Srinivasan Murali (Aut
100% (1)
Eetop - CN - Designing Reliable and Efficient Networks On Chips by Dr. Srinivasan Murali (Aut
200 pages
Network On Chip
No ratings yet
Network On Chip
6 pages
P1500-Wrapper - Creation
No ratings yet
P1500-Wrapper - Creation
24 pages
Bi - Directional Signals in IEEE P1500 Standard
No ratings yet
Bi - Directional Signals in IEEE P1500 Standard
68 pages
Multistage Interconnection Network For Mpsoc: Performances Study and Prototyping On Fpga
No ratings yet
Multistage Interconnection Network For Mpsoc: Performances Study and Prototyping On Fpga
6 pages
SOC Testing
No ratings yet
SOC Testing
27 pages
A Plan For Router Testing in Network-On-Chip
No ratings yet
A Plan For Router Testing in Network-On-Chip
46 pages
GinosarNOC Tutorial
No ratings yet
GinosarNOC Tutorial
35 pages
Network On-Chip and Its Research Challenges: K. Paramasivam
100% (1)
Network On-Chip and Its Research Challenges: K. Paramasivam
5 pages
An Efficient On-Chip Network Interface Offering Guaranteed Services, Shared-Memory Abstraction, and Flexible Network Configuration
No ratings yet
An Efficient On-Chip Network Interface Offering Guaranteed Services, Shared-Memory Abstraction, and Flexible Network Configuration
6 pages
Noc Topologies01
No ratings yet
Noc Topologies01
16 pages
Eetop - CN - Ayan Mandal, Sunil P. Khatri, Rabi Mahapatra (Auth.) - Source-Synchronous Networ
No ratings yet
Eetop - CN - Ayan Mandal, Sunil P. Khatri, Rabi Mahapatra (Auth.) - Source-Synchronous Networ
151 pages
Integrated Circuit IP Cores System On A Chip
No ratings yet
Integrated Circuit IP Cores System On A Chip
3 pages
SoCBUS Switched Network On Chip For Hard Real Time Embedded Systems
No ratings yet
SoCBUS Switched Network On Chip For Hard Real Time Embedded Systems
8 pages
Week 9 Course Material
No ratings yet
Week 9 Course Material
65 pages
Networks On Chip, Router Architectures and Performance Challenges
No ratings yet
Networks On Chip, Router Architectures and Performance Challenges
6 pages
Noc Router Area Opt Flow Co1
No ratings yet
Noc Router Area Opt Flow Co1
8 pages
Frame Work For Designexp
No ratings yet
Frame Work For Designexp
15 pages
Verification of Noc - Interconnect
No ratings yet
Verification of Noc - Interconnect
6 pages
Attia 2011
No ratings yet
Attia 2011
6 pages
Survey of Network-On-chip Proposals
No ratings yet
Survey of Network-On-chip Proposals
13 pages
An Efficient Interfacing Approach For Heavily-Communicating NoC-Based Systems
No ratings yet
An Efficient Interfacing Approach For Heavily-Communicating NoC-Based Systems
20 pages
Performance Comparison of 2D and 3D Torus Network-on-Chip Architectures
No ratings yet
Performance Comparison of 2D and 3D Torus Network-on-Chip Architectures
4 pages
Hallenges and Romising Esults in O Rototyping Sing S: C P R N CP U Fpga
No ratings yet
Hallenges and Romising Esults in O Rototyping Sing S: C P R N CP U Fpga
10 pages
Week 10 Course
No ratings yet
Week 10 Course
50 pages
Designing A WISHBONE Protocol Network Adapter For An Asynchronous Network-on-Chip
No ratings yet
Designing A WISHBONE Protocol Network Adapter For An Asynchronous Network-on-Chip
7 pages
05 Testing PDF
No ratings yet
05 Testing PDF
118 pages
Testing
No ratings yet
Testing
118 pages
附件2
No ratings yet
附件2
6 pages
Power-Aware Soc Test Planning For Effective Utilization of Port-Scalable Testers
No ratings yet
Power-Aware Soc Test Planning For Effective Utilization of Port-Scalable Testers
19 pages
Overview of The Ieee p1500 Standard
No ratings yet
Overview of The Ieee p1500 Standard
10 pages
Data Communication and Computer Networks: Submitted To: Dr. Muhammad Kaleem Ullah Submitted By: Registration #
100% (1)
Data Communication and Computer Networks: Submitted To: Dr. Muhammad Kaleem Ullah Submitted By: Registration #
32 pages
Compare Performance of 2D and 3D Mesh Architectures in Network-On-Chip
No ratings yet
Compare Performance of 2D and 3D Mesh Architectures in Network-On-Chip
5 pages
Delay Optimization: Use of Dynamic Routing
No ratings yet
Delay Optimization: Use of Dynamic Routing
8 pages
An Efficient Algorithm For Wrapper and Tam Co-Optimization To Reduce Test Application Time in Core Based Soc
No ratings yet
An Efficient Algorithm For Wrapper and Tam Co-Optimization To Reduce Test Application Time in Core Based Soc
9 pages
Design and Application of A Co-Simulation Framework For Chisel-EECS-2021-133
No ratings yet
Design and Application of A Co-Simulation Framework For Chisel-EECS-2021-133
49 pages
Routing in Wireless Mesh Networks
From Everand
Routing in Wireless Mesh Networks
Raghav Kumar
No ratings yet
NoC Memory Controller
No ratings yet
NoC Memory Controller
74 pages
Survey of Network On Chip (Noc) Architectures & Contributions
No ratings yet
Survey of Network On Chip (Noc) Architectures & Contributions
15 pages
2-D Mesh of Switches and Resources Providing Physical-Architectural Level Design Integration Type of Resources Basic Module
No ratings yet
2-D Mesh of Switches and Resources Providing Physical-Architectural Level Design Integration Type of Resources Basic Module
5 pages
No Care A Power
No ratings yet
No Care A Power
7 pages
Scientific Programming - 1994 - Rosing - Low Latency Messages On Distributed Memory Multiprocessors
No ratings yet
Scientific Programming - 1994 - Rosing - Low Latency Messages On Distributed Memory Multiprocessors
9 pages
Modeling and Analysis of A Cache Coherent Interconnect
No ratings yet
Modeling and Analysis of A Cache Coherent Interconnect
84 pages
Proactive Deadlock Prevention Based On Traffic Cla
No ratings yet
Proactive Deadlock Prevention Based On Traffic Cla
12 pages
Javed Sethi2020 - Review of Network On Chip Routing Algorithms
No ratings yet
Javed Sethi2020 - Review of Network On Chip Routing Algorithms
13 pages
Network On Chip
No ratings yet
Network On Chip
43 pages
Applying The Benefits of Network On A Chip Architecture To FPGA System Design
No ratings yet
Applying The Benefits of Network On A Chip Architecture To FPGA System Design
9 pages
System-On-Chip (Soc) Testing
No ratings yet
System-On-Chip (Soc) Testing
18 pages
A SECURE DATA AGGREGATION TECHNIQUE IN WIRELESS SENSOR NETWORK
From Everand
A SECURE DATA AGGREGATION TECHNIQUE IN WIRELESS SENSOR NETWORK
Dr Chaitra HV
No ratings yet
Comparative Analysis of Different Topologies Based On Network-on-Chip Architectures
No ratings yet
Comparative Analysis of Different Topologies Based On Network-on-Chip Architectures
6 pages
Why Use The IEEE 1500 Standard?: 2.1.1 Test Reuse and Partitioning
No ratings yet
Why Use The IEEE 1500 Standard?: 2.1.1 Test Reuse and Partitioning
8 pages
Final Exam System On Chip Solutions in Networking SS 2007
No ratings yet
Final Exam System On Chip Solutions in Networking SS 2007
10 pages
Improvement of Compilers
No ratings yet
Improvement of Compilers
4 pages
Exam #2 For Computer Networks (CIS 6930) SOLUTIONS : Problem #1
No ratings yet
Exam #2 For Computer Networks (CIS 6930) SOLUTIONS : Problem #1
4 pages
NoC Seminar Report
No ratings yet
NoC Seminar Report
24 pages
Managing Qos Flows at Task Level in Noc-Based Mpsocs: Everton Carara, Ney Calazans, Fernando Moraes
No ratings yet
Managing Qos Flows at Task Level in Noc-Based Mpsocs: Everton Carara, Ney Calazans, Fernando Moraes
6 pages
Noc Bench Spec Part1 v03 With XML
No ratings yet
Noc Bench Spec Part1 v03 With XML
13 pages
Essential Guide to CAN Communication
From Everand
Essential Guide to CAN Communication
Engineer's Essentials
No ratings yet
Important Valeo Qs
100% (1)
Important Valeo Qs
7 pages
Project Description:: Comm14 Version (I) Microprocessor
No ratings yet
Project Description:: Comm14 Version (I) Microprocessor
7 pages
Part 3 Itroduction To Wan Ch1 Part2
No ratings yet
Part 3 Itroduction To Wan Ch1 Part2
22 pages
Companies Database 1
No ratings yet
Companies Database 1
36 pages
117101004
No ratings yet
117101004
4 pages
Static Routing: Prof DR Hesham Arafat
No ratings yet
Static Routing: Prof DR Hesham Arafat
26 pages
Part 2 Routing Prt3
No ratings yet
Part 2 Routing Prt3
8 pages
TCP and Udp: Connectionless Low Overhead Datagrams
No ratings yet
TCP and Udp: Connectionless Low Overhead Datagrams
11 pages
DHCP - Dynamic Host Configuration Protocol: IP Addresses and Other Information Can Be Obtained
No ratings yet
DHCP - Dynamic Host Configuration Protocol: IP Addresses and Other Information Can Be Obtained
8 pages
Altera Max Plus Tutorial
No ratings yet
Altera Max Plus Tutorial
15 pages
Network Support The Way We Live: Modern Technologies and Trends For Each Give A Brief Discussion
No ratings yet
Network Support The Way We Live: Modern Technologies and Trends For Each Give A Brief Discussion
8 pages
OSI Reference Model: ITE PC v4.0 © 2007 Cisco Systems, Inc. All Rights Reserved. Cisco Public
No ratings yet
OSI Reference Model: ITE PC v4.0 © 2007 Cisco Systems, Inc. All Rights Reserved. Cisco Public
8 pages
Compact Yet High-Performance (Cyhp) Library For Short Time-To-Market With New Technologies
No ratings yet
Compact Yet High-Performance (Cyhp) Library For Short Time-To-Market With New Technologies
7 pages
N Nterconnect Rchitecture FOR Etworking Ystems On Hips: A I A N S C
No ratings yet
N Nterconnect Rchitecture FOR Etworking Ystems On Hips: A I A N S C
10 pages
Electronics Graduation Project Competition 2014: Abstract
No ratings yet
Electronics Graduation Project Competition 2014: Abstract
1 page
OFDM in Verilog
No ratings yet
OFDM in Verilog
6 pages
Maxima and Minima
No ratings yet
Maxima and Minima
4 pages
Math F111
No ratings yet
Math F111
2 pages
Brij Kishor
No ratings yet
Brij Kishor
102 pages
Alder - Multivariate Calculus
100% (1)
Alder - Multivariate Calculus
198 pages
Altair's Student Guides - CAE and Design Optimisation - Basics
100% (29)
Altair's Student Guides - CAE and Design Optimisation - Basics
69 pages
Mathematics (2 Unit) : Based On 1983 Syllabus, Written in 2004
No ratings yet
Mathematics (2 Unit) : Based On 1983 Syllabus, Written in 2004
16 pages
Calculus Based Optimization: Eaching Uggestions M6-6
No ratings yet
Calculus Based Optimization: Eaching Uggestions M6-6
2 pages
Maxima Minima
No ratings yet
Maxima Minima
2 pages
History of Derivatives
No ratings yet
History of Derivatives
9 pages
C++ Chapter 5 Arrays and String
No ratings yet
C++ Chapter 5 Arrays and String
15 pages
Key Concept: Ansal Lasses Wave Optics
No ratings yet
Key Concept: Ansal Lasses Wave Optics
11 pages
Optimization of Hydraulics Parameters
100% (1)
Optimization of Hydraulics Parameters
22 pages
Arihant Class12 Mathematics Revisionnotes-1
100% (1)
Arihant Class12 Mathematics Revisionnotes-1
19 pages
Math F111
No ratings yet
Math F111
2 pages
Maximum or Minimum
No ratings yet
Maximum or Minimum
38 pages
Rational and Radical Function - Domain and Range PDF
No ratings yet
Rational and Radical Function - Domain and Range PDF
9 pages
Operation Research
No ratings yet
Operation Research
23 pages
MaxEnt Lambda Files
No ratings yet
MaxEnt Lambda Files
6 pages
AUGS/ AGSR Division: Birla Institute of Technology and Science, Pilani Pilani Campus
No ratings yet
AUGS/ AGSR Division: Birla Institute of Technology and Science, Pilani Pilani Campus
4 pages
Question Bank AOT
No ratings yet
Question Bank AOT
4 pages
Applications of Differentiation
No ratings yet
Applications of Differentiation
96 pages
12 Mathematics Sp05
No ratings yet
12 Mathematics Sp05
27 pages
Lec05 Proof of Correctness Solving Local Minima in Grid
No ratings yet
Lec05 Proof of Correctness Solving Local Minima in Grid
24 pages
S&Z 1.6 PDF
No ratings yet
S&Z 1.6 PDF
27 pages
Section 9.4 Lagrange Multipliers Problems 1-15 Odd, 19-25 Odd
No ratings yet
Section 9.4 Lagrange Multipliers Problems 1-15 Odd, 19-25 Odd
4 pages
Stacking Sequence Design of Composite Laminates For Maximum Strength Using Genetic Algorithms
No ratings yet
Stacking Sequence Design of Composite Laminates For Maximum Strength Using Genetic Algorithms
15 pages
3.6 Illustrating Extreme Value Theorem
No ratings yet
3.6 Illustrating Extreme Value Theorem
16 pages
Lesson 7 Optimization Problems
No ratings yet
Lesson 7 Optimization Problems
16 pages

Optimization of Noc Wrapper Design Under Bandwidth and Test Time Constraints

Uploaded by

Optimization of Noc Wrapper Design Under Bandwidth and Test Time Constraints

Uploaded by

Optimization of NoC Wrapper Design Under Bandwidth and Test Time Constraints

Fawnizu Azmadi Hussin, Tomokazu Yoneda, and Hideo Fujiwara

12th IEEE European Test Symposium (ETS'07)

the guaranteed bandwidth and latency provided by the NoC to

From other cores, PIs, etc

Figure 1. NoC model based on the thereal NoC

Figure 3. IP core model interfaced to the NI port

the IP cores by means of shared-memory abstraction (Fig. 1)

into primary data outputs (PDO), consisting of RDATA[31:0]

Figure 2. AXI burst-write transaction

12th IEEE European Test Symposium (ETS'07)

Core wrapper design for a TAM-based test architecture has

4. NoC Wrapper Architecture

When using TAMs as the delivery channel, the scan chain

Scan-in depth, i,k

Scan-out depth, so,k

Scan-in depth, si,k

NoC as the delivery channel, the scan chains are connected to

12th IEEE European Test Symposium (ETS'07)

Scan-out depth, o,k

Figure 4. TAM-based wrapper scan chains made up

Figure 5. Type 1 NoC wrapper architecture

For a CUT with

Number of wrapper scan chains

Scan-out depth, so,k

Test frequency = 100 MHz

From PDI port

Scan-in depth, si,k

Required NoC bandwidth

Figure 6. Scan rate and required bandwidth of a

Figure 6 shows the required bandwidth of the proposed

Figure 7. Type 2 NoC wrapper with an I/O interface

time ( or ) and bandwidth ( or ) must be considered.

, such that   is located. In step 2,

 to find the optimal

Parallel core tests are performed according to an optimum

12th IEEE European Test Symposium (ETS'07)

5. Optimization of the NoC Wrappers

Figure 8. Test schedule optimization

Table 2. TAT comparison for the circuit in [21]

Table 1. p93791s Core 6 [25] with 64-bit PDI/PDOs

on the corrected results obtained from the paper author because of

12th IEEE European Test Symposium (ETS'07)

Table 4. Selected optimization results (Core 17)

TAT (clock cycles)

Test Application Time

Number of wrapper scan chains

12th IEEE European Test Symposium (ETS'07)

[10] P. Guerrier and A. Greiner, A Generic Architecture for On-Chip

You might also like

, such that is located. In step 2,

to find the optimal