A Method For Improving Power Grid Resilience To Electromigration-Caused Via Failures
A Method For Improving Power Grid Resilience To Electromigration-Caused Via Failures
A Method For Improving Power Grid Resilience To Electromigration-Caused Via Failures
Abstract— Electromigration (EM) has become a major power lifetime. For example, MTTF of a wire is a decreasing function
grid reliability problem in VLSI. In this paper, we first demon- of its current density. Power grid interconnections suffer most
strate that EM reliability analysis of a power grid can be from EM due to large current densities carried by them.
converted to analyzing EM reliability of the grid vias. We develop
a model for calculating EM lifetime of via-arrays and observe In dual-damascene copper metallization, special attention
that making power grid EM-immortal carries a huge metal area must be paid to vias, because they are where the maximum
overhead and possibly makes routing of both power and signal atomic flux divergence (AFD) occurs. According to [4], AFD
networks too difficult to complete. We propose a method for is directly associated with EM lifetime. In this paper, we
trading off power grid integrity and reliability to minimize the demonstrate that power grid EM analysis can be converted
total metal area overhead needed to achieve the desired grid
life time under power integrity constraints. Experimental results to analyzing EM reliability of the power grid vias.
show that using our method, both EM reliability and power The need for low resistance power grid leads to implemen-
integrity can be met, while the additional metal area used is tations with wide wires and multivias (via-arrays). Single large
significantly reduced. vias is not used due to via aspect ratio limit. Current density
Index Terms— Electromigration (EM), power integrity, of an individual via is reduced due to the presence of multiple
via-array. vias in an array. However, current is not evenly distributed
into vias of an array, and such vias fail at different rates due
I. I NTRODUCTION to EM. Also, when one via fails, the current redistributes into
remaining vias, which again changes the via failing rate. In this
test results [9] also suggest that vias are the EM critical spots,
Fig. 1. EM effect in dual-damascene copper metallization. (a) Electron therefore, in this paper, we ignore the minor diffusion paths
moving upward. (b) Electron moving downward. and focus on the dominant EM problem at via sites. At the
TABLE I anode, copper atoms are accumulated and compressed causing
C OPPER D IFFUSION PATHS AND THE A CTIVATION E NERGY a back-stress, which resists EM-induced void nucleation [10].
In extreme cases, when the compressive stress of copper atoms
at the anode exceeds a certain value, an extrusion may occur.
However, in dual-damascene Cu metallization, this would
imply that barrier layer is destroyed and copper atoms diffuse
into the dielectric layer. This is very unlikely to occur, thus in
dual-damascene copper metallization voids are considered the
main EM-caused failures [11].
(Cu) body as the upper level metal, but are separated from the Whether or not a wire is affected by EM depends not only
lower level metal by a barrier layer, which prevents copper on its current density, but also on its length. This is known
diffusion into the dielectric, as shown in Fig. 1. Typical barrier as Blech length effect [10], and any wire shorter than Blech
layer materials are Ta, TaN, TiN, and TiW. length is considered EM-immortal. Blech length effect can be
According to the basic EM theory, Cu atoms will gradually briefly described as follows: copper atoms are forced to move
move when moving electrons collide with them. In Cu metal- toward anode but cannot pass through the barrier layer, so they
lization, there are several potential atom diffusion paths. They accumulate there and create a compressive stress at the anode.
are listed in Table I [5], [6] and indicated in the bottom metal This compressive back-stress compensates the stress caused by
in Fig. 1(a). electron flow. Mathematical description of Blech length effect
From Table I, we can see that the surface and interface is given by
σmax
are the dominant diffusion paths in copper metallization. It ( j · L)critical = ∗ (2)
has been observed that the EM defect locations are related Z eρ
to those dominant diffusion paths. In [7], EM test results where j is the current density, L is the wire length, σ max is the
show that regions around via locations are the most EM-weak. maximum back-stress that can be generated (the upper bound),
In [8], finite-element method (FEM) simulation is performed is the atomic volume, Z ∗ is the effective valence, e is the
to investigate the nature of the asymmetries, and to identify electron’s charge, and ρ is the resistivity. When maximum
the likely failure locations in the via structures at different back-stress is generated on a wire, the corresponding jL
test conditions. Using the site with maximum AFD as the product is called the critical jL product. It is considered a
void location site, a good correlation between the model constant for a given technology. Given a current density j on a
predictions and experimental observations is obtained. Here, wire, if the wire length l < (jL)critical /j , the wire is considered
we briefly summarize the correlation between EM defect spot EM-immortal. Blech length of a wire is a decreasing function
and diffusion paths. In the wire body, due to high activation of its current density [10] rather than a constant, so a better
energy of grain boundary and bulk, the resulting AFD is much way to interpret Blech length effect is to take the jL product
smaller, which can be mathematically described as AFD = value of a wire and compare it to (jL)critical value to determine
d N/dt ≈ 0, where N is the atomic concentration and t is its EM immorality.
time. Therefore, the wire body is usually not the EM defect The above analyzes were all developed for a simple point-
site. Most of the voids nucleate near vias, since via regions are to-point wire. For a complex network topology, in which
mainly associated with surface and interface diffusion paths at least one node has a degree larger than two (a via con-
with small activation energy. Thus, near vias the nonzero tributes 1°), such as a power grid, it is nontrivial to extract
AFDs are created. In Fig. 1(a), the via acts as cathode for the the proper jL product. The problem of complex topology EM
upper layer metal. This scenario is also referred to as upstream analysis has been addressed in [12]. We briefly explain how
electron flow. For this case, a void is nucleated above the via to apply their results.
bottom barrier layer. In Fig. 1(b), the via acts as cathode for the Fig. 2 shows an example of a general complex topology.
lower layer metal. This is a downstream electron flow scenario. The nodes labeled X denote vias, the edges indicate metal
A void is nucleated below the via bottom barrier layer. In wire segments; all wires are on the same layer. Arrows denote
an extreme case, when a void spans the whole via width, the direction of electron flow. It is a complex topology since at
an open circuit is created. In practice, a typical EM failure least one node has a degree greater than two, e.g., node B, has
criterion is set to 10% of wire resistance increase. Real EM degree 4 due to incident edges AB, BC, EB, and via B. In [12],
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
each via, given the wire dimensions, applied voltage and via
resistance Rvia .
Using superposition property of a resistive circuit, we solve
the system each time with only one voltage source present and
all others shorted to ground. Due to symmetry of the system,
when only Va is applied, I1_a_via = I13_a_via , I2_a_via =
I14_a_via , and so on, where Ii_a_via denotes the current flowing
through via i when Va is applied. The currents I1_a_via ,
I2_a_via , I3_a_via , and I4_a_via are different due to the voltage
gradient between left and right end of the horizontal wire. We
define this as the forward voltage gradient of Va , denoted by
f = V1a /V2a , where Via is defined as voltage of via i ’s top
surface when Va is applied. The currents I1_a_via , I5_a_via , are
Fig. 4. Even/uneven current distribution in a via-array. (a) 3-D orthogonal
wire structure with 4 × 4 via-array. (b) Even distribution. (c) Uneven
different due to the voltage gradient between the left end of
distribution. (d) Extreme uneven. a horizontal wire and the bottom end of a vertical wire. This
is defined as the lateral voltage gradient of Va , denoted by
l = V1a /V5a .
We compute f using resistance divider shown in Fig. 6.
Rvia + R is the equivalent resistance from a via to ground and
R = R B //RT . We have
V1a 3r + R R I1_a_via
f = = = . (8)
V2a 2r + R R I2_a_via
The lateral voltage gradient l is computed using resistances
Fig. 5. 4 × 4 via-array. (a) Grid model. (b) Shortest paths.
approximated by the shortest paths shown in Fig. 5(b).
There are two shortest length paths: path_5 and path_1, and
A 3-D orthogonal wire structure with 4 × 4 via-array is only one of them pass through via 5. Thus, the resistance
built, as shown in Fig. 4(a). We change the currents flowing difference between path_5 and path_1 is r /2. We have
in each wire segment, and record the current densities through V1a Rtotal_path RL + RB
the horizontal cross-sectional plane of multiple vias, as shown l= = = . (9)
V5a Rtotal_path − r/2 R L + R B − r/2
in Fig. 4(b)–(d). It can be observed that when all wire currents
are identical as in Fig. 4(b), current distribution into the vias is Based on this analysis, we can express all via currents first
uniform. Greater differences among the wire segment currents in terms of I1_a_via and the value of I1_a_via can be derived
cause the current split into the vias to be more nonuniform. using the forward and lateral voltage gradients.
This effect can be explained by the differences among path Experimental results indicate that the maximal inaccuracy
resistances through different vias. A pure resistive mesh is is <1% of the total current value. This error is contributed
created in SPICE, and the simulation results validate this mostly by the inaccuracy of forward gradient computation.
observation. Based on the resistive mesh model, a fast current Our method can be easily extended to M × N via-array by
distribution into each via can be determined. A detailed modifying the equivalent circuit and shortest paths.
description can be found in [17]. We briefly summarize the There is also a local current crowding effect even in a single
approach below. via. Tan et al. [18] investigate this phenomenon, and observe
Consider two orthogonal wires on Metal 1 and Metal 2 that the effect is due to the much larger average current density
connected through a 4 × 4 via-array composed of equal size along the interface between the liner and via. This behavior
vias uniformly spaced in horizontal and vertical directions, affects the EM defect location around the via region.
connected through a via-array, as shown in Fig. 5(a). The Due to the variation of the microstructure inside each
lengths of wires extending from the intersection area are L T , interconnect, same current density may result in different
L B , L L , and L R (assume L T = L B and L L = L R ) and their EM lifetimes, therefore the overall EM lifetime of a via-
resistances are RT , R B , R L , and R R . The unit length of mesh array is not uniquely determined, even with known current
gird is d and its unit resistance is r . Voltages Va , Vb , Vc , and distribution. To account for this variation, a large number
Vd are applied at the wire ends. Vd is set to be the reference of simulations are required to obtain the overall via-array
(Vd = 0). Our objective is to compute current flowing through EM lifetime distribution. A complete via-array EM lifetime
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 7. Timeline for via p EM failure. (a) No void before t p−1. (b) Void
nucleates at t ∗ .
TABLE II
EM A NALYSIS R ESULTS AND C OMPARISONS
Fig. 17. Observations of the current flow behavior on the grid. (a) Via KCL. According to Kirchhoff’s current law (KCL), the current
(b) Current change trend. (c) Top-down view.
flowing into a network equals to the current flowing out of
the network. In Fig. 17(a), the dashed boxes can be viewed
first translate the via-array sensitivity list to track sensitivity as networks, therefore Iloads = I12 = I23 = · · · . In Fig. 17(b),
list by collecting all sensitivities of EM-weak via-arrays on the upsized via provides a less resistive path between the
the track, and choosing the track for upsizing based on the two metal layers, therefore it carries more current (denoted
track sensitivity. by a + sign). However, the current density through the
The objective of retaining power integrity within the upsized via is decreased. This can be proved using Thevenin’s
required lifetime while minimizing the cost of extra metal area theorem [26].
can be stated as Consider Fig. 18, a power grid in our case is a linear electri-
cal network with voltage and current sources and resistances.
min A = wi · li It can be replaced at terminals A-B by an equivalent voltage
i ∈ {upsized tracks} source Ueq in series connected to an equivalent resistance
Req . The resistor connecting A-B is the via-array that is to
s. t.∀i ∈ {power grid nodes} : Vi ≤ Vth
be upsized.
∀ j, k ∈ {power tracks} ∧ j//k : d jk ≥ dmin (17) Assume that the horizontal cross-sectional area of the via-
where A is the total extra metal area; wi is the width array is upsized to α (α > 1) times the original area. The
resistance of the via-array will be 1/α of the original value.
increase of track i ; li is the length of track i ; Vi is the
voltage drop on node i after all EM failures occur within the This is mathematically stated in (18). In (18), Iv and Iv
required lifetime; Vth is the threshold voltage drop, defined are the via-array currents before and after upsizing, jv and
as Vth = 10%Vdd in our case; djk is the distance between jv are the via-array current densities before and after upsiz-
track j and track k; and dmin is the minimum allowed distance ing, A is the via-array horizontal cross-sectional area before
between two tracks by the process technology. A first thought upsizing
to obtain a solution would be using greedy algorithm that by ⎧
⎨ Iv = Ueq1 > Iv = R U+R eq
iterations of selecting and upsizing each time a track with the Req + α Rv eq v
(18)
⎩ jv = Ueq1 · 1 = Ueq Ueq
greatest sensitivity until power integrity is achieved. In this αA
Req + α Rv A·(α Req +Rv ) < jv = A·(Req +Rv ) .
way, in each iteration, we obtain the local optimal solution
for the current condition; and it is expected to achieve a near Since the total amount of current between two adjacent
optimal solution at the end. However, two concerns would layers is constant, and the upsized via will carry more current,
arise during each iteration step. First, after each step the sensi- so other vias will carry less current (denoted by − sign),
tivity of remaining via-arrays would change; second, the global the amount of current decrease will gradually diminish as the
current distribution of power grid interconnects would change, distance between the other vias and the upsized via increases,
which may create new EM violations or eliminate some since the via that is the nearest to the upsized via will be
of the existing ones. So the true greedy algorithm requires affected the most. Fig. 17(c) shows the top view of a power
updating of track sensitivity list and power grid global current grid. When a track and the vias beneath it is upsized, it
distribution by simulating the upsized power grid at each decreases the current flowing through their first immediate
iteration step. The huge computation time prohibits us from neighboring vias; the effect on other vias on the same layer
taking this path. Instead, we apply a heuristic approach based is negligible. In Fig. 17(b), currents flowing through vias
on the following observations: 1) the total current flowing on different via layers may increase (denoted by ? mark),
through all vias between any two adjacent metal layers is equal especially those beneath and above the upsized via. This may
to the total amount of current drawn by all current loads; 2) the potentially create new EM violations. However, at this point,
current flowing through the via-arrays upsized at the present we will put this problem aside, and later provide a simulation-
iteration step increases, but the current density decreases; and based check to resolve it.
3) the current flowing through all other via-arrays that are Based on these observations, our heuristic greedy upsizing
on the same via layer as the upsized ones decreases with the Algorithm 1 is summarized below followed by brief expla-
largest current decrease on the first immediate neighboring nations. In the algorithm, SS is the track sensitivity list; we
track. We justify these observations below. first translate via-array sensitivity into track sensitivity. Then,
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
we sort the track sensitivity list and upsize the track with the VI. E XPERIMENTAL R ESULT
greatest sensitivity. The test criterion in step 2 is a necessary We first perform EM analysis on several IBM power grid
condition since if the total amount of voltage drop increase benchmarks. The analysis includes jL filter, fast via search,
on the target nodes is not satisfied, there is at least one power and via-array EM lifetime check. The results are summarized
integrity violation among the target nodes. We continue the in Table II. The column labeled #v gives the total number of
track upsizing until the criterion is reached to avoid the time via-arrays in the benchmark; the column labeled #jL lists the
consuming power grid simulation. In step 8, we provide a sim- number of via-arrays that are EM-mortal due to Blech length
ple track sensitivity updating approach. First, the upsized via- violation; the column labeled #fast provides the number of
array lies on both horizontal and vertical tracks; if one track is potentially EM-weak via-arrays among EM-mortal ones that
upsized, the via-array sensitivity contribution to the other track were found during the fast via search; the column labeled
is subtracted. Second, according to the heuristic observation, #EM-weak gives the number of via-arrays that are finally
the current through the first immediate neighboring via-array identified as EM-weak during the via-array EM lifetime check;
to the upsized track will be significantly reduced and will the column labeled #EM-weak-even lists the EM-weak via-
be considered EM-safe, so its sensitivity contribution is sub- arrays found assuming current is evenly distributed into a via-
tracted. This heuristic step may introduce new EM-violations, array; columns labeled t_fast and t_full are runtimes with fast
therefore to guarantee the robustness of the algorithm, in via search and with direct full via search (jL filter time is
steps 10 and 11, power grid is simulated to capture the possible not included); the column labeled t_improve is the runtime
newly created EM-violations and verify power integrity. Even- improvement with and without fast via search. We make sev-
tually the algorithm will either fail if after all possible power eral observations. First, the jL filter is very effective in quickly
track upsizing, power integrity is still not achieved, or other- sifting out EM-immortal wires. On the benchmarks we have
wise succeed in finding a suboptimal solution for minimizing tried, the filtering rate reaches up to 99.96% (IBMPGNEW2)
extra metal area used to achieve power integrity and EM in the best case, and in worst case is still 78.63% (IBMPG2).
reliability. Second, the fast via search, in general, is useful for further
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE III
P OWER G RID T RACK U PSIZING R ESULTS AND C OMPARISONS
Fig. 19. Visualized track upsizing results. (a) Upsizing for IBMPG2 layer 4–5. (b) Upsizing for IBMPG3 layer 2–3. (c) Upsizing for IBMPG4 layer 3–4.
(d) Upsizing for IBMPGNEW2 layer 1–2.
fast filtering out of non-EM-weak via-arrays, however, for a respectively; the column labeled A_improve is the area
specific case (IBMPG4) it does not help, since all EM-mortal improvement of greedy upsizing compared to full or optimal
via-arrays are eventually identified as EM-weak. Third, assum- upsizing; the column labeled t_greedy and t_opt are runtimes
ing even current split into individual vias in the array, much of greedy and optimal upsizing, respectively; the column
fewer via-arrays are identified as EM-weak, which indicates labeled t_improve is the runtime improvement of greedy
that assuming the even current distribution model misses many upsizing compared to optimal upsizing.
EM-weak via-arrays; the miss rate in the worst case reaches We make several observations. The number of via-arrays
up to 76.98%. Forth, the runtime improvement is significant that are upsized using greedy algorithm is much smaller than
for those cases when fast via search filtering rate is not zero; when full upsizing is evoked, but a little greater than in the
the improvement can reach up to 83.05% in the best case. case of optimal upsizing. The corresponding area comparisons
Next, we perform power track upsizing on the IBM power are also provided. For greedy algorithm, the area improvement
grid benchmarks. The results are summarized in Table III. We compared with full upsizing is significant, and in the best case
compare the results for full upsizing, greedy upsizing, and reaches up to 95%. As expected, in area improvement, the
optimal upsizing. Full upsizing strengthens all EM-weak via- optimal exceeds the greedy upsizing, but with a very small
arrays; greedy upsizing applies our heuristic greedy algorithm; advantage. In the worst case, the results of greedy upsizing
optimal upsizing is based on greedy algorithm, but it includes are only 6% worse than the optimal.
a power grid simulation to update sensitivity list and possibly The greedy upsizing algorithm’s runtime is almost an order
adds new EM-weak via-arrays each time a power track is of magnitude faster than that of the optimal algorithm. In
upsized. In Table III, the column labeled target is the number Fig. 19, we show several visualized power track upsizing
of target nodes found in the benchmark; the column labeled results comparing the greedy and full upsizing. X marks
v_greedy is the number of via-arrays that are strengthened denote those EM-weak via-arrays that are critical to power
using heuristic greedy algorithm; the column labeled #v_opt integrity; O marks denote those that are not.
is the number of via-arrays that are strengthened using Finally, we provide a comparison between our via-centric
true optimal upsizing; the columns labeled A_total, A_full, and a traditional wire segment-centric EM analysis. The tradi-
A_greedy, and A_opt are total metal area of the power grid and tional wire segment-centric EM analysis ignores the fact that
extra metal area used for full, greedy, and optimal upsizing, vias are the locations where AFD occurs, which eventually
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Malgorzata Marek-Sadowska (F’97) received the Sani R. Nassif (F’08) received the bachelor’s degree
M.S. and Ph.D. degrees from the Technical Univer- from the American University of Beirut, Beirut,
sity of Warsaw, Warsaw, Poland. Lebanon, and the master’s and Ph.D. degrees from
She was an Assistant Professor with the Tech- Carnegie-Mellon University, Pittsburgh, PA, USA, in
nical University of Warsaw, from 1976 to 1982. 1980, 1981 and 1985 respectively.
From 1982 to 1990, she was a Research Engineer He was with Bell Laboratories, Berkeley Heights,
with the Electronics Research Laboratory, University NJ, USA, until 1996 and the IBM Austin Research
of California, Berkeley, CA, USA. In 1990, she Laboratory, Austin, TX, USA, until 2013. He is now
was a Professor with the Department of Electrical a CEO of Radyalis, a firm focused on applying EDA
and Computer Engineering, University of California, techniques in the area of Cancer radiation therapy.
Santa Barbara, CA, USA. He has authored numerous conference and journal
She was an Associate Editor from 1991 to 1993, and Editor-In-Chief of publications, received five Best Paper awards (IEEE Trans. CAD, ICCAD,
IEEE T RANSACTIONS ON CAD from 1993 to 1995. She is currently an DAC, ISQED and ICCD), and given Keynotes at conferences like ASPDAC,
Associate Editor of IEEE T RANSACTIONS ON VLSI. ESSCIRC, BMAS, SISPAD, SEMICON, PATMOS, SELSE and VLSI-SOC.
He is the president of the IEEE Council on EDA, a member of the ACM and
the AAAS, and an IBM Master Inventor with more than 75 patents.