Related-Key Cryptanalysis of The Full AES-192 and AES-256

Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Related-key Cryptanalysis of the Full AES-192

and AES-256
Alex Biryukov and Dmitry Khovratovich
University of Luxembourg
Abstract. In this paper we present two related-key attacks on the full
AES. For AES-256 we show the rst key recovery attack that works for
all the keys and has 2
99.5
time and data complexity, while the recent
attack by Biryukov-Khovratovich-Nikolic works for a weak key class and
has much higher complexity. The second attack is the rst cryptanalysis
of the full AES-192. Both our attacks are boomerang attacks, which are
based on the recent idea of nding local collisions in block ciphers and
enhanced with the boomerang switching techniques to gain free rounds
in the middle.
1 Introduction
The Advanced Encryption Standard (AES) [10] a 128-bit block cipher, is one
of the most popular ciphers in the world and is widely used for both commercial
and government purposes. It has three variants which oer dierent security
levels based on the length of the secret key: 128, 192, 256-bits. Since it became
a standard in 2001 [1], the progress in its cryptanalysis has been very slow. The
best results until 2009 were attacks on 7-round AES-128 [11,12], 10-round AES-
192 [6,14], 10-round AES-256 [6,14] out of 10, 12 and 14 rounds respectively.
The two last results are in the related-key scenario.
Only recently there was announced a rst attack on the full AES-256 [7]. The
authors showed a related-key attack which works with complexity 2
96
for one
out of every 2
35
keys. They have also shown practical attacks on AES-256 (see
also [8]) in the chosen key scenario, which demonstrates that AES-256 can not
serve as a replacement for an ideal cipher in theoretically sound constructions
such as Davies-Meyer mode.
In this paper we improve these results and present the rst related-key at-
tack on AES-256 that works for all the keys and has a better complexity (2
99.5
data and time). We also develop the rst related key attack on the full AES-192.
In both attacks we minimize the number of active S-boxes in the key-schedule
(which caused the previous attack on AES-256 to work only for a fraction of all
keys) by using a boomerang attack [16] enhanced with boomerang switching tech-
niques. We nd our boomerang dierentials by searching for local collisions [9,7]
in a cipher. The complexities of our attacks and a comparison with the best
previous attacks are given in Table 1.
This paper is structured as follows: In Section 3 we develop the idea of local
collisions in the cipher and show how to construct optimal related-key dieren-
tials for AES-192 and AES-256 . In Section 4 we briey explain the idea of a
boomerang and an amplied boomerang attack. In Sections 5 and 6 we describe
an attack on AES-256 and AES-192, respectively.
Attack Rounds # keys Data Time Memory Source
192
Partial sums 8 1 2
127.9
2
188
? [11]
Related-key rectangle 10 64 2
124
2
183
? [6,14]
Related-key
amplied boomerang
12 4 2
123
2
176
2
152
Sec. 6
256
Partial sums 9 256 2
85
2
226
2
32
[11]
Related-key rectangle 10 64 2
114
2
173
? [6,14]
Related-key dierential 14 2
35
2
131
2
131
2
65
[7]
Related-key boomerang 14 4 2
99.5
2
99.5
2
77
Sec. 5
Table 1. Best attacks on AES-192 and AES-256
2 AES Description and Notation
We expect that most of our readers are familiar with the description of AES and
thus point out only the main features of AES-256 that are crucial for our attack.
AES rounds are numbered from 1 to 14 (12 for AES-192). We denote the
i-th 192-bit subkey (do not confuse with a 128-bit round key) by K
i
, i.e. the
rst (whitening) subkey is the rst four columns of K
0
. The last subkey is K
7
in AES-256 and K
8
in AES-192. The dierence in K
i
is denoted by K
i
. Bytes
of a subkey are denoted by k
l
i,j
, where i, j stand for the row and column index,
respectively, in the standard matrix representation of AES, and l stands for the
number of the subkey. Bytes of the plaintext are denoted by p
i,j
, and bytes of the
internal state after the SubBytes transformation in round r are denoted by a
r
i,j
,
with A
r
depicting the whole state. Let us also denote by b
r
i,j
byte in position
(i, j) after the r-th application of MixColumns.
Features of AES-256. AES-256 has 14 rounds and a 256-bit key, which is two
times larger than the internal state. Thus the key schedule consists of only 7
rounds. One key schedule round consists of the following transformations:
k
l+1
i,0
S(k
l
i+1,7
) k
l
i,0
C
l
, 0 i 3;
k
l+1
i,j
k
l+1
i,j1
k
l
i,j
, 0 i 3, 1 j 3;
k
l+1
i,4
S(k
l+1
i,3
) k
l
i,4
, 0 i 3;
k
l+1
i,j
k
l+1
i,j1
k
l
i,j
, 0 i 3, 5 j 7,
where S() stands for the S-box, and C
l
for the round-dependent constant.
Therefore, each round has 8 S-boxes.
Features of AES-192. AES-192 has 12 rounds and a 192-bit key, which is 1.5
times larger than the internal state. Thus the key schedule consists of 8 rounds.
One key schedule round consists of the following transformations:
k
l+1
i,0
S(k
l
i+1,5
) k
l
i,0
C
l
, 0 i 3;
k
l+1
i,j
k
l+1
i,j1
k
l
i,j
, 0 i 3, 1 j 5.
Notice that each round has only four S-boxes.
3 Local Collisions in AES
SubBytes
ShiftRows
MixColumns
Key schedule round
Key schedule round
disturbance
correction
Fig. 1. A local collision in AES-256.
The notion of a local collision comes from
the cryptanalysis of hash functions with
one of the rst applications by Chabaud
and Joux [9]. The idea is to inject a dif-
ference into the internal state, causing a
disturbance, and then to correct it with
the next injections. The resulting dier-
ence pattern is spread out due to the mes-
sage schedule causing more disturbances
in other rounds. The goal is to have as
few disturbances as possible in order to
reduce the complexity of the attack.
In the related-key scenario we are
allowed to have dierence in the key,
and not only in the plaintext as in the
pure dierential cryptanalysis. However
the attacker can not control the key it-
self and thus the attack should work for
any key pair with a given dierence.
Local collisions in AES-256 are best understood on a one-round example
(Fig. 1), which has one active S-box in the internal state, and ve non-zero byte
dierences in the two consecutive subkeys. This dierential holds with probabil-
ity 2
6
if we use an optimal dierential for an S-box:
0x01
SubBytes
= 0x1f;

0x1f
0
0
0

MixColumns
=

0x3e
0x1f
0x1f
0x21

Due to the key schedule the dierences spread to other subkeys thus forming
the key schedule dierence. The resulting key schedule dierence can be viewed
as a set of local collisions, where the expansion of the disturbance (also called
disturbance vector) and the correction dierences compensate each other. The
probability of the full dierential trail is then determined by the number of
active S-boxes in the key-schedule and in the internal state. The latter is just
the number of the non-zero bytes in the disturbance vector.
Therefore, to construct an optimal trail we have to construct a minimal-
weight disturbance expansion, which will become a part of the full key schedule
dierence. For the AES key schedule, which is mostly linear, this task can be
viewed as building a low-weight codeword of a linear code. Simultaneously, cor-
rection dierences also form a codeword, and the key schedule dierence code-
word is the sum of the disturbance and the correction codewords. In the simplest
trail the correction codeword is constructed from the former one by just shifting
four columns to the right and applying the S-boxMixColumns transformation.
Disturbance
Correction
+
Key schedule
=
Fig. 2. Full key schedule dierence (4.5 key-schedule rounds) for AES-256.
An example of a good key-schedule pattern for AES-256 is depicted in Fig-
ure 2 as a 4.5-round codeword. In the rst four key-schedule rounds the distur-
bance codeword has only 9 active bytes (red cells in the picture), which is the
lower bound. We want to avoid active S-boxes in the key schedule as long as
possible, so we start with a single-byte dierence in byte k
4
0,0
and go backwards.
Due to a slow diusion in the AES key schedule the dierence aects only one
more byte per key schedule round. The correction (grey) column should be po-
sitioned four columns to the right, and propagates backwards in the same way.
The last column in the rst subkey is active, so all S-boxes of the rst round are
active as well, which causes an unknown dierence in the rst (green) column.
This alien dierence should be canceled by the plaintext dierence.
4 Related Key Boomerang and Amplied Boomerang
Attacks
In this section we describe two types of boomerang attacks in the related-key
scenario.
A basic boomerang distinguisher [16] is applied to a cipher E
K
() which is
considered as a composition of two sub-ciphers: E
K
() = E
1
E
0
. The rst sub-
cipher is supposed to have a dierential , and the second one to have a
dierential , with probabilities p and q, respectively. In the further text
the dierential trails of E
0
and E
1
are called upper and lower trails, respectively.
In the boomerang attack a plaintext pair results in a quartet with probability
p
2
q
2
. The amplied boomerang attack [13] (also called rectangle attack [4]) works
in a chosen-plaintext scenario and constructs N
2
p
2
q
2
2
n
quartets of N plaintext
pairs. We refer to [16,13] for the full description of the attacks.
In the original boomerang attack paper by Wagner [16] it was noted that
the number of good ciphertext quartets is actually higher, since an attacker may
consider other and (with the same and ). This observation can be applied
to both types of boomerang attacks. As a result, the number Q of good quartets
is expressed via amplied probabilities p and q as follows:
Q = p
2
q
2
2
n
N
2
,
where
p =

P[ ]
2
; q =

P[ ]
2
. (1)
4.1 Related-key attack model
The related-key attack model [3] is a class of cryptanalytic attacks in which the
attacker knows or chooses a relation between several keys and is given access to
encryption/decryption functions with all these keys. The goal of the attacker is
to nd the actual secret keys. The relation between the keys can be an arbitrary
bijective function R (or even a family of such functions) chosen in advance by
the attacker (for a formal treatment of the general related key model see [2,15]).
In the simplest form of this attack, this relation is just a XOR with a constant:
K
2
= K
1
C, where the constant C is chosen by the attacker. This type of
relation allows the attacker to trace the propagation of XOR dierences induced
by the key dierence C through the key schedule of the cipher. However, more
complex forms of this attack allow other (possibly non-linear) relations between
the keys. For example, in some of the attacks described in this paper the attacker
chooses a desired XOR relation in the second subkey, and then denes the implied
relation between the actual keys as: K
2
= F
1
(F(K
1
) C) = R
C
(K
1
) where
F represents a single round of the AES-256 key schedule, and the constant C is
chosen by the attacker.
1
Compared to other cryptanalytic attacks in which the attacker can manipu-
late only the plaintexts and/or the ciphertexts the choice of the relation between
secret keys gives additional degree of freedom to the attacker. The downside of
this freedom is that such attacks might be harder to mount in practice. Still,
designers usually try to build ideal primitives which can be automatically used
without further analysis in the widest possible set of applications, protocols, or
modes of operation. Thus resistance to such attacks is an important design goal
for block ciphers, and in fact it was one of the stated design goals of the Rijndael
algorithm, which was selected as the Advanced Encryption Standard.
In this paper we use boomerang attacks in the related-key scenario. In the
following sections we denote the dierence between subkeys in the upper trail
by K
i
, and in the lower part by K
i
.
4.2 Boomerang switch
Here we analyze the transition from the sub-trail E
0
to the sub-trail E
1
, which
we call the boomerang switch. We show that the attacker can gain 1-2 middle
rounds for free due to a careful choice of the top and bottom dierentials. The
position of the switch is a tradeo between the sub-trail probabilities, that should
minimize the overall complexity of the distinguisher. Below we summarize the
switching techniques that can be used in boomerang or amplied boomerang
attacks on any block cipher.
Ladder switch. By default, a cipher is decomposed into rounds. However, such
decomposition may not be the best for the boomerang attack. We propose not
only to further decompose the round into simple operations but also to exploit
the existing parallelism in these operations. For example some bytes may be
independently processed. In such case we can switch in one byte before it is
transformed and in another one after it is transformed, see Fig. 3 for an illus-
tration.
An example is our attack on AES-192. Let us look at the dierential trails
(see Fig. 8). There is one active S-box in round 7 of the lower trail in byte
b
7
0,2
. On the other hand, the S-box in the same position is not active in the
upper trail. If we would switch after ShiftRows in round 6, we would pay the
probability in round 7 afterwards. However, we switch all the state except b
0,2
after MixColumns, and switch the remaining byte after the S-box application in
round 7, where it is not active. We thus do not pay for this S-box.
Feistel switch. Surprisingly, a Feistel round with an arbitrary function (e.g., an
S-box) can be passed for free in the boomerang attack (this was rst observed
1
Note that due to low nonlinearity of AES-256 key schedule such subkey relation
corresponds to a xed XOR relation in 28 out of 32 bytes of the secret key, and a
simple S-box relation in the four remaining bytes.
a
1
a
2
a
3
S S S
k
1
k
2
k
3
S S S
E
0
E
1
E
0
/ E
1
boundary
Fig. 3. The ladder switch in a toy three S-box block. A switch either before or
after the S-box layer would cost probability, while the ladder does not.
in the attack on cipher Khufu in [16]). Suppose the internal state (X, Y ) is
transformed to (Z = X f(Y ), Y ) at the end of E
0
. Suppose also that the E
0
dierence before this transformation is (
X
,
Y
), and that the E
1
dierence
after this transformation is (
Z
,
Y
).
As a result, variable Y in the four iterations of a boomerang quartet takes
two values: Y
0
and Y
0

Y
for some Y
0
. Then the f transformation is guaranteed
to have the same output dierence
f
in the quartet. Therefore, the decryption
phase of the boomerang creates the dierence
X
in X at the end of E
0
for
free. This trick is used in the switch in the subkey in the attack on AES-192.
S-box switch. This is similar to the Feistel switch, but costs probability only
in one of the directions. Suppose that E
0
ends with an S-box Y S(X) with
dierence If the output of an S-box in a cipher has dierence and if the same
dierence comes from the lower trail, then propagation through this S-box is
for free on one of the faces of the boomerang. Moreover, the other direction can
use amplied probability since specic value of the dierence is not important
for the switch
2
.
5 Attack on AES-256
In this section we present a related key boomerang attack on AES-256.
2
This type of switch was used in the original version of this paper, but is not needed
now due to change in the trails. We describe it here for completeness, since it might
be useful in other attacks.
5.1 The trail
The boomerang trail is depicted in Figure 7, and the actual values are listed in
Tables 3 and 2. It consists of two similar 7-round trails: the rst one covers rounds
18, and the second one covers rounds 814. The trails dier in the position of
the disturbance bytes: the row 1 in the upper trail, and the row 0 in the lower
trail. This fact allows the Ladder switch.
The switching state is the state A
9
(internal state after the SubBytes in
round 9) and a special key state K
S
, which is the concatenation of the last four
columns of K
3
and the rst four columns of K
4
. Although there are active S-
boxes in the rst round of the key schedule, we do not impose conditions on
them. As a result, the dierence in column 0 of K
0
is unknown yet.
Related keys We dene the relation between four keys as follows (see also
Figure 4). For a secret key K
A
, which the attacker tries to nd, compute its
second subkey K
1
A
and apply the dierence K
1
to get a subkey K
1
B
, from
which the key K
B
is computed. The relation between K
A
and K
B
is a constant
XOR relation in 28 bytes out of 32 and is computed via a function k

i,0
=
k
i,0
S(k
i+1,7
) S(k
i+1,7
c
i+1,7
), i=0,1,2,3, with constant c
i+1,7
= k
0
i+1,7
for the four remaining bytes.
The switch into the keys K
C
, K
D
happens between the 3rd and the 4th
subkeys in order to avoid active S-boxes in the key-schedule using the Ladder
switch idea described above. We compute subkeys K
3
and K
4
for both K
A
and K
B
. We add the dierence K
3
to K
3
A
and compute the upper half (four
columns) of K
3
C
. Then we add the dierence K
4
to K
4
A
and compute the lower
half (four columns) of K
4
C
. From these eight consecutive columns we compute
the full K
C
. The key K
D
is computed from K
B
in the same way.
K
0
K
1
K
1
K
1
K
A
K
B
K
3
K
4
K
3
K
D
K
4
K
2
K
3
K
4
K
5
K
3
K
4
K
3
K
C
K
4
Fig. 4. AES-256: Computing K
B
, K
C
, and K
D
from K
A
.
Finally, we point out that dierence between K
C
and K
D
can be computed in
the backward direction deterministically since there would be no active S-boxes
till the rst round. The secret key K
A
, and the three keys K
B
, K
C
, K
D
computed
from K
A
as described above form a proper related key quartet. Moreover, due
to a slow diusion in the backward direction, as a bonus we can compute some
values in K
i
even for i = 0, 1, 2, 3 (see Table 2). Hence given the byte value
k
l
i,j
for K
A
we can partly compute K
B
, K
C
and K
D
.
Internal state The plaintext dierence is specied in 9 bytes. We require that
all the active S-boxes in the internal state should output the dierence 0x1f so
that the active S-boxes are passed with probability 2
6
. The only exception is
the rst round where the input dierence in nine active bytes is not specied.
Let us start a boomerang attack with a random pair of plaintexts that t the
trail after one round. Active S-boxes in rounds 37 are passed with probability
2
6
each, so the overall probability is 2
30
.
We switch the internal state in round 9 with the Ladder switch technique:
the row 1 is switched before the application of S-boxes, and the other rows are
switched after the S-box layer. As a result, we do not pay for active S-boxes at
all in this round.
The second part of the boomerang trail is quite simple. Three S-boxes in
rounds 1014 contribute to the probability, which is thus equal to 2
18
. Finally
we get one boomerang quartet after the rst round with probability 2
30301818
= 2
96
.
5.2 The attack
The attack works as follows. Do the following steps 2
25.5
times:
1. Prepare a structure of plaintexts as specied below.
2. Encrypt it on keys K
A
and K
B
and keep the resulting sets S
A
and S
B
in
memory.
3. XOR
C
to all the ciphertexts in S
A
and decrypt the resulting ciphertexts
with K
C
. Denote the new set of plaintexts by S
C
.
4. Repeat previous step for the set S
B
and the key K
D
. Denote the set of
plaintexts by S
D
.
5. Compose from S
C
and S
D
all the possible pairs of plaintexts which are equal
in 56 bits
c
c
c
c
c
c
c
.
6. For every remaining pair check if the dierence in p
i,0
, i > 1 is equal on both
sides of the boomerang quartet (16-bit lter). Note that k
0
i,7
= 0 so k
0
i,0
should be equal for both key pairs (K
A
, K
B
) and (K
C
, K
D
).
7. Filter out the quartets whose dierence can not be produced by active S-
boxes in the rst round (one-bit lter per S-box per key pair) and active
S-boxes in the key schedule (one-bit lter per S-box), which is a 2 2+2 = 6-
bit lter.
8. Gradually recover key values and dierences simultaneously ltering out the
wrong quartets.
Each structure has all possible values in column 0 and row 0, and constant
values in the other bytes. Of 2
72
texts per structure we can compose 2
144
ordered
pairs. Of these pairs 2
14489
= 2
72
pass the rst round. Thus we expect one
right quartet per 2
9672
= 2
24
structures, and three right quartets out of 2
25.5
structures.
Let us now compute the number of noisy quartets. About 2
1445616
=
2
72
pairs come out of step 6. The next step applies a 6-bit lter, so we get
2
72+25.56
= 2
91.5
candidate quartets in total.
The remainder of this section deals with gradual recovering of the key and
ltering wrong quartets. The key bytes are recovered as shown in Figure 5.
2 1
0
1 3
0D
0D
4
3D
4
5
5 0
0 5
Fig. 5. Gradual key recovery. Digits stand for the steps, D means dierence.
1. First, consider 4-tuples of related key bytes in each position (1, j), j < 4. Two
dierences in a tuple are known by default. The third dierence is unknown
but is equal for all tuples (see Table 2, where it is denoted by X) and gets
one of 2
7
values. We use this fact for key derivation and ltering as follows.
Consider key bytes k
0
2,2
and k
0
2,3
. The candidate quartet proposes 2
2
candi-
dates for both 4-tuples of related-key bytes, or 2
4
candidates in total. Since
the dierences are related with the X-dierence, which is a 9-bit lter, this
step reveals two key bytes and the value of X and reduces the number of
quartets to 2
91.55
= 2
86.5
.
2. Now consider the value of k
0
1,0
, which is unknown yet and might be dierent
in two pairs of related keys. Let us notice that it is determined by the value of
k
0
2,7
, and k
0
2,7
= 0, so that k
0
1,0
is the same for both related key pairs and
can take 2
7
values. Each guess of k
0
1,0
proposes key candidates for byte
k
0
2,0
, where we have a 8-bit lter for the 4-tuple of related-key bytes. We
thus derive the value of k
0
1,0
in all keys and reduce the number of candidate
quartets to 2
85.5
.
3. The same trick holds for the unknown k
0
1,4
, which can get 2
7
possible values
and can be computed for both key pairs simultaneously. Each of these values
proposes four candidates for k
0
1,1
, which are ltered with an 8-bit lter. We
thus recover k
0
1,1
and k
0
1,4
and reduce the number of quartets to 2
79.5
.
4. Finally, we notice that k
0
1,4
is completely determined by k
0
1,0
, k
0
1,1
, k
0
1,2
, k
0
1,3
,
and k
0
2,7
. There are at most two candidates for the latter value as well as for
k
0
1,4
, so we get a 6-bit lter and reduce the number of quartets to 2
72.5
.
5. Each quartet also proposes two candidates for each of key bytes k
0
0,0
, k
0
2,2
,
and k
0
3,3
. Totally, the number of key candidates proposed by each quartet is
2
6
.
The key candidates are proposed for 11 bytes of each of four related keys.
However, these bytes are strongly related so the number of independent key
bytes on which the voting is performed is signicantly smaller than 11 4. At
least, bytes k
0
0,0
, k
0
1,1
, k
0
2,2
and k
0
3,3
of K
A
and K
C
are independent so we recover
15 key bytes with 2
78.5
proposals. The probability that three wrong quartets
propose the same candidates does not exceed 2
80
.
We thus estimate the complexity of the ltering step as 2
77.5
time and mem-
ory. We recover 3 7 + 8 8 = 85 bits of of K
A
(and 85 bits of K
C
) with 2
99.5
data and time and 2
77.5
memory.
The remaining part of the key can be found with many approaches. One is
to relax the condition on one of the active S-boxes in round 3 thus getting four
more active S-boxes in round 2, which in turn leads to a full-dierence state
in round 1. The condition can be actually relaxed only for the rst part of the
boomerang (the key pair (K
A
, K
B
)) thus giving a better output lter. For each
candidate quartet we use the key bytes, that were recovered at the previous
step, to compute A
1
and thus signicantly reduce the number of keys that are
proposed by a quartet. We then rank candidates for the rst four columns of
K
0
A
and take the candidate that gets the maximal number of votes. Since we
do not make key guesses, we expect that the complexity of this step is smaller
than the complexity of the previous step (2
99.5
). The right quartet also provide
information about four more bytes in the right half of K
0
A
that correspond to
the four active S-boxes in round 2. The remaining 8 bytes of K
A
can be found
by exhaustive search.
6 Attack on AES-192
The key schedule of AES-192 has better diusion, so it is hard to avoid active S-
boxes in the subkeys. We construct a related-key boomerang attack with two sub-
trails of 6 rounds each. The attack is an amplied-boomerang attack because we
have to deal with truncated dierences in both the plaintext and the ciphertext,
the latter would be expensive to handle in a plain boomerang attack.
6.1 The trail
The trail is depicted in Figure 8, and the actual values are listed in Tables 4
and 5. The key schedule codeword is depicted in Figure 6.
Related keys We dene the relation between four keys similarly to the attack
on AES-256. Assume we are given a key K
A
, which the attacker tries to nd.
We compute its subkey K
1
A
and apply the dierence K
1
to get the subkey K
1
B
,
from which the key K
B
is computed. Then we compute the subkeys K
4
A
and
K
4
B
and apply the dierence K
4
to them. We get subkeys K
4
C
and K
4
D
, from
which the keys K
C
and K
D
are computed.
Now we prove that keys K
A
, K
B
, K
C
, and K
D
form a quartet, i.e. the
subkeys of K
C
and K
D
satisfy the equations K
l
C
K
l
D
= K
l
, l = 1, 2, 3.
Disturbance
Correction
Key schedule
+
=
E
0
Disturbance
Correction
Key schedule
+
=
E
1
Fig. 6. AES-192 key schedule codeword.
The only active S-box is positioned between K
3
and K
4
, whose input is k
3
0,5
.
However, this S-box gets the same pair of inputs in both key pairs (see the
Feistel switch in Sec. 4.2). Indeed, if we compute k
3
0,5
from K
4
, then it is
equal to k
3
0,5
= 0x01. Therefore, if the active S-box gets as input and 1
in K
A
and K
B
, respectively, then it gets a1 and a in K
C
and K
D
, respectively.
As a result, K
3
C
K
3
D
= K
3
, the further propagation is linear, so the four keys
form a quartet.
Due to a slow diusion in the backward direction, we can compute some
values in K
l
even for small l (Table 5). Hence given k
l
i,j
for K
A
we can partly
compute K
B
, K
C
and K
D
, which provides additional ltration in the attack.
Internal state The plaintext dierence is specied in 10 bytes
c
c
c
c
c c
c
c c
c
, the dif-
ference in the other six bytes not restricted. The three active S-boxes in rounds
24 are passed with probability 2
6
each. In round 6 (the switching round) we
ask for the xed dierence only in a
6
0,2
, the other two S-boxes can output any
dierence such that it is the same as in the second related-key pair. Therefore,
the amplied probability of round 6 equals to 2
623.5
= 2
13
. We switch be-
tween the two trails before the key addition in round 6 in all bytes except b
6
0,2
,
where we switch after the S-box application in round 7 (the Ladder switch). This
trick allows us not to take into account the only active S-box in the lower trail
in round 7. The overall probability of the rounds 36 is 2
3613
= 2
31
.
The lower trail has 8 active S-boxes in rounds 812. Only the rst four active
S-boxes are restricted in the output dierence, which gives us probability 2
24
for the lower trail. The ciphertext dierence is fully specied in the middle two
rows, and has 35 bits of entropy in the other bytes. More precisely, each c
0,
is
taken from a set of size 2
7
, and all the c
3,
should be the same on both sides
of the boomerang and again should belong to a set of size 2
7
. Therefore, the
ciphertext dierence gives us a 93-bit lter.
6.2 The attack
We compose 2
73
structures of type
c
c
c
c
c c
c
c c
c
with 2
48
texts each. Then we encrypt
all the texts with the keys K
A
and K
C
, and their complements w.r.t. P on
K
B
and K
D
. We keep all the data in memory and analyze it with the following
procedure:
1. Compose all candidate plaintext pairs for the key pairs (K
A
, K
B
) and (K
C
, K
D
).
2. Compose and store all the candidate quartets of the ciphertexts.
3. For each guess of the subkey bytes: k
0
0,3
, k
0
2,3
, and k
0
0,5
in K
A
; k
7
0,5
in K
A
and K
B
:
(a) Derive values for these bytes in all the keys from the dierential trail.
Derive the yet unknown key dierences in K
0
and K
8
.
(b) Filter out candidate quartets that contradict K
8
.
(c) Prepare counters for the yet unknown subkey bytes that correspond to
active S-boxes in the rst two rounds and in the last round: k
0
0,0
, k
0
0,1
,
k
0
1,2
, k
0
3,0
in keys K
A
and K
C
, k
8
0,0
, k
8
0,1
, k
8
0,2
, k
8
0,3
in keys K
A
and
K
B
, i.e. 16 bytes in total.
(d) For each candidate quartet derive possible values for these unknown
bytes and increase the counters.
(e) Pick the group of 16 subkey bytes with the maximal number of votes.
(f) Try all possible values of the yet unknown 9 key bytes in K
0
and check
whether it is the right key. If not then go to the rst step.
Right quartets. Let us rst count the number of right quartets in the data.
Evidently, there exist 2
128
pairs of internal states with the dierence A
2
.
The inverse application of 1.5 rounds maps these pairs into structures that we
have dened, with 2
48
pairs per structure. Therefore, each structure has 2
48
pairs that pass 1.5 rounds, and 2
73
structures have 2
121
pairs. Of these pairs
2
(12131)2128
= 2
52
right quartets can be composed after the switch in the
middle. Of these quartets 2
52224
= 16 right quartets come out of the last
round.
Steps 12. The rst two steps are the most time-consuming part of the attack.
We propose the following approach based on the ideas of [5] and the fact that
pairs of plaintexts in a right quartet should belong to the same structure:
Having encrypted the data, group all the ciphertexts into buckets according
to the 88-bit ciphertext lter: xed dierences in the middle rows, equal
dierences in the last row.
Prepare a two-dimensional table of plaintexts indexed by the indices of struc-
tures and a key.
For every pair (C
A
, C
C
) of ciphertexts in a same bucket, that were encrypted
under K
A
and K
C
, respectively:
Check if the pair satises the additional 5-bit lter in the dierences
corresponding to the active S-boxes, where there are only 2
7
possibilities
per byte.
If yes, detect structures S
A
and S
C
, to which the corresponding plain-
texts belong, and insert the pair in a table into a cell indexed by these
structures.
Repeat the previous step for the keys K
B
and K
D
.
For every pair of structures compose all the possible quartets of plaintexts.
Put all the quartets into a hash table indexed by the two dierences c
3,0
.
Every bucket contains 2
(12188)25
= 2
61
pairs. The overall number of pairs is
2
88+61
= 2
149
pairs, or 2
3
pairs for a pair of structures. Therefore, we compose
2
32+732
= 2
152
candidate quartets and then rank them according to c
3,0
. We
thus get 2
14
groups of quartets each having 2
138
candidate quartets.
Step 3(a-b) (ltering). We apply the K
8
lter and analyze 2
138
candidate
quartets. We know k
0
0,3
from the guess and a
1
0,3
from the trail, which gives
us an 8-bit lter on p
0,3
, and a 16-bit lter on the quartets. We also know k
0
2,3
so for both pairs of plaintexts in each of remaining 2
122
quartets we compute
a
1
2,3
. Since a
1
2,3
is a value in the column that should collapse to one non-zero
byte b
1
0,1
by MixColumns, we derive all the values on its diagonal and b
1
0,1
.
Actually, the value of b
1
0,1
can be restricted to two options, since we know k
0
0,1
,
k
0
0,1
, and a
2
0,1
. For a given dierence in the plaintext and provided with A
1
there exist 8 possible combinations of k
0
0,1
, k
0
1,2
, and k
0
3,0
, and the probability
that any of them matches the two options for a
1
0,1
is 1/16. Therefore, the value
of p
2,3
restricts the other three dierences on its diagonal by 3 + 4 = 7 bits.
One more bit comes from the fact that only half of a
1
0,1
can be converted into
a
2
0,1
. Therefore, we consider only 2
1222(7+1)
= 2
106
quartets.
Step 3(c-e) (counting). Both pairs of plaintexts in a quartet propose key can-
didates: the rst pair for K
A
and the second pair for K
B
. Each pair proposes
one candidate for (k
0
0,1
, k
0
1,2
, k
0
3,0
) and the two candidates for k
0
0,0
, so we have 2
10
candidates for 16 key bytes. The probability that 16 false quartets propose the
same key candidate can be upper bounded by 2
1161612815
= 2
64
.
The overall time complexity is the number of quartets analyzed at Step 3
times the number of the key guesses. Now we point out that we do not have to
guess all 8 bits of k
7
0,5
since we need only the output S-box dierence. Therefore,
we try 2
38+27
= 2
38
key guesses, so the time complexity of the attack is 2
138+38
= 2
176
, and the data complexity is 2
123
.
7 Conclusions
We presented related-key boomerang attacks on the full AES-192 and the full
AES-256. The dierential trails for the attacks are based on the idea of nding
local collisions in the block cipher. We showed that optimal key-schedule trails
should be based on low-weight codewords in the key schedule. We also exploit
various boomerang-switching techniques, which help us to gain free rounds in
the middle of the cipher. However, both our attacks are still mainly of theoretical
interest and do not present a threat to practical applications using AES.
Acknowledgements. The authors thank Vincent Rijmen and anonymous re-
viewers for their valuable comments, which helped to improve the paper. Dmitry
Khovratovich is supported by PRP Security & Trust grant of the University
of Luxembourg.
References
1. FIPS-197: Advanced Encryption Standard, November 2001, available at http://
csrc.nist.gov/publications/fips/fips197/fips-197.pdf.
2. Mihir Bellare and Tadayoshi Kohno. A theoretical treatment of related-key attacks:
Rka-prps, rka-prfs, and applications. In EUROCRYPT03, volume 2656 of LNCS,
pages 491506. Springer, 2003.
3. Eli Biham. New types of cryptanalytic attacks using related keys. J. Cryptology,
7(4):229246, 1994.
4. Eli Biham, Orr Dunkelman, and Nathan Keller. The rectangle attack - rectangling
the Serpent. In EUROCRYPT01, volume 2045 of LNCS, pages 340357. Springer,
2001.
5. Eli Biham, Orr Dunkelman, and Nathan Keller. New results on boomerang and
rectangle attacks. In FSE02, volume 2365 of LNCS, pages 116. Springer, 2002.
6. Eli Biham, Orr Dunkelman, and Nathan Keller. Related-key boomerang and
rectangle attacks. In EUROCRYPT05, volume 3494 of LNCS, pages 507525.
Springer, 2005.
7. Alex Biryukov, Dmitry Khovratovich, and Ivica Nikolic. Distinguisher and related-
key attack on the full AES-256. In CRYPTO09, volume 5677 of LNCS, pages
231249. Springer, 2009.
8. Alex Biryukov, Dmitry Khovratovich, and Ivica Nikolic. Examples of dierential
multicollisions for 13 and 14 rounds of AES-256, 2009, available at http://eprint.
iacr.org/2009/242.pdf.
9. Florent Chabaud and Antoine Joux. Dierential collisions in SHA-0. In
CRYPTO98, 1998.
10. Joan Daemen and Vincent Rijmen. The Design of Rijndael. AES the Advanced
Encryption Standard. Springer, 2002.
11. Niels Ferguson, John Kelsey, Stefan Lucks, Bruce Schneier, Michael Stay, David
Wagner, and Doug Whiting. Improved cryptanalysis of Rijndael. In FSE00,
volume 1978 of LNCS, pages 213230. Springer, 2000.
12. Henri Gilbert and Marine Minier. A collision attack on 7 rounds of Rijndael. In
AES Candidate Conference, pages 230241, 2000.
13. John Kelsey, Tadayoshi Kohno, and Bruce Schneier. Amplied boomerang attacks
against reduced-round MARS and Serpent. In FSE00, volume 1978 of LNCS,
pages 7593. Springer, 2000.
14. Jongsung Kim, Seokhie Hong, and Bart Preneel. Related-key rectangle attacks on
reduced AES-192 and AES-256. In FSE07, volume 4593 of LNCS, pages 225241.
Springer, 2007.
15. Stefan Lucks. Ciphers secure against related-key attacks. In FSE04, volume 3017
of LNCS, pages 359370. Springer, 2004.
16. David Wagner. The boomerang attack. In FSE99, volume 1636 of LNCS, pages
156170. Springer, 1999.
Disclaimer on colors. We intensively use colors in our gures in order to
provide better understanding on the trail construction. In gures, dierent colors
refer to dierent values, which is hard to depict in black and white. However,
we also list all the trail dierences in the tables, so all the color information is
actually dubbed.
Trail details. By A
i
we denote the upper trail dierence in the internal
state after the S-box layer, and by A
i
the same for the lower trail.
Table 2. Key schedule dierence in the AES-256 trail.
K
i
0
? 00 00 00 3e 3e 3e 3e
? 01 01 01 ? 21 21 21
? 00 00 00 1f 1f 1f 1f
? 00 00 00 1f 1f 1f 1f
1
00 00 00 00 3e 00 3e 00
00 01 00 01 21 00 21 00
00 00 00 00 1f 00 1f 00
00 00 00 00 1f 00 1f 00
2
00 00 00 00 3e 3e 00 00
00 01 01 00 21 21 00 00
00 00 00 00 1f 1f 00 00
00 00 00 00 1f 1f 00 00
3
00 00 00 00 3e 00 00 00
00 01 00 00 21 00 00 00
00 00 00 00 1f 00 00 00
00 00 00 00 1f 00 00 00
4
00 00 00 00 3e 3e 3e 3e
00 01 01 01 ? ? ? ?
00 00 00 00 1f 1f 1f 1f
00 00 00 00 1f 1f 1f 1f
K
i
0
? ? ? ? ? ? ? 00
X X X X 1f 1f 1f 00
? ? ? ? 1f 1f 1f 00
? ? ? ? 21 21 21 00
1
? 01 ? 00 ? ? 00 00
X 00 X 00 1f 1f 00 00
? 00 ? 00 1f 1f 00 00
? 00 ? 00 21 21 00 00
2
? ? 00 00 ? 00 00 00
X X 00 00 1f 00 00 00
? ? 00 00 1f 00 00 00
? ? 00 00 21 00 00 00
3
? 01 01 01 3e 3e 3e 3e
X 00 00 00 1f 1f 1f 1f
? 00 00 00 1f 1f 1f 1f
? 00 00 00 21 21 21 21
4
01 00 01 00 3e 00 3e 00
00 00 00 00 1f 00 1f 00
00 00 00 00 1f 00 1f 00
00 00 00 00 21 00 21 00
5
01 01 00 00 3e 3e 00 00
00 00 00 00 1f 1f 00 00
00 00 00 00 1f 1f 00 00
00 00 00 00 21 21 00 00
6
01 00 00 00 3e 00 00 00
00 00 00 00 1f 00 00 00
00 00 00 00 1f 00 00 00
00 00 00 00 21 00 00 00
7
01 01 01 01 ? ? ? ?
00 00 00 00 1f 1f 1f 1f
00 00 00 00 1f 1f 1f 1f
00 00 00 00 21 21 21 21
Table 3. Non-zero internal state dierences in the AES-256 trail.
P
? 00 00 00
? ? ? ?
? 00 ? 00
? 00 00 ?
A
1
? 00 00 00
1f ? 1f 1f
00 00 ? 00
00 00 00 ?
A
3
00 00 00 00
00 1f 00 1f
00 00 00 00
00 00 00 00
A
5
00 00 00 00
00 1f 1f 00
00 00 00 00
00 00 00 00
A
7
00 00 00 00
00 1f 00 00
00 00 00 00
00 00 00 00
A
7
1f 1f 1f 1f
00 00 00 00
00 00 00 00
00 00 00 00
A
9
1f 00 1f 00
00 00 00 00
00 00 00 00
00 00 00 00
A
11
1f 1f 00 00
00 00 00 00
00 00 00 00
00 00 00 00
A
13
1f 00 00 00
00 00 00 00
00 00 00 00
00 00 00 00
C
00 00 00 00
00 00 00 00
00 00 00 00
00 00 00 00
Table 4. Internal state dierence in the AES-192 trail.
P
? ? 3e ?
1f 1f ? 1f
1f 1f 1f ?
? 21 21 21
A
1
1f ? 00 1f
00 00 ? 00
00 00 00 ?
? 00 00 00
A
2
00 1f 00 00
00 00 00 00
00 00 00 00
00 00 00 00
A
3
00 1f 1f 00
00 00 00 00
00 00 00 00
00 00 00 00
A
4
00 00 00 1f
00 00 00 00
00 00 00 00
00 00 00 00
A
5
00 00 00 00
00 00 00 00
00 00 00 00
00 00 00 00
A
6
00 1f 1f 1f
00 00 00 00
00 00 00 00
00 00 00 00
A
7
00 00 00 1f
00 00 00 00
00 00 00 00
00 00 00 00
A
6
1f 1f 1f 1f
00 00 00 00
00 00 00 00
00 00 00 00
A
7
00 00 1f 00
00 00 00 00
00 00 00 00
00 00 00 00
A
8
1f 00 00 00
00 00 00 00
00 00 00 00
00 00 00 00
A
9
1f 1f 00 00
00 00 00 00
00 00 00 00
00 00 00 00
A
10
00 00 1f 00
00 00 00 00
00 00 00 00
00 00 00 00
A
11
00 00 00 00
00 00 00 00
00 00 00 00
00 00 00 00
A
12
? ? ? ?
00 00 00 00
00 00 00 00
00 00 00 00
C
? ? ? ?
1f 1f 1f 1f
1f 1f 1f 1f
? ? ? ?
Table 5. Key schedule dierence in the AES-192 trail.
K
0
00 3e 3e 3f 3e 01
00 1f 1f 1f 1f 00
00 1f 1f 1f 1f 00
? 21 21 21 21 00
K
1
00 3e 00 3f 01 00
00 1f 00 1f 00 00
00 1f 00 1f 00 00
00 21 00 21 00 00
K
2
00 3e 3e 01 00 00
00 1f 1f 00 00 00
00 1f 1f 00 00 00
00 21 21 00 00 00
K
3
00 3e 00 01 01 01
00 1f 00 00 00 00
00 1f 00 00 00 00
00 21 00 00 00 00
K
4
00 3e 3e 3f 3e 3f
00 1f 1f 1f 1f 1f
00 1f 1f 1f 1f 1f
? ? ? ? ? ?
K
0
? ? ? 3e 3f 3e
? ? ? 1f 1f 1f
? ? ? 1f 1f 1f
? ? ? ? 21 21
K
1
? ? 3f 01 3e 00
? ? 1f 00 1f 00
? ? 1f 00 1f 00
? ? ? 00 21 00
K
2
? 3e 01 00 3e 3e
? 1f 00 00 1f 1f
? 1f 00 00 1f 1f
? ? 00 00 21 21
K
3
3e 00 01 01 3f 01
1f 00 00 00 1f 00
1f 00 00 00 1f 00
? 00 00 00 21 00
K
4
3e 3e 3f 3e 01 00
1f 1f 1f 1f 00 00
1f 1f 1f 1f 00 00
21 21 21 21 00 00
K
5
3e 00 3f 01 00 00
1f 00 1f 00 00 00
1f 00 1f 00 00 00
21 00 21 00 00 00
K
6
3e 3e 01 00 00 00
1f 1f 00 00 00 00
1f 1f 00 00 00 00
21 21 00 00 00 00
K
7
3e 00 01 01 01 01
1f 00 00 00 00 00
1f 00 00 00 00 00
21 00 00 00 00 00
K
8
3e 3e 3f 3e 3f 3e
1f 1f 1f 1f 1f 1f
1f 1f 1f 1f 1f 1f
? ? ? ? ? ?
SubBytes
ShiftRows
MixColumns
RC SB AC
ShiftRows
MixColumns
SB
9
10
11
SubBytes
ShiftRows
MixColumns
RC SB AC
ShiftRows
MixColumns
SB
SubBytes
ShiftRows
MixColumns
RC SB AC
ShiftRows
MixColumns
SB
12
13
SubBytes
SubBytes
ShiftRows
MixColumns
SubBytes
0
4
5
6
14
RC SB AC
ShiftRows
SB
SubBytes
ShiftRows
MixColumns
RC SB AC
ShiftRows
MixColumns
SB
SubBytes
ShiftRows
MixColumns
RC SB AC
ShiftRows
MixColumns
SB
SubBytes
SubBytes
ShiftRows
MixColumns
SubBytes
RC SB AC
ShiftRows
MixColumns
SB
7
6
5
4
3
1
RC SB AC
ShiftRows
MixColumns
SB
8
2
1
2
Switch
2
30
2
18
E
0
E
1
3
4
8
7
SubBytes
SubBytes
SubBytes
SubBytes
Fig. 7. AES-256 E
0
and E
1
trails. Green ovals show an overlap between the two
trails where the switch happens.
ShiftRows
MixColumns
ShiftRows
MixColumns
ShiftRows
MixColumns
ShiftRows
MixColumns
KS
KS
KS
SubBytes
SubBytes
SubBytes
SubBytes
ShiftRows
MixColumns
ShiftRows
SubBytes
SubBytes
ShiftRows
MixColumns
ShiftRows
MixColumns
ShiftRows
MixColumns
ShiftRows
MixColumns
KS
KS
KS
KS
KS
SubBytes
SubBytes
SubBytes
SubBytes
ShiftRows
MixColumns
ShiftRows
MixColumns
ShiftRows
MixColumns
SubBytes
SubBytes
SubBytes
1
2
3
4
5
7
8
9
10
11
12
7
ShiftRows
MixColumns
KS
KS
SubBytes
6
6
Switch
K
0
K
1
K
2
K
3
K
4
K
4
K
5
K
6
K
7
K
8
2
31
2
24
Fig. 8. AES-192 trail.

You might also like