Seminal Papers
Seminal Papers
Abstract—The combination network is one of the simplest and the starting node of the edge. When we consider the function
insightful networks in coding theory. The vector network coding on the edge as a linear combination of the h messages, the
solutions for this network and some of its sub-networks are vector of coding coefficients (for the h messages) is called the
examined. For a fixed alphabet size of a vector network coding
solution, an upper bound on the number of nodes in the network global coding vector. To recover the h messages, a receiver R
is obtained. This bound is an MDS bound for subspaces over a should obtain h global coding vectors whose linear span has
finite field. A family of sub-networks of combination networks dimension h. In other words, the h × h matrix formed by these
is defined. It is proved that for this family of networks, which h global coding vectors should be invertible. This h × h matrix
are minimal multicast networks, there is a gap in the minimum is called a transfer matrix of R. The previous description
alphabet size between vector network coding solutions and scalar
network coding solutions. This gap is obtained for any number constitutes the framework for scalar linear network coding.
of messages and is based on coloring of the q-Kneser graph and The framework for vector network coding was presented in [9].
a new hypergraph generalization for it. Each message and each packet is a vector of length t and
the coding coefficients are t × t matrices. The global coding
I. I NTRODUCTION vectors, on the edges, consist of h matrices of size t × t, which
Network coding has been attracting increased attention together form t × (ht) matrices. W.l.o.g., we assume that each
for almost two decades since the seminal papers [1], [16]. t × (ht) matrix of a global coding vector is a generator matrix
Multicast networks have received most of this attention. An of a t-subspace of F htq . To recover the h messages, a receiver
recent survey on the foundation of multicast network coding R should have on its ℓ incoming edges, ℓ > h, h such global
can be found in [13]. The multicast network-coding problem coding vectors which form together an (ℓt) × (ht) transfer
can be formulated as follows: given a network with one source matrix of rank ht.
which has h messages, for each edge find a function of the The field size of the solution is an important parameter that
packets received at the starting node of the edge, such that each directly influences the complexity of the calculations at the
receiver can recover all the messages from its received packets. network nodes. It is known that any field size q > N suffices
Such an assignment of a function to each edge is called a for a solution. However, it is conjectured that the smallest
solution for the network. Therefore, the received packets on an field size allowing a solution is much smaller [12], [13]. An
edge can be expressed as functions of the source messages. If efficient algorithm to find such a field size and the related
these functions are linear, we obtain a linear network coding network code was given in [15]. It is conjectured that the
solution, otherwise we have a nonlinear solution. In linear minimum alphabet size is much smaller, but this was proved
network coding, each linear function on an edge consists of only for two messages [12]. For this purpose we distinguish
coding coefficients for each incoming packet. If the coding between the smallest alphabet size required for each one of the
coefficients and the packets are scalars, it is called a scalar three types of network coding solutions. Given a network N ,
network coding solution. If the messages and the packets are we define qs (N ) to be the smallest field size q for which N
vectors and the coding coefficients are matrices then it is has a scalar linear solution. Similarly, qn (N ) is the smallest
called a vector network coding solution. A network which alphabet size q (q not necessarily a prime power) for which
has a solution is called a solvable network. It is well-known N has a scalar nonlinear solution, and qv (N ) is the smallest
that a multicast network with one source, h messages, and value qt , q a prime power, such that N has a vector solution
N receivers, is solvable if and only if the min-cut between the over F tq . By definition, qs (N ) > qv (N ) > qn (N ), and we
source and each receiver is at least h [13]. define the vector gap by
The functions on the edges of the network form the network
gapv (N ) , qs (N ) − qv (N ).
code. The coding coefficients form the network coding vectors
on the edges. The vector of coding coefficients is called Two other gaps (qs (N ) − qn (N ) and qv (N ) − qn (N )) are
the local coding vector when the function on the edge is defined similarly, but this paper will be mostly devoted to the
considered as a linear combination of the packets received at vector gap.
One of the most celebrated families of networks is the networks called minimal multicast networks, for any number of
family of combination networks [22], which were used for messages. The gap is increasing with the number of messages.
various topics in network coding. The Nh,r,s combination This also proves the existence of a gap for two messages which
network, where s > h, is shown in Fig. 1. The network has was left open in [11]. The networks which will be used for this
three layers: the first layer consists of a single source with h purpose are sub-networks of the combination networks. The
messages. The source transmits r packets to the r nodes of the proof will be based on the chromatic number of the q-Kneser
middle layer. Any s nodes in the middle layer are connected graph and a generalized version of it, the q-Kneser hypergraph,
to a receiver, and each one of the (rs) receivers demands all which was not defined before. The coloring problem raises
the h messages. It was proved in [22] that a solution for such an intriguing combinatorial problem which has independent
a network exists if and only if a related error-correcting code intellectual merit. Several more related problems will be
exists. This network was also generalized to compare scalar presented in Section IV and will be considered in the full
and vector network coding [11]. Its sub-networks were used version of this paper. The same is true for some proofs of
to prove that finding the minimum required field size of a claims in the paper.
(linear or nonlinear) scalar network code for a certain multicast
II. V ECTOR S OLUTION AND B OUND FOR MDS C ODES
network is NP-complete [21].
In this section, we first describe the three types of solutions
x1 , . . . , x h for the Nh,r,s combination network. The key result is the
following theorem proved in [22]. Let (r, qh , r − s + 1)q
denote a code over F q of length r with qh codewords and
minimum Hamming distance r − s + 1. If this code is linear,
...... r nodes it is denoted by [r, h, r − s + 1]q .
s edges Theorem 1. ( [22]) The Nh,r,s combination network is solvable
over F q if and only if there exists an (r, qh , r − s + 1)q code.
(rs) receivers
In view of Theorem 1, what are the functions on the edges of
the Nh,r,s combination network in the three types of solutions?
Figure 1. The N h,r,s combination network: it has an edge from the source to
each of the r nodes in the middle layer. Each of the (rs) receivers is connected 1) For the scalar nonlinear solution, an (r, qh , r − s + 1)q
to a unique set of s middle-layer nodes, and demands all of the h source code, each coordinate in a codeword is a function of
messages.
h information symbols which are represented by the
The goal of this work is to consider two problems which are h messages. The function for the ith symbol of a
related to vector coding solutions for combination networks codeword is the function on the link from the source
and their sub-networks. In Section II, we describe network to the ith node in the middle layer.
coding solutions (vector, scalar, linear and nonlinear) for the 2) For the scalar linear solution, an [r, h, r − s + 1]q code
Nh,r,s combination network. In particular, we consider the is required. It has an r × h generator matrix and the
Nh,r,h combination network and the maximum number of h entries of its ith column are the coding coefficients of
nodes in the middle layer for such a network. This number is the linear function on the link from the source to the ith
related to the largest length of certain MDS codes. While there node in the middle layer.
exists a proof on the upper bound of such length for linear and In both cases, the nodes of the middle layer transmit their
nonlinear codes, we are not aware on any proof based on the information to the related receivers. Each receiver obtains s
properties of the subspaces. These codes are also MDS array symbols from the middle-layer nodes, each one has the same
codes which were considered in the past for storage [5] and global coding vector on its incoming and outgoing edges.
are very popular today as distributed-storage codes, e.g., see Since the minimum Hamming distance of the code is r − s + 1,
[8], [24] and references therein. In Section III, the vector gap it follows that for each two different sets of h messages, each
is considered. Such vector gaps, which are very large, were receiver obtains a different s-tuple of symbols from the middle
considered in [11] for any number of messages h > 2. The layer nodes. Hence, it can recover the h messages.
networks which were used for the proof are generalizations of For the vector network coding solution, the h matrices of
the combination networks in which for each receiver there are size t × t on the edges from the source to the middle-layer
some redundant edges on the paths between the source and the nodes form together a t × (ht) matrix which has dimension t,
receiver. The extra edges were used to distribute the (ht)-space i.e., it represents a t-subspace of F ht
q . Now, to have a solution
formed by the h vector messages of length t on more than h for the Nh,r,s combination network, each s subspaces, related
edges. This enables some edges to transmit only a fraction of to the edges between the source and the middle-layer nodes,
a one-dimensional space. However, a similar idea cannot be span the (ht)-space defined by the messages of the source.
used for scalar linear network coding. The question whether A fundamental combinatorial structure that underpins some
such gaps can be obtained if there are no such redundant edges of the generalized combination networks is a structure we call
remained open. In Section III, we give a positive answer to a (t; h, α)q -independent configuration. We use [Vt ] to denote
this question and prove that there exists a vector gap in such all the t-dimensional subspaces of a vector space V, and [ba]
to denote the Gaussian coefficient (where the field size q is that receiver has a full rank (ht) × (ht) transfer matrix from
understood from context). which it can recover the h messages.
Definition 2. Let q be a prime power, t, h, α be positive integers, Lemmas 3 and 4 form a generalization for an upper bound
α 6 h, and denote V = F ht on the length of MDS code (use α = h in Lemma 3). The
q . A ( t; h, α ) q -independent configu-
related results for (scalar) linear codes are given in [18].
ration (IC) is a set C = {V1 , V2 , . . . , Vm } ⊆ [Vt ], such that for
Corollary 7 [18, p. 321] asserts that for an [n, k, n − k + 1]q
all 1 6 i1 < i2 < · · · < iα 6 m,
MDS code, we have that n 6 q + k − 1. This result is
dim(Vi1 + Vi2 + · · · + Viα ) = αt. strengthened in Theorem 11 [18, p. 326] by using a more
complicated proof based on projective geometry. The theorem
We say |C| = m is the size of the IC. asserts that if k > 3 and q is odd then n 6 q + k − 2. A more
Lemma 3. Let C be a (t; h, α)q -IC. If α > 2 then complicated proof for the same result is given for nonlinear
codes in [20, pp. 12-13].
q ( h − α +2 ) t − 1 Lemmas 3 and 4 can be generalized for a family of net-
|C| 6 + α − 2.
qt − 1 works which generalize the combination network [11]. Some
Proof: If α = 2 the claim is immediate by considering interesting consequences implied by this generalization will
the size of a t-spread [10]. be discussed in the full version of this paper.
Assume now α > 2, and denote V , F ht q . Let us write
We can use Lemma 4 to upper bound the vector gap
C = {V1 , V2 , . . . , Vm }, and define in the Nh,r,h combination networks. For this we will use
Bertrand’s postulate (e.g., see [2]) that the interval [n, 2n]
W1 , V1 + V2 + · · · + Vα−2 , contains a prime power for any integer n; and that the interval
where dim(W1 ) = (α − 2)t. By the definition of an IC, F ht [ x, x + x21/40 ] contains a prime for all large enough x [3].
q =
This implies the following result.
W1 + W2 , where W2 ∈ [( h−αV+2) t]. It follows that any vector
v ∈ Vj , α − 1 6 j 6 m, may be written uniquely as v = Theorem 5. For all positive integers h and r, let N denote the
v1 + v2 , where v1 ∈ W1 and v2 ∈ W2 . We now define Nh,r,h combination network. Then gapv (N ) 6 r + h − 3, and
for all large enough r, gapv (N ) 6 (r − 1)21/40 + h − 2.
Vj′ , v2 : v1 + v2 ∈ Vj , v1 ∈ W1 , v2 ∈ W2 ,