Papr 2

Uploaded by

ahsanbser67

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views6 pages

Papr 2

Uploaded by

ahsanbser67

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Rate-Constrained Shaping Codes for Finite-State

Channels with Cost

Yi Liu Member, IEEE,∗ Yonglong Li Member, IEEE,∗‡ Pengfei Huang Member, IEEE,∗
Paul H. Siegel Life Fellow, IEEE ∗
∗ Department of Electrical and Computer Engineering, CMRR, University of California, San Diego, La Jolla, CA
‡ National University of Singapore

[email protected], [email protected], [email protected], [email protected]

Abstract—Shaping codes are used to generate code sequences the type-II coding problem. Several researchers have considered
in which the symbols obey a prescribed probability distribution. this problem. In [4], modified Shannon-Fano codes, based on
They arise naturally in the context of source coding for noiseless matching the probability of source and codeword sequences,
channels with unequal symbol costs. Recently, shaping codes have
been proposed to extend the lifetime of flash memory and reduce were introduced, and they were shown to be asymptotically
DNA synthesis time. In this paper, we study a general class of optimal. A similar idea was used in [22], where an arithmetic
arXiv:2205.03731v1 [cs.IT] 7 May 2022

shaping codes for noiseless finite-state channels with cost and coding technique was introduced. Several works extend coding
i.i.d. sources. We establish a relationship between the code rate algorithms for memoryless channels to finite-state channels.
and minimum average symbol cost. We then determine the rate In [2], a finite-state graph was transformed to its memoryless
that minimizes the average cost per source symbol (total cost).
An equivalence is established between codes minimizing average representation M and a normalized geometric Huffman code
symbol cost and codes minimizing total cost, and a separation was used to design a asymptotically capacity achieving code on
theorem is proved, showing that optimal shaping can be achieved M. In [8], the author extended the dynamic programming al-
by a concatenation of optimal compression and optimal shaping gorithm introduced in [7] to finite-state channels. The proposed
for a uniform i.i.d. source. algorithm finds locally optimal codes for each starting state,
I. I NTRODUCTION but the algorithm does not guarantee global optimality. In [6],
an iterative algorithm that can find globally optimal codes was
Shaping codes are used to encode information for use on
proposed.
channels with symbol costs under an average cost constraint.
They find application in data transmission with a power con- The concepts of combinatorial capacity and probabilistic
straint, where constellation shaping is achieved by addressing capacity can be generalized to the setting where there is a con-
into a suitably designed multidimensional constellation or, straint on the average cost per transmitted channel symbol. The
equivalently, by incorporating, either explicitly or implicitly, probabilistic capacity was determined in [20] and [9], where
some form of non-equiprobable signaling. More recently, shap- the entropy-maximizing stationary Markov chain satisfying the
ing codes have been proposed for use in data storage applica- average cost constraint was found. The relationship between
tions: coding for flash memory to reduce device wear [17], and cost-constrained combinatorial capacity and probabilistic ca-
coding for efficient DNA synthesis in DNA-based storage [14]. pacity was also addressed in [10]. The equivalence of the two
Motivated by these applications, [18] investigated information- definitions of cost-constrained capacity was proved in [25], and
theoretic properties and design of rate-constrained fixed-to- an alternative proof was recently given in [15], where methods
variable length shaping odes for memoryless noiseless channels of analytic combinatorics in several variables were used to
with cost and general i.i.d. sources. In this paper, we extend the directly evaluate the cost-constrained combinatorial capacity.
results in [18] to rate-constrained shaping codes for finite-state We refer to the problem of designing codes that achieve
noiseless channels with cost and general i.i.d. sources. the cost-constrained capacity as the type-I coding problem.
Finite-state noiseless channels with cost trace their concep- This problem has also been addressed by several authors.
tual origins to Shannon’s 1948 paper that launched the study of In [10], an asymptotically optimal block code was introduced
information theory [23]. In that paper, Shannon considered the by considering codewords that start and end at the same state.
problem of transmitting information over a telegraph channel. In [12], the authors construct fixed-to-fixed length and variable-
The telegraph channel is a finite-state graph and the channel to-fixed length codes based on state-splitting methods [1]
symbols – dots and dashes – have different time durations, for magnetic recording and constellation shaping applications.
which can be interpreted as integer transmission costs. Shannon Other constructions can be found in [13], [24] and [26].
defined the combinatorial capacity of this channel and gave an In this paper, we address the problem of designing shaping
explicit formula. He also determined the symbol probabilities codes for noiseless finite-state channels with cost and general
that maximize the entropy per unit cost, and showed the i.i.d. sources. We systematically study the fundamental proper-
equivalence of this probabilistic definition of capacity to the ties of these codes from the perspective of symbol distribution,
combinatorial capacity. In [4], this result was then generalized average cost, and entropy rate using the theory of finite-state
to arbitrary non-negative symbol costs. In [11], a new proof word-valued sources. We derive fundamental bounds relating
technique for deriving the combinatorial capacity was intro- these quantities and establish an equivalence between optimal
duced for non-integer costs and another proof of the equivalence type-I and type-II shaping codes. A generalization of Varn
of combinatorial and probabilistic definitions of capacity was coding [28] is shown to provide an asymptotically optimal
given. In [2] and [3], a generating function approach was used to type-II shaping code for uniform i.i.d. sources. Finally, we
extend the equivalence to a larger class of constrained systems. prove separation theorems showing that optimal shaping for
We refer to the problem of designing codes that achieve the a general i.i.d. source can be achieved by a concatenation of
capacity, i.e., that maximize the information rate per unit cost, optimal lossless compression with an optimal shaping code for
or, equivalently, that minimize the cost per information bit, as a uniform i.i.d. source.
In Section II, we define finite-state channels with cost and 1
review the combinatorial and probabilistic capacities associated
with the type-I and type-II coding problems. In Section III, we 00
define finite-state variable length shaping codes for channels 2 2
3
with cost and characterize properties of the codeword pro- 01 4 10
cess using the theory of finite-state word-valued sources. In 4 4
Section IV, we analyze shaping codes for a fixed code rate, 11
which we call type-I shaping codes. We develop a theoretical
bound on the trade-off between the rate – or more precisely, 4
the corresponding expansion factor – and the average cost of a
Fig. 1: Flash memory channel
type-I shaping code. We then study shaping codes that minimize
average cost per source symbol (total cost). We refer to this def 1
class of shaping codes as type-II shaping codes. We derive the C I,comb (W ) = lim sup log2 |Kn (W )|. (1)
n→∞ n
relationship between the code expansion factor and the total
cost and determine the optimal expansion factor. In Section V, We also refer to this definition as type-I combinatorial capacity.
we consider the problem of designing optimal shaping codes. Let E be a stationary Markov process with entropy rate H (E)
We prove an equivalence theorem showing that both type-I and and average cost A(E). The probabilistic capacity for a given
type-II shaping codes can be realized using a type-II shaping average cost constraint W, or cost-constrained probabilistic
code for a channel with modified edge cost. Using a generaliza- capacity, is
tion of Varn coding [28], we propose an asymptotically optimal C I,prob (W ) =
def
sup H (E). (2)
type-II shaping code on this modified channel for a uniform E:A(E)≤W
i.i.d. source. We then extend our construction to arbitrary i.i.d.
sources by introducing a separation theorem, which states that The maxentropic Markov chain for a given W was derived
optimal shaping can be achieved by a concatenation of lossless in [9] and [20]. The result relies on the one-step cost-enumerator
compression and optimal shaping for a uniform i.i.d. source. matrix D ( S), where S ≥ 0, with entries
(
Due to space constraints, we must omit many detailed proofs, 2− Sw(ei j ) for edge ei j between (vi , v j )
which can be found in [16]. However, we remark that several di j ( S ) = (3)
new proof techniques are required to extend the results on 0 if edge ei j doesn’t exist.
block shaping codes for memoryless channels in [18] to the Denote by λ ( S) its Perron root and by vectors E L = [ Pvi /ρi ]
corresponding results on finite-state shaping codes for finite- and E R = [ρi ]> the corresponding left and right eigenvectors
state channels in this paper. such that E L E R = 1. Given an average cost constraint W ( S)
II. N OISELESS F INITE -S TATE C OSTLY C HANNEL the maxentropic Markov chain has transition probabilities
Let H = (V , E ) be an irreducible finite directed graph,
1 2− Sw(ei j )
with vertices V and edges E . A finite-state costly channel is Pi j ( S) = ρj (4)
a noiseless channel with cost associated with H, where each ρi λ ( S )
edge e ∈ E is assigned a non-negative cost w(e) ≥ 0. We such that
assume that between any pair of vertices (vi , v j ) ∈ V × V , 1 ρj
there is at most one edge. If not, we can always convert it to
W ( S) =
λ ( S) ∑ Pvi 2−Sw(ei j ) ρi , (5)
ij
another graph that satisfies this condition by state splitting [19].
An example of such a channel is given in Example 1. and the type-I probabilistic capacity of this channel is
Example 1. In SLC NAND flash memory, cells are arranged C I,prob (W ( S)) = log2 λ ( S) + SW ( S). (6)
in a grid and programming a cell affects its neighbors. One It was shown in [25], [15] that C I,comb (W ) = C I,prob (W ).
example of this phenomenon is inter-cell inference (ICI) [27].
Cells have two states: programmed, corresponding to bit 1, and B. Channel capacity without cost constraint
erased, corresponding to bit 0. Due to ICI, programming a cell
Denote by K (W ) the number of distinct sequences e∗ with
will damage its neighbors cells. Each length-3 sequence has a
cost equal to W. The combinatorial capacity, or the type-II
cost associated with the damage to the middle bit, as shown in
combinatorial capacity, of this channel is defined as
Table I. We can convert this table into a directed graph with
vertices V = {00, 01, 10, 11}, as shown in Fig. 1. def 1
C I I,comb = lim sup log2 K (W ). (7)
TABLE I: Flash memory channel cost W →∞ W

e 000 001 010 011 100 101 110 111

Similarly, the type-II probabilistic capacity of this channel is
defined as
w(e) 1 2 4 4 2 3 4 4 def H (E)
C I I,prob = sup . (8)
E A(E)
A. Channel capacity with average cost constraint In [11], it was proved that the transition probabilities of the
Given a length-n edge sequence en1 , the cost of this sequence maxentropic Markov process are Pi j ( S0 ), where S0 satisfies
is defined as W (en1 ) = ∑in=1 w(ei ), and the average cost of λ ( S0 ) = 1. It was also proved that
this sequence is defined as A(en1 ) = n1 W (en1 ). If Kn (W ) is the C I I,comb = C I I,prob = S0 . (9)
number of sequences of length-n with average cost less than or
equal to W, then the combinatorial capacity for a given average In [2] and [3], the equivalence between C I I,comb and C I I,prob
cost constraint [25], or cost-constrtained capacity, is was extended to a larger class of constrained systems.
x1 x2 Lemma 1. The graph F is irreducible. The encoding process is
a Markov process associated with F with transition probabilities

v0 y1 v1 y2 v2 P( y{k,l } | y{i, j} ) = P( xl ). (14)

The stationary distribution of states { y{i, j} } is
Fig. 2: Encoding Process.
π F ( y{i, j} ) = πG (vi ) P( x j ). (15)
III. F INITE -S TATE VARIABLE -L ENGTH C ODES : A 2
W ORD -VALUED S OURCE A PPROACH Using the law of large numbers for irreducible Markov chain [5,
Exercise 5.5], [19, Theorem 3.21] and the dominated con-
A. Finite-State Variable-Length Codes
vergence theorem, we know that the expected length of the
Let X = X1 X2 . . ., where Xi ∼ X for all i, be an i.i.d. codeword process E is
source with alphabet X = {α1 , . . . , αu }. We denote by Pi the n
1
probability of symbol αi and assume that P1 ≥ P2 ≥ . . . ≥ Pu . def
E( L) = lim E( ∑ L(Ym )) = ∑ L( y{i, j} )πG (vi )P(x j ). (16)
n→∞ n m=1
Let|X | denote the size of the alphabet and P( x∗ ) denote the ij
probability of any finite sequence x∗ . A finite-state variable- Given an edge e ∈ E and codeword y{i, j} , we denote by
length code on graph H is a mapping φ : V × X q → E ∗ . For Ne ( y{i, j} ) the total number of occurrences of e in sequence
simplicity and without loss of generality, we assume q = 1 y{i, j} . We can similarly prove that
and denote φ(vi , x j ) as y{i, j} . The starting state of y{i, j} is vi
and is denoted by start( y{i, j} ). Its ending state is denoted by E( Ne ) = ∑ Ne ( y{i, j} )πG (vi ) P( x j ).
y{i, j}
(17)
end( y{i, j} ). Its length is denoted by L( y{i, j} ). We assume this
mapping has the following two properties:
B. Finite-State Word-Valued Source
• Its subcodebook set Yi = { y{i, j} } is prefix-free for all vi .
Introduced in [21], a word-valued source is a discrete random
• l ≤ L ( y{i, j} ) < L for some positive l and L.
process that is formed by sequentially encoding the symbols of
The encoding process associated with this mapping, as shown an i.i.d. random process X into corresponding codewords over
in Fig. 2, is defined as follows. an alphabet E . In this paper, the mapping is a function of both
• Start: fixed state v0 . input symbols and the starting state. We refer to the process E,
• Input: source word sequence x1 , x2 , x3 , . . .; xi ∈ X q . formed by an i.i.d. source process and mapping φ, as a finite-
• Output: codeword sequence y1 , y2 , y3 , . . .; yi ∈ E ∗ . state word-valued source.
• Encoding rules: yi = φ (vi−1 , xi ), vi = end ( yi ). Given an encoded sequence e1 e2 . . ., the probability of
From the encoding rules, we have sequence en1 is Q(en1 ), and the number of occurrences of e
yi = φ( yi−1 , xi ) = φ(end( yi−1 ), xi ) = F ( yi−1 , xi ), is Ne (en1 ). The following properties of the process E are of
(10) interest.
vi = end( yi ) = end(φ(vi−1 , xi )) = G (vi−1 , xi ).
• The asymptotic symbol occurrence probability
These equations suggest that we can define codeword graph F0
1
and state graph G0 , which are closely related to the encoding P̂e = lim E( Ne ( En1 )). (18)
n→∞ n
process, as follows.
• The asymptotic average cost
• Codeword graph F0 = (V F0 , E F0 ):
1
– Vertices V F0 = { y{i, j} for all i and j}. A(φ) = lim E(W ( En1 )). (19)
n→∞ n
– Edges E F0 = {e = ( y{i, j} , y{k,l } )|vk = end( y{i, j} )}.
• The entropy rate
• State graph G0 = (V G0 , E G0 ):
1
– Vertices VG0 = V . H (E) = lim H ( En1 ). (20)
n→∞ n
– Edges E F0 = {e = (vi , v j )|∃ xk s.t. v j = G (vi , xk ))}.
We can prove the following lemma.
Here we construct an irreducible subgraph from F0 and G0 .
We choose the irreducible component G ⊆ G0 that contains v0 Lemma 2. For a finite-state code φ : V × X q → E ∗ asso-
(we assume that v0 always belongs to one of the irreducible ciated with graph F and stationary distribution {π F y{i, j} =
components). Its state and edge sets are denoted by VG and EG πG (vi ) P( x j )} such that E( Ne ) < ∞ for all e ∈ E and
respectively. For convenience, we will re-index the vertices in E( L) < ∞, the asymptotic probability of occurrence P̂e is given
VG as v0 , v1 , v2 , . . .. The encoding process is a Markov process by
associated with G and transition probabilities 1
P̂e = E( Ne ) . (21)
tiGj = E( L )
∑ P( xk ). (11)
k:G (vi ,xk )=s j The asymptotic average cost A(φ) is
We denote by πG (vi ) the stationary distribution of vi . We define A(φ) = ∑ P̂e w(e). (22)
a subgraph F ⊂ F0 as follows. Its vertex V F is e∈E
2
V F = { y{i, j} = φ(vi , x j )| for vi ∈ VG and all x j } (12) Lemma 3. For a finite-state code φ : V × X q → E ∗ associated
and its edge set E F is with graph F such that H (X) < ∞ and E( L) < ∞, the entropy
E F = {( y{i, j} , y{k,l } )| y{i, j} , y{k,l } ∈ V F rate of the codeword process is
(13) qH (X) H (X)
and vk = end( y{i, j} ) = end(φ(vi , x j ))}. H (E) = = . (23)
E( L ) f
We can prove the following lemma. Here f = E( L)/q is the expansion factor of the mapping φ. 2
C. Asymptotic normalized KL-divergence eigenvectors such that E L E R = 1, and S is the constant such that
Similar to the definition of P̂e , the asymptotic probability of P̂ei j H (X)
occurrence of state v ∈ V is defined as H (Ê) = − ∑ P̂ei j log2 = H (E) = . (31)
1 ij P̂vi f
P̂v = lim E( Nv ( En1 )), (24)
n→∞ n On a cost-uniform graph, the average cost for any shaping code
and we can prove that is a constant −α. 2
P̂vi = ∑ P̂ei j (25) Remark 2. When the minimum average cost is achieved, we
j
have H (Ê) = H (E). As shown in Remark 1, the codeword
Consider a finite-order Markov process Ê associated with graph sequence approximates a finite-order stationary Markov process
H and transition probabilities with transition probabilities { P̂ei j / P̂vi }. 2
P̂ei j Using Theorem 5, we study shaping codes that minimize
tiHj = . (26) average cost per source symbol, which are closely related to
P̂vi the type-II channel capacity introduced in Section II-B. The
Denote by P̂(en1 ) the probability of a length-n sequence gener- total cost of a shaping code is
ated by this process. To measure the difference between Ê and E(W (φ( X nq )))
T (φ) = lim = f ∑ P̂e w ( ei j ) . (32)
E, we define the asymptotic normalized KL-divergence as n→∞ nq ij
ij
1 Q(en1 )
lim D ( En1 || Ên1 ) = lim ∑ Q(en1 ) log2 . (27) We refer to the problem of minimizing the total cost as the type-
n→∞ n n→∞ n n P̂(en1 ) II shaping problem. Shaping codes that achieve the minimum
e ∈E 1 total cost are referred to as optimal type-II shaping codes. The
The relationship between processes E and Ê is summarized in corresponding optimization problem is as follows.
the following lemma. minimize f ∑ P̂e ij w ( ei j )
P̂ei j , f ij
Lemma 4. The asymptotic normalized KL-divergence between
processes E and Ê satisfies H (X)
subject to H (Ê) ≥ H (E) = (33)
f
1 H (X)
lim D ( En1 || Ên1 ) = H (Ê) − H (E) = H (Ê) − . (28) ∑ P̂e = ∑ P̂e ji and ∑ P̂e = 1.
n→∞ n f ij ij
j j ij
2
We have the following theorem that determines the minimum
1 n n
Remark 1. When H (Ê) = H (E), liml →∞ l D ( E1 || Ê1 ) = 0. achievable total cost of a shaping code.
Therefore, the codeword process E approximates the stationary
Markov process Ê, in the sense that the asymptotic normalized Theorem 6. If a cost-0 cycle does not exist, the minimum total
KL-divergence between E and Ê converges to 0. cost of a type-II shaping code φ : V × X q → E ∗ is given by
H (X)
IV. O PTIMAL S HAPING C ODES FOR F INITE -S TATE C OSTLY Tmin = f ? ∑ P̂e?i j w(ei j ) = , (34)
C HANNEL ij
S?
In this section, we first analyze shaping codes that minimize P̂v? ?
the average cost with a given expansion factor. We refer to this where P̂e?i j = ρ i 2− S w(ei j ) ρ j , S? is a constant such that λ ( S? ) =
minimization problem as the type-I shaping problem, and we i

call shaping codes that achieve the minimum average cost for a 1, and E L = [ P̂v?i /ρi ] and E R = [ρi ]> are the corresponding
given expansion factor optimal type-I shaping codes. We solve eigenvectors such that E L E R = 1. The corresponding expansion
the following optimization problem. factor f ? is
w ( ei j ) H (X) H (X)
minimize ∑ P̂e ij f? = = . (35)
P̂ei j ij P̂e?
ij S? ∑i j P̂e?i j w(ei j )
− ∑i j P̂e?i j log2
P̂v?
P̂ei j H (X) i
subject to H (Ê) = − ∑ P̂ei j log2 ≥ (29) If there is a cost-0 cycle in H, the total cost is a decreasing
ij ∑ j P̂ei j f
function of f . 2
∑ P̂e ji = ∑ P̂ei j and ∑ P̂e ij = 1. V. O PTIMAL S HAPING C ODE D ESIGN
j j ij
In this section, we consider the problem of designing optimal
In [15], the authors discuss cost-diverse and cost-uniform type-I and type-II shaping codes.
graphs. A graph is cost-diverse if it has at least one pair of A. Equivalence Theorem
equal-length paths with different costs that connect the same
pair of vertices. Otherwise it is called cost-uniform. It can be We consider the channel with modified edge costs
proved that the edge costs w(ei j ) of a cost-uniform graph can P̂e?i j
be expressed as w(ei j ) = −µi + µ j − α. The following theorem w0 (ei j ) = − log2 ? , (36)
P̂vi
relates to the achievable minimum average cost of a finite-state
shaping code. where P̂e?i j , P̂v?i are given in Theorem 6. It is easy to check
Theorem 5. On a cost-diverse graph, the average cost of a type- that the optimal type-II shaping codes on this channel are also
I shaping code φ : V × X q → E ∗ with expansion factor f is optimal on the original channel, in the sense that the symbol
lower bounded by occurrence probabilities { P̂ei j } are identical on both channels.
H (X) log2 λ ( S) We can prove the following lemma.
Amin ( f ) = ∑ P̂e ij w ( ei j ) = − , (30)
ij
Sf S Lemma 7. Given a noiseless finite-state costly channel with edge
P̂vi 2− Sw(ei j )
costs {w(ei j )}. If there is a shaping code φ : V × X q → E ∗
where P̂ei j = ρ j, λ ( S) is the Perron root of the such that
ρi λ ( S) f ∑ P̂ei j w0 (ei j ) − H (X) < δ, (37)
matrix D ( S), E L = [ P̂vi /ρi ] E R = [ρi ]> are the corresponding ij
where w0 (ei j ) = − log2 ( P̂e?i j / P̂v?i ) = S? w(ei j ) + log2 ρi − Remark 3. By extending some leaf nodes to states that are not
log2 ρ j , for some δ > 0, then the total cost of this code satisfies visited by the original code, we can make graph G0 a complete
H (X) δ graph. Then we can choose any state as the starting state. This
f ∑ P̂ei j w(ei j ) − ?
< ?. (38) operation only adds a constant to the cost of a codeword and
ij
S S therefore does not affect the asymptotic performance of the
2 generalized Varn code.
The next two theorems establish the equivalence between type-I
and type-II shaping codes. Example 2. For the channel introduced in Example 1, the
optimal symbol distributions that minimize the total cost are
Theorem 8. Given a noiseless finite-state costly channel with shown in Table II. Based on the distribution, we can design a
edge costs {w(ei j )}. For any γ, η > 0, there exists a δ > 0 generalized Varn code on the channel with modified edge costs
such that if there exists a shaping code φ : V × X q → E ∗ with shown in Table III. The total cost as a function of codebook
expansion factor f 0 such that size is shown in Fig. 3.
f 0 ∑ P̂e0i j w0 (ei j ) − H (X) < δ, (39)
TABLE II: Probabilities for SLC flash channel that minimize
ij
total cost.
P̂e?
where w0 (ei j )= − log2 ij
P̂v?i
= S? w(ei j ) + log2 ρi − log2 ρ j , u 000 001 010 011 100 101 110 111

then the average cost of this code is upper bounded by P̂u 0.4318 0.1323 0.1135 0.0593 0.1323 0.0405 0.0593 0.0310

H (X) log λ ( S)
∑ P̂e0i j w(ei j ) − ( S f − 2S ) < γ (40) TABLE III: Modified cost for flash memory channel.
ij
u 000 001 010 011 100 101 110 111
and the expansion factor of this code f 0 satisfies | f 0 − f | < η. C (u) 0.3805 2.0923 0.6068 1.5423 0.3855 2.0923 0.6068 1.5423
Theorem 9. Given a noiseless finite-state costly channel that
does not contain a cost-0 cycle. Denote by S? the constant such
that λ ( S? ) = 1 and by f ? the expansion factor of an optimal
type-II shaping code. For any γ > 0, there exist δ, η > 0 such
that if a shaping code φ : V × X q → E ∗ with expansion factor
f 0 satisfies
∑ P̂e0i j w(ei j ) − Amin ( f 0 ) < δ, | f 0 − f ? | < η, (41)
ij
then the total cost of this code satisfies
H (X)
f 0 ∑ P̂e0i j w(ei j ) − < γ. (42)
ij
S?
B. Generalized Varn Code
We now describe an asymptotically optimal type-II shaping
code for uniform i.i.d. sources based on a generalization of
Varn coding [28]. Given a uniform i.i.d. input source X ,
Fig. 3: The total cost of a generalized Varn code on the SLC
a generalized Varn Code on the noiseless finite-state costly
flash channel
channel is a collection of tree-based variable-length mappings,
φ : V × X q → E ∗ . Denote by Yk the set of codewords starting C. Separation Theorem
from state vk , namely We now present a separation theorem for shaping codes.
Yk = {φ(vk , xq )| xq ∈ X q }. (43) It states that the minimum total cost can be achieved by a
concatenation of optimal lossless compression with an optimal
Codewords in Yk are generated according to the following steps.
shaping code for a uniform i.i.d. source.
• Set state vk ∈ V as the root of the tree.
• Expand the root node. The edge costs {w0 (ekl )} are the
Theorem 11. Given an i.i.d. source X and a noiseless finite-
modified costs defined in Lemma 7. The cost of a leaf node state costly channel with edge costs {w(ei j )}, the minimum total
is the cost of the path from root node to the leaf node. cost can be achieved by a concatenation of an optimal lossless
• Expand the leaf node that has the lowest cost.
compression code with a binary optimal type-II shaping code for
• Repeat the previous steps until the total number of leaf
a uniform i.i.d. source.
nodes M ≥ |X |q . Delete the leaf nodes that have the
largest cost until the number of leaf nodes equals to |X |q .
Theorem 12. Given the i.i.d. source X, the noiseless finite-state
Each path from the root node vk to a leaf node represents
costly channel with edge costs {w(ei j )}, and the expansion
one codeword in Yk .
factor f , the minimum average cost can be achieved by a
The following lemma gives an upper bound on the total cost of concatenation of an optimal lossless compression code with a
a generalized Varn code. binary optimal type-I shaping code for uniform i.i.d. source and
Lemma 10. The total cost of a generalized Varn code φ : V × f
expansion factor f 0 = H (X) .
X q → E ∗ is upper bounded by
log2 M maxi j {w0 (ei j )} By Theorem 9, the optimal type-I shaping code for uniform i.i.d.
T (φ) ≤ + = log2 |X |. (44) source in Theorem 12 can be replaced by a suitable optimal
q q q→∞
type-II shaping code for uniform i.i.d. source.
2
R EFERENCES
[1] R. Adler, D. Coppersmith, and M. Hassner, “Algorithms for sliding block
codes,” IEEE Trans. Inf. Theory, vol. IT-29, no. 1, pp. 5–22, Jan. 1983.
[2] G. Böcherer, “Capacity-Achieving Probabilistic Shaping for Noisy and
Noiseless Channels”, Ph.D. dissertation, RWTH Aachen University, 2012.
[3] G. Böcherer, R. Mathar, V. C. da Rocha Jr., and C. Pimentel,“On the
capacity of constrained systems,” in Proc. Int. ITG Conf. Source Channel
Coding (SCC), 2010.
[4] I. Csiszár, “Simple proofs of some theorems on noiseless channels”, Inf.
Contr., vol. 14, pp. 285–298, 1969.
[5] R. Durrett, Probability: Theory and Examples, 3rd ed. Belmont, CA:
Duxbury, 2004.
[6] R. Fujita, K. Iwata, and H. Yamamoto, “An Iterative Algorithm to
Optimize the Average Performance of Markov Chains with Finite States,”
in Proc. IEEE Int. Symp. Inf. Theory (ISIT), Paris, France, 2019, pp.1902
- 1906.
[7] M. J. Golin and G. Rote, “A dynamic programming algorithm for
constructing optimal prefix-free codes with unequal letter costs,” IEEE
Trans. Inf. Theory, vol. 44, no. 5, pp. 1770–1781, Sep. 1998.
[8] K. Iwata and T. Koyama, “A prefix-free coding for finite-state noiseless
channels with small coding delay,” in Proc. 2010 Int. Symp. Inf. Theory
& its Applications, Taichung, Taiwan, Oct. 2010, pp. 473–477.
[9] J. Justesen and T. Høholdt, “Maxentropic Markov chains,” IEEE Trans.
Inf. Theory, vol. IT-30, no. 4, pp. 665–667, Jul. 1984.
[10] R. Karabed, D. L. Neuhoff, A. Khayrallah, The Capacity of Costly
Noiseless Channels, Research report, IBM Research Division, 1988.
[11] A. Khandekar, R. J. McEliece, and E. Rodemich, “The Discrete Noiseless
Channel Revisited,” in Proc. 1999 Int. Symp. Communication Theory and
Applications, pp. 115-137, 1999.
[12] A. S. Khayrallah and D. L. Neuhoff, “Coding for channels with cost
constraints,” IEEE Trans. Inf. Theory, vol. 42, pp. 854-867, May 1996.
[13] V. Y. Krachkovsky, R. Karabed, S. Yang and B. A. Wilson, “On
modulation coding for channels with cost constraints”, in Proc. IEEE
Int. Symp. Inf. Theory, Honolulu, HI, Ju.-Jul. 2014, pp. 421–425.
[14] A. Lenz, Y. Liu, C. Rashtchian, P. H. Siegel, A. Wachter-Zeh, and E.
Yaakobi, “Coding for efficient DNA synthesis,” in Proc. IEEE Int. Symp.
Inf. Theory, Los Angeles, CA, Jun. 2020, pp. 2885-2890.
[15] A. Lenz, S. Melczer, C. Rashtchian, and P. H. Siegel, “Multivariate
Analytic Combinatorics for Cost Constrained Channels and Subsequence
Enumeration”, arXiv:2111.06105 [cs.IT], Nov. 2021.
[16] Y. Liu, “Coding Techniques to Extend the Lifetime of Flash Mem-
ories”, Ph.D. dissertation, University of California, San Diego, 2020.
https://escholarship.org/uc/item/43k8v2hz
[17] Y. Liu and P. H. Siegel, “Shaping codes for structured data,” in Proc.
IEEE Globecom, Washington, D.C., Dec. 4-8, 2016, pp. 1–5.
[18] Y. Liu, P. Huang, A. W. Bergman, P. H. Siegel, “Rate-constrained shaping
codes for structured sources”, IEEE Trans. Inf. Theory, vol. 66, no. 8,
pp. 5261–5281, Aug. 2020.
[19] B. H. Marcus, R.M. Roth, and P.H. Siegel, An Introduction to Cod-
ing for Constrained Systems, Lecture Notes, 2001, available online at:
ronny.cswp.cs.technion.ac.il/wp-content/uploads/sites/54/2016/05/chapters1-9.pdf
[20] R. J. McEliece and E. R. Rodemich, “A maximum entropy Markov chain”, in Proc.
17th Conf. Inf. Sciences and Systems, Johns Hopkins University, Mar. 1983, pp.
245-248.
[21] M. Nishiara and H. Morita, “On the AEP of word-valued sources,” IEEE Trans. Inf.
Theory, vol. 46, no. 3, pp. 1116–1120, May 2000.
[22] S. A. Savari and R. G. Gallager, “Arithmetic coding for finite-state noiseless
channels,” IEEE Trans. Inf. Theory, vol. 40, no. 1, pp. 100–107, Jan. 1994.
[23] C. E. Shannon, “A mathematical theory of communication, Part I, Part II,” Bell Syst.
Tech. J, vol. 27, pp. 379–423, 1948.
[24] J. B. Soriaga and P. H. Siegel, “On distribution shaping codes for partial- response
channels,” in Proc. 41st Annual Allerton Conference on Communication, Control,
and Computing, (Monticello, IL, USA), pp. 468-477, October 2003.
[25] J. B. Soriaga and P. H. Siegel “On the design of finite-state shaping encoders
for partial-response channels,” in Proc. of the 2006 Inf. Theory and Application
Workshop (ITA2006), San Diego, CA, USA, Feb. 2006.
[26] J. B. Soriaga and P. H. Siegel, “Near-capacity coding systems for partial- response
channels,” in Proc. IEEE Int. Symp. Inf. Theory, Chicago, IL, USA, p. 267, June
2004.
[27] V. Taranalli, H. Uchikawa and P. H. Siegel, “Error analysis and inter-cell interference
mitigation in multi-level cell flash memories,” in Proc. IEEE Int. Conf. Commun.
(ICC), London, Jun. 2015 , pp. 271-276.
[28] B. Varn, “Optimal variable length codes (arbitrary symbol cost and equal code word
probability),” Inform. Contr., vol. 19, pp. 289–301, 1971.

Kostina-Lossy Data Compression PDF
No ratings yet
Kostina-Lossy Data Compression PDF
273 pages
Epfl TH4461
No ratings yet
Epfl TH4461
181 pages
Reed-Muller Codes: Foundations and Trends in Communications and Information Theory
No ratings yet
Reed-Muller Codes: Foundations and Trends in Communications and Information Theory
160 pages
Information Theory Lecture Notes
100% (1)
Information Theory Lecture Notes
97 pages
Coding 515
No ratings yet
Coding 515
92 pages
Information and Coding Theory
No ratings yet
Information and Coding Theory
177 pages
A Resource Framework For Quantum Shannon Theory: I. Devetak
No ratings yet
A Resource Framework For Quantum Shannon Theory: I. Devetak
61 pages
Ec8501-Digital Communication Question Bank Two Marks With Answer
No ratings yet
Ec8501-Digital Communication Question Bank Two Marks With Answer
28 pages
On Properties of The Support of Capacity-Achieving Distributions For Additive Noise Channel Models With Input Cost Constraints
No ratings yet
On Properties of The Support of Capacity-Achieving Distributions For Additive Noise Channel Models With Input Cost Constraints
41 pages
uRLLC Rate
No ratings yet
uRLLC Rate
53 pages
Electrical Engineering 229A Lecture Notes Information Theory and Coding
No ratings yet
Electrical Engineering 229A Lecture Notes Information Theory and Coding
117 pages
Applications of Error-Control Coding
No ratings yet
Applications of Error-Control Coding
31 pages
Incremental Redundancy, Fountain Codes and Advanced Topics: Suayb S. Arslan
No ratings yet
Incremental Redundancy, Fountain Codes and Advanced Topics: Suayb S. Arslan
57 pages
DC Slide Module-3
No ratings yet
DC Slide Module-3
16 pages
Notes For EE 229A: Information and Coding Theory UC Berkeley Fall 2020
100% (1)
Notes For EE 229A: Information and Coding Theory UC Berkeley Fall 2020
70 pages
Advanced Topics Information Theory-Lecture Notes - Stefan M. Moser 2.5 PDF
No ratings yet
Advanced Topics Information Theory-Lecture Notes - Stefan M. Moser 2.5 PDF
416 pages
Energy, Latency, and Reliability Tradeoffs in Coding Circuits
No ratings yet
Energy, Latency, and Reliability Tradeoffs in Coding Circuits
13 pages
Justesen 1982 - Information Rates and Power Spectra of Digital Codes
No ratings yet
Justesen 1982 - Information Rates and Power Spectra of Digital Codes
16 pages
Computation of Channel Capacity and Rate-Distortion Functions
No ratings yet
Computation of Channel Capacity and Rate-Distortion Functions
14 pages
A Prefix-Free Coding For Finite-State Noiseless Channels With Small Coding Delay
No ratings yet
A Prefix-Free Coding For Finite-State Noiseless Channels With Small Coding Delay
5 pages
Lecture 13
No ratings yet
Lecture 13
9 pages
ETN642 Lec8 Ch8 Handouts
No ratings yet
ETN642 Lec8 Ch8 Handouts
12 pages
5-2 Information Theory
No ratings yet
5-2 Information Theory
37 pages
Channel Capacity PDF
No ratings yet
Channel Capacity PDF
30 pages
LDPC Codes: An Introduction: Amin Shokrollahi
No ratings yet
LDPC Codes: An Introduction: Amin Shokrollahi
34 pages
Channel Capacity
No ratings yet
Channel Capacity
51 pages
Immink 2022 - Innovation in Constrained Codes
No ratings yet
Immink 2022 - Innovation in Constrained Codes
6 pages
Unit 1 INFORMATION ENTROPY FUNDAMENTALS
No ratings yet
Unit 1 INFORMATION ENTROPY FUNDAMENTALS
13 pages
Tabla de Polinomios Generadores PDF
No ratings yet
Tabla de Polinomios Generadores PDF
9 pages
Module-3 Information Theory: Entropy Source-Coding Theorem
No ratings yet
Module-3 Information Theory: Entropy Source-Coding Theorem
14 pages
Asymptotic Equipartition Property of Output When Rate Is Above Capacity
No ratings yet
Asymptotic Equipartition Property of Output When Rate Is Above Capacity
23 pages
GATE Online Coaching Classes: Digital Communications
No ratings yet
GATE Online Coaching Classes: Digital Communications
64 pages
PT1 Itc QB
No ratings yet
PT1 Itc QB
12 pages
Introduction To Information Theory
75% (4)
Introduction To Information Theory
178 pages
Lecture 15: Channel Capacity, Rate of Channel Code
No ratings yet
Lecture 15: Channel Capacity, Rate of Channel Code
6 pages
Compression and Coding Algorithms
No ratings yet
Compression and Coding Algorithms
10 pages
INGI 2348: Information Theory and Coding
No ratings yet
INGI 2348: Information Theory and Coding
33 pages
Channel Capacity: 1 Preliminaries and Definitions
No ratings yet
Channel Capacity: 1 Preliminaries and Definitions
5 pages
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
No ratings yet
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
31 pages
1 Information Theory
No ratings yet
1 Information Theory
57 pages
Applications of Error-Control Coding
No ratings yet
Applications of Error-Control Coding
30 pages
EE 376A: Information Theory: Lecture Notes
No ratings yet
EE 376A: Information Theory: Lecture Notes
75 pages
Rohini 15720602071
No ratings yet
Rohini 15720602071
2 pages
Lecture Notes in Information Theory Volume II
No ratings yet
Lecture Notes in Information Theory Volume II
293 pages
Rateless Coding With Partial State Information at The Decoder
No ratings yet
Rateless Coding With Partial State Information at The Decoder
26 pages
Efficient Coding Schemes The Hard-Square Model': Osi M
No ratings yet
Efficient Coding Schemes The Hard-Square Model': Osi M
1 page
Channel Capacity and The Channel Coding Theorem, Part I: Information Theory 2013
No ratings yet
Channel Capacity and The Channel Coding Theorem, Part I: Information Theory 2013
17 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
36 pages
Decision-Feedback Maximum-Likelihood Decoder Finite-State Markov Channels
No ratings yet
Decision-Feedback Maximum-Likelihood Decoder Finite-State Markov Channels
5 pages
BIP39 - Mnemonic Code
No ratings yet
BIP39 - Mnemonic Code
1 page
Ec8501-Digital Communication-1142519233-1564326036555 - Ec 8501 DC QB
No ratings yet
Ec8501-Digital Communication-1142519233-1564326036555 - Ec 8501 DC QB
28 pages
Unit 1
100% (2)
Unit 1
45 pages
Final 2015 Sol
No ratings yet
Final 2015 Sol
13 pages
Information Coding Techniques
0% (2)
Information Coding Techniques
374 pages
A Mathematical Theory of Communication: Jin Woo Shin, Sang Joon Kim
No ratings yet
A Mathematical Theory of Communication: Jin Woo Shin, Sang Joon Kim
6 pages
Source Coding and Channel Coding For Mobile Multimedia Communication
No ratings yet
Source Coding and Channel Coding For Mobile Multimedia Communication
19 pages
Hamming CHs1-3 PDF
No ratings yet
Hamming CHs1-3 PDF
60 pages
ITC
100% (1)
ITC
13 pages
EC2311 Communication Engineering Question Bank
No ratings yet
EC2311 Communication Engineering Question Bank
6 pages
DHARMAPURI SIVA SAI - Offer of Employment
No ratings yet
DHARMAPURI SIVA SAI - Offer of Employment
6 pages
B4 Batch2020-24 MajorProject ThesisReport BabyMonitoring MuneerUddin May2024 RPT
No ratings yet
B4 Batch2020-24 MajorProject ThesisReport BabyMonitoring MuneerUddin May2024 RPT
81 pages
Information Theory & Coding Syllabus
No ratings yet
Information Theory & Coding Syllabus
3 pages
Chapter Five Lossless Compression
No ratings yet
Chapter Five Lossless Compression
49 pages
Ec23ec4211itc PPT
No ratings yet
Ec23ec4211itc PPT
148 pages
SAMPLE PROJECT THESIS For Template
No ratings yet
SAMPLE PROJECT THESIS For Template
89 pages
Information Theory and Coding
No ratings yet
Information Theory and Coding
27 pages
BEC 701 Digital Communications
No ratings yet
BEC 701 Digital Communications
2 pages
ML Notes
No ratings yet
ML Notes
60 pages
Dccnetwork
No ratings yet
Dccnetwork
2 pages
Paperon Automationin Baby Cradle
No ratings yet
Paperon Automationin Baby Cradle
5 pages
Mathematical Theory
No ratings yet
Mathematical Theory
29 pages
Prerequisites
No ratings yet
Prerequisites
6 pages
Papr 4
No ratings yet
Papr 4
41 pages
Smart Wheelchair Project Report
No ratings yet
Smart Wheelchair Project Report
14 pages
MCCPPT 2
No ratings yet
MCCPPT 2
13 pages
Itc 03
No ratings yet
Itc 03
24 pages
Rptfinal
No ratings yet
Rptfinal
26 pages
MCC Cdma
No ratings yet
MCC Cdma
11 pages
Papr 7
No ratings yet
Papr 7
26 pages
Papr 1
No ratings yet
Papr 1
6 pages
Paper 4
No ratings yet
Paper 4
4 pages
Papr 11
No ratings yet
Papr 11
18 pages
Describe in Brief Different Types of Regression Algorithms
No ratings yet
Describe in Brief Different Types of Regression Algorithms
25 pages
DT20235060716 Application
No ratings yet
DT20235060716 Application
5 pages
Types of Regularization in Machine Learning - by Aqeel Anwar - Towards Data Science
No ratings yet
Types of Regularization in Machine Learning - by Aqeel Anwar - Towards Data Science
11 pages
3 - Perfect Secrecy
No ratings yet
3 - Perfect Secrecy
59 pages
Co Imp
No ratings yet
Co Imp
14 pages
Papr 5
No ratings yet
Papr 5
14 pages
Lesson - Huffman and Entropy Coding
No ratings yet
Lesson - Huffman and Entropy Coding
31 pages
Lecture Notes - Decision Tree
No ratings yet
Lecture Notes - Decision Tree
13 pages
Maximum Likelihood Estimation of Optimal Receiver
No ratings yet
Maximum Likelihood Estimation of Optimal Receiver
10 pages
Papr 6
No ratings yet
Papr 6
10 pages
Melvin Vopson - Second Law of Information Dynamics
No ratings yet
Melvin Vopson - Second Law of Information Dynamics
6 pages
Paper 6
No ratings yet
Paper 6
6 pages
Data Mining: Clustering Validation Minimum Description Length Information Theory Co-Clustering
No ratings yet
Data Mining: Clustering Validation Minimum Description Length Information Theory Co-Clustering
67 pages
Papr 10
No ratings yet
Papr 10
7 pages
IOTBased Baby Cradle Systemwith Real Time Data Tracking
No ratings yet
IOTBased Baby Cradle Systemwith Real Time Data Tracking
7 pages
Renyi Tsallis Fuzzy Divergence
No ratings yet
Renyi Tsallis Fuzzy Divergence
22 pages
Effective Transfer Entropy Approach To Information
No ratings yet
Effective Transfer Entropy Approach To Information
14 pages
Governance Modes in Supply Chains and Financial Performance at Buyer, Supplier and Dyadic Levels: The Positive Impact of Power Balance
No ratings yet
Governance Modes in Supply Chains and Financial Performance at Buyer, Supplier and Dyadic Levels: The Positive Impact of Power Balance
30 pages
An Introduction To Neural Data Compression: Yibo Yang, Stephan Mandt, and Lucas Theis
No ratings yet
An Introduction To Neural Data Compression: Yibo Yang, Stephan Mandt, and Lucas Theis
20 pages
Chapter 5 Solved Problem
No ratings yet
Chapter 5 Solved Problem
3 pages
Digi Comm Model QP P1
No ratings yet
Digi Comm Model QP P1
3 pages
Chapter 8: Differential Entropy
No ratings yet
Chapter 8: Differential Entropy
21 pages
Ahsanbaseer Coverletter Juniorcybersec
No ratings yet
Ahsanbaseer Coverletter Juniorcybersec
1 page
Some Properties of Entropy For The Exponentiated Pareto Distribution (EPD) Based On Order Statistics
No ratings yet
Some Properties of Entropy For The Exponentiated Pareto Distribution (EPD) Based On Order Statistics
11 pages
Dynamic Scale Inferenceby Entropy Minimization
No ratings yet
Dynamic Scale Inferenceby Entropy Minimization
10 pages
(EIE529) Assignment 2 Solution
No ratings yet
(EIE529) Assignment 2 Solution
4 pages
Economic Barriers
No ratings yet
Economic Barriers
1 page
Community Engagement
No ratings yet
Community Engagement
1 page
Self-Quiz Unit 5 - Attempt Review
No ratings yet
Self-Quiz Unit 5 - Attempt Review
3 pages
Entropy and Its Properties
No ratings yet
Entropy and Its Properties
6 pages
Fla Assignment 2022-2023
No ratings yet
Fla Assignment 2022-2023
1 page
Lecture25 - Lecture25Entropy, Joint Entropy, ConditionalEntropy
No ratings yet
Lecture25 - Lecture25Entropy, Joint Entropy, ConditionalEntropy
4 pages
1.what Is Entropy?: Unit-V Information Theory 2
No ratings yet
1.what Is Entropy?: Unit-V Information Theory 2
3 pages
Routing in Wireless Mesh Networks
From Everand
Routing in Wireless Mesh Networks
Raghav Kumar
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet

Papr 2

Uploaded by

Papr 2

Uploaded by

Rate-Constrained Shaping Codes for Finite-State

Channels with Cost

[email protected], [email protected], [email protected], [email protected]

e 000 001 010 011 100 101 110 111

v0 y1 v1 y2 v2 P( y{k,l } | y{i, j} ) = P( xl ). (14)

You might also like