An Efficient Image Compression Technique Using Tcheb
An Efficient Image Compression Technique Using Tcheb
PII: S0030-4026(17)30899-9
DOI: http://dx.doi.org/doi:10.1016/j.ijleo.2017.08.007
Reference: IJLEO 59470
To appear in:
Please cite this article as: Ferda Ernawan, Nomani Kabir, Kamal Zuhairi
Zamli, An efficient image compression technique using Tchebichef bit allocation,
<![CDATA[Optik - International Journal for Light and Electron Optics]]> (2017),
http://dx.doi.org/10.1016/j.ijleo.2017.08.007
This is a PDF file of an unedited manuscript that has been accepted for publication.
As a service to our customers we are providing this early version of the manuscript.
The manuscript will undergo copyediting, typesetting, and review of the resulting proof
before it is published in its final form. Please note that during the production process
errors may be discovered which could affect the content, and all legal disclaimers that
apply to the journal pertain.
An Efficient Image Compression Technique Using
Tchebichef Bit Allocation
t
Ferda Ernawan, Nomani Kabir and Kamal Zuhairi Zamli
ip
IBM Centre of Excellence, Faculty of Computer Systems & Software Engineering,
Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Malaysia
cr
us
Abstract
an
suing image analysis and image reconstruction. The psychovisual threshold can
be utilized to find the optimal bits-budget for image signals. The psychovisual
system is developed based on noticeable distortion of the compressed image
M
from an original image in frequency order. This paper proposes an image com-
pression technique using Tchebichef psychovisual threshold for generating an
optimal bits-budget of image signals. The bits-budget is designed to replace
d
results show that the proposed bits-budget technique can improve the visual
quality of image output by 42% of JPEG compression. The visual image qual-
p
ity of Thcebichef bits allocation produces less artifact effect and distortion of
the image pixels. A set of bits-budgets gives excellent improvement in the image
ce
quality at a low average bit length of Huffman code than the image coding using
quantization tables.
Ac
∗ FerdaErnawan
Email address: [email protected] or [email protected] (Ferda Ernawan, Nomani
Kabir and Kamal Zuhairi Zamli)
Preprint submitted to International Journal for Light and Electron Optics August 2, 2017
Page 1 of 32
1. Introduction
Nowdays, bilions of new images are uploaded to the internet everyday [1]
t
and they require large storage to store the image data. Image compression
ip
provides a compact storage and a fast transmission for bandwidth-limited ap-
5 plications of the image data [2]. Over the last two decades, researchers have
cr
developed efficient image compression techniques to improve the compression
rates and the quality of images. JPEG [3] and JPEG2000 [4] are the most
us
popular standard image compression techniques along with many other non-
standard compression algorithms. JPEG compression technique implements
the sequential block coding which is easy to implement on most digital cam-
an
10
not come without price [6]. JPEG2000 has high computational complexity than
the standard JPEG compression [7]. Moreover, some parts of JPEG2000 are
te
patented and thus they are not found as open source, hindering straightforward
experimentation for performance comparison.
p
the our proposed Tchebichef bit allocation technique rather than JPEG 2000.
JPEG 2000 is not widely supported in web browsers, and it is not generally
used on the World Wide Web, while the standard JPEG compression has been
Ac
Page 2 of 32
the optimum bits budget based on psychovisual thresholds to replace the main
role of quantization tables in image compression. In the past, psychovisual ex-
periments have been conducted for subjective measurement test [10] [11]. The
t
ip
subjective evaluation is laborious, since the results need to be obtained by many
35 observers through repeated viewing sessions [12]. The results from the subjec-
cr
tive quality assessment depend on the test condition of human visual systems.
The image is judged for suitability by visual inspection, however, the differ-
us
ence in observer conditions may give significant influences to the testing results
[13]. Some researchers worked on the psychovisual threshold based on quanti-
40 tative experiment to improve the image compression performance [14],[15],[16].
an
Recently, the psychovisual experiment has become an important issue and a
big challenge in image compression. Crandall et. al.[17] proposed a psycho-
visual image compression technique by discarding psychovisually unnecessary
M
bits from the image pixels. The psychovisual image compression techniques
45 compress pixel data of an image by a fixed compression ratio such that small or
no perceptual loss of the image information occurs [17].
d
Image signals require the certain amount of bits at a time in order to fully
te
store the image data. Redundant frequency image signals not only require
amount of bits, but also make no impact on the quality of image reconstruc-
p
Page 3 of 32
change the role of the quantization process in TMT image compression. An op-
timal bit allocation is generated based on the principle of psychoacoustic model.
The psychoacoustic model has been widely implemented in audio coding such as
t
ip
65 MPEG-Audio Layer-3 (mp3), Dolby Digital and Advanced Audio Coding [21].
In psychoacoustics, the sound can only be heard at certain or higher sound
cr
pressure levels (SPL) across frequency order [22]. A strong signal makes a weak
signal inaudible and the weak signal is imperceptible when occurs under ideal
us
listening conditions. The slope of masking threshold is steeper towards lower
70 frequencies. In audio coding [23], a strong signal at the local frequency image
signals is identified as the local spectral peaks and the surrounding of the peak
an
frequency (weak signals) as the noise components. Psychoacoustics indicates
that human hearing sensation has a selective sensitivity to different frequencies
[24]. Similar to audio coding, in our technique, the frequency image signals are
M
75 categorized as strong signals (peak) and weak signals. The bits of weak image
signals are reduced by the presence of peak signals without introducing image
distortion on the image quality. Visual masking occurs when visibility of target
d
is used to reduce bits of the weak image signals without distorting the image
80 quality. A set of bits-budget with the optimized frequency image signal is finally
p
sample image block [26]. The orthonormal Tchebichef moment has its own
85 advantage when used for the large sample data. The orthonormal Tchebichef
moment is applied when the moment order becomes large and it can reduce
the accumulation of numerical errors [26]. The discrete orthonormal Tchebichef
moment has energy compactness properties of a large image block. Tchebichef
Moments for a set {tn (x)} of input values (image intensity values) of size M ×N
Page 4 of 32
90 with an order m + n are given as follows:
M −1 N −1
X X tm (x) tn (y)
Tmn = f (x, y) (1)
ρ(m, M ) ρ(n, N )
t
x=0 y=0
ip
f (x, y) denotes the intensity value of an image at the pixel position x and y.
tn (x) is defined using the following recursive relation:
cr
tn (x) = α1 xtn−1 (x) + α2 tn−1 (x) + α3 tn−2 (x) (2)
us
for n = 2, 3, ..., N − 1 and x = 0, 1, ..., N − 1. Where,
r
2 4n2 − 1
α1 = (3)
n N 2 − n2
an
r
1−N 4n2 − 1
α2 = (4)
n N 2 − n2
M
95 r
r
n−1 2n + 1 N 2 − (n − 1)2
α3 = (5)
n 2n − 3 N 2 − n2
The start values for the above recursion are obtained from the equations as
d
follows:
1
te
t0 (x) = √ (6)
N
s
p
3
t1 (x) = (2x + 1 − N ) (7)
N (N 2 − 1)
ce
The recurrence relation to compute the polynomial value for tn (x) is given by:
r r
N − n 2n + 1
tn (0) = tn−1 (0) (8)
N + n 2n − 1
Ac
100
n(1 + n)
tn (1) = {1 + }tn (0) (9)
1−N
tn (x) = γ 1 tn (x − 1) + γ 2 tn (x − 2) (10)
Page 5 of 32
(x + 1)(x − N − 1)
γ2 = (12)
x(N − x)
t
The above definition uses the following scale factor for the polynomial of degree
ip
105 n: r
N (N 2 − 1)(N 2 − 22 ) . . . (N 2 − n2 )
β(n, N ) = (13)
2n + 1
cr
The set tn (x) has a squared-norm given by
N
X −1
{tn (i)}2 = 1.0
us
ρ(n, N ) = (14)
i=0
an
M = N = 256 is shown in Figure 1 for visual purposes. Image reconstruction
M
d
p te
ce
Ac
Figure 1: The first four discrete Tchebichef polynomials tn (x) for x = 0, 1, 2 and 3 .
Page 6 of 32
For m, n = 0, 1, 2, ..., N − 1. Where f (x, y) denotes the reconstructed image
intensity and M, N denote the maximum orders of moments. Based on the
discrete orthogonal moments above, a compact representation of the moment
t
ip
ti (x)
coefficient K ∈ R(S×S) considering ki (x) = ρ(i,M ) is given by:
k0 (0) k1 (0) ... kS−1 (0)
cr
k0 (1) k1 (1) ... kS−1 (1)
K = k0 (2) k1 (2) ... kS−1 (2)
us
.. .. ..
..
. . . .
k0 (S − 1) k1 (S − 1) . . . kS−1 (S − 1)
an
115 Image block matrix F ∈ R(S×S) with f (x, y) denotes the intensity values of the
image pixels for each color component:
M
f (0, 0) f (0, 1) ... f (0, S − 1)
f (1, 0) f (1, 1) ... f (1, S − 1)
F = f (2, 0) f (2, 1) ... f (2, S − 1)
.. .. ..
..
d
. . . .
f (S − 1, 0) f (S − 1, 1) ... f (S − 1, S − 1)
te
T = KT F K (16)
ce
This process is repeated for every block in the original image to generate coef-
ficient of discrete orthogonal TMT. The inverse moment is used to reconstruct
Ac
120 the image block from the above moments and is presented as follows:
G = KT K T (17)
Page 7 of 32
125 as:
d(0,0) d(0,1) ... d(0,S−1)
d(1,0) d(1,1) ... d(1,S−1)
t
D = d(2,0) d(2,1) ... d(2,S−1)
ip
.. .. ..
..
. . . .
cr
d(S−1,0) d(S−1,1) ... d(S−1,S−1)
The order zero d(0,0) on moment coefficient represents the average intensity
us
of an image. The first order on moment coefficients are represented by d(1,0)
and d(0,1) . The second order moment coefficients are denoted by d(2,0) , d(1,1)
and d(0,2) . Psychovisual threshold on M × N TMT is to support practically
an
130 local bit allocation in TMT image compression. The psychovisual threshold
on large TMT is developed based on the contribution of TMT coefficients to
the reconstruction error for each frequency order. The psycho visual thresholds
M
on 256 × 256 TMT coefficients for luminance and chrominance are shown in
Figure 2. The ideal curve of average reconstruction error for a given frequency
d
p te
ce
Ac
135 order 0 to frequency order 510 for luminance channel is visualized by a solid
curve. For chrominance channel, an ideal curve of average reconstruction error
is depicted by a blue curve. The finer curves above of average reconstruction
Page 8 of 32
Table 1: polynomial coefficients
i p q
t
0 0.0303 0.0303
ip
1 −0.0000506 −0.0000576
2 0.00001672 0.0000189
cr
3 0.000000014256 0.0000000162
4 −0.00000000024827 −0.0000000002815
us
5 0.000000000000305848 0.00000000000034721
error are the primitive of psychovisual threshold on 256 × 256 TMT. A just
an
noticeable difference of the compressed image from original image known as
140 psychovisual threshold on TMT luminance and chrominance. These curves are
then interpolated by polynomial functions for luminance fM L and chrominance
M
fM R as follows:
fM L (x) = p5 x5 + p4 x4 + p3 x3 + p2 x2 + p1 x + p0 (18)
d
te
fM R (x) = q5 x5 + q4 x4 + q3 x3 + q2 x2 + q1 x + q0 (19)
for x = 0, 1, 2, ..., 510 which represents frequency order under regular 256 × 256
p
145 discrete transform. Coefficient values p and q are given in Table 1. Furthermore,
ce
these thresholds are used as references for the maximum impact of assigning bit
allocation for local frequency blocks on each frequency order. The impact of bit
allocation for each local frequency block is measured by the reconstruction error
Ac
per pixel. The reconstruction error of assigning bit allocation should be less than
150 psycho visual threshold. Next section explains the generation of bit allocation
based on psycho visual threshold for various local frequency block-sizes in image
compression.
Page 9 of 32
3. Experimentation with Tchebichef Bit Allocation
A color image has a high correlation between the different colors [27]. There-
t
155 fore, an RGB image is converted to a YUV image to reduce the correlation
ip
between the color components. The YUV image can be computed directly from
8 bit RGB as follows:
cr
Y 0.299 0.587 0.114 R 0
U = 0.299 0.587 0.114 · G + 128
us
V 0.299 0.587 0.114 B 128
an
R 1 0 1.4021 Y 0
G = 1 −0.34414 −0.71414 · U − 128
M
B 1 1.7718 0 V 128
The luminance (Y) and chrominances (U and V) components are very much
160 decor-related between each other. The most energy of the transformed YUV
d
165 color information. The bits can be reduced to eliminate the redundancy. Now we
ce
investigate the optimum bits budget for each channel using TMT psychovisual
threshold.
First, an image is subdivided by 256 × 256 image block and it is transformed
Ac
into image signals using Tchebichef transform. The frequency image signals
170 under regular 256 × 256 Tchebichef Moment contain 65535 AC coefficients and
there is only one DC coefficient on each image block. The DC coefficient is
encoded by a 14-bit number since the average minimum value is about 3565
and the maximum average value is about 16384. Nevertheless, AC coefficients
are listed a traversing array with a zigzag pattern. A linear array of frequency
175 of AC coefficients is divided into various local frequency block-sizes. Then we
10
Page 10 of 32
investigate an optimal amount of bits assigned to each local frequency blocks
with 8, 16 and 32 coefficients. Frequencies of AC coefficients on each local
frequency block are classified as peak frequencies and non-peak frequencies of
t
ip
image signals.
180 Referring to the audio coding, the tonal signal on audio coding produces the
cr
most energy of an audio signal which contains the important information to hu-
man auditory [23]. Thus, we assume that the peak frequency of AC coefficients
us
on each local frequency block provides a significant contribution to the image
quality. The local peak frequency is determined based on the maximum peak
185 of each local frequency block. The amount of peak frequency signals is main-
an
tained as the original peak frequency signals by rounding into integer values.
AC coefficients on the surrounding of peak signal on each local frequency block
are called the non-peak frequency signals. The non-peak frequency signals of
M
AC coefficients are assigned to get an ideal amount of bit allocation as follows:
C(x)
B = round 2n (20)
P
d
190 where B is the frequency masking, P is the local peak frequency signal, x is
te
allocation can be done by increasing bit allocation by one at a time. The effect
of incremental bit allocation is measured by the Average Reconstruction Error
ce
M N R i=0 j=0
k=0
where, the original image of size M × N × R contains the three RGB colors.
The average reconstruction error from incremental bit one at a time is always
set less than psychovisual error threshold for its frequency order. The inverse
of non-peak frequency coding is computed by:
B
C(x) = P (22)
2n
11
Page 11 of 32
200 Several experiments are conducted to evaluate the performance of the pro-
posed bits budget in image compression. 80 images including 40 color images
and 40 grayscale images are used to evaluate the proposed TMT bit allocation in
t
ip
image compression. This proposed bits-budget is tested on 40 color images from
a database [? ] and 40 grayscale images from another database [30] to prove
cr
205 the effectiveness of Tchebichef bits-budget in image compression. The sample
Lena and Baboon images selected from 40 color images in order to analyze the
us
quality output of the compressed image are shown Figure 3. For grayscale im-
ages, zelda and women images chosen as samples from 40 grayscale images are
depicted in Figure 4.
an
M
d
Figure 3: Original Lena and Baboon color image, 512 × 512, 24 bits/pixel.
p te
ce
Ac
Figure 4: Original Zelda (left) and Women (right) grayscale image, 512 × 512, 8 bits/pixel.
210 As a part of the evaluation process, the average bit length of Huffman code,
compression ratio and quality on image reconstruction have been chosen to mea-
sure the effectiveness of the TMT bit allocation in image compression. Average
bit length of Huffman code is used to predict that the number of bits on the
compressed image.
12
Page 12 of 32
215 4. Experimental Results and Analysis
t
role of quantization process. The proposed TMT bits-budget in luminance chan-
ip
nel is designed to have greater bit allocation than chrominance channel. The
average reconstruction errors of the proposed TMT bit allocation on each fre-
cr
220 quency order for local blocks of 8, 16 and 32 coefficients are shown in Figure 5,
Figure 6 and Figure 7, respectively.
us
an
M
d
Figure 5: Average reconstruction error of TMT bits-budget on each frequency order for local
te
block of 8 coefficients
p
ce
Ac
Figure 6: Average reconstruction error of TMT bits-budget on each frequency order for local
block of 16 coefficients
13
Page 13 of 32
t
ip
cr
us
Figure 7: Average reconstruction error of TMT bits-budget on each frequency order for local
block of 32 coefficients
an
has slightly higher error than luminance. The error with low frequency order
225 is about 0.03 which increases rapidly to about 0.5 to 0.6 with higher frequency
M
order. The error then declines with further frequency order. Notice that the
error reduction is higher with increasing Tchebichef coefficients.
Experimental results are provided by the histogram of frequency distribu-
d
230
The frequency distributions from JPEG compression and TMT compression are
ce
zeros for the AC coefficients than that of JPEG compression. In the proposed
TMT bit allocation, the frequency distribution of image signals is divided into
two parts, peak signals and non-peak signals as shown in Figure 9. Since peak
240 frequency of AC coefficients provides the significant contribution to the image
quality, the peak of AC coefficients is considered as the original frequency signals
by rounding into integer values. A set of bits-budgets is addressed to the non-
peak frequency of AC coefficients. From the histograms on the left of Figure 9,
14
Page 14 of 32
the extreme distributions of the peak AC coefficients are shown in Figure 10.
t
ip
cr
us
Figure 8: Histogram of AC coefficients from JPEG compression (left) and TMT compression
(right) for 40 images.
an
M
Figure 9: Histogram of TMT bit allocation on AC coefficients for 40 images.
d
p te
ce
Ac
245 Referring to Figure 10, a red curve represents generalized extreme distribu-
tion at each frequency. Thus, these results of generalized extreme distribution
are used to calculate the probability density function as shown in Figure 11.
For non-peak AC coefficients, the normal distribution is applied to estimate the
probability of AC coefficients from assigning TMT bits-budget which simplifies
15
Page 15 of 32
t
ip
cr
us
Figure 11: Probability density function on peak of AC coefficients.
an
250 the computation of the encoding process. Huffman tree is not built repeatedly
to compute the probability of the frequency distribution when the new input
images come to be processed. The probability density function will cover the
M
zeros on the frequency distribution. The total area under the red graph of the
probability distribution in Figure 11 is equal to one. The probability density
function (pdf) estimates the probability distribution of the frequency image sig-
d
255
nals. Using probability density functions, we do not require to run the encoding
te
process repetitively for new images. The probability density function is utilized
to estimate the probability distribution in Huffman coding rapidly.
p
Huffman coding makes shorter code for frequently occurring symbols in the
AC coefficients. Each symbol is encoded with a variable length code from the
ce
260
Huffman table [32]. The Huffman codes are written in the Huffman tables.
Huffman tables used during the compression process are stored as header infor-
Ac
mation in the compressed image file in order to decode during the decompression
process.
265 The image quality is measured by Absolute Reconstruction Error (ARE),
Means Square Error (MSE), Peak Signal to Noise Ratio (PSNR) and Structural
Similarity Index Metric (SSIM) to validate the effectiveness of TMT bit alloca-
tion technique in image compression. MSE calculates the average of the squares
16
Page 16 of 32
of the errors defined by:
M −1 N −1 R−1
1 X X X 2
M SE = [(i, j, k) − f (i, j, k)] (23)
M N R i=0 j=0
t
k=0
ip
270 The standard PSNR is calculated to obtain the quality of image reconstruction.
A higher PSNR means that the image reconstruction has more similarity with
cr
the original image [33]. PSNR is defined as:
M axi
P SN R = 20log10 √ (24)
us
M SE
where, M axi is the maximum possible pixel value of the image. Another
measurement approach of image quality is Structural Similarity Index Met-
an
275 ric (SSIM), which captures similarity between original image and compressed
image [34] defined as follows:
α β γ
M
SSIM (x, y) = [l(x, y)] · [c(x, y)] · [s(x, y)] (25)
where, α, β, γ > 0, are the parameters to adjust the relative importance of the
three components. A detail description can be found in [34].
d
280 16 coefficients and 32 coefficients on 256 × 256 TMT for luminance and chromi-
nance as listed in Table 2, Table 3 and Table 4, respectively. The average bit
p
lengths of Huffman code after assigning bit allocation on short, medium and
long frequency block respectively are given in Table 5 and Table 6. The average
ce
compression ratios from bit allocation technique for 40 color and 40 grayscale
285 images are listed in Table 7.
Experimental results show that the larger local frequency block-size produces
Ac
the minimum bit size of the compressed output. We observe that the assigned
bits allocation on local frequency block of 32 coefficients produce slightly better
compression ratio than local frequency blocks of 8 and 16 coefficients. Local
290 frequency blocks of 8, 16 and 32 coefficients produce almost similar average
compression ratio scores. Besides, the reconstruction errors from bit allocation
technique show significant improvement which is about 42% of JPEG compres-
sion and 39.5% of TMT image compression.
17
Page 17 of 32
Table 2: A Set of TMT Bit Allocation for Local Frequency Blocks of 8 Coefficients on M ×
N − T MT
Luminancec Chrominance
t
ip
Block No Freq Order Bit Block No Freq Order Bit
1 1-3 10
cr
2-3 4-6 9
4-15 7-14 8 1 1-3 8
us
16-41 15-25 7 2-3 4-6 7
42-93 26-38 6 4-13 7-13 6
94-173 39-52 5 14-32 14-22 5
an
174-338 53-73 4 33-66 23-32 4
339-645 74-101 3 67-130 33-45 3
646-1288 102-143 2 131-227 46-59 2
M
1289-4478 144-267 1 228-446 60-83 1
4479-8191 268-510 0 447-8191 84-510 0
d
Table 3: A Set of TMT Bit Allocation for Local Frequency Blocks of 16 coefficients on
te
M × N − T MT
Luminancec Chrominance
Block No Freq Order Bit Block No Freq Order Bit
p
1 1-5 10
ce
2-3 6-9 9
4-10 10-17 8 1 1-5 8
11-27 18-28 7 2-3 6-9 7
Ac
18
Page 18 of 32
t
Table 4: A Set of TMT Bit Allocation for Local Frequency Blocks of 32 Coefficients on
ip
M × N − T MT
Luminancec Chrominance
Block No Freq Order Bit Block No Freq Order Bit
cr
1 1-7 10
2 8-11 9
us
3-6 12-19 8 1 1-7 8
7-14 20-29 7 2 8-11 7
an
15-33 30-45 6 3-6 12-19 6
34-67 46-64 5 7-12 20-27 5
68-117 65-86 4 13-22 28-37 4
M
118-227 87-120 3 23-45 38-53 3
228-512 121-180 2 46-82 54-71 2
513-1641 181-350 1 83-178 72-106 1
d
Table 5: Average Bit Length of Huffman Code for 40 Real and 40 Graphic Color Images
Real Graphic
Method ACY ACU ACV ACY ACU ACV
Ac
19
Page 19 of 32
Table 6: Average Bit Length of Huffman Code for 40 Grayscale Images
t
Method AC
ip
JPEG Compression 2.934
TMT Compression [35] [36] 1.729
cr
Bit Allocation on short block 2.888
Bit Allocation on medium block 2.670
us
Bit Allocation on long block 2.520
an
Table 7: Average Compression Ratio Score for 40 Real Color Images, 40 Graphic Color Images
and 40 Grayscale Images
M
Method Real Graphic Grayscale
JPEG compression 3.324 2.972 2.685
TMT compression [35][36] 5.559 3.893 4.500
d
Table 8: Average Quality of Image Reconstruction for 40 Real and 40 Graphic Color Images
Real Graphic
Ac
20
Page 20 of 32
Table 9: Average Quality of Image Reconstruction for 40 Grayscale Images
Method ARE MSE PSNR SSIM
t
JPEG Compression 4.635 45.425 32.380 0.980
ip
TMT Compression [35] [36] 4.474 38.133 32.696 0.971
Bit Allocation on short block 2.983 18.032 37.191 0.986
cr
Bit Allocation on medium block 3.062 18.975 36.931 0.985
Bit Allocation on long block 3.062 19.098 36.713 0.984
us
Table 10: Average Bit Size of Compressed Image from 40 Real Color Images, 40 Graphic
Color Images and 40 Grayscale Images
an
Method Real Graphic Grayscale
JPEG compression 230.997 Kb 258.394 Kb 95.331 Kb
TMT compression [35][36] 138.144 Kb 197.283 Kb 56.888 Kb
M
Bit allocation on short block 249.575 Kb 264.630 Kb 94.593 Kb
Bit allocation on medium block 247.824 Kb 251.851 Kb 85.458 Kb
Bit allocation on long block 248.636 Kb 246.774 Kb 80.672 Kb
d
In Table 8 and Table 9, we notice that TMT bit allocation provides statis-
te
295 tically higher image quality in image reconstruction. The large number of peak
frequency signals leads to improvement of the quality of image output. The
p
The visual image outputs from TMT bit allocation on short, medium and
long frequency block of image signals are shown in Figure 12 and Figure 13. In
300 order to observe the visual quality of the output image, the right eye of Lena
Ac
image is zoomed in to 200%. The image output from a set of bit allocation
on image compression produces a rich texture of the original image. The bit
allocation on shorter local frequency blocks produces relatively similar image
output as the original one. Figure 13 presents that the large local frequency
305 blocks of 32 coefficients produce significant texture color pixels than the local
frequency blocks of 8 and 16 coefficients. It implies that the reconstructed
image pixels are relatively too far from the original pixels in Lena image. A
21
Page 21 of 32
t
ip
cr
Figure 12: Original cropped baboon image (left) and TMT bit allocation on short block
us
(right) with zoom in 200%.
an
M
Figure 13: TMT bit allocation on medium block (left) and TMT bit allocation on long block
(right) with zoom in 200%.
d
large local frequency block produces less quality of the image output than a
te
short local frequency block. The local frequency block of 32 coefficients under
310 regular 256×256 discrete transform provides 2047 peak frequencies of 65535 AC
p
medium local block, the peak frequency contributes 6.24%. Otherwise, short
local block has more peak frequencies, which are 12.49%. The number of peak
signals provides significant contribution in the image quality. Compression of
Ac
315 the rate distortion of the proposed scheme is shown in Figure 14. Our proposed
TMT bit allocation exhibits higher PSNR values (bit rates 2-5) than JPEG
compression and TMT compression.
In this part, Baboon, Lena, Women and Zelda images are selected to analyse
and compare the image quality of the proposed TMT bit allocation technique to
320 JPEG compression and TMT compression. The samples color and grayscale im-
ages are zoomed in to 200% as depicted in Figure 15 and Figure 16, respectively.
22
Page 22 of 32
t
ip
cr
us
Figure 14:
an
Comparison of the rate distortion among the TMT bit allocation, JPEG com-
pression and TMT compression of compressed Lena image.
M
The experimental results show that a set of bit allocations on short local im-
age signals produces better improvement on the quality of image reconstruction
d
than JPEG compression and TMT compression. In addition, the image output
te
325 from a set of TMT bit allocation is almost the same as the original image.
p
5. Conclusion
ce
Digital images require a significant amount of bits to store the image data.
Image data consists of redundant bits in the frequency image signals. The re-
dundant bits do not provide any contribution on the image quality. This work
Ac
330 has presented a new bit allocation technique using Tchebichef image compres-
sion. This technique can be used to replace the critical role of the quantization
process in image compression. Bit allocation technique is based on noticeable
distortion by assigning bits of the image signals. A set of bit allocations on
local frequency signals is considered to reduce the redundant number of bits on
335 frequency image signals. The goal of this study is to achieve an optimal bits-
budget on the frequency image signals. Experimental results reveal that this
23
Page 23 of 32
technique performs better than the quantization process in image compression.
Moreover, the technique produces less distortion in the image pixels. The pro-
posed Tchebichef bit allocation provides a significant improvement on the image
t
ip
340 quality at relatively low bit rates. The technique shows the improvement of im-
age quality by approximately 42% of JPEG compression and 39.5% of TMT
cr
image compression.
6. Acknowledgments
us
The authors would like to express very special thanks to Universiti Malaysia
345 Pahang, Malaysia for providing financial support of this research project by
an
UMP Research Grant Scheme (RDU170399).
References
M
[1] H. Averbuch-Elor, Y. Wang, Y. Qian, M. Gong, J. Kopf, H. Zhang,
D. Cohen-Or, Distilled collections from textual image queries, Computer
d
24
Page 24 of 32
//dx.doi.org/10.1016/j.image.2011.08.002.
365 URL http://www.sciencedirect.com/science/article/pii/
S092359651100097X
t
ip
[6] P. Bagheri Zadeh, A. Sheikh Akbari, T. Buggy, J. Soraghan, Multiresolu-
tion hvs and statistically based image coding scheme, Multimedia Tools and
cr
Applications 49 (2) (2010) 347–370. doi:10.1007/s11042-009-0371-z.
370 URL http://dx.doi.org/10.1007/s11042-009-0371-z
us
[7] K. Liu, Y. Zhou, Y. S. Li, J. F. Ma, A high performance {MQ} encoder
architecture in {JPEG2000}, Integration, the {VLSI} Journal 43 (3) (2010)
an
305 – 317. doi:http://dx.doi.org/10.1016/j.vlsi.2010.01.001.
URL http://www.sciencedirect.com/science/article/pii/
375 S0167926010000027
M
[8] J. M. Pascual, H. M. Mora, A. F. Guill, J. A. Lpez, Adjustable
compression method for still {JPEG} images, Signal Process-
d
ing: Image Communication 32 (2015) 16 – 32. doi:http:
//dx.doi.org/10.1016/j.image.2015.01.004.
te
25
Page 25 of 32
10.1111/1467-8659.t01-1-00561.
URL http://dx.doi.org/10.1111/1467-8659.t01-1-00561
t
[12] X. Fei, L. Xiao, Y. Sun, Z. Wei, Perceptual image quality as-
ip
395 sessment based on structural similarity and visual masking, Sig-
nal Processing: Image Communication 27 (7) (2012) 772 – 783.
cr
doi:http://dx.doi.org/10.1016/j.image.2012.04.005.
URL http://www.sciencedirect.com/science/article/pii/
us
S092359651200094X
400 [13] K. Gu, G. Zhai, X. Yang, W. Zhang, A new psychovisual paradigm for
an
image quality assessment: from differentiating distortion types to discrimi-
nating quality conditions, Signal, Image and Video Processing 7 (3) (2013)
423–436. doi:10.1007/s11760-013-0445-2.
M
URL http://dx.doi.org/10.1007/s11760-013-0445-2
URL http://www.ingentaconnect.com/content/asp/asl/2014/
00000020/00000001/art00006
p
uri?eid=2-s2.0-84880144758&partnerID=40&md5=
415 7014cb632362199c012136dd1275ec3b
26
Page 26 of 32
420 visual image compression, uS Patent 8,891,894 (Nov. 18 2014).
URL https://www.google.com/patents/US8891894
t
[18] N. A. Abu, F. Ernawan, N. Suryana, A generic psychovisual error threshold
ip
for the quantization table generation on jpeg image compression, in: Sig-
nal Processing and its Applications (CSPA), 2013 IEEE 9th International
cr
425 Colloquium on, 2013, pp. 39–43. doi:10.1109/CSPA.2013.6530010.
us
tization using psychovisual threshold for image compression, Advanced
Science Letters 20 (1) (2014) 70–74. doi:doi:10.1166/asl.2014.5316.
an
URL http://www.ingentaconnect.com/content/asp/asl/2014/
430 00000020/00000001/art00015
S0167839605000415
tual coding of high-quality digital audio, Proceedings of the IEEE 101 (9)
ce
27
Page 27 of 32
ference on Acoustics, Speech and Signal Processing, 2010, pp. 341–344.
doi:10.1109/ICASSP.2010.5495863.
t
450 [25] V. Adzic, H. Kalva, B. Furht, Exploring visual temporal masking for video
ip
compression, in: 2013 IEEE International Conference on Consumer Elec-
tronics (ICCE), 2013, pp. 590–591. doi:10.1109/ICCE.2013.6487030.
cr
[26] R. Mukundan, Some computational aspects of discrete orthonormal mo-
ments, IEEE Transactions on Image Processing 13 (8) (2004) 1055–1059.
us
455 doi:10.1109/TIP.2004.828430.
an
representation for lossy compression of cmyk prepress images, Computer
Graphics Forum 19 (1) (2000) 3–12. doi:10.1111/1467-8659.00384.
URL http://dx.doi.org/10.1111/1467-8659.00384
M
460 [28] F. Douak, R. Benzid, N. Benoudjit, Color image compression algorithm
based on the {DCT} transform combined to an adaptive block scanning,
d
URL http://www.sciencedirect.com/science/article/pii/
S1434841110000695
p
465
for tone mapping, Computer Graphics Forum 28 (2) (2009) 193–202. doi:
10.1111/j.1467-8659.2009.01358.x.
Ac
URL http://dx.doi.org/10.1111/j.1467-8659.2009.01358.x
470 [30] C. V. Group, Dataset of standard 512x512 grayscale test images (Jun 2003).
URL http://decsai.ugr.es/cvg/cg/base.htm
28
Page 28 of 32
[32] K. Vidyavathi, R. S. Sabeenian, Motion-compensated video compression
using variable length huffman coding, in: Machine Vision and Image
Processing (MVIP), 2012 International Conference on, 2012, pp. 77–80.
t
ip
doi:10.1109/MVIP.2012.6428765.
480 [33] Y.-K. Chen, F.-C. Cheng, P. Tsai, A gray-level clustering reduction
cr
algorithm with the least {PSNR}, Expert Systems with Applications
38 (8) (2011) 10183 – 10187. doi:http://dx.doi.org/10.1016/j.eswa.
us
2011.02.071.
URL http://www.sciencedirect.com/science/article/pii/
S0957417411002776
an
485
URL http://dx.doi.org/10.1134/S1054661810040115
29
Page 29 of 32
Ac
ce
p
original lena croped JPEG compression TMT compression TMT Bit Allocation
te
d
30
M
an
original baboon croped JPEG compression TMT compression TMT Bit Allocation
us
Figure 15: Visual quality of Lena and Baboon color image, 512 × 512, 24 bits/pixel
cr
ip
t
Page 30 of 32
Ac
ce
p
original Zelda croped JPEG compression TMT compression TMT Bit Allocation
te
d
31
M
an
original Women croped JPEG compression TMT compression TMT Bit Allocation
us
Figure 16: Visual quality of Zelda and Women grayscale image, 512 × 512, 8 bits/pixel
cr
ip
t
Page 31 of 32
t
ip
cr
us
an
Dr. Ferda Ernawan is currently a Senior Lecturer at the Faculty of Computer
Systems & Software Engineering, Universiti Malaysia Pahang. He received his
Master in Software Engineering and Intelligence and Ph.D in image process-
M
ing from Faculty of Information and Communication Technology, Universiti
Teknikal Malaysia Melaka in 2011 and 2014 respectively. His research interests
include image compression, digital watermarking and steganography.
d
p te
ce
Ac
32
Page 32 of 32