0% found this document useful (0 votes)

38 views10 pages

Linear Algebra in Image Compression - SVD and DCT

Linear algebra techniques like SVD and DCT are commonly used for lossy image compression. SVD decomposes an image into three matrices, and smaller values in the diagonal matrix can be removed to compress the image while retaining most information. DCT transforms image blocks and quantization matrices are used to round values, resulting in many zeros and compressed data size. Both techniques effectively compress images to 15-25% of original size before quality severely degrades. The SVD breaks images into blurry lines while DCT results in a more blocky appearance at high compression levels.

Uploaded by

ĐẠT NGUYỄN HOÀNG TẤN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views10 pages

Linear Algebra in Image Compression - SVD and DCT

Uploaded by

ĐẠT NGUYỄN HOÀNG TẤN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Linear Algebra in Image Compression: SVD and DCT

By: Andrew Fraser

Image compression is a vital tool in sending and receiving images across the web. Its first
major use came in during the 1960s when satellites used it to transfer images from space to
Earth. It has become even more useful since the implementation of the internet, as smaller sizes
became much more important to a media that demanded instant access. The most common use
today is in streaming websites like Youtube, Netflix, and Hulu, where 60 images or more are
sent in a single second over the internet. Without image compression, ridiculous internet speeds
would be needed to do this, but compression allows for up to 10 times less data to be sent in
order to get the same picture.

Processes of Image Compression

One of the simplest ways to store an image is in the Raster format, which is essentially an
mxn matrix storing the values of each pixel, where m and n are the length and width of the
image. For a colored image, the same is done but with three matrices, each one holding the red,
green, or blue pixel values for the RGB format. The main linear algebra compression algorithms
use this common method of storing images, as it is quite nice to have a matrix when dealing in
linear algebra.

One important aspect of image compression is whether it is lossy or lossless. A lossy

compression results in some information being lost in the compression, but allows for a more
effective compression. A lossless compression stores all of the same information, but just in a
compressed state. Usually, the lossless compressions mostly rely on storing bytes differently and
don’t really apply linear algebra. However, lossy compressions apply multiple linear algebra
techniques, including the entire process of SVD and various matrix transformations in the DCT.

One example of a common lossless compression is used in the PNG file format. PNG
files are mostly compressed using binary storage methods, with no data lost. One example of a
common lossy compression is used in the JPEG file format. JPEG files use a form of the DCT to
perform a lossy compression, which is much more effective than the PNG format, but results in a
loss in data.

Singular Value Decomposition

SVD, or Singular Value Decomposition, is a matrix factorization in Linear Algebra

where A = UΣVT. It is fairly similar to the AP = PD factorization, but it instead uses the square
root of the eigenvalues of ATA, so it can be used on any matrix. To calculate this factorization,
the following steps should be performed on a matrix A of size mxn:

Singular values σ1-σn: Found by taking the square root of each eigenvalue of ATA.
U = A matrix with its columnspace containing the columnspace of A and nullspace of AT as its
columns. All orthogonalized, mxm.
Σ = A diagonal matrix each singular value at its diagonals, from largest at 1,1, to smallest at nxn.
All other values are zeros. Mxn
V = A matrix with its columnspace comprised of the eigenvectors of ATA. Also happens to be
the rowspace of A and nullspace of A all orthogonalized. Nxn.
VT = Transpose of V

This can be used in compression by utilizing the fact that the Σ matrix values go from
greatest to least. By removing smaller values from this matrix, most of the information in the
image is kept, while the values needed to be stored is reduced. As more and more of these
smaller values are set to zero, columns from the U and rows from the VT matrices can be set to
zero as well, as they would simply be multiplied by zeros from the Σ matrix anyway. Thus, for
each small value in the SVD that is set to zero, both U and VT also lose an entire row/column, so
much less needs to be stored entirely.

Discrete Cosine Transform

The DCT, or Discrete Cosine Transform, is a transformation that uses matrix

multiplications to compress matrix data. First, the image is split into many NxN matrices, with
the ideal number for N usually being 8. Then, an NxN transformation matrix is created using a
cosine formula based on the i, j, positions in the matrix. Each NxN square of the image is then
left multiplied by this matrix, and right multiplied by the transpose of the matrix.

D = TMTT

After this process, there are different NxN quantization matrices ranging from
compression percentages 0-100% that can be used to compress each transformed matrix. For
example, the 50% percent matrix creates a reasonably compressed matrix by turning a decent
amount of values into zeros. A 10% matrix would result in large amounts of compression with a
lower quality image, while a 90% would perform little compression, but keep almost all of the
data. For each zero that results in this compression, the image becomes more and more
compressed. Each i,j value in the NxN matrix of data is divided by the i,j value in the
quantization matrix, then rounded to the nearest whole number. It is this division that results in
many zeros in the matrix, allowing for much less data to be stored overall.

By storing all of these NxN matrices with many zeros each, the matrices become much
less compressed. Then, when a user wants to view the image, the reverse process is performed by
multiplying each i,j value in the quantization by each value in the data, then left multiplying by
the DCT matrix and right multiplying by the . Finally each NxN block is recombined into the
full raster image block, which can be viewed like a normal image.

M = TTDT
Applying the SVD

For performing the SVD, I decided to use Maple because it is what I was most familiar
with, and it contains all of the necessary tools to perform an SVD compression. Here is a text
copy of the code I wrote:

with(LinearAlgebra):
with(ImageTools): # Necessary to read images as matrices and manipulate them
img:= ToGrayscale(Read("/u/class/f/c-fras2/Downloads/Robot.jpg")): # Reads the black and
white image
Write("/u/class/f/c-fras2/Pictures/Initial Robot.JPG", img):
U:= LinearAlgebra[SingularValues](img, 'output = U'):
S:= LinearAlgebra[SingularValues](img, 'output = S'):
Vt:= LinearAlgebra[SingularValues](img, 'output = Vt'):

C := 5/100:
for i from (round((RowDimension(S) * C) + 1)) to RowDimension(S) do
S[i] := 0:
end do:
DiagS:= DiagonalMatrix(S, RowDimension(img), ColumnDimension(img)):
CompressedImage:= U.DiagS.Vt:
Write((cat("/u/class/f/c-fras2/Pictures/", convert(round(C * 100), string), "% Robot SVD.png")),
CompressedImage):

This code first reads an image in grayscale, then finds the full SVD decomposition of the
matrix. Then, it turns a set amount of the smaller Σ values to zeros, allowing U and VT to store
that many less values as well. The value C is used to easily set what percentage of compression
is desired, a smaller percentage meaning that more compression occurs but more data is also lost.
Then, the image is turned back into its original form and written to a file to be viewed.

Applying the DCT

I decided to use Matlab to perform the DCT because it includes many more tools for
performing the DCT on a matrix than Maple. Matlab included a method that automatically split a
matrix into 8x8 chunks, then performed the compression itself. It also included a method to
automatically decompress them. The only missing portion was the quantization matrices, which I
had to perform manually. Here is the code that I wrote to do this:
A = im2double(imread('/u/class/f/c-fras2/Pictures/Initial Robot.JPG'));
D = dct2(A);
D(abs(D) < .1) = 0;
count = 0;
for m=1: size(D,1)
for n=1:size(D,2)
if D(m, n) == 0
count = count + 1;
end
end
end

percent = round((1 - (count/(size(D,1) * size(D,2)))) * 100);

R = idct2(D);
filepath = strcat('/u/class/f/c-fras2/Pictures/', num2str(percent), '% Robot DCT.png');
imwrite(R, filepath);

First, the image is loaded in and the dct2 method is used to create the DCT compressed
matrix for A. Then, each value below a certain amount (here it is set to .1) is set to zero. This sets
the smaller, less useful values in the stored matrix to zero. Then, the number of zeros in the
matrix is counted to determine the percentage of compression. Finally, the D matrix is inverted
back into a normal raster matrix, then is written to a file.

Effectiveness of Compression Techniques

The SVD, DCT, and other compression techniques tend to have a golden ratio of
compression where the image is still very distinguishable, yet a large amount of compression is
performed. For the SVD and DCT in particular, an average curve looks somewhat like this:
Based upon this curve, it is clear that the compression starts off by removing many values
that are unnecessary and barely matter to the visibility of the piece. However, around 15% of the
remaining data, much more important information starts to be lost. Picture quality starts to
decrease rapidly. By 10%, things start to get visibly blurry, but still visible. By 1%, the image
becomes difficult to distinguish. Thus, the golden ratio of compression tends to lie between 15%
and 25%, depending on how high quality the image needs to be. Below that point, it is often not
worth the extra bit of compression for such a lower quality image.

Results - Bridge
100%

75% SVD 69% DCT

50% SVD 47% DCT

30% SVD 35% DCT

20% SVD 20% DCT

10% SVD 12% DCT

1% SVD 2% DCT

It is interesting to see how the images get lower in quality in different ways. As you can
see with the SVD images at 10% and 1%, blurry lines start to appear in the image. It is as if
entire lines of the image are stripped away. This makes sense in relation to the transformation
because as each singular value is removed, an entire eigenvector is removed, so lines or
“vectors” are removed from the image at a time.

In contrast, the DCT becomes spottier in the entire picture, which makes sense because
values are taken out of the matrix one at a time based on value, not vector by vector. This is
visible in the sky at the 20% mark, and becomes more visible in other spots at 10% and 2%. The
image is still recognizable at 2%, which is quite a feat, but the quality is still poor enough that
the sign can’t be read.

Overall, these results show why the JPEG format uses the DCT. The DCT retains more
quality overall because it affects the whole image the same way, so everything stays somewhat
visible even at single percentages of data remaining. However, since the SVD removes entire
vectors, parts of the image become quite messed up, such as the streaks in the sky at 10%,
whereas other parts are less affected.

Results - Robot
100%
75% SVD 76% DCT

50% SVD 57% DCT

30% SVD 30% DCT

15% SVD 15% DCT

10% SVD 11% DCT

5% SVD 4% DCT
1% SVD 1% DCT

These results further illustrate why the DCT is used in the JPEG algorithm. Even at the
level of 10% data remaining for the robot, everything is still fairly visible to the viewer. In fact, it
is possible that the golden ratio for the DCT may even drop into around 10%, as it still retains
reasonable quality at that point where the SVD does not. Of course, higher percentages are still
needed if quality is important, but otherwise 10% is actually quite viable.

Conclusion

The SVD and DCT techniques are both very useful for data compression, and can easily
compress an image to 30% of its original size with almost no visual difference. However, based
on these results, the DCT appears to be more effective because it takes its losses throughout the
entire photo evenly, instead of removing entire vectors of data at a time. To create a high quality
image that loses almost no important data, the 30% mark seems to be about the spot to stay, as
very little differences can be seen between that point and the original image. For images that
don’t worry about quality, about 15% for the SVD and 10% DCT is where the line should likely
be drawn, as quality becomes too poor past those points to be worth it.

Sources

https://www.math.cuhk.edu.hk/~lmlui/dct.pdf
http://videocodecs.blogspot.com/2007/05/image-coding-fundamentals_08.html
http://www.mvnet.fi/index.php?osio=Tutkielmat&luokka=Yliopisto&sivu=Image_compression
https://ntrs.nasa.gov/search.jsp?R=19920024689
https://www.sitepoint.com/gif-png-jpg-which-one-to-use/

Solid Works Training BASIC
100% (4)
Solid Works Training BASIC
176 pages
Image Compression
No ratings yet
Image Compression
89 pages
Image Database
No ratings yet
Image Database
16 pages
Image Compression
No ratings yet
Image Compression
17 pages
IP Exercises 2024 Ex5
No ratings yet
IP Exercises 2024 Ex5
7 pages
Project 15 April Paperpublished
No ratings yet
Project 15 April Paperpublished
4 pages
Iii Jpeg
No ratings yet
Iii Jpeg
32 pages
A Comparative Study of Image Compression Techniques: Kamalesh Acharya, Shruti Bijawat
No ratings yet
A Comparative Study of Image Compression Techniques: Kamalesh Acharya, Shruti Bijawat
6 pages
Multimedia Assignment
No ratings yet
Multimedia Assignment
12 pages
Data Analytics Certification
No ratings yet
Data Analytics Certification
10 pages
Data Science Training in Mumbai
No ratings yet
Data Science Training in Mumbai
10 pages
9146 MMVR Assignment1 Group3
No ratings yet
9146 MMVR Assignment1 Group3
10 pages
Jpeg
No ratings yet
Jpeg
28 pages
Ijet V3i3p39
No ratings yet
Ijet V3i3p39
8 pages
College of Information Science and Engineering. Central South University. Changsha, Hunan, 410083, P.R China
100% (2)
College of Information Science and Engineering. Central South University. Changsha, Hunan, 410083, P.R China
37 pages
Image Compression Using SVD PDF
No ratings yet
Image Compression Using SVD PDF
7 pages
CMR Engineering College: Lossless Image Compression Using Matlab
No ratings yet
CMR Engineering College: Lossless Image Compression Using Matlab
7 pages
Jpeg Compressor Using Matlab
No ratings yet
Jpeg Compressor Using Matlab
6 pages
Image Compression Report
No ratings yet
Image Compression Report
9 pages
Image and Video Compression
No ratings yet
Image and Video Compression
8 pages
Fast Solvers in Image Compression: Federico Jean Valentin
No ratings yet
Fast Solvers in Image Compression: Federico Jean Valentin
22 pages
Comparative Analysis Between DCT & DWT Techniques of Image Compression
No ratings yet
Comparative Analysis Between DCT & DWT Techniques of Image Compression
9 pages
SVD
No ratings yet
SVD
6 pages
DCT Image Compression: D. Bhavsingh EC94001 M.Tech, E.I
No ratings yet
DCT Image Compression: D. Bhavsingh EC94001 M.Tech, E.I
38 pages
An Approach For Compressing Digital Images by Using Run Length Encoding
No ratings yet
An Approach For Compressing Digital Images by Using Run Length Encoding
3 pages
Lecture 6
No ratings yet
Lecture 6
38 pages
Presented by:-ABHISHEK SENGAR M.Sc. 4 Semester ROLL NO.: - 02
No ratings yet
Presented by:-ABHISHEK SENGAR M.Sc. 4 Semester ROLL NO.: - 02
15 pages
JPEG Compression Standard
No ratings yet
JPEG Compression Standard
23 pages
Comparative Analysis Between DCT & DWT Techniques of Image Compression
No ratings yet
Comparative Analysis Between DCT & DWT Techniques of Image Compression
10 pages
Image Compression Using SVD
No ratings yet
Image Compression Using SVD
3 pages
Introduction To Conventional Compression Solutions
No ratings yet
Introduction To Conventional Compression Solutions
8 pages
Multimedia System: Safeen H. Rasool
No ratings yet
Multimedia System: Safeen H. Rasool
22 pages
Call - For - PaperDDiscrete Cosine Transform For Image Compression
No ratings yet
Call - For - PaperDDiscrete Cosine Transform For Image Compression
7 pages
Image Compression Approaches: A Comprehensive Study: Akul S, Kavitha S N
No ratings yet
Image Compression Approaches: A Comprehensive Study: Akul S, Kavitha S N
3 pages
Image Compression
No ratings yet
Image Compression
114 pages
Image Compression and Its Implementation in Real Life: Shreyansh Tripathi, Vedant Bonde, Yatharth Rai
No ratings yet
Image Compression and Its Implementation in Real Life: Shreyansh Tripathi, Vedant Bonde, Yatharth Rai
36 pages
BTL DSRR
No ratings yet
BTL DSRR
11 pages
Discrete Cosine Transform
No ratings yet
Discrete Cosine Transform
40 pages
An Efficient DCT Compression Technique Using Strassen's Matrix Multiplication Algorithm
No ratings yet
An Efficient DCT Compression Technique Using Strassen's Matrix Multiplication Algorithm
6 pages
DCT Image Compression: D. Bhavsingh EC94001 M.Tech, E.I
No ratings yet
DCT Image Compression: D. Bhavsingh EC94001 M.Tech, E.I
39 pages
Image Compression 2
No ratings yet
Image Compression 2
24 pages
Assignment
No ratings yet
Assignment
10 pages
Image Compression Using Discrete Cosine Transform: Data Compression Digital Images Lossy Lossless
No ratings yet
Image Compression Using Discrete Cosine Transform: Data Compression Digital Images Lossy Lossless
9 pages
"Lossy Image Compression": Reyes Espinoza Kevin
No ratings yet
"Lossy Image Compression": Reyes Espinoza Kevin
2 pages
Minor Project Report On Image Compression
No ratings yet
Minor Project Report On Image Compression
8 pages
JPEG Image Compression: Matt Marcus June 1, 2014
No ratings yet
JPEG Image Compression: Matt Marcus June 1, 2014
5 pages
Jpeg & BNP: Image Compression Implemented in Matlab
No ratings yet
Jpeg & BNP: Image Compression Implemented in Matlab
13 pages
Compression Using Huffman Coding
No ratings yet
Compression Using Huffman Coding
9 pages
Compression: Image Data Compression, Discrete Cosine Transform, and The JPEG Format
No ratings yet
Compression: Image Data Compression, Discrete Cosine Transform, and The JPEG Format
14 pages
Image Compression Techniques
No ratings yet
Image Compression Techniques
5 pages
Information Theory and Coding: Submitted by
No ratings yet
Information Theory and Coding: Submitted by
12 pages
A Philosopher's Understanding of Quantum Mechanics (Vermaas)
100% (1)
A Philosopher's Understanding of Quantum Mechanics (Vermaas)
308 pages
DCT
No ratings yet
DCT
32 pages
Imagemanipulation PDF
No ratings yet
Imagemanipulation PDF
8 pages
Project 1 - Image Compression
No ratings yet
Project 1 - Image Compression
13 pages
JPEG Image Compression
No ratings yet
JPEG Image Compression
54 pages
Volume of Cube and Rectangular Prism in Cubic CM and M
100% (1)
Volume of Cube and Rectangular Prism in Cubic CM and M
49 pages
Compression of A Filtered Image Using DCT Technique
No ratings yet
Compression of A Filtered Image Using DCT Technique
8 pages
Image Compression Using DCT
100% (1)
Image Compression Using DCT
10 pages
Implementation of Image and Audio Compression Techniques Using
No ratings yet
Implementation of Image and Audio Compression Techniques Using
26 pages
Chap 4 MNGT Acctng PDF
No ratings yet
Chap 4 MNGT Acctng PDF
4 pages
Standard Form of Quadratic Equation
No ratings yet
Standard Form of Quadratic Equation
2 pages
General Physics Lesson 3
No ratings yet
General Physics Lesson 3
10 pages
RFIC Inductor Toolkit
No ratings yet
RFIC Inductor Toolkit
39 pages
Fourier Transform and Fast Fourier Transform Algorithms
No ratings yet
Fourier Transform and Fast Fourier Transform Algorithms
42 pages
Chap 6
No ratings yet
Chap 6
24 pages
OptiTekServices Mining
No ratings yet
OptiTekServices Mining
3 pages
SAT Equivalent-Expressions
No ratings yet
SAT Equivalent-Expressions
79 pages
Breakthrough Trading Formulas
100% (1)
Breakthrough Trading Formulas
7 pages
Chapter II Risk Management
No ratings yet
Chapter II Risk Management
36 pages
RS Aggrawal Solutions For Class 6 Maths Chapter 16 Triangles
No ratings yet
RS Aggrawal Solutions For Class 6 Maths Chapter 16 Triangles
8 pages
S5 M2 Mock Paper 3
No ratings yet
S5 M2 Mock Paper 3
12 pages
Lecture 2 Numeric Solution of Differential Equations
No ratings yet
Lecture 2 Numeric Solution of Differential Equations
45 pages
Understanding The Statistical Tests in Your Study
No ratings yet
Understanding The Statistical Tests in Your Study
9 pages
2021 Test 3 Graphs and Networks
No ratings yet
2021 Test 3 Graphs and Networks
9 pages
Applied Mechanics Ii
No ratings yet
Applied Mechanics Ii
3 pages
Sec. 3
No ratings yet
Sec. 3
8 pages
Design of An Improved Interval Type-2 Controller Using FCM and Supervised Clustering Algorithms
No ratings yet
Design of An Improved Interval Type-2 Controller Using FCM and Supervised Clustering Algorithms
10 pages
GAN-based Synthetic Medical Image Augmentation
No ratings yet
GAN-based Synthetic Medical Image Augmentation
10 pages
Defining Computational Aesthetics Hoenig
No ratings yet
Defining Computational Aesthetics Hoenig
6 pages
Normalizer Free Networks
No ratings yet
Normalizer Free Networks
22 pages
Lambert Conformal Conic Projection For India
No ratings yet
Lambert Conformal Conic Projection For India
4 pages
SignExplainer An Explainable AI-Enabled Framework For Sign Language Recognition With Ensemble Learning
No ratings yet
SignExplainer An Explainable AI-Enabled Framework For Sign Language Recognition With Ensemble Learning
10 pages
DLL 4TH Quarter
No ratings yet
DLL 4TH Quarter
11 pages
The Multiple Classical Linear Regression Model (CLRM) : Specification and Assumptions
No ratings yet
The Multiple Classical Linear Regression Model (CLRM) : Specification and Assumptions
19 pages
To Predict The Bead Geometry Parameters and Shape Relationships in MIG Welding of Stainless Steel 301 by Mathematical Modelling
No ratings yet
To Predict The Bead Geometry Parameters and Shape Relationships in MIG Welding of Stainless Steel 301 by Mathematical Modelling
10 pages
Stanford E14 PSET 1 Solutions
No ratings yet
Stanford E14 PSET 1 Solutions
18 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Raster Graphics Editor: Transforming Visual Realities: Mastering Raster Graphics Editors in Computer Vision
From Everand
Raster Graphics Editor: Transforming Visual Realities: Mastering Raster Graphics Editors in Computer Vision
Fouad Sabry
No ratings yet
Digital Raster Graphic: Unveiling the Power of Digital Raster Graphics in Computer Vision
From Everand
Digital Raster Graphic: Unveiling the Power of Digital Raster Graphics in Computer Vision
Fouad Sabry
No ratings yet

Linear Algebra in Image Compression - SVD and DCT

Uploaded by

Linear Algebra in Image Compression - SVD and DCT

Uploaded by

Linear Algebra in Image Compression: SVD and DCT

By: Andrew Fraser

Processes of Image Compression

One important aspect of image compression is whether it is lossy or lossless. A lossy

Singular Value Decomposition

SVD, or Singular Value Decomposition, is a matrix factorization in Linear Algebra

Discrete Cosine Transform

The DCT, or Discrete Cosine Transform, is a transformation that uses matrix

Applying the DCT

percent = round((1 - (count/(size(D,1) * size(D,2)))) * 100);

Effectiveness of Compression Techniques

75% SVD 69% DCT

30% SVD 35% DCT

20% SVD 20% DCT

10% SVD 12% DCT

50% SVD 57% DCT

30% SVD 30% DCT

10% SVD 11% DCT

You might also like