0% found this document useful (0 votes)

19 views21 pages

Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation

Uploaded by

Geleta Aman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views21 pages

Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation

Uploaded by

Geleta Aman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

ADAMA SCIENCE AND TECHNOLOGY

UNIVERSITY

DEPARTMENT OF ELECTRONICS AND COMMUNICATION

ENGINEERING
Advanced Digital Signal Processing
Project presentation

Speech Compression Using DCT

By Geleta Aman
ID_No : PGR 35870/16
ABSTRACT
Speech compression is a fundamental aspect of modern
communication systems and enabling efficient transmission and
storage of audio data. Discrete Cosine Transform (DCT) has emerged as
a powerful tool in speech compression due to its ability to concentrate
signal energy into a reduced set of coefficients. This paper presents
analysis of speech compression using DCT, focusing on the
mathematical underpinnings and practical implementation aspects. The
trade-off between compression ratio and quality is carefully examined,
considering parameters such as thresholding and quantization step size.
ABSTRACT
Evaluation metrics including Signal-to-Noise Ratio (SNR) and Mean
Squared Error (MSE) are utilized to assess the fidelity of the
reconstructed speech signal. Through mathematical analysis and
experimental validation, this study highlights the efficacy of DCT-based
speech compression in achieving significant compression ratios while
preserving perceptual quality. The findings contribute to the
understanding and optimization of speech compression techniques,
paving the way for enhanced audio communication systems in various
domains.
INTRODUCTION

Objective of speech is communication, whether face to face or cell phone to

cell phone. A huge amount of data is a big issue for transmission or storage.
Speech compression is the technology of converting human speech into an
efficiently encoded representation that can later be decoded to produce a
close approximation of the original signal. Major objective of speech
compression is to represent speech with less or few numbers of bits with
level of quality.
INTRODUCTION

By removing redundancy between neighboring samples signal can be compressed. In this paper we
have implemented compression technique in two steps, in 1st step a transform function is applied
on speech signal to get result with a new set of data with smaller values and more repetition, 2nd
step is coding(compression) step, this step will represent the data set in its minimal form by using
encoding techniques such as Run Length encoding, Huffman encoding, run length encoding
followed by Huffman encoding. Performance measures compression factor (CF), signal to noise
ratio (SNR), peak signal to noise ratio (PSNR), normalized root mean square error (NRMSE),
retained signal energy (RSE) is measured for reconstructed speech obtained DCT based speech
compression techniques.
Objectives
Here are four specific objectives of speech compression using DCT:
Enhancing data storage efficiency by reducing the size of speech
signals
Minimizing bandwidth requirements for speech transmission
Mitigating storage and transmission costs
Preserving essential speech features while reducing redundancy
enabling efficient utilization of communication resources in various
applications.
Statement of the Problem:

Speech compression is a critical aspect of various applications including

telecommunications, multimedia streaming, and storage systems.
Efficient compression techniques are essential to reduce the storage
requirements and bandwidth usage while maintaining acceptable audio
quality. In this context, the utilization of the Discrete Cosine Transform
(DCT) for speech compression presents a promising approach.
SYSTEM DESIGN AND
MATHEMATICAL ANALYSIS
Methodology for compression of speech signal

In this paper we are implementing speech compression technique based on DCT transform
method. in case of DCT transform speech can be represented in terms of DCT coefficient. Thus,
data operation can be performed using just the corresponding DCT coefficients. Transform
techniques and thresholding does not actually compress a signal, it simply provides information
about the signal, which allows the data to be compressed by standard encoding techniques.
Speech compression is achieved by neglecting small coefficients as insignificant data and
discarding them and then applying quantization and encoding scheme on coefficients.
SYSTEM DESIGN
Methodology for compression of speech signal
Steps in Speech Compression using DCT:
• Segmentation: Divide the speech signal into small segments or frames. Each frame typically
consists of a few milliseconds of audio data.
• DCT Transformation: Apply DCT to each frame of the speech signal.
• Quantization: Quantize the DCT coefficients by rounding them to a smaller number of bits or
by using a quantization matrix. This step reduces the precision of the coefficients.
• Entropy Coding: Apply entropy coding techniques (e.g., Huffman coding) to further compress
the quantized coefficients.
• Transmission/Storage: Transmit or store the compressed coefficients along with necessary
side information (e.g., frame size, quantization parameters) to reconstruct the speech signal.
• Reconstruction: At the decoder side, inverse the compression process by applying the inverse
steps: entropy decoding, dequantization, inverse DCT, and frame concatenation.
System Block Diagram
.
MATHEMATICAL ANALYSIS
Mathematical model
METHODOLOGY
Mathematical model
RESULT AND DISCUSSION

Performance evaluation

To evaluate the overall performance of the proposed audio compression

scheme, several objective tests were made. To measure the performance
of the reconstructed signal, various factors such as compression factor,
Signal to noise ratio, PSNR& mean square error are taken into
consideration.
RESULT AND DISCUSSION

Performance evaluation

 Signal to Noise Ratio (SNR)

Where σx2 is the mean square of the speech signal and σe 2 is the mean
square difference between the original and reconstructed speech signal.
RESULT AND DISCUSSION

Performance evaluation
Peak Signal to Noise Ratio (PSNR)

Where N is the length of reconstructed signal, X is the maximum

absolute square value of signal x and ||x-x`||2 is the energy of the
difference between the original and reconstructed signal.
RESULT AND DISCUSSION

Performance evaluation
Normalized Root Mean Square Error (NRMSE)

Here, X(n) is the speech signal, x‟(n) is reconstructed speech signal and
μ x(n) is the mean of speech signal.
RESULT AND DISCUSSION

Results

The results for Compression factor, Signal to Noise ratio, PSNR & Mean
square error for the speech signal using the DCT based compression are
summarized in table 1.

No Error PSNR RMSE Size before compression Size after Decompression

1 3.0587e+04 21.8790 174.8914 110033 110033

RESULT AND DISCUSSION

Results
CONCLUSION

In conclusion, speech signal compression can be achieved through

various methods, but one of the simplest and effective approaches is
employing the Discrete Cosine Transform (DCT). By applying DCT, we
can identify threshold coefficients within the speech signal and
subsequently reduce its size, thereby facilitating efficient compression.
CONCLUSION

While numerous other transforms and techniques exist for speech signal
compression, the utilization of DCT stands out as the simplest and widely
adopted method. Its effectiveness lies in its ability to efficiently represent
the signal in the frequency domain, enabling significant reductions in data
size while preserving essential information within the speech signal.
Thank you

Digital Modulations using Matlab
From Everand
Digital Modulations using Matlab
Mathuranathan Viswanathan
4/5 (6)
1b-1830 PSS Training Intro
67% (3)
1b-1830 PSS Training Intro
191 pages
HCIA-WLAN V2.0 Exam Outline
No ratings yet
HCIA-WLAN V2.0 Exam Outline
2 pages
Discrete Cosine Transform
No ratings yet
Discrete Cosine Transform
12 pages
Audio and Speech Compression Using DCT and DWT Techniques: M. V. Patil, Apoorva Gupta, Ankita Varma, Shikhar Salil
No ratings yet
Audio and Speech Compression Using DCT and DWT Techniques: M. V. Patil, Apoorva Gupta, Ankita Varma, Shikhar Salil
8 pages
Itc Review 3 PDF
No ratings yet
Itc Review 3 PDF
8 pages
DCT For Speech Compression
No ratings yet
DCT For Speech Compression
21 pages
MP3 Audio Compression Using DCT
No ratings yet
MP3 Audio Compression Using DCT
13 pages
Implementation of Image and Audio Compression Techniques Using
No ratings yet
Implementation of Image and Audio Compression Techniques Using
26 pages
DCT Application in Speech Recognition: A Survey
No ratings yet
DCT Application in Speech Recognition: A Survey
5 pages
Audio Compression by Using Wavelet
No ratings yet
Audio Compression by Using Wavelet
5 pages
Towards Using Genetic Algorithms in Lossy Audio Compression 2008
No ratings yet
Towards Using Genetic Algorithms in Lossy Audio Compression 2008
8 pages
A Novel Method of
No ratings yet
A Novel Method of
5 pages
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
14 pages
Assignment 2
No ratings yet
Assignment 2
1 page
Implementation of Audio Compression Usin
No ratings yet
Implementation of Audio Compression Usin
4 pages
Project 1 Audio Compression Using The FFT: Independent Dominant (Maximum) Components Are Retained For Data Reconstruction
No ratings yet
Project 1 Audio Compression Using The FFT: Independent Dominant (Maximum) Components Are Retained For Data Reconstruction
4 pages
A Novel Method of Compressing Speech With Higher Bandwidt
100% (2)
A Novel Method of Compressing Speech With Higher Bandwidt
12 pages
Introduction To Digital Communications System: Wireless Information Transmission System Lab
No ratings yet
Introduction To Digital Communications System: Wireless Information Transmission System Lab
83 pages
Digital Audio Compression: by Davis Yen Pan
No ratings yet
Digital Audio Compression: by Davis Yen Pan
14 pages
22uec111 DSP Exp7
No ratings yet
22uec111 DSP Exp7
8 pages
Psychoacoustic Principles and Genetic Algorithms in Audio Compression
No ratings yet
Psychoacoustic Principles and Genetic Algorithms in Audio Compression
3 pages
Robust Pitch Detection Using DCT Based Spectral Autocorrelation
No ratings yet
Robust Pitch Detection Using DCT Based Spectral Autocorrelation
20 pages
Speech Compression Techniques: An Overview
No ratings yet
Speech Compression Techniques: An Overview
4 pages
ART2017951
No ratings yet
ART2017951
5 pages
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
No ratings yet
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
3 pages
Speech Coding: Before You Start..
No ratings yet
Speech Coding: Before You Start..
5 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
1987 OKI Voice Synthesis LSI Data Book
No ratings yet
1987 OKI Voice Synthesis LSI Data Book
214 pages
Analysis of Audio Signal Using Various T Ef70b0cd
No ratings yet
Analysis of Audio Signal Using Various T Ef70b0cd
13 pages
1 PB
No ratings yet
1 PB
9 pages
UNIT - IV - PPT
100% (1)
UNIT - IV - PPT
18 pages
MMC Unit II
No ratings yet
MMC Unit II
40 pages
A Method of Continuous Data Flow Embedded Within Speech Signals
No ratings yet
A Method of Continuous Data Flow Embedded Within Speech Signals
4 pages
Ijetr011603 PDF
No ratings yet
Ijetr011603 PDF
5 pages
Chap 5 Compression
No ratings yet
Chap 5 Compression
43 pages
EC8002 MCC Question Bank Watermark
No ratings yet
EC8002 MCC Question Bank Watermark
109 pages
Audio Compression
0% (1)
Audio Compression
26 pages
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
4 pages
3G 4 DigitalComm PDF
No ratings yet
3G 4 DigitalComm PDF
163 pages
Chapter-5 Data Compression
No ratings yet
Chapter-5 Data Compression
53 pages
Ranjana Chaturvedi - A Survey On Compression Techniques For Ecg
No ratings yet
Ranjana Chaturvedi - A Survey On Compression Techniques For Ecg
3 pages
Main Techniques and Performance of Each Compression
No ratings yet
Main Techniques and Performance of Each Compression
23 pages
DSP Project Report
No ratings yet
DSP Project Report
18 pages
Design of Test Data Compressor/Decompressor Using Xmatchpro Method
No ratings yet
Design of Test Data Compressor/Decompressor Using Xmatchpro Method
10 pages
Chapter 1: Lossless Data Compression
No ratings yet
Chapter 1: Lossless Data Compression
4 pages
Final Nazary Lec 10 11
No ratings yet
Final Nazary Lec 10 11
6 pages
Packet Loss Concealment Using Audio Morphing: STQ Workshop, Sophia-Antipolis, February 11, 2003
No ratings yet
Packet Loss Concealment Using Audio Morphing: STQ Workshop, Sophia-Antipolis, February 11, 2003
12 pages
Speech Coding
100% (3)
Speech Coding
36 pages
CVSD - A Tutorial
No ratings yet
CVSD - A Tutorial
16 pages
Digital Signal Processing Lab: Bachelor of Technology in Electronics and Communication Engineering
No ratings yet
Digital Signal Processing Lab: Bachelor of Technology in Electronics and Communication Engineering
13 pages
M3 CCIS SpringerSeries
No ratings yet
M3 CCIS SpringerSeries
16 pages
Digitization of One-Dimension Signals
No ratings yet
Digitization of One-Dimension Signals
46 pages
Final Journal - Fourier Series
No ratings yet
Final Journal - Fourier Series
6 pages
Audio and Audio Compression
No ratings yet
Audio and Audio Compression
27 pages
MMC 17ec741 Module 3 Notes
No ratings yet
MMC 17ec741 Module 3 Notes
45 pages
Unit 5 - Data Compression
No ratings yet
Unit 5 - Data Compression
46 pages
Chapter 5 Data Compression
No ratings yet
Chapter 5 Data Compression
71 pages
Implementation Challenges and Performance Analysis of Image Compression Using Huffman Encoding and DCT Algorithm On DSP Processor TMS320C6748 and Arduino Nano 33 BLE
No ratings yet
Implementation Challenges and Performance Analysis of Image Compression Using Huffman Encoding and DCT Algorithm On DSP Processor TMS320C6748 and Arduino Nano 33 BLE
6 pages
Audio Visual Speech Recognition: Advancements, Applications, and Insights
From Everand
Audio Visual Speech Recognition: Advancements, Applications, and Insights
Fouad Sabry
No ratings yet
Human Visual System Model: Understanding Perception and Processing
From Everand
Human Visual System Model: Understanding Perception and Processing
Fouad Sabry
No ratings yet
D-STAR, DMR & Fusion A Beginner’s Guide
From Everand
D-STAR, DMR & Fusion A Beginner’s Guide
Duarte Braga
No ratings yet
Mesh Network
No ratings yet
Mesh Network
10 pages
1 Reyhword
No ratings yet
1 Reyhword
3 pages
Chapter 2 Defining The Research Problem
No ratings yet
Chapter 2 Defining The Research Problem
17 pages
Moral and Ethics Education
No ratings yet
Moral and Ethics Education
233 pages
Construction Electrician Level 4
No ratings yet
Construction Electrician Level 4
19 pages
2016 E.C Academic Calander Final2
No ratings yet
2016 E.C Academic Calander Final2
10 pages
Mie GDPR Data Processing Agreement With Examiners
No ratings yet
Mie GDPR Data Processing Agreement With Examiners
4 pages
Parallel Resonance
No ratings yet
Parallel Resonance
4 pages
Lesson 2 - 3 Analog Transmission
No ratings yet
Lesson 2 - 3 Analog Transmission
49 pages
Properties of Microstripline
No ratings yet
Properties of Microstripline
3 pages
Newtrax - MRU1K8-HN1
No ratings yet
Newtrax - MRU1K8-HN1
8 pages
Signal Quiz1
No ratings yet
Signal Quiz1
2 pages
Ecen 607 Class D
No ratings yet
Ecen 607 Class D
51 pages
TINA Design Suite-12-Page Detailed Brochure
No ratings yet
TINA Design Suite-12-Page Detailed Brochure
6 pages
Underwater Acoustic Modem
No ratings yet
Underwater Acoustic Modem
11 pages
Improving Coverage and Power Efficiency in 5G Networks
No ratings yet
Improving Coverage and Power Efficiency in 5G Networks
16 pages
User Manual 1 Description 3423644
No ratings yet
User Manual 1 Description 3423644
68 pages
Circuit Note
No ratings yet
Circuit Note
20 pages
Electronic Measurement Distance
No ratings yet
Electronic Measurement Distance
14 pages
GM C Filters
No ratings yet
GM C Filters
4 pages
3g Repeater Programmerbar Coiler Ps 2200 PDF
No ratings yet
3g Repeater Programmerbar Coiler Ps 2200 PDF
2 pages
1MA294 3e Sigfox Device Test
No ratings yet
1MA294 3e Sigfox Device Test
39 pages
University of Newcastle Upon Tyne: School of Electrical, Electronic & Computer Engineering
No ratings yet
University of Newcastle Upon Tyne: School of Electrical, Electronic & Computer Engineering
8 pages
TDJ-709018 - 172721DEH-33FT2 Dar Aci - 1L1H - 1.5m
No ratings yet
TDJ-709018 - 172721DEH-33FT2 Dar Aci - 1L1H - 1.5m
1 page
Timing Advance Calculation
No ratings yet
Timing Advance Calculation
9 pages
L1 DLD Introduction To DLD
No ratings yet
L1 DLD Introduction To DLD
25 pages
FlavourMTC Manual
No ratings yet
FlavourMTC Manual
9 pages
Dokumen - Tips - Sony MHC gpx3 Atc PDF
No ratings yet
Dokumen - Tips - Sony MHC gpx3 Atc PDF
66 pages
OTN-N215 Node: Open Transport Network (OTN)
No ratings yet
OTN-N215 Node: Open Transport Network (OTN)
2 pages
4446
No ratings yet
4446
58 pages
Automatic Speed Control of Vehicles
No ratings yet
Automatic Speed Control of Vehicles
3 pages
Spread Spectrum: Submitted By: Vikas Sharma M.Tech (Part Time)
No ratings yet
Spread Spectrum: Submitted By: Vikas Sharma M.Tech (Part Time)
19 pages
Wireless and Mobile Network Architecture
No ratings yet
Wireless and Mobile Network Architecture
45 pages
Security Analysis of TETRA
No ratings yet
Security Analysis of TETRA
102 pages
Andrew Antena CV3PX310R1 CRET INTEGRADO
No ratings yet
Andrew Antena CV3PX310R1 CRET INTEGRADO
2 pages

Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation

Uploaded by

Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation

Uploaded by

ADAMA SCIENCE AND TECHNOLOGY

DEPARTMENT OF ELECTRONICS AND COMMUNICATION

Speech Compression Using DCT

Objective of speech is communication, whether face to face or cell phone to

Speech compression is a critical aspect of various applications including

To evaluate the overall performance of the proposed audio compression

 Signal to Noise Ratio (SNR)

Where N is the length of reconstructed signal, X is the maximum

No Error PSNR RMSE Size before compression Size after Decompression

1 3.0587e+04 21.8790 174.8914 110033 110033

In conclusion, speech signal compression can be achieved through

You might also like