0% found this document useful (0 votes)

26 views17 pages

Speech Coding Techniques

The document discusses various speech coding techniques, emphasizing their importance for VoIP and the balance between quality and bandwidth. It covers concepts such as voice quality measurement using the Mean Opinion Score (MOS), speech production mechanisms, and different quantization methods. Additionally, it highlights specific codecs like G.711 and CELP, detailing their functionalities and performance metrics.

Uploaded by

Bala Murugan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views17 pages

Speech Coding Techniques

Uploaded by

Bala Murugan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Speech Coding Techniques

Introduction
 Efficient speech-coding techniques
 Advantages for VoIP
 Digital streams of ones and zeros
 The lower the bandwidth, the lower the
quality
 RTP payload types
 Processing power
 The better quality (for a given bandwidth)
uses a more complex algorithm
 A balance between quality and cost
Voice Quality
 Bandwidth is easily quantified
 Voice quality is subjective
 MOS, Mean Opinion Score
 ITU-T Recommendation P.800
 Excellent – 5
 Good – 4
 Fair – 3
 Poor – 2
 Bad – 1
 A minimum of 30 people
 Listen to voice samples or in conversations
 P.800 recommendations
 The selection of participants
 The test environment
 Explanations to listeners
 Analysis of results
 Toll quality
 A MOS of 4.0 or higher
About Speech
 Speech
 Air pushed from the lungs past the vocal
cords and along the vocal tract
 The basic vibrations – vocal cords
 The sound is altered by the disposition of
the vocal tract ( tongue and mouth)
 Model the vocal tract as a filter
 The shape changes relatively slowly
 The vibrations at the vocal cords
 The excitation signal
Speech sounds
 Voiced sound
 The vocal cords vibrate open and close
 Quasi-periodic pulses of air
 The rate of the opening and closing – the pitch
 Unvoiced sounds
 Forcing air at high velocities through a constriction
 Noise-like turbulence
 Show little long-term periodicity
 Short-term correlations still present
 Plosive sounds
 A complete closure in the vocal tract
 Air pressure is built up and released suddenly
Voice Sampling
 Discrete Time LTI Systems: The
Convolution Sum
+∞ +∞
x [n]= ∑ x[ k ] δ[ n−k ] y [ n]= ∑ x[ k ] h[ n−k ]
k=−∞ k=−∞

1
h[n]

0 1 2 n
2.5
2 2
x[n] y[n]
0.5 0.5

0 1 n 0 1 2 3 n
 Nyquist sampling theorem
X c ( j Ω)

∞
s(t )= ∑ δ (t −nT )
n=−∞
Ω
−Ω N ΩN
x s (t )=xc (t )s( t )
∞
=x c (t ) ∑ δ(t −nT )
−Ω S 0 X c ( j Ω) ΩS Ω n=−∞
∞
2π
S ( j Ω )= ∑ δ (Ω−k Ω s )
T k=−∞
ΩS −Ω N ΩN ΩS Ω

( ΩS −ΩN )
Quantization (Scalar
Quantization)
v1 v2 vk+1 vL

m0= -A m1 m2 …… mk mk+1 mL1 mL=A

J
 Assume | x[n] |  A k+1

divide the range [ A , A ] into L quantization levels

{ J1 , J2 , …… Jk ,….. JL }
Jk : [mk-1,mk ]
R
L=2

each quantization level Jk is represented by a value vk

S = U Jk , V = { v1 , v2 , …… vk ,….. vL }
Non-Uniform Quantization
m0 = -A m1 m2 …… 0 mL=A

Concept : small quantization levels for small x

large quantization levels for large x

Goal: constant SNRQ for all x

Companding

x[n] ^
x[n]
F(x) Uniform Uniform F1(x)
Quantization Decoder

Compressor …1101…1101… Expandor

Compressor + Expandor  Compandor

F(x) is to specify the non-uniform quantization
characteristics
Non-Uniform Quantization
 -law
log [ 1 +μ|x|]
|F ( x )|= , 0≤|x|≤1
log (1 +μ )
 A-law
A|x| 1


|F ( x )|=
{1+ ln A
, 0≤|x|≤
1+ln [ A|x|] 1
A

Waveform codecs,source codecs (also

known
1+ ln A
, ≤|x|≤1
A
as vocoders),and hybrid codecs.
}
 Typical values in practice
 = 255 , A = 87.6
speech Source Model and
Source Coding
unvoiced G(z), G(), g[n]
random Excitation parameters
sequence u[n] 1 x[n]v/u : voiced/ unvoiced
G(z) =
generator  P N : pitch for voiced
periodic 1  akz-k
pulse
G G : signal gain
k=1
train v/u
generator voiced Vocal Tract  excitation signal u[n]
N Model
Vocal Tract parameters
Excitation {ak} : LPC coefficients

formant structure of
speech signals
A good approximation,
though not precise enough
LPC Vocoder(Voice Coder)
x[n] { ak }
LPC Encoder
Analysis N,G
…11011
v/u
…

N by pitch detection
v/u by voicing detection
receiver

{ ak } x[n]
Decoder Ex g[n]
N,G G(z)
…11011
v/u
…

{ak} can be non-uniform or vector

quantized to reduce bit rate further
G.711
 The most commonplace codec
 Used in circuit-switched telephone network

 PCM, Pulse-Code Modulation

 If uniform quantization
 12 bits * 8 k/sec = 96 kbps

 Non-uniform quantization
 65 kbps DS0 rate

 North America
 A-law
 Other countries, a little friendlier to lower signal levels

 An MOS of about 4.3

 1024 vectors in the code book
 10-bit pointer (index)
 16 kbps
 CELP encoder
 Minimize a frequency-weighted mean-square error
 LD-CELP decoder

 An MOS score of about 3.9

 One-quarter of G.711 bandwidth

WTM 4500 - v3
100% (1)
WTM 4500 - v3
46 pages
Method Statement 14728983812691479973057231
No ratings yet
Method Statement 14728983812691479973057231
6 pages
25.# Injectable Vit d3 Consent Form - Zenoti 2021 - Draft 2
No ratings yet
25.# Injectable Vit d3 Consent Form - Zenoti 2021 - Draft 2
3 pages
New Speech Coding Techniques: Mr. L.Ramesh Ap/Ece
No ratings yet
New Speech Coding Techniques: Mr. L.Ramesh Ap/Ece
24 pages
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
38 pages
Human Speech Producing Organs: 2.4 Kbps
No ratings yet
Human Speech Producing Organs: 2.4 Kbps
108 pages
Speech and Audio Coding
No ratings yet
Speech and Audio Coding
16 pages
Nice
No ratings yet
Nice
15 pages
Speech Coders For Wireless Communication
No ratings yet
Speech Coders For Wireless Communication
53 pages
DAP Speech Coding v3 2025
No ratings yet
DAP Speech Coding v3 2025
49 pages
4: Speech Compression: Data Rates
No ratings yet
4: Speech Compression: Data Rates
14 pages
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
No ratings yet
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
20 pages
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
No ratings yet
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
20 pages
Speech Coding Journal
No ratings yet
Speech Coding Journal
20 pages
Speech Generation
No ratings yet
Speech Generation
11 pages
Digitizing and Packetizing Voice: Describe Cisco Voip Implementations
No ratings yet
Digitizing and Packetizing Voice: Describe Cisco Voip Implementations
24 pages
Digital Speech Processing
No ratings yet
Digital Speech Processing
18 pages
2720 Slides7
No ratings yet
2720 Slides7
18 pages
Speech and Audio Processing: Lecture-3
No ratings yet
Speech and Audio Processing: Lecture-3
20 pages
Speech Coder
No ratings yet
Speech Coder
20 pages
Dokumen - Tips Elec9344speech Audio Processing 4pdfspeech Signal For Digital Storage or Transmission
No ratings yet
Dokumen - Tips Elec9344speech Audio Processing 4pdfspeech Signal For Digital Storage or Transmission
87 pages
Speech Coding Systems
No ratings yet
Speech Coding Systems
90 pages
Multi-Band Excitation Vocoder: RLE Technical Report No. 524
No ratings yet
Multi-Band Excitation Vocoder: RLE Technical Report No. 524
140 pages
Digital Transmission
No ratings yet
Digital Transmission
25 pages
CELP
No ratings yet
CELP
23 pages
Lesson 2 - Digitizing and Packetizing Voice
No ratings yet
Lesson 2 - Digitizing and Packetizing Voice
22 pages
Speech Processing Project
No ratings yet
Speech Processing Project
16 pages
Adaptive Multi Rate Coder Using ACLP
No ratings yet
Adaptive Multi Rate Coder Using ACLP
45 pages
Chapter 9 - Speech Coding in GSM
No ratings yet
Chapter 9 - Speech Coding in GSM
44 pages
LPC Modeling: Unit 5 1.speech Compression
No ratings yet
LPC Modeling: Unit 5 1.speech Compression
13 pages
Unit2 1
No ratings yet
Unit2 1
23 pages
Principles of Communications: Chapter 4: Analog-to-Digital Conversion
No ratings yet
Principles of Communications: Chapter 4: Analog-to-Digital Conversion
35 pages
Optimizing Converged Cisco Networks (Ont) : Module 2: Cisco Voip Implementations
No ratings yet
Optimizing Converged Cisco Networks (Ont) : Module 2: Cisco Voip Implementations
22 pages
Wireless Communications by Theodore S Ra
No ratings yet
Wireless Communications by Theodore S Ra
31 pages
Speech Compression Techniques - Formant and CELP Vocoders
No ratings yet
Speech Compression Techniques - Formant and CELP Vocoders
41 pages
Speech Coding
100% (3)
Speech Coding
36 pages
Bab 7 Multimedia Kompresi Audio
No ratings yet
Bab 7 Multimedia Kompresi Audio
52 pages
Unit 2 Wireless
No ratings yet
Unit 2 Wireless
159 pages
Wireless Networks Slides8
No ratings yet
Wireless Networks Slides8
23 pages
4 Voice - PCM
No ratings yet
4 Voice - PCM
33 pages
4 Voice - PCM
No ratings yet
4 Voice - PCM
33 pages
Procedia: Speech Coding Techniques
No ratings yet
Procedia: Speech Coding Techniques
11 pages
Unit Iv Audio and Video Coding
No ratings yet
Unit Iv Audio and Video Coding
15 pages
REC085 t3 Sheet
No ratings yet
REC085 t3 Sheet
15 pages
Class 14 - 21092020
No ratings yet
Class 14 - 21092020
9 pages
Speech Signal Analysis and Coding: Dr. Arun Kumar
No ratings yet
Speech Signal Analysis and Coding: Dr. Arun Kumar
52 pages
Unit 2 A
No ratings yet
Unit 2 A
48 pages
Multimedia Communications: Speech Compression
No ratings yet
Multimedia Communications: Speech Compression
26 pages
Speech Compression
No ratings yet
Speech Compression
37 pages
Vocoder
No ratings yet
Vocoder
72 pages
Project Guidelines 3
No ratings yet
Project Guidelines 3
6 pages
Unit 3
No ratings yet
Unit 3
44 pages
Speech Compression
No ratings yet
Speech Compression
15 pages
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
4 pages
Speech Compression Using GSM
No ratings yet
Speech Compression Using GSM
23 pages
Lecture 15 (91 Slides)
No ratings yet
Lecture 15 (91 Slides)
91 pages
Introduction To Speech Coding What, Why, Where & How (First Part)
No ratings yet
Introduction To Speech Coding What, Why, Where & How (First Part)
10 pages
Transmission of Information: David Falconer and Halim Yanikomeroglu
No ratings yet
Transmission of Information: David Falconer and Halim Yanikomeroglu
42 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
2 - PCM & Delta Modulation
No ratings yet
2 - PCM & Delta Modulation
33 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
The Green Book of Mathematical Problems
From Everand
The Green Book of Mathematical Problems
Kenneth Hardy
4.5/5 (3)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Theory & Application of Psycho-Oncology
No ratings yet
Theory & Application of Psycho-Oncology
56 pages
Geotechnical Engineering I Prof. Devendra N. Singh Department of Civil Engineering Indian Institute of Technology-Bombay
No ratings yet
Geotechnical Engineering I Prof. Devendra N. Singh Department of Civil Engineering Indian Institute of Technology-Bombay
14 pages
SSRN Id4138427
No ratings yet
SSRN Id4138427
12 pages
Logistics Support Analysis
0% (1)
Logistics Support Analysis
5 pages
Lours Et Le Dragon Coffret 2 Volumes Tom Clancy Instant Download
100% (1)
Lours Et Le Dragon Coffret 2 Volumes Tom Clancy Instant Download
31 pages
03 SR Iit - Co-Sc GTM-13 (N) Main (Model-A - B - C) - 11-01-2024 - 2197346
No ratings yet
03 SR Iit - Co-Sc GTM-13 (N) Main (Model-A - B - C) - 11-01-2024 - 2197346
12 pages
SUTENE2 TRM Test U8B
No ratings yet
SUTENE2 TRM Test U8B
4 pages
Government of India Ministry of Road Transport Highways (Transport Division)
No ratings yet
Government of India Ministry of Road Transport Highways (Transport Division)
4 pages
Heat of Reaction
83% (6)
Heat of Reaction
8 pages
45905128e8e0b-1 Gs Pre Abhyaas Test 4359 e 2024 Letter
No ratings yet
45905128e8e0b-1 Gs Pre Abhyaas Test 4359 e 2024 Letter
22 pages
Finn John - From Counting To Calculus
No ratings yet
Finn John - From Counting To Calculus
113 pages
Module 2
No ratings yet
Module 2
54 pages
O-Levels Metal Technology and Design Exemplar
100% (2)
O-Levels Metal Technology and Design Exemplar
33 pages
Comprehensive Pharmacology
No ratings yet
Comprehensive Pharmacology
102 pages
Week 5 Tutorial Notes Answers
No ratings yet
Week 5 Tutorial Notes Answers
6 pages
Prepared by Dr. Wagih Girgis
No ratings yet
Prepared by Dr. Wagih Girgis
14 pages
Simultaneous Occurrence of AF+AVL
No ratings yet
Simultaneous Occurrence of AF+AVL
11 pages
OceanofPDF - Com Velise - Cebelius
100% (1)
OceanofPDF - Com Velise - Cebelius
371 pages
2024 Biology Booklet Questions
No ratings yet
2024 Biology Booklet Questions
595 pages
Microteaching Chemistry
No ratings yet
Microteaching Chemistry
3 pages
Sec File Asmita
No ratings yet
Sec File Asmita
13 pages
Samuel Murphy Case Study Firms and Markets
100% (1)
Samuel Murphy Case Study Firms and Markets
21 pages
Visualization With Matplotlib
No ratings yet
Visualization With Matplotlib
18 pages
Headache Center Diary and Guide
No ratings yet
Headache Center Diary and Guide
3 pages
Dd-Il9-Practice Test Unit 3a
No ratings yet
Dd-Il9-Practice Test Unit 3a
6 pages
Common Interview Question
No ratings yet
Common Interview Question
4 pages
First Aid For Burns
No ratings yet
First Aid For Burns
10 pages

Speech Coding Techniques

Uploaded by

Speech Coding Techniques

Uploaded by

Speech Coding Techniques

m0= -A m1 m2 …… mk mk+1 mL1 mL=A

divide the range [ A , A ] into L quantization levels

each quantization level Jk is represented by a value vk

Concept : small quantization levels for small x

Goal: constant SNRQ for all x

Compressor …1101…1101… Expandor

Compressor + Expandor  Compandor

Waveform codecs,source codecs (also

{ak} can be non-uniform or vector

 PCM, Pulse-Code Modulation

 An MOS of about 4.3

 An MOS score of about 3.9

You might also like