0% found this document useful (0 votes)

50 views24 pages

New Speech Coding Techniques: Mr. L.Ramesh Ap/Ece

The document discusses new speech coding techniques that provide efficient digital encoding of voice signals. It covers various coding methods like ADPCM, LPC vocoding, analysis-by-synthesis codecs, and standardized codecs including G.711, G.728, G.723.1, and G.729. These techniques aim to balance voice quality and bandwidth usage by modeling the vocal tract and compressing excitation signal parameters.

Uploaded by

Ramesh L

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views24 pages

New Speech Coding Techniques: Mr. L.Ramesh Ap/Ece

Uploaded by

Ramesh L

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 24

New Speech Coding

Techniques

Mr. L.Ramesh
AP/ECE
Introduction

Efficient speech-coding techniques

Advantages for VoIP
Digital streams of ones and zeros
The lower the bandwidth, the lower the
quality
RTP payload types
Processing power
The better quality (for a given bandwidth)
uses a more complex algorithm
A balance between quality and cost
Voice Quality

Bandwidth is easily quantified

Voice quality is subjective
MOS, Mean Opinion Score
ITU-T Recommendation P.800
 Excellent – 5
 Good – 4
 Fair – 3
 Poor – 2
 Bad – 1
A minimum of 30 people
Listen to voice samples or in conversations
 P.800 recommendations
 The selection of participants
 The test environment
 Explanations to listeners
 Analysis of results
 Toll quality
 A MOS of 4.0 or higher
About Speech

Speech
Air pushed from the lungs past the vocal
cords and along the vocal tract
The basic vibrations – vocal cords
The sound is altered by the disposition of the
vocal tract ( tongue and mouth)
Model the vocal tract as a filter
The shape changes relatively slowly
The vibrations at the vocal cords
The excitation signal
Speech sounds
Voiced sound
 The vocal cords vibrate open and close
 Quasi-periodic pulses of air
 The rate of the opening and closing – the pitch
Unvoiced sounds
 Forcing air at high velocities through a constriction
 Noise-like turbulence
 Show little long-term periodicity
 Short-term correlations still present
Plosive sounds
 A complete closure in the vocal tract
 Air pressure is built up and released suddenly
Voice Sampling
Discrete Time LTI Systems: The Convolution
Sum

 
x[n]   x[k ] [n  k ]
k  
y[n]   x[k ]h[n  k ]
k  

1
h[n]

0 1 2 n
2.5
2 2
x[n] y[n]
0.5 0.5

0 1 n 0 1 2 3 n
 Nyquist sampling theorem
X c ( j )


s (t )    (t  nT )
n  
 N N 
xs (t )  xc (t ) s (t )

 xc (t )   (t  nT )
 S 0 X c ( j ) S  n  

2 
S ( j ) 
T
  (  k )
k  
s

S  N N S 

( S   N )
Quantization (Scalar
Quantization)
v1 v2 vk+1 vL

m0= -A m1 m2 …… mk mk+1 mL1 mL=A

J
· Assume | x[n] |  A k+1

divide the range [ A , A ] into L quantization levels

{ J1 , J2 , …… Jk ,….. JL }
Jk : [mk-1,mk ]
R
L=2

each quantization level Jk is represented by a value vk

S = U Jk , V = { v1 , v2 , …… vk ,….. vL }
Non-Uniform Quantization

m0 = -A m1 m2 …… 0 mL=A

Concept : small quantization levels for small x

large quantization levels for large x

Goal: constant SNRQ for all x

Companding

x[n] ^
x[n]
F(x) Uniform Uniform F1(x)
Quantization Decoder

Compressor …1101…1101… Expandor

Compressor + Expandor  Compandor

F(x) is to specify the non-uniform quantization
characteristics
Non-Uniform Quantization
 - law

 A-law log 1  μ x 
F ( x)  ,0  x  1
log( 1  μ)

 Ax 1
 ,0  x 

F ( x )   1  lnA A
1  ln[ A x ] , 1  x  1

 1  lnA A

 Typical values in practice

 = 255 , A = 87.6
Types of Speech Codecs
Waveform codecs,source codecs (also
known as vocoders),and hybrid codecs.
Speech Source Model and
Source Coding

unvoiced G(z), G(), g[n]

random Excitation parameters
sequence u[n] 1 x[n]v/u : voiced/ unvoiced
G(z) =
generator  P N : pitch for voiced
periodic 1  akz-k
pulse
G G : signal gain
k=1
train v/u
generator voiced Vocal Tract Model  excitation signal u[n]
N
Vocal Tract parameters
Excitation {ak} : LPC coefficients

formant structure of
speech signals
A good approximation,
though not precise enough
LPC Vocoder(Voice Coder)

x[n] { ak }
LPC Encoder
Analysis N,G
…11011…
v/u

N by pitch detection
v/u by voicing detection
receiver

{ ak } x[n]
Decoder Ex g[n]
N,G G(z)
…11011…
v/u

{ak} can be non-uniform or vector

quantized to reduce bit rate further
G.711

 The most commonplace codec

 Used in circuit-switched telephone
network
 PCM, Pulse-Code Modulation
 If uniform quantization
 12 bits * 8 k/sec = 96 kbps
 Non-uniform quantization

  law
 65 kbps DS0 rate

 North America
 A-law
 Other countries, a little friendlier to
lower signal levels
 An MOS of about 4.3
ADPCM(adaptive differential
PCM)
DPCM and ADPCM.
ADPCM : Adaptive Prediction in DPCM
Adaptive Quantization
Adaptive Quantization
 Quantization level  varies with local signal level
 [n] = ax[n]
 x[n] : locally estimated standard deviation of x[n]

G.721:ADPCM-coded speech at 32Kbps.

G.726(A-law or )
16,24,32,40Kbps
  law
MOS 4.0 , at 32Kbps
Analysis-by-Synthesis (AbS)
Codecs
 Hybrid codec
Fill the gap between waveform and source
codecs
The most successful and commonly used
 Time-domain AbS codecs
 Not a simple two-state, voiced/unvoiced
 Different excitation signals are attempted
 Closest to the original waveform is selected
 MPE, Multi-Pulse Excited
 RPE, Regular-Pulse Excited
 CELP, Code-Excited Linear Predictive
G.728 LD-CELP
 CELP codecs
 A filter; its characteristics change over time
 A codebook of acoustic vectors
 A vector = a set of elements representing various
char. of the excitation
 Transmit
 Filter coefficients, gain, a pointer to the vector
chosen
 Low Delay CELP
 Backward-adaptive coder
 Use previous samples to determine filter coefficients
 Operates on five samples at a time
 Delay < 1 ms
 Only the pointer is transmitted
 1024 vectors in the code book
 10-bit pointer (index)
 16 kbps
 LD-CELP encoder
 Minimize a frequency-weighted mean-square error
 LD-CELP decoder

 An MOS score of about 3.9

 One-quarter of G.711 bandwidth
G.723.1 ACELP
 6.3 or 5.3 kbps
 Both mandatory
 Can change from one to another during a conversation
 The coder
 A band-limited input speech signal
 Sampled at 8 KHz, 16-bit uniform PCM quantization
 Operate on blocks of 240 samples at a time
 A look-ahead of 7.5 ms
 A total algorithmic delay of 37.5 ms + other delays
 A high-pass filter to remove any DC component
 G.723.1 Annex A
 Silence Insertion Description (SID) frames
of size four octets
 The two lsbs of the first octet
 00 6.3kbps 24 octets/frame
 01 5.3kbps 20
 10 SID frame 4
 An MOS of about 3.8
 At least 37.5 ms delay
G.729
 8 kbps
 Input frames of 10 ms, 80 samples for 8 KHz
sampling rate
 5 ms look-ahead
 Algorithmic delay of 15 ms
 An 80-bit frame for 10 ms of speech
 A complex codec
 G.729.A (Annex A), a number of simplifications
 Same frame structure
 Encoder/decoder, G.729/G.729.A
 Slightly lower quality

Workbook in LOGIC
No ratings yet
Workbook in LOGIC
40 pages
Milestone Challenge On Used Bikes Data Set
25% (8)
Milestone Challenge On Used Bikes Data Set
11 pages
DAP Speech Coding v3 2025
No ratings yet
DAP Speech Coding v3 2025
49 pages
Khwopa Secondary School
No ratings yet
Khwopa Secondary School
17 pages
The Open Work
100% (2)
The Open Work
160 pages
Barkley 1997 Psych Bulletin PDF
No ratings yet
Barkley 1997 Psych Bulletin PDF
30 pages
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
17 pages
English9 Pre Assessment Test
75% (4)
English9 Pre Assessment Test
3 pages
Directed Writing
No ratings yet
Directed Writing
23 pages
P Inter
No ratings yet
P Inter
278 pages
RNW 3RD Q Reviewer
No ratings yet
RNW 3RD Q Reviewer
7 pages
2720 Slides7
No ratings yet
2720 Slides7
18 pages
Unit 2 Wireless
No ratings yet
Unit 2 Wireless
159 pages
Start An Essay
100% (2)
Start An Essay
7 pages
Procedures de Maintenance
No ratings yet
Procedures de Maintenance
87 pages
Evolve 2B
No ratings yet
Evolve 2B
9 pages
British National Academy Complaint Letter
No ratings yet
British National Academy Complaint Letter
2 pages
UNIT 1 - TEL Material Preparation: Session No Session Name Name of The Faculty
No ratings yet
UNIT 1 - TEL Material Preparation: Session No Session Name Name of The Faculty
1 page
Code Excited Liner Predictive Coding
No ratings yet
Code Excited Liner Predictive Coding
9 pages
Sistem Digital Nirkabel (TM3)
No ratings yet
Sistem Digital Nirkabel (TM3)
64 pages
CPM Textbooks Homework Help
100% (1)
CPM Textbooks Homework Help
6 pages
dịch bt
No ratings yet
dịch bt
11 pages
COT in English Q4
100% (1)
COT in English Q4
5 pages
Chapter 9 - Speech Coding in GSM
No ratings yet
Chapter 9 - Speech Coding in GSM
44 pages
Wireless Networks Slides8
No ratings yet
Wireless Networks Slides8
23 pages
Staff Manual
No ratings yet
Staff Manual
44 pages
Multi-Band Excitation Vocoder: RLE Technical Report No. 524
No ratings yet
Multi-Band Excitation Vocoder: RLE Technical Report No. 524
140 pages
b18592958 PDF
No ratings yet
b18592958 PDF
104 pages
Computer System Security Unit 1
No ratings yet
Computer System Security Unit 1
60 pages
Speech Compression
No ratings yet
Speech Compression
37 pages
Wireless Communications by Theodore S Ra
No ratings yet
Wireless Communications by Theodore S Ra
31 pages
Digital Speech Processing
No ratings yet
Digital Speech Processing
18 pages
Assignment Unit 1 Automata
No ratings yet
Assignment Unit 1 Automata
4 pages
Schermelleh Moosbrugger Mueller ModelFit MPR 2003
No ratings yet
Schermelleh Moosbrugger Mueller ModelFit MPR 2003
53 pages
Speech Coding
100% (3)
Speech Coding
36 pages
EL121N Day 1 Notes
No ratings yet
EL121N Day 1 Notes
35 pages
Unit Iv Audio and Video Coding
No ratings yet
Unit Iv Audio and Video Coding
15 pages
Unit2 1
No ratings yet
Unit2 1
23 pages
Dokumen - Tips Elec9344speech Audio Processing 4pdfspeech Signal For Digital Storage or Transmission
No ratings yet
Dokumen - Tips Elec9344speech Audio Processing 4pdfspeech Signal For Digital Storage or Transmission
87 pages
Speech Coding Systems
No ratings yet
Speech Coding Systems
90 pages
REC085 t3 Sheet
No ratings yet
REC085 t3 Sheet
15 pages
4 Chapter Audio and Video Compression
No ratings yet
4 Chapter Audio and Video Compression
122 pages
Week 7
No ratings yet
Week 7
15 pages
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
38 pages
Comparative Analysis of Speech Compression Algorithms With Perceptual and LP Based Quality Evaluations
No ratings yet
Comparative Analysis of Speech Compression Algorithms With Perceptual and LP Based Quality Evaluations
1 page
Chapter-3 1
No ratings yet
Chapter-3 1
21 pages
Speech Coders For Wireless Communication
No ratings yet
Speech Coders For Wireless Communication
53 pages
Speech Compression Using GSM
No ratings yet
Speech Compression Using GSM
23 pages
Python For IT Professionals
No ratings yet
Python For IT Professionals
13 pages
ch5.3 (Vocoders)
No ratings yet
ch5.3 (Vocoders)
23 pages
Speech Processing Project
No ratings yet
Speech Processing Project
16 pages
ST Stephen School Sonarpur Bengali
No ratings yet
ST Stephen School Sonarpur Bengali
4 pages
dịch bt
No ratings yet
dịch bt
13 pages
Nice
No ratings yet
Nice
15 pages
Priority OB1
No ratings yet
Priority OB1
2 pages
Formentera, Alondra L. Defining Approaches LP
No ratings yet
Formentera, Alondra L. Defining Approaches LP
7 pages
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
No ratings yet
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
20 pages
Audio Coding and Standards
No ratings yet
Audio Coding and Standards
32 pages
MMC Unit III-1
No ratings yet
MMC Unit III-1
122 pages
Adaptive Multi Rate Coder Using ACLP
No ratings yet
Adaptive Multi Rate Coder Using ACLP
45 pages
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
No ratings yet
Speech Coding: Fundamentals and Applications: ARK Asegawa Ohnson
20 pages
Speech Coding: Before You Start..
No ratings yet
Speech Coding: Before You Start..
5 pages
Sylabus Aca Writing
No ratings yet
Sylabus Aca Writing
2 pages
Low Bit Rate Speech Coding
No ratings yet
Low Bit Rate Speech Coding
165 pages
Speech and Audio Coding
No ratings yet
Speech and Audio Coding
16 pages
Ordinal Numbers
No ratings yet
Ordinal Numbers
3 pages
Human Speech Producing Organs: 2.4 Kbps
No ratings yet
Human Speech Producing Organs: 2.4 Kbps
108 pages
Speech Coder
No ratings yet
Speech Coder
20 pages
There Is No Such Thing As A Morale or An Immoral Book
No ratings yet
There Is No Such Thing As A Morale or An Immoral Book
3 pages
4 Voice - PCM
No ratings yet
4 Voice - PCM
33 pages
Speech and Audio Processing: Lecture-3
No ratings yet
Speech and Audio Processing: Lecture-3
20 pages
QA Engineer Coding Challenge
No ratings yet
QA Engineer Coding Challenge
3 pages
MyUPSI Student-Semester 5 - Timetable
No ratings yet
MyUPSI Student-Semester 5 - Timetable
1 page
Week 4
No ratings yet
Week 4
16 pages
Week 7
No ratings yet
Week 7
16 pages
Comparative Analysis of Speech Compression Algorithms With Perceptual and LP Based Quality Evaluations
No ratings yet
Comparative Analysis of Speech Compression Algorithms With Perceptual and LP Based Quality Evaluations
5 pages
CSPL 392
No ratings yet
CSPL 392
20 pages
4 Voice - PCM
No ratings yet
4 Voice - PCM
33 pages
Bab 7 Multimedia Kompresi Audio
No ratings yet
Bab 7 Multimedia Kompresi Audio
52 pages
Research Paper
No ratings yet
Research Paper
5 pages
Audio Compression
No ratings yet
Audio Compression
81 pages
Procedia: Speech Coding Techniques
No ratings yet
Procedia: Speech Coding Techniques
11 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
LPC Modeling: Unit 5 1.speech Compression
No ratings yet
LPC Modeling: Unit 5 1.speech Compression
13 pages
Lecture LPC
No ratings yet
Lecture LPC
7 pages
Speech Compression
No ratings yet
Speech Compression
15 pages
Speech Generation
No ratings yet
Speech Generation
11 pages
4: Speech Compression: Data Rates
No ratings yet
4: Speech Compression: Data Rates
14 pages
Speech Coding Journal
No ratings yet
Speech Coding Journal
20 pages
Ijetae 0612 54 PDF
No ratings yet
Ijetae 0612 54 PDF
4 pages
CELP
No ratings yet
CELP
23 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
The Green Book of Mathematical Problems
From Everand
The Green Book of Mathematical Problems
Kenneth Hardy
4.5/5 (3)

New Speech Coding Techniques: Mr. L.Ramesh Ap/Ece

Uploaded by

New Speech Coding Techniques: Mr. L.Ramesh Ap/Ece

Uploaded by

New Speech Coding

Efficient speech-coding techniques

Bandwidth is easily quantified

m0= -A m1 m2 …… mk mk+1 mL1 mL=A

divide the range [ A , A ] into L quantization levels

each quantization level Jk is represented by a value vk

Concept : small quantization levels for small x

Goal: constant SNRQ for all x

Compressor …1101…1101… Expandor

Compressor + Expandor  Compandor

 Typical values in practice

unvoiced G(z), G(), g[n]

{ak} can be non-uniform or vector

 The most commonplace codec

G.721:ADPCM-coded speech at 32Kbps.

 An MOS score of about 3.9

You might also like