0% found this document useful (0 votes)

71 views32 pages

S H Li Speech Analysis

The document discusses various techniques for analyzing speech sounds, including waveform analysis, spectrograms, and linear prediction coding (LPC). Waveform analysis looks at changes in sound intensity over time. Spectrograms examine dynamic changes in a speech spectrum and are useful for segmenting phonemes. LPC separates resonant vocal tract characteristics from sound source characteristics to identify formant peaks representing resonances. Examples of applying these techniques to analyze vowels, stops, affricates and fricatives are shown through various figures.

Uploaded by

ilasundaram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views32 pages

S H Li Speech Analysis

Uploaded by

ilasundaram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Speechanalysis S h l i

WhatisSpeechAnalysis? What is Speech Analysis?

Analysisofspeechsoundstakingintoconsiderationtheirmethodof y p g

production Thelevelofprocessingbetweenthedigitisedacousticwaveformandthe The level of processing between the digitised acoustic waveform and the acousticfeaturevectors. Th Theextractionof``interesting''informationasanacousticvector t ti f ``i t ti '' i f ti ti t

waveforms

SpeechWaveforms h f
A waveform is a two dimensional representation of a sound. The two dimensions in a waveform display are time and intensity. The vertical dimension is intensity and the horizontal dimension is time. Waveforms are known as time domain representations of sound as they display changes in intensity over time. The intensity dimension actually displays sound pressure. Sound pressure is a measure of the tiny variations in air pressure that we are able to perceive as sound. I t it i th Intensity in these waveforms i a simple li f is i l linear scaling of sound li f d pressure (not dB).

ResonancesandFormants
Resonances are vibratory characteristics of a resonating body. In the case of an air filled tube the resonance characteristics exist even when there is no sound being produced. When we produce vowel sounds the resonances of the vocal tract selectively enhance sound vibrations close to the resonance frequencies and selectively attenuate sound vibrations remote from the resonance frequencies frequencies. This results in peaks in the acoustic spectrum of the resulting speech sound. These acoustic spectral peaks are called formants, particularly when they occur in vowels and vowellike consonants.

Spectrograms Spectrograms permit the examination of the dynamic changes in a Spectrogramspermittheexaminationofthedynamicchangesina

speechspectrum. This is particularly useful for the examination of rapidly changing Thisisparticularlyusefulfortheexaminationofrapidlychanging consonants(eg.stopbursts)andalsoforvoweltransitions(between vowelsandconsonantsandbetweenthetargetsindiphthongs). Spectrograms,usuallyinconjunctionwithwaveforms,areessential duringthesegmentingandlabelingofspeech. Spectrogramsusuallyprovidetheclearestvisualcuestothe boundariesbetweenphonemes. Spectrogramsdonot,however,provideaccuratemeasurementsof vowelformantsasbroadbandspectrogramshaveapoorfrequency resolution(about300Hz)andsothereisahighdegreeofintrinsic errorinformantmeasurementstakenvisuallyfromspectrograms. error in formant measurements taken visually from spectrograms ThatiswhywetendtouseFFTsandLPCsfortheaccurate measurementofformantfrequencies.

Fig:waveformandbroadbandspectrogramoftheword"heard"

Figure:anarrowbandspectrogramoftheword"heard"

Figure: Thisisabroadbandspectrogramof theword"hide"withtheformanttracksfor formants1to5superimposedoverit.

1_aam 0.0143017892 0.490396511

g1 0

aag

aa1

aa2

aam

m2 0.491

Time (s) ( )

aayvu 1

-1 g 0 Time (s) aa ay y yv v vu u 0.8455

0.18 0 18

0.2

0.1

0.07

0.04

0.07

0.19

Words aayvu g aa ay y yv v vu u

Duration insecs 0.77 0.19 0 19 0.2 0.1 0.07 0.04 0.07 0 07 0.06 0.2

Intensity indB 80.4 62.4 62 4 81.3 84.0 80.5 78.7 73.4 73 4 78.2 77.8

Pitch inHz 160.2 128 137.1 171.1 179 174.5 162.2 162 2 166.5 167.2

F1 540.7 900.78 900 78 810.4 654.07 362.1 349.3 348.7 348 7 3636.0 387.36

FormantsinHz F2 F3 1484.6 3750.3 1853.0 1853 0 2899.3 2899 3 1181.6 2865.5 1755.3 2599.9 2275.9 2570.3 1928.6 2365.0 1154.98 1154 98 2418.4 2418 4 1147.2 2570.8 1488.5 2611.5

F4 3750.2 4078.2 4078 2 3792.2 3753.5 3878.4 3876.5 3636.0 3636 0 3568.2 3693.2

LPC of aa in aayvu
Sound pressu level (dB/Hz) ure

886.4

1212.5

60
2916.7

3754.0

4813.6

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

LPC of ay in aayvu
Sound press sure level (dB/Hz)

671.6

1694.1 2272.1

3679.9

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

LPC of y in aayvu

Sound pressure level (dB/Hz) d (

352.9 2323.9 3939.3 4939.6

1000

2000

3000 Frequency (Hz)

4000

5000 5500

LPC of v in aayvu

Sound pressure level (d /Hz) dB

323.3 1190.2 2346.2 3613.2

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

LPC of vu in aayvu

Sound pressure level (dB/Hz) p B

360.3
60

1108.7

2612.9

3583.6

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

LPC of u in aayvu

Sound pressure level (d /Hz) dB

397.4
60

1486.3

3583.6 2590.7

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

Linear Prediction Coefficient (LPC)

Linear Prediction Coefficient (LPC) analysis attempts to predict the poles (related to resonances or formants) that, when combined with the speech source spectrum (the "residual" in LPC analysis), would result in the original waveform. g

An LPC analysis separates the analysis of the resonant characteristics of a speech sound from the source characteristics of that sound.

The resulting LPC spectrum is a smoothed spectrum with the peaks representing the formants (resulting from the vocal tract resonances) of the spectrum of a vowel or vowel like consonant vowel-like consonant.

Figure:ThisisanLPCanalysisofthevowelinheard.Note thesmoothspectrumclearlyshowingthepositionsofthe mainspectralpeaks(formants)ofthisvowel

Figure:Whitenoiseusedasasimplifiedmodelofafricativesound source. Notetherandompatternofboththewaveform(bottom)andthe spectrum(top).Alsonotethatthespectralenvelope(LPCspectruminred) isapproximatelyflat.

Identification of Speech Waveforms

Figure:Threelongvowelsinan/h_d/context.

Figure:ThreeEnglishvoicelessoralstopsinCVcontext

Figure:ThreeEnglishvoicedoralstopsinCVcontext.

Figure:ThetwoEnglishaffricatesinCVcontext.

Figure9:WaveformsoftwooftheEnglishvoicelessfricativesinCVcontext

Quizzes
80% (5)
Quizzes
110 pages
CD 442 Speech Science Spectrograms and Acoustic Analysis Lab Project Instructions
No ratings yet
CD 442 Speech Science Spectrograms and Acoustic Analysis Lab Project Instructions
3 pages
Praat Manual
100% (2)
Praat Manual
1,270 pages
Salter 1998 - Acoustics Architecture Engineering The Environment
100% (1)
Salter 1998 - Acoustics Architecture Engineering The Environment
70 pages
How To Read A Spectrograms (Course3)
No ratings yet
How To Read A Spectrograms (Course3)
28 pages
3.2 Automatic Speech Recognition
No ratings yet
3.2 Automatic Speech Recognition
151 pages
P and P Essay Spectrogram
No ratings yet
P and P Essay Spectrogram
3 pages
Speech Features
No ratings yet
Speech Features
9 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
69 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
69 pages
Phonolog Y: The Study of Sound Structure in Language
No ratings yet
Phonolog Y: The Study of Sound Structure in Language
21 pages
Audproc 2
No ratings yet
Audproc 2
40 pages
Spectrograms
No ratings yet
Spectrograms
5 pages
Acoustic Phonetics: Sanjukta Ghosh
No ratings yet
Acoustic Phonetics: Sanjukta Ghosh
19 pages
Unit 4 NLP Kcs072
No ratings yet
Unit 4 NLP Kcs072
9 pages
Chapter 6
No ratings yet
Chapter 6
13 pages
Acoustics of Speech: Julia Hirschberg CS 4706
No ratings yet
Acoustics of Speech: Julia Hirschberg CS 4706
30 pages
556 Acoustic Phonetics Basics
No ratings yet
556 Acoustic Phonetics Basics
22 pages
Speech Lab
No ratings yet
Speech Lab
7 pages
How Do I Read A Spectrogram?: Rob's Blog
No ratings yet
How Do I Read A Spectrogram?: Rob's Blog
15 pages
Handout Spectrogram
100% (1)
Handout Spectrogram
5 pages
Acoustics of Speech: Julia Hirschberg CS 4706
No ratings yet
Acoustics of Speech: Julia Hirschberg CS 4706
29 pages
Speech Acoustics Project
No ratings yet
Speech Acoustics Project
22 pages
Ac Phon
No ratings yet
Ac Phon
60 pages
Definition of Resonance: The Temporal Features (Time Domain Features), Which Are Simple To Extract and
No ratings yet
Definition of Resonance: The Temporal Features (Time Domain Features), Which Are Simple To Extract and
5 pages
Sound Waves
No ratings yet
Sound Waves
8 pages
Acoustic Phonetics - The Handbook of Phonetic Sciences - Blackwell Reference Online
100% (1)
Acoustic Phonetics - The Handbook of Phonetic Sciences - Blackwell Reference Online
32 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
54 pages
Recall What Are Sound Features? Feature Detection and Extraction Features in Sphinx III
No ratings yet
Recall What Are Sound Features? Feature Detection and Extraction Features in Sphinx III
11 pages
Chapter6 - SPEECH SIGNAL PROCESSING
No ratings yet
Chapter6 - SPEECH SIGNAL PROCESSING
54 pages
Lecture 3
No ratings yet
Lecture 3
7 pages
Lec2 Audition
No ratings yet
Lec2 Audition
37 pages
Types of Waveform.
No ratings yet
Types of Waveform.
5 pages
Vowels
No ratings yet
Vowels
43 pages
Auditary Phonetics
No ratings yet
Auditary Phonetics
5 pages
List of Figures: Second Unit: Audio and Speech Descriptors
No ratings yet
List of Figures: Second Unit: Audio and Speech Descriptors
22 pages
Speech Processing Basics
No ratings yet
Speech Processing Basics
86 pages
Acoustic Phonetics
100% (1)
Acoustic Phonetics
19 pages
Acoustics of Fricatives 8
No ratings yet
Acoustics of Fricatives 8
6 pages
Lecours 1968
No ratings yet
Lecours 1968
3 pages
Introduction To Acoustics
No ratings yet
Introduction To Acoustics
7 pages
Acoustic Phonetics 2017-18
No ratings yet
Acoustic Phonetics 2017-18
49 pages
Speech Chapter 4
No ratings yet
Speech Chapter 4
41 pages
Digital Signal Processing: Course
No ratings yet
Digital Signal Processing: Course
47 pages
Acoustic Phonetics
No ratings yet
Acoustic Phonetics
4 pages
Phonetics Acoustic Phonetics
0% (1)
Phonetics Acoustic Phonetics
52 pages
Text, Speech and Phono
No ratings yet
Text, Speech and Phono
2 pages
Favsi m3 (Models)
No ratings yet
Favsi m3 (Models)
48 pages
Blacklock (2004) Tesis-Characteristics of Variation in Production of Normal and Disordered Fricative - Multitaper
No ratings yet
Blacklock (2004) Tesis-Characteristics of Variation in Production of Normal and Disordered Fricative - Multitaper
288 pages
Speech Analysis
No ratings yet
Speech Analysis
10 pages
Speech Sound Production: Recognition Using Recurrent Neural Networks
No ratings yet
Speech Sound Production: Recognition Using Recurrent Neural Networks
20 pages
Lab2 Cepstrales Sin Cepstrales
No ratings yet
Lab2 Cepstrales Sin Cepstrales
21 pages
Voice Signal Processing For Speech Synthesis: June 2006
No ratings yet
Voice Signal Processing For Speech Synthesis: June 2006
6 pages
UNc2rjc ncr2ocmxedIT 2
No ratings yet
UNc2rjc ncr2ocmxedIT 2
3 pages
Acoustic Phonetics: Presenting By: Lon MJ Aeronic S. Vargas
No ratings yet
Acoustic Phonetics: Presenting By: Lon MJ Aeronic S. Vargas
15 pages
Basic Acoustics + DSP
No ratings yet
Basic Acoustics + DSP
42 pages
Spectrogram
No ratings yet
Spectrogram
1 page
Resonance: November 4, 2011
No ratings yet
Resonance: November 4, 2011
23 pages
Spectral Analysis in Speech Processing Techniques: Prof. Vijaya Sugandhi
No ratings yet
Spectral Analysis in Speech Processing Techniques: Prof. Vijaya Sugandhi
3 pages
15 Resonance
No ratings yet
15 Resonance
25 pages
2 1 Fixed Spectrum Additive Synthes S
No ratings yet
2 1 Fixed Spectrum Additive Synthes S
1 page
The Music Producer's Guide To EQ: The Music Producer's Guide
From Everand
The Music Producer's Guide To EQ: The Music Producer's Guide
Ashley Hewitt
No ratings yet
Prof. Murugaiyan
No ratings yet
Prof. Murugaiyan
13 pages
Prof. K. Rajan
No ratings yet
Prof. K. Rajan
65 pages
Prof. AG. Ramakrishnan
No ratings yet
Prof. AG. Ramakrishnan
85 pages
Dr. TV. Geetha
No ratings yet
Dr. TV. Geetha
176 pages
DR A. Muthukumar
No ratings yet
DR A. Muthukumar
36 pages
My Graduation Project Final Report
No ratings yet
My Graduation Project Final Report
66 pages
Optical Fiber
No ratings yet
Optical Fiber
6 pages
Wave Optics DPPs PDablu
No ratings yet
Wave Optics DPPs PDablu
11 pages
Wave Optics Interference of Light
No ratings yet
Wave Optics Interference of Light
27 pages
Mwoc Q - A
No ratings yet
Mwoc Q - A
19 pages
Wireless Transmission Media
No ratings yet
Wireless Transmission Media
14 pages
Chapter 3: - (Polarization) : Solve The All Question
No ratings yet
Chapter 3: - (Polarization) : Solve The All Question
4 pages
DS - Ap Ant 48
No ratings yet
DS - Ap Ant 48
2 pages
Waves Questions
No ratings yet
Waves Questions
10 pages
Guided Revision: Section-I Single Correct Answer Type 9 Q. (3 M (-1) )
No ratings yet
Guided Revision: Section-I Single Correct Answer Type 9 Q. (3 M (-1) )
25 pages
Chapter 15: Reflection and Refraction: Worksheet Solutions
No ratings yet
Chapter 15: Reflection and Refraction: Worksheet Solutions
9 pages
Dot 8
No ratings yet
Dot 8
1 page
NYUSIM-based Millimeter Wave Propagation Channel Model in 5G
No ratings yet
NYUSIM-based Millimeter Wave Propagation Channel Model in 5G
6 pages
255 39 Solutions Instructor Manual All Chapters
88% (8)
255 39 Solutions Instructor Manual All Chapters
53 pages
Physics Quiz Sum
No ratings yet
Physics Quiz Sum
4 pages
Grade 7 Exam 3
No ratings yet
Grade 7 Exam 3
4 pages
Prelab 9 - Sound Levels: Sound Pressure Level Measurements
No ratings yet
Prelab 9 - Sound Levels: Sound Pressure Level Measurements
4 pages
Optappl 5001p69
No ratings yet
Optappl 5001p69
13 pages
Formation of OAM Beams by Circular Polarization Ceramic Antenna Array
No ratings yet
Formation of OAM Beams by Circular Polarization Ceramic Antenna Array
10 pages
Wave Guides
No ratings yet
Wave Guides
10 pages
Effects of Frequency Characteristics of Reverberation Time On Listener Envelopment
No ratings yet
Effects of Frequency Characteristics of Reverberation Time On Listener Envelopment
6 pages
Martini Acoustic Design Guide Bradford
No ratings yet
Martini Acoustic Design Guide Bradford
27 pages
Fourier Transform PDF
No ratings yet
Fourier Transform PDF
7 pages
NMMHTCT Chapter 1
No ratings yet
NMMHTCT Chapter 1
16 pages
Coherent Optics Fundamentals and Applications
No ratings yet
Coherent Optics Fundamentals and Applications
352 pages
Simulation - Photoelectric Effect - Answer Guideline
0% (1)
Simulation - Photoelectric Effect - Answer Guideline
3 pages
Determination of The Diffraction Intensity at Slit and Double Slit Systems
100% (1)
Determination of The Diffraction Intensity at Slit and Double Slit Systems
5 pages
Astro Luminosity
No ratings yet
Astro Luminosity
10 pages

S H Li Speech Analysis

Uploaded by

S H Li Speech Analysis

Uploaded by

Speechanalysis S h l i

WhatisSpeechAnalysis? What is Speech Analysis?

Spectrograms Spectrograms permit the examination of the dynamic changes in a Spectrogramspermittheexaminationofthedynamicchangesina

Figure: Thisisabroadbandspectrogramof theword"hide"withtheformanttracksfor formants1to5superimposedoverit.

1_aam 0.0143017892 0.490396511

-1 g 0 Time (s) aa ay y yv v vu u 0.8455

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

Sound pressure level (dB/Hz) d (

352.9 2323.9 3939.3 4939.6

3000 Frequency (Hz)

Sound pressure level (d /Hz) dB

323.3 1190.2 2346.2 3613.2

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

Sound pressure level (dB/Hz) p B

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

Sound pressure level (d /Hz) dB

20 0 1000 2000 3000 Frequency (Hz) 4000 5000 5500

Linear Prediction Coefficient (LPC)

Figure:ThisisanLPCanalysisofthevowelinheard.Note thesmoothspectrumclearlyshowingthepositionsofthe mainspectralpeaks(formants)ofthisvowel

Figure:Whitenoiseusedasasimplifiedmodelofafricativesound source. Notetherandompatternofboththewaveform(bottom)andthe spectrum(top).Alsonotethatthespectralenvelope(LPCspectruminred) isapproximatelyflat.

Identification of Speech Waveforms

You might also like