Digital Representation of Speech Waveform: Sampling of Speech Signals Review of The Statistical Model For Speech
The document discusses sampling of speech signals for digital representation. It states that speech is a continuous signal that needs to be sampled periodically and quantized to a finite set of values. It recommends a sampling rate greater than 20kHz to accurately represent all speech sounds due to the rapid spectral falloff at high frequencies. The document also reviews the statistical model of speech, representing it as an ergodic random process where the power spectrum of sampled signals is an aliased version of the original. It discusses estimating the probability density, autocorrelation, and power spectrum of speech signals using methods like histograms, gamma distributions, and time series analysis.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
70 views7 pages
Digital Representation of Speech Waveform: Sampling of Speech Signals Review of The Statistical Model For Speech
The document discusses sampling of speech signals for digital representation. It states that speech is a continuous signal that needs to be sampled periodically and quantized to a finite set of values. It recommends a sampling rate greater than 20kHz to accurately represent all speech sounds due to the rapid spectral falloff at high frequencies. The document also reviews the statistical model of speech, representing it as an ergodic random process where the power spectrum of sampled signals is an aliased version of the original. It discusses estimating the probability density, autocorrelation, and power spectrum of speech signals using methods like histograms, gamma distributions, and time series analysis.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7
Sampling of Speech Signals
Review of the Statistical Model for Speech
SAMPLING OF SPEECH SIGNALS Speech is a continuous function of continuous time variable. It is sampled periodically in time to produce a sequence of samples X(nT). It is necessary to quantize these sample values to a finite set of values in order to obtain digital representation. Since we are concerned with digital representation of speech signals we need to consider spectral properties of speech. According to steady state models for the production of vowel and fricative sounds, speech signals are not inherently band limited. Spectrum tend to fall off rapidly at high frequencies Thus to accurately represent all speech sounds would require a sampling rate greater than 20KHz REVIEW OF THE STATISTICAL MODEL FOR SPEECH Speech waveform can be represented by an ergodic random process. Assume that the signal x(t) is a sample function of continuous time random process then the sequence of samples obtained by sampling can be thought of as a sample sequence of discrete time random process.
The power spectrum of sampled signal is an aliased version of the power spectrum of the original signal. The averages such as mean and variance are the same for samples as for the original signal.
PROBABILITY DENSITY ESTIMATION It is estimated by determining a histogram of amplitudes for a large number of samples. A good approximation to measure probability density is gamma distribution given as
Similar approximation is the laplacian density
The auto correlation function and power spectrum of speech signals can be estimated by standard time series analysis methods An estimate of the autocorrelation function can be obtained by estimating the time average autocorrelation function from a long segment of a signal. The power spectrum can be estimated in a variety of ways 1. By measuring the average output of the set of bandpass filters. 2. Estimation of long term average power spectrum 3. Computing the power transfer function of a recursive digital filter