Lecture
Lecture
Spring 2024
Lecture #10
Speech Recognition
Speech processing system has mainly three tasks:
Recording
When you have to read the audio signal from a file, then record it
using a microphone, at first. (You can download the sample.wav file
from bulut.marmara.edu.tr)
Sampling
When recording with microphone, the signals are stored in a
digitized form. But to work upon it, the machine needs them in the
discrete numeric form. Hence, we should perform sampling at a
certain frequency and convert the signal into the discrete
numerical form. Choosing the high frequency for sampling implies
that when humans listen to the signal, they feel it as a continuous
audio signal.
Visualizing Audio Signals with Python
We show a stepwise approach to analyze an audio signal, using
Python, which is stored in a file.