0% found this document useful (0 votes)

112 views17 pages

Spoken Language Processing in Python Chapter1

This document introduces audio data processing in Python. It discusses different audio file formats like mp3 and wav, and how audio is measured in frequency (kHz). It then demonstrates how to open an audio file in Python, convert the soundwave bytes to integers, find the frame rate and timestamps. Finally, it shows how to visualize two sound waves on a single plot to compare them.

Uploaded by

Fgpeqw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views17 pages

Spoken Language Processing in Python Chapter1

Uploaded by

Fgpeqw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Introduction to

audio data in Python

S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Dealing with audio les in Python
Different kinds all of audio les
mp3

wav

m4a

Digital sounds measured in frequency (kHz)

1 kHz = 1000 pieces of information per second

SPOKEN LANGUAGE PROCESSING IN PYTHON

Frequency examples
Streaming songs have a frequency of 32 kHz

Audiobooks and spoken language are between 8 and 16 kHz

We can't see audio les so we have to transform them rst

import wave

SPOKEN LANGUAGE PROCESSING IN PYTHON

Opening an audio le in Python
Audio le saved as good-morning.wav

# Import audio file as wave object

good_morning = wave.open("good-morning.wav", "r")

# Convert wave object to bytes

good_morning_soundwave = good_morning.readframes(-1)

# View the wav file in byte form

good_morning_soundwave

b'\xfd\xff\xfb\xff\xf8\xff\xf8\xff\xf7\...

SPOKEN LANGUAGE PROCESSING IN PYTHON

Working with audio is different
Have to convert the audio to something useful

Small sample of audio = large amount of information

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's practice!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Converting sound
wave bytes to
integers
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Converting bytes to integers
Can't use bytes

Convert bytes to integers using numpy

import numpy as np

# Convert soundwave_gm from bytes to integers

signal_gm = np.frombuffer(soundwave_gm, dtype='int16')

# Show the first 10 items

signal_gm[:10]

array([ -3, -5, -8, -8, -9, -13, -8, -10, -9, -11], dtype=int16)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Finding the frame rate
Frequency (Hz) = length of wave object array/duration of audio le (seconds)

# Get the frame rate

framerate_gm = good_morning.getframerate()

# Show the frame rate

framerate_gm

48,000

Duration of audio le (seconds) = length of wave object array/frequency (Hz)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Finding sound wave timestamps
# Return evenly spaced values between start and stop
np.linspace(start=1, stop=10, num=10)

array([ 1., 2., 3., 4., 5., 6., 7., 8., 9., 10.])

# Get the timestamps of the good morning sound wave

time_gm = np.linspace(start=0,
stop=len(soundwave_gm)/framerate_gm,
num=len(soundwave_gm))

SPOKEN LANGUAGE PROCESSING IN PYTHON

Finding sound wave timestamps
# View first 10 time stamps of good morning sound wave
time_gm[:10]

array([0.00000000e+00, 2.08334167e-05, 4.16668333e-05, 6.25002500e-05,

8.33336667e-05, 1.04167083e-04, 1.25000500e-04, 1.45833917e-04,
1.66667333e-04, 1.87500750e-04])

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's practice!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON
Visualizing sound
waves
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Adding another sound wave
New audio le: good_afternoon.wav

Both are 48 kHz

Same data transformations to all audio les

SPOKEN LANGUAGE PROCESSING IN PYTHON

Setting up a plot
import matplotlib.pyplot as plt

# Initialize figure and setup title

plt.title("Good Afternoon vs. Good Morning")

# x and y axis labels

plt.xlabel("Time (seconds)")
plt.ylabel("Amplitude")

# Add good morning and good afternoon values

plt.plot(time_ga, soundwave_ga, label ="Good Afternoon")
plt.plot(time_gm, soundwave_gm, label="Good Morning",
alpha=0.5)

# Create a legend and show our plot

plt.legend()
plt.show()

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON
Time to visualize!
S P OK EN LAN GUAGE P ROCES S IN G IN P YTH ON

English: Quarter 2 Distinguishing Among Various Types of Viewing Materials
84% (31)
English: Quarter 2 Distinguishing Among Various Types of Viewing Materials
19 pages
Credit Risk Modeling in Python Chapter3
No ratings yet
Credit Risk Modeling in Python Chapter3
35 pages
Designing Machine Learning Workflows in Python Chapter2
No ratings yet
Designing Machine Learning Workflows in Python Chapter2
39 pages
Pro Tools For Breakfast: Get Started Guide For The Most Used Software In Recording Studios: Stefano Tumiati, #2
From Everand
Pro Tools For Breakfast: Get Started Guide For The Most Used Software In Recording Studios: Stefano Tumiati, #2
Stefano Tumiati
No ratings yet
Introduction To Data Visualization With Seaborn Chapter3
100% (1)
Introduction To Data Visualization With Seaborn Chapter3
32 pages
MitraStar GPT-2541GNAC Users Manual
100% (1)
MitraStar GPT-2541GNAC Users Manual
226 pages
Spoken Language Processing in Python Chapter3
No ratings yet
Spoken Language Processing in Python Chapter3
26 pages
Spoken Language Processing in Python Chapter2
No ratings yet
Spoken Language Processing in Python Chapter2
23 pages
Spoken Language Processing in Python Chapter4
No ratings yet
Spoken Language Processing in Python Chapter4
46 pages
Analyzing IoT Data in Python Chapter4
No ratings yet
Analyzing IoT Data in Python Chapter4
34 pages
Designing Machine Learning Workflows in Python Chapter4
No ratings yet
Designing Machine Learning Workflows in Python Chapter4
38 pages
Designing Machine Learning Workflows in Python Chapter3
No ratings yet
Designing Machine Learning Workflows in Python Chapter3
42 pages
Analyzing IoT Data in Python Chapter3
No ratings yet
Analyzing IoT Data in Python Chapter3
30 pages
Data Science in Python - Regression
No ratings yet
Data Science in Python - Regression
234 pages
Online Machine Learning Algorithms For Currency Exchange Prediction
No ratings yet
Online Machine Learning Algorithms For Currency Exchange Prediction
84 pages
Designing Machine Learning Workflows in Python Chapter1
No ratings yet
Designing Machine Learning Workflows in Python Chapter1
32 pages
Introduction To Data Visualization With Seaborn Chapter1
No ratings yet
Introduction To Data Visualization With Seaborn Chapter1
26 pages
A Practical Approach To Linear Regression in Machine Learning - by Ashwin Raj - Towards Data Science
No ratings yet
A Practical Approach To Linear Regression in Machine Learning - by Ashwin Raj - Towards Data Science
20 pages
ch2 Deterministic and Random Signal Analysis
No ratings yet
ch2 Deterministic and Random Signal Analysis
32 pages
LLM Explainable Financial Forecasting
No ratings yet
LLM Explainable Financial Forecasting
13 pages
Knime Anomaly Detection Visualization
No ratings yet
Knime Anomaly Detection Visualization
13 pages
ScipyLectures Simple
No ratings yet
ScipyLectures Simple
670 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
384 pages
L1 - Machine Learning For Finance
100% (1)
L1 - Machine Learning For Finance
131 pages
771 A18 Lec4
100% (1)
771 A18 Lec4
128 pages
100 Page Chat GPT Generated Python Tutorial
No ratings yet
100 Page Chat GPT Generated Python Tutorial
106 pages
Introduction To Data Visualization With Seaborn Chapter2
No ratings yet
Introduction To Data Visualization With Seaborn Chapter2
38 pages
Flask Restplus
No ratings yet
Flask Restplus
86 pages
Introduction To Data Visualization With Matplotlib Chapter2
No ratings yet
Introduction To Data Visualization With Matplotlib Chapter2
27 pages
Early Stopping in Practice
No ratings yet
Early Stopping in Practice
14 pages
Machine Learning With Python PDF
No ratings yet
Machine Learning With Python PDF
5 pages
Building Chatbots in Python Chapter4
No ratings yet
Building Chatbots in Python Chapter4
20 pages
Introduction To Data Science - Lin and Li
No ratings yet
Introduction To Data Science - Lin and Li
403 pages
Geographic Coordinate Conversion
No ratings yet
Geographic Coordinate Conversion
11 pages
Fundamental Analysis Via Machine Learning
No ratings yet
Fundamental Analysis Via Machine Learning
26 pages
Essentials of Machine Learning Algorithms
No ratings yet
Essentials of Machine Learning Algorithms
15 pages
Linux Log Files Location and How Do I View Logs Files On Linux
No ratings yet
Linux Log Files Location and How Do I View Logs Files On Linux
5 pages
Python Libraries Cheat Sheets
No ratings yet
Python Libraries Cheat Sheets
6 pages
Introduction To Cloud Infrastructure Technologies
No ratings yet
Introduction To Cloud Infrastructure Technologies
11 pages
Data Scientist Certification Study Guide
No ratings yet
Data Scientist Certification Study Guide
7 pages
Python Cheat Sheet
No ratings yet
Python Cheat Sheet
14 pages
Altoros Tensorflow Cheat Sheet
100% (1)
Altoros Tensorflow Cheat Sheet
1 page
Lecture 3 EdgeDetection
No ratings yet
Lecture 3 EdgeDetection
52 pages
Matlab OOP
No ratings yet
Matlab OOP
724 pages
Python DataScience PDF
100% (1)
Python DataScience PDF
9 pages
OceanofPDF - Com Pythonic Quant A Comprehensive Guide - Hayden Van Der Post
100% (2)
OceanofPDF - Com Pythonic Quant A Comprehensive Guide - Hayden Van Der Post
586 pages
Digital Communications Practical File PDF
100% (1)
Digital Communications Practical File PDF
44 pages
Apache Calcite Tutorial
No ratings yet
Apache Calcite Tutorial
83 pages
Pandas - Powerful Python Data Analysis Toolkit
No ratings yet
Pandas - Powerful Python Data Analysis Toolkit
95 pages
Fast Payment Flagship - Final - Nov 1
No ratings yet
Fast Payment Flagship - Final - Nov 1
113 pages
Python Tricks and Tips
No ratings yet
Python Tricks and Tips
84 pages
Nasdaq Data Link Data Fabric
100% (1)
Nasdaq Data Link Data Fabric
12 pages
Conquer Radio Frequency Ebook
No ratings yet
Conquer Radio Frequency Ebook
228 pages
Pandas Tutorial 1: Pandas Basics (Reading Data Files, Dataframes, Data Selection)
No ratings yet
Pandas Tutorial 1: Pandas Basics (Reading Data Files, Dataframes, Data Selection)
15 pages
M5 - Custom Model Building With SQL in BigQuery ML Slides
No ratings yet
M5 - Custom Model Building With SQL in BigQuery ML Slides
32 pages
Test Driven Machine Learning - Sample Chapter
100% (1)
Test Driven Machine Learning - Sample Chapter
25 pages
Data Science Learning Path For 50 Days
No ratings yet
Data Science Learning Path For 50 Days
15 pages
Embuk
No ratings yet
Embuk
36 pages
Machine Learning + Devops Using Azure ML Services
No ratings yet
Machine Learning + Devops Using Azure ML Services
17 pages
Lesson 1 - Course - Introduction
No ratings yet
Lesson 1 - Course - Introduction
9 pages
How To Use GitLab
No ratings yet
How To Use GitLab
8 pages
Chapter 1
No ratings yet
Chapter 1
17 pages
Pydub
No ratings yet
Pydub
26 pages
Lecture
No ratings yet
Lecture
7 pages
Preparing Your Gures To Share With Others: Ariel Rokem
No ratings yet
Preparing Your Gures To Share With Others: Ariel Rokem
35 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
36 pages
Changing Plot Style and Color: Erin Case
No ratings yet
Changing Plot Style and Color: Erin Case
54 pages
Introduction To Data Visualization With Matplotlib: Ariel Rokem
No ratings yet
Introduction To Data Visualization With Matplotlib: Ariel Rokem
30 pages
Customer Segmentation in Python Chapter3
No ratings yet
Customer Segmentation in Python Chapter3
25 pages
Customer Segmentation in Python Chapter4
No ratings yet
Customer Segmentation in Python Chapter4
37 pages
Cleaning Data With PySpark Chapter3
No ratings yet
Cleaning Data With PySpark Chapter3
25 pages
Credit Risk Modeling in Python Chapter4
100% (1)
Credit Risk Modeling in Python Chapter4
35 pages
Cleaning Data With PySpark Chapter2
100% (1)
Cleaning Data With PySpark Chapter2
25 pages
Cleaning Data With PySpark Chapter4
No ratings yet
Cleaning Data With PySpark Chapter4
23 pages
Cleaning Data With PySpark Chapter1
0% (1)
Cleaning Data With PySpark Chapter1
20 pages
Building Chatbots in Python Chapter2 PDF
No ratings yet
Building Chatbots in Python Chapter2 PDF
41 pages
Analyzing IoT Data in Python Chapter2
No ratings yet
Analyzing IoT Data in Python Chapter2
35 pages
Advanced NLP With Spacy Chapter4
No ratings yet
Advanced NLP With Spacy Chapter4
26 pages
Analyzing IoT Data in Python Chapter1
100% (1)
Analyzing IoT Data in Python Chapter1
27 pages
User Manual - TLL1711082-EN
No ratings yet
User Manual - TLL1711082-EN
7 pages
+@ (Original 18+) Viraly LOL Hindi Clip LOL Viraly Video ...
No ratings yet
+@ (Original 18+) Viraly LOL Hindi Clip LOL Viraly Video ...
4 pages
CDDI
No ratings yet
CDDI
1 page
Gartner MDR Managed Detection Response
No ratings yet
Gartner MDR Managed Detection Response
15 pages
DIR-615 X1 User-Manual 3.0.7 26.02.21 EN
No ratings yet
DIR-615 X1 User-Manual 3.0.7 26.02.21 EN
176 pages
CenRF 900&1800&2100 Digital Triple Band Pico User Manual
No ratings yet
CenRF 900&1800&2100 Digital Triple Band Pico User Manual
31 pages
School Brochure September 2017final
No ratings yet
School Brochure September 2017final
41 pages
History of The Internet of Things
100% (1)
History of The Internet of Things
40 pages
AS 88013 PLC OM 600L93 GB WW 1027-1 Keyence
No ratings yet
AS 88013 PLC OM 600L93 GB WW 1027-1 Keyence
6 pages
Primer On Usb C and PD
No ratings yet
Primer On Usb C and PD
14 pages
Debre Berhan University Collage of Engineering Department of Electrical and Computer Engineering
No ratings yet
Debre Berhan University Collage of Engineering Department of Electrical and Computer Engineering
72 pages
S700 Series Low Level Radar
No ratings yet
S700 Series Low Level Radar
6 pages
WSN 2023
No ratings yet
WSN 2023
3 pages
Using The IR Sensor System With VEX Cortex
No ratings yet
Using The IR Sensor System With VEX Cortex
7 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
212 Chapter 7 - Discussion Questions and Answers:: Telecommunications, Internet, Wireless Technology
No ratings yet
212 Chapter 7 - Discussion Questions and Answers:: Telecommunications, Internet, Wireless Technology
8 pages
BLOCK DIAGRAM - Rio M Block Diagram - 2.4 - 2018 - 09 - 14
No ratings yet
BLOCK DIAGRAM - Rio M Block Diagram - 2.4 - 2018 - 09 - 14
7 pages
07 Workshop Brochure 2
No ratings yet
07 Workshop Brochure 2
4 pages
World Radio 1993 07
No ratings yet
World Radio 1993 07
80 pages
Lecture 4 Interfacing and Communication
No ratings yet
Lecture 4 Interfacing and Communication
43 pages
Memory Organization - II: Unit - 6
No ratings yet
Memory Organization - II: Unit - 6
17 pages
Adaptive Arrays & Smart Antennas
No ratings yet
Adaptive Arrays & Smart Antennas
7 pages
2024 Catalog Digital Services Telkom DWS
No ratings yet
2024 Catalog Digital Services Telkom DWS
26 pages
Analysis and Comparison of UART SPI and I2C
No ratings yet
Analysis and Comparison of UART SPI and I2C
5 pages
Iee Edge Service Automation
No ratings yet
Iee Edge Service Automation
57 pages
Cobham - Aeroflex - IFR GPS-101 Datasheet
No ratings yet
Cobham - Aeroflex - IFR GPS-101 Datasheet
6 pages
RX-V430 530 HTR-5540 5550 Dsp-Ax430 530
100% (1)
RX-V430 530 HTR-5540 5550 Dsp-Ax430 530
7 pages
Quick Guide For SCA-IOT2050
No ratings yet
Quick Guide For SCA-IOT2050
14 pages

Spoken Language Processing in Python Chapter1

Uploaded by

Spoken Language Processing in Python Chapter1

Uploaded by

Introduction to

audio data in Python

Digital sounds measured in frequency (kHz)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audiobooks and spoken language are between 8 and 16 kHz

We can't see audio les so we have to transform them rst

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import audio file as wave object

# Convert wave object to bytes

# View the wav file in byte form

SPOKEN LANGUAGE PROCESSING IN PYTHON

Small sample of audio = large amount of information

SPOKEN LANGUAGE PROCESSING IN PYTHON

Convert bytes to integers using numpy

# Convert soundwave_gm from bytes to integers

# Show the first 10 items

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Get the frame rate

# Show the frame rate

Duration of audio le (seconds) = length of wave object array/frequency (Hz)

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Get the timestamps of the good morning sound wave

SPOKEN LANGUAGE PROCESSING IN PYTHON

array([0.00000000e+00, 2.08334167e-05, 4.16668333e-05, 6.25002500e-05,

SPOKEN LANGUAGE PROCESSING IN PYTHON

Both are 48 kHz

Same data transformations to all audio les

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Initialize figure and setup title

# x and y axis labels

# Add good morning and good afternoon values

# Create a legend and show our plot

SPOKEN LANGUAGE PROCESSING IN PYTHON

You might also like