0% found this document useful (0 votes)

44 views14 pages

Python Mini Project

This document describes a Python mini project to perform speech emotion recognition using the librosa library. It involves loading speech data from the RAVDESS dataset, extracting features like MFCCs and chroma using librosa, training an MLPClassifier model on the features to classify emotions, and evaluating the model accuracy on a test set. Key steps include defining functions for feature extraction, loading and splitting the dataset, initializing and training the MLPClassifier, making predictions on the test set, and calculating classification accuracy. The model is able to recognize emotions from speech with an accuracy of 72.4% on this dataset.

Uploaded by

Nitish Kumar Choudhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views14 pages

Python Mini Project

Uploaded by

Nitish Kumar Choudhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Python Mini Project – Speech

Emotion Recognition with librosa

Python Mini Project

Speech emotion recognition, the best ever python mini project. The best
example of it can be seen at call centers. If you ever noticed, call centers
employees never talk in the same manner, their way of pitching/talking to
the customers changes with customers. Now, this does happen with
common people too, but how is this relevant to call centers? Here is your
answer, the employees recognize customers’ emotions from speech, so they
can improve their service and convert more people. In this way, they are
using speech emotion recognition. So, let’s discuss this project in detail.

Speech emotion recognition is a simple Python mini-project, which you are

going to practice with DataFlair.

What is Speech Emotion Recognition?

Speech Emotion Recognition, abbreviated as SER, is the act of attempting
to recognize human emotion and affective states from speech. This is
capitalizing on the fact that voice often reflects underlying emotion through
tone and pitch. This is also the phenomenon that animals like dogs and
horses employ to be able to understand human emotion.

SER is tough because emotions are subjective and annotating audio is

challenging.
What is librosa?
librosa is a Python library for analyzing audio and music. It has a flatter
package layout, standardizes interfaces and names, backwards
compatibility, modular functions, and readable code. Further, in this
Python mini-project, we demonstrate how to install it (and a few other
packages) with pip.

What is JupyterLab?
JupyterLab is an open-source, web-based UI for Project Jupyter and it has
all basic functionalities of the Jupyter Notebook, like notebooks, terminals,
text editors, file browsers, rich outputs, and more. However, it also provides
improved support for third party extensions.

To run code in the JupyterLab, you’ll first need to run it with the command
prompt:

This will open for you a new session in your browser. Create a new Console
and start typing in your code. JupyterLab can execute multiple lines of code
at once; pressing enter will not execute your code, you’ll need to press
Shift+Enter for the same.

Speech Emotion Recognition –

Objective
To build a model to recognize emotion from speech using the librosa and
sklearn libraries and the RAVDESS dataset.
Speech Emotion Recognition – About
the Python Mini Project

In this Python mini project, we will use the libraries librosa, soundfile, and
sklearn (among others) to build a model using an MLPClassifier. This will
be able to recognize emotion from sound files. We will load the data, extract
features from it, then split the dataset into training and testing sets. Then,
we’ll initialize an MLPClassifier and train the model. Finally, we’ll calculate
the accuracy of our model.

The Dataset
For this Python mini project, we’ll use the RAVDESS dataset; this is the
Ryerson Audio-Visual Database of Emotional Speech and Song dataset, and
is free to download. This dataset has 7356 files rated by 247 individuals 10
times on emotional validity, intensity, and genuineness. The entire dataset
is 24.8GB from 24 actors,

Prerequisites
You’ll need to install the following libraries with pip:

If you run into issues installing librosa with pip, you can try it with conda.
Steps for speech emotion recognition
python projects
1. Make the necessary imports:

Screenshot:
2. Define a function extract_feature to extract the mfcc, chroma, and mel
features from a sound file. This function takes 4 parameters- the file name
and three Boolean parameters for the three features:

• mfcc: Mel Frequency Cepstral Coefficient, represents the short-

term power spectrum of a sound
• chroma: Pertains to the 12 different pitch classes
• mel: Mel Spectrogram Frequency

Open the sound file with soundfile.SoundFile using with-as so it’s

automatically closed once we’re done. Read from it and call it X. Also, get
the sample rate. If chroma is True, get the Short-Time Fourier Transform of
X.

Let result be an empty numpy array. Now, for each feature of the three, if it
exists, make a call to the corresponding function from librosa.feature (eg-
librosa.feature.mfcc for mfcc), and get the mean value. Call the function
hstack() from numpy with result and the feature value, and store this in
result. hstack() stacks arrays in sequence horizontally (in a columnar
fashion). Then, return the result.
Screenshot:
3. Now, let’s define a dictionary to hold numbers and the emotions
available in the RAVDESS dataset, and a list to hold those we want- calm,
happy, fearful, disgust.

Screenshot:
4. Now, let’s load the data with a function load_data() – this takes in the
relative size of the test set as parameter. x and y are empty lists; we’ll use
the glob() function from the glob module to get all the pathnames for the
sound files in our dataset. The pattern we use for this is:
“D:\\DataFlair\\ravdess data\\Actor_*\\*.wav”. This is because our
dataset looks like this:

Screenshot:

So, for each such path, get the basename of the file, the emotion by splitting
the name around ‘-’ and extracting the third value:
Screenshot:

Using our emotions dictionary, this number is turned into an emotion, and
our function checks whether this emotion is in our list of
observed_emotions; if not, it continues to the next file. It makes a call to
extract_feature and stores what is returned in ‘feature’. Then, it appends
the feature to x and the emotion to y. So, the list x holds the features and y
holds the emotions. We call the function train_test_split with these, the
test size, and a random state value, and return that.
Screenshot:

5. Time to split the dataset into training and testing sets! Let’s keep the test
set 25% of everything and use the load_data function for this.

Screenshot:
6. Observe the shape of the training and testing datasets:

Screenshot:

7. And get the number of features extracted.

Output Screenshot:

8. Now, let’s initialize an MLPClassifier. This is a Multi-layer Perceptron

Classifier; it optimizes the log-loss function using LBFGS or stochastic
gradient descent. Unlike SVM or Naive Bayes, the MLPClassifier has an
internal neural network for the purpose of classification. This is a
feedforward ANN model.

Screenshot:
9. Fit/train the model.

Output Screenshot:

10. Let’s predict the values for the test set. This gives us y_pred (the predicted emotions for
the features in the test set).

Screenshot:
11. To calculate the accuracy of our model, we’ll call up the accuracy_score()
function we imported from sklearn. Finally, we’ll round the accuracy to 2
decimal places and print it out.

Output Screenshot:

Summary
In this Python mini project, we learned to recognize emotions from speech.
We used an MLPClassifier for this and made use of the soundfile library to
read the sound file, and the librosa library to extract features from it. As
you’ll see, the model delivered an accuracy of 72.4%. That’s good enough
for us yet.

Hope you enjoyed the mini python project.

Class 10 Artificial Intelligence Sample Paper Set 4
No ratings yet
Class 10 Artificial Intelligence Sample Paper Set 4
9 pages
CH 4 Force System Resultant
No ratings yet
CH 4 Force System Resultant
50 pages
Locked College List Mop Up Round AIQ
No ratings yet
Locked College List Mop Up Round AIQ
4 pages
Mathematical Optimization of Solar Thermal Collectors Efficiency Function Using MATLAB
No ratings yet
Mathematical Optimization of Solar Thermal Collectors Efficiency Function Using MATLAB
5 pages
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
No ratings yet
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
7 pages
Final ThesisII
No ratings yet
Final ThesisII
82 pages
Topic 3 Me111 PDF
No ratings yet
Topic 3 Me111 PDF
25 pages
Questionnaire Employee Name: Designation: Academic Qualification: Experience
No ratings yet
Questionnaire Employee Name: Designation: Academic Qualification: Experience
4 pages
N.E.F. Phobia
No ratings yet
N.E.F. Phobia
2 pages
Handlebars
No ratings yet
Handlebars
5 pages
The FSC - Stability
No ratings yet
The FSC - Stability
9 pages
Lecture22 PDF
No ratings yet
Lecture22 PDF
29 pages
Step Template
No ratings yet
Step Template
20 pages
Section 7 Gravitational Fields
No ratings yet
Section 7 Gravitational Fields
39 pages
Articles (Homework)
No ratings yet
Articles (Homework)
2 pages
Test CAE
No ratings yet
Test CAE
10 pages
HSBC Digital Starter Kit Masterbrand HBPH
No ratings yet
HSBC Digital Starter Kit Masterbrand HBPH
27 pages
Using The TI-73:: A Guide For Teachers
No ratings yet
Using The TI-73:: A Guide For Teachers
86 pages
Uid-Module 3 Menus
No ratings yet
Uid-Module 3 Menus
25 pages
Nystesc 2019 Primer
No ratings yet
Nystesc 2019 Primer
64 pages
Daftar Topik Dan Road Map Pusat Penelitian 2020 2024
No ratings yet
Daftar Topik Dan Road Map Pusat Penelitian 2020 2024
22 pages
Adf Scheme - List of The Colleges and Departments Approved by Aicte SL. NO. Name of The College Name of The Departments
No ratings yet
Adf Scheme - List of The Colleges and Departments Approved by Aicte SL. NO. Name of The College Name of The Departments
1 page
Chapter 3 - Static Performance Characterstics
No ratings yet
Chapter 3 - Static Performance Characterstics
29 pages
Schedule of Examination - Second Semester AY 2024 2025
No ratings yet
Schedule of Examination - Second Semester AY 2024 2025
8 pages
Class 6 History Worksheet
No ratings yet
Class 6 History Worksheet
5 pages
Tyagi Wang Wen Zuo
No ratings yet
Tyagi Wang Wen Zuo
17 pages
L8.2: Interfacing Digital Temperature and Humidity Sensor With Microcontroller
No ratings yet
L8.2: Interfacing Digital Temperature and Humidity Sensor With Microcontroller
6 pages
Ass2 Sem-2 20-21
No ratings yet
Ass2 Sem-2 20-21
1 page
Harrogate International Application Form
No ratings yet
Harrogate International Application Form
4 pages
20 Things To Do After Installing Elementary OS Freya
No ratings yet
20 Things To Do After Installing Elementary OS Freya
2 pages
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)

Python Mini Project

Uploaded by

Python Mini Project

Uploaded by

Python Mini Project – Speech

Emotion Recognition with librosa

Python Mini Project

Speech emotion recognition is a simple Python mini-project, which you are

What is Speech Emotion Recognition?

SER is tough because emotions are subjective and annotating audio is

Speech Emotion Recognition –

• mfcc: Mel Frequency Cepstral Coefficient, represents the short-

Open the sound file with soundfile.SoundFile using with-as so it’s

7. And get the number of features extracted.

8. Now, let’s initialize an MLPClassifier. This is a Multi-layer Perceptron

Hope you enjoyed the mini python project.

You might also like