This document summarizes research on using machine learning algorithms to detect gender based on voice and speech recognition. A dataset of 3168 labeled voice samples was collected and 21 acoustic features were extracted from each sample. Three machine learning classifiers - Decision Tree, Random Forest, and Logistic Regression - were applied to the dataset, with Random Forest achieving the highest accuracy of 98.5% for gender classification. The research demonstrates that machine learning can effectively classify gender based on voice biometrics, with performance varying based on the specific algorithm and dataset used.
This document summarizes research on using machine learning algorithms to detect gender based on voice and speech recognition. A dataset of 3168 labeled voice samples was collected and 21 acoustic features were extracted from each sample. Three machine learning classifiers - Decision Tree, Random Forest, and Logistic Regression - were applied to the dataset, with Random Forest achieving the highest accuracy of 98.5% for gender classification. The research demonstrates that machine learning can effectively classify gender based on voice biometrics, with performance varying based on the specific algorithm and dataset used.
College of Information Engineering and Communications
Department of Information Engineering and Communications
Gender Detection Based on Voice and Speech Recognition Using
Different Machine Learning Algorithms
By: Mohammed Wahhab Abdulrazzaq Introduction
Each vocal sound has physical characteristics that can be
related to perceptual evaluations of that sound.
Frequency (measured in Hertz, Hz), sound level (measured in
decibels, dB), acoustic characteristics.
Here we will be focused only on the measurement of physical
characteristics of the sound (Acoustic) Natural Sound Generation Dataset In order to analyze gender by voice and speech, a training database is required.
A database was built using 3168 of samples of male and
female voices, each labeled by their gender of male or female.
This Dataset contains 3168 rows and 21 columns (20
columns for each feature and one label column for the classification of male or female). Dataset Cont.. The voice physical characteristics (Acoustic) are obtained using Specan function using Warble package in R language.
for classification and regression. The goal is to create a model that predicts the value of a target variable by learning simple decision rules inferred from the data features.
Decision tree algorithms: ID3 , C4.5 , C5.0, CART
Machine Learning Classification Cont.. Random Forest Classification Random forest, like its name implies, consists of a large number of individual decision trees that operate as an ensemble.
Each individual tree in the random forest
spits out a class prediction and the class with the most votes become our model’s prediction. Machine Learning Classification Cont.. Logistic Regression Classification statistical model (also known as logit model) is often used for classification and predictive the outcome is a probability, the dependent variable is bounded between 0 and 1 Steps of Programming and Running Python language in Anaconda software and Jupyter model 1. Reading the Dataset
2. Checking the Dataset if it has missing values
3. Importing classifier
4. Applying machine training on 80% of the dataset
Steps of Programming and Running
5. Making Prediction for the rest 20% of dataset
6. Measuring the accuracy
7. Drawing the confusion matrix
8. Comparing the results of all the used classifiers
9. Testing the machine using new unseen and unlabeled
dataset Results Decision Tree Accuracy: 97.3186119873817 Confusion Matrix Results Random Forest Accuracy: 98.58044164037855 Confusion Matrix Results Decision Tree Accuracy: 92.74447949526814 Confusion Matrix Conclusions Human can make a decision for something but with limited volume. The machine intervention is required when the data are huge but should be trained perfectly by the human. The supervised machine learning algorithms can be used for doing that purpose through classifying the input data. In this project, three supervised machine learning classifiers (Decision Tree, Random Forest, and Logistic Regression) applied on the same voice dataset and we got different accuracy values. The results sows the best one is the Random Forest algorithm with accuracy 98.5, while the others are 97.7 for the Decision Tree algorithm, and finally 92.7 for the Logistic Regression. The accuracy of classification differs according to the type and size of dataset. THANK YOU
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB