0% found this document useful (0 votes)

34 views4 pages

Hexatalk Using ANN and DNNS

The document presents a speaker recognition system utilizing Long Short-Term Memory (LSTM) neural networks and Mel-Frequency Cepstral Coefficients (MFCCs) for feature extraction. This system addresses limitations of traditional methods like Gaussian Mixture Models (GMMs) and Hidden Markov Models (HMMs) by achieving high accuracy and robustness in noisy environments. The proposed approach demonstrates significant improvements in speaker classification performance, scalability, and noise resilience compared to conventional techniques.

Uploaded by

International Journal of Innovative Science and Research Technology

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views4 pages

Hexatalk Using ANN and DNNS

Uploaded by

International Journal of Innovative Science and Research Technology

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Volume 10, Issue 4, April – 2025 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25apr1252

Hexatalk using ANN and DNNS

M Ravi1; Dr. A Obulesu2; CH Vinod Vara Prasad3; N Abhishek4;
N Rithish Reddy5; V Anil Chary6
1,2,3,4,5,6
, Vidya Jyothi Institute of Technology, Hyderabad, Telangana, India

Publication Date: 2025/04/30

Abstract: Speaker recognition is an essential aspect of human-computer interaction, with applications in security,
personalized services, and more. This project proposes an end-to-end speaker recognition system leveraging Long Short-
Term Memory (LSTM) neural networks. Mel-Frequency Cepstral Coefficients (MFCCs) are used as audio features,
processed by an LSTM model to classify speakers with high accuracy. The proposed system demonstrates the efficacy of
LSTM for temporal feature analysis, achieving robust performance in noisy environments.

Keywords: Speaker Recognition, Deep Learning, MFCC, LSTM, Audio Classification.

How to Cite: M Ravi; Dr. A Obulesu Ch Vinod Vara Prasad; N Abhishek; N Rithish Reddy; V Anil Chary (2025). Hexatalk using
ANN and DNNS. International Journal of Innovative Science and Research Technology, 10(4), 1789-1792.
Https://Doi.Org/10.38124/Ijisrt/25apr1252

I. INTRODUCTION II. EXISTING SYSTEMS

Speaker recognition involves identifying or verifying the Traditional speaker recognition systems primarily rely on
identity of a speaker based on audio signals. With the statistical approaches like Gaussian Mixture Models (GMMs)
increasing adoption of smart devices and voice assistants, and Hidden Markov Models (HMMs) for feature extraction
robust speaker recognition systems have become essential for and classification. These methods have been the foundation of
applications like biometric authentication, personalized speaker recognition for decades due to their ability to model
services, and secure communication. speech dynamics and variations effectively. However, they
exhibit several critical limitations
Traditional methods such as Gaussian Mixture Models
(GMMs) and Hidden Markov Models (HMMs) rely on  Dependency on Handcrafted Features
handcrafted features and struggle to model the complex and Traditional systems rely heavily on manually designed
dynamic nature of speech signals. They also face challenges in features, such as spectral or prosodic attributes. These
adapting to noise, speaker variability, and environmental handcrafted features often fail to capture the full complexity
changes, limiting their effectiveness in real-world scenarios. of speech signals, especially under varying conditions.

Recent advancements in deep learning have  Inability to Capture Temporal Relationships

revolutionized speaker recognition by enabling models to Speech signals are inherently sequential and dynamic.
learn intricate patterns in audio data. Architectures like Statistical methods struggle to model long-term dependencies
Convolutional Neural Networks (CNNs) and Recurrent Neural and temporal relationships, which are crucial for accurate
Networks (RNNs), particularly Long Short-Term Memory speaker recognition.
(LSTM) networks, excel at capturing temporal dependencies
in speech.  Reduced Performance in Noisy or Dynamic Conditions
Real-world audio data often includes background noise,
This paper introduces an LSTM-based speaker overlapping speech, and variable recording environments.
recognition system that leverages Mel-Frequency Cepstral Traditional methods lack robustness in such scenarios, leading
Coefficients (MFCCs) for feature extraction. The proposed to significant performance degradation.
approach addresses challenges such as noise robustness and
variability, achieving high performance in diverse conditions, Moreover, while some hybrid approaches have
and advancing the capabilities of speaker recognition integrated machine learning with traditional methods to
technology. improve performance, they remain limited in scalability and
adaptability. For instance, these models often require
extensive preprocessing, feature engineering, and domain
expertise, making them less suitable for real-time applications
or deployment in diverse environments.

IJISRT25APR1252 www.ijisrt.com 1789

Volume 10, Issue 4, April – 2025 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25apr1252
Recent studies have highlighted the potential of deep LSTM outputs, mapping them to speaker classifications.
learning to address these challenges by automating feature Finally, the output layer applies a softmax activation function,
extraction and leveraging data-driven learning methods. generating probabilities for each speaker class. This structured
However, many existing deep learning-based models are design ensures scalability and adaptability for diverse
either computationally expensive or not optimized to handle applications.
the noise, variability, and scalability demands of real-world
data. These systems often overfit on clean datasets and fail to  Visualization
generalize effectively when exposed to unpredictable The attached architecture diagram illustrates the system
conditions, leaving significant room for improvement in workflow, starting from audio input, progressing through
practical applications. MFCC extraction and LSTM processing, and culminating in
speaker classification. This visualization highlights the key
III. PROPOSED SYSTEM components and their interactions, emphasizing the system’s
scalability and robustness.
The proposed system employs a Long Short-Term
Memory (LSTM) neural network for speaker classification by IV. MODULES
analyzing Mel-Frequency Cepstral Coefficients (MFCCs).
These features encode both spectral and temporal  Feature Extraction Module
characteristics of audio, making them highly effective for The feature extraction module is responsible for
speaker recognition. LSTMs are particularly suitable for this converting raw audio signals into meaningful representations
task due to their ability to model long-term dependencies that can be processed by the machine learning model. This
within sequential data, addressing limitations of traditional module uses Mel-Frequency Cepstral Coefficients (MFCCs),
methods like Gaussian Mixture Models (GMMs) and Hidden which capture the spectral properties of speech signals,
Markov Models (HMMs). This approach ensures robustness mimicking how humans perceive sound. MFCC extraction
in noisy conditions and improves classification accuracy. involves several steps, including framing, applying the Fast
Fourier Transform (FFT), and mapping frequencies to the Mel
 Feature Extraction and Data Preparation scale. Additionally, this module may include preprocessing
The system begins by extracting MFCC features from techniques such as noise reduction, silence removal, and
raw audio inputs, capturing essential speech characteristics. normalization to ensure clean and consistent audio features.
These features are then processed and formatted into These processes enhance the quality of the extracted features,
sequences compatible with LSTM networks. Data making them suitable for downstream tasks like speaker
augmentation techniques, such as adding synthetic noise, are classification.
applied to make the system resilient to environmental
variability. The dataset is labeled and split into training,  Data Preparation Module
validation, and testing subsets, with necessary preprocessing Once the features are extracted, the data preparation
steps like padding or truncating sequences for uniformity. module organizes and structures the data for input into the
Long Short-Term Memory (LSTM) model. This involves
 Model Training and Optimization encoding speaker labels, typically using one-hot encoding, to
During the training phase, the LSTM network learns facilitate classification tasks. The dataset is then split into
temporal patterns unique to each speaker using labeled data. training, validation, and testing subsets to ensure a fair
Regularization techniques, including dropout and batch evaluation of the model's performance. To handle the
normalization, are employed to prevent overfitting. The sequential nature of audio data, input sequences are either
model’s parameters, such as the learning rate and number of padded or truncated to a uniform length, ensuring
hidden layers, are fine-tuned for optimal performance. compatibility with the LSTM's input requirements. Data
Training ensures the network effectively maps MFCC inputs augmentation techniques, such as adding synthetic noise,
to speaker labels, leveraging its sequential learning time-stretching, or pitch-shifting, may also be applied to
capabilities. increase dataset diversity and improve the model's robustness
in real-world scenarios.
 Evaluation and Metrics
Once trained, the model is evaluated using metrics such  Model Building Module
as accuracy, precision, recall, and F1-score. Its performance is The model building module is the core of the system,
tested under both clean and noisy conditions to ensure where the LSTM neural network is constructed and optimized
reliability in real-world applications. The system’s robustness for speaker recognition. The architecture typically includes
is further validated by comparing its results against traditional
methods, highlighting significant improvements in speaker  Input Layer
recognition accuracy. Accepts the preprocessed MFCC feature sequences.

 System Architecture  LSTM Layers

The architecture consists of several key components. The Stacked LSTM layers process the sequential data,
input layer processes MFCC feature sequences, which are learning temporal dependencies and speaker-specific patterns.
passed through stacked LSTM layers to capture temporal
dependencies. Fully connected layers further process the

IJISRT25APR1252 www.ijisrt.com 1790

Volume 10, Issue 4, April – 2025 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25apr1252
 Fully Connected Layers  Improved Feature Utilization:
These layers transform the learned temporal features into The integration of MFCC features with LSTM networks
speaker classifications. led to significant improvements in recognition accuracy
compared to GMM-HMM systems, particularly in handling
 Output Layer sequential and temporal data.
A softmax activation function generates probabilities for
each speaker class.  Performance Comparison
The following metrics were used to assess the system:
The model is fine-tuned using hyperparameter
optimization, such as adjusting the number of LSTM layers,  Accuracy:
the size of hidden units, and the learning rate. Regularization Demonstrated the system's ability to correctly identify
techniques like dropout and L2 regularization are applied to speakers. The LSTM-based model consistently outperformed
prevent overfitting and enhance the model's generalization traditional approaches, achieving gains of 15-20% in noisy
capabilities. scenarios.

 Evaluation Module  Loss:

The evaluation module ensures the system's Low test loss values indicated that the model generalized
effectiveness and reliability by analyzing its performance on well to unseen data, avoiding overfitting despite noise and
various metrics. Common metrics include accuracy, precision, variability.
recall, F1-score, and confusion matrices, providing a detailed
assessment of the model's classification abilities. This module  Precision and Recall:
also evaluates the model's loss during training and testing These metrics were used to evaluate the balance between
phases to monitor convergence and stability. Robustness is false positives and false negatives, showing the LSTM model's
tested by introducing different noise levels into the evaluation superior reliability in speaker classification tasks.
dataset, simulating real-world conditions. Comparative
analyses with baseline models, such as Gaussian Mixture  Comparative Analysis
Models (GMMs) or Hidden Markov Models (HMMs), are The comparison highlighted substantial gains offered by
conducted to highlight the proposed system's advantages. the LSTM-based system over GMM-HMM methods:

V. SIMULATION RESULTS  Temporal Pattern Recognition:

The LSTM model excelled in capturing temporal
 Simulation Setup dependencies in sequential data, which traditional methods
The system was rigorously tested using a dataset struggled to handle.
comprising multiple speakers under varying noise conditions,
including background chatter, white noise, and environmental  Noise Resilience:
disturbances. The primary objective of the simulation was to The deep learning-based system demonstrated
evaluate the robustness and accuracy of the LSTM-based significantly higher robustness in environments with
model in comparison to traditional methods like Gaussian background noise or overlapping speech.
Mixture Models-Hidden Markov Models (GMM-HMM). The
dataset was split into training, validation, and testing subsets  Scalability:
to ensure unbiased performance evaluation. Synthetic noise The LSTM architecture scaled effectively to datasets
augmentation was applied during training to improve the with a large number of speakers, maintaining high accuracy
model's robustness to real-world conditions. without requiring extensive manual feature engineering.

 Key Results VI. CONCLUSION

 The simulation yielded impressive results, demonstrating This paper proposed an advanced speaker recognition
the efficacy of the LSTM-based speaker recognition system leveraging Long Short-Term Memory (LSTM) neural
system. The key findings include: networks, which effectively addressed the challenges
associated with traditional methods. By utilizing Mel-
 High Accuracy on Clean Data: Frequency Cepstral Coefficients (MFCCs) as input features,
The model achieved an accuracy exceeding 95% on clean the system captured critical spectral and temporal speech
datasets, indicating its effectiveness in speaker identification. characteristics, enabling precise speaker classification. The
LSTM architecture excelled in modeling temporal
 Robustness in Noisy Conditions: dependencies, achieving robust performance across diverse
Even under high noise levels, the model maintained conditions, including noisy and dynamic environments.
robust performance with minimal degradation in accuracy, Compared to conventional approaches like Gaussian Mixture
outperforming traditional methods. Models (GMMs) and Hidden Markov Models (HMMs), the
proposed system demonstrated significant improvements in
accuracy, scalability, and noise resilience.
.

IJISRT25APR1252 www.ijisrt.com 1791

Volume 10, Issue 4, April – 2025 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25apr1252
REFERENCES

[1]. Yu, D., & Deng, L. Automatic Speech Recognition: A

Deep Learning Approach. Springer, 2015.J. Clerk
Maxwell, A Treatise on Electricity and Magnetism, 3rd
ed., vol. 2. Oxford: Clarendon, 1892, pp.68-73.
[2]. Chollet, F. Deep Learning with Python. Manning
Publications, 2018.
[3]. Hochreiter, S., & Schmidhuber, J. Long Short-Term
Memory. Neural Computation, 1997.

IJISRT25APR1252 www.ijisrt.com 1792

Speaker Recognition, PHD Thesis
No ratings yet
Speaker Recognition, PHD Thesis
231 pages
Exploring The Association Between Attachment and Bullying Among Adolescents Through Bowlbian Perspective
No ratings yet
Exploring The Association Between Attachment and Bullying Among Adolescents Through Bowlbian Perspective
10 pages
Temperature-Energy Relationships and Spatial Distribution Analysis for Nano-Enhanced Phase Change Materials Via Thermal Energy Storage
No ratings yet
Temperature-Energy Relationships and Spatial Distribution Analysis for Nano-Enhanced Phase Change Materials Via Thermal Energy Storage
18 pages
Solid Dispersion-Based Approaches for Improving Oral Bioavailability: Current Progress and Future Perspectives
No ratings yet
Solid Dispersion-Based Approaches for Improving Oral Bioavailability: Current Progress and Future Perspectives
8 pages
Assessment of Caregivers' Knowledge and Acceptance of The Human Papilloma Virus Vaccine in Maihula Community, Bali Lga, Taraba State, Nigeria
No ratings yet
Assessment of Caregivers' Knowledge and Acceptance of The Human Papilloma Virus Vaccine in Maihula Community, Bali Lga, Taraba State, Nigeria
8 pages
Dental Care Flip Model: Dental Health Education To Improve Dental Health Maintenance Behavior of Elementary School Students
No ratings yet
Dental Care Flip Model: Dental Health Education To Improve Dental Health Maintenance Behavior of Elementary School Students
8 pages
Analyzing The Efficiency of Hybrid Explainable AI Models For Feature Extraction and Pattern Recognition in High-Dimensional Data Mining Tasks
No ratings yet
Analyzing The Efficiency of Hybrid Explainable AI Models For Feature Extraction and Pattern Recognition in High-Dimensional Data Mining Tasks
12 pages
Fenton Reagent-Based Advanced Oxidation For The Degradation of Reactive Black 5 and Methylene Blue Dyes
No ratings yet
Fenton Reagent-Based Advanced Oxidation For The Degradation of Reactive Black 5 and Methylene Blue Dyes
17 pages
Cardiovascular Catastrophe in Catastrophic Antiphospholipid Syndrome: A Case Report
No ratings yet
Cardiovascular Catastrophe in Catastrophic Antiphospholipid Syndrome: A Case Report
5 pages
Statistical Machine Learning
No ratings yet
Statistical Machine Learning
28 pages
Hands-On Speech Recognition Wit - Yamin Ren
No ratings yet
Hands-On Speech Recognition Wit - Yamin Ren
223 pages
Parental Participation and Students' Academic Achievement in Selected Government Aided Secondary Schools in Kibaale Town Council, Rakai District, Uganda
No ratings yet
Parental Participation and Students' Academic Achievement in Selected Government Aided Secondary Schools in Kibaale Town Council, Rakai District, Uganda
11 pages
Reviving Chettinad Architecture: A Cultural Legacy of Tamil Nadu
No ratings yet
Reviving Chettinad Architecture: A Cultural Legacy of Tamil Nadu
9 pages
NPAs and Profitability in Indian Private Sector Banks: Evidence from a Panel Study
No ratings yet
NPAs and Profitability in Indian Private Sector Banks: Evidence from a Panel Study
7 pages
Ginkgo Biloba-Derived Flavonoids as Metal Chelators in Alzheimer’s Neurochemistry: A Biochemical Approach
No ratings yet
Ginkgo Biloba-Derived Flavonoids as Metal Chelators in Alzheimer’s Neurochemistry: A Biochemical Approach
7 pages
Methods and Measures Growth Mixture Modeling A Method For Identifying Differences in Longitudinal Change Among Unobserved Groups
No ratings yet
Methods and Measures Growth Mixture Modeling A Method For Identifying Differences in Longitudinal Change Among Unobserved Groups
13 pages
Applsci 15 02924
No ratings yet
Applsci 15 02924
27 pages
Nanospore
No ratings yet
Nanospore
14 pages
Isolated Fallopian Tube Torsion Caused by A Mature Cystic Teratoma: A Rare Case Report
No ratings yet
Isolated Fallopian Tube Torsion Caused by A Mature Cystic Teratoma: A Rare Case Report
6 pages
Seminar Report Parthiv
No ratings yet
Seminar Report Parthiv
58 pages
Msge Desta Review of Speaker Recognition From Spectrogram Images
No ratings yet
Msge Desta Review of Speaker Recognition From Spectrogram Images
5 pages
Machine Learning-Aided Hybrid Technique For Dynamics of Rail Transit Stations Classification A Case Study
No ratings yet
Machine Learning-Aided Hybrid Technique For Dynamics of Rail Transit Stations Classification A Case Study
30 pages
Zlib - Pub - Swarm Intelligence Methods For Statistical Regression
No ratings yet
Zlib - Pub - Swarm Intelligence Methods For Statistical Regression
137 pages
1 s2.0 S0361476X06000543 Main
No ratings yet
1 s2.0 S0361476X06000543 Main
40 pages
Seminar Report Final
No ratings yet
Seminar Report Final
37 pages
2018ac04523 Final Report
No ratings yet
2018ac04523 Final Report
27 pages
Efficacy, Safety, and Feasibility of Verapamil in The Management of Atrial Fibrillation in Emergency Services With Limited Resources: A Systematic Review
No ratings yet
Efficacy, Safety, and Feasibility of Verapamil in The Management of Atrial Fibrillation in Emergency Services With Limited Resources: A Systematic Review
13 pages
Data-Driven Neural Network Based Feature - Phd-Thesis
No ratings yet
Data-Driven Neural Network Based Feature - Phd-Thesis
155 pages
2018ac04523 FR
No ratings yet
2018ac04523 FR
27 pages
Personal-Professional Attributes of Teachers and Learning Competence of Junior High School Students
No ratings yet
Personal-Professional Attributes of Teachers and Learning Competence of Junior High School Students
28 pages
Integrated Method of Deep Learning and Large Language Model in Speech Recognition
No ratings yet
Integrated Method of Deep Learning and Large Language Model in Speech Recognition
6 pages
Sita#1part2 Merged
No ratings yet
Sita#1part2 Merged
61 pages
Search For Binary Companions Around Millisecond Pulsars
No ratings yet
Search For Binary Companions Around Millisecond Pulsars
13 pages
A Novel Scheme For Speaker Recognition Using A Phonetically-Aware
No ratings yet
A Novel Scheme For Speaker Recognition Using A Phonetically-Aware
5 pages
Full Text 01
No ratings yet
Full Text 01
54 pages
An Amalgamation of Integrated Features With Deepspeech2 Architecture and Improved Spell Corrector For Improving Gujarati Language Asr System
No ratings yet
An Amalgamation of Integrated Features With Deepspeech2 Architecture and Improved Spell Corrector For Improving Gujarati Language Asr System
13 pages
2022.lrec-1.542 A Survey of Multilingual Models For Automatic Speech Recognition
No ratings yet
2022.lrec-1.542 A Survey of Multilingual Models For Automatic Speech Recognition
9 pages
Intelligent Speech Recognition Algorithm in Multimedia Visual Interaction Via BiLSTM and Attention Mechanism
No ratings yet
Intelligent Speech Recognition Algorithm in Multimedia Visual Interaction Via BiLSTM and Attention Mechanism
13 pages
Research Method and Presentation (Mini Project Proposal)
No ratings yet
Research Method and Presentation (Mini Project Proposal)
26 pages
6 - Conception of Speaker Recognition Methods A Review
No ratings yet
6 - Conception of Speaker Recognition Methods A Review
6 pages
Learning Augmented Joint-Space Task-Oriented Dynamical Systems: A Linear Parameter Varying and Synergetic Control Approach
No ratings yet
Learning Augmented Joint-Space Task-Oriented Dynamical Systems: A Linear Parameter Varying and Synergetic Control Approach
8 pages
Pamectomy in Lobular Breast Cancer
No ratings yet
Pamectomy in Lobular Breast Cancer
3 pages
Speaker Recognition Based On Deep Learning: An Overview
No ratings yet
Speaker Recognition Based On Deep Learning: An Overview
39 pages
Thesis Bich Ngoc Do
No ratings yet
Thesis Bich Ngoc Do
72 pages
Xiao Guest Lecture ASR
No ratings yet
Xiao Guest Lecture ASR
39 pages
University Libraries and The Use of Open Educational Resources (OERs) in Blended Learning (BL) : Effective Strategies From Nairobi County
No ratings yet
University Libraries and The Use of Open Educational Resources (OERs) in Blended Learning (BL) : Effective Strategies From Nairobi County
7 pages
Digital Transformation in The Judiciary: Evaluating The Impact of Court Case Management Systems On Reducing Case Backlogs and Enhancing Efficiency in Subordinate Courts of Tamil Nadu
No ratings yet
Digital Transformation in The Judiciary: Evaluating The Impact of Court Case Management Systems On Reducing Case Backlogs and Enhancing Efficiency in Subordinate Courts of Tamil Nadu
2 pages
Social Medias Influence On Modern Language and Communication Skills
No ratings yet
Social Medias Influence On Modern Language and Communication Skills
12 pages
Speaker Recognition Thesis
100% (3)
Speaker Recognition Thesis
8 pages
Quantifying, Measuring, and Correlating Socio - Cultural Variables: An Indispensable Technique For Diverse Fields of The Social Sciences
No ratings yet
Quantifying, Measuring, and Correlating Socio - Cultural Variables: An Indispensable Technique For Diverse Fields of The Social Sciences
12 pages
Case Study: Speech Recognition For Virtual Assistants: 1. Problem Identification
No ratings yet
Case Study: Speech Recognition For Virtual Assistants: 1. Problem Identification
8 pages
Study On Speech Recognition Method of Artificial Intelligence Deep Learning
No ratings yet
Study On Speech Recognition Method of Artificial Intelligence Deep Learning
6 pages
Childhood Adversity and Its Echoes in Adult Intimate Relationships
No ratings yet
Childhood Adversity and Its Echoes in Adult Intimate Relationships
9 pages
A Hybrid of Deep Neural Network and EXtreme Gradie
No ratings yet
A Hybrid of Deep Neural Network and EXtreme Gradie
12 pages
Speech Proceesing End Sem Reveiw FINAL
No ratings yet
Speech Proceesing End Sem Reveiw FINAL
16 pages
Project Report
No ratings yet
Project Report
17 pages
Intercalating A Multi-Barreled Approach To Educational and Pedagogical Reform: A Brief Summation of Our Publications On Pedagogy
No ratings yet
Intercalating A Multi-Barreled Approach To Educational and Pedagogical Reform: A Brief Summation of Our Publications On Pedagogy
12 pages
Gastrointestinal Stromal Tumour (GIST)
No ratings yet
Gastrointestinal Stromal Tumour (GIST)
5 pages
Speaker Recognition PHD Thesis
100% (3)
Speaker Recognition PHD Thesis
5 pages
Anabarasi (1) 11
No ratings yet
Anabarasi (1) 11
16 pages
Parental Influence On Aggression and Self-Esteem Among Young Adults: An Indian Context
No ratings yet
Parental Influence On Aggression and Self-Esteem Among Young Adults: An Indian Context
6 pages
1 s2.0 S095741742302746X Main
No ratings yet
1 s2.0 S095741742302746X Main
13 pages
Production Functions - Structural Approaches: Rachel Griffith
No ratings yet
Production Functions - Structural Approaches: Rachel Griffith
59 pages
A Novel Pipeline Leak Detection Approach Independent of Prior Failure Information
No ratings yet
A Novel Pipeline Leak Detection Approach Independent of Prior Failure Information
12 pages
Bayesdll: Bayesian Deep Learning Library: T.Hospedales@Ed - Ac.Uk
No ratings yet
Bayesdll: Bayesian Deep Learning Library: T.Hospedales@Ed - Ac.Uk
13 pages
Potential Wound Healing Activity of Citrus Micrantha Rut. (Biasong) Ethanolic Peel Extract On Excised Cutaneous Wounds in Male Albino Mice
No ratings yet
Potential Wound Healing Activity of Citrus Micrantha Rut. (Biasong) Ethanolic Peel Extract On Excised Cutaneous Wounds in Male Albino Mice
11 pages
Pattern Recognition - Unit - 1&2
100% (1)
Pattern Recognition - Unit - 1&2
41 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Garimella Thesis
No ratings yet
Garimella Thesis
74 pages
Unpacking Financial Interventions Link To Student Academic Performance in Public Secondary Schools: A Nyamira County Level Analysis, Kenya
No ratings yet
Unpacking Financial Interventions Link To Student Academic Performance in Public Secondary Schools: A Nyamira County Level Analysis, Kenya
11 pages
Piyu Sem Report.5
No ratings yet
Piyu Sem Report.5
30 pages
Lo-Mendell-Rubin Adjusted Likelihood Ratio Test and Wald Test - Statalist
No ratings yet
Lo-Mendell-Rubin Adjusted Likelihood Ratio Test and Wald Test - Statalist
12 pages
A Study To Assess The General Mental Health Among College Students in Selected Colleges at Kannur District
No ratings yet
A Study To Assess The General Mental Health Among College Students in Selected Colleges at Kannur District
5 pages
Concepts and Techniques: - Chapter 11
No ratings yet
Concepts and Techniques: - Chapter 11
103 pages
Enabling Independence Through Sound Recognition
No ratings yet
Enabling Independence Through Sound Recognition
10 pages
An Overview of Text-Independent Speaker Recognitio PDF
No ratings yet
An Overview of Text-Independent Speaker Recognitio PDF
31 pages
Targeted Voice Separation
No ratings yet
Targeted Voice Separation
4 pages
Sound For 3D Cinema and The Sense of Presence
No ratings yet
Sound For 3D Cinema and The Sense of Presence
9 pages
Beyond The Tests: How Portfolios Whisper of Equity and Engagement in Our Classrooms
100% (1)
Beyond The Tests: How Portfolios Whisper of Equity and Engagement in Our Classrooms
2 pages
El 29 2 15
No ratings yet
El 29 2 15
8 pages
Mediating Conflicts: Challenges of School Grievance Committee
No ratings yet
Mediating Conflicts: Challenges of School Grievance Committee
4 pages
AI Speech Recognition Document
No ratings yet
AI Speech Recognition Document
26 pages
Rohit
No ratings yet
Rohit
14 pages
Sample Midterm Exam 6
No ratings yet
Sample Midterm Exam 6
11 pages
Secr Overview
No ratings yet
Secr Overview
20 pages
Medical Applications of Finite Mixture Models Full Digital Edition
100% (13)
Medical Applications of Finite Mixture Models Full Digital Edition
15 pages
KY DSV
No ratings yet
KY DSV
7 pages
Voice Recognition System Using Machine L
No ratings yet
Voice Recognition System Using Machine L
7 pages
Thesis On Speaker Recognition System
100% (2)
Thesis On Speaker Recognition System
4 pages
Safety, Security, and Convenience: The Benefits of Voice Recognition Technology
No ratings yet
Safety, Security, and Convenience: The Benefits of Voice Recognition Technology
5 pages
Literature Review On Speech Recognition System
100% (3)
Literature Review On Speech Recognition System
4 pages
A Review On Image Feature Extraction and Representation Techniques
No ratings yet
A Review On Image Feature Extraction and Representation Techniques
13 pages
Este Es 1 Make 01 00031 PDF
No ratings yet
Este Es 1 Make 01 00031 PDF
17 pages
Speaker Verification Using Convolutional Neural Networks: Hossein Salehghaffari
No ratings yet
Speaker Verification Using Convolutional Neural Networks: Hossein Salehghaffari
6 pages
Speaker Recognation System Srs
No ratings yet
Speaker Recognation System Srs
23 pages
Speaker Recognition: Tony (Mr. T)
No ratings yet
Speaker Recognition: Tony (Mr. T)
10 pages
Speaker Recognition: SRT Project of Signal Processing
No ratings yet
Speaker Recognition: SRT Project of Signal Processing
27 pages
Final Report Complete PDF
No ratings yet
Final Report Complete PDF
26 pages
Autospeech: Neural Architecture Search For Speaker Recognition
No ratings yet
Autospeech: Neural Architecture Search For Speaker Recognition
5 pages
Voice Isolation Using Artificial Neural Network
No ratings yet
Voice Isolation Using Artificial Neural Network
7 pages
Generalized Method of Moments Estimation PDF
No ratings yet
Generalized Method of Moments Estimation PDF
29 pages
(IJCST-V10I3P32) :rizwan K Rahim, Tharikh Bin Siyad, Muhammed Ameen M.A, Muhammed Salim K.T, Selin M
No ratings yet
(IJCST-V10I3P32) :rizwan K Rahim, Tharikh Bin Siyad, Muhammed Ameen M.A, Muhammed Salim K.T, Selin M
6 pages
Regression Examples
No ratings yet
Regression Examples
26 pages
LPCC MFCC GFCC
No ratings yet
LPCC MFCC GFCC
5 pages
System: Power State Estimation
No ratings yet
System: Power State Estimation
7 pages
Probabilistic Matrix Factorization: Ruslan Salakhutdinov and Andriy Mnih
No ratings yet
Probabilistic Matrix Factorization: Ruslan Salakhutdinov and Andriy Mnih
8 pages
A Speaker Verification Method Based On TDNN-LSTMP PDF
No ratings yet
A Speaker Verification Method Based On TDNN-LSTMP PDF
15 pages
Bark08 Ghahramani Samlbb 01
No ratings yet
Bark08 Ghahramani Samlbb 01
26 pages
Kaldi Whitepaper PDF
No ratings yet
Kaldi Whitepaper PDF
4 pages
Speech Recognition Using HMM ANN Hybrid Model
No ratings yet
Speech Recognition Using HMM ANN Hybrid Model
4 pages
Automatic Speaker Recognition Using LPCC and MFCC
No ratings yet
Automatic Speaker Recognition Using LPCC and MFCC
4 pages
Speech Recognition Using Deep Learning Techniques
No ratings yet
Speech Recognition Using Deep Learning Techniques
5 pages
Methodology For Speaker Identification and Recognition System
100% (1)
Methodology For Speaker Identification and Recognition System
13 pages
Economics 2142 Time Series Analysis Syllabus
No ratings yet
Economics 2142 Time Series Analysis Syllabus
3 pages
Performance Improvement of Speaker Recognition System
No ratings yet
Performance Improvement of Speaker Recognition System
6 pages
A Review On Feature Extraction and Noise Reduction Technique
No ratings yet
A Review On Feature Extraction and Noise Reduction Technique
5 pages
Speaker Recognition System - v1
No ratings yet
Speaker Recognition System - v1
7 pages

Hexatalk Using ANN and DNNS

Uploaded by

Hexatalk Using ANN and DNNS

Uploaded by

Volume 10, Issue 4, April – 2025 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25apr1252

Hexatalk using ANN and DNNS

Publication Date: 2025/04/30

Keywords: Speaker Recognition, Deep Learning, MFCC, LSTM, Audio Classification.

I. INTRODUCTION II. EXISTING SYSTEMS

Recent advancements in deep learning have  Inability to Capture Temporal Relationships

IJISRT25APR1252 www.ijisrt.com 1789

 System Architecture  LSTM Layers

IJISRT25APR1252 www.ijisrt.com 1790

 Evaluation Module  Loss:

V. SIMULATION RESULTS  Temporal Pattern Recognition:

 Key Results VI. CONCLUSION

IJISRT25APR1252 www.ijisrt.com 1791

[1]. Yu, D., & Deng, L. Automatic Speech Recognition: A

IJISRT25APR1252 www.ijisrt.com 1792

You might also like