IEEE Paper
IEEE Paper
Abstract—The document discusses the design and paper provides implementation details, result analysis, and
implementation of an algorithm to predict the gender of suggestions for further improvement.
an individual based on features extracted from pre-
recorded audio data. The algorithm utilizes pitch and II. PROPOSED METHOD
energy as the key features and sets predefined thresholds
for classification. The project team has implemented the
algorithm using MATLAB and achieved efficiency rates The proposed methodology for the topic of "Gender
of 81.1% for male voice prediction and 72.5% for female Recognition from Audio" involves the following steps:
voice prediction. The document provides a flow chart
outlining the steps involved in the process, including 2.1 Audio Recording:
audio recording, pre-processing and filtering, feature
extraction, gender classification, and displaying the Utilize the audio recorder to capture audio input for a
predicted gender. The document also suggests exploring predefined duration (e.g., 5 seconds). Adjust the recording
more advanced signal processing techniques or machine time based on the application requirements.
learning models for better accuracy if the results are not
satisfactory. Two external references are provided for 2.2 Preprocessing:
further information on gender-based speaker
recognition and GMM models.
Apply any necessary preprocessing steps, such as filtering
Keywords— Prediction, efficiency , filtering, audio. out noise from the recorded audio signal. This may involve
techniques like noise reduction or filtering.
Use audio visualization tools to gain insights into the Step 5: Test on Diverse Dataset:
characteristics of the audio signal. The code includes a plot Task: Evaluate the system on a diverse set of audio
of the audio data, which can be helpful for debugging and samples. Considerations: Include a variety of speakers,
analysis. accents, and environmental conditions to ensure robustness.
𝐶𝑜𝑟𝑟𝑒𝑐𝑡
𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑠 CONCLUSION
𝑇𝑜𝑡𝑎𝑙
Efficiency= ×100% V.
𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑠
Overall efficiency after taking 50 Males and 50 The presented document details the design and
Female samples. implementation of an algorithm aimed at predicting the
gender of individuals based on features extracted from pre-
recorded audio data. The algorithm relies on pitch and
Efficiency of Male Voice Prediction:- 81.1% energy as primary features and employs predefined
thresholds for classification. Implemented using MATLAB,
Efficiency of Female voice Prediction:- 72.5% the system achieved efficiency rates of 81.1% for male
voice prediction and 72.5% for female voice prediction.
The document outlines a systematic process through a flow
chart, encompassing key stages such as audio recording,
pre- processing, and filtering, feature extraction, gender
classification, and displaying the predicted gender. This
structured approach allows for a clear understanding of the
system's workflow.
While the implemented algorithm demonstrates reasonable
efficiency, the document wisely suggests potential avenues
for improvement. Specifically, the exploration of more
I also extend my sincere thanks to “Dr. Reena Sonkusare”
advanced signal processing techniques or machine learning
for their encouragement and willingness to share their
models is recommended, particularly if the achieved results
knowledge and expertise. Their guidance has been
fall short of expectations. This forward-thinking approach
instrumental in helping me overcome obstacles and reach
reflects an acknowledgment of the evolving nature of audio-
new levels of understanding in “Audio steganography using
based gender prediction systems and the continuous pursuit
LBC, ECB, and Wavelet Transform”.
of enhanced accuracy.
[3] Furthermore, the inclusion of external references on The insights and support of both professors have been
gender-based speaker recognition and Gaussian Mixture invaluable throughout this research journey. I am truly
Model (GMM) models demonstrates a commitment to honoured to have had the opportunity to learn from such
leveraging existing knowledge and methodologies within dedicated and passionate educators.
the field.
ACKNOWLEDGMENTS REFERENCES
[1]. B. Jena, A. Mohanty, and S. K. Mohanty, "Gender
I would like to express my deepest gratitude to my college Recognition of Speech Signal using KNN and SVM," in
professors, “Dr. Kiran TALELE” and International Conference on Smart Data Intelligence
“Dr. Reena Sonkusare”, for their unwavering support and (ICSMDI 2021), 2021.
guidance throughout the development of this research https://papers.ssrn.com/sol3/papers.cfm?abstract_id=385260
project. Their expertise, encouragement, and patience have 7
been instrumental in shaping my understanding of “Audio
steganography using LBC, ECB, and Wavelet Transform”. [2].P. Kumar, N. Jakhanwal, A. Bhowmick, and M.
Chandra, "Gender Classification Using Pitch and
Formants," in Proceedings of the IEEE International
We are particularly grateful to, “Dr. Kiran TALELE” for
Conference on Pattern Recognition, December 2008, pp. 1-
their invaluable mentorship and insightful feedback, which
4. https://www.researchgate.net/publication/220846517_Gende
helped us to refine our research methodology and strengthen
r_classification_using_pitch_and_formants
the overall quality of my work. Their dedication to teaching
and their passion for research have been a constant source of
[3]. M. Gupta, S. S. Bharti, and S. Agarwal, "Gender-based
inspiration for me.
speaker recognition from speech signals using GMM
model," Modern Physics Letters B, vol. 33, no. 35, pp.
1950438, November 2019. DOI:
10.1142/S0217984919504384
https://www.researchgate.net/publication/337227020_Gende
r-
based_speaker_recognition_from_speech_signals_using_G
MM_model