0% found this document useful (0 votes)

26 views8 pages

Journal 1

Uploaded by

adelekeabudu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views8 pages

Journal 1

Uploaded by

adelekeabudu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

International Journal of Electrical and Computer Engineering (IJECE)

Vol. 11, No. 2, April 2021, pp. 1796~1803

ISSN: 2088-8708, DOI: 10.11591/ijece.v11i2.pp1796-1803  1796

Design and implementation of speech recognition system

integrated with internet of things

Ademola Abdulkareem, Tobiloba E. Somefun, Oji K. Chinedum, Felix Agbetuyi

Department of Electrical and Information Engineering, Covenant University, Canaan Land, Nigeria

Article Info ABSTRACT

Article history: The process of speech recognition is such that a speech signal from a client
or user is received by the system through a microphone, then the system
Received Jul 14, 2020 analyses this signal and extracts useful information from the signal which is
Revised Aug 17, 2020 converted to text. This study focuses on the design and implementation of a
Accepted Nov 6, 2020 speech recognition system integrated with internet of thing (IoT) to control
electrical appliances and door with raspberry pi as a core element. To design
the speech recognition system, digital signal processing (DSP) technique and
Keywords: hidden Markov model were fully considered for processing, extraction and
high predictive accuracy of the system. The Google application programming
Digital signal processing interface (API) was used as a cloud server to store command and give the
Internet of thing system to assess to the internet. With 150 speech samples on the system, a
Markov model high level of accuracy of over 80% was obtained.
Programming interface
This is an open access article under the CC BY-SA license.

Corresponding Author:
Tobiloba Emmanuel Somefun
Department of Electrical and Information Engineering
Covenant University
Canaan Land, KM 10, Idiroko Rood, P. M. B. 1023, Ota, Ogun State, Nigeria
Email: [email protected]

1. INTRODUCTION
Speaking is the major means of communication by a human. There are a lot of processes involved in
the production of speech. Also, there are several body parts that aid in the production of speech, apart from
the commonly known body parts such as tongue, mouth and lips. The lungs, trachea, larynx, vocal cord, oral
cavity and nasal cavity are highly involved [1, 2]. Human speech is produced by the flow of air from the lungs
through the larynx. It is produced by inhaling and exhaling through the nasal and oral cavity. Vowel sounds are
produced by the flow of air from the lungs through the vocal cord, making them vibrate [3]. Consonants can
be produced when the air is pressed through the closed vocal resulting in turbulent airflow. Due to the
vibration of the vocal cords, sounds can be produced [4]. Each sound, word or speech vibrates differently.
The frequency of the vibration is called pitch. In reference [5], the source-filter theory of speech production
was introduced, which explains how speech is produced. According to [5], speech production is in two
stages. In the first stage, air flows through the vocal cords to produce a basic signal. This basic signal is
known as the signal source.
The recognition of the speaker is the process of recognising a speaker from the unique information,
which is present in the wave of the word. This technique uses the speaker's voice to check the identity of the
rapporteur and control access to services such as composition from voice, security, information service,
remote access to a computer, purchases, etc. A lot of handicap (blind, lame) and aged persons in society have
a limited capacity to perform certain tasks due to their physical and environmental conditions [6, 7]. Most
often they require human help in several of their activities which usually cost a huge sum if the person is not
their family member and persons who render such services are very minimal [8, 9]. This work seeks to help

Journal homepage: http://ijece.iaescore.com

Int J Elec & Comp Eng ISSN: 2088-8708  1797

the physically challenged or disabled individual to perform the most basic tasks, such as opening doors,
turning on/off electrical devices, calling a mobile line, automated activities and much more through the use of voice.
It is like a telecommunication service that aids attention to the need of the disabled via automation [10, 11].
With the recent trend in automation as a means of control systems in different areas [12-16], this work deems
it fit to integrate automation to meet some of the needs of the disabled individual. The proposed model in this
study is limited to the sound or speech recognition mode of authentication. Although there is another authentication
mode to gain access such RFID [17-19], biometrics [20-22], PIN [23, 24] and or a combination [25-27], this
study focuses on the voice recognition.

2. MATERIALS AND METHODS

The bulk of this work hangs on the software part, although the hardware part is also important. Most
of the hardware components used were ready-made; the technical aspect of the hardware lies on the correct
ratings of all the components and right connection. In this work, the Raspberry Pi single board computer
(SBC) software is deployed for the design of speech recognition system. The Raspberry Pi software is used to
configure the hardware for the required action. As this work is based on internet of things (IoT), internet
connectivity is required to be setup. In this work, internet connectivity is gotten through USB Wi-Fi adapter.
For ease of identification, Figure 1 shows the block diagram of the proposed system.

Figure 1. Block diagram for the speech recognition system

3. DESIGN SPECIFICATIONS
The design specifications of the speech recognition for access control module deals with
the conditions necessary for the functionality of the module optimally. For this work, two types of design
specifications would be considered namely: hardware and software specifications

3.1. Hardware specifications

The hardware specification deals with the optimal conditions necessary for the module to function.
These conditions include:
a. Operating current: The current source needed to power up the raspberry pi, and all the peripheral
devices attached to it should provide at least 2 A of current.
b. Operating voltage: The operating voltage by which the raspberry pi functions is 5 V. The current
compensates to provide a required power of about 10 W. (5 V x 2 A = 10 W). It can be seen that this is
a very low power device. Assuming a 7500 mAh battery is used to power up this device, the module can
last for almost 4 hours before a battery recharge would be required.
c. Operating temperature: The official operating temperature range for the raspberry pi is from -40℃ to
85℃. As the temperature begins to approach 82℃, the performance is thermally throttled.

Design and implementation of speech recognition system integrated … (Ademola Abdulkareem)

1798  ISSN: 2088-8708

3.2. Software specifications

The software specifications necessary for the module to run optimally include:
a. The RAM size: The RAM size required for this project is 512 MB and the raspberry pi zero meets this
specification.
b. The ROM size: The minimum ROM size (storage space of the microSD card) is 8 GB. In this project,
an 8 GB class 10 microSD card. This memory card has a very high read and write speed of 10 MB/s.
c. The processor: The processor that is necessary for running the software of this project work is the Broadcom
BCM2835 system on chip (SoC) with an ARM 11 CPU running 1 GHz.
d. The operating system: The operating system, which the raspberry pi (the control unit of the module)
runs is a Debian based distribution. It is mounted on an SD card, which takes at least 4 GB of space.
The Debian distribution is known as Raspbian.
Figure 2 shows the circuit diagram of the proposed system and it is drawn from EasyEda application.

Figure 2. Circuit diagram

3.3. Design analysis

Let 𝑥(𝑡) represents speech signal which is a function of time. The time varying spectrum of the
speech can be obtained from the time varying Fourier transform of the signal 𝑋𝑛(𝜔) as shown in Figure 3.

Int J Elec & Comp Eng, Vol. 11, No. 2, April 2021 : 1796 - 1803
Int J Elec & Comp Eng ISSN: 2088-8708  1799

Figure 3. Time varying specturm of speech production

3.4. Fourier methods

The short-time Fourier transform of the speech signal 𝑥(𝑡) can be calculated using (1) as given in [28].
∞
Xn(ω)=∑𝑚=−∞(𝑊𝑛−𝑚 𝑋𝑚 𝑒 −𝑗𝜔𝑚 ) (1)

where index ‘n’ is referred to as time nT, which means that Xn(ω)=(nT, w). By inverse Fourier transform,
𝑥(𝑡) (speech signal) is recovered as shown in e (2):
1 𝜋
𝑋𝑛 = ∫ 𝑋 (𝑤) 𝑒 𝑗𝜔𝑛 𝑑𝑤
𝑊𝑂 2𝜋 −𝜋 𝑛
(2)

Since 𝑋𝑛(𝜔) is a function of time and it changes as time changes, it is sampled at a rate that allows
the speech 𝑥(𝑡) to be reconstructed. With the bandwidth, Bx of the speech signal 𝑥(𝑡) being approximately
equal to 5kHz, the sampling rate frequency (Fs) is therefore equals 10 kHz. For the Hamming window of
length N=100, (wn), using (3), the bandwidth B is found to be
2𝐹𝑆
𝐵= (3)
𝑁

20,000
𝐵= = 200𝐻𝑧
100

The Nyquist rate for the short-time Fourier transform is twice the the bandwidth B, therefore, the
Nyquist rate equals 400 Hz. Hence, at Fs equals 10,000, it requires a value of 𝑋𝑛(𝜔𝑘) every 25 samples.
Since N=100, the windows should overlap by 75% [28].

3.5. Gaussian mixture model (GMM)

For Gaussian mixture model, (4) gives a non-singular multivariate normal distribution of a d-
dimensional random variable x [29]:
1
1 (𝑥−𝜇)𝑇 (∑ −1)(𝑥+1)
𝑃(𝑥) = 𝑑 𝑒 (−2 (4)
2𝜋 √|𝐸|
2

Multivariate data are observation that are made on more than one variable, In (3) 𝑃(𝑥) is called
probability density function formula, 𝜇 is the mean vector (𝑑 × 1 𝑚𝑎𝑡𝑟𝑖𝑥) and ∑ 𝑖𝑠 𝑡ℎ𝑒 𝑐𝑜𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒 𝑚𝑎𝑡𝑟𝑖𝑥 [ 𝑑 ×
𝑑 𝑚𝑎𝑡𝑟𝑖𝑥] of the normally distributed random variable X. In (4) the mean vector (expected vector) is as
shown in (5):
∞
𝜇 ≜ 𝐹(𝑥) ≜ ∫−∞ 𝑥𝑃(𝑥) 𝑑(𝑥) (5)

Design and implementation of speech recognition system integrated … (Ademola Abdulkareem)

1800  ISSN: 2088-8708

The sample mean approximation as shown in (6)

1
𝜇 ≈ ∑𝑁=1
𝑖=1 𝑋𝑖 (6)
𝑁

where the number of samples is 𝑁 and 𝑋𝑖 are the mel-cepstral feature vector.
The expression for variance-covariance matrix of a multi-dimensional random variance is described in (7):
1 1
∑= ∑𝑁=1 𝑇
𝑖=1 (𝑋𝑖 − 𝜇)(𝑋𝑖 − 𝜇) = [𝑆𝑥𝑥 − 𝑁(𝜇𝜇)𝑇 ] (7)
𝑁−1 𝑁−1

where the sample mean 𝜇 is obtained from (5) and the second order sum matrix 𝑆𝑥𝑥 as shown in (8) [28].

𝑇
𝑆𝑥𝑥 = ∑𝑁−1
𝑖=1 𝑋𝑖 𝑋𝑖 (8)

when the preparation information is prepared and the reason for an independent model that is saved as the
previous statistics is assembled, many speakers are used to advance the Gaussian parameters and coefficients,
using standard procedures, for example, maximum likelihood estimation (MLE), maximum posterior
regulation (MPR) and maximum likelihood linear regression (MLLR). Now, the frame is ready to play the
enlistment. Enrollment can be completed by taking an example of an objective voice sound and adjusting it
so that it is ideal for adjusting this example. This guarantees that the probabilities returned when coordinating
a similar example with the adapted model would be maximum.

3.6. Hidden Markov model (HMM)

Hidden Markov model is a model based on augmenting the Markov chain. A Markov chain is a non-
linear model that is often used to represent a sequence of possible event such that the probability that even
event would occur is dependent solely on the previous one. Figure 4 shows a Markov chain for assigning
probability to a sequence of words, 𝑤1, 𝑤2……. 𝑤𝑛 i.e (the probability for a user to say “TO” after saying the
word “GO” is 0.4) the speech recognition make decision based the most probably next state, which means
from the Figure 3, the system is most likely to move to “BED” with “GO” as the initial state. The transition
probability matrix is given as follows:

𝑎11 𝑎12 𝑎13 0.1 0.4 0.5

[𝑎21 𝑎22 𝑎23 ] = [0.5 0 0.5]
𝑎31 𝑎32 𝑎33 0.6 0.2 0.2

Figure 4. Markov chain [30]

The following components are required for a markov chain:

*𝑄 = 𝑞1 𝑞2 … … . 𝑞𝑛 a set of N state

Int J Elec & Comp Eng, Vol. 11, No. 2, April 2021 : 1796 - 1803
Int J Elec & Comp Eng ISSN: 2088-8708  1801

*𝐴 = 𝑎11, 𝑎12,…. 𝑎𝑛1,…. 𝑎𝑛𝑛, a transition probability matrix

∑𝑛𝑗=1 𝑎𝑖𝑗 = 1 ∀𝑖

*𝜋 = 𝜋1, 𝜋2, an initial probablity distrbution.

∑𝑛𝑖=1 𝜋𝑖 = 1 ∀𝑖
The probability that a Markov chain will begin in state 𝑖 is 𝜋𝑖. The flow chart for the procee is given
in Figure 5.

Figure 5. Flow chart

Design and implementation of speech recognition system integrated … (Ademola Abdulkareem)

1802  ISSN: 2088-8708

4. RESULTS AND DISCUSSIONS

The proposed system was tested severally to measure and ascertain its accuracy. The speaker
recogniser was trained with various speech samples. Afterward, a candidate was enrolled for the system in
order to confirm the accuracy of the system. The speech of the candidate was taken under different conditions
to show the performance of the module with the test samples of the speaker’s voice already in the database.
The conditions under which the candidate voice was taken include:
 Crowded place with background noise;
 Silent place with little or no background noise;
 A condition such that the speaker’s voice was low; and
 A condition such that the speaker’s voice was loud.
By these four different conditions, recognition accuracy of the system is obtained as shown in Figure 6.

Figure 6. Result of tests carried out on same speaker under different conditions

where
A = A condition of crowded place with background noise;
B = A condition of silent place with little or no background noise;
C = A condition such that the speaker’s voice was low; and
D = A condition such that the speaker’s voice was loud
Table 1 shows the accuracy of samples taken.

Table 1. Accuracy of the sample taken

S/No. Test Condition Number of Accuracy Percentage (%)
A. A condition of crowded place with background noise; 97 64.67
B. A condition of silent place with little or no background noise; 123 82.00
C. A condition such that the speaker’s voice was low; and 137 91.33
D. A condition such that the speaker’s voice was loud. 110 73.33

5. CONCLUSION AND RECOMMENDATION

This work has successfully constructed a prototype speech recognition system for home automation
using Raspberry Pi Single Board Computer. The prototype worked well and gave a vary promising result for
real production. With noise interference (i.e. worse scenario environmental condition in terms of noise), a
very good result was obtained with approximately 65% accuracy and the highest accuracy recorded was 91%.
The work can find application in different places and fields depending on the work to be carried out. The
response time of this module is relatively fast. Further work can be done by adding more appliances, voice
recognition module to ensure security for home automation and training of voice recognition module to
adjust to diverse voice condition of the user. Moreso, other means of authentication can be added.

Int J Elec & Comp Eng, Vol. 11, No. 2, April 2021 : 1796 - 1803
Int J Elec & Comp Eng ISSN: 2088-8708  1803

ACKNOWLEDGEMENTS
The authors acknowledge Covenant University for her financial support.

REFERENCES
[1] D. Blischak, et al., "Use of speech-generating devices: In support of natural speech," Augmentative and alternative
communication, vol. 19, no. 1, pp. 29-35, 2003.
[2] M. Mills, "Aid for speech therapy and a method of making same," ed: Google Patents, 1984.
[3] K. N. Stevens and A. S. House, "An acoustical theory of vowel production and some of its implications," Journal of
Speech and Hearing Research, vol. 4, pp. 303-320, 1961.
[4] K. Nishikawa, et al., "Speech planning of an anthropomorphic talking robot for consonant sounds production," in
Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No. 02CH37292), Washington,
DC, USA, vol. 2, 2002, pp. 1830-1835.
[5] G. Fant, "The source filter concept in voice production," STL-QPSR, vol. 1, pp. 21-37, 1981.
[6] A. Ismail, S. Abdlerazek, and I. M. El-Henawy, "Development of Smart Healthcare System Based on Speech
Recognition Using Support Vector Machine and Dynamic Time Warping," Sustainability, vol. 12, no. 6, p. 2403, 2020.
[7] R. Gonzalez, et al,, "Voice Recognition System to Support Learning Platforms Oriented to People with Visual
Disabilities," in International Conference on Universal Access in Human-Computer Interaction, 2016, pp. 65-72.
[8] T. Gomi and A. Griffith, "Developing intelligent wheelchairs for the handicapped," in Assistive Technology and
Artificial Intelligence, Springer, pp. 150-178, 1998.
[9] R. C. Handel, "The role of the advocate in securing the handicapped child's right to an effective minimal
education," Ohio State University, vol. 36, p. 349, 1975.
[10] T. E. Somefun, C. O. A. Awosope, and C. Sika, "Development of a research project repository," TELKOMNIKA
Telecommunication, Computing, Electronics and Control, vol. 18, no. 1, pp. 156-165, 2020.
[11] A. Ademola, T. Somefun, A. Agbetuyi, and A. Olufayo, "Web based fingerprint roll call attendance management
system," International Journal of Electrical and Computer Engineering (IJECE), vol. 9, no. 5, pp. 4364-4371, 2019.
[12] Y. Yamazaki and J. Maeda, "The SMART system: an integrated application of automation and information
technology in production process," Computers in Industry, vol. 35, no. 1, pp. 87-99, 1998.
[13] L. Kocúrová, I. S. Balogh, and V. Andruch, "Solvent microextraction: a review of recent efforts at automation,"
Microchemical Journal, vol. 110, pp. 599-607, 2013.
[14] S. E. Shladover and C. Systematics, "Recent international activity in cooperative vehicle-highway automation
systems," United States. Federal Highway Administration. Office of Corporate Research, pp. 1-95, 2012.
[15] C. von Altrock and J. Gebhardt, "Recent successful fuzzy logic applications in industrial automation," in
Proceedings of IEEE 5th International Fuzzy Systems, New Orleans, LA, USA, vol. 3, pp. 1845-1851, 1996.
[16] L. Kamelia, S. A. Noorhassan, M. Sanjaya, and W. E. Mulyana, "Door-automation system using bluetooth-based
android for mobile phone," ARPN Journal of Engineering and Applied Sciences, vol. 9, no. 10, pp. 1759-1762, 2014.
[17] A. Abdulkareem, I. U. Dike, and F. Olowononi, "Development of a radio frequency identification based attendance
management application with a pictorial database framework," International Journal of Research in Information
Technology (IJRIT), vol. 2, no. 4, pp. 621-628, 2014.
[18] A. Juels, "RFID security and privacy: A research survey," IEEE journal on selected areas in communications,
vol. 24, no. 2, pp. 381-394, 2006.
[19] A. Abdulkareem, C. Awosope, and A. Tope-Ojo, "Development and implementation of a miniature RFID system in
a shopping mall environment," International Journal of Electrical and Computer Engineering (IJECE), vol. 9,
no. 2, pp. 1374-1378, 2019.
[20] M. Lourde and D. Khosla, "Fingerprint Identification in Biometric SecuritySystems," International Journal of
Computer and Electrical Engineering, vol. 2, no. 5, pp. 852-855, 2010.
[21] D. Bhattacharyya, R. Ranjan, F. Alisherov, and M. Choi, "Biometric authentication: A review," International
Journal of u-and e-Service, Science and Technology, vol. 2, no. 3, pp. 13-28, 2009.
[22] N. L. Clarke, S. M. Furnell, and P. L. Reynolds, "Biometric authentication for mobile devices," IEEE Security &
Privacy, vol. 13, pp. 70-73, 2015.
[23] T. Van Nguyen, N. Sae-Bae, and N. Memon, "DRAW-A-PIN: Authentication using finger-drawn PIN on touch
devices," computers & security, vol. 66, pp. 115-128, 2017.
[24] J. Saville, "Authentication of PIN-Less Transactions," Google Patents, 2008.
[25] W. Shatford, "Biometric based authentication system with random generated PIN," Google Patents, 2006.
[26] F. Okumura, A. Kubota, Y. Hatori, K. Matsuo, M. Hashimoto, and A. Koike, "A study on biometric authentication
based on arm sweep action with acceleration sensor," in 2006 International Symposium on Intelligent Signal
Processing and Communications, Tottori, 2006, pp. 219-222.
[27] Y. Li, J. Yang, M. Xie, D. Carlson, H. G. Jang, and J. Bian, "Comparison of PIN-and pattern-based behavioral
biometric authentication on mobile devices," in MILCOM 2015-2015 IEEE Military Communications Conference,
Tampa, FL, 2015, pp. 1317-1322.
[28] S. E. Levinson, "Mathematical models for speech technology," Wiley Online Library, 2005.
[29] S. K. Patel, J. M. Dhodiya, and D. C. Joshi, "Mathematical Model Based on Human Speech Recognition and Body
Recognition," International Journal of Engineering Research & Technology (IJERT), vol. 1, no. 4, pp. 1-5, 2012.
[30] L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proceedings
of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.

Design and implementation of speech recognition system integrated … (Ademola Abdulkareem)

Cat Ii
100% (13)
Cat Ii
71 pages
Research Article: A 1.25-12.5 Gbps Adaptive CTLE With Asynchronous Statistic Eye-Opening Monitor
No ratings yet
Research Article: A 1.25-12.5 Gbps Adaptive CTLE With Asynchronous Statistic Eye-Opening Monitor
10 pages
LabVIEW PDF
No ratings yet
LabVIEW PDF
2 pages
DSP Lab Assignments
No ratings yet
DSP Lab Assignments
112 pages
Voice Technology Seminar
100% (1)
Voice Technology Seminar
35 pages
Speech Recognition Using Python
100% (2)
Speech Recognition Using Python
6 pages
User Guide V5.6-6.5
No ratings yet
User Guide V5.6-6.5
21 pages
Sampling Analog
No ratings yet
Sampling Analog
33 pages
TUTORIAL For The HANDBOOK FOR ACOUSTIC ECOLOGY
No ratings yet
TUTORIAL For The HANDBOOK FOR ACOUSTIC ECOLOGY
52 pages
CSC 412 Presentation Compiled
No ratings yet
CSC 412 Presentation Compiled
38 pages
DSP 1
No ratings yet
DSP 1
129 pages
Digital Signal Processing Question Bank 02
No ratings yet
Digital Signal Processing Question Bank 02
24 pages
June 2020 QP - Paper 2 AQA Computer Science As-Level
No ratings yet
June 2020 QP - Paper 2 AQA Computer Science As-Level
25 pages
RDR HX900
No ratings yet
RDR HX900
142 pages
Audiosegment Readthedocs Io en Latest
No ratings yet
Audiosegment Readthedocs Io en Latest
23 pages
AWESOME Manual
No ratings yet
AWESOME Manual
107 pages
Voice Recognition
100% (1)
Voice Recognition
3 pages
Adc Lab
No ratings yet
Adc Lab
38 pages
M2M MANTIS Specification Sheet 8 5x11 01
No ratings yet
M2M MANTIS Specification Sheet 8 5x11 01
2 pages
Cat Instruments
No ratings yet
Cat Instruments
66 pages
CSC 226
No ratings yet
CSC 226
7 pages
SAP Authorization Units 3 5
No ratings yet
SAP Authorization Units 3 5
5 pages
Internet of Things with ESP8266
From Everand
Internet of Things with ESP8266
Marco Schwartz
5/5 (2)
CSC 312 Note 2
No ratings yet
CSC 312 Note 2
3 pages
Question
100% (1)
Question
17 pages
Report 5. Speech Recognition
No ratings yet
Report 5. Speech Recognition
20 pages
ESP32 Programming for the Internet of Things: JavaScript, AJAX, MQTT and WebSockets Solutions
From Everand
ESP32 Programming for the Internet of Things: JavaScript, AJAX, MQTT and WebSockets Solutions
Sever Spanulescu
5/5 (2)
7SG18 - Solkor N Complete Technical Manual
No ratings yet
7SG18 - Solkor N Complete Technical Manual
154 pages
Computer Networking: An introductory guide for complete beginners: Computer Networking, #1
From Everand
Computer Networking: An introductory guide for complete beginners: Computer Networking, #1
Ramon Nastase
4.5/5 (2)
SAP Authorization Units 3 5 Detailed
No ratings yet
SAP Authorization Units 3 5 Detailed
5 pages
Free-Field Equivalent Localization of Virtual Audio
No ratings yet
Free-Field Equivalent Localization of Virtual Audio
9 pages
2023/2024 Project Allocation - Computer Pure: Supervisor Matric - No Fullname
No ratings yet
2023/2024 Project Allocation - Computer Pure: Supervisor Matric - No Fullname
3 pages
A Study On Automatic Speech Recognition
100% (1)
A Study On Automatic Speech Recognition
2 pages
Bit Rate, Sample Rate
No ratings yet
Bit Rate, Sample Rate
8 pages
Serial Port Complete: COM Ports, USB Virtual COM Ports, and Ports for Embedded Systems
From Everand
Serial Port Complete: COM Ports, USB Virtual COM Ports, and Ports for Embedded Systems
Jan Axelson
3.5/5 (9)
Journal Review
No ratings yet
Journal Review
1 page
Vocoding-Creating Digital Voice
No ratings yet
Vocoding-Creating Digital Voice
3 pages
Trio Hazloc Cutsheet-0622
No ratings yet
Trio Hazloc Cutsheet-0622
2 pages
Example - High Speed DAC - ADC
No ratings yet
Example - High Speed DAC - ADC
11 pages
Why Dynamic-Element-Matching Dacs Work: Ian Galton, Member, Ieee
No ratings yet
Why Dynamic-Element-Matching Dacs Work: Ian Galton, Member, Ieee
6 pages
Review of Feature Extraction Techniques in Automatic Speech Recognition
100% (1)
Review of Feature Extraction Techniques in Automatic Speech Recognition
6 pages
Research Sampling - Digital - Communication
No ratings yet
Research Sampling - Digital - Communication
2 pages
Spherical Harmonic Matrix
No ratings yet
Spherical Harmonic Matrix
12 pages
Discrete-Time Systems: - Fe-b-r-U-Qr-Y - 2-0-00 - 3 - 9
No ratings yet
Discrete-Time Systems: - Fe-b-r-U-Qr-Y - 2-0-00 - 3 - 9
11 pages
Lab 1
No ratings yet
Lab 1
8 pages
Design and Implementation of A Wireless Multi-Channel EEG Recording
No ratings yet
Design and Implementation of A Wireless Multi-Channel EEG Recording
6 pages
Chap 3 - Data Acquisition Part 2
No ratings yet
Chap 3 - Data Acquisition Part 2
19 pages
Computer Knowledge Guide For All Competitive Exams
From Everand
Computer Knowledge Guide For All Competitive Exams
Mohmmad Khaja Shareef
3/5 (4)
DSP Lab 2
No ratings yet
DSP Lab 2
5 pages
Introduction To Linguistics 14
No ratings yet
Introduction To Linguistics 14
27 pages
355-Article Text-710-1-10-20210107
No ratings yet
355-Article Text-710-1-10-20210107
12 pages
An Overview of The Development of Speaker Recognition
No ratings yet
An Overview of The Development of Speaker Recognition
11 pages
A Voice Trigger System Using Keyword and Speaker Recognition
No ratings yet
A Voice Trigger System Using Keyword and Speaker Recognition
73 pages
Speech Recognition
No ratings yet
Speech Recognition
27 pages
Irjet V7i6965
No ratings yet
Irjet V7i6965
5 pages
PDF 4
No ratings yet
PDF 4
1 page
Ibrahim 2020
No ratings yet
Ibrahim 2020
5 pages
Iccsee 2012 359
No ratings yet
Iccsee 2012 359
4 pages
Rohit
No ratings yet
Rohit
14 pages
Human-Computer Interaction Based On Speech Recogni
No ratings yet
Human-Computer Interaction Based On Speech Recogni
9 pages
Voice To Text Conversion Using Deep Learning
No ratings yet
Voice To Text Conversion Using Deep Learning
6 pages
Speech Recognition Full Report
No ratings yet
Speech Recognition Full Report
11 pages
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
No ratings yet
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
9 pages
Body and Conclu
No ratings yet
Body and Conclu
3 pages
Synopsis
No ratings yet
Synopsis
11 pages
Voice Recognition System Speech To Text
No ratings yet
Voice Recognition System Speech To Text
5 pages
25 The Comprehensive Analysis Speech Recognition System
No ratings yet
25 The Comprehensive Analysis Speech Recognition System
5 pages
Major Project - I Final Submission Report: DSP Tools in Wireless Communication
No ratings yet
Major Project - I Final Submission Report: DSP Tools in Wireless Communication
36 pages
Speaker Recognition System: A Project Report On
No ratings yet
Speaker Recognition System: A Project Report On
48 pages
Shareef Seminar Docs
No ratings yet
Shareef Seminar Docs
24 pages
DSP Implementation of Voice Recognition Using Dynamic Time Warping Algorithm
No ratings yet
DSP Implementation of Voice Recognition Using Dynamic Time Warping Algorithm
7 pages
Automatic Speech Recognition (Attempt) : ECE 113DB Final Project, Winter 2019 Fong Chi Ho, Zijun Sun, Shao Xiong Lee
No ratings yet
Automatic Speech Recognition (Attempt) : ECE 113DB Final Project, Winter 2019 Fong Chi Ho, Zijun Sun, Shao Xiong Lee
4 pages
IRJET Speech Scribd
No ratings yet
IRJET Speech Scribd
3 pages
Embedded Ethernet and Internet Complete
From Everand
Embedded Ethernet and Internet Complete
Jan Axelson
4/5 (1)
Speech Recognition - Specific Task of Speech Recognition: Abstract
No ratings yet
Speech Recognition - Specific Task of Speech Recognition: Abstract
7 pages
Performance Comparison of Robust Speech PDF
No ratings yet
Performance Comparison of Robust Speech PDF
6 pages
A Voice Identification System Using Hidden Markov Model
No ratings yet
A Voice Identification System Using Hidden Markov Model
6 pages
JournalNX - Speaker Recognition
No ratings yet
JournalNX - Speaker Recognition
6 pages
NLP Project Reportttt
No ratings yet
NLP Project Reportttt
9 pages
Speech Recognition Technology: Applications & Future: Pankaj Pathak
No ratings yet
Speech Recognition Technology: Applications & Future: Pankaj Pathak
3 pages
DC Motor Control
No ratings yet
DC Motor Control
2 pages
Get Smart - Learn About Computers
From Everand
Get Smart - Learn About Computers
Patrick J. Fischer
No ratings yet
Jarvis Digital Life Assistant IJERTV2IS1237 PDF
No ratings yet
Jarvis Digital Life Assistant IJERTV2IS1237 PDF
6 pages
Voice Recognition System
No ratings yet
Voice Recognition System
4 pages
Speech Recognition: College Name: Guru Nanak Engineering College Authors: Shruthi Tapse
No ratings yet
Speech Recognition: College Name: Guru Nanak Engineering College Authors: Shruthi Tapse
13 pages
Virtual Report Processing: The Mapper Story
From Everand
Virtual Report Processing: The Mapper Story
Louis Schlueter
No ratings yet
Programming and Prototyping with Teensy Microcontrollers: Definitive Reference for Developers and Engineers
From Everand
Programming and Prototyping with Teensy Microcontrollers: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Aircraft Intercom Systems
From Everand
Aircraft Intercom Systems
Askhat Khafizow
No ratings yet
Profound Linux For Users
From Everand
Profound Linux For Users
Onder Teker
No ratings yet
Computer Jargon: The Illustrated Glossary of Basic Computer Terminology
From Everand
Computer Jargon: The Illustrated Glossary of Basic Computer Terminology
Kevin Wilson
No ratings yet
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Computerised Systems Architecture: An embedded systems approach
From Everand
Computerised Systems Architecture: An embedded systems approach
S Mathioudakis
No ratings yet
Audio Visual Speech Recognition: Advancements, Applications, and Insights
From Everand
Audio Visual Speech Recognition: Advancements, Applications, and Insights
Fouad Sabry
No ratings yet
An Introduction To Data Acquisition
From Everand
An Introduction To Data Acquisition
Jason King
No ratings yet
Computer Jargon - The Illustrated Glossary of Basic Computer Terminology: Decode and simplify complex computer terms with easy-to-follow visual guides
From Everand
Computer Jargon - The Illustrated Glossary of Basic Computer Terminology: Decode and simplify complex computer terms with easy-to-follow visual guides
Kevin Wilson
No ratings yet
Study Guide Cisco 300-915 DEVIOT Developing Solutions using Cisco IoT and Edge Platforms Exam
From Everand
Study Guide Cisco 300-915 DEVIOT Developing Solutions using Cisco IoT and Edge Platforms Exam
Anand Vemula
No ratings yet
Physical Computing: Exploring Computer Vision in Physical Computing
From Everand
Physical Computing: Exploring Computer Vision in Physical Computing
Fouad Sabry
No ratings yet

Journal 1

Uploaded by

Journal 1

Uploaded by

International Journal of Electrical and Computer Engineering (IJECE)

Vol. 11, No. 2, April 2021, pp. 1796~1803

Design and implementation of speech recognition system

Ademola Abdulkareem, Tobiloba E. Somefun, Oji K. Chinedum, Felix Agbetuyi

Article Info ABSTRACT

Journal homepage: http://ijece.iaescore.com

2. MATERIALS AND METHODS

Figure 1. Block diagram for the speech recognition system

3.1. Hardware specifications

Design and implementation of speech recognition system integrated … (Ademola Abdulkareem)

3.2. Software specifications

Figure 2. Circuit diagram

3.3. Design analysis

Figure 3. Time varying specturm of speech production

3.4. Fourier methods

3.5. Gaussian mixture model (GMM)

Design and implementation of speech recognition system integrated … (Ademola Abdulkareem)

The sample mean approximation as shown in (6)

3.6. Hidden Markov model (HMM)

𝑎11 𝑎12 𝑎13 0.1 0.4 0.5

Figure 4. Markov chain [30]

The following components are required for a markov chain:

*𝐴 = 𝑎11, 𝑎12,…. 𝑎𝑛1,…. 𝑎𝑛𝑛, a transition probability matrix

*𝜋 = 𝜋1, 𝜋2, an initial probablity distrbution.

Figure 5. Flow chart

Design and implementation of speech recognition system integrated … (Ademola Abdulkareem)

4. RESULTS AND DISCUSSIONS

Table 1. Accuracy of the sample taken

5. CONCLUSION AND RECOMMENDATION

Design and implementation of speech recognition system integrated … (Ademola Abdulkareem)

You might also like