0% found this document useful (0 votes)
26 views1 page

Speech Recognition: Jump To Navigationjump To Search

Speech recognition is a field that develops technologies to recognize and translate spoken language into text. Some systems require training a specific speaker's voice to increase accuracy for that person, while others are speaker independent. Speech recognition has applications in voice interfaces, search, data entry, document preparation, and aircraft controls. It incorporates knowledge from computer science, linguistics, and computer engineering.

Uploaded by

Blessen Thomas
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
26 views1 page

Speech Recognition: Jump To Navigationjump To Search

Speech recognition is a field that develops technologies to recognize and translate spoken language into text. Some systems require training a specific speaker's voice to increase accuracy for that person, while others are speaker independent. Speech recognition has applications in voice interfaces, search, data entry, document preparation, and aircraft controls. It incorporates knowledge from computer science, linguistics, and computer engineering.

Uploaded by

Blessen Thomas
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

Speech recognition

From Wikipedia, the free encyclopedia

Jump to navigationJump to search

For the human linguistic concept, see Speech perception.

"Speech to text" redirects here. For the human role, see Speech-to-text reporter.

Speech recognition is an interdisciplinary subfield of computer science and computational


linguistics that develops methodologies and technologies that enable the recognition
and translation of spoken language into text by computers. It is also known as automatic speech
recognition (ASR), computer speech recognition or speech to text (STT). It incorporates
knowledge and research in the computer science, linguistics and computer engineering fields.
Some speech recognition systems require "training" (also called "enrollment") where an individual
speaker reads text or isolated vocabulary into the system. The system analyzes the person's specific
voice and uses it to fine-tune the recognition of that person's speech, resulting in increased
accuracy. Systems that do not use training are called "speaker independent"[1] systems. Systems
that use training are called "speaker dependent".
Speech recognition applications include voice user interfaces such as voice dialing (e.g. "call
home"), call routing (e.g. "I would like to make a collect call"), domotic appliance control, search key
words (e.g. find a podcast where particular words were spoken), simple data entry (e.g., entering a
credit card number), preparation of structured documents (e.g. a radiology report), determining
speaker characteristics,[2] speech-to-text processing (e.g., word processors or emails),
and aircraft (usually termed direct voice input).
The term voice recognition[3][4][5] or speaker identification[6][7] refers to identifying the speaker, rather
than what they are saying. Recognizing the speaker can simplify the task of translating speech in
systems that have been trained on a specific person's voice or it can be used to authenticate or
verify the identity of a speaker as part of a security process.
From the technology perspective, speech recognition has a long history with several waves of major
innovations. Most recently, the field has benefited from advances in deep learning and big data. The
advances are evidenced not only by the surge of academic papers published in the field, but more
importantly by the worldwide industry adoption of a variety of deep learning methods in designing
and deploying speech recognition systems.

You might also like