PHP Voice

The document discusses voice recognition systems and how they work by converting speech to text or commands. It describes the technology used including VoiceXML, which is an XML language for building voice applications. Potential applications of web-based voice recognition systems are also outlined.

Uploaded by

Sidharth Choubey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views6 pages

PHP Voice

Uploaded by

Sidharth Choubey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Problem Statement: Web Based Voice recognition System .

Introduction & working of Voice recognition system :

Today, when we call most large companies, a person doesn't usually answer the phone.
Instead, an automated voice recording answers and instructs you to press buttons to move
through option menus. Many companies have moved beyond requiring you to press buttons,
though. OIten you can just speak certain words (again, as instructed by a recording) to get
what you need. The system that makes this possible is a type oI speech recognition program
-- an automated phone system.
You an also use speech recognition soItware in homes and businesses. A range oI soItware
products allows users to dictate to their computer and have their words converted to text in a
word processing or e-mail document. You can access Iunction commands, such as opening
Iiles and accessing menus, with voice instructions. Some programs are Ior speciIic business
settings, such as medical or legal transcription.
People with disabilities that prevent them Irom typing have also adopted speech-recognition
systems. II a user has lost the use oI his hands, or Ior visually impaired users when it is not
possible or convenient to use a Braille keyboard, the systems allow personal expression
through dictation as well as control oI many computer tasks. Some programs save users'
speech data aIter every session, allowing people with progressive speech deterioriation to
continue to dictate to their computers.
Current programs Iall into two categories:
Small-vocabulary/many-users
These systems are ideal Ior automated telephone answering. The users can speak with a great
deal oI variation in accent and speech patterns, and the system will still understand them most
oI the time. However, usage is limited to a small number oI predetermined commands and
inputs, such as basic menu options or numbers.
Large-vocabulary/limited-users
These systems work best in a business environment where a small number oI users will work
with the program. While these systems work with a good degree oI accuracy (85 percent or
higher with an expert user) and have vocabularies in the tens oI thousands, you must train
them to work best with a small number oI primary users. The accuracy rate will Iall
drastically with any other user.
Speech recognition systems made more than 10 years ago also Iaced a choice between
discrete and continuous speech. It is much easier Ior the program to understand words when
we speak them separately, with a distinct pause between each one. However, most users
preIer to speak in a normal, conversational speed. Almost all modern systems are capable oI
understanding continuous speech.
Speech to Data
To convert speech to on-screen text or a computer command, a computer has to go through
several complex steps. When you speak, you create vibrations in the air. The analog-to-
digital converter (ADC) translates this analog wave into digital data that the computer can
understand. To do this, it samples, or digitizes, the sound by taking precise measurements oI
the wave at Irequent intervals. The system Iilters the digitized sound to remove unwanted
noise, and sometimes to separate it into diIIerent bands oI frequency (Irequency is the
wavelength oI the sound waves, heard by humans as diIIerences in pitch). It also normalizes
the sound, or adjusts it to a constant volume level. It may also have to be temporally aligned.
People don't always speak at the same speed, so the sound must be adjusted to match the
speed oI the template sound samples already stored in the system's memory.

An ADC translates the analog waves of your voice into digital data by sampling the
sound. The higher the sampling and precision rates, the higher the quality.

ext the signal is divided into small segments as short as a Iew hundredths oI a second, or
even thousandths in the case oI plosive consonant sounds -- consonant stops produced by
obstructing airIlow in the vocal tract -- like "p" or "t." The program then matches these
segments to known phonemes in the appropriate language. A phoneme is the smallest
element oI a language -- a representation oI the sounds we make and put together to Iorm
meaningIul expressions. There are roughly 40 phonemes in the English language (diIIerent
linguists have diIIerent opinions on the exact number), while other languages have more or
Iewer phonemes.

The next step seems simple, but it is actually the most diIIicult to accomplish and is the is
Iocus oI most speech recognition research. The program examines phonemes in the context oI
the other phonemes around them. It runs the contextual phoneme plot through a complex
statistical model and compares them to a large library oI known words, phrases and
sentences. The program then determines what the user was probably saying and either outputs
it as text or issues a computer command.

Tecbnology USED :
PHP :

M?SCL

VUICE XML Introduction
InO, (IO,) ls Lhe W3Cs sLandard xML formaL for speclfylng lnLeracLlve volce dlalogues
beLween a human and a compuLer lL allows volce appllcaLlons Lo be developed and deployed ln an
analogous way Lo P1ML for vlsual appllcaLlons !usL as P1ML documenLs are lnLerpreLed by a vlsual
web browser volcexML documenLs are lnLerpreLed by a volce browser
Many commercial VoiceXML applications have been deployed, processing millions oI
telephone calls per day. These applications include: order inquiry, package tracking, driving
directions, emergency notiIication, wake-up, Ilight tracking, voice access to email, customer
relationship management, prescription reIilling, audio news magazines, voice dialing, real-
estate inIormation and national directory assistance applications.
VoiceXML has tags that instruct the voice browser to provide speech synthesis, automatic
speech recognition, dialog management, and audio playback. The Iollowing is an example oI
a VoiceXML document:
<vxml version="2.0" xmlns="http://www.w3.org/2001/vxml"
<form
<block
<prompt
Hello world!
</prompt
</block
</form
</vxml
When interpreted by a VoiceXML interpreter this will output "Hello world" with synthesized
speech.
Typically, HTTP is used as the transport protocol Ior Ietching VoiceXML pages. Some
applications may use static VoiceXML pages, while others rely on dynamic VoiceXML page
generation using an application server like Tomcat, Weblogic, IIS, or WebSphere.
Historically, VoiceXML platIorm vendors have implemented the standard in diIIerent ways,
and added proprietary Ieatures. But the VoiceXML 2.0 standard, adopted as a W3C
Recommendation on 16 March 2004, clariIied most areas oI diIIerence. The VoiceXML
Forum, an industry group promoting the use oI the standard, provides a conIormance testing
process that certiIies vendors' implementations as conIormant.

Problem statement can be solved by the Iollowing strategy.

Applications :
1. Web based IVRS .
2. Technique can be used to make voice based secure login system.
3. Voice based search engines.
4. Voice based Online home automation system over IP networks.
Etc..

(Ebook) Effective Team Management with VSTS and TFS: A Guide for Scrum Masters by Chandrasekara, Chaminda, Yapa, Sanjaya ISBN 9781484235577, 1484235576 instant download
100% (5)
(Ebook) Effective Team Management with VSTS and TFS: A Guide for Scrum Masters by Chandrasekara, Chaminda, Yapa, Sanjaya ISBN 9781484235577, 1484235576 instant download
58 pages
02 Govt Sa Telugu 16.06.2024
No ratings yet
02 Govt Sa Telugu 16.06.2024
2 pages
Sunrise Model
100% (5)
Sunrise Model
3 pages
Battleship Past Simple Past Continuous
No ratings yet
Battleship Past Simple Past Continuous
1 page
auditory memory span
No ratings yet
auditory memory span
13 pages
Past Simple and Present Perfect
No ratings yet
Past Simple and Present Perfect
3 pages
NCP Pedia
40% (5)
NCP Pedia
2 pages
ER Diagram Advanced Practices
No ratings yet
ER Diagram Advanced Practices
4 pages
Creative Writing
No ratings yet
Creative Writing
135 pages
English8 Q1M1
No ratings yet
English8 Q1M1
7 pages
Peserta Belum Pilih Tilok CPPPK T.A. 2022
No ratings yet
Peserta Belum Pilih Tilok CPPPK T.A. 2022
21 pages
Believe in Revealed Books MS, Muhammad Arsalan Hussaini
No ratings yet
Believe in Revealed Books MS, Muhammad Arsalan Hussaini
5 pages
Rish SI-101 CONFIGURATION SETTING FOR INPUT SELECTION
No ratings yet
Rish SI-101 CONFIGURATION SETTING FOR INPUT SELECTION
9 pages
Present S-Form Progressive Simple Past Participle
No ratings yet
Present S-Form Progressive Simple Past Participle
14 pages
SFF24 - Program - Guide-SUNDANCE 2024
No ratings yet
SFF24 - Program - Guide-SUNDANCE 2024
53 pages
Bridge To Terabithia Study Guide HHE
No ratings yet
Bridge To Terabithia Study Guide HHE
7 pages
Syphilis Is A Sexually Transmitted Disease
No ratings yet
Syphilis Is A Sexually Transmitted Disease
5 pages
Teaching Plan - Dental Hygiene
100% (1)
Teaching Plan - Dental Hygiene
5 pages
Instructional Module 1 in Purposive Communication
No ratings yet
Instructional Module 1 in Purposive Communication
8 pages
Annulment and Divorce
100% (1)
Annulment and Divorce
3 pages
Placing
No ratings yet
Placing
2 pages
Benefits of Registering
No ratings yet
Benefits of Registering
3 pages
PPPP P P PPPPP PP P PP PPPP PPPPP P PPPPPPP PPPPPPPP P P PPP PPP PP PP
No ratings yet
PPPP P P PPPPP PP P PP PPPP PPPPP P PPPPPPP PPPPPPPP P P PPP PPP PP PP
4 pages
Pollination
No ratings yet
Pollination
5 pages
Difference Between RAM and ROM
100% (3)
Difference Between RAM and ROM
7 pages
Inaugural Address
No ratings yet
Inaugural Address
7 pages
Technical Report: TCP, UDP, and Sockets: The Service-Level Specification
No ratings yet
Technical Report: TCP, UDP, and Sockets: The Service-Level Specification
305 pages
P P PPPPPPPPPPPPPPPPP P P P: PP PPP PPPPP PP!
No ratings yet
P P PPPPPPPPPPPPPPPPP P P P: PP PPP PPPPP PP!
9 pages
Intel GFX
No ratings yet
Intel GFX
12 pages
Baptism Symbols
No ratings yet
Baptism Symbols
2 pages
Chapter 1
No ratings yet
Chapter 1
7 pages
Explain The Difference Between HRM and HRD
No ratings yet
Explain The Difference Between HRM and HRD
3 pages
Chapter 1
No ratings yet
Chapter 1
3 pages
Info of Album
No ratings yet
Info of Album
3 pages
1.1 SRS Version
No ratings yet
1.1 SRS Version
13 pages
Commitment
No ratings yet
Commitment
14 pages
Narrative Cinder Ella Story
No ratings yet
Narrative Cinder Ella Story
5 pages
Investments Avenues 3
No ratings yet
Investments Avenues 3
6 pages
Ethical Issues Regarding Abortion
No ratings yet
Ethical Issues Regarding Abortion
2 pages
Company Secretary
No ratings yet
Company Secretary
7 pages
The Curse
No ratings yet
The Curse
31 pages
Ethics
No ratings yet
Ethics
3 pages
Budget
No ratings yet
Budget
41 pages
1 Respiration
No ratings yet
1 Respiration
11 pages
Resume Writing Articles
No ratings yet
Resume Writing Articles
5 pages
Agamoddharak Shri Ghasilalji Maharaj 250418 STD
No ratings yet
Agamoddharak Shri Ghasilalji Maharaj 250418 STD
4 pages
FNCP
No ratings yet
FNCP
6 pages
SI Output
No ratings yet
SI Output
7 pages
Definition of Vector
No ratings yet
Definition of Vector
4 pages
CPP PP PPP PPPPP P
No ratings yet
CPP PP PPP PPPPP P
1 page
PPE
No ratings yet
PPE
3 pages
Jomon Mathew SAP C
No ratings yet
Jomon Mathew SAP C
3 pages
P PPPP P
No ratings yet
P PPPP P
2 pages
PP PPPPP PPPPP PP P (
No ratings yet
PP PPPPP PPPPP PP P (
4 pages
Research and Methodology
No ratings yet
Research and Methodology
5 pages
C M D
No ratings yet
C M D
13 pages
Windows Explorer
No ratings yet
Windows Explorer
3 pages
Saudi Arabia Electronic Invoicing SAP Integration Suite (SAP ERP, SAP S4HANA) - Cloud Foundry
No ratings yet
Saudi Arabia Electronic Invoicing SAP Integration Suite (SAP ERP, SAP S4HANA) - Cloud Foundry
28 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Unit 52 Verb + To
No ratings yet
Unit 52 Verb + To
5 pages
Journal On Teachable Moments
No ratings yet
Journal On Teachable Moments
2 pages
Benefits of FDI
No ratings yet
Benefits of FDI
4 pages
Informe 1 Tulua
No ratings yet
Informe 1 Tulua
6 pages
Driving Without Wheels
No ratings yet
Driving Without Wheels
14 pages
Mock P1
No ratings yet
Mock P1
5 pages
Difference Between Soluble Fiber and Insoluble Fiber
No ratings yet
Difference Between Soluble Fiber and Insoluble Fiber
5 pages
Computer
No ratings yet
Computer
5 pages
Fema 2000
No ratings yet
Fema 2000
5 pages
Gandhi
No ratings yet
Gandhi
8 pages
Elements of Poetry
No ratings yet
Elements of Poetry
5 pages
Dragon's Breath: Mastering Voice Recognition in the Digital Age
From Everand
Dragon's Breath: Mastering Voice Recognition in the Digital Age
Pasquale De Marco
No ratings yet
NCP Fracture
0% (1)
NCP Fracture
3 pages
Scoliosis
No ratings yet
Scoliosis
4 pages
PP PP P PPP PP
No ratings yet
PP PP P PPP PP
5 pages
Introduction To Psychologyt
No ratings yet
Introduction To Psychologyt
32 pages
PPPPPPPP PPPP PP P P PPPPPPPPPPPPPPPP
No ratings yet
PPPPPPPP PPPP PP P P PPPPPPPPPPPPPPPP
10 pages
For FinalExam
No ratings yet
For FinalExam
65 pages
English Assignment 1
No ratings yet
English Assignment 1
13 pages
Omens in Alchemist
0% (1)
Omens in Alchemist
2 pages
Da Lo Radius
No ratings yet
Da Lo Radius
8 pages
Computer Science Sample Paper
No ratings yet
Computer Science Sample Paper
16 pages
Literary Works in History of English
No ratings yet
Literary Works in History of English
30 pages
Action Plan Catch-Up Fridays
No ratings yet
Action Plan Catch-Up Fridays
6 pages
Critical Care Nursing
No ratings yet
Critical Care Nursing
3 pages
Struts 2 Framework Tutorial: Filterdispatcher Respectively
No ratings yet
Struts 2 Framework Tutorial: Filterdispatcher Respectively
26 pages
Dawar Shoe Recruitment and Selection
No ratings yet
Dawar Shoe Recruitment and Selection
131 pages
Speech Generating Device: Fundamentals and Applications
From Everand
Speech Generating Device: Fundamentals and Applications
Fouad Sabry
No ratings yet
Une Parole Circule No13
No ratings yet
Une Parole Circule No13
8 pages
Speech Recognition: Fundamentals and Applications
From Everand
Speech Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Natural Language User Interface: Fundamentals and Applications
From Everand
Natural Language User Interface: Fundamentals and Applications
Fouad Sabry
No ratings yet

PHP Voice

Uploaded by

PHP Voice

Uploaded by

Problem Statement: Web Based Voice recognition System .

Introduction & working of Voice recognition system :

You might also like