Speech Processing System
Speech Processing System
Dr P. Mythili Sharan
Architect(Speech)
Nuronics Labs
Agenda • Natural language understanding
• Natural language generation for dialogue
systems
• Use of speech synthesizers in dialogue
systems
• Dialogue system evaluation
• Stochastic approach to dialogue
• Dialogue policy design and training
• MDP-POMDP Reinforcement learning
• Simulated users
• Microsoft SDK
• Case studies related to it
Natural language understanding (NLU) is a branch of artificial
intelligence that uses computer software to understand input in the
form of sentences using text or speech.
Example
▪ The space that belongs to you has green ambient lighting;
▪ You are driving through a green traffic signal;
▪ You no longer have to wait to continue driving;
▪ You are permitted to proceed in a non-driving context;
You have a green light ▪ Your body is cast in a greenish glow;
▪ You possess a light source which radiates green; or
“it is difficult to infer meaning without
knowing the context, the identity of the ▪ You possess a light with a green surface.
speaker or the speaker's intent.”
How does natural language understanding work
Example
With rule-based frameworks, the developer is Provide browser-based consoles which assist
responsible for implementing the core logic to developers in uploading and annotating training
interpret incoming messages and return helpful examples. They also streamline the task of
responses. This logic generally consists of a series of launching a cloud-based web service to handle and
rules that specify which scripted response to return parse natural language requests.
for a message that matches a specified pattern. NLP services currently available are Amazon Lex ,
Limitations: Narrow vocabulary of commands Google’s Dialogflow, Facebook’s wit.ai, Microsoft
and fails in handling many corner as it enters into LUIS and IBM Watson Assistant.
production Limitations: Cost of implementation and security
Natural language generation for dialogue systems
▪ NLG is conceptualized as a process leading from a
high-level communicative goal to a sequence of
communicative acts which accomplish this
communicative goal.
cp = nltk.RegexpParser(pattern)
cs = cp.parse(sent)
print(cs)
iob_tagged = tree2conlltags(cs)
pprint(iob_tagged)
ne_tree = ne_chunk(pos_tag(word_tokenize(ex)))
print(ne_tree)