0% found this document useful (0 votes)

24 views22 pages

Final Presentation

Uploaded by

ivanov.john04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views22 pages

Final Presentation

Uploaded by

ivanov.john04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Effective Chatbots using Deep

Learning and Natural Language

Processing
John Ivanov,1 Prajval Sharma,2 Yarwin Liu3
Tesoro High School,1 Cupertino High School, 2 Aliso Niguel High School3

SRA Track #9 - Machine Learning and Optimization

SRA Capstone Seminar
7/23
Presentation Outline
1. Introduction to Chatbots
2. Research Question & Goals
3. Significance
4. Literature Review
5. Methodology
6. Analysis
7. Conclusion
8. Acknowledgements
9. References

2
Introduction to Chatbots
● Software used to communicate with humans
● ELIZA and was written with a predefined script
● Today, chatbots utilize complicated deep neural networks to remove the constraints that come with
predefining output responses
● Our research was on generative models where the model formulates its own response

Figure 1. Classification of Chatbot types 3

Research Question & Goals
How can chatbots be trained to have more natural
conversations?

Goals:
● Improve on current chatbot design
● Promote bots in communication departments
○ conversational chatbot

Image 1. Cartoon Chatbot

4
Significance
● Bridging the gap between human and machine intelligence
● Great versatility: can be used for customer accommodation, assistance, and as an emotional friend
● Stepping stone to allowing human thoughts to be understood
● Few departments specializing in human interaction such as resources use chatbots

Figure 2. Use of Chatbots by Department 5

Literature Review
● Steps needed to create a model would include Natural Language Understanding and
Natural Language Generation
● Great datasets include corpuses with dialogues or data from communication
applications
● Cleaning of data dramatically improved sentence error rate
● Sequence to sequence model is state of the art in this field

6
Methodology: Dataset
● 220,000 lines of Cornell Movie Corpus and 700,000 lines of Twitter

Figure 7. Twitter Dataset Figure 8. Cornell Dataset

7
Methodology: Data Preprocessing
● Before the textual input is fed into the model, it needs to be preprocessed into
a readable format
○ Punctuation and capitalization were removed
○ Uncommon words and repeated lines were removed
○ Padded to make all words the same length

Example: [‘hello’, ‘everyone’] -> [‘hello000’, ‘everyone’]

● Words are added into a dictionary

8
Methodology: Word Embeddings vs One hot encoding
● One hot encoding is used to convert each word input into vectoral form
○ Examples:
■ Outstanding = [1,0,0,0]
■ Amazing = [0,1,0,0]
■ Awesome = [0,0,1,0]
■ Great = [0,0,0,1]
● Problem with this approach is that synonyms will be counted as different
words and model will treat them differently

9
Methodology: Word Embeddings vs One Hot Encoding, continued

● Word Embeddings
○ Representation of text where words that have similar meanings have a similar representation
○ Most commonly used in an embedding layer
■ A word embedding that is learned jointly with a neural network model on a specific NLP
task such as language modeling
■ Input to layer must be a unique integer representation
■ Initialized with random weights and will learn an embedding for all the words in the
dataset
■ Arguments to be specified include the input dimension, output dimension, and the input
length

10
Methodology: Gated Recurrent Unit (GRU) and Long Short Term Memory (LSTM)

● Advantages
○ Avoids the vanishing gradient problem commonly associated with other networks
○ Advantage in storing short-term memory
● Main Idea
○ Gates filters out unnecessary information/words from a sentence that does not match the
intent
○ Example: This show is amazing. It is not bad. I need to buy tickets immediately.
■ The words the LSTM pick up are ones that match the intent. They are highlighted.

11
Methodology: Gated Recurrent Unit (GRU) and Long Short Term Memory (LSTM), continued

● Structure
○ LSTM
■ 3 gates
● Input gate
● Forget gate
● Output gate
○ GRU
■ 2 gates
● Reset gate
● Update gate

Figure 3. Comparison of GRU and LSTM

12
Methodology: Sequence to Sequence Model
● Used sequence-to-sequence model
○ A 3 layer neural network
○ Includes an encoder and decoder
○ Both encoder and decoder are recurrent neural networks

Figure 4. Sequence to Sequence Model 13

Methodology: Encoder

Formula 1. Describes how each hidden state in the encoder is

calculated

Figure 5. Encoder Computation Graph [6]

14
Methodology: Decoder

Formula 2. Describes how each hidden state is

calculated

Formula 3. How the output of the decoder is calculated

Figure 6. Decoder Computation Graph [6]

15
Analysis: Results
● The table on the left shows examples of output that correctly responded to each input respectively. Meanwhile, the
table on the right shows examples of output thats do not properly responded to each input respectively
● The chatbot was able to correctly respond 72.2% of the time

Input Output

Where are you We have one before the sun

very badly

Are you funny Do you think youll make sure I

want one

Do you want to be I didnt get the message

president
Figure 9. Positive Example inputs and outputs from the chatbot Figure 10. Negative example inputs and outputs from the chatbot

16
Analysis: Learning Rate
● Initially, we set the learning rate higher and noticed that the model was giving
a higher loss despite a higher accuracy
○ One possibility was that the model was overconfident with its predictions
● Learning rate of 1E-4

17
Analysis: Learning Loss and Accuracy
● Loss decreased from 1.3 (epoch 1) to 0.4 (epoch 50)
● Accuracy grew from 38.9% (epoch 1) to 72.2% (epoch 50)

Figure 11. Loss vs. Epoch Graph Figure 11. Loss vs. Epoch Graph

18
Conclusion
● Due to the nature of our dataset, slang was incorporated
○ Bot understood popular topics such as the presidential election
● Ended with a 72% accuracy
● Could not respond to personal questions
○ Plan on building a “personal information document” saved in memory
○ Therefore, if personal questions are classified, a retrieval based model can be used
● Has potential in providing great emotional services
● Improved encoder can read between the lines

19
Acknowledgements
We would like to thank Ryan Solgi, Laboni Sarker, S. Shailja, Dr. Lina Kim, and the
SRA staff for their support.

20
References
[1] Ayanouz, Soufyane, et al. “A Smart Chatbot Architecture Based NLP and Machine Learning for Health Care Assistance.” ResearchGate, Association for Computing Machinery, 31
Mar. 2020, www.researchgate.net/publication/340678278_A_Smart_Chatbot_Architecture_based_NLP_and_Machine_Learning_for_Health_Care_Assistance.

[2] Brain. “Chatbot Report 2019: Global Trends and Analysis.” Medium, Chatbots Magazine, 19 Apr. 2019, chatbotsmagazine.com/chatbot-report-2019-global-trends-and-analysis-
a487afec05b.

[3] Chablani, Manish “Sequence to Sequence Model: Introduction and Concepts.” Medium, Towards Data Science, 23 June 2017 towardsdatascience.com/sequence-to-sequence-
model-introduction-and-concepts-44d9b41cd42

[4] “Chatbot Tutorial¶.” Chatbot Tutorial - PyTorch Tutorials 1.9.0+cu102 Documentation , Pytorch, pytorch.org/tutorials/beginner/chatbot_tutorial.html?highlight=chatbot+tutorial.

[5] Fang, Hao, et al. “Sounding Board: A User-Centric and Content-Driven Social Chatbot.” Arxiv, Cornell University, 26 Apr. 2018, arxiv.org/abs/1804.10202.

[6 ] Jwala, K. "(2019, June)." (n.d.): Jwala, K., Sirisha, G. N. V. G., & Raju, G. V. P. (2019, June). Developing a Chatbot using Machine Learning.
https://www.ijrte.org/wp-content/uploads/papers/v8i1S3/A10170681S319.pdf.

[7] Liu, Bing, et al. “Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems.” Aclanthology, Association for Computational
Linguistics, June 2018, aclanthology.org/N18-1187/.

[8] Mazarè, Pierre-Emmanuel, et al. “Training Millions of Personalized Dialogue Agents.” Arxiv, Cornell University, 6 Sept. 2018, arxiv.org/abs/1809.01984.

[9] Siddhant, Aditya, et al. “Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents.” Arxiv, Carnegie Mellon University, 13 Nov. 2018,
arxiv.org/pdf/1811.05370.pdf.

[10] Suta, P., Lang, X., Wu, B., Mongkolnam, P., & Chan, J. H. (2020, April 4). An Overview of Machine Learning in Chatbots .
http://www.ijmerr.com/uploadfile/2020/0312/20200312023706525.pdf.

21
Questions
Contact Information: [email protected], [email protected], [email protected]

Chapter 2 Literature Review
No ratings yet
Chapter 2 Literature Review
44 pages
Anjana Tiha Masters Project
No ratings yet
Anjana Tiha Masters Project
26 pages
Developing A Chatbot Using Machine Learning
No ratings yet
Developing A Chatbot Using Machine Learning
17 pages
Full Text 01
No ratings yet
Full Text 01
132 pages
Modern Chatbot Systems
No ratings yet
Modern Chatbot Systems
12 pages
Wen 2018 PHD
No ratings yet
Wen 2018 PHD
174 pages
Introduction To Modern Industrial Engineering
100% (2)
Introduction To Modern Industrial Engineering
221 pages
Case Study 2025
No ratings yet
Case Study 2025
34 pages
Unit 5.
No ratings yet
Unit 5.
17 pages
A Conditional Generative Chatbot Using Transformer
No ratings yet
A Conditional Generative Chatbot Using Transformer
14 pages
Expansion of Theme
100% (2)
Expansion of Theme
10 pages
A Conditional Generative Chatbot Using Transformer Model
No ratings yet
A Conditional Generative Chatbot Using Transformer Model
12 pages
D 4 Comsc Css 2405 1a e
No ratings yet
D 4 Comsc Css 2405 1a e
9 pages
The Health ChatBots in Telemedicine Intelligent Di
No ratings yet
The Health ChatBots in Telemedicine Intelligent Di
12 pages
Full Text 01
No ratings yet
Full Text 01
64 pages
Chatbot PPT 2.0
No ratings yet
Chatbot PPT 2.0
14 pages
NLPPAP
No ratings yet
NLPPAP
8 pages
Deep Learning Project
No ratings yet
Deep Learning Project
21 pages
Seq2Seq Attention Mechanism
No ratings yet
Seq2Seq Attention Mechanism
19 pages
Effective Chatbots Using Machine Learning and Natural Language Processing
No ratings yet
Effective Chatbots Using Machine Learning and Natural Language Processing
10 pages
Implementing Chatbots Using Neural Machine Translation Techniques
No ratings yet
Implementing Chatbots Using Neural Machine Translation Techniques
44 pages
Seminar
No ratings yet
Seminar
27 pages
2020-Anki P. Et Al.-Intelligent Chatbot Adapted From Question and Answer System Using RNN-LSTM Model
No ratings yet
2020-Anki P. Et Al.-Intelligent Chatbot Adapted From Question and Answer System Using RNN-LSTM Model
12 pages
Doc
No ratings yet
Doc
1 page
ELECOM 2018 Paper 12 Final
No ratings yet
ELECOM 2018 Paper 12 Final
11 pages
Ia NLP 16 81444370
No ratings yet
Ia NLP 16 81444370
7 pages
10.1007@978 981 13 8581 07
No ratings yet
10.1007@978 981 13 8581 07
13 pages
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
No ratings yet
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
7 pages
Health Care Chatbot Final
No ratings yet
Health Care Chatbot Final
7 pages
Horvath Final Documentation WS18
No ratings yet
Horvath Final Documentation WS18
43 pages
NLP &
No ratings yet
NLP &
21 pages
UNIT IV Lecture Notes Covering Natural Language Processing
No ratings yet
UNIT IV Lecture Notes Covering Natural Language Processing
6 pages
Working of Chatgpt Report
No ratings yet
Working of Chatgpt Report
24 pages
Ir Review 3 PPT
No ratings yet
Ir Review 3 PPT
14 pages
Neural Approaches To Conversational AI
No ratings yet
Neural Approaches To Conversational AI
95 pages
Monsoon Theories
100% (1)
Monsoon Theories
14 pages
Chat GPT
No ratings yet
Chat GPT
8 pages
Britto
No ratings yet
Britto
16 pages
Exploring The Capabilities and Limitations of GPT
No ratings yet
Exploring The Capabilities and Limitations of GPT
3 pages
FINAL-MIDTERM Major2
No ratings yet
FINAL-MIDTERM Major2
20 pages
111-115, Tesma0712, IJEAST, 19806
No ratings yet
111-115, Tesma0712, IJEAST, 19806
5 pages
Chatbots
No ratings yet
Chatbots
15 pages
Service Chatbots Final
No ratings yet
Service Chatbots Final
33 pages
NLP
No ratings yet
NLP
71 pages
Chemical Engineering in Practice Second Edition - Sampler
100% (1)
Chemical Engineering in Practice Second Edition - Sampler
99 pages
Find YourSelf - Khyber EyeCon
No ratings yet
Find YourSelf - Khyber EyeCon
52 pages
Project Report
No ratings yet
Project Report
4 pages
Britto 1 15 2 15 - Merged
No ratings yet
Britto 1 15 2 15 - Merged
18 pages
Mpa 17 em PDF
No ratings yet
Mpa 17 em PDF
9 pages
Boiling: 1. Neutralization of Magma Gas in Host Rock at Deep Location
No ratings yet
Boiling: 1. Neutralization of Magma Gas in Host Rock at Deep Location
84 pages
Ai NLP
No ratings yet
Ai NLP
34 pages
NLP Report
No ratings yet
NLP Report
20 pages
Base 1
No ratings yet
Base 1
5 pages
Language Model Evaluation in Open-Ended Text Gener
No ratings yet
Language Model Evaluation in Open-Ended Text Gener
70 pages
Urological Oncology: A Comparison Between Clinical and Pathologic Staging in Patients With Bladder Cancer
No ratings yet
Urological Oncology: A Comparison Between Clinical and Pathologic Staging in Patients With Bladder Cancer
5 pages
Chatbots With Personality Using Deep Learning
No ratings yet
Chatbots With Personality Using Deep Learning
47 pages
Mkt350 Final Report The Art of Potano
No ratings yet
Mkt350 Final Report The Art of Potano
30 pages
Recent Deep Learning Based NLP Techniques For Chatbot Development An Exhaustive Survey
No ratings yet
Recent Deep Learning Based NLP Techniques For Chatbot Development An Exhaustive Survey
4 pages
Course Project Report For: Artificial Intelligence EL-3011
No ratings yet
Course Project Report For: Artificial Intelligence EL-3011
8 pages
AI Based Chatbot Using NLP To Facilitate Software Development Process
No ratings yet
AI Based Chatbot Using NLP To Facilitate Software Development Process
26 pages
BROSURABFPLOFT20112
No ratings yet
BROSURABFPLOFT20112
6 pages
Case
No ratings yet
Case
4 pages
Chatbot Application For Tourism Using Deep Learning
No ratings yet
Chatbot Application For Tourism Using Deep Learning
5 pages
31st MCMC
No ratings yet
31st MCMC
11 pages
Project Phase 1 Progress 2
No ratings yet
Project Phase 1 Progress 2
15 pages
Python Chatbot Project
No ratings yet
Python Chatbot Project
6 pages
Irjet V5i8212 PDF
No ratings yet
Irjet V5i8212 PDF
3 pages
01 Merged
No ratings yet
01 Merged
15 pages
Natural Language Understanding in Chatbots
No ratings yet
Natural Language Understanding in Chatbots
4 pages
A Cyber Security Awareness and Education Framework For South Africa
No ratings yet
A Cyber Security Awareness and Education Framework For South Africa
219 pages
Third Review Chatbot
No ratings yet
Third Review Chatbot
19 pages
Important: Service Data Sheet
No ratings yet
Important: Service Data Sheet
4 pages
Biography of Adolf Hitler
No ratings yet
Biography of Adolf Hitler
1 page
Introduction To Soil Ecology
No ratings yet
Introduction To Soil Ecology
15 pages
Answer Key Class Test 1 Paper3
No ratings yet
Answer Key Class Test 1 Paper3
7 pages
Ephesians: What To Do
No ratings yet
Ephesians: What To Do
8 pages
Financial Kake Da Hotel (N)
No ratings yet
Financial Kake Da Hotel (N)
10 pages
English Yr5 2015 Ms
No ratings yet
English Yr5 2015 Ms
9 pages
Exhibit B - Security Policy
No ratings yet
Exhibit B - Security Policy
4 pages
Sachin Pawar Resume
No ratings yet
Sachin Pawar Resume
6 pages
Syllabus
No ratings yet
Syllabus
7 pages
Blaszczyk DAOsandRegulatoryCompetition Final
No ratings yet
Blaszczyk DAOsandRegulatoryCompetition Final
17 pages
FINAL MANUSCRIPTTTTTTTTTTtttttttttttttttttttttttttttttttttttttTTTTTTTTTTT
No ratings yet
FINAL MANUSCRIPTTTTTTTTTTtttttttttttttttttttttttttttttttttttttTTTTTTTTTTT
24 pages
Figure of Speech
No ratings yet
Figure of Speech
4 pages
My MVP in Volleyball: Individual Awards: Collegiate Awards
No ratings yet
My MVP in Volleyball: Individual Awards: Collegiate Awards
1 page
ED Mid
No ratings yet
ED Mid
1 page
Mapping Pulling Cable Grounding System
No ratings yet
Mapping Pulling Cable Grounding System
1 page
Table of Contents (The Summary) : Intro
No ratings yet
Table of Contents (The Summary) : Intro
14 pages
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
From Everand
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
Nelson Ambrose
No ratings yet
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
From Everand
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
William Sullivan
1/5 (1)