NLP tutorial1

This document discusses the advancements in Natural Language Processing (NLP) using deep learning techniques, particularly through the use of TensorFlow. It highlights various NLP tasks such as machine translation, sentiment analysis, and conversational AI, while explaining the importance of word representation through models like Word2vec. The document also details specific models like Langmod_nn and Memn2n-master, showcasing their architectures, training processes, and performance metrics.

Uploaded by

tvvk291

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

NLP tutorial1

Uploaded by

tvvk291

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

ML | Natural Language Processing using Deep

Learning
Last Updated : 04 Aug, 2022


Machine Comprehension is a very interesting but challenging task in both Natural
Language Processing (NLP) and artificial intelligence (AI) research. There are several
approaches to natural language processing tasks. With recent breakthroughs in deep
learning algorithms, hardware, and user-friendly APIs like TensorFlow, some tasks
have become feasible up to a certain accuracy. This article contains information about
TensorFlow implementations of various deep learning models, with a focus on
problems in natural language processing. The purpose of this project article is to help
the machine to understand the meaning of sentences, which improves the efficiency of
machine translation, and to interact with the computing systems to obtain useful
information from it.

Understanding Natural Language Processing:

Our ability to evaluate the relationship between sentences is essential for tackling a
variety of natural language challenges, such as text summarization, information
extraction, and machine translation. This challenge is formalized as the natural
language inference task of Recognizing Textual Entailment (RTE), which involves
classifying the relationship between two sentences as one of entailment, contradiction,
or neutrality. For instance, the premise “Garfield is a cat”, naturally entails the
statement “Garfield has paws”, contradicts the statement “Garfield is a German
Shepherd”, and is neutral to the statement “Garfield enjoys sleeping”.
Natural language processing is the ability of a computer program to understand human
language as it is spoken. NLP is a component of artificial intelligence that deal with
the interactions between computers and human languages in regard to the processing
and analyzing large amounts of natural language data. Natural language processing
can perform several different tasks by processing natural data through different
efficient means. These tasks could include:
 Answering questions about anything (what Siri*, Alexa*, and Cortana* can do).
 Sentiment analysis (determining whether the attitude is positive, negative, or
neutral).
 Image to text mappings (creating captions using an input image).
 Machine translation (translating text to different languages).
 Speech recognition
 Part of Speech (POS) tagging.
 Entity identification
The traditional approach to NLP involved a lot of domain knowledge of linguistics
itself.
Deep learning at its most basic level, is all about representation learning. With
convolutional neural networks (CNN), the composition of different filters is used to
classify objects into categories. Taking a similar approach, this article creates
representations of words through large datasets.

Conversational AI: Natural Language Processing Features

 Natural Language Processing (NLP)

 Text Mining (TM)
 Computational Linguistics (CL)
 Machine Learning on Text Data (ML on Text)
 Deep Learning approaches for Text Data (DL on Text)
 Natural Language Understanding (NLU)
 Natural Language Generation (NLG)
Conversational AI, has seen several amazing advances in recent years, with
significant improvements in automatic speech recognition (ASR), text to speech
(TTS), and intent recognition, as well as the significant growth of voice assistant
devices like the Amazon Echo and Google Home.
Using deep learning techniques can work efficiently on NLP-related problems. This
article uses backpropagation and stochastic gradient descent (SGD) as 4 algorithms in
the NLP models.
The loss depends on each element of the training set, especially when it is compute-
intensive, which in the case of NLP problems is true as the data set is large. As
gradient descent is iterative, it has to be done through many steps which means going
through the data hundreds and thousands of times. Estimate the loss by taking the
average loss from a random, small data set chosen from the larger data set. Then
compute the derivative for that sample and assumes that the derivative is the right
direction to use the gradient descent. It might even increase the loss, not reduce it.
Compensate by doing it many times, taking very small steps each time. Each step is
cheaper to compute and overall will produce better performance. The SGD algorithm
is at the core of deep learning.

Word Vectors:
Words need to be represented as input to the machine learning models, one
mathematical way to do this is to use vectors. There are an estimated 13 million words
in the English language, but many of these are related.
Search for an N-dimensional vector space (where N << 13 million) that is sufficient to
encode all semantics in our language. To do this, there needs to be an understanding
of the similarity and differences between words. The concept of vectors and distances
between them (Cosine, Euclidean, etc.) can be exploited to find similarities and
differences between words.

How Do We Represent the Meaning of Words?

If separate vectors are used for all of the +13 million words in the English vocabulary,
several problems can occur. First, there will be large vectors with a lot of ‘zeroes’ and
one ‘one’ (in different positions representing a different word). This is also known as
one-hot encoding. Second, when searching for phrases such as “hotels in New Jersey”
in Google, expectations are that results pertaining to “motel”, “lodging”, and
“accommodation” in New Jersey are returned. And if using one-hot encoding, these
words have no natural notion of similarity. Ideally, dot products (since we are dealing
with vectors) of synonym/similar words would be close to one of the expected results.
Word2vec8 is a group of models which helps derive relations between a word and its
contextual words. Beginning with a small, random initialization of word vectors, the
predictive model learns the vectors by minimizing the loss function. In Word2vec, this
happens with a feed-forward neural network and optimization techniques such as the
SGD algorithm. There are also count-based models which make a co-occurrence
count matrix of words in the corpus; with a large matrix with a row for each of the
“words” and columns for the “context”. The number of “contexts” is of course large,
since it is essentially combinatorial in size. To overcome the size issue, singular value
decomposition can be applied to the matrix, reducing the dimensions of the matrix and
retaining maximum information.
Software and Hardware:
The programming language being used is Python 3.5.2 with Intel Optimization for
TensorFlow as the framework. For training and computation purposes, the Intel AI
DevCloud powered by Intel Xeon Scalable processors was used. Intel AI DevCloud
can provide a great performance bump from the host CPU for the right application and
use case due to having 50+ cores and its own memory, interconnect, and operating
system.

Training Models for NLP: Langmod_nn and Memn2n-master

Langmod_nn Model –
The Langmod_nn model6 builds a three-layer Forward Bigram Model neural network
consisting of an embedding layer, a hidden layer, and a final softmax layer where the
goal is to use a given word in a corpus to attempt to predict the next word.
To pass the input into one hot encoded vector of dimensions of 5000.
Input:
A word in a corpus. Because the vocabulary size can get very large, we have limited
the vocabulary to the top 5000 words in the corpus, and the rest of the words are
replaced with the UNK symbol. Each sentence in the corpus is also double-padded
with stop symbols.
Output:
The following word in the corpus also encoded one-hot in a vector the size of the
vocabulary.

Layers –

The model consists of the following three layers:

Embedding Layer: Each word corresponds to a unique embedding vector, a
representation of the word in some embedding space. Here, the embedding, all have
dimension 50. We find the embedding for a given word by doing a matrix multiply
(essentially a table lookup) with an embedding matrix that is trained during regular
backpropagation.
Hidden Layer: A fully-connected feed-forward layer with a hidden layer size of 100,
and rectified linear unit (ReLU) activation.
Softmax Layer: A fully-connected feed-forward layer with a layer size equal to the
vocabulary size, where each element of the output vector (logits) corresponds to the
probability of that word in the vocabulary being the next word.
Loss- The normal cross-entropy loss between the logits and the true labels as the
model’s cost.
Optimizer –
A normal SGD Optimizer with a learning rate of .05.
Each epoch (around 480, 000 examples) takes about 10 minutes to train on the CPU.
The test log-likelihood after epoch five is -846493.44
Memn2n-master:
Memn2n-master7 is a neural network with a recurrent attention model over a possibly
large external memory. The architecture is a form of memory network but unlike the
model in that work, it is trained end-to-end, and hence requires significantly less
supervision during training, making it more generally applicable in realistic settings.
Input data –
This directory includes the first set of 20 tasks for testing text understanding and
reasoning in the bAbI5 project. The motive behind these 20 tasks is that each task
tests a unique aspect of text and reasoning, and hence by testing the different abilities
of the trained models.
For both testing and training, we have 1000 questions each. However, we have not
used this much data as it might not be of much use.
The results of this model were a testing accuracy of 99.6%, training accuracy of
97.6%, and validation accuracy of 88%.
The tensorFlow framework has shown good results for training neural network
models with NLP models showing good accuracy. Training, testing, and loss results
have been great. Langmod_nn model and memory networks resulted in good accuracy
rates with low loss and error value. The flexibility of the memory model allows it to
be applied to tasks as diverse as question answering and language modeling.

Conclusion:

As shown, NLP provides a wide set of techniques and tools which can be applied in
all areas of life. By learning the models and using them in everyday interactions,
quality of life would highly improve. NLP techniques help to improve
communications, reach goals, and improve the outcomes received from every
interaction. NLP helps people to use the tools and techniques that are already
available to them. By learning NLP techniques properly, people can achieve goals and
overcome obstacles.
In the future, NLP will move beyond both statistical and rule-based systems to a
natural understanding of language. There are already some improvements made by
tech giants. For example, Facebook* tried to use deep learning to understand the text
without parsing, tags, named-entity recognition (NER), etc., and Google is trying to
convert language into mathematical expressions. Endpoint detection using grid long
short-term memory networks and end-to-end memory networks on bAbI tasks
performed by Google and Facebook respectively shows the advancement that can be
done in NLP models.

Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara 2024 scribd download
50% (2)
Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara 2024 scribd download
62 pages
Final Seminar Report
100% (5)
Final Seminar Report
30 pages
Project Plan - Kel 5 PDF
No ratings yet
Project Plan - Kel 5 PDF
5 pages
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
From Everand
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
Alexandra George
No ratings yet
cs224n Winter2023 Lecture1 Notes Draft
No ratings yet
cs224n Winter2023 Lecture1 Notes Draft
13 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
12 pages
DLT Unit-5
No ratings yet
DLT Unit-5
48 pages
UNIT_5_DL
No ratings yet
UNIT_5_DL
11 pages
What Is Natural Language Processing (NLP)
No ratings yet
What Is Natural Language Processing (NLP)
15 pages
Three 150224 Generative a i Intro
No ratings yet
Three 150224 Generative a i Intro
19 pages
NLP Lab Manual-1
No ratings yet
NLP Lab Manual-1
18 pages
SPA.2018.8563389
No ratings yet
SPA.2018.8563389
6 pages
Module 5
No ratings yet
Module 5
76 pages
NLP 1
No ratings yet
NLP 1
15 pages
O'Neal
No ratings yet
O'Neal
7 pages
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet
Word Embedding
No ratings yet
Word Embedding
9 pages
Machines and Human Language
No ratings yet
Machines and Human Language
4 pages
2005 14165v3 PDF
No ratings yet
2005 14165v3 PDF
74 pages
Chapter-03-1
No ratings yet
Chapter-03-1
43 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
cs224n spr2024 Lecture01 Wordvecs1
No ratings yet
cs224n spr2024 Lecture01 Wordvecs1
40 pages
Intro DL 10 NLP
No ratings yet
Intro DL 10 NLP
99 pages
ChatGPT Simplified: A Comprehensive Guide to Understanding and Utilizing AI Language Models, ChatGPT-4, ChatGPT Prompts, Fiction Writing, Blogging, Content Writing, Make Money Online
From Everand
ChatGPT Simplified: A Comprehensive Guide to Understanding and Utilizing AI Language Models, ChatGPT-4, ChatGPT Prompts, Fiction Writing, Blogging, Content Writing, Make Money Online
Silas Quantum
5/5 (1)
NLP_DeepNLP
No ratings yet
NLP_DeepNLP
61 pages
Literature Review On Vulnerability Detection Using
No ratings yet
Literature Review On Vulnerability Detection Using
10 pages
CS224n: Natural Language Processing With Deep Learning
No ratings yet
CS224n: Natural Language Processing With Deep Learning
14 pages
Cs 224 N
No ratings yet
Cs 224 N
128 pages
ChatGPT KZ Feb2023 PDF
No ratings yet
ChatGPT KZ Feb2023 PDF
7 pages
The State-Of-Art Applications of NLP: Evidence From ChatGPT
No ratings yet
The State-Of-Art Applications of NLP: Evidence From ChatGPT
7 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Microsoft Azure AI Fundamentals Generative AI
No ratings yet
Microsoft Azure AI Fundamentals Generative AI
66 pages
NLP handwritten notes_copy
No ratings yet
NLP handwritten notes_copy
26 pages
Motivation Video: Mitsuku Vs Cleverbot - AI (Artificial Intelligence)
No ratings yet
Motivation Video: Mitsuku Vs Cleverbot - AI (Artificial Intelligence)
45 pages
ML4D-L6 nlp2
No ratings yet
ML4D-L6 nlp2
58 pages
2020 NLPDeepLearning
No ratings yet
2020 NLPDeepLearning
72 pages
Automated Source Code Generation and Auto-Completion Using Deep Learning: Comparing and Discussing Current Language Model-Related Approaches
No ratings yet
Automated Source Code Generation and Auto-Completion Using Deep Learning: Comparing and Discussing Current Language Model-Related Approaches
16 pages
Thuyết Trình TWP
No ratings yet
Thuyết Trình TWP
7 pages
Bert - Se: A P - L R M S E: RE Trained Anguage Epresentation Odel For Oftware Ngineering
No ratings yet
Bert - Se: A P - L R M S E: RE Trained Anguage Epresentation Odel For Oftware Ngineering
17 pages
AI in the Study of Literary History and Evolution
No ratings yet
AI in the Study of Literary History and Evolution
12 pages
Archivo - 01 (Cópia)
No ratings yet
Archivo - 01 (Cópia)
5 pages
Archivo - 01 (Outra Cópia)
No ratings yet
Archivo - 01 (Outra Cópia)
5 pages
NLP_M4_Part_2_SPP
No ratings yet
NLP_M4_Part_2_SPP
71 pages
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
No ratings yet
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
15 pages
Recent Advances
No ratings yet
Recent Advances
21 pages
Explanation Based Learning: Fundamentals and Applications
From Everand
Explanation Based Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Trend
No ratings yet
Trend
47 pages
ChatBot with GANs
No ratings yet
ChatBot with GANs
61 pages
NLP Basic - YL
No ratings yet
NLP Basic - YL
16 pages
Evaluating The Machine Learning Models Based On Natural Language Processing Tasks
No ratings yet
Evaluating The Machine Learning Models Based On Natural Language Processing Tasks
15 pages
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
No ratings yet
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
11 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
Sanyam Modi Report Seminar Final
No ratings yet
Sanyam Modi Report Seminar Final
39 pages
Deep Learning For Natural Language Processing: July 2021
No ratings yet
Deep Learning For Natural Language Processing: July 2021
10 pages
What Is Natural Language Processing?
No ratings yet
What Is Natural Language Processing?
5 pages
Lecture 2
No ratings yet
Lecture 2
80 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
33 pages
DL Unit-IV
No ratings yet
DL Unit-IV
20 pages
Wen 2018 PHD
No ratings yet
Wen 2018 PHD
174 pages
Image Caption Technical Report
No ratings yet
Image Caption Technical Report
31 pages
A Quick Guide To Artificial Intelligence
100% (3)
A Quick Guide To Artificial Intelligence
41 pages
Final Review Paper 1
No ratings yet
Final Review Paper 1
19 pages
Future of Media and Journalism
No ratings yet
Future of Media and Journalism
47 pages
2305 File Paper
No ratings yet
2305 File Paper
7 pages
Intelligence
No ratings yet
Intelligence
16 pages
Describe Artificial Intelligence and Machine Learning
No ratings yet
Describe Artificial Intelligence and Machine Learning
27 pages
AI project
No ratings yet
AI project
63 pages
Research Paper Final
No ratings yet
Research Paper Final
5 pages
5g And Beyond The Futrure Of Iot Parag Chatterjee Robin Singh Bhadoria instant download
100% (2)
5g And Beyond The Futrure Of Iot Parag Chatterjee Robin Singh Bhadoria instant download
91 pages
Submitted in Partial Fulfillment of The Requirements For The Subject Practical Research 1
No ratings yet
Submitted in Partial Fulfillment of The Requirements For The Subject Practical Research 1
4 pages
6151A
No ratings yet
6151A
4 pages
Model Lifecycle (XII)
No ratings yet
Model Lifecycle (XII)
10 pages
Artificial Intelligence (AI) and Reverse Engineering - THE BASICS
No ratings yet
Artificial Intelligence (AI) and Reverse Engineering - THE BASICS
2 pages
(READ) Artificial Intelligence Techniques in Software Engineering For Automated Software Reuse and Design (IEEE)
No ratings yet
(READ) Artificial Intelligence Techniques in Software Engineering For Automated Software Reuse and Design (IEEE)
4 pages
AI_Practical_File
No ratings yet
AI_Practical_File
5 pages
ML Interview Questions
No ratings yet
ML Interview Questions
6 pages
Ai Unit 1
100% (1)
Ai Unit 1
101 pages
International Arbitration 3.0
No ratings yet
International Arbitration 3.0
9 pages
AI ML Virtual Internship
No ratings yet
AI ML Virtual Internship
8 pages
Gen AI - Gartner Notes
No ratings yet
Gen AI - Gartner Notes
14 pages
IGCSE - PAPER-1 - Student Notes-126-127
100% (1)
IGCSE - PAPER-1 - Student Notes-126-127
2 pages
paper_58
No ratings yet
paper_58
8 pages
OceanofPDF.com With This Ring - RS McKenzie
No ratings yet
OceanofPDF.com With This Ring - RS McKenzie
241 pages
AI Assignment#1
No ratings yet
AI Assignment#1
5 pages
AI1
No ratings yet
AI1
77 pages
Bece309l Artificial-Intelligence-And-machine-learning TH 1.0 71 Bece309l 66 Acp
No ratings yet
Bece309l Artificial-Intelligence-And-machine-learning TH 1.0 71 Bece309l 66 Acp
2 pages
Architectural Design - 2019 - Andrasek - in Search of The Unseen Towards
No ratings yet
Architectural Design - 2019 - Andrasek - in Search of The Unseen Towards
8 pages
Attention Is All You Need-Summary by Meghana B
No ratings yet
Attention Is All You Need-Summary by Meghana B
2 pages
34980-Article Text-39047-1-2-20250410
No ratings yet
34980-Article Text-39047-1-2-20250410
9 pages
Unit 1 Introduction To AI Notes
No ratings yet
Unit 1 Introduction To AI Notes
13 pages