0% found this document useful (0 votes)

18 views6 pages

NLP Ai X

Natural Language Processing (NLP) enables computers to understand and process human languages, with applications including automatic summarization, sentiment analysis, text classification, and virtual assistants. Chatbots are a prominent NLP application, categorized into scriptbots and smart-bots based on their complexity and functionality. Key processes in NLP include text normalization, tokenization, stemming, and lemmatization, which help convert raw data into meaningful information for machine learning techniques.

Uploaded by

avrsrivastav2010

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views6 pages

NLP Ai X

Uploaded by

avrsrivastav2010

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Natural Language Processing

NLP (Natural Language Processing), is dedicated to making it possible for computers to

comprehend and process human languages. Artificial intelligence (AI) is a subfield of
linguistics, computer science, information engineering, and artificial intelligence that
studies how computers interact with human (natural) languages, particularly how to
train computers to handle and analyse massive volumes of natural language data.

Applications of NLP

Most people utilize NLP apps on a regular basis in their daily lives.:

Automatic Summarization – Automatic summarization is useful for gathering data from

social media and other online sources, as well as for summarizing the meaning of
documents and other written materials.

Sentiment Analysis – To better comprehend what internet users are saying about a
company’s goods and services, businesses use natural language processing tools like
sentiment analysis to understand the customer requirement.

Indicators of their reputation – Sentiment analysis goes beyond establishing simple

polarity to analyse sentiment in context to help understand what is behind an
expressed view. This is very important for understanding and influencing purchasing
decisions.

Text classification – Text classification enables you to classify a document and organise
it to make it easier to find the information you need or to carry out certain tasks. Spam
screening in email is one example of how text categorization is used.

Virtual Assistants – These days, digital assistants like Google Assistant, Cortana, Siri,
and Alexa play a significant role in our lives. Not only can we communicate with them,
but they can also facilitate our life.

Chatbots
A chatbot is one of the most widely used NLP applications. Many chatbots on the
market now employ the same strategy as we did in the instance above.

• Mitsuku Bot*
https://www.pandorabots.com/mitsuku/

• CleverBot*
https://www.cleverbot.com/

• Jabberwacky*
http://www.jabberwacky.com/

• Haptik*
https://haptik.ai/contact-us

• Rose*
http://ec2-54-215-197-164.us-west-1.compute.amazonaws.com/speech.php

• Ochatbot*
https://www.ometrics.com/blog/list-of-fun-chatbots/

There are 2 types of chatbots

Scriptbot Smart-bot

Script bots are easy to make Smart-bots are flexible and powerful

Script bots work around a script which is Smart bots work on bigger databases and other
programmed in them resources directly

Mostly they are free and are easy to integrate

Smart bots learn with more data
to a messaging platform

No or little language processing skills Coding is required to take this up on board

Limited functionality Wide functionality

Human Language VS Computer Language

Humans need language to communicate, which we constantly process. Our brain
continuously processes the sounds it hears around us and works to make sense of
them. Our brain continuously processes and stores everything, even as the teacher is
delivering the lesson in the classroom.

The Computer Language is understood by the computer, on the other hand. All input
must be transformed to numbers before being sent to the machine. And if a single error
is made while typing, the machine throws an error and skips over that area. Machines
only use extremely simple and elementary forms of communication.

Data Processing

Data Processing is a method of manipulation of data. It means the conversion of raw

data into meaningful and machine-readable content. It basically is a process of
converting raw data into meaningful information.

Since human languages are complex, we need to first of all simplify them in order to
make sure that the understanding becomes possible. Text Normalisation helps in
cleaning up the textual data in such a way that it comes down to a level where its
complexity is lower than the actual data. Let us go through Text Normalisation in detail.

Text Normalisation

The process of converting a text into a canonical (standard) form is known as text
normalisation. For instance, the canonical form of the word “good” can be created from
the words “gooood” and “gud.” Another illustration is the reduction of terms that are
nearly identical, such as “stopwords,” “stop-words,” and “stop words,” to just
“stopwords.”

Sentence Segmentation

Under sentence segmentation, the whole corpus is divided into sentences. Each
sentence is taken as a different data so now the whole corpus gets reduced to
sentences.

Tokenisation

Sentences are first broken into segments, and then each segment is further divided into
tokens. Any word, number, or special character that appears in a sentence is referred
to as a token. Tokenization treats each word, integer, and special character as a
separate entity and creates a token for each of them.

Removing Stopwords, Special Characters and Numbers

In this step, the tokens which are not necessary are removed from the token list. What
can be the possible words which we might not require?
Stopwords are words that are used frequently in a corpus but provide nothing useful.
Humans utilise grammar to make their sentences clear and understandable for the
other person. However, grammatical terms fall under the category of stopwords
because they do not add any significance to the information that is to be
communicated through the statement. Stopwords include a, an, and, or, for, it, is, etc.

Converting text to a common case

After eliminating the stopwords, we change the text’s case throughout, preferably to
lower case. This makes sure that the machine’s case-sensitivity does not treat similar
terms differently solely because of varied case usage.

Stemming

The remaining words are boiled down to their root words in this step. In other words,
stemming is the process of stripping words of their affixes and returning them to their
original forms.

Lemmatization

Stemming and lemmatization are alternate techniques to one another because they
both function to remove affixes. However, lemmatization differs from both of them in
that the word that results from the elimination of the affix (also known as the lemma) is
meaningful.

Bag of Words

A bag-of-words is a textual illustration that shows where words appear in a document.

There are two components: a collection of well-known words. a metric for the amount
of well-known words.

A Natural Language Processing model called Bag of Words aids in the extraction of
textual information that can be used by machine learning techniques. We gather the
instances of each term from the bag of words and create the corpus’s vocabulary.
Document Information
Topic Modelling Stop word filtering
Classification Retrieval System

Helps in classifying the To extract the Helps in removing the

It helps in predicting
type and genre of a important information unnecessary words
the topic for a corpus.
document. out of a corpus. out of a text body.

Here is the step-by-step approach to implement bag of words algorithm:

1. Text Normalisation: Collect data and pre-process it

2. Create Dictionary: Make a list of all the unique words occurring in the corpus.
(Vocabulary)
3. Create document vectors: For each document in the corpus, find out how many
times the word from the unique list of words has occurred.
4. Create document vectors for all the documents.

Term Frequency

The measurement of a term’s frequency inside a document is called term frequency.

The simplest calculation is to count the instances of each word. However, there are
ways to change that value based on the length of the document or the frequency of the
term that appears the most often.

Inverse Document Frequency

A term’s frequency inside a corpus of documents is determined by its inverse

document frequency. It is calculated by dividing the total number of documents in the
corpus by the number of documents that contain the phrase.

Applications of AI
No ratings yet
Applications of AI
11 pages
Natural Language Processing Notes Class 10 AI
100% (1)
Natural Language Processing Notes Class 10 AI
20 pages
Unit 1 2 3 4 5 NLP Notes Merged
100% (1)
Unit 1 2 3 4 5 NLP Notes Merged
105 pages
Natural Language Processing Notes Class 10
No ratings yet
Natural Language Processing Notes Class 10
10 pages
Part B Notes
No ratings yet
Part B Notes
62 pages
Natural Language Processing Notes Class 10 AI
No ratings yet
Natural Language Processing Notes Class 10 AI
25 pages
PDF NLP
No ratings yet
PDF NLP
7 pages
Unit-3 (NLP)
No ratings yet
Unit-3 (NLP)
28 pages
Introduction To NLP
No ratings yet
Introduction To NLP
50 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
17 pages
NLP Class10 PDF
No ratings yet
NLP Class10 PDF
9 pages
NLP Intro
No ratings yet
NLP Intro
74 pages
Natural Language Processing - Compressed
No ratings yet
Natural Language Processing - Compressed
17 pages
Unit 6 - AI (NLP)
No ratings yet
Unit 6 - AI (NLP)
37 pages
C10 - Ai - Unit 3 - NLP - Half Yearly
No ratings yet
C10 - Ai - Unit 3 - NLP - Half Yearly
37 pages
Introduction To
No ratings yet
Introduction To
16 pages
Natural Language Processing Revision Notes
No ratings yet
Natural Language Processing Revision Notes
4 pages
Unit 6 (NLP)
No ratings yet
Unit 6 (NLP)
8 pages
Text Analytics Basics
No ratings yet
Text Analytics Basics
28 pages
NLP Notes
No ratings yet
NLP Notes
10 pages
Dupppppppppp
No ratings yet
Dupppppppppp
15 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
Chapter 6 - NLP Question Answer
No ratings yet
Chapter 6 - NLP Question Answer
7 pages
TSP Unit1 Own
No ratings yet
TSP Unit1 Own
20 pages
Natural Language Processing
No ratings yet
Natural Language Processing
28 pages
AIUnit 6 10
No ratings yet
AIUnit 6 10
8 pages
TSP Unit1 Own
No ratings yet
TSP Unit1 Own
13 pages
IP Projects NLP
No ratings yet
IP Projects NLP
8 pages
Module 3
No ratings yet
Module 3
40 pages
NLP m2
No ratings yet
NLP m2
71 pages
Ai & ML Week-11
No ratings yet
Ai & ML Week-11
32 pages
Natural Language Processing Tools and Approaches
No ratings yet
Natural Language Processing Tools and Approaches
106 pages
Unit 6 - NLP Notes
No ratings yet
Unit 6 - NLP Notes
7 pages
Natural Language Processing
No ratings yet
Natural Language Processing
10 pages
Seminar Report
No ratings yet
Seminar Report
12 pages
DLT Unit-5
No ratings yet
DLT Unit-5
48 pages
NLP
No ratings yet
NLP
40 pages
Welcome
No ratings yet
Welcome
8 pages
Final
No ratings yet
Final
14 pages
NLP Steps Basic
No ratings yet
NLP Steps Basic
26 pages
Seminar Report1
No ratings yet
Seminar Report1
17 pages
Chapter 7.1 - Introducing Natural Language Processing
No ratings yet
Chapter 7.1 - Introducing Natural Language Processing
39 pages
Introduction To Natural Language Processing
No ratings yet
Introduction To Natural Language Processing
11 pages
Q ClassX AI Ch7
No ratings yet
Q ClassX AI Ch7
6 pages
Natural Language Processing (NLP)
No ratings yet
Natural Language Processing (NLP)
5 pages
NLP - Notes
No ratings yet
NLP - Notes
3 pages
Sentiment Analysis Using Supervised Machine Learning Ijariie13051
No ratings yet
Sentiment Analysis Using Supervised Machine Learning Ijariie13051
7 pages
Ai NLP
No ratings yet
Ai NLP
9 pages
NLP - CH-6
No ratings yet
NLP - CH-6
4 pages
Unit-6 Natural Language Processing
No ratings yet
Unit-6 Natural Language Processing
7 pages
Natural Language Processing: Learning Is Not A Course, Its A Path From Passion To Profession
No ratings yet
Natural Language Processing: Learning Is Not A Course, Its A Path From Passion To Profession
19 pages
Harambe University
No ratings yet
Harambe University
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
6 pages
Intro To NLP: Natural Language Toolkit
No ratings yet
Intro To NLP: Natural Language Toolkit
11 pages
WWW Scribd
No ratings yet
WWW Scribd
1 page
Assignment of AI Finished
No ratings yet
Assignment of AI Finished
16 pages
Week 6: Introduction To Natural Language Processing
No ratings yet
Week 6: Introduction To Natural Language Processing
18 pages
Intelligent Control Syllabus Updated
No ratings yet
Intelligent Control Syllabus Updated
3 pages
Anikesh Sip Part 2 - Merged
No ratings yet
Anikesh Sip Part 2 - Merged
64 pages
4060 PDF
0% (1)
4060 PDF
51 pages
Data Augmentation Techniques I
No ratings yet
Data Augmentation Techniques I
23 pages
The Warehouse of The Future
No ratings yet
The Warehouse of The Future
28 pages
Emerging Trends in Computer Science
No ratings yet
Emerging Trends in Computer Science
7 pages
Kickstart Your Journey With LLM - A Comprehensive Guide
No ratings yet
Kickstart Your Journey With LLM - A Comprehensive Guide
2 pages
Chapter - 3 - Artificial Intelligence (AI)
No ratings yet
Chapter - 3 - Artificial Intelligence (AI)
66 pages
CDEV8130 Career Management Assignment 3 My Interview Preparation Self Assessment
100% (2)
CDEV8130 Career Management Assignment 3 My Interview Preparation Self Assessment
7 pages
PID Loop Simulator: Download
No ratings yet
PID Loop Simulator: Download
8 pages
Se Speaking Qúy 2
No ratings yet
Se Speaking Qúy 2
12 pages
Strategic Foresight From Intelligence Gathering To Implementation (Sarah Lai-Yin Cheah) (Z-Library)
No ratings yet
Strategic Foresight From Intelligence Gathering To Implementation (Sarah Lai-Yin Cheah) (Z-Library)
195 pages
5G Delivers New Wave of Automation For Manufacturing: Ffi FF
No ratings yet
5G Delivers New Wave of Automation For Manufacturing: Ffi FF
16 pages
Data Mining Assignment
No ratings yet
Data Mining Assignment
24 pages
Sentiment Classification and Aspect Based Sentiment Analysis On Yelp Reviews Using Deep Learning and Word Embeddings
No ratings yet
Sentiment Classification and Aspect Based Sentiment Analysis On Yelp Reviews Using Deep Learning and Word Embeddings
24 pages
Private Affiliated Colleges Fall 2019
No ratings yet
Private Affiliated Colleges Fall 2019
54 pages
Organization Behavior
No ratings yet
Organization Behavior
21 pages
How Do Customers React To The Integration of AI in Customer Experience 1-29-25 11-19pm
No ratings yet
How Do Customers React To The Integration of AI in Customer Experience 1-29-25 11-19pm
12 pages
College Proposal - Spectrum 2024 - 1718455688
No ratings yet
College Proposal - Spectrum 2024 - 1718455688
13 pages
Module2 Lecture 6 Cat1 UptoSmoothGrad
No ratings yet
Module2 Lecture 6 Cat1 UptoSmoothGrad
59 pages
Machine Learning - Notes - 321
No ratings yet
Machine Learning - Notes - 321
3 pages
LLM Interpretability 101
No ratings yet
LLM Interpretability 101
8 pages
INTRAJAN2021-Presentation 53213117344 MUHAMMAD ISMAIL AL FATIH BIN MOHD FUAD-TRAMS SDN BHD
No ratings yet
INTRAJAN2021-Presentation 53213117344 MUHAMMAD ISMAIL AL FATIH BIN MOHD FUAD-TRAMS SDN BHD
21 pages
Generalizing From A Few Examples: A Survey On Few-Shot Learning
No ratings yet
Generalizing From A Few Examples: A Survey On Few-Shot Learning
34 pages
Key Terms in Machine Learning
No ratings yet
Key Terms in Machine Learning
6 pages
TensorFlow Sec1
No ratings yet
TensorFlow Sec1
14 pages
Arafat CV
No ratings yet
Arafat CV
1 page
Overview of Supervised Learning Algorithms
No ratings yet
Overview of Supervised Learning Algorithms
8 pages
Tutorial 11 - AI & Governance 3
No ratings yet
Tutorial 11 - AI & Governance 3
1 page
Sentenced To Prison by A Machine - Assessment Questions (Lechner)
No ratings yet
Sentenced To Prison by A Machine - Assessment Questions (Lechner)
3 pages
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
From Everand
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
Alexandra George
No ratings yet
Natural Language Processing
From Everand
Natural Language Processing
Ajit Singh
No ratings yet
Concept Mining: Fundamentals and Applications
From Everand
Concept Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet