0% found this document useful (0 votes)

2 views3 pages

Assignment - 7: Import Import Import Import

The document outlines a series of Python code snippets demonstrating Natural Language Processing (NLP) techniques using libraries like NLTK and Scikit-learn. It includes tokenization, part-of-speech tagging, stop words removal, stemming, lemmatization, and TF-IDF vectorization. Additionally, it showcases a simple bar plot of TF-IDF scores for visualization.

Uploaded by

princethakur545454

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views3 pages

Assignment - 7: Import Import Import Import

Uploaded by

princethakur545454

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

ASSIGNMENT - 7

In [10]: import numpy

import scipy
import sklearn
import nltk

In [3]: document = "Natural Language Processing is a fascinating field of AI. NLP he

In [4]: from nltk.tokenize import word_tokenize

tokens = word_tokenize(document)
print("Tokenized Words:", tokens)

Tokenized Words: ['Natural', 'Language', 'Processing', 'is', 'a', 'fascinati

ng', 'field', 'of', 'AI', '.', 'NLP', 'helps', 'machines', 'understand', 'hu
man', 'language', '.']

In [5]: pos_tags = nltk.pos_tag(tokens)

print("POS Tags:", pos_tags)

POS Tags: [('Natural', 'JJ'), ('Language', 'NNP'), ('Processing', 'NNP'),

('is', 'VBZ'), ('a', 'DT'), ('fascinating', 'JJ'), ('field', 'NN'), ('of',
'IN'), ('AI', 'NNP'), ('.', '.'), ('NLP', 'NNP'), ('helps', 'VBZ'), ('machin
es', 'NNS'), ('understand', 'JJ'), ('human', 'JJ'), ('language', 'NN'),
('.', '.')]

In [6]: from nltk.corpus import stopwords

stop_words = set(stopwords.words('english'))
filtered_tokens = [word for word in tokens if word.lower() not in stop_words
print("After Stop Words Removal:", filtered_tokens)

After Stop Words Removal: ['Natural', 'Language', 'Processing', 'fascinatin

g', 'field', 'AI', '.', 'NLP', 'helps', 'machines', 'understand', 'human',
'language', '.']

In [7]: from nltk.stem import PorterStemmer, WordNetLemmatizer

stemmer = PorterStemmer()
lemmatizer = WordNetLemmatizer()

stemmed = [stemmer.stem(word) for word in filtered_tokens]

lemmatized = [lemmatizer.lemmatize(word) for word in filtered_tokens]

print("Stemmed Words:", stemmed)

print("Lemmatized Words:", lemmatized)

Stemmed Words: ['natur', 'languag', 'process', 'fascin', 'field', 'ai', '.',

'nlp', 'help', 'machin', 'understand', 'human', 'languag', '.']
Lemmatized Words: ['Natural', 'Language', 'Processing', 'fascinating', 'fiel
d', 'AI', '.', 'NLP', 'help', 'machine', 'understand', 'human', 'language',
'.']
In [8]: from sklearn.feature_extraction.text import TfidfVectorizer

# Using the same doc twice just to simulate multiple documents for IDF
documents = [
"Natural Language Processing is a fascinating field of AI. NLP helps mac
"Natural Language Processing is a fascinating field of AI. NLP helps mac
]

tfidf_vectorizer = TfidfVectorizer()
tfidf_matrix = tfidf_vectorizer.fit_transform(documents)

# Print TF-IDF scores

feature_names = tfidf_vectorizer.get_feature_names_out()
dense = tfidf_matrix.todense()
denselist = dense.tolist()

import pandas as pd
df = pd.DataFrame(denselist, columns=feature_names)
print(df)

ai fascinating field helps human is language machines natural

\
0 0.25 0.25 0.25 0.25 0.25 0.25 0.5 0.25 0.25
1 0.25 0.25 0.25 0.25 0.25 0.25 0.5 0.25 0.25

nlp of processing understand

0 0.25 0.25 0.25 0.25
1 0.25 0.25 0.25 0.25

In [11]: import matplotlib.pyplot as plt

import numpy as np

# Example TF-IDF scores

terms = ['term1', 'term2', 'term3', 'term4', 'term5']
tfidf_scores = [0.75, 0.85, 0.95, 0.65, 0.80]

# Sort terms based on TF-IDF scores in descending order

sorted_indices = np.argsort(tfidf_scores)[::-1]
sorted_terms = np.array(terms)[sorted_indices]
sorted_scores = np.array(tfidf_scores)[sorted_indices]

# Plotting
plt.bar(sorted_terms, sorted_scores, color='skyblue')
plt.xlabel('Terms')
plt.ylabel('TF-IDF Score')
plt.title('Top 5 TF-IDF Scores')
plt.show()
In [ ]:

This notebook was converted with convert.ploomber.io

NLP Lab Manual
No ratings yet
NLP Lab Manual
21 pages
Breville BES920XL
100% (3)
Breville BES920XL
17 pages
Acceptance Test Engineering Guide Vol I RC1 Full 102609 PDF
No ratings yet
Acceptance Test Engineering Guide Vol I RC1 Full 102609 PDF
251 pages
Dokumen - Pub - Natural Language Processing Practical Using Transformers With Python
No ratings yet
Dokumen - Pub - Natural Language Processing Practical Using Transformers With Python
275 pages
Extra Feature NLP
No ratings yet
Extra Feature NLP
5 pages
Chapter - 4 Education and Human Value
100% (1)
Chapter - 4 Education and Human Value
16 pages
NLP 1 Week Tutorial NLTK
No ratings yet
NLP 1 Week Tutorial NLTK
15 pages
Tutorial 3 - 206009L
No ratings yet
Tutorial 3 - 206009L
34 pages
Sahil NLP
No ratings yet
Sahil NLP
16 pages
Parts of Speech Tagger
No ratings yet
Parts of Speech Tagger
12 pages
NLP Record
No ratings yet
NLP Record
16 pages
C24064 - NLP - Lab Manual
No ratings yet
C24064 - NLP - Lab Manual
28 pages
NLP Crecord Mid2
No ratings yet
NLP Crecord Mid2
36 pages
Ir Practical Manual 2
No ratings yet
Ir Practical Manual 2
24 pages
(E-Book) Scheduling in Real-Time Systems
100% (5)
(E-Book) Scheduling in Real-Time Systems
284 pages
Surge Arrestor Testing
100% (1)
Surge Arrestor Testing
31 pages
Fake News Detection
No ratings yet
Fake News Detection
15 pages
NLP Assignment (917722H031)
No ratings yet
NLP Assignment (917722H031)
18 pages
DS 7
No ratings yet
DS 7
3 pages
Report On - Social Media Research Topic Modeling
No ratings yet
Report On - Social Media Research Topic Modeling
26 pages
EX1
No ratings yet
EX1
6 pages
Aped For Fake News
No ratings yet
Aped For Fake News
6 pages
Deep Learning Questions 1701781891
No ratings yet
Deep Learning Questions 1701781891
17 pages
Thermodynamics Project PDF
No ratings yet
Thermodynamics Project PDF
32 pages
3
No ratings yet
3
5 pages
NLP Manual
No ratings yet
NLP Manual
21 pages
Assignment No - 7
No ratings yet
Assignment No - 7
4 pages
Module III
No ratings yet
Module III
42 pages
Exp No 5
No ratings yet
Exp No 5
5 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
NLP Lab Programs
No ratings yet
NLP Lab Programs
18 pages
DSBD 7 Ass
No ratings yet
DSBD 7 Ass
9 pages
SK NLP Practical (FS)
No ratings yet
SK NLP Practical (FS)
22 pages
Lab2 IR
No ratings yet
Lab2 IR
16 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
CSE 3652 Lab Record Format - PDF
No ratings yet
CSE 3652 Lab Record Format - PDF
13 pages
Self Evaluation Exercises
No ratings yet
Self Evaluation Exercises
12 pages
DSBA+Master+Codebook+ +Text+Mining+&+TSF
No ratings yet
DSBA+Master+Codebook+ +Text+Mining+&+TSF
11 pages
ASTW RA03 PracticalManual
No ratings yet
ASTW RA03 PracticalManual
18 pages
Assignment 2 IR
No ratings yet
Assignment 2 IR
6 pages
NLP - Practical List
No ratings yet
NLP - Practical List
14 pages
Final NLP Lab File
No ratings yet
Final NLP Lab File
28 pages
Bag of Words
No ratings yet
Bag of Words
19 pages
Sumati
No ratings yet
Sumati
10 pages
Methodology
No ratings yet
Methodology
9 pages
NLP Record 2
No ratings yet
NLP Record 2
18 pages
NLP Lab Assignment 8
No ratings yet
NLP Lab Assignment 8
14 pages
NLP Final
No ratings yet
NLP Final
26 pages
NLP 9
No ratings yet
NLP 9
44 pages
Dsbda 7
No ratings yet
Dsbda 7
1 page
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
Python Assignment 3
No ratings yet
Python Assignment 3
3 pages
SMA (TASK1 AND 2) ... HARDCOPY (Final) ..Pranchal..
No ratings yet
SMA (TASK1 AND 2) ... HARDCOPY (Final) ..Pranchal..
11 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
Motivation Video: Mitsuku Vs Cleverbot - AI (Artificial Intelligence)
No ratings yet
Motivation Video: Mitsuku Vs Cleverbot - AI (Artificial Intelligence)
45 pages
Tinywow Pythass3 77951173
No ratings yet
Tinywow Pythass3 77951173
17 pages
Reading KSTN Charts
100% (3)
Reading KSTN Charts
6 pages
03 Maxilift 17 Ws
No ratings yet
03 Maxilift 17 Ws
25 pages
SL-3 - Assignment No 7
No ratings yet
SL-3 - Assignment No 7
14 pages
CSE508: Information Retrieval Assignment 2: Question 1 - (40 Points) Scoring and Term-Weighting
No ratings yet
CSE508: Information Retrieval Assignment 2: Question 1 - (40 Points) Scoring and Term-Weighting
3 pages
Week 6: Introduction To Natural Language Processing
No ratings yet
Week 6: Introduction To Natural Language Processing
18 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
EPL Futsal Centre Online System
100% (1)
EPL Futsal Centre Online System
26 pages
Technical Writing A Practical Guide For Engineers ... - (Chapter 1 The Nature of Technical Writing)
100% (1)
Technical Writing A Practical Guide For Engineers ... - (Chapter 1 The Nature of Technical Writing)
10 pages
Report - Casa Chen - Actualizado - 16.10.2019 - Opcion 2
No ratings yet
Report - Casa Chen - Actualizado - 16.10.2019 - Opcion 2
11 pages
NLP - Short Assignments
No ratings yet
NLP - Short Assignments
8 pages
Itmf 2013 11 Sperling Enu
No ratings yet
Itmf 2013 11 Sperling Enu
5 pages
CSDM2-Text Preprocessing For NL Data - 011050
No ratings yet
CSDM2-Text Preprocessing For NL Data - 011050
6 pages
IKEA Brochure Bath en
No ratings yet
IKEA Brochure Bath en
19 pages
Natural Language Processing - NOTES
No ratings yet
Natural Language Processing - NOTES
4 pages
Comandos Sics
No ratings yet
Comandos Sics
64 pages
Machine Learning NLP LAB Sayak Mallick
No ratings yet
Machine Learning NLP LAB Sayak Mallick
4 pages
Differential Amplifiers: Syed Asif Eqbal
No ratings yet
Differential Amplifiers: Syed Asif Eqbal
22 pages
Module 4 CT
No ratings yet
Module 4 CT
7 pages
Descripción de Un City Gate
No ratings yet
Descripción de Un City Gate
18 pages
Marika SENG Research Presentation v1
No ratings yet
Marika SENG Research Presentation v1
27 pages
Commercializing Boron Nitride NanoTubes (BNNTS) For The Advanced Engineering Materials Industry An Interview With Jerome Pollak
No ratings yet
Commercializing Boron Nitride NanoTubes (BNNTS) For The Advanced Engineering Materials Industry An Interview With Jerome Pollak
6 pages
0701 Mechanical General Provision
No ratings yet
0701 Mechanical General Provision
13 pages
Irr Eo 801
No ratings yet
Irr Eo 801
9 pages
Solution Overview. Motorola Solutions Dimetra IP Micro Automatic Failover PDF
No ratings yet
Solution Overview. Motorola Solutions Dimetra IP Micro Automatic Failover PDF
8 pages
Client Side Document Scoping
No ratings yet
Client Side Document Scoping
22 pages
Salman Et Al 2008 "The Changing Role of CAAD in The Architectural Design Studio"
No ratings yet
Salman Et Al 2008 "The Changing Role of CAAD in The Architectural Design Studio"
15 pages
Rozee CV 6280316 Sumaiya Iftekhar
No ratings yet
Rozee CV 6280316 Sumaiya Iftekhar
2 pages
Compendium of Biomedical Instrumentation, 3 Volume Set Raghbir Singh Khandpur PDF Download
100% (1)
Compendium of Biomedical Instrumentation, 3 Volume Set Raghbir Singh Khandpur PDF Download
58 pages
0803
No ratings yet
0803
29 pages
Vapour Pressure
No ratings yet
Vapour Pressure
4 pages
G026a-Pv-15001 String Wiring Layout Block 1-15001
No ratings yet
G026a-Pv-15001 String Wiring Layout Block 1-15001
1 page
FSR PD 8
No ratings yet
FSR PD 8
2 pages
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
50 Python Concepts Every Developer Should Know
From Everand
50 Python Concepts Every Developer Should Know
Hernando Abella
No ratings yet

Assignment - 7: Import Import Import Import

Uploaded by

Assignment - 7: Import Import Import Import

Uploaded by

ASSIGNMENT - 7

In [10]: import numpy

In [3]: document = "Natural Language Processing is a fascinating field of AI. NLP he

In [4]: from nltk.tokenize import word_tokenize

Tokenized Words: ['Natural', 'Language', 'Processing', 'is', 'a', 'fascinati

In [5]: pos_tags = nltk.pos_tag(tokens)

POS Tags: [('Natural', 'JJ'), ('Language', 'NNP'), ('Processing', 'NNP'),

In [6]: from nltk.corpus import stopwords

After Stop Words Removal: ['Natural', 'Language', 'Processing', 'fascinatin

In [7]: from nltk.stem import PorterStemmer, WordNetLemmatizer

stemmed = [stemmer.stem(word) for word in filtered_tokens]

print("Stemmed Words:", stemmed)

Stemmed Words: ['natur', 'languag', 'process', 'fascin', 'field', 'ai', '.',

# Print TF-IDF scores

ai fascinating field helps human is language machines natural

nlp of processing understand

In [11]: import matplotlib.pyplot as plt

# Example TF-IDF scores

# Sort terms based on TF-IDF scores in descending order

This notebook was converted with convert.ploomber.io

You might also like