Assignment-10 (NLP-part-2)

Lab Assignment 10 for UCS420 Cognitive Computing focuses on Natural Language Processing (NLP) using Python. It includes tasks such as text preprocessing, feature extraction, sentiment analysis, and text generation through various techniques like tokenization, stemming, and similarity metrics. Students are required to apply libraries like NLTK, TextBlob, and Keras to analyze and generate text based on their own inputs.

Uploaded by

skaushal1be23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Assignment-10 (NLP-part-2)

Uploaded by

skaushal1be23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Lab Assignment 10

UCS420 Cogni ve Compu ng

Assignment Title: NLP using Python-II

(Feature extrac on from text, sen ment analysis and text genera on)

Q1. Write a unique paragraph (5-6 sentences) about your favorite topic (e.g., sports,
technology, food, books, etc.).

1. Convert text to lowercase and remove punctua on using re.

2. Tokenize the text into words and sentences.
3. Split using split() and word_tokenize() and compare how Python split and NLTK’s
word_tokenize() diﬀer.
4. Remove stopwords (using NLTK's stopwords list).
5. Display word frequency distribu on (excluding stopwords).

Q2. Using the same paragraph from Q1:

1. Extract all words with only alphabets using re.ﬁndall()
2. Remove stop words using NLTK’s stopword list
3. Perform stemming with PorterStemmer
4. Perform lemma za on with WordNetLemma zer
5. Compare the stemmed and lemma zed outputs and explain when you’d prefer one over
the other.

Q3. Choose 3 short texts of your own (e.g., diﬀerent news headlines, product reviews).

1. Use CountVectorizer to generate the Bag of Words representa on.

2. Use TﬁdfVectorizer to compute TF-IDF scores.
3. Print and interpret the top 3 keywords from each text using TF-IDF.

Q4. Write 2 short texts (4–6 lines each) describing two diﬀerent technologies (e.g., AI vs
Blockchain).

1. Preprocess and tokenize both texts.

2. Calculate:
a. Jaccard Similarity using sets
b. Cosine Similarity using TﬁdfVectorizer + cosine_similarity()
c. Analyze which similarity metric gives be er insights in your case.

Q5. Write a short review for a product or service.

1. Use TextBlob or VADER to ﬁnd polarity & subjec vity for each review.
2. Classify reviews into Posi ve / Nega ve / Neutral.
3. Create a word cloud using the wordcloud library for all posi ve reviews.

Q6. Choose your own paragraph (~100 words) as training data.

1. Tokenize text using Tokenizer() from keras.preprocessing.text
2. Create input sequences and build a simple LSTM or Dense model
3. Train the model and generate 2–3 new lines of text star ng from any seed word you
provide.

Approaching Almost Any NLP
No ratings yet
Approaching Almost Any NLP
118 pages
Nlp Lab Manual
No ratings yet
Nlp Lab Manual
21 pages
NLP_Assignment2 proper RNN working
No ratings yet
NLP_Assignment2 proper RNN working
3 pages
NLP_Assignment2
No ratings yet
NLP_Assignment2
7 pages
AI Lab Manual aktu
No ratings yet
AI Lab Manual aktu
11 pages
NLP Manual
No ratings yet
NLP Manual
21 pages
Laboratory Manual: Faculty of Engineering and Technology Bachelor of Technology
No ratings yet
Laboratory Manual: Faculty of Engineering and Technology Bachelor of Technology
10 pages
NLP LAB_MANUAL (1)
No ratings yet
NLP LAB_MANUAL (1)
33 pages
NLP - Practical List
No ratings yet
NLP - Practical List
14 pages
p4
No ratings yet
p4
10 pages
Sumati
No ratings yet
Sumati
10 pages
ASTW RA03 PracticalManual
No ratings yet
ASTW RA03 PracticalManual
18 pages
Combine PDF
No ratings yet
Combine PDF
124 pages
NLP Assignment 2
No ratings yet
NLP Assignment 2
3 pages
NLP Lab
No ratings yet
NLP Lab
18 pages
NLP - Short Assignments
No ratings yet
NLP - Short Assignments
8 pages
AIML_P4
No ratings yet
AIML_P4
12 pages
unit4 (1)
No ratings yet
unit4 (1)
23 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
Deep DL Manual Deep
No ratings yet
Deep DL Manual Deep
8 pages
Next Word Prediction With NLP and Deep Learning
No ratings yet
Next Word Prediction With NLP and Deep Learning
13 pages
Practicle 7-notes
No ratings yet
Practicle 7-notes
2 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
Deep DL Manual Nainish
No ratings yet
Deep DL Manual Nainish
8 pages
SMA (TASK1 AND 2) ... HARDCOPY (Final) ..Pranchal..
No ratings yet
SMA (TASK1 AND 2) ... HARDCOPY (Final) ..Pranchal..
11 pages
NLP - Cheatsheet
No ratings yet
NLP - Cheatsheet
10 pages
Jal Patel NLP
No ratings yet
Jal Patel NLP
32 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
basenlp
No ratings yet
basenlp
5 pages
NLP Tushar
No ratings yet
NLP Tushar
21 pages
CSDM2-Text Preprocessing For NL Data - 011050
No ratings yet
CSDM2-Text Preprocessing For NL Data - 011050
6 pages
NLP Soc
No ratings yet
NLP Soc
15 pages
NLP LAB MANUAL
No ratings yet
NLP LAB MANUAL
17 pages
Assignment-9 (NLP)
No ratings yet
Assignment-9 (NLP)
2 pages
17 Practicals
No ratings yet
17 Practicals
7 pages
NLP Previous Sem-4-5
No ratings yet
NLP Previous Sem-4-5
2 pages
Batch 2
No ratings yet
Batch 2
13 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
AI Practical No 9-13
No ratings yet
AI Practical No 9-13
5 pages
NLP Final
No ratings yet
NLP Final
26 pages
Homework 2
No ratings yet
Homework 2
4 pages
Gen Ai Lab
No ratings yet
Gen Ai Lab
3 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
COMP 4650 6490 Assignment 3 2023-v1.1
No ratings yet
COMP 4650 6490 Assignment 3 2023-v1.1
6 pages
Sahil NLP
No ratings yet
Sahil NLP
16 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
Web and Social Media Analytics Lab
No ratings yet
Web and Social Media Analytics Lab
34 pages
Question Bank
No ratings yet
Question Bank
3 pages
Question Bank
No ratings yet
Question Bank
2 pages
Deep_Learning_Questions_1701781891
No ratings yet
Deep_Learning_Questions_1701781891
17 pages
NLP MTE syllabus and Practice Problems (2)
No ratings yet
NLP MTE syllabus and Practice Problems (2)
2 pages
NLP
No ratings yet
NLP
2 pages
AIML_LAB_Week9_2
No ratings yet
AIML_LAB_Week9_2
3 pages
Unit2 Full
No ratings yet
Unit2 Full
28 pages
Aped For Fake News
No ratings yet
Aped For Fake News
6 pages
NLP Programs
No ratings yet
NLP Programs
13 pages
TextFeatureEnginerring-NLP lec2
No ratings yet
TextFeatureEnginerring-NLP lec2
60 pages
NLP Preprocessing Steps
No ratings yet
NLP Preprocessing Steps
20 pages
A Beginner's guide to Python
From Everand
A Beginner's guide to Python
Steven Mcananey
No ratings yet
Beginning Swift Programming
From Everand
Beginning Swift Programming
Wei-Meng Lee
No ratings yet

Assignment-10 (NLP-part-2)

Uploaded by

Assignment-10 (NLP-part-2)

Uploaded by

Lab Assignment 10

UCS420 Cogni ve Compu ng

Assignment Title: NLP using Python-II

1. Convert text to lowercase and remove punctua on using re.

Q2. Using the same paragraph from Q1:

1. Use CountVectorizer to generate the Bag of Words representa on.

1. Preprocess and tokenize both texts.

Q5. Write a short review for a product or service.

Q6. Choose your own paragraph (~100 words) as training data.

You might also like