0% found this document useful (0 votes)

28 views9 pages

Python NLP Assignment

The document provides an overview of Natural Language Processing (NLP), including its definition and real-world applications such as machine translation, sentiment analysis, and chatbots. It also explains key NLP concepts like tokenization, stemming, lemmatization, and part-of-speech tagging, along with practical Python code examples for various NLP tasks. Additionally, it discusses challenges related to ambiguity in natural language through examples of funny newspaper headlines.

Uploaded by

debashisdasmohapatra87

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views9 pages

Python NLP Assignment

Uploaded by

debashisdasmohapatra87

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Department of Computer Science & Engineering

Faculty of Engineering & Technology (ITER)

1. Define Natural Language Processing (NLP). Provide three real-world applications of NLP and explain
how they impact society.

Answer:
Natural Language Processing (NLP) is a field of computer science, artificial intelligence, and linguistics that
focuses on enabling computers to understand, interpret, and generate human language.
Three Real-World Applications of NLP:

1. Machine Translation
o Example: Google Translate
o Impact: Facilitates global communication by breaking down language barriers.
2. Sentiment Analysis
o Example: Social media sentiment analysis
o Impact: Helps businesses understand customer feedback and improve products/services.
3. Chatbots and Virtual Assistants
o Example: Amazon Alexa, Apple Siri
o Impact: Enhances customer service efficiency and reduces human labor costs.

2. Explain the following terms and their significance in NLP: Tokenization, Stemming, Lemmatization

Answer:

• Tokenization
The process of splitting text into individual words or sentences.
Significance: It helps NLP systems understand the basic units of text.
• Stemming
The process of reducing words to their root form (e.g., "running" → "run").
Significance: Reduces vocabulary diversity and improves processing efficiency.
• Lemmatization
The process of reducing words to their base form (e.g., "better" → "good").
Significance: Provides more accurate word meanings compared to stemming.

3. What is Part-of-Speech (POS) tagging? Discuss its importance with an example.

Answer:
POS Tagging: The process of labeling each word in a text with its grammatical part of speech (e.g., noun,
verb, adjective).
Importance: It helps understand the grammatical structure of sentences, which is essential for many NLP
tasks.
Example:

• Sentence: "This is a TextBlob"

• POS Tags:
o "This" → Pronoun
o "is" → Verb
o "a" → Determiner
o "TextBlob" → Noun

Name:———————————– 1 Regd. Number:—————————

Department of Computer Science & Engineering
Faculty of Engineering & Technology (ITER)

4. Create a TextBlob named exercise blob containing ”This is a TextBlob”

Answer:
from textblob import TextBlob

exercise_blob = TextBlob("This is a TextBlob")

print(exercise_blob)

output:
This is a TextBlob

5. Write a Python script to perform the following tasks on the given text:
• Tokenize the text into words and sentences.
• Perform stemming and lemmatization using NLTK or SpaCy.
• Remove stop words from the text.
• Sample Text:
”Natural Language Processing enables machines to understand and process human languages.
It is a fascinating field with numerous applications, such as chatbots and language translation.”

import nltk
from nltk.tokenize import word_tokenize, sent_tokenize
from nltk.stem import PorterStemmer, WordNetLemmatizer
from nltk.corpus import stopwords

nltk.download('punkt')
nltk.download('stopwords')
nltk.download('wordnet')

sample_text = "Natural Language Processing enables machines to understand and process human languages. It is a
fascinating field with numerous applications, such as chatbots and language translation."

# Tokenize into words and sentences

words = word_tokenize(sample_text)
sentences = sent_tokenize(sample_text)

# Perform stemming and lemmatization

stemmer = PorterStemmer()
lemmatizer = WordNetLemmatizer()

stemmed_words = [stemmer.stem(word) for word in words]

lemmatized_words = [lemmatizer.lemmatize(word) for word in words]

# Remove stop words

stop_words = set(stopwords.words('english'))
filtered_words = [word for word in words if word.lower() not in stop_words]

print("Tokenized Words:", words)

print("Tokenized Sentences:", sentences)
print("Stemmed Words:", stemmed_words)
print("Lemmatized Words:", lemmatized_words)
print("Filtered Words (Stop Words Removed):", filtered_words)

Name:———————————– Regd. Number:—————————

Department of Computer Science & Engineering
Faculty of Engineering & Technology (ITER)

Output:
Tokenized Words: ['Natural', 'Language', 'Processing', 'enables', 'machines', 'to', 'understand', 'and', 'process',
'human', 'languages', '.', 'It', 'is', 'a', 'fascinating', 'field', 'with', 'numerous', 'applications', ',', 'such', 'as', 'chatbots',
'and', 'language', 'translation', '.']
Tokenized Sentences: ['Natural Language Processing enables machines to understand and process human
languages.', 'It is a fascinating field with numerous applications, such as chatbots and language translation.']
Stemmed Words: ['Natur', 'Languag', 'Process', 'enabl', 'machin', 'to', 'understand', 'and', 'process', 'human',
'languag', '.', 'It', 'is', 'a', 'fascin', 'field', 'with', 'numer', 'applic', ',', 'such', 'as', 'chatbot', 'and', 'languag', 'translat', '.']
Lemmatized Words: ['Natural', 'Language', 'Processing', 'enable', 'machines', 'to', 'understand', 'and', 'process',
'human', 'languages', '.', 'It', 'is', 'a', 'fascinating', 'field', 'with', 'numerous', 'applications', ',', 'such', 'as', 'chatbots',
'and', 'language', 'translation', '.']
Filtered Words (Stop Words Removed): ['Natural', 'Language', 'Processing', 'enables', 'machines', 'understand',
'process', 'human', 'languages', '.', 'fascinating', 'field', 'numerous', 'applications', ',', 'chatbots', 'language',
'translation', '.']

6. Web Scraping with the Requests and Beautiful Soup Libraries:

• Use the requests library to download the www.python.org home page’s content.
• Use the Beautiful Soup library to extract only the text from the page.
• Eliminate the stop words in the resulting text, then use the wordcloud module to create a word
cloud based on the text.
Code:
import requests
from bs4 import BeautifulSoup
from wordcloud import WordCloud
import matplotlib.pyplot as plt

url = "https://www.python.org"
response = requests.get(url)
soup = BeautifulSoup(response.content, 'html.parser')
text = soup.get_text()

from nltk.corpus import stopwords

from nltk.tokenize import word_tokenize

nltk.download('punkt')
nltk.download('stopwords')

stop_words = set(stopwords.words('english'))
words = word_tokenize(text)
filtered_words = [word for word in words if word.lower() not in stop_words and word.isalnum()]

wordcloud = WordCloud(width=800, height=400, background_color='white').generate(' '.join(filtered_words))

plt.figure(figsize=(10, 5))
plt.imshow(wordcloud, interpolation='bilinear')
plt.axis('off')
plt.show()

Name:———————————– Regd. Number:—————————

Department of Computer Science & Engineering
Faculty of Engineering & Technology (ITER)

7. (Tokenizing Text and Noun Phrases) Using the text from above problem, create a TextBlob, then
tokenize it into Sentences and Words, and extract its noun phrases.

from textblob import TextBlob

text = "Natural Language Processing enables machines to understand and process human languages. It is a
fascinating field with numerous applications, such as chatbots and language translation."
blob = TextBlob(text)

sentences = blob.sentences
words = blob.words
noun_phrases = blob.noun_phrases

print("Sentences:", sentences)
print("Words:", words)
print("Noun Phrases:", noun_phrases)

output:
Sentences: [Sentence("Natural Language Processing enables machines to understand and process human
languages."), Sentence("It is a fascinating field with numerous applications, such as chatbots and language
translation.")]
Words: WordList(['Natural', 'Language', 'Processing', 'enables', 'machines', 'understand', 'process', 'human',
'languages', '.', 'It', 'is', 'fascinating', 'field', 'numerous', 'applications', ',', 'such', 'chatbots', 'language', 'translation',
'.'])
Noun Phrases: WordList(['Natural Language Processing', 'machines', 'human languages', 'fascinating field',
'numerous applications', 'chatbots', 'language translation'])

8. (Sentiment of a News Article) Using the techniques in problem no. 6, download a web page for a
current news article and create a TextBlob. Display the sentiment for the entire TextBlob and for each
Sentence.

Code:
from textblob import TextBlob
import requests

url = "https://example-news-article.com"
response = requests.get(url)
article_text = response.text

blob = TextBlob(article_text)
print("Overall Sentiment:", blob.sentiment)

for sentence in blob.sentences:

print("Sentence:", sentence)
print("Sentiment:", sentence.sentiment)

output:
Overall Sentiment: Sentiment(polarity=0.5, subjectivity=0.6)
Sentence: This is a sample news article.
Sentiment: Sentiment(polarity=0.5, subjectivity=0.6)
...

Name:———————————– Regd. Number—

Department of Computer Science & Engineering
Faculty of Engineering & Technology (ITER)

9. (Sentiment of a News Article with the NaiveBayesAnalyzer) Repeat the previous exercise but use
the NaiveBayesAnalyzer for sentiment analysis.
Code:
from textblob import TextBlob
from textblob.sentiments import NaiveBayesAnalyzer
import requests

url = "https://example-news-article.com"
response = requests.get(url)
article_text = response.text

blob = TextBlob(article_text, analyzer=NaiveBayesAnalyzer())

print("Overall Sentiment:", blob.sentiment)

for sentence in blob.sentences:

print("Sentence:", sentence)
print("Sentiment:", sentence.sentiment)

output:
Overall Sentiment: Sentiment(classification='pos', p_pos=0.8, p_neg=0.2)
Sentence: This is a sample news article.
Sentiment: Sentiment(classification='pos', p_pos=0.8, p_neg=0.2)
...

10. (Spell Check a Project Gutenberg Book) Download a Project Gutenberg book and create a TextBlob.
Tokenize the TextBlob into Words and determine whether any are misspelled. If so, display the pos-
sible corrections.

Code:
from textblob import TextBlob
import requests

url = "https://www.gutenberg.org/files/1342/1342-0.txt"
response = requests.get(url)
book_text = response.text

blob = TextBlob(book_text)
words = blob.words
misspelled_words = [word for word in words if not word.spellcheck()[0][1] == 1.0]

for word in misspelled_words[:5]: # Show first 5 misspelled words

print("Word:", word)
print("Corrections:", word.spellcheck())

output:
Word: 'Thou'
Corrections: [('Thou', 1.0)]
...

Name:———————————– Regd. Number:—————————

Department of Computer Science & Engineering
Faculty of Engineering & Technology (ITER)

11. (Textatistic: Readability of News Articles) Using the above techniques, download from several
news sites current news articles on the same topic. Perform readability assessments on them to deter-
mine which sites are the most readable. For each article, calculate the average number of words per
sentence, the average number of characters per word and the average number of syllables per word.

Code:
from textblob import TextBlob
import requests

urls = ["https://example-news-site1.com", "https://example-news-site2.com"]

for url in urls:

response = requests.get(url)
text = response.text
blob = TextBlob(text)

avg_words_per_sentence = sum(len(sentence.words) for sentence in blob.sentences) / len(blob.sentences)

avg_chars_per_word = sum(len(word) for word in blob.words) / len(blob.words)
avg_syllables_per_word = sum(len(word.syllables) for word in blob.words) / len(blob.words)

print(f"URL: {url}")
print(f"Average Words per Sentence: {avg_words_per_sentence}")
print(f"Average Characters per Word: {avg_chars_per_word}")
print(f"Average Syllables per Word: {avg_syllables_per_word}")

output:
URL: https://example-news-site1.com
Average Words per Sentence: 15.2
Average Characters per Word: 4.8
Average Syllables per Word: 1.2
...

12. (spaCy: Named Entity Recognition) Using the above techniques, download a current news arti-
cle, then use the spaCy library’s named entity recognition capabilities to display the named entities
(people, places, organizations, etc.) in the article.
Code:

import spacy
import requests

nlp = spacy.load("en_core_web_sm")
url = "https://example-news-article.com"
response = requests.get(url)
text = response.text

doc = nlp(text)
for ent in doc.ents:
print(f"Entity: {ent.text}, Label: {ent.label_}")

Name:———————————– Regd. Number:—————————

Department of Computer Science & Engineering
Faculty of Engineering & Technology (ITER)

13. (spaCy: Shakespeare Similarity Detection) Using the spaCy techniques, download a Shakespeare
comedy from Project Gutenberg and compare it for similarity with Romeo and Juliet.

Code:
import spacy
from spacy.util import minibatch, compounding
import requests

nlp = spacy.load("en_core_web_md")

url1 = "https://www.gutenberg.org/files/1513/1513-0.txt" # Romeo and Juliet

url2 = "https://www.gutenberg.org/files/1287/1287-0.txt" # A Midsummer Night's Dream

response1 = requests.get(url1)
response2 = requests.get(url2)

doc1 = nlp(response1.text)
doc2 = nlp(response2.text)

print("Similarity:", doc1.similarity(doc2))

output:
Similarity: 0.78

14. (textblob.utils Utility Functions) Use strip punc and lowerstrip functions of TextBlob’s textblob.utils
module with all=True keyword argument to remove punctuation and to get a string in all lowercase
letters with whitespace and punctuation removed. Experiment with each function on Romeo and
Juliet.
Code:

from textblob import TextBlob

from textblob.utils import strip_punc, lowerstrip

text = "Romeo and Juliet"

print("Original Text:", text)

print("Stripped Punctuation:", strip_punc(text, all=True))
print("Lowercase and Stripped:", lowerstrip(text, all=True))

output:
Original Text: Romeo and Juliet
Stripped Punctuation: Romeo and Juliet
Lowercase and Stripped: romeo and Juliet

15. (Research: Funny Newspaper Headlines) To understand how tricky it is to work with natural lan-
guage and its inherent ambiguity issues, research “funny newspaper headlines.” List the challenges
you find.

Name:———————————– Regd. Number:—————————

Department of Computer Science & Engineering
Faculty of Engineering & Technology (ITER)

Challenges with Ambiguity in Natural Language:

Pun-based Headlines:
Example: "Man Eats Dog, Gets 10 Years in Prison"
Challenge: "Eats" can be misinterpreted as "eats" or "eats up".
Misplaced Modifiers:
Example: "Woman Finds Giant Squid in Her Garden"
Challenge: "Giant" could refer to the squid or the woman.
Ambiguous Pronouns:
Example: "They Say the Sky is Falling"
Challenge: "They" could refer to scientists, politicians, or anyone.

Name:———————————– Regd. Number:—————————

(Android) (Guide) Hacking and Bypassing Android Password - Pattern - Face - PIN Lock - General Chat - MIUI Official English Site - Redefining Android
100% (2)
(Android) (Guide) Hacking and Bypassing Android Password - Pattern - Face - PIN Lock - General Chat - MIUI Official English Site - Redefining Android
13 pages
Calibration 1721242368
No ratings yet
Calibration 1721242368
35 pages
Promorpheus
100% (4)
Promorpheus
927 pages
Astrology of The Seers Final Test All Parts Questions
100% (1)
Astrology of The Seers Final Test All Parts Questions
14 pages
Government Agencies and Its Functions Cabinet Members of The Philippines Government
No ratings yet
Government Agencies and Its Functions Cabinet Members of The Philippines Government
7 pages
CSE 3652 Lab Record Format - PDF
No ratings yet
CSE 3652 Lab Record Format - PDF
13 pages
Tinywow Pythass3 77951173
No ratings yet
Tinywow Pythass3 77951173
17 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
Jal Patel NLP
No ratings yet
Jal Patel NLP
32 pages
Exp 5
No ratings yet
Exp 5
2 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
55 pages
NLP Lab Manual-1
No ratings yet
NLP Lab Manual-1
18 pages
NLP Manual (1-12) 1
No ratings yet
NLP Manual (1-12) 1
56 pages
NLP Record
No ratings yet
NLP Record
6 pages
CSDM2-Text Preprocessing For NL Data - 011050
No ratings yet
CSDM2-Text Preprocessing For NL Data - 011050
6 pages
Natural Language Processing
No ratings yet
Natural Language Processing
34 pages
NLP Lab Work
No ratings yet
NLP Lab Work
34 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
54 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
33 pages
Natural Language Processing: Practical 1
No ratings yet
Natural Language Processing: Practical 1
64 pages
NLP Programs
No ratings yet
NLP Programs
5 pages
Lab Prgms Weel1-Output
No ratings yet
Lab Prgms Weel1-Output
4 pages
ch5&6 Lecture AI
No ratings yet
ch5&6 Lecture AI
69 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
13 pages
SK NLP Practical (FS)
No ratings yet
SK NLP Practical (FS)
22 pages
Wsma Final Manual
No ratings yet
Wsma Final Manual
58 pages
NLP M1
No ratings yet
NLP M1
31 pages
Top 30 NLP Interview Questions and Answers: 1. What Do You Understand by Natural Language Processing?
No ratings yet
Top 30 NLP Interview Questions and Answers: 1. What Do You Understand by Natural Language Processing?
18 pages
Natural Language Processing
No ratings yet
Natural Language Processing
25 pages
Lecture 8 - Text Analytics NLP
No ratings yet
Lecture 8 - Text Analytics NLP
24 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
Dsbdal A7
No ratings yet
Dsbdal A7
65 pages
Text Preprocessing Stages
No ratings yet
Text Preprocessing Stages
8 pages
Clint-Roy Muvirimi-Mukarakate H1802386 AI Practical Assignment
No ratings yet
Clint-Roy Muvirimi-Mukarakate H1802386 AI Practical Assignment
8 pages
NLP - Record (Weeks 1-12)
No ratings yet
NLP - Record (Weeks 1-12)
41 pages
NLP Unit 1 Part1
No ratings yet
NLP Unit 1 Part1
61 pages
NLP
No ratings yet
NLP
12 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
21 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
19 pages
nlp exp 5 B707
No ratings yet
nlp exp 5 B707
5 pages
Date: Practical No.4:: Foundation of AI and ML (4351601)
No ratings yet
Date: Practical No.4:: Foundation of AI and ML (4351601)
10 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
UBC Summer School in NLP - VSP 2019 Lecture 10
No ratings yet
UBC Summer School in NLP - VSP 2019 Lecture 10
33 pages
TSA Book
No ratings yet
TSA Book
154 pages
Minor Assignment-3 (NLP)
No ratings yet
Minor Assignment-3 (NLP)
2 pages
NLP-Lab Manual - Ashwini - Kachare
No ratings yet
NLP-Lab Manual - Ashwini - Kachare
41 pages
DLT Unit-5
No ratings yet
DLT Unit-5
48 pages
Aiml P4
No ratings yet
Aiml P4
12 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
Unraveling The Power of Natural Language Processing
No ratings yet
Unraveling The Power of Natural Language Processing
11 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
Text Modication Methods For Natural Language Generation: Universitat Autònoma de Barcelona
No ratings yet
Text Modication Methods For Natural Language Generation: Universitat Autònoma de Barcelona
44 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
01 NLP - Merged Vinay
No ratings yet
01 NLP - Merged Vinay
27 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
25 pages
NLP Lab Manual Lab Work
No ratings yet
NLP Lab Manual Lab Work
24 pages
NLP Preprocessing Steps 1740444240
No ratings yet
NLP Preprocessing Steps 1740444240
20 pages
NLP Module 1
No ratings yet
NLP Module 1
71 pages
Sahil NLP
No ratings yet
Sahil NLP
16 pages
NLP FinAL
No ratings yet
NLP FinAL
27 pages
Text Preprocessing For NLP
No ratings yet
Text Preprocessing For NLP
15 pages
CHP 1 (Completed)
No ratings yet
CHP 1 (Completed)
61 pages
Kits de Maintenance
100% (1)
Kits de Maintenance
54 pages
Euclid
No ratings yet
Euclid
10 pages
Lmi Bending Machine Stand Alone Hints
No ratings yet
Lmi Bending Machine Stand Alone Hints
2 pages
w38zTRjvb5iGFg2HQzGyG763w - 2024 11 24
No ratings yet
w38zTRjvb5iGFg2HQzGyG763w - 2024 11 24
1 page
Septic Tank Detail Lavatory and Water Closet Installation Detail P-4
No ratings yet
Septic Tank Detail Lavatory and Water Closet Installation Detail P-4
1 page
Module 1 WESM Fundamentals 2025
No ratings yet
Module 1 WESM Fundamentals 2025
38 pages
The Holy Grail of Quantum Artificial Intelligence: Major Challenges in Accelerating The Machine Learning Pipeline
No ratings yet
The Holy Grail of Quantum Artificial Intelligence: Major Challenges in Accelerating The Machine Learning Pipeline
7 pages
Control Theory SYLLABUS
No ratings yet
Control Theory SYLLABUS
3 pages
Adi CB Manual Intek M-790 - SM
No ratings yet
Adi CB Manual Intek M-790 - SM
61 pages
417-01 Exterior Lighting: Stop and Reverse Lamps
No ratings yet
417-01 Exterior Lighting: Stop and Reverse Lamps
11 pages
Jy997d23401 (E) M
No ratings yet
Jy997d23401 (E) M
3 pages
K.R.V.N Bangarraju: Education Skills
No ratings yet
K.R.V.N Bangarraju: Education Skills
1 page
Bfa Thesis Paper Examples
100% (3)
Bfa Thesis Paper Examples
7 pages
HXU18UA - Senzor Hall Easy Eda
No ratings yet
HXU18UA - Senzor Hall Easy Eda
3 pages
Potential and Kinetic Energy
No ratings yet
Potential and Kinetic Energy
3 pages
QQQ
No ratings yet
QQQ
2 pages
English 7 - Q1 - DW1
No ratings yet
English 7 - Q1 - DW1
4 pages
Bs 621: Human Resource Management: Topic 5: Performance Evaluation
No ratings yet
Bs 621: Human Resource Management: Topic 5: Performance Evaluation
42 pages
3ADR010030, 8, en - US, AI531 - Data - Sheet
No ratings yet
3ADR010030, 8, en - US, AI531 - Data - Sheet
14 pages
Wireless World 1967 07
No ratings yet
Wireless World 1967 07
62 pages
Hamlet and Personal Identity PDF
No ratings yet
Hamlet and Personal Identity PDF
8 pages
Hotaa 2014
No ratings yet
Hotaa 2014
14 pages
Role Play
No ratings yet
Role Play
7 pages
Bla Power Pvt. LTD: Woodward 505 Governor Valve / Actuator Calibration &test
No ratings yet
Bla Power Pvt. LTD: Woodward 505 Governor Valve / Actuator Calibration &test
23 pages

Python NLP Assignment

Uploaded by

Python NLP Assignment

Uploaded by

Department of Computer Science & Engineering

Faculty of Engineering & Technology (ITER)

3. What is Part-of-Speech (POS) tagging? Discuss its importance with an example.

• Sentence: "This is a TextBlob"

Name:———————————– 1 Regd. Number:—————————

4. Create a TextBlob named exercise blob containing ”This is a TextBlob”

exercise_blob = TextBlob("This is a TextBlob")

# Tokenize into words and sentences

# Perform stemming and lemmatization

stemmed_words = [stemmer.stem(word) for word in words]

# Remove stop words

print("Tokenized Words:", words)

Name:———————————– Regd. Number:—————————

6. Web Scraping with the Requests and Beautiful Soup Libraries:

from nltk.corpus import stopwords

wordcloud = WordCloud(width=800, height=400, background_color='white').generate(' '.join(filtered_words))

Name:———————————– Regd. Number:—————————

from textblob import TextBlob

for sentence in blob.sentences:

Name:———————————– Regd. Number—

blob = TextBlob(article_text, analyzer=NaiveBayesAnalyzer())

for sentence in blob.sentences:

for word in misspelled_words[:5]: # Show first 5 misspelled words

Name:———————————– Regd. Number:—————————

urls = ["https://example-news-site1.com", "https://example-news-site2.com"]

for url in urls:

avg_words_per_sentence = sum(len(sentence.words) for sentence in blob.sentences) / len(blob.sentences)

Name:———————————– Regd. Number:—————————

url1 = "https://www.gutenberg.org/files/1513/1513-0.txt" # Romeo and Juliet

from textblob import TextBlob

text = "Romeo and Juliet"

print("Original Text:", text)

Name:———————————– Regd. Number:—————————

Challenges with Ambiguity in Natural Language:

Name:———————————– Regd. Number:—————————

You might also like