0% found this document useful (0 votes)
51 views2 pages

Week 8: Assignment 8: Assignment Submitted On 2023-09-14, 15:02 IST

The document is a quiz for an online course on business analytics and text mining modeling using Python. It consists of 10 multiple choice questions testing students' understanding of key concepts relating to text transformation, preprocessing, feature extraction, vectorization, and classification modeling. Topics covered include tokenization, lemmatization, stemming, vector space models, part-of-speech tagging, model tuning, and applying text mining techniques like topic modeling to social media data.

Uploaded by

nidhuraji
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
51 views2 pages

Week 8: Assignment 8: Assignment Submitted On 2023-09-14, 15:02 IST

The document is a quiz for an online course on business analytics and text mining modeling using Python. It consists of 10 multiple choice questions testing students' understanding of key concepts relating to text transformation, preprocessing, feature extraction, vectorization, and classification modeling. Topics covered include tokenization, lemmatization, stemming, vector space models, part-of-speech tagging, model tuning, and applying text mining techniques like topic modeling to social media data.

Uploaded by

nidhuraji
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Business Analytics & Text Mining Modeling Using Python - - Unit 10 - Week 8 21/09/23, 10:46 AM

(https://swayam.gov.in) [email protected]

(https://swayam.gov.in/nc_details/NPTEL)

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Business Analytics & Text Mining Modeling Using Python (course)

Announcements (announcements) About the Course (preview) Ask a Question (forum) Progress (student/home)

Mentor (student/mentor) Mentee List (student/mentee) Review Assignment (assignment_review)

If already registered, click


to check your payment
status Week 8 : Assignment 8
The due date for submitting this assignment has passed.
Due on 2023-09-20, 23:59 IST.
Course outline
Assignment submitted on 2023-09-14, 15:02 IST
How does an NPTEL 1) The process of transforming unstructured text involves following steps 1 point
online course work? ()
Cleaning Text
Week 0 () Tokenization
Stemming or lemmatization
Week 1 () Feature Extraction and Selection
All
Week 2 () Yes, the answer is correct.
Score: 1
Week 3 () Accepted Answers:
All

Week 4 () 2) A large collection of text data in text mining is known as 1 point

Text database
Week 5 ()
Data Collection
Corpus
Week 6 ()
Data Store
Week 7 () Yes, the answer is correct.
Score: 1
Week 8 () Accepted Answers:
Corpus
Text Collection &
3) The primary assumption that sentence tokenization has already performed is necessary in 1 point
Transformation Part I
(unit?unit=73&lesson=74) Treebank Word Tokenizer
Text Collection &
Word Tokenizer
Transformation Part II Punctuation Sentence Tokenizer
(unit?unit=73&lesson=75) None of the above

Text Mining & Modeling Yes, the answer is correct.


Part I (unit? Score: 1
unit=73&lesson=76) Accepted Answers:
Treebank Word Tokenizer
Text Mining & Modeling
Part II (unit? 4) Pattern = r’[^a-zA-Z0-9]’ extract from text 1 point
unit=73&lesson=77)
Characters
Text Mining & Modeling Words
Part III (unit? Alphanumeric Characters
unit=73&lesson=78) None of the above

Quiz: Week 8 : Yes, the answer is correct.


Assignment 8 Score: 1
(assessment?name=115) Accepted Answers:
Alphanumeric Characters
DOWNLOAD VIDEOS ()
5) Process of converting the word to its base form or root form is called 1 point

Weekly Feedback Link Tokenization


() Lemmatization
Stemming
Text Transcripts () Both b and c

Yes, the answer is correct.


Books () Score: 1
Accepted Answers:
Both b and c

https://onlinecourses.nptel.ac.in/noc23_mg104/unit?unit=73&assessment=115 Page 1 of 2
Business Analytics & Text Mining Modeling Using Python - - Unit 10 - Week 8 21/09/23, 10:46 AM

6) Which of the following stemmer is iterative and have rule based steps for removal of affixes to their stems? 1 point

Porter Stemmer
Snowball Stemmer
Lancaster Stemmer
All

Yes, the answer is correct.


Score: 1
Accepted Answers:
Lancaster Stemmer

7) Transforming and representing text document as numeric vectors of specific terms which form the vector dimensions 1 point

Vector space model


Term space model
Both a and b
None of the above

No, the answer is incorrect.


Score: 0
Accepted Answers:
Both a and b

8) Which of the following features can be used for accuracy improvement of a classification model? 1 point

Frequency count of terms


Part of Speech Tag
Dependency Grammar
All of these

Yes, the answer is correct.


Score: 1
Accepted Answers:
All of these

9) What is the right order for text classification model components? 1 point
1. Text cleaning
2. Text annotation
3. Gradient descent
4. Model tuning
5. Text to predictors

12345
13425
12534
13452

Yes, the answer is correct.


Score: 1
Accepted Answers:
12534

10) Social Media platforms are the most intuitive form of text data. You are given a corpus of complete social media data of tweets. 1 point
How can you create a model that suggests the hashtags?

Perform Topic Models to obtain most significant words of the corpus


Train a Bag of N grams model to capture top n-grams – words and their combinations
Train a word2vector model to learn repeating contexts in the sentences
All of these

Yes, the answer is correct.


Score: 1
Accepted Answers:
All of these

https://onlinecourses.nptel.ac.in/noc23_mg104/unit?unit=73&assessment=115 Page 2 of 2

You might also like