Big Data Computing - Assignment 6
Big Data Computing - Assignment 6
Assessment submitted.
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
X
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Big Data Computing (course)
Course
Thank you for taking the Week - 6
outline : Assignment-6.
How does an
NPTEL online
course work?
Week - 6 : Assignment-6
Week-0 Your last recorded submission was on 2021-09-27, 21:32 Due date: 2021-10-06, 23:59 IST.
IST
Week-1
1) Which of the following is required by K-means clustering ? 1 point
Week-2
Defined distance metric
Number of clusters
Week-3
Initial guess as to cluster centroids
Week-4
All of the mentioned
2) Identify the correct statement in context of Regressive model of Machine Learning. 1 point
Week-5
Regressive model predicts a numeric value instead of category.
Week-6
Regressive model organizes similar item in your dataset into groups.
Big Data
Regressive model comes up with a set of rules to capture associations between items or
Machine events.
Learning (Part-
None of the Mentioned
I) (unit?
unit=59&lesson=60) 3) Which of the following tasks can be best solved using Clustering ? 1 point
Big Data
Predicting the amount of rainfall based on various cues
Machine
Learning (Part-
Training a robot to solve a maze
II) (unit?
Detecting fraudulent credit card transactions
unit=59&lesson=61)
All of the mentioned
Machine
4) Identify the correct method for choosing the value of ‘k’ in k-means algorithm ? 1 point
Learning
Algorithm K-
Dimensionality reduction
means using
Map Reduce
Elbow method
for Big Data
Both Dimensionality reduction and Elbow method
Analytics
Data partitioning
https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=59&assessment=97 1/3
9/27/21, 9:32 PM Big Data Computing - - Unit 8 - Week-6
X Statement I: The idea of Pre-pruning is to stop tree induction before a fully grown tree is built,
Parallel K-
means using that perfectly fits the training data.
Map Reduce
on Big Data Statement II: The idea of Post-pruning is to grow a tree to its maximum size and then remove
Cluster the nodes using a top-bottom approach.
Analysis (unit?
unit=59&lesson=63)
Only Statement I is true
Week-6:
Only Statement II is true
Lecture
Both Statements are true
material (unit?
Both Statements are false
unit=59&lesson=64)
Quiz: Week -
6) Which of the following options is/are true for K-fold cross-validation ?
1 point
6:
Assignment-6 1. Increase in K will result in higher time required to cross validate the result.
(assessment? 2. Higher values of K will result in higher confidence on the cross-validation result as
name=97) compared to lower value of K.
3. If K=N, then it is called Leave one out cross validation, where N is the number of
Text Transcripts observations.
Books
1 and 2
2 and 3
1 and 3
1, 2 and 3
7) Imagine you are working on a project which is a binary classification problem. You 1 point
trained a model on training dataset and get the below confusion matrix on validation dataset.
Based on the above confusion matrix, choose which option(s) below will give you correct
predictions ?
1. Accuracy is ~0.91
1 and 3
2 and 4
2 and 3
1 and 4
https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=59&assessment=97 2/3
9/27/21, 9:32 PM Big Data Computing - - Unit 8 - Week-6
X Statement I: In supervised approaches, the target that the model is predicting is unknown or
unavailable. This means that you have unlabeled data.
Statement II: In unsupervised approaches the target, which is what the model is predicting, is
provided. This is referred to as having labeled data because the target is labeled for every
sample that you have in your data set.
Only Statement I is true
Only Statement II is true
Both Statements are false
Both Statements are true
You may submit any number of times before the due date. The final submission will be
considered for grading.
Submit Answers
https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=59&assessment=97 3/3