0% found this document useful (0 votes)
168 views

Big Data Computing - Assignment 6

This document is a summary of an online NPTEL course assessment for Big Data Computing - Unit 8 Week 6. It provides 8 multiple choice questions related to machine learning algorithms and evaluations. The assessment is due on October 6, 2021 and allows multiple submissions before the due date, with the final submission used for grading.

Uploaded by

VarshaMega
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
168 views

Big Data Computing - Assignment 6

This document is a summary of an online NPTEL course assessment for Big Data Computing - Unit 8 Week 6. It provides 8 multiple choice questions related to machine learning algorithms and evaluations. The assessment is due on October 6, 2021 and allows multiple submissions before the due date, with the final submission used for grading.

Uploaded by

VarshaMega
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

9/27/21, 9:32 PM Big Data Computing - - Unit 8 - Week-6

Assessment submitted.

(https://swayam.gov.in)      

(https://swayam.gov.in/nc_details/NPTEL)
X

[email protected]

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Big Data Computing (course)

Course
Thank you for taking the Week - 6
outline : Assignment-6.
How does an
NPTEL online
course work?
Week - 6 : Assignment-6
Week-0 Your last recorded submission was on 2021-09-27, 21:32 Due date: 2021-10-06, 23:59 IST.
IST
Week-1
1) Which of the following is required by K-means clustering ? 1 point

Week-2
Defined distance metric

Number of clusters
Week-3

Initial guess as to cluster centroids
Week-4
All of the mentioned

2) Identify the correct statement in context of Regressive model of Machine Learning. 1 point
Week-5

Regressive model predicts a numeric value instead of category.
Week-6
Regressive model organizes similar item in your dataset into groups.

Big Data
Regressive model comes up with a set of rules to capture associations between items or
Machine events.
Learning (Part-
None of the Mentioned
I) (unit?
unit=59&lesson=60) 3) Which of the following tasks can be best solved using Clustering ? 1 point

Big Data
Predicting the amount of rainfall based on various cues
Machine
Learning (Part-

Training a robot to solve a maze
II) (unit?
Detecting fraudulent credit card transactions
unit=59&lesson=61)

All of the mentioned
Machine
4) Identify the correct method for choosing the value of ‘k’ in k-means algorithm ? 1 point
Learning
Algorithm K-

Dimensionality reduction
means using
Map Reduce
Elbow method
for Big Data
Both Dimensionality reduction and Elbow method
Analytics

Data partitioning

https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=59&assessment=97 1/3
9/27/21, 9:32 PM Big Data Computing - - Unit 8 - Week-6

(unit? 5) Identify the correct statement(s) in context of overfitting in decision trees:


1 point
Assessment submitted.
unit=59&lesson=62)

X Statement I: The idea of Pre-pruning is to stop tree induction before a fully grown tree is built,
Parallel K-
means using that perfectly fits the training data.

Map Reduce

on Big Data Statement II: The idea of Post-pruning is to grow a tree to its maximum size and then remove
Cluster the nodes using a top-bottom approach.
Analysis (unit?
unit=59&lesson=63)
Only Statement I is true
Week-6:
Only Statement II is true
Lecture
Both Statements are true
material (unit?

Both Statements are false
unit=59&lesson=64)

Quiz: Week -
6) Which of the following options is/are true for K-fold cross-validation ?
1 point
6:

Assignment-6 1. Increase in K will result in higher time required to cross validate the result.

(assessment? 2. Higher values of K will result in higher confidence on the cross-validation result as
name=97) compared to lower value of K.

3. If K=N, then it is called Leave one out cross validation, where N is the number of
Text Transcripts observations.

Books
1 and 2

2 and 3
 

1 and 3

1, 2 and 3

7) Imagine you are working on a project which is a binary classification problem. You 1 point
trained a model on training dataset and get the below confusion matrix on validation dataset.

Based on the above confusion matrix, choose which option(s) below will give you correct
predictions ?

1. Accuracy is ~0.91

2. Misclassification rate is ~ 0.91

3. False positive rate is ~0.95

4. True positive rate is ~0.95


1 and 3

2 and 4

2 and 3

1 and 4

https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=59&assessment=97 2/3
9/27/21, 9:32 PM Big Data Computing - - Unit 8 - Week-6

8) Identify the correct statement(s) in context of machine learning approaches:


1 point
Assessment submitted.

X Statement I: In supervised approaches, the target that the model is predicting is unknown or
unavailable. This means that you have unlabeled data.

Statement II: In unsupervised approaches the target, which is what the model is predicting, is
provided. This is referred to as having labeled data because the target is labeled for every
sample that you have in your data set.


Only Statement I is true

Only Statement II is true

Both Statements are false

Both Statements are true

You may submit any number of times before the due date. The final submission will be
considered for grading.
Submit Answers

https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=59&assessment=97 3/3

You might also like