ML

The document is a sessional examination paper for a Machine Learning course, consisting of multiple-choice questions (MCQs) and descriptive questions (DES) related to various machine learning concepts. Topics covered include Principal Component Analysis (PCA), linear discriminant analysis, support vector machines (SVM), dimensionality reduction techniques, and decision tree models. The exam is structured to assess students' understanding of these concepts and their application in practical scenarios.

Uploaded by

khushpatel1222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

ML

Uploaded by

khushpatel1222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Sessional – 2, April 2023

Machine Learning (DSE 2254), IV Sem, DSE

Date: 19/04/2023 Max. Marks: 15

Duration: 1 hr

Instructions to Candidates
 Answer ALL the questions.
 Use of Calculator is allowed. Use of Mobile is NOT allowed.

Type: MCQ

Q1. What is the goal of Principal Component Analysis (PCA)? (0.5)

1. **To transform the original features into a new set of uncorrelated features.
2. To find the optimal decision boundary between classes.
3. To identify the most important features in a dataset.
4. To reduce the number of features in a dataset.
Q2. The ____________ probability is one of the quantities involved in Bayes' rule. It is the
conditional probability of a given event, computed after observing a second event whose conditional
and unconditional probabilities were known in advance. It is computed by revising the prior
pobability. (0.5)
1. Prior.
2. Zero.
3. **Posterior.
4. None of these.
Q3. How is the number of principal components chosen in PCA? (0.5)
1. Based on the number of features in the dataset.
2. **Based on the amount of variance explained by each component.
3. Based on the correlation between each pair of features.
4. None of these
Q4. What is the relationship between principal components and original features in PCA?(0.5)
1. Each principal component represents a single original feature.
2. Each original feature is a linear combination of all the principal components.
3. ** Each principal component is a linear combination of all the original features.
4. There is no relationship between principal components and original features.
Q5. What is the primary goal of linear discriminant analysis? (0.5)
1. To reduce the dimensionality of a dataset
2. **To find the decision boundary that maximizes class separation
3. To identify the most important features in a dataset
4. To fit a linear regression model to the data
Q6. Which of the following is a use case for linear discriminant analysis? (0.5)
1. **Identifying fraudulent credit card transactions
2. Predicting the price of a house
3. Segmenting customers based on demographics
4. All of the these
Q7. Eigen vectors are essentially defined for a ________ matrix (0.5)
1. Identity
2. ** Square
3. Orthogonal
4. Diagonal
Q8. Which of the following is false about non-linear SVM?(0.5)
1. It can only handle linearly separable data
2. It always ignores the outliers in data
3. It is only applicable to binary classification problems
4. **All these are false
Q9. Which of the following is true about SVM?(0.5)
1. It is only applicable to binary classification problems
2. **It can handle high-dimensional data
3. It cannot handle nonlinear data
4. It is a type of unsupervised learning algorithm
Q10. Suppose that an individual is extracted at random from a population of men. The
probability of extracting a married individual is 50%. The probability of extracting a childless
individual is 40%. The conditional probability that an individual is childless given that he is
married is equal to 20%. If the individual we extract at random from the population turns out
to be childless, what is the conditional or posterior probability that he is married? (0.5)
1. ** 1/4
2. 2/3
3. 3/8
4. 1/2
Type: DES

Q11. Explain the role of Kernel function in SVM. Discuss about different types of Kernel functions. (2)
Role of Kernel in SVM: 0.5 Marks
Atleast 3 different kernel functions with one line definition: 0.5 * 3 = 1.5 Marks

Q12. Explain any FOUR (4) dimensionality reduction techniques. (2)

 Only List FOUR DR method 0.5 marks

 Explain FOUR DR methods 2.0 marks (0.5 each)

Dimensionality reduction techniques are used to reduce the number of features or variables
in a dataset while still retaining the important information. This is particularly useful when
dealing with high-dimensional data where the number of variables is much larger than the
number of observations.
Principal Component Analysis (PCA): PCA is a linear dimensionality reduction technique that
aims to find a new set of uncorrelated variables, known as principal components, that
capture the maximum amount of variance in the original data. The first principal component
captures the direction of maximum variance in the data, the second captures the direction
of the maximum remaining variance, and so on. PCA is commonly used in data visualization,
feature extraction, and data compression.
t-Distributed Stochastic Neighbor Embedding (t-SNE): t-SNE is a nonlinear dimensionality
reduction technique that maps high-dimensional data onto a low-dimensional space
(typically 2D or 3D) by preserving the pairwise similarities between data points. It uses a
probabilistic approach to model the similarity between points in high-dimensional space and
low-dimensional space, with a focus on preserving the structure of the data. t-SNE is
particularly useful for visualizing high-dimensional data, as it can reveal the underlying
structure and relationships between the data points.
Uniform Manifold Approximation and Projection (UMAP): UMAP is a nonlinear
dimensionality reduction technique that is similar to t-SNE, but uses a different approach to
construct the low-dimensional representation. UMAP works by constructing a high-
dimensional graph of the data points and then using a smooth function to map the points
onto a low-dimensional space. This smooth function is designed to preserve the local
structure of the data, which makes UMAP particularly useful for preserving the cluster
structure of the data.
Locally Linear Embedding (LLE): LLE is a nonlinear dimensionality reduction technique that
works by finding a low-dimensional representation of the data that preserves the local
structure of the data. LLE constructs a graph of the data points and then finds a low-
dimensional representation that minimizes the difference between the distances in the high-
dimensional space and the distances in the low-dimensional space. LLE is particularly useful
for preserving the local structure of the data, which makes it useful for data visualization and
anomaly detection.

Q13. Predict whether the tuple (1.8, 2.1) belongs to Class A or Class B using the principles of
Maximum Likelihood Estimation. (3)
 Formulae 0.5 marks
 Correct Steps 2.0 marks
 Correct Answer 0.5 marks

µx µy σx σy

Class A -0.19 5.03 4.12 1.78

Class B -2.18 -2.84 2.04 0.85

likelihood_A = (1 / (2 * pi * 4.12 * 1.78)) * exp(-(((1.8 - (-0.19)) / 4.12)**2 / 2) * exp(-(((2.1 -

5.03) / 1.78)**2 / 2)) = 0.000053
likelihood_B = (1 / (2 * pi * 2.04 * 0.85)) * exp(-(((1.8 - (-2.18)) / 2.04)**2 / 2) * exp(-(((2.1 - (-
2.84)) / 0.85)**2 / 2)) = 0.000012
Since likelihood_A is greater than likelihood_B, tuple (1.8, 2.1) belongs to Class A.
Q14. Answer the following questions: (3)
A) Explain different ways of measuring impurity in data using Decision tree model.
3 methods: Gain Ratio, Entropy and Gini Index – 0.5 * 3 = 1.5 Marks

B) Explain different ways of fitting the decision model to avoid over-fitting.

Ans: Overfitting is the phenomena that results when the trained model is more biased towards the
training data and does not do well on the test or unseen data instances resulting in violation of the
concept of Generalization. Definition : 0.5 M
We need to avoid overfitting by reducing the number of training data tuples to generate the model
Or Identify redundant or irrelevant attributes from the original data.
2 Methods with one line explanation – 0.5 * 2 = 1 M

R in Action, Second Edition
0% (2)
R in Action, Second Edition
2 pages
E9 205 - Machine Learning For Signal Processing
No ratings yet
E9 205 - Machine Learning For Signal Processing
3 pages
Compre FoDS
No ratings yet
Compre FoDS
3 pages
PRML 2022 Endsem
No ratings yet
PRML 2022 Endsem
3 pages
ML Questions Answer Q1
No ratings yet
ML Questions Answer Q1
79 pages
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
8 pages
Feature Engineering
No ratings yet
Feature Engineering
51 pages
Khoi KHDL - de On
No ratings yet
Khoi KHDL - de On
6 pages
CS-30004(DSA)-CS_END_NOV_2024
No ratings yet
CS-30004(DSA)-CS_END_NOV_2024
17 pages
Mcq's (6 Topics)
No ratings yet
Mcq's (6 Topics)
42 pages
ML MCQ Unit 2
No ratings yet
ML MCQ Unit 2
8 pages
Kernel PCA
No ratings yet
Kernel PCA
13 pages
Dip Ii-Unit
No ratings yet
Dip Ii-Unit
7 pages
ML Finals16 PDF
No ratings yet
ML Finals16 PDF
12 pages
Final: CS 189 Spring 2016 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2016 Introduction To Machine Learning
12 pages
CSE1703 - Fundamental of Data Science
No ratings yet
CSE1703 - Fundamental of Data Science
6 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Lecture 16_25.09.2024_PCA, Unsupervised Learning-Clustring & Metrics
No ratings yet
Lecture 16_25.09.2024_PCA, Unsupervised Learning-Clustring & Metrics
51 pages
final2008f-solution
No ratings yet
final2008f-solution
18 pages
Compre FoDS
No ratings yet
Compre FoDS
2 pages
IT446 Test Bank
No ratings yet
IT446 Test Bank
57 pages
Major 2020
No ratings yet
Major 2020
2 pages
4 - Basics in Statistics and Linear Algebra
No ratings yet
4 - Basics in Statistics and Linear Algebra
7 pages
Compre FoDS
No ratings yet
Compre FoDS
2 pages
Final Exam: CS 189 Spring 2020 Introduction To Machine Learning
No ratings yet
Final Exam: CS 189 Spring 2020 Introduction To Machine Learning
19 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Visualization 9 Dim Reduction
No ratings yet
Visualization 9 Dim Reduction
73 pages
CAT2 Key
No ratings yet
CAT2 Key
10 pages
Question Bank
No ratings yet
Question Bank
6 pages
It-3031 (DMDW) - CS End Nov 2023
No ratings yet
It-3031 (DMDW) - CS End Nov 2023
23 pages
MLT
No ratings yet
MLT
32 pages
Foundation of Data Science previous year question paper
No ratings yet
Foundation of Data Science previous year question paper
40 pages
ML Unit 2 MCQ
100% (2)
ML Unit 2 MCQ
3 pages
MCQ QB
No ratings yet
MCQ QB
6 pages
quiz3
No ratings yet
quiz3
12 pages
ML 2023a Midsem Solution
No ratings yet
ML 2023a Midsem Solution
9 pages
Quiz1_18September2021-Ans
No ratings yet
Quiz1_18September2021-Ans
3 pages
Pca
No ratings yet
Pca
19 pages
Final Compre - Solutions - updated FoDS
No ratings yet
Final Compre - Solutions - updated FoDS
12 pages
FDS-1
No ratings yet
FDS-1
5 pages
EDAB Module 5 Singular Value Decomposition (SVD)
No ratings yet
EDAB Module 5 Singular Value Decomposition (SVD)
58 pages
Lecture 9_PCA
No ratings yet
Lecture 9_PCA
44 pages
finals19
No ratings yet
finals19
16 pages
Itae002 Test 2
No ratings yet
Itae002 Test 2
150 pages
finals19
No ratings yet
finals19
16 pages
Unit 3
No ratings yet
Unit 3
21 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
8 pages
4_5900103544970678130
No ratings yet
4_5900103544970678130
15 pages
Worksheet For Quiz
No ratings yet
Worksheet For Quiz
5 pages
Q1.Bayes' Theorem
No ratings yet
Q1.Bayes' Theorem
5 pages
Ict Data Analysis
No ratings yet
Ict Data Analysis
9 pages
Unit 3
No ratings yet
Unit 3
50 pages
2022 CS244 End Sem Soln
No ratings yet
2022 CS244 End Sem Soln
6 pages
MLT MCQ
No ratings yet
MLT MCQ
21 pages
FDS - 3 SOLVED
No ratings yet
FDS - 3 SOLVED
21 pages
Subject: COMP - VI / R-2016 / ML
No ratings yet
Subject: COMP - VI / R-2016 / ML
9 pages
FDS PYQ Solution
No ratings yet
FDS PYQ Solution
8 pages
Quiz3_2023
No ratings yet
Quiz3_2023
2 pages
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
2.7 Eigenvalues and Eigenvectors
No ratings yet
2.7 Eigenvalues and Eigenvectors
9 pages
1.13 Higher Order Linear Differential Equations
No ratings yet
1.13 Higher Order Linear Differential Equations
13 pages
1.14 Variation of Parameters
No ratings yet
1.14 Variation of Parameters
3 pages
1.15 Euler-Cauchy linear equation
No ratings yet
1.15 Euler-Cauchy linear equation
2 pages
1.16 Legendre’s linear equation
No ratings yet
1.16 Legendre’s linear equation
2 pages
2.4 Inverse using elementary operations
No ratings yet
2.4 Inverse using elementary operations
3 pages
Lesson Plan - July 2023 - BET - ELE 1071
No ratings yet
Lesson Plan - July 2023 - BET - ELE 1071
1 page
1.9 First Order Linear Equations
No ratings yet
1.9 First Order Linear Equations
2 pages
Digital Communication
No ratings yet
Digital Communication
9 pages
1.5 Variable Separable equations
No ratings yet
1.5 Variable Separable equations
2 pages
End Sem QP Format and sample QP- Communication Skills in English.docx
No ratings yet
End Sem QP Format and sample QP- Communication Skills in English.docx
3 pages
1.3 Families of curves
No ratings yet
1.3 Families of curves
2 pages
1.2 Formulation of Differential Equations by Eliminating Arbitrary Constants
No ratings yet
1.2 Formulation of Differential Equations by Eliminating Arbitrary Constants
3 pages
1.7 Differential Equations with linear coefficients
No ratings yet
1.7 Differential Equations with linear coefficients
2 pages
TUTORIAL-6.1
No ratings yet
TUTORIAL-6.1
6 pages
lecture-4-Carbohydrates & ATP
No ratings yet
lecture-4-Carbohydrates & ATP
5 pages
Unit IV-Communications
No ratings yet
Unit IV-Communications
3 pages
TUTORIAL-3.2
No ratings yet
TUTORIAL-3.2
10 pages
EXAMPLE PROBLEM-1
No ratings yet
EXAMPLE PROBLEM-1
10 pages
TUTORIAL-5.1
No ratings yet
TUTORIAL-5.1
1 page
Case study 1
No ratings yet
Case study 1
9 pages
EXAMPLE PROBLEM-3
No ratings yet
EXAMPLE PROBLEM-3
7 pages
DSE-2221-18-Mar-2024(DBS)
No ratings yet
DSE-2221-18-Mar-2024(DBS)
9 pages
ENGLISH Solutions
No ratings yet
ENGLISH Solutions
4 pages
MIE-1071-Bme
No ratings yet
MIE-1071-Bme
7 pages
DPS
No ratings yet
DPS
13 pages
MID_SEM_QP_2024_MARCH_final
No ratings yet
MID_SEM_QP_2024_MARCH_final
4 pages
DSE-2224-21-Mar-2024
No ratings yet
DSE-2224-21-Mar-2024
7 pages
Mos
No ratings yet
Mos
49 pages
DPS PYQs
No ratings yet
DPS PYQs
5 pages
Lec 4 PDF
No ratings yet
Lec 4 PDF
66 pages
Package Caret': R Topics Documented
No ratings yet
Package Caret': R Topics Documented
136 pages
SWOT Analysis Paper
No ratings yet
SWOT Analysis Paper
10 pages
Statistical Evaluation of Agricultural Development in Asian Countries
No ratings yet
Statistical Evaluation of Agricultural Development in Asian Countries
62 pages
Research Paper 4
No ratings yet
Research Paper 4
31 pages
Instant Access to (Ebook) Representation in Machine Learning by Murty, M N; Avinash, M ISBN 9789811979071, 9811979073 ebook Full Chapters
100% (6)
Instant Access to (Ebook) Representation in Machine Learning by Murty, M N; Avinash, M ISBN 9789811979071, 9811979073 ebook Full Chapters
67 pages
Loan Prediction Using Artificial Intelligence and Machine Learning
No ratings yet
Loan Prediction Using Artificial Intelligence and Machine Learning
24 pages
4 Data Reduction Techniques For Efficient Data Analysis
No ratings yet
4 Data Reduction Techniques For Efficient Data Analysis
10 pages
Icalab2007 Guidebook
No ratings yet
Icalab2007 Guidebook
78 pages
Wind Peak Pressures On A Square-Section Cylinder Flow Mechanism and Standardconditional POD Analyses
No ratings yet
Wind Peak Pressures On A Square-Section Cylinder Flow Mechanism and Standardconditional POD Analyses
15 pages
Siddique Et Al. - 2021 - Irrigation Water Quality Index Development
No ratings yet
Siddique Et Al. - 2021 - Irrigation Water Quality Index Development
20 pages
One Day Workshop On EFA & CFA Using IBM SPSS 24 & AMOS 24
No ratings yet
One Day Workshop On EFA & CFA Using IBM SPSS 24 & AMOS 24
31 pages
Marketing Research Question
No ratings yet
Marketing Research Question
4 pages
Heart Disease Analysis
No ratings yet
Heart Disease Analysis
45 pages
Principal Component Analysis (PCA) : Feature Extraction Node
No ratings yet
Principal Component Analysis (PCA) : Feature Extraction Node
4 pages
IPCC_AR6_WGI_Annex_IV
No ratings yet
IPCC_AR6_WGI_Annex_IV
60 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Bak PDF
No ratings yet
Bak PDF
215 pages
Data Mining Project DSBA PCA Report Final
No ratings yet
Data Mining Project DSBA PCA Report Final
21 pages
Multivariate Analysis An Overview
No ratings yet
Multivariate Analysis An Overview
9 pages
(Ebook) Analysis of Financial Time Series by Ruey S. Tsay ISBN 9780470414354, 0470414359 - Download the ebook today and experience the full content
100% (2)
(Ebook) Analysis of Financial Time Series by Ruey S. Tsay ISBN 9780470414354, 0470414359 - Download the ebook today and experience the full content
56 pages
NumXL Functions
No ratings yet
NumXL Functions
11 pages
Pca 1692550768
No ratings yet
Pca 1692550768
13 pages
Mastering Data Analysis with R 1st Edition Daroczi 2024 Scribd Download
100% (9)
Mastering Data Analysis with R 1st Edition Daroczi 2024 Scribd Download
60 pages
The Insider Threat Detection Method of University Website Clusters Based On Machine Learning
No ratings yet
The Insider Threat Detection Method of University Website Clusters Based On Machine Learning
6 pages
AIML Curriculum powered by IBM - Pregrad-merged
No ratings yet
AIML Curriculum powered by IBM - Pregrad-merged
66 pages
Ccw331 Lab Manual
No ratings yet
Ccw331 Lab Manual
102 pages
Basic Linear Algebra For Deep Learning and Machine Learning Python Tutorial - by Towards AI Team - Towards AI - Oct, 2020 - Medium PDF
No ratings yet
Basic Linear Algebra For Deep Learning and Machine Learning Python Tutorial - by Towards AI Team - Towards AI - Oct, 2020 - Medium PDF
33 pages
Instrumented Principal Component Analysis
No ratings yet
Instrumented Principal Component Analysis
71 pages

ML

Uploaded by

ML

Uploaded by

Sessional – 2, April 2023

Machine Learning (DSE 2254), IV Sem, DSE

Date: 19/04/2023 Max. Marks: 15

Q1. What is the goal of Principal Component Analysis (PCA)? (0.5)

Q12. Explain any FOUR (4) dimensionality reduction techniques. (2)

 Explain FOUR DR methods 2.0 marks (0.5 each)

Class A -0.19 5.03 4.12 1.78

Class B -2.18 -2.84 2.04 0.85

likelihood_A = (1 / (2 * pi * 4.12 * 1.78)) * exp(-(((1.8 - (-0.19)) / 4.12)**2 / 2) * exp(-(((2.1 -

B) Explain different ways of fitting the decision model to avoid over-fitting.

You might also like