We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 6
[This question paper contains 6 printed pages.)
Your Roll No......sssseess
Sr. No. of Question Paper; 1262 A
Unique Paper Code : 32347607
Name of the Paper : Machine Learning
Name of the Course : B.Se. (Hons.) Computer Science
(LOCF)
(Admission of 2019)
Semester : VI
Duration : 3 Hours Maximum Marks : 75
Instructions for Candidates
1. Write your Roll No. on the top immediately on receipt of
this question paper.
we
Section A is compulsory.
3. Attempt any 4 (four) questions from Section B.
4, Use of scientific calculator is allowed.
SECTION A
2 G) Distinguish between supervised learning and
unsupervised learning. Illustrate with an example.
> G6)
abot
WF Detine Concept Learning. How the concept learning
’
can be viewed as the task of searching? (5)
P.T.O.1262
Git)
2
Calculate the output y of a single layer neural network
with three input neuron and one output neuron. The
input feature vector is (xl, x2,x3) = (0.8, 0.6, 0.4)
and weight values are [w1, w2, w3] = (0.2, 0.1, -0.3]
and bias = 0.35. Use binary Sigmoid function as
activation function. (3)
(iy Distinguish between overfitting and underfitting. How
(v)
(vi)
Gi
it can affect model generalization? (5)
Using an example discuss how new features can
be constructed by forming Cartesian production of
existing features. What are the implications of this
approach? (5)
Suppose that the probability of five events are P(first)
= 0.5 and P(second) = P(third) = P(fourth) = P(Fifth)
= 0.125. Calculate its entropy. (5)
Use K-means clustering to cluster the following data
into two groups:
{2, 4, 10, 12, 3, 20, 30, 11, 25}
Assume cluster centroid are ml=2 and m2=4, The
distance function used is Euclidean distance. (5)1262
2 Ww
Gi)
a i)
Gi)
3
SECTION B
Find the least square regression line for the given
dataset using the normal equation method. Show
computation at each step.
xl x2 y
1 9 1%
2 1 7
3 2 12]
4 3 16 -
5 4 20 (4)
Consider the dataset given below having two input
variables x1, x2 and onc output variable y. Update the
coefficients 0, 0, and 0, using gradient descent for
the logistic regression model. Assume the learning
rate = 0.3 and the initial values of coefficients as
6, = -0.5, 0, = 1, and 6, = —I. Perform one iteration
of gradient descent. (6)
x1 1 3 8 6
xz] 2 5 3 2
y 0 0 1 1
State Bayes Theorem. (2)
Consider the training data in the following table where
Play is a class attribute.
P.T.O.1262 4
Humidity Sunny Wind | ‘Play
L N s N
H N wi ¥
H N Ww. N
H Y Ss Y
H N _W ¥
L Y s N
Build a Naive Bayes Classifier using the above data.
Estimate the class label for day (Humidity=L, Sunny=N,
Wind=W) using the above classifier. (8)
4. (i) Differentiate between Standard and Stochastic
Gradient Descent. (4)
(ii) Identify the first splitting attribute for decision tree
with the following dataset using ID3 algorithm:
Field Experience Hired
IT. Coding No
IT. Coding _| No
IT Administration Yes
IT Administration Yes
Business Codin,
Business. Coding Yes
Business Administration No
Business Administration No
(6)1262 5 .
5, Consider the following neural network with initial weights
biases, and training input / outputs as mentioned
bto35 b2060
Given the inputs il = 0.05, i2 = 0.10, determine the values
of output nodes 01 and 02. Also calculate the prediction
error E,,,,, if the actual output values ol and 02 are 0.01
and 0.99 respectively. Use Sigmoid as the activation function
for the hidden as well as the output layers. (10)
6. (i) What is the'difference between K-means clustering
and K-Nearest Neighbor classifier? (4)
(ii) Discuss steps used by Principal Component Analysis
to extract important features. (6)
oy (i) Given the set of values X = (3, 9, 11, 5, 2)" and
y = (1, 8, 11, 4, 3)’, Evaluate the regression
coefficients using ordinary least square method.
@)
P.T.O.1262
qi)
Gi)
6
Assume a total of 1,000 patients are tested fo,
influenza; 900 are found to be healthy, while 106 are
found to be sick. A test resulted in 60 being positive
and 40 being negative for the sick persons. The same
lest was positive for 120, and negative for 789 in
healthy adults, Construct a confusion matrix for the
data and determine precision and recall. (6)
What is regularization? What is the effect of the
following on the model?
(a) The regularization parameter (A) is zero
(b) The regularization parameter (4) is very large
@)
State the mathematical formulation of the SVM
problem, Give an outline of the method used for solving
the classification problem using SVM. (6)
(1100)