0% found this document useful (0 votes)
18 views6 pages

ML 2022

ML Data mining PYQ

Uploaded by

Ayush
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
18 views6 pages

ML 2022

ML Data mining PYQ

Uploaded by

Ayush
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 6
[This question paper contains 6 printed pages.) Your Roll No......sssseess Sr. No. of Question Paper; 1262 A Unique Paper Code : 32347607 Name of the Paper : Machine Learning Name of the Course : B.Se. (Hons.) Computer Science (LOCF) (Admission of 2019) Semester : VI Duration : 3 Hours Maximum Marks : 75 Instructions for Candidates 1. Write your Roll No. on the top immediately on receipt of this question paper. we Section A is compulsory. 3. Attempt any 4 (four) questions from Section B. 4, Use of scientific calculator is allowed. SECTION A 2 G) Distinguish between supervised learning and unsupervised learning. Illustrate with an example. > G6) abot WF Detine Concept Learning. How the concept learning ’ can be viewed as the task of searching? (5) P.T.O. 1262 Git) 2 Calculate the output y of a single layer neural network with three input neuron and one output neuron. The input feature vector is (xl, x2,x3) = (0.8, 0.6, 0.4) and weight values are [w1, w2, w3] = (0.2, 0.1, -0.3] and bias = 0.35. Use binary Sigmoid function as activation function. (3) (iy Distinguish between overfitting and underfitting. How (v) (vi) Gi it can affect model generalization? (5) Using an example discuss how new features can be constructed by forming Cartesian production of existing features. What are the implications of this approach? (5) Suppose that the probability of five events are P(first) = 0.5 and P(second) = P(third) = P(fourth) = P(Fifth) = 0.125. Calculate its entropy. (5) Use K-means clustering to cluster the following data into two groups: {2, 4, 10, 12, 3, 20, 30, 11, 25} Assume cluster centroid are ml=2 and m2=4, The distance function used is Euclidean distance. (5) 1262 2 Ww Gi) a i) Gi) 3 SECTION B Find the least square regression line for the given dataset using the normal equation method. Show computation at each step. xl x2 y 1 9 1% 2 1 7 3 2 12] 4 3 16 - 5 4 20 (4) Consider the dataset given below having two input variables x1, x2 and onc output variable y. Update the coefficients 0, 0, and 0, using gradient descent for the logistic regression model. Assume the learning rate = 0.3 and the initial values of coefficients as 6, = -0.5, 0, = 1, and 6, = —I. Perform one iteration of gradient descent. (6) x1 1 3 8 6 xz] 2 5 3 2 y 0 0 1 1 State Bayes Theorem. (2) Consider the training data in the following table where Play is a class attribute. P.T.O. 1262 4 Humidity Sunny Wind | ‘Play L N s N H N wi ¥ H N Ww. N H Y Ss Y H N _W ¥ L Y s N Build a Naive Bayes Classifier using the above data. Estimate the class label for day (Humidity=L, Sunny=N, Wind=W) using the above classifier. (8) 4. (i) Differentiate between Standard and Stochastic Gradient Descent. (4) (ii) Identify the first splitting attribute for decision tree with the following dataset using ID3 algorithm: Field Experience Hired IT. Coding No IT. Coding _| No IT Administration Yes IT Administration Yes Business Codin, Business. Coding Yes Business Administration No Business Administration No (6) 1262 5 . 5, Consider the following neural network with initial weights biases, and training input / outputs as mentioned bto35 b2060 Given the inputs il = 0.05, i2 = 0.10, determine the values of output nodes 01 and 02. Also calculate the prediction error E,,,,, if the actual output values ol and 02 are 0.01 and 0.99 respectively. Use Sigmoid as the activation function for the hidden as well as the output layers. (10) 6. (i) What is the'difference between K-means clustering and K-Nearest Neighbor classifier? (4) (ii) Discuss steps used by Principal Component Analysis to extract important features. (6) oy (i) Given the set of values X = (3, 9, 11, 5, 2)" and y = (1, 8, 11, 4, 3)’, Evaluate the regression coefficients using ordinary least square method. @) P.T.O. 1262 qi) Gi) 6 Assume a total of 1,000 patients are tested fo, influenza; 900 are found to be healthy, while 106 are found to be sick. A test resulted in 60 being positive and 40 being negative for the sick persons. The same lest was positive for 120, and negative for 789 in healthy adults, Construct a confusion matrix for the data and determine precision and recall. (6) What is regularization? What is the effect of the following on the model? (a) The regularization parameter (A) is zero (b) The regularization parameter (4) is very large @) State the mathematical formulation of the SVM problem, Give an outline of the method used for solving the classification problem using SVM. (6) (1100)

You might also like