Question Bank MLT
Question Bank MLT
QUESTION BANK
Programme & Branch B.E & Electronics and Communication Engineering Semester VI
Course Code & Name 60 EC 604 & Machine Learning Techniques
Unit - I
2 a Compare Lasso regression and Ridge Regression in terms of bias and variance. 6 An CO2 PO1
b Suppose you have a dataset with two classes that are not linearly separable. 10 Un CO2 PO2
How would you use a support vector machine (SVM) to classify this data?
Discuss the use of kernel functions and the choice of parameters in this scenario.
3 a Study the relationship between the monthly e-commerce sales and the online 10 Ap CO2 PO2
advertising costs. Here the survey result for 7 online stores for the last year was
given. Your task is to find the equation of the straight line that best fits the data
and plot the regression line for the sample values. The following table represents
the survey results from the 7 online stores.
b What is the purpose of regularization in machine learning? How do regularization 6 Ap CO2 PO1
parameters, such as lambda in L1 and L2 regularization, influence the trade-off
between bias and variance in machine learning models? Explain.
4 a Explain the fundamental principles of logistic regression and analyze how it 8 Ap CO2 PO1
compares to other classification algorithms (e.g., decision trees, support vector
machines) regarding performance, interpretability, and scalability - examples
where logistic regression is preferred over other methods.
b Given the positively labeled data points (4,1)(4,-1)(7,1)(7,-1) and negatively 8 Ap CO2 PO3
labeled data points (2,0)(1,1)(1,-1)(0,0) in R2. Identify the support vectors and
discover a simple SVM that accurately discriminates the positive and negative
classes.
(MAY – 2020)
5 a Give decision trees to represent the following Boolean functions: 10 Ap CO2 PO2
(a) A ∧ ¬B
(b) A ∨ [B ∧ C]
(c) A XOR B
(d) [A ∧ B] ∨ [C ∧ D]
b Discuss advantages and limitations of Bayesian model compared to traditional 6 Un CO2 PO1
methods.
6 a A company manufactures an electronic device to be used in a very wide 8 Un CO2 PO2
temperature range. The company knows that increased temperature shortens the
lifetime of the device, and a study is therefore performed in which the life time is
determined as a function of temperature. Express the lifetime as a linear function of
temperature. The following data is found:
Temperature 10 20 30 40 50 60 70 80 90
(C)
Lifetime 420 365 285 220 176 117 68 34 5
(hours)
(MAY – 2022)
Person Height Weight
1 167 55
2 120 32
3 113 33
4 175 76
5 108 25
b Discuss the core principles underlying Support Vector Machines (SVM) in 8 Un CO2 PO1
machine learning and how they enable classification in high-dimensional spaces.
UNIT - III
2 a Discuss the role of feature selection and dimensionality reduction techniques in 10 Un CO3 PO1
preprocessing data for K-means clustering. How do these preprocessing steps
influence the clustering results and improve clustering accuracy?
b Explain the different types of linkages used to cluster the objects with the help of 6 Un CO3 PO1
neat diagram.
3 a Consider a dataset with 5 objects and 2 features (height & weight) for each object. 10 Ap CO3 PO3
Assume k =2 and cluster the objects based on distance metrics. Choose person2
and person 3 are two centroids.
5 a Consider a database shown below which has 5 transactions. Analyze the database 10 Ap CO3 PO3
given and find the transaction ID under each cluster using k-means algorithm. Use
K=2 and assume A and C are the initial means. Justify your reasoning at each step,
including how the centroids are updated and how the final clusters are formed.
Transaction X1 X2
ID
A 1 1
B 1 0
C 0 2
D 2 4
E 3 5
b Discuss how does the factor analysis model assess the goodness of fit. 6 Un CO3 PO1
6 a Here the feature vectors which depicts the yearly rainfall and runoff rate for 5 years is 10 Ap CO3 PO2
given.
Year 1 2 3 4 5
Rainfall(cm)
10 15 13 90 5
Runoff(cm) 42 46 26 39 29
b Illustrate the hierarchy of clustering with the help of dendogram. 6 Un CO3 PO1
UNIT - IV
2 a The state transition diagram for the flea hops around on a triangle is given. At 10 Ap CO4 PO2
each step it moves to one of the other two vertices at random. What is the
expected time taken for flea to get from vertex 1 to vertex 2?
b Explain the relationship between a Markov Model and a Markov Chain. 6 Un CO4 PO1
3 a Describe how Bayesian Networks can be used to model complex probabilistic 8 Un CO4 PO2
relationships. Provide an example of a real-world application.
b Will the ensemble help to improve the classification accuracy? What are the 8 Un CO4 PO1
benefits of ensemble model? Explain.
4 a Discuss the importance of Markov Chains in the context of Markov Chain Monte 10 Ap CO4 PO2
Carlo (MCMC) methods. How do they ensure proper sampling?
b Discuss the advantages of using ensemble learning techniques over single models 6 Un CO4 PO1
5 a Analyze the process and importance of tracking methods in sequential data 8 Un CO4 PO3
analysis. Compare and contrast how models like Hidden Markov Models (HMMs)
and Kalman filters are used for state estimation and prediction in dynamic systems.
Provide examples from areas such as signal processing or navigation, and discuss
the strengths and limitations of each method in these applications.
b Explain AdaBoost algorithm covering its key concepts, components, and the 8 Un CO4 PO1
overall algorithmic process.
6 a Describe the stacking method in ensemble learning. Explain how it works, and 8 Un CO4 PO1
provide examples of its application in improving model performance.
b What is bagging in the context of machine learning? How does bagging help to 8 Un CO4 PO1
improve the performance of machine learning models? Explain it in detail.
UNIT - V
2 a Explain the applications and real-world challenges of reinforcement learning in 10 Un CO5 PO1
robotics.
b Write reinforcement learning problem characteristics. 6 Un CO5 PO1
3 a How reinforcement learning problem differs from the other function approximation 10 Un CO5 PO2
tasks? Explain.
b Illustrate the temporal difference learning model. 6 Un CO5 PO1
4 a What is reinforcement learning? Explain the applications and real-world challenges 10 Un CO5 PO1
of reinforcement learning in healthcare applications.
b Discuss the different application, challenges and limitations of policy search in 6 Un CO5 PO1
reinforcement learning.
5 a You are asked to solve the problem of identifying fake bloggers. A fake blogger is 10 Ap CO5 PO3
one who has posted at least 10% of fake blogs. As you are aware, blogs can be in
multiple languages and individual blogs can be in a mix of language. You have
been given the identifiers of nearly 1000 bloggers and you have been told at least
20% of the bloggers post fake blogs. You decided to solve the problem using a
machine learning approach. Answer the following questions in this context:
1. What are the features that you would consider? Name them and justify their
relevance.
2. What approach do you use to extract these features? Discuss how do you store
them.
3. Determine the appropriate machine learning algorithm and construct a flow
diagram depicting how you would determine the fake bloggers.
4. Discuss the metrics used to verify your algorithm.
(May 2022)
b Describe the concept of discount factor (gamma) in reinforcement learning and its 6 Un CO5 PO1
influence on the agent's behaviour.
6 a List down the difference between reinforcement learning, models based learning 10 Un CO5 PO1
and temporal difference learning. (May
2022)
b Is it necessary to perform generalisation in reinforcement learning? Why? 6 Un CO5 PO1