Midterm Exam (CSE 321 (Data Mining and Machine Learning), A (Day), Fall 2021)
Midterm Exam (CSE 321 (Data Mining and Machine Learning), A (Day), Fall 2021)
Answer all of the following questions. Figures in the right-hand margin indicate full marks.
1. The COVID-19 pandemic has been causing huge losses of life and sufferings around the
world since the beginning of this year. Although, its intensity has diminished to some
extent, its outbreak is still carrying on. To live a healthy and safe life during this pandemic
period, World Health Organization (WHO) and some other organizations/researchers
have found out the factors causing the infection of COVID-19. A data set has been
collected based on some prominent factors, which is as follows:
Page No. 1 of 2
Now, answer the following questions based on the above data set:
a) Write your plan to apply a suitable data mining technique for the given problem and 5
justify your answer.
b) Prepare a Bayesian classification model to classify the following test record X: 10
X = (51, Male, Medium, Low).
a) Can regression be directly applied on the data set in Question 1? Justify your answer. 1
b) What is the necessity of using scaling in k-NN? What can be the minimum value of k? 1
g) Can sin x be a good choice of the function for attribute transformation? Why? 1
h) According to the definition, what can the possible values of the pth percentile? 1
i) If some researchers want to find the major causes of smoking in the context of 1
Bangladesh based of data, which type of algorithms of different data mining tasks is
suitable here?
j) Given: 1
– If a patient has toothache, the probability that he/she has cavity is 50%.
Page No. 2 of 2