Assignment DMW
Assignment DMW
UNIT 1
Q.1 (a). Define data mining and explain its significance in extracting valuable information from
large datasets.
(b). Enumerate and briefly describe at least three key functionalities of data mining. Provide an
example for each functionality.
Q.2 (a). Outline the steps involved in the data mining process. Explain the importance of each
step in discovering hidden patterns and knowledge from data.
(b). Differentiate between data cleaning and data transformation in the context of the data mining
process. Why are these steps crucial for successful data mining?
Q.3 (a). Classify data mining systems based on their functionalities. Provide a detailed
explanation of at least two types of data mining systems.
(b). Discuss the advantages and limitations of each type of data mining system you
mentioned in part (a).
Q.4 (a). Identify and explain three major issues or challenges faced in the field of data mining.
How do these issues impact the accuracy and reliability of the mining results?
(b). Propose potential solutions or strategies to address one of the issues identified in part (a).
Q.5 (a). Provide an overview of data preprocessing and its significance in the context of data
mining.
(b). Explain the concepts of data cleaning, data transformation, and data discretization in the
data preprocessing phase. How do these techniques contribute to improving the quality of data
for mining purposes?
UNIT 2
Q.1 (a). Explain the general approach to classification in predictive modeling. Highlight the key
steps involved and discuss why classification is a fundamental task in data analysis.
(b). Provide an example scenario where classification is applicable, and describe how it can be
beneficial in making predictions or decisions.
Q.2 (a). Define decision tree induction and discuss its role in predictive modeling. Explain how
decision trees are constructed and how they make predictions
(b). Compare and contrast the advantages and disadvantages of decision tree induction with other
classification techniques.
Q.3 (a). Explain the fundamental principles of Bayes classification methods. Discuss how
probability theory is applied in these methods to make predictions.
(b). Provide an example of a real-world application where Bayes classification methods could be
effectively used. Discuss the assumptions and considerations associated with these methods.
Q.4 (a). Describe the concepts of Bayesian Belief Networks in the context of predictive
modeling. Discuss the advantages and potential challenges associated with using Bayesian Belief
Networks for classification.
(b). Explain the back propagation algorithm in the context of neural networks for classification.
Discuss how back propagation works and its role in training neural networks.
(a). Define Support Vector Machines (SVM) and explain how they are used in predictive
modeling. Discuss the concept of hyper planes and the importance of kernel functions in SVM.
(b). Discuss the characteristics of lazy learners in the context of classification. Provide examples
of lazy learners and explain when they might be preferred over other classification methods.