Unit 1
Unit 1
1. KDD
2. 3 Schemas
3. Data warehouse implementation
4. OLAP vs OLTP
5. What kind of data to be mined
6. What kind of patterns mined/ Functionalities
7. Data Cube/Multi Dimensional Data model
8. Data warehouse architecture
9. OLAP Operations
10. technologies used for data mining?
11. CONCEPT HIERARCHIES
UNIT 2
1. Data Cleaning
2. Data Integration
3. Data Reduction
4. Data Transformation
5. Data Discretization
6. Problems on Nomalization(3 types)
7. Problems on partitioning
UNIT 3
1. Basics of classification
2. Decision tree induction Algorithm, gini index, information gain
3. Attribute selection measures
4. Tree Pruning
5. Scalability and Decision Tree Induction
6.
UNIT 4
1. Apriori Algorithm/problems
2. FP Growth Algorithm/problems
3. Rule Generation
UNIT 5
What are the categories of
major clustering methods?
Explain.
● Explain K-means Clustering
Algorithm with diagram?
● Explain K- Medoids method
with an exampl
1. Categories of clustering
2. K-means clustering
3.K-medoidsclustering
4.Limitations of K-means
5. Bisecting K-Means