Introduction to Data Mining
Introduction to Data Mining
TO DATA
MINING
Unit-7
SCOPE OF DATA MINING
Sampling
• Sampling is a technique used to select a subset of data for analysis.
• It helps in reducing computational costs and improving efficiency.
• Common sampling methods:
• Simple random sampling
• Stratified sampling
• Systematic sampling
• Cluster sampling
DATA VISUALIZATION
• Accuracy
• Precision and Recall
• F1-score
• Confusion matrix
• ROC curve and AUC
ACCURACY
• Once trained, the classification model is used to predict new, unseen data.
• The model assigns a label to each new data instance based on learned patterns.
CLASSIFICATION TECHNIQUES