Supervised_Learning_Notes
Supervised_Learning_Notes
A: Supervised learning is a type of machine learning where the model learns from labeled data
A: Email spam detection, fraud detection, medical diagnosis, and credit scoring.
A: Supervised uses labeled data to predict outcomes; unsupervised finds patterns in unlabeled data.
A: Classification (e.g., spam or not) and regression (e.g., predicting house prices).
Intermediate Level
A: Linear Regression, Logistic Regression, Decision Trees, Random Forest, KNN, SVM.
A: Overfitting is when a model learns the noise of training data instead of the pattern, performing
A: By using cross-validation, regularization, pruning (in trees), early stopping, and gathering more
data.
Q: What is cross-validation?
A: Cross-validation is a technique to split data into training and validation sets multiple times to
Advanced Level
A: It is the balance between a model's complexity and its ability to generalize. High bias underfits,
A: ROC curve shows the trade-off between sensitivity and specificity; AUC measures the area under
this curve.
A: Bagging combines predictions from multiple models to reduce variance; boosting combines weak
A: Regularization adds a penalty to the loss function to prevent overfitting (L1: Lasso, L2: Ridge).
A: Use techniques like oversampling the minority class, undersampling the majority class, or using