Machine Learning
Machine Learning
Underfitting:
• Definition: Model is too simplistic, fails to capture data complexity.
• Implications: Poor performance on both training and test data.
• Strategies: Increase Model Complexity, Feature Engineering, Ensemble
Methods, Collect More Data.
Visualizations:
• Overfitting: Complex model fits training data too closely, includes noise.
• Underfitting: Simple model fails to capture underlying trend of data.
Overfitting and Underfitting
Correctness and Evaluation Metrics
• Accuracy
• Precision
• Recall (Sensitivity)
• Specificity
• F1 Score
Bias-Variance Trade-off
Bias:
Implications: High bias models tend to be too simplistic and may overlook
important patterns in the data. They often result in underfitting, where the
model fails to capture the underlying structure of the data.
Variance:
Implications: High variance models are overly complex and may capture
noise or random fluctuations in the training data. They often result in
overfitting, where the model performs well on the training data but fails to
generalize to unseen data.
Bias-Variance Trade-off
Example: Visual Representation with Dart Analogy
Feature Extraction and Selection
Feature Extraction:
• Feature extraction involves transforming raw data into a
set of features that are more informative and suitable for
modeling. It aims to reduce the dimensionality of the data
while preserving relevant information.
Feature Selection:
• Feature selection involves choosing a subset of the most
relevant features from the original set of features. It aims
to eliminate irrelevant, redundant, or noisy features that
may negatively impact model performance.
Importance of Feature Extraction
and Selection:
Improved Model •By extracting relevant features or selecting informative features,
machine learning models can focus on the most important aspects of the
Performance: data, leading to better predictive performance.
Computational •By reducing the dimensionality of the feature space, feature extraction
and selection techniques can significantly reduce the computational
Efficiency: resources required for model training and inference.
Why Machine
Learning is Popular?
Big Data
Factors Contributing
to the Popularity of
Machine Learning
Development of
Computational
Better
Resources
Algorithms