Module -01 Machine Learning(BCS602)
Module -01 Machine Learning(BCS602)
Module-1
Chapter – 01 - Introduction
Knowledge Pyramid:
o Machine learning helps businesses process large amounts of data to make informed
decisions.
o It aids in designing new products, improving business processes, and developing
effective decision support systems.
Arthur Samuel's Definition: "Machine learning gives computers the ability to learn
without being explicitly programmed."
Focus: Systems learn autonomously by analyzing data without manual
programming.
Conventional Programming:
Limitations:
o Forms of models:
1. Mathematical equations.
2. Relational diagrams (e.g., trees, graphs).
3. Logical rules (if/else).
4. Clusters (groupings).
Example:
Human Analogy:
Machine Analogy:
o Machines "learn" from data, generalize patterns, and form rules (heuristics) to solve
tasks.
Goal: Learn the function f from data to accurately predict outcomes for new inputs.
Limitations of Heuristics:
Machine learning is a sub-branch of AI, which aims to develop intelligent agents (e.g.,
robots, humans, or autonomous systems).
AI originally focused on logic and reasoning but evolved into data-driven systems for
finding relationships in data.
Machine learning focuses on extracting patterns for prediction, while deep learning, a
sub-branch of machine learning, uses neural networks modeled on human neurons.
Machine learning starts with data and focuses on analyzing patterns for prediction.
Data Mining: Unearths hidden patterns in data (similar to machine learning but
focuses on pattern discovery).
Data Analytics: Extracts useful knowledge from raw data. Predictive data analytics is
closely related to machine learning.
Pattern Recognition:
An engineering field that uses machine learning algorithms for feature extraction,
pattern analysis, and classification.
Statistics:
Requires a strong theoretical background and often involves complex equations and
assumptions.
Machine Learning:
Machine learning bridges AI, data science, statistics, and pattern recognition.
Deep learning, big data, and data analytics significantly influence its development.
Machine learning involves learning through the interaction of a program with its
environment.
The types of learning are categorized based on the data used and the supervision
provided during the learning process.
Labelled Data:
For example, the Iris dataset includes features (e.g., sepal length, petal width) with a
target label (class of Iris flower).
Unlabelled Data:
Supervised Learning
Involves a teacher or supervisor providing labelled data for training and testing.
Classification:
Algorithms:
Decision Trees
Random Forest
Support Vector Machines (SVM)
Naïve Bayes
Artificial Neural Networks (e.g., CNNs for deep learning)
Classification models:
Regression:
Algorithm:
Unsupervised Learning
Methods:
Cluster Analysis:
Example:
Algorithms:
k-means algorithm
Hierarchical algorithms
Dimensionality Reduction:
Semi-Supervised Learning
Combines labelled and unlabelled data, leveraging small labelled datasets and larger
unlabelled datasets.
Dimensionality Reduction
The goal is to simplify the dataset by reducing the number of features without losing its
essential information.
Applications:
Principal Component Analysis (PCA): Projects data into components that explain
the maximum variance.
Linear Discriminant Analysis (LDA): Finds the linear combinations of features that
best separate classes.
Semi-Supervised Learning
Process:
The pseudo-labelled data is combined with the labelled data to train the model.
Applications:
Reinforcement Learning
An agent interacts with the environment, perceives its state, takes actions, and receives
rewards or penalties.
The agent aims to maximize cumulative rewards by learning the best actions to take
over time.
Key Concepts:
No labelled data: The agent learns through interaction instead of relying on labelled
datasets.
Sequential decision-making: The agent must take multiple steps to achieve the
goal.
Machine learning, while powerful, faces several challenges that stem from its reliance
on data quality, computational resources, and algorithmic efficiency.
Quality of Data
The growth of Big Data and complex machine learning tasks (e.g., deep learning)
demands significant computational resources:
Time Complexity: Training sophisticated models can take hours, days, or even weeks.
Algorithmic Complexity
Bias/Variance Tradeoff
Bias: Occurs when the model oversimplifies the problem, leading to underfitting.
Variance: Occurs when the model overfits the training data, failing to generalize to
new data.
Striking the right balance between bias and variance—called the bias-variance
tradeoff—is a persistent challenge.
Solution:
Ensemble methods like bagging (e.g., Random Forests) and boosting (e.g., XGBoost)
improve generalization.
Many machine learning models, particularly deep learning, are considered black
boxes, making it difficult to explain their predictions or decisions.
Solution:
Employ explainability tools like SHAP (SHapley Additive exPlanations) or LIME (Local
Interpretable Model-agnostic Explanations).
Biases present in training data can lead to biased models, perpetuating unfair decisions
(e.g., in hiring, lending, or law enforcement).
Scalability
Scaling machine learning models to work with massive datasets across distributed
systems is a technical challenge.
Solution: Use scalable tools and frameworks like Apache Spark, Dask, or cloud-based
machine learning services.
It is widely adopted because of its versatility across industries. Below are the six steps
of the CRISP-DM process:
Objective: Grasp the business goals and define the problem statement.
Activities:
Objective: Explore and understand the data available for solving the problem.
Activities:
Data Preparation
Activities:
Importance: Data quality directly impacts the performance of the machine learning
model.
Modeling
Activities:
Evaluation
Objective: Assess the performance of the model using statistical and visualization
techniques.
Activities:
Challenges: Tasks like email classification may require expert domain knowledge for
proper evaluation.
Deployment
Activities:
Sentiment Analysis
Use Case: Natural Language Processing (NLP) techniques analyze text to determine
sentiments (e.g., happy, sad, angry).
Example: Movie and product reviews can be analyzed to assign ratings (e.g., stars)
automatically.
Recommendation Systems
Example:
Voice Assistants
Use Case: Use speech recognition and natural language understanding to perform
tasks.
Navigation Systems
Use Case: Optimize routes and reduce travel time using machine learning-based
algorithms.