Machine Learning (Chapter1)

Uploaded by

Lomada Rohit Kumar Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views8 pages

Machine Learning (Chapter1)

Uploaded by

Lomada Rohit Kumar Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

INTRODUCTION & OVERVIEW:

Machine Learning is a subset of Artificial Intelligence that enables

systems to learn and improve from experience without being
explicitly programmed. It's about finding patterns in data and using
those patterns to make predictions or decisions.
An Illustrative Learning Task: Email Spam Filtering
Let's consider a common ML problem: email spam filtering. The goal
is to classify emails as either "spam" or "not spam".
Data:
✓ A dataset of emails, each labeled as "spam" or "not spam".
✓ Each email can be represented as a set of features like:
✓ The presence of certain words (e.g., "free", "money", "urgent")
✓ The length of the subject line
✓ The email sender
✓ The presence of links
✓ The use of exclamation marks
Learning Task:
The machine learning model needs to learn to identify patterns in
these features that correlate with spam emails. Once trained, it should
be able to classify new, unseen emails as spam or not spam.
Approaches to the Problem
1. Supervised Learning
➢ Approach: The model is trained on a dataset where each email is
labeled as spam or not spam. The algorithm learns to map input
features (email characteristics) to output labels (spam or not
spam).
➢ Algorithms:
• Naive Bayes: Assumes independence between features.
• Support Vector Machines (SVM): Finds the best hyperplane to
separate spam and non-spam emails.
• Decision Trees: Creates a tree-like model of decisions and their
possible consequences.
• Random Forest: An ensemble of decision trees.
• Logistic Regression: Estimates the probability of an email being
spam.
2. Unsupervised Learning
➢ Approach: The model is trained on unlabeled data. It tries to
find patterns or clusters in the data without explicit guidance.
While not directly applicable for spam filtering (since we have
labeled data), it could be used for pre-processing or feature
engineering.
➢ Algorithms:
• Clustering: Groups similar emails together (e.g., clustering by
sender, subject length).
• Dimensionality Reduction: Reduces the number of features in
the data.
3. Reinforcement Learning
➢ Approach: The model learns by interacting with an environment
and receiving rewards or penalties for its actions. While not a
typical approach for spam filtering, it could be used in more
dynamic scenarios, like adaptive spam filtering based on user
feedback.
➢ Key Considerations
• Data Quality: The quality of the training data is crucial for
model performance.
• Feature Engineering: Creating relevant features from raw
data can significantly impact model accuracy.
• Model Evaluation: Metrics like accuracy, precision, recall,
and F1-score are used to assess model performance.
• Overfitting: The model should generalize well to new
data, avoiding overfitting to the training data.
•
Algorithms:
Algorithms are the heart of machine learning. They are the
computational procedures that enable computers to learn from data,
identify patterns, and make predictions or decisions without
explicit programming.
Theory
➢ Computational Complexity: Understanding the theoretical limits
of algorithms helps in selecting appropriate algorithms for
different problem sizes and computational resources.
➢ Optimization: Optimization theories provide the foundation for
training machine learning models efficiently, finding optimal
parameters, and minimizing errors.
➢ Probability and Statistics: These theoretical frameworks
underpin many machine learning algorithms, from Bayesian
methods to hypothesis testing.
Experiment
➢ Algorithm Design and Evaluation: Experimentation with
different algorithms on various datasets leads to the
development of new algorithms and improvements in existing
ones.
➢ Hyperparameter Tuning: Experimental approaches are used to
find optimal hyperparameters for machine learning models,
maximizing performance.
➢ Model Selection: Experimentation helps in selecting the best
model for a specific problem based on performance metrics.
Biology
➢ Neural Networks: Inspired by the human brain, artificial neural
networks have been a cornerstone of deep learning.
➢ Evolutionary Algorithms: Genetic algorithms and other
evolutionary approaches are used for optimization and feature
selection in machine learning.
➢ Biologically Inspired Algorithms: Techniques like swarm
intelligence (inspired by ant colonies) and particle swarm
optimization find applications in machine learning.
Psychology
➢ Cognitive Science: Understanding human cognition helps in
designing algorithms that mimic human decision-making
processes, such as reinforcement learning.
➢ Human-Computer Interaction: Insights from psychology are
used to create user-friendly machine learning interfaces and
explainable AI models.
➢ Behavioral Economics: Understanding human biases and
decision-making under uncertainty contributes to developing
robust machine learning models.

Linear Regression:
Linear regression is a statistical method used to model the relationship
between a dependent variable (target variable) and one or more
independent variables (predictor variables). It assumes a linear
relationship between the variables.
Simple Linear Regression: Involves one independent variable.
Multiple Linear Regression: Involves multiple independent variables.
The linear regression model can be represented as:
y = b0 + b1*x1 + b2*x2 + ... + bn*xn + ε
Where:
* y is the dependent variable
* b0 is the intercept
* b1, b2, ..., bn are the coefficients for the independent variables x1,
x2, ..., xn
* ε is the error term
Examples of Linear Regression
Simple Linear Regression:
➢ Predicting house prices based on square footage:
i. Dependent variable: House price
ii. Independent variable: Square footage
➢ Predicting student grades based on study hours:
i. Dependent variable: Grades
ii. Independent variable: Study hours
Multiple Linear Regression:
➢ Predicting car prices based on mileage, age, and horsepower:
➢ Dependent variable: Car price
➢ Independent variables: Mileage, age, horsepower
➢ Predicting sales based on advertising expenditure, price, and
competition:
➢ Dependent variable: Sales
➢ Independent variables: Advertising expenditure, price,
competition
Applications of Linear Regression
Linear regression is widely used in various fields, including:
• Finance: Predicting stock prices, portfolio returns
• Economics: Forecasting GDP, inflation
• Marketing: Predicting sales, customer churn
• Healthcare: Modeling disease progression, predicting patient
outcomes
• Social Sciences: Analyzing relationships between social factors
Key Considerations
➢ Assumptions: Linear regression makes certain assumptions about
the data, such as linearity, independence, normality, and
homoscedasticity. Violating these assumptions can affect the
model's accuracy.
➢ Outliers: Outliers can significantly impact the regression line. It's
essential to identify and handle outliers appropriately.
➢ Multicollinearity: When independent variables are highly
correlated, it can affect the model's stability and interpretation.
➢ Outliers: Outliers can significantly impact the regression line. It's
essential to identify and handle outliers appropriately.
➢ Multicollinearity: When independent variables are highly
correlated, it can affect the model's stability and interpretation.

Multiple Regression
Multiple regression is an extension of simple linear regression
that involves more than one independent variable to predict a
continuous dependent variable.
Example:
• Predicting house prices based on multiple factors like square
footage, number of bedrooms, location, age, etc.
• Model:
• y = b0 + b1x1 + b2x2 + ... + bn*xn + ε
• y: Dependent variable (house price)
• x1, x2, ..., xn: Independent variables (square footage, number of
bedrooms, etc.)
• b0: Intercept
• b1, b2, ..., bn: Coefficients
• ε: Error term
Logistic Regression
Logistic regression is used for classification problems where the
dependent variable is categorical (usually binary, like yes/no or 0/1). It
estimates the probability of an event occurring.
Example:
* Predicting whether a customer will churn (leave a company) based
on factors like tenure, contract type, usage, etc.
Model:
* Uses a logistic function to map linear combinations of predictors to
probabilities.
* Output is a probability between 0 and 1.
* A threshold (often 0.5) is used to classify instances.
Logistic Function
The logistic function (or sigmoid function) is used in logistic
regression to map any real number to a value between 0 and 1.
Formula:
* f(x) = 1 / (1 + e^(-x))

Graph:
Example:
* In logistic regression, the output of the linear combination of
predictors is passed through the logistic function to obtain the
probability of the positive class.
Key Differences
* Multiple regression predicts a continuous value, while logistic
regression predicts a probability.
* Multiple regression uses a linear equation, while logistic regression
uses a logistic function.

Predictive Analytics Updated
No ratings yet
Predictive Analytics Updated
30 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
SDL Unit 1
No ratings yet
SDL Unit 1
7 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Ca10bd6d De86 4bae 9427 c60d433d2076 Supervised Learning
No ratings yet
Ca10bd6d De86 4bae 9427 c60d433d2076 Supervised Learning
17 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
17 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
Unit 1 Machine Learning - PDF Lands
No ratings yet
Unit 1 Machine Learning - PDF Lands
5 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
Essentials of Machine Learning Algorithms
No ratings yet
Essentials of Machine Learning Algorithms
15 pages
Machine Learning Concept1
No ratings yet
Machine Learning Concept1
16 pages
Linear Regression For ML Ass
No ratings yet
Linear Regression For ML Ass
99 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
10 pages
Machine Learning
100% (3)
Machine Learning
46 pages
Machine Learning QB
No ratings yet
Machine Learning QB
15 pages
Supervised Learning
No ratings yet
Supervised Learning
14 pages
Module 03 - Learners Guide
No ratings yet
Module 03 - Learners Guide
13 pages
Machine Learning
No ratings yet
Machine Learning
41 pages
ML (Theory)
No ratings yet
ML (Theory)
11 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
ML QB
No ratings yet
ML QB
13 pages
Aiml 4
No ratings yet
Aiml 4
107 pages
ML Report 1
No ratings yet
ML Report 1
23 pages
Dinesh ML
No ratings yet
Dinesh ML
11 pages
Unit 3 DSA
No ratings yet
Unit 3 DSA
69 pages
Project
No ratings yet
Project
12 pages
Machine Learning Reg
No ratings yet
Machine Learning Reg
45 pages
Presentation On Supervised Learning
No ratings yet
Presentation On Supervised Learning
8 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
8 pages
2494508-Machine Learning Module Notes
No ratings yet
2494508-Machine Learning Module Notes
41 pages
R22 Machine Learning Digital Notes Final
No ratings yet
R22 Machine Learning Digital Notes Final
143 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Module 3
No ratings yet
Module 3
63 pages
Fam QB Ans
No ratings yet
Fam QB Ans
9 pages
Importance of Machine Learning
No ratings yet
Importance of Machine Learning
39 pages
2-Machine Learning Algorithms
No ratings yet
2-Machine Learning Algorithms
16 pages
Machine Learning Is A Branch of Artificial Intelligence (AI)
No ratings yet
Machine Learning Is A Branch of Artificial Intelligence (AI)
80 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
6 pages
ML 01 (Shubham)
No ratings yet
ML 01 (Shubham)
14 pages
ML Notes
No ratings yet
ML Notes
10 pages
Btcse 504 Machine Learning
No ratings yet
Btcse 504 Machine Learning
11 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
Mechine Learning
No ratings yet
Mechine Learning
106 pages
Unit 1 - Machine Learning
No ratings yet
Unit 1 - Machine Learning
17 pages
Machine Learning Shortnote
No ratings yet
Machine Learning Shortnote
14 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
38 pages
Machine Learning - I
No ratings yet
Machine Learning - I
126 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
6 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
17 pages
Unit 6
No ratings yet
Unit 6
107 pages
Commonly Used Machine Learning Algorithms (With Python and R Codes)
No ratings yet
Commonly Used Machine Learning Algorithms (With Python and R Codes)
19 pages
Machine Learning For Data Science Unit-4
No ratings yet
Machine Learning For Data Science Unit-4
16 pages
Presenttion 33
No ratings yet
Presenttion 33
2 pages
Introduction To Ai & ML
No ratings yet
Introduction To Ai & ML
27 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Slide 1
No ratings yet
Slide 1
29 pages
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
FORM 2 ENGLISH Lesson 16 Writing - A Personal Profile
100% (1)
FORM 2 ENGLISH Lesson 16 Writing - A Personal Profile
7 pages
Photointerrupter: Product Data Sheet
No ratings yet
Photointerrupter: Product Data Sheet
6 pages
Manual - AirControl 1 - GB
No ratings yet
Manual - AirControl 1 - GB
16 pages
Research Analyst
No ratings yet
Research Analyst
3 pages
Grade 7 Math Lesson 23: Multiplying Polynomials Learning Guide
No ratings yet
Grade 7 Math Lesson 23: Multiplying Polynomials Learning Guide
6 pages
Teaching Practicum Syllabus
No ratings yet
Teaching Practicum Syllabus
9 pages
Business Environment Notes 2021
No ratings yet
Business Environment Notes 2021
10 pages
TNSPCB degreestandards-EnvironmentalScience
No ratings yet
TNSPCB degreestandards-EnvironmentalScience
4 pages
FIELDCRAFTProficiency
100% (1)
FIELDCRAFTProficiency
67 pages
Strong Light-Matter Coupling - From Atoms To Solid-State Physics
100% (1)
Strong Light-Matter Coupling - From Atoms To Solid-State Physics
303 pages
Master Your Entry
100% (3)
Master Your Entry
11 pages
Ellemers 2018 Gender Stereotypes
No ratings yet
Ellemers 2018 Gender Stereotypes
26 pages
Activity 1 Algebra & Trigonometry
No ratings yet
Activity 1 Algebra & Trigonometry
3 pages
Chapter 0 - Miscellaneous Preliminaries: EE 520: Topics - Compressed Sensing Linear Algebra Review
No ratings yet
Chapter 0 - Miscellaneous Preliminaries: EE 520: Topics - Compressed Sensing Linear Algebra Review
18 pages
Clax 100 Ob 2al1 (E) - Pis 2018 New Logo
No ratings yet
Clax 100 Ob 2al1 (E) - Pis 2018 New Logo
2 pages
B Agad Template 2024 2025
No ratings yet
B Agad Template 2024 2025
3 pages
Osmotic Concentration of Potato. Spatial Distribution The Osmotic Effect
No ratings yet
Osmotic Concentration of Potato. Spatial Distribution The Osmotic Effect
25 pages
Individual Learner's Record (LR)
No ratings yet
Individual Learner's Record (LR)
2 pages
AIRCRAFT Impact On Society
No ratings yet
AIRCRAFT Impact On Society
2 pages
q1 General Physics Module 2
No ratings yet
q1 General Physics Module 2
13 pages
Lesson Plan 1849californiagoldrush - LBLP
No ratings yet
Lesson Plan 1849californiagoldrush - LBLP
3 pages
Class 10 Biology Chapter 7 Control and Coordination Cbse Board Doubtnut Teachers Notes English Medium
No ratings yet
Class 10 Biology Chapter 7 Control and Coordination Cbse Board Doubtnut Teachers Notes English Medium
71 pages
Prelim Bio 2020 Module 4 Ecosystem Dynamics
No ratings yet
Prelim Bio 2020 Module 4 Ecosystem Dynamics
8 pages
Government of Tamil Nadu Directorate of Medical and Rural Health Services
No ratings yet
Government of Tamil Nadu Directorate of Medical and Rural Health Services
20 pages
Stem Cell and Its Clinical Implications
No ratings yet
Stem Cell and Its Clinical Implications
85 pages
TUFGEAR Spray - GB
No ratings yet
TUFGEAR Spray - GB
12 pages
Lucke Beecham 2009 Cavitation Aeration and Negative Pressures in Siphonic Roof Drainage Systems
No ratings yet
Lucke Beecham 2009 Cavitation Aeration and Negative Pressures in Siphonic Roof Drainage Systems
17 pages
Theory Practice
No ratings yet
Theory Practice
1 page
Anticipation Reaction Paper Towns
No ratings yet
Anticipation Reaction Paper Towns
2 pages
11 Physics - Volume I (E/m) 5 Marks Questions & Answers
100% (1)
11 Physics - Volume I (E/m) 5 Marks Questions & Answers
14 pages

Machine Learning (Chapter1)

Uploaded by

Machine Learning (Chapter1)

Uploaded by

INTRODUCTION & OVERVIEW:

Machine Learning is a subset of Artificial Intelligence that enables

You might also like