0% found this document useful (0 votes)

31 views21 pages

Fundamentals of ML Recap

Uploaded by

kayforts

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views21 pages

Fundamentals of ML Recap

Uploaded by

kayforts

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

IDEAS Emerging Technology

Skills Scholarship Program

RECAP OF FUNDAMENTALS OF
MACHINE LEARNING
Presented by: Khadijah Saad Mohammed
Content
Recap
Additional Code
We'll review the key concepts and
Recap techniques we've covered in this module,
ensuring you have a solid foundation in
machine learning.
Introduction
Types of Machine Learning
Common Algorithms
Preparing Data and Engineering Features
Evaluating Model Performance
Machine Learning

Machine learning is a subset of artificial

intelligence that involves training algorithms to
predict outcomes, identify patterns, and make
decisions based on data.

CULMINATECH
Types of Machine Learning

Supervised Learning: Where the model learns from labeled data.

Unsupervised Learning: Where the model identifies patterns in
unlabeled data.
Reinforcement Learning: Where an agent learns to make decisions by
performing actions and receiving rewards.

CULMINATECH
Types of Machine Learning
Supervised Learning
Where the model learns from labeled data.
In supervised learning, the model uses input-output pairs to
learn a function that can predict outcomes for new data.
Classification: Predicting categorical labels (e.g., spam
or not spam). Apple Banana

Regression: Predicting continuous values (e.g., price of a

house).
.

CULMINATECH
Types of Machine Learning
Unsupervised Learning

Unsupervised learning involves models that infer patterns

. The model has to make sense of the
from unlabeled data without reference to known or labeled
data on its own, finding patterns and
outcomes. structures that we might not
Clustering: Grouping a set of objects in such a way immediately see.
that objects in the same group are more similar to
each other than to those in other groups.
Association: Discovering rules that describe
portions of your data, such as people that buy X also
tend to buy Y.

CULMINATECH
Types of Machine Learning

Reinforcement Learning
:Where an agent learns to make decisions by performing actions and receiving rewards.

CULMINATECH
Preparing Data and Engineering Features

Data Pre-processing
Cleaning: Removing errors and inconsistencies from the data.
Transformation: Scaling or normalizing features to ensure uniformity.
Encoding: Converting categorical variables into numerical
representations.
Feature Engineering: Creating new features or transforming existing
ones to improve model performance.

CULMINATECH
Model Evaluation
Why Evaluate a Model?
After training a model, you need to know how well it predicts
new data

Basic Metrics In classification, the data is split

For Classifying Data (e.g., spam or not spam): Look at into training sets for learning and
Accuracy (how many predictions were correct). testing sets for evaluating model
For Predicting Values (e.g., house prices): Use MSE (Mean performance
Squared Error), which tells you how far off your predictions
are on average.
Model Evaluation
Evaluating a model's accuracy and performance is crucial for verifying its effectiveness.
Key Metrics
For Classification: Accuracy, Precision, Recall, and F1 Score.
For Regression: Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and
Mean Absolute Error (MAE).
Validation Techniques
Train/Test Split
Cross-Validation: Using parts of the data to train and validate the model to ensure
reliability.
Overfitting and Underfitting
What are These?
Overfitting: Imagine memorizing answers for a test without understanding the
concepts. You might fail on different questions. Overfitting is similar; the model
performs well on training data but poorly on unseen data.
Underfitting: This is like not studying enough. Underfitting means the model is
too simple to learn the underlying pattern of the data.
Overfitting and Underfitting
Overfitting: Occurs when a model learns the training data too well, including the
noise, which hampers its performance on new data.
Underfitting: Occurs when a model is too simple to capture the underlying data
patterns, resulting in poor performance on both training and new data.
Solutions:
Regularization: Techniques like L1 (Lasso) and L2 (Ridge) that help reduce
overfitting by penalizing large coefficients.
Pruning: Used in decision trees to reduce the size of the tree and improve model
simplicity.
Regularization Techniques
What is Regularization?

Think of regularization as a way to prevent your model from studying "too hard" and just
memorizing data. It gently nudges the model to be more general.
Best Practices in Machine Learning
Data Quality: High-quality data is critical for building robust models.
Algorithm Selection: There is no one-size-fits-all algorithm. Testing different algorithms based
on the problem context is essential.
MORE CODING
The next slides contain snippets of more coding examples.
from sklearn.linear_model import LogisticRegression
model = LogisticRegression()
model.fit(X_train, y_train)

predictions = model.predict(X_test)
from sklearn.metrics import accuracy_score
print("Accuracy:", accuracy_score(y_test, predictions))
More Data Processing
# Create a new column 'is_child' to indicate whether the passenger is a child
df['is_child'] = (df['Age'] < 18).astype(int)

# Fill missing values

df['Age'].fillna(df['Age'].median(), inplace=True)
df['Embarked'].fillna(df['Embarked'].mode()[0], inplace=True)
# Extract title from the Name column
df['Title'] = df['Name'].str.extract(' ([A-Za-z]+)\.', expand=False)

# Convert categorical variables using one-hot encoding

df = pd.get_dummies(df, columns=['Sex', 'Embarked', 'Title'], drop_first=True)

# Drop columns that are not needed

df.drop(['PassengerId', 'Name', 'Ticket', 'Cabin'], axis=1, inplace=True)
# Creating a new feature 'FamilySize'
titanic['FamilySize'] = titanic['SibSp'] + titanic['Parch'] + 1
Other Algorithms
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LogisticRegression
from sklearn.ensemble import RandomForestClassifier
from sklearn.svm import SVC
Accuracy with other
# Logistic Regression
log_reg = LogisticRegression()
Algorithms
log_reg.fit(X_train, y_train)
log_reg_preds = log_reg.predict(X_test)
print("Logistic Regression Accuracy:", accuracy_score(y_test, log_reg_preds))

# Random Forest Classifier

rf_clf = RandomForestClassifier()
rf_clf.fit(X_train, y_train) # no need to scale data for tree-based models
rf_preds = rf_clf.predict(X_test)
print("Random Forest Accuracy:", accuracy_score(y_test, rf_preds))

# Support Vector Machine

svm = SVC()
svm.fit(X_train, y_train)
svm_preds = svm.predict(X_test)
print("SVM Accuracy:", accuracy score(y test, svm preds))
THANK YOU
Q&A

Philippine Skills Framework For Contact Center and Business Process Management
100% (2)
Philippine Skills Framework For Contact Center and Business Process Management
320 pages
System Design
50% (2)
System Design
58 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Practice Questions - Sign Convention - Spherical Mirrors DONEE
0% (1)
Practice Questions - Sign Convention - Spherical Mirrors DONEE
2 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Quarter 3 - Module 8: The Power (Positivity, Optimism and Resiliency) To Cope
100% (1)
Quarter 3 - Module 8: The Power (Positivity, Optimism and Resiliency) To Cope
3 pages
Unit I 2
No ratings yet
Unit I 2
78 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Unit 1,2,3
No ratings yet
Unit 1,2,3
30 pages
Automatic Transfer Switch - Ats 22 Manual
No ratings yet
Automatic Transfer Switch - Ats 22 Manual
38 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
16 pages
Aptitude Test 2 - Questions
No ratings yet
Aptitude Test 2 - Questions
5 pages
Artificial Intelligence - Machine Learning Fundamentals
No ratings yet
Artificial Intelligence - Machine Learning Fundamentals
31 pages
2019 Genes Ejercicio
No ratings yet
2019 Genes Ejercicio
543 pages
AV 50 Terzan PDF
No ratings yet
AV 50 Terzan PDF
47 pages
Bilal Ahmed Shaik Data Mining
No ratings yet
Bilal Ahmed Shaik Data Mining
88 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
Case Study - Churn Mdel Prediction
No ratings yet
Case Study - Churn Mdel Prediction
77 pages
ML COMPLETE (Pure Sem Ka)
No ratings yet
ML COMPLETE (Pure Sem Ka)
347 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
RDZ Search Options
No ratings yet
RDZ Search Options
74 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
Machine Learning
No ratings yet
Machine Learning
57 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Social Media Analytics Techniques
No ratings yet
Social Media Analytics Techniques
77 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Introduction To ML
No ratings yet
Introduction To ML
31 pages
OU Diary-2020 Informatica PDF
No ratings yet
OU Diary-2020 Informatica PDF
75 pages
Types of Machine Learning
No ratings yet
Types of Machine Learning
63 pages
Vaishali Bujad Project..2
No ratings yet
Vaishali Bujad Project..2
54 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Chapter 3 - Static Performance Characterstics
No ratings yet
Chapter 3 - Static Performance Characterstics
29 pages
Machine Learning - Brief
No ratings yet
Machine Learning - Brief
12 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
ML and Deploying It Using Flask and Docker.
No ratings yet
ML and Deploying It Using Flask and Docker.
30 pages
Housekeeping Operations - Chapter 4 Guestroom Cleaning & Maintenance
No ratings yet
Housekeeping Operations - Chapter 4 Guestroom Cleaning & Maintenance
34 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
Lecture 4 Machine Learning - BCSC
No ratings yet
Lecture 4 Machine Learning - BCSC
45 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
64 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Model Evaluation
No ratings yet
Model Evaluation
39 pages
The Machine Learning Landscape
No ratings yet
The Machine Learning Landscape
30 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
Module 2 - ML
No ratings yet
Module 2 - ML
53 pages
Learning Progress Review Week 10
No ratings yet
Learning Progress Review Week 10
35 pages
Data in ML
No ratings yet
Data in ML
26 pages
HSBC Digital Starter Kit Masterbrand HBPH
No ratings yet
HSBC Digital Starter Kit Masterbrand HBPH
27 pages
Class10-Introduction To ML
No ratings yet
Class10-Introduction To ML
32 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Fam Question Bank CT
No ratings yet
Fam Question Bank CT
14 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Module 3 - Introduction To ML
No ratings yet
Module 3 - Introduction To ML
45 pages
Unit 4 - Question Bank and Answers
No ratings yet
Unit 4 - Question Bank and Answers
23 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
MLE
No ratings yet
MLE
15 pages
Unit1 ML
No ratings yet
Unit1 ML
15 pages
Tyagi Wang Wen Zuo
No ratings yet
Tyagi Wang Wen Zuo
17 pages
Mu Checker - 2215 1
No ratings yet
Mu Checker - 2215 1
20 pages
Monthly RE Generation Report April 2025
No ratings yet
Monthly RE Generation Report April 2025
28 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
FM11SB 7.8
No ratings yet
FM11SB 7.8
9 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Lecture 5
No ratings yet
Lecture 5
26 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Eyongand Akpa Publication 2
No ratings yet
Eyongand Akpa Publication 2
13 pages
Flatlined - Study Notes
No ratings yet
Flatlined - Study Notes
27 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
8 pages
Politeness On Instagram Comment Section
No ratings yet
Politeness On Instagram Comment Section
12 pages
TransCAD - An Overview of A Transportation Planning and Analysis Software Significance Part 3
No ratings yet
TransCAD - An Overview of A Transportation Planning and Analysis Software Significance Part 3
10 pages
Final ML
No ratings yet
Final ML
2 pages
Questionnaire Employee Name: Designation: Academic Qualification: Experience
No ratings yet
Questionnaire Employee Name: Designation: Academic Qualification: Experience
4 pages
AWS SAA Notes
No ratings yet
AWS SAA Notes
18 pages
L8.2: Interfacing Digital Temperature and Humidity Sensor With Microcontroller
No ratings yet
L8.2: Interfacing Digital Temperature and Humidity Sensor With Microcontroller
6 pages
Introduction To Machine Learning Top-Down Approach - Towards Data Science
No ratings yet
Introduction To Machine Learning Top-Down Approach - Towards Data Science
6 pages
Bio++data Mukul++Vaghela
No ratings yet
Bio++data Mukul++Vaghela
2 pages
AIML105
No ratings yet
AIML105
5 pages
Futo Digital Bootcamp 2024 Timetable
No ratings yet
Futo Digital Bootcamp 2024 Timetable
3 pages
Nitish Bnkassociate
No ratings yet
Nitish Bnkassociate
2 pages
Report General Chejj
No ratings yet
Report General Chejj
3 pages
MMT Bus E-Ticket Nu 25147911932077 Hyderabad-Pune
No ratings yet
MMT Bus E-Ticket Nu 25147911932077 Hyderabad-Pune
2 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

Fundamentals of ML Recap

Uploaded by

Fundamentals of ML Recap

Uploaded by

IDEAS Emerging Technology

Skills Scholarship Program

Machine learning is a subset of artificial

Supervised Learning: Where the model learns from labeled data.

Regression: Predicting continuous values (e.g., price of a

Unsupervised learning involves models that infer patterns

Basic Metrics In classification, the data is split

# Fill missing values

# Convert categorical variables using one-hot encoding

# Drop columns that are not needed

# Random Forest Classifier

# Support Vector Machine

You might also like