Week 3 A

Uploaded by

eshaasif005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views18 pages

Week 3 A

Uploaded by

eshaasif005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

INTRODUCTION TO MACHINE LEARNING

BY AATIQA BINT E GHAZALI

FALL 2024
Revision
 Q/A session from the previous lecture.
 Check the home-work given.
Tasks of learning examples
 Supervised: classification,regression analysis,
 Unsupervised : anomaly detection , dimensionality reduction,
 Reinforcement: many robots implement Reinforcement Learning algorithms
to learn how to walk, gaming chess players etc,
Main Challenges of Machine Learning
 Insufficient Quantity of Training Data
 Non-representative Training Data
 Poor quality data
 Irrelevant features
 Overfitting the Training Data
 Under-fitting the Training Data
Testing and Validating
 Putting the model in production for testing is bad.
 splitting the data into two sets: the training set and the test set : better
option
 80:20 ratio for train and test
THE ML TOOLBOX
 Data
 Infrastructure
 Algorithms
 Visualizations
Machine Learning Pipeline
 a series of interconnected data processing and modeling steps
 designed to automate, standardize and streamline the process of building,
training, evaluating and deploying machine learning models.
stages of a machine learning pipeline
1. Data collection
2. Data preprocessing
3. Feature engineering
4. Model selection
5. Model training
6. Model evaluation
7. Model deployment
8. Monitoring and maintenance
Data Collection
 new data is collected from various data sources, such as
databases, APIs , or files
 often involves raw data which may require preprocessing to be
useful.
 Common sources of data : Kaggle , UCI
Data preprocessing

 involves cleaning, transforming and preparing input data for modeling.

 Common preprocessing steps include handling missing values, encoding
categorical variables, scaling numerical features and splitting the
data into training and testing sets.
Feature engineering & Model
selection
 Feature engineering
 creating new features or selecting relevant features from the
data that can improve the model's predictive power.
 This step often requires domain knowledge and creativity.
 Model selection
 choose the appropriate machine learning algorithm(s) based on the problem
type (e.g., classification, regression), data characteristics, and performance
requirements.
Model training & Model evaluation
 Model training
 The selected model(s) are trained on the training dataset using the
chosen algorithm(s).
 This involves learning the underlying patterns and relationships within
the training data.
 Pre-trained models can also be used, rather than training a new model.
 Model evaluation
 We will be assessing the model's performance using a separate testing
dataset or through cross-validation.
 Common evaluation metrics depend on the specific problem but may
include accuracy, precision, recall, F1-score, mean squared error or
others.
Model deployment & Maintenance
 Model deployment
 Once a satisfactory model is developed and evaluated, it can be deployed to
a production environment where it can make predictions on new, unseen
data.
 Maintenance
 After deployment, it's important to continuously monitor the model's
performance and retrain it as needed to adapt to changing data patterns.
 This step ensures that the model remains accurate and reliable in a real-
world setting.
 Lets do some practical implementation from the pipeline discussed
Titanic Dataset Collection
 Kaggle holds a wide range of datasets of various types
 One of the most common and beginner datasets / competition is titanic
dataset
 This dataset is used to predict the survivals in Titanic
 Download the dataset
 Upload it on drive
 Explore it
 https://kaggle.com/c/titanic/data
Pandas and NumPy
 pandas and NumPy are very useful libraries in Python
 Pandas is a very popular library for working with data . DataFrames are at
the center of pandas. A DataFrame is structured like a table or spreadsheet.
The rows and the columns both have indexes, and you can perform
operations on rows or columns separately.
 NumPy is an open-source Python library that facilitates efficient numerical
operations on large quantities of data.
 Pandas is built on the top of numpy
 If you are working on anaconda use !pip install numpy and pandas
 Pip is made for installing things in colab , anaconda etc
 After installation import numpy and pandas into your code
Matplotlip and seaborn
 For visualization purposes
 Matplotlib is primarily used for basic chart plotting,
 while Seaborn offers many default themes and a wide variety of schemes for
statistical visualization.
 Import these two libraries in colab
Loading dataset
 Read the csv file using pandas
 And display the dataset in notebook
 Read first few rows of data

Paper 2 English
No ratings yet
Paper 2 English
8 pages
CS 2 3 4 Aml
No ratings yet
CS 2 3 4 Aml
70 pages
Personal Mandala Rubric
No ratings yet
Personal Mandala Rubric
2 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
L2 - Machine Learning Process
No ratings yet
L2 - Machine Learning Process
17 pages
ML Notion 1
No ratings yet
ML Notion 1
18 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
5.1 Large Scale ML
No ratings yet
5.1 Large Scale ML
10 pages
Approaching (Almost) Any Machine Learning Problem - Abhishek Thakur - No Free Hunch
No ratings yet
Approaching (Almost) Any Machine Learning Problem - Abhishek Thakur - No Free Hunch
22 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
No ratings yet
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
22 pages
Workflow of A Machine Learning Project
No ratings yet
Workflow of A Machine Learning Project
12 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
EXAMPLE ML in Real Life
No ratings yet
EXAMPLE ML in Real Life
6 pages
Session 4 Machine Learning Process
No ratings yet
Session 4 Machine Learning Process
28 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
90 pages
ML Da
No ratings yet
ML Da
55 pages
Module - 1
No ratings yet
Module - 1
9 pages
Isml 3
No ratings yet
Isml 3
9 pages
Research Trends in Machine Learning: Muhammad Kashif Hanif
No ratings yet
Research Trends in Machine Learning: Muhammad Kashif Hanif
80 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Fall2024 W4995 Lecture1
No ratings yet
Fall2024 W4995 Lecture1
110 pages
MDCM Sagar Assignment
No ratings yet
MDCM Sagar Assignment
15 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
ML and Deploying It Using Flask and Docker.
No ratings yet
ML and Deploying It Using Flask and Docker.
30 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
Semi Supervised Learning
No ratings yet
Semi Supervised Learning
86 pages
Activity Log
No ratings yet
Activity Log
23 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
ML in Simple Words: in Python, The Function Is Used To Display Output On The Screen or Other Standard Output Device
No ratings yet
ML in Simple Words: in Python, The Function Is Used To Display Output On The Screen or Other Standard Output Device
30 pages
Part 2 Introduction To ML
No ratings yet
Part 2 Introduction To ML
13 pages
OceanofPDF - Com Hands-On Machine Learning From Scratch - Venelin Valkov
No ratings yet
OceanofPDF - Com Hands-On Machine Learning From Scratch - Venelin Valkov
119 pages
Silver Oak College of Computer Application: Subject:Machine Learning
No ratings yet
Silver Oak College of Computer Application: Subject:Machine Learning
15 pages
ML Workshop
No ratings yet
ML Workshop
78 pages
ML Training PDF
No ratings yet
ML Training PDF
6 pages
AIML
No ratings yet
AIML
5 pages
End To End Project
No ratings yet
End To End Project
21 pages
Unit 1
No ratings yet
Unit 1
43 pages
Mooc Presentation
No ratings yet
Mooc Presentation
13 pages
Lecture 17&18 - Introduction To Machine Learning
No ratings yet
Lecture 17&18 - Introduction To Machine Learning
51 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
1694266379-Unit1 Machine Learning Introduction CU 2.0
No ratings yet
1694266379-Unit1 Machine Learning Introduction CU 2.0
58 pages
ML Checklist PDF
No ratings yet
ML Checklist PDF
4 pages
Unit 2
No ratings yet
Unit 2
19 pages
UNIT2
No ratings yet
UNIT2
20 pages
Data Science and Machine Learning
No ratings yet
Data Science and Machine Learning
30 pages
ML Interactively
No ratings yet
ML Interactively
273 pages
Machine Learning 3
No ratings yet
Machine Learning 3
30 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
BDA Lec11
No ratings yet
BDA Lec11
32 pages
Intro To Machine Learning With Python
100% (1)
Intro To Machine Learning With Python
55 pages
Algorithmeknn 121213175830 Phpapp02
No ratings yet
Algorithmeknn 121213175830 Phpapp02
52 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
Chapter 2 Preparing To Model
No ratings yet
Chapter 2 Preparing To Model
49 pages
Unit 1-1
No ratings yet
Unit 1-1
10 pages
Manual Data
No ratings yet
Manual Data
13 pages
Cse3001 Ai ML m2
No ratings yet
Cse3001 Ai ML m2
118 pages
Lecture 14
No ratings yet
Lecture 14
19 pages
Lecture 9 Part 2
No ratings yet
Lecture 9 Part 2
15 pages
Lecture 13
No ratings yet
Lecture 13
22 pages
Lecture 11 Part 1
No ratings yet
Lecture 11 Part 1
26 pages
Lecturer 6 Initial Forty Years of Mohammad's Life
100% (1)
Lecturer 6 Initial Forty Years of Mohammad's Life
33 pages
Week 11 Code
No ratings yet
Week 11 Code
1 page
Grasp Pattern
No ratings yet
Grasp Pattern
35 pages
MVC Framework - Architecture Patterns
No ratings yet
MVC Framework - Architecture Patterns
14 pages
Central University of Haryana: Temporary Camp Office: Govt. B.Ed. College Building, Narnaul (Distt. Mahendergarh) Haryana
No ratings yet
Central University of Haryana: Temporary Camp Office: Govt. B.Ed. College Building, Narnaul (Distt. Mahendergarh) Haryana
7 pages
Choose The BEST Answer.: Practice Test 2 - Assessment of Learning Multiple Choice
100% (1)
Choose The BEST Answer.: Practice Test 2 - Assessment of Learning Multiple Choice
6 pages
Ithm 605 Global Foodservice and Lodging Operations Syllabus
No ratings yet
Ithm 605 Global Foodservice and Lodging Operations Syllabus
16 pages
Edu 402 Quiz Solved: Brutal Facts
No ratings yet
Edu 402 Quiz Solved: Brutal Facts
7 pages
Stats 101 Assignment 1
No ratings yet
Stats 101 Assignment 1
9 pages
ISO 9001 Internal Auditor Training
100% (3)
ISO 9001 Internal Auditor Training
7 pages
Intelligent Motion Control Design For An Omnidirectional Conveyor System
No ratings yet
Intelligent Motion Control Design For An Omnidirectional Conveyor System
11 pages
Industrial Training Report 2
No ratings yet
Industrial Training Report 2
3 pages
Nova Southeastern Dissertation Guide
100% (2)
Nova Southeastern Dissertation Guide
4 pages
SAP S4 Hana Syllabus
No ratings yet
SAP S4 Hana Syllabus
3 pages
Fluid Mechanics Lab Report: STUDY OF PRESSURE DISTRIBUTION ON A CYLINDER
No ratings yet
Fluid Mechanics Lab Report: STUDY OF PRESSURE DISTRIBUTION ON A CYLINDER
11 pages
Bo de Thi Tieng Anh Lop 4 Hoc Ki 1 Co Dap An
No ratings yet
Bo de Thi Tieng Anh Lop 4 Hoc Ki 1 Co Dap An
60 pages
Fluid Statics Examples
No ratings yet
Fluid Statics Examples
14 pages
PythonProgrammingTutorial Day01
No ratings yet
PythonProgrammingTutorial Day01
6 pages
Entrepreneurship: Quarter 1 - Module 1
No ratings yet
Entrepreneurship: Quarter 1 - Module 1
23 pages
Liveloud Lyrics 2021
No ratings yet
Liveloud Lyrics 2021
606 pages
Light and Electricity in Fishing
No ratings yet
Light and Electricity in Fishing
19 pages
Democracy in Athens
No ratings yet
Democracy in Athens
2 pages
Basics of Essay Writing
No ratings yet
Basics of Essay Writing
20 pages
Auditing Theory 2013
No ratings yet
Auditing Theory 2013
28 pages
Kamuli District DDP III 2020 - 2025 - 0
No ratings yet
Kamuli District DDP III 2020 - 2025 - 0
233 pages
5 Principles of Presentation Design-18
No ratings yet
5 Principles of Presentation Design-18
27 pages
Array: Intermediate Level Questions
No ratings yet
Array: Intermediate Level Questions
3 pages
JD - Lead Salesforce Developer-2
No ratings yet
JD - Lead Salesforce Developer-2
2 pages
Corex Delivery
No ratings yet
Corex Delivery
37 pages
Indian Mathematicians
No ratings yet
Indian Mathematicians
12 pages
English Project
No ratings yet
English Project
22 pages

Week 3 A

Uploaded by

Week 3 A

Uploaded by

INTRODUCTION TO MACHINE LEARNING

BY AATIQA BINT E GHAZALI

 involves cleaning, transforming and preparing input data for modeling.

You might also like