0% found this document useful (0 votes)

11 views57 pages

W1 - Introduction To ML

The document outlines the syllabus for a Machine Learning course (CS-245) taught by Dr. Mehwish Fatima at SEECS-NUST, focusing on foundational concepts, types of machine learning, and practical applications. It covers supervised and unsupervised learning, the ML pipeline, and challenges in machine learning. The course aims to equip students with skills in various algorithms and tools relevant to AI and data science.

Uploaded by

rimahmood2020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views57 pages

W1 - Introduction To ML

Uploaded by

rimahmood2020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

Spring 2025

CS-245: Machine
Learning
Dr. Mehwish Fatima
Assistant Professor,
AI & DS Department,
SEECS-NUST, Pakistan
WEEK 1:
INTRODUCTION TO MACHINE
LEARNING
AGENDA 3

01 Introduction to course
04 Types of machine
learning

02 Artiﬁcial intelligence
05 ML pipeline

03 What is machine
learning 06 Challenges in ML
INTRODUCTION TO COURSE
Course
● This course introduces the foundational concepts of machine learning
(ML) with a focus on understanding core algorithms, performance
evaluation, and practical implementation.
○ Supervised, and unsupervised paradigms
○ Build, analyze, and evaluate ML models.
○ Regression, classiﬁcation, clustering, dimensionality reduction, and debugging ML
systems
○ Learning with a project that applies ML techniques to solve practical problems.
Instructor
● Phd:
○ Ruprecht-Karls-Universität Heidelberg, Germany (2018–2024)
● Experience:
○ Industry & Academia (10+years)

● Research:
○ https://scholar.google.com/citations?user=zEyTPkMAAAAJ&hl=en
● Research Area:
○ Generative AI (GenAI) & Natural Language Processing (NLP)
○ Computational Linguistics (CL)
WHAT IS
○ Machine Learning (ML) & Deep Learning (DL)

● GENERATIVE
Practical Skills: AI?
○ Languages & Frameworks: Python, PyTorch, TensorFlow, CUDA, C++/Java
○ Tools: DeepSpeed, Docker, Kubernetes, LangChain, AWS, Google Colab, GitHub,
MultiGPU server deployments
ARTIFICIAL INTELLIGENCE
Artiﬁcial Intelligence Artificial Intelligence

Machine Learning
The basic goal of AI is to develop
intelligent machines. Deep Learning

GenAI
This consists of many sub-goals:
• Perception
• Reasoning
• Control / Motion /
Manipulation
• Planning
• Communication
• Creativity
• Learning
Artiﬁcial Intelligence Artificial Intelligence

Machine Learning
The basic goal of AI is to develop
intelligent machines. Deep Learning

GenAI
This consists of many sub-goals:
• Perception
• Reasoning
• Control / Motion /
Manipulation
• Planning
• Communication
• Creativity
• Learning
Artiﬁcial
Artiﬁcial Intelligence Artificial Intelligence
Intelligence

Intelligence
Machine Learning
The basic goal of AI is to develop Learning
intelligent machines. Deep Learning
Learning
This consists of many sub-goals: GenAI
GenAI

• Perception
• Reasoning
• Control / Motion /
Manipulation
• Planning
• Communication
• Creativity
• Learning
Artiﬁcial
Artiﬁcial Intelligence Artificial Intelligence
Intelligence

Intelligence
Machine Learning
The basic goal of AI is to develop Learning
intelligent machines. Deep Learning
Learning
This consists of many sub-goals: GenAI
GenAI

• Perception
• Reasoning
• Control / Motion / Manipulation
• Planning
• Communication
• Creativity
• Learning
Artiﬁcial
Artiﬁcial Intelligence Artificial Intelligence
Intelligence

Intelligence
Machine Learning
The basic goal of AI is to develop Learning
intelligent machines. Deep Learning
Learning
This consists of many sub-goals: GenAI
GenA
I
• Perception
• Reasoning
• Control / Motion /
Manipulation
• Planning
• Communication
• Creativity
• Learning
Artiﬁcial
Artiﬁcial Intelligence
Artificial Intelligence
Intelligence

Intelligence
Machine Learning
The basic goal of AI is to develop Learning
intelligent machines. Deep Learning
Learning
This consists of many sub-goals: GenAI
GenA
I
• Perception
• Reasoning
• Control / Motion /
Manipulation
Planning
•• Communication
• Creativity
• Learning
Artiﬁcial
Artiﬁcial Intelligence Artificial Intelligence
Intelligence

1
“Deep Style” from https://deepdreamgenerator.com/#gallery 0
Artiﬁcial
Artiﬁcial Intelligence Artificial Intelligence
Intelligence

Intelligence
Machine Learning
The basic goal of AI is to develop Learning
intelligent machines. Deep Learning
Learning
This consists of many sub-goals: GenAI
GenA
I
• Perception
• Reasoning
• Control / Motion /
Manipulation
• Planning
• Communication
Creativity
• Learning
History
WHAT IS MACHINE LEARNING?
Machine Learning
● Machine Learning (ML) is a subset of AI that enables models to learn
patterns from data and make decisions without being explicitly
programmed.

● Mathematically
○ A model 𝑓 learns a function that maps input 𝑋 to output 𝑌,
○ 𝑌 = 𝑓(𝑋)+ϵ where ϵ represents the error or noise in predictions.
Machine Learning
● Logically
○ The goal of ML is to generalize from past data (training data) to make accurate
predictions on unseen data (test data).

● Example
○ Predicting house prices using features like square footage, location, and number of
rooms.
Learning in Humans
Learning in Humans
● Imagine you're teaching a child to recognize
different types of animals, like dogs and cats.

○ You show them pictures of 100 different

dogs and 100 different cats,
explaining which is which.

○ This is their training data —

the pictures you've already shown them,
and they’ve learned from.
Learning in Humans
Learning in Humans
● Now, if you show them a new picture of a dog
they've never seen before,

○ you expect them to correctly say,

"That’s a dog!"

○ This is them generalizing their learning

to new situations.

○ Even though they’ve never seen this exact dog before,

they recognize it based on what they learned from the
other dog pictures.
Machine Learning
● Training Data is like showing the system
lots of examples
○ like pictures of dogs and cats with
correct labels this is a "dog", this is a "cat"

● Test Data is like showing the system a

brand-new picture and asking it to guess
what it is.

● The better the system is at generalizing,

the better it will be at making accurate predictions on new data it hasn’t
seen before.
Machine Learning
● Generalization
○ The goal is not just to memorize the training data
■ like remembering each dog individually

○ but to learn patterns

■ dogs have certain features like four legs, a tail, fur, etc.

○ that can apply to new, unseen examples.

This is what machine learning models aim to do when we say they are
"generalizing".
Machine Learning
● Generalization
○ A model that has generalized well can handle new data and make correct
predictions,
■ like predicting house prices in a new neighborhood based on the
features (size, number of rooms, etc.) it learned during training.

○ A model that doesn't generalize well might only work on the data it has seen
before and fail when presented with new data, which is called overﬁtting.
Traditional Programming Approach Vs. ML
Traditional Programming
Approach Vs. ML
ML CLASSIFICATION
Types of Machine Learning
Types of Machine Learning
● There are so many different types of ML systems that it is useful to
classify them in broad categories based on:

○ Whether or not they are trained with human supervision (supervised, unsupervised,
semi-supervised, and reinforcement learning)

○ Whether or not they can learn incrementally on the ﬂy (online versus batch learning)

○ Whether they work by simply comparing new data points to known data points, or
instead detect patterns in the training data and build a predictive model, much like
scientists do (instance-based versus model-based learning)
Types of Machine Learning
Types of Machine Learning
Machine Learning systems can be classiﬁed according to the amount and type
of supervision they get during training.
Supervised Learning
Supervised Learning
● The training data you feed to the
algorithm includes the desired solutions,
called labels

● Mathematically
○ Learning a function that maps input 𝑋 to output 𝑌, where labels are provided.

● Use cases
○ Spam detection (classiﬁcation), house price prediction (regression).
Supervised Learning
Supervised Learning
● Classiﬁcation
○ predictive model that approximates a
mapping function from input variables to
identify discrete output variables
■ labels or categories

○ The mapping function of classiﬁcation algorithms is responsible for predicting the

label or category of the given input variables.

○ A classiﬁcation algorithm can have both discrete and real-valued variables, but it
requires that the examples be classiﬁed into one of two or more classes.
Supervised Learning
Supervised Learning
● Regression
○ predict a continuous value based on the
input variables.

○ The main goal of regression problems is to estimate a mapping function based on the
input and output variables.

○ If your target variable is a quantity like income, scores, height or weight, or the
probability of a binary category (like the probability of rain in particular regions), then
you should use the regression model.

○ Ex: Customer segmentation, anomaly detection.

Supervised Learning
Supervised Learning
● Classiﬁcation vs. Regression
○ Regression helps predict a continuous quantity

○ Classiﬁcation predicts discrete class labels

● Overlap
○ A regression algorithm can predict a discrete value which is in the form of an integer
quantity

○ A classiﬁcation algorithm can predict a continuous value if it is in the form of a class

label probability
Some Popular Algorithms
● k-Nearest Neighbors

● Linear Regression

● Logistic Regression

● Support Vector Machines (SVMs)

● Decision Trees and Random Forests

● Neural networks
Unsupervised Learning
● As you might guess, the training data is
unlabeled. The system tries to learn without
a teacher.

● Mathematically
○ Learning patterns in the data without any labels by either minimizing or maximizing the
objective function.

● Use cases
○ Customer segmentation, anomaly detection.
Unsupervised Learning
● Clustering
○ The goal is to ﬁnd natural groups or clusters
in a feature space and interpret the input data.

○ To divide the data points in a way that each data point falls into a group that is similar
to other data points in the same group based on a predeﬁned similarity or distance
metric in the feature space.

○ Ex: determining customer segments in marketing data.

■ different segments of customers helps marketing teams approach these
customer segments in unique ways.
● Think of features like gender, location, age, education, income bracket, and so on.
Unsupervised Learning
● Dimensionality reduction
○ the goal is to reduce the number of random
variables under consideration.

○ To reduce the complexity of a problem by projecting the feature space to a

lower-dimensional space so that less correlated variables are considered in a
machine learning system.

○ Ex: Visualization algorithms try to preserve as much structure as they can

■ (e.g., trying to keep separate clusters in the input space from overlapping in the
visualization),
○ to understand how the data is organized and perhaps identify unsuspected patterns.
Unsupervised Learning
● Feature extraction
○ The goal is to simplify the data without losing too much
information.

○ One way to do this is to merge several correlated

features into one.

○ Ex: a car’s mileage may be very correlated with its age,

so the dimensionality reduction algorithm will merge them into one feature that
represents the car’s wear and tear.
Unsupervised Learning
● Clustering
○ K-Means
○ Hierarchical Cluster Analysis (HCA)

● Anomaly detection and novelty detection

○ One-class SVM
○ Isolation Forest

● Visualization and dimensionality reduction

○ Principal Component Analysis (PCA)
○ Locally-Linear Embedding (LLE)
Unsupervised Learning
● Deal with partially labeled training data,
usually a lot of unlabeled data and a little
bit of labeled data.

○ Ex: Google Photos-you upload all your family photos, it automatically recognizes that
the same person A shows up in photos 1, 5, and 11, while another person B shows up in
photos 2, 5, and 7.
■ This is the unsupervised part of the algorithm (clustering).

○ Now all the system needs is for you to tell it who these people are.
■ Just one label per person, and it is able to name everyone in every photo, which is
useful for searching photos.
Unsupervised Learning
● The learning system—called an agent
in this context
○ can observe the environment,
○ select and perform actions, and
○ get rewards in return
○ or penalties in the form of negative
rewards.

○ It must then learn by itself what is the best strategy/ policy

to get the most reward over time.

○ A policy deﬁnes what action the agent should choose when it is in a given situation.
ML PIPELINE
ML Pipeline
ML Pipeline
ML Pipeline
ML Pipeline
ML Pipeline
● A real estate company wants to predict house prices based on various
factors. They use a machine learning model to help estimate the price of a
house based on its features.

○ Features: These are the characteristics or input variables of each house that are used
to predict the price.
■ Ex: Square footage, number of bedrooms, and age of the house.

○ Labels: This is the target value the model is trying to predict, which in this case is the
house price.
■ Ex: The actual sale price of the house, like $350,000.
ML Pipeline: Predicting House Prices
● A real estate company wants to predict house prices based on various
factors. They use a machine learning model to help estimate the price of a
house based on its features.

○ Training: The process model learns from historical data, where both the features
(house characteristics) and labels (house prices) are known.
■ The company uses past house sales data to train the model so it can learn the
relationship between features and the house price.
ML Pipeline: Predicting House Prices
● A real estate company wants to predict house prices based on various
factors. They use a machine learning model to help estimate the price of a
house based on its features.

○ Testing: The model is tested on unseen data to check how accurately it predicts house
prices for new examples.
■ The model is tested on new houses, where it predicts the price, and the
predictions are compared with the actual prices.
ML Pipeline: Predicting House Prices
● A real estate company wants to predict house prices based on various
factors. They use a machine learning model to help estimate the price of a
house based on its features.

○ Evaluation Metrics: These are measures used to assess how well the model performs.
■ Ex: Mean Squared Error (MSE) can measure how far the predicted house prices
are from the actual prices. Lower error indicates better accuracy.
Challenges in ML
Data Challenges in ML
The two things that can go wrong are “bad algorithm” and “bad data.” Let’s
start with examples of bad data.

● Insufficient Quantity of Training Data

○ ML requires large amounts of data, unlike
a toddler who can quickly learn concepts
with just a few examples, as even simple
ML tasks often need thousands of examples
and complex ones may need millions.
Data Challenges in ML
● Nonrepresentative Training Data
○ To ensure good generalization, training data must be representative of the cases you
want to predict, as missing or biased data can lead to poor model performance.
Data Challenges in ML
● Poor-Quality Data
○ If your training data contains errors, outliers, or noise, it will hinder pattern detection
and reduce system performance, making data cleaning a critical step in building
effective models.
Data Challenges in ML
● Irrelevant Features
○ As the saying goes, "garbage in, garbage out."
○ Your machine learning system's performance depends heavily on having relevant
training data features, making feature engineering—selecting, extracting, and creating
useful features—a crucial aspect of any successful ML project.
Model Challenges in ML
● Overfitting the Training Data
○ Overgeneralization parallels the concept of overfitting in ML, where a model may excel
on training data yet fail to generalize to new data, raising questions about its predictive
trustworthiness.
Model Challenges in ML
● Underfitting the Training Data
○ The opposite of overfitting, occurs when a model is too simplistic to capture the
underlying complexity of the data and can be addressed by selecting a more powerful
model, improving feature engineering, or reducing constraints on the model.
ML Vs. DL
ML Vs. DL
56

Questions?
THANK YOU

21CSC305P ML - Unit 1-E
No ratings yet
21CSC305P ML - Unit 1-E
137 pages
Personal Factors CPALE
No ratings yet
Personal Factors CPALE
113 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Fractions - Lesson Plan
100% (1)
Fractions - Lesson Plan
6 pages
What Is Yoga Nidra
100% (1)
What Is Yoga Nidra
5 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
39 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
Machine Learning Notes - Concepts, Algorithms
No ratings yet
Machine Learning Notes - Concepts, Algorithms
171 pages
ML - Unit I - Final
No ratings yet
ML - Unit I - Final
132 pages
Module 1-Basics of ML
No ratings yet
Module 1-Basics of ML
142 pages
Machine Learning Unit 1
100% (7)
Machine Learning Unit 1
112 pages
ML Maths Full Notes
No ratings yet
ML Maths Full Notes
120 pages
Unit - 2 Machine Learning
No ratings yet
Unit - 2 Machine Learning
45 pages
U1 ML Intro and Applications
No ratings yet
U1 ML Intro and Applications
123 pages
ML Key Concepts
No ratings yet
ML Key Concepts
139 pages
Unit 1 ML
No ratings yet
Unit 1 ML
96 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
Uvuhiihijno
No ratings yet
Uvuhiihijno
14 pages
AIDS Module 1 Notes Draft
No ratings yet
AIDS Module 1 Notes Draft
30 pages
Machine Learning
No ratings yet
Machine Learning
39 pages
1993 Aaron Antonovxy - Coherence Scale
100% (1)
1993 Aaron Antonovxy - Coherence Scale
9 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
B.Tech V MLT KCS055 Unit1 2
No ratings yet
B.Tech V MLT KCS055 Unit1 2
9 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
40 pages
CE880 Lecture5 Slides
No ratings yet
CE880 Lecture5 Slides
32 pages
Thinking Sills - Robert Fisher
No ratings yet
Thinking Sills - Robert Fisher
21 pages
ML Notes
No ratings yet
ML Notes
18 pages
Module 1
No ratings yet
Module 1
175 pages
Bca & Bscit Sem-6 SUBJECT:-Machine Learning With Python CH:-1 Introduction To Machine Learning
No ratings yet
Bca & Bscit Sem-6 SUBJECT:-Machine Learning With Python CH:-1 Introduction To Machine Learning
19 pages
UNIT I-Part 1
No ratings yet
UNIT I-Part 1
52 pages
MLUnit - 1 Share
No ratings yet
MLUnit - 1 Share
162 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
65 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
Management and Leadership 6
75% (4)
Management and Leadership 6
72 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
DA Chap2
No ratings yet
DA Chap2
14 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Introduction To Machine Learning Basics
No ratings yet
Introduction To Machine Learning Basics
12 pages
Mlintro 4
No ratings yet
Mlintro 4
28 pages
Unit 1&2
No ratings yet
Unit 1&2
270 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
Mlintro 2
No ratings yet
Mlintro 2
28 pages
Machine Learning: Professional CORE (CET3006B) T. Y. B.Tech CSE
No ratings yet
Machine Learning: Professional CORE (CET3006B) T. Y. B.Tech CSE
106 pages
UNIT-1 Machine Learning
No ratings yet
UNIT-1 Machine Learning
43 pages
Module 1
No ratings yet
Module 1
22 pages
Machine Learning and Soft Computing: CSCC53 Mca V Sem 2020
No ratings yet
Machine Learning and Soft Computing: CSCC53 Mca V Sem 2020
33 pages
Module1 - Deep Learning
No ratings yet
Module1 - Deep Learning
26 pages
Unit 1 ML
No ratings yet
Unit 1 ML
70 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
DETAILED Lesson Plan Emglish 3
No ratings yet
DETAILED Lesson Plan Emglish 3
4 pages
ML Unit-1
No ratings yet
ML Unit-1
34 pages
Formulating A Thesis Statement
No ratings yet
Formulating A Thesis Statement
3 pages
Mlintro 3
No ratings yet
Mlintro 3
28 pages
A PDF
No ratings yet
A PDF
26 pages
Esp 2ND Quarter
No ratings yet
Esp 2ND Quarter
2 pages
Internship Report
No ratings yet
Internship Report
31 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Matrixde
No ratings yet
Matrixde
182 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Project On: Employee Motivation and Empowerment
No ratings yet
Project On: Employee Motivation and Empowerment
63 pages
Lesson Plan n1
No ratings yet
Lesson Plan n1
6 pages
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
No ratings yet
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
39 pages
Introduction To National 5 English Ruae
No ratings yet
Introduction To National 5 English Ruae
20 pages
UNIT 4 Skills in Counselling
No ratings yet
UNIT 4 Skills in Counselling
73 pages
Checklist: Skill Competency
No ratings yet
Checklist: Skill Competency
2 pages
Science Matter Lesson Plan
100% (1)
Science Matter Lesson Plan
2 pages
A Report - Exercises
No ratings yet
A Report - Exercises
3 pages
Session One
No ratings yet
Session One
26 pages
Example of Group Activities
No ratings yet
Example of Group Activities
9 pages
I Learn Smart World 10 - Unit 3 - Grammar
No ratings yet
I Learn Smart World 10 - Unit 3 - Grammar
4 pages
Informational Influence in Organizations: An Integrated Approach To Knowledge Adoption
No ratings yet
Informational Influence in Organizations: An Integrated Approach To Knowledge Adoption
26 pages
Semi Detailed Lesson Plan in Mtb-Mle
No ratings yet
Semi Detailed Lesson Plan in Mtb-Mle
7 pages
Day1 (3) Toefl Prep
No ratings yet
Day1 (3) Toefl Prep
7 pages
Penerapan Algoritma Convolutional Neural Network Dalam Klasifikasi Telur Ayam Fertil Dan Infertil Berdasarkan Hasil Candling
No ratings yet
Penerapan Algoritma Convolutional Neural Network Dalam Klasifikasi Telur Ayam Fertil Dan Infertil Berdasarkan Hasil Candling
9 pages
Axiological Linguistics - 2023-1-Part 1.2
No ratings yet
Axiological Linguistics - 2023-1-Part 1.2
45 pages
Cori Method2 PDF
No ratings yet
Cori Method2 PDF
21 pages
Rae Lectura Metaconocimiento
No ratings yet
Rae Lectura Metaconocimiento
4 pages
Why Do We Laugh
No ratings yet
Why Do We Laugh
2 pages
Effectiveness of Implementation of Blended Learning and Flipped Classroom Methods in Higher Education Institutions
No ratings yet
Effectiveness of Implementation of Blended Learning and Flipped Classroom Methods in Higher Education Institutions
6 pages
Lesson Plan (10.6.2025)
No ratings yet
Lesson Plan (10.6.2025)
2 pages
Artificial Intelligence Class 7: Skill Education for Class 7th, Code (417)
From Everand
Artificial Intelligence Class 7: Skill Education for Class 7th, Code (417)
Geeta Zunjani
No ratings yet
AI Training Navigator: What to Ask When Choosing the Right AI Training for Your Team
From Everand
AI Training Navigator: What to Ask When Choosing the Right AI Training for Your Team
Ben Jones
No ratings yet