0% found this document useful (0 votes)
163 views18 pages

Machine Learning (CSC052P6G, CSC033U3M, CSL774, EEL012P5E) : Dr. Shaifu Gupta

This document provides information about a machine learning course taught by Dr. Shaifu Gupta. The course will cover topics such as data preprocessing, supervised learning algorithms like linear regression and neural networks, unsupervised learning algorithms like k-means clustering, and dimensionality reduction techniques. The course structure includes lectures, labs involving implementing machine learning algorithms from scratch, and evaluations including quizzes, midterms and finals. Prerequisites for the course include knowledge of Python, probability, statistics, and linear algebra.

Uploaded by

Soubhav Chaman
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
163 views18 pages

Machine Learning (CSC052P6G, CSC033U3M, CSL774, EEL012P5E) : Dr. Shaifu Gupta

This document provides information about a machine learning course taught by Dr. Shaifu Gupta. The course will cover topics such as data preprocessing, supervised learning algorithms like linear regression and neural networks, unsupervised learning algorithms like k-means clustering, and dimensionality reduction techniques. The course structure includes lectures, labs involving implementing machine learning algorithms from scratch, and evaluations including quizzes, midterms and finals. Prerequisites for the course include knowledge of Python, probability, statistics, and linear algebra.

Uploaded by

Soubhav Chaman
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 18

Machine Learning

(CSC052P6G, CSC033U3M,
CSL774, EEL012P5E )
Dr. Shaifu Gupta
[email protected]
Content for the course

Data Preprocessing. Evaluation metrics. Supervised learning


algorithms: Linear and Logistic Regression, Gradient Descent, Support
Vector Machines, Kernels, Artificial Neural Networks, Decision Trees,
ML and MAP Estimates, K-Nearest Neighbor, Naive Bayes, Introduction
to Bayesian Networks. Unsupervised learning algorithms: K-Means
clustering, Gaussian Mixture Models, Expectation Maximization.
Dimensionality Reduction and Principal Component Analysis. Bias
Variance Trade-off. Model Selection and Feature Selection.
Regularization. Applications. Advanced Topics.

2
Course İnformation

Course structure: 3:0:2

Prerequisite: COL 773


(Python, probability and statistics, linear
algebra, optimization)

Revision will be helpful!

3
Labs/ mini projects

Linear and Logistic Regression, Support Vector Machines, Artificial


Neural Networks, Decision Trees, K-Nearest Neighbor, Bayesian
models, K-Means clustering, Gaussian Mixture Models, Principal
Component Analysis etc.

Write the code yourself (rather than using inbuilt libraries)!!

4
Mode of Evaluation CSL 774, CSP 774 CSC033U3M,
( % Weight )
CSC052P6G, EEL012P5E
% Weight
Surprise class quiz 20 15

Mid Sem 30 25

Class Test 2 20 15

End Sem 30 25

Lab Evaluation 100 (CSL 774) 20

Total % 200 100

5
Reference material

Other reference materials will also be shared from time to time!

7
INTRODUCTION
What is machine learning?

Herbert Simon (1970)


Any process by which a system improves its performance
Tom Mitchell (1990)
A computer program that improves its performance at some task through
experience
Wikipedia
Machine learning (ML) is the study of computer algorithms that improve
automatically through experience - by the use of data.

9
Big Data
10

● Widespread use of personal computers, social networks, web search etc..


leads to generation of “big data”
● We are both producers and consumers of data
● Data is not random, it has structure, e.g., customer behavior
● We need mechanism to extract that structure from data for
(a) Understanding the process
(b) Making predictions for the future

Example in retail: Customer transactions to consumer behavior:


People who bought “Blink” also bought “Outliers” [*books by Malcolm
Gladwell] (www.amazon.com)
Data Mining
11

● Retail: Market basket analysis, Customer relationship management (CRM)


● Finance: Credit scoring, fraud detection
● Manufacturing: Control, robotics, troubleshooting
● Medicine: Medical diagnosis
● Telecommunications: Spam filters, intrusion detection
● Bioinformatics: Motifs, alignment
● Web mining: Search engines
● ...
Learning paradigms
12

● Supervised Learning
○ Classification
○ Regression
● Unsupervised Learning
Classification
13

● Example: Credit scoring (financial metric used by


money lenders)
● Differentiating between low-risk and high-risk
customers from their income and savings

Discriminant: IF income > θ1 AND savings > θ2


THEN low-risk ELSE high-risk
Classification: Applications
14

● Face recognition: Pose, lighting, occlusion (glasses, beard),


makeup, hair style
● Character recognition: Different handwriting styles.
● Speech recognition: Temporal dependency.
● Medical diagnosis: From symptoms to illnesses
● Biometrics: Recognition/authentication using physical and/or
behavioral characteristics: Face, iris, signature, etc
● Outlier/novelty detection
● Fault diagnosis
Face Recognition
15

Training examples of a person

Test images

ORL dataset,
AT&T Laboratories, Cambridge UK
Regression

● Example: Price of a used car


● x : car attributes
y : price y = wx+w0
y = g (x | θ )
g ( ) model,
θ parameters

16
Regression applications

● Predict water given to soil based on weather conditions


● Predict future resource usage of server/data center
● Predict weather conditions (temperature, humidity etc. )
● Predict mobility of public based on active covid cases
● Predict sales based on season, customer interest, quality of product etc.

17
Unsupervised Learning
18

● Clustering: Grouping similar instances


● Example applications
○ Customer segmentation in CRM
○ Grouping sensors in an organization
○ Grouping of movies based on reviews, genre etc.
Other learning paradigms (Meta learning, EXplainable AI)
19

● Transfer learning
○ Transfer of knowledge between multiple domains
● Active learning
○ Learning algorithms interactively queries an oracle to obtain desired
outputs for new data
● Online learning
○ Learning on the fly
○ zero shot learning
● Representation learning
○ Learning representation from data
○ Embeddings
● Reinforcement learning
○ Learn to act in an environment
○ Actions: rewards and penalties

You might also like