COMP323 - Topic C - Introduction To Machine Learning 1

This document provides an introduction and overview of machine learning topics, including: - The reasons machine learning is used, such as when human expertise does not exist or changes over time. - Common machine learning applications like retail, finance, manufacturing, and medicine. - Supervised learning techniques like classification and regression. - Unsupervised learning techniques like clustering. - Reinforcement learning and its applications to games, robotics, and multiple interacting agents. - Resources for machine learning data, journals, conferences, and course details.

Uploaded by

mg21001375

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

COMP323 - Topic C - Introduction To Machine Learning 1

Uploaded by

mg21001375

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

TOPIC C: Introduction to ML

Why “Learn”?
 Machine learning is programming computers to optimize a
performance criterion using example data or past experience.
 There is no need to “learn” to calculate payroll
 Learning is used when:
 Human expertise does not exist (navigating on Mars),
 Humans are unable to explain their expertise (speech recognition)
 Solution changes in time (routing on a computer network)
 Solution needs to be adapted to particular cases (user biometrics)

2
What We Talk About When We Talk
About“Learning”
 Learning general models from a data of particular examples
 Data is cheap and abundant (data warehouses, data marts);
knowledge is expensive and scarce.
 Example in retail: Customer transactions to consumer
behavior:
People who bought “Da Vinci Code” also bought “The Five People
You Meet in Heaven” (www.amazon.com)
 Build a model that is a good and useful approximation to the
data.

3
Data Mining/KDD
Definition := “KDD is the non-trivial process of
identifying valid, novel, potentially useful, and
ultimately understandable patterns in data” (Fayyad)
Applications:
 Retail: Market basket analysis, Customer relationship
management (CRM)
 Finance: Credit scoring, fraud detection
 Manufacturing: Optimization, troubleshooting
 Medicine: Medical diagnosis
 Telecommunications: Quality of service optimization
 Bioinformatics: Motifs, alignment
 Web mining: Search engines
 ...
4
What is Machine Learning?
 Machine Learning
 Study of algorithms that
 improve their performance
 at some task
 with experience
 Optimize a performance criterion using example data or past
experience.
 Role of Statistics: Inference from a sample
 Role of Computer science: Efficient algorithms to
 Solve the optimization problem
 Representing and evaluating the model for inference

5
Growth of Machine Learning
 Machine learning is preferred approach to
 Speech recognition, Natural language processing
 Computer vision
 Medical outcomes analysis
 Robot control
 Computational biology
 This trend is accelerating
 Improved machine learning algorithms
 Improved data capture, networking, faster computers
 Software too complex to write by hand
 New sensors / IO devices
 Demand for self-customization to user, environment
 It turns out to be difficult to extract knowledge from human expertsfailure of
expert systems in the 1980’s.
6
Alpydin & Ch. Eick: ML Topic1
Applications
 Association Analysis
 Supervised Learning
 Classification
 Regression/Prediction
 Unsupervised Learning
 Reinforcement Learning

7
Learning Associations
 Basket analysis:
P (Y | X ) probability that somebody who buys X also buys Y
where X and Y are products/services.

Example: P ( chips | beer ) = 0.7

Market-Basket transactions
TID Items
1 Bread, Milk
2 Bread, Diaper, Beer, Eggs
3 Milk, Diaper, Beer, Coke
4 Bread, Milk, Diaper, Beer
5 Bread, Milk, Diaper, Coke
Classification
 Example: Credit
scoring
 Differentiating
between low-risk and
high-risk customers
from their income and
savings

Discriminant: IF income > θ1 AND savings > θ2

THEN low-risk ELSE high-risk

Model 9
Classification: Applications
 Aka Pattern recognition
 Face recognition: Pose, lighting, occlusion (glasses, beard),
make-up, hair style
 Character recognition: Different handwriting styles.
 Speech recognition: Temporal dependency.
 Use of a dictionary or the syntax of the language.
 Sensor fusion: Combine multiple modalities; eg, visual (lip image) and
acoustic for speech
 Medical diagnosis: From symptoms to illnesses
 Web Advertizing: Predict if a user clicks on an ad on the
Internet.

10
Face Recognition
Training examples of a person

Test images

AT&T Laboratories, Cambridge UK

http://www.uk.research.att.com/facedatabase.html

11
Prediction: Regression
 Example: Price of a used car
 x : car attributes
y : price y = wx+w0
y = g (x | θ )
g ( ) model,
θ parameters

12
Supervised Learning: Uses
Example: decision trees tools that create rules
 Prediction of future cases: Use the rule to predict the output
for future inputs
 Knowledge extraction: The rule is easy to understand
 Compression: The rule is simpler than the data it explains
 Outlier detection: Exceptions that are not covered by the rule,
e.g., fraud

13
Unsupervised Learning
 Learning “what normally happens”
 No output
 Clustering: Grouping similar instances
 Other applications: Summarization, Association Analysis
 Example applications
 Customer segmentation in CRM
 Image compression: Color quantization
 Bioinformatics: Learning motifs

14
Reinforcement Learning
 Topics:
 Policies: what actions should an agent take in a particular situation
 Utility estimation: how good is a state (used by policy)
 No supervised output but delayed reward
 Credit assignment problem (what was responsible for the
outcome)
 Applications:
 Game playing
 Robot in a maze
 Multiple agents, partial observability, ...

15
Resources: Datasets
 UCI Repository: http://www.ics.uci.edu/~mlearn/MLRepository.html
 UCI KDD Archive:
http://kdd.ics.uci.edu/summary.data.application.html
 Statlib: http://lib.stat.cmu.edu/
 Delve: http://www.cs.utoronto.ca/~delve/

16
Resources: Journals
 Journal of Machine Learning Research www.jmlr.org
 Machine Learning
 IEEE Transactions on Neural Networks
 IEEE Transactions on Pattern Analysis and Machine
Intelligence
 Annals of Statistics
 Journal of the American Statistical Association
 ...

17
Resources: Conferences
 International Conference on Machine Learning (ICML)
 European Conference on Machine Learning (ECML)
 Neural Information Processing Systems (NIPS)
 Computational Learning
 International Joint Conference on Artificial Intelligence (IJCAI)
 ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD)
 IEEE Int. Conf. on Data Mining (ICDM)

18
Summary COSC 6342
 Introductory course that covers a wide range of machine learning
techniques—from basic to state-of-the-art.
 More theoretical/statistics oriented, compared to other courses I teach
might need continuous work not “to get lost”.
 You will learn about the methods you heard about: Naïve Bayes’, belief
networks, regression, nearest-neighbor (kNN), decision trees, support vector
machines, learning ensembles, over-fitting, regularization, dimensionality reduction
& PCA, error bounds, parameter estimation, mixture models, comparing models,
density estimation, clustering centering on K-means, EM, and DBSCAN, active
and reinforcement learning.
 Covers algorithms, theory and applications
 It’s going to be fun and hard work

19
Alpydin & Ch. Eick: ML Topic1
Which Topics Deserve More Coverage
—if we had more time?
 Graphical Models/Belief Networks (just ran out of time)
 More on Adaptive Systems
 Learning Theory
 More on Clustering and Association Analysiscovered by Data
Mining Course
 More on Feature Selection, Feature Creation
 More on Prediction
 Possibly: More depth coverage of optimization techniques, neural
networks, hidden Markov models, how to conduct a machine
learning experiment, comparing machine learning algorithms,…

20
Alpydin & Ch. Eick: ML Topic1

Barry Coward - Peter Gaunt - The Stuart Age - England, 1603-1714-Routledge (2017)
No ratings yet
Barry Coward - Peter Gaunt - The Stuart Age - England, 1603-1714-Routledge (2017)
651 pages
CS3491-AI ML-Chapter 1
No ratings yet
CS3491-AI ML-Chapter 1
19 pages
ML Topic1A
No ratings yet
ML Topic1A
12 pages
Tirth.pdf
No ratings yet
Tirth.pdf
19 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
I2ml3e Chap1
No ratings yet
I2ml3e Chap1
20 pages
ML 01
No ratings yet
ML 01
15 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
58 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
ML Chap1
No ratings yet
ML Chap1
26 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
lec001
No ratings yet
lec001
17 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
Lecture 1.2 Introduction to Machine Learning
No ratings yet
Lecture 1.2 Introduction to Machine Learning
31 pages
AI notes Week 11
No ratings yet
AI notes Week 11
68 pages
Machine-Learning NOTE2025 2
No ratings yet
Machine-Learning NOTE2025 2
331 pages
IntroductionML.pptx
No ratings yet
IntroductionML.pptx
25 pages
Unit1-2
No ratings yet
Unit1-2
101 pages
Lec 01 - Intro To ML
No ratings yet
Lec 01 - Intro To ML
28 pages
ML 01
No ratings yet
ML 01
44 pages
Lect3 Machine Learning
No ratings yet
Lect3 Machine Learning
27 pages
Introduction to Machine Learning (1)
No ratings yet
Introduction to Machine Learning (1)
89 pages
ML NOTES
No ratings yet
ML NOTES
101 pages
Unit 3
No ratings yet
Unit 3
62 pages
LM #01-Introduction To ML
No ratings yet
LM #01-Introduction To ML
33 pages
01 LecIntro
No ratings yet
01 LecIntro
23 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
MLUnit_1
No ratings yet
MLUnit_1
131 pages
MLLecture 1
No ratings yet
MLLecture 1
10 pages
ML Chapter 01
No ratings yet
ML Chapter 01
38 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
49 pages
MLUnit_1 Share (1)
No ratings yet
MLUnit_1 Share (1)
162 pages
UNIT I-Part 1
No ratings yet
UNIT I-Part 1
52 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
Unit 1&2
No ratings yet
Unit 1&2
270 pages
1. Machine Learning - Introduction
No ratings yet
1. Machine Learning - Introduction
73 pages
Intro - Types of Machine Learning
No ratings yet
Intro - Types of Machine Learning
24 pages
Machine Learning and Web Scraping Lecture 01
No ratings yet
Machine Learning and Web Scraping Lecture 01
19 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
1. Machine Learning - Introduction
No ratings yet
1. Machine Learning - Introduction
138 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
40 pages
Dr. Ahmed Elngar - ML
No ratings yet
Dr. Ahmed Elngar - ML
118 pages
ml report
No ratings yet
ml report
19 pages
Machine Learning KTU Module 1
No ratings yet
Machine Learning KTU Module 1
77 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
ML-01
No ratings yet
ML-01
23 pages
Machine Learning
No ratings yet
Machine Learning
74 pages
Week 12 Intro to DS and ML
No ratings yet
Week 12 Intro to DS and ML
67 pages
MACHINE LEARNING ALGORITHM - Unit-1-1
100% (1)
MACHINE LEARNING ALGORITHM - Unit-1-1
78 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Machine Learning (CSC052P6G, CSC033U3M, CSL774, EEL012P5E) : Dr. Shaifu Gupta
No ratings yet
Machine Learning (CSC052P6G, CSC033U3M, CSL774, EEL012P5E) : Dr. Shaifu Gupta
18 pages
Unit-1
No ratings yet
Unit-1
88 pages
Lect1 Introduction
No ratings yet
Lect1 Introduction
38 pages
Unit - 5.1 - Introduction To Machine Learning
No ratings yet
Unit - 5.1 - Introduction To Machine Learning
38 pages
Lecture01 Introduction To Machine Learning (Chapter1)
No ratings yet
Lecture01 Introduction To Machine Learning (Chapter1)
64 pages
Artificial Intelligence: Chapter 5 - Machine Learning
No ratings yet
Artificial Intelligence: Chapter 5 - Machine Learning
30 pages
MATHEMATICAL FOUNDATIONS OF MACHINE LEARNING: Unveiling the Mathematical Essence of Machine Learning (2024 Guide for Beginners)
From Everand
MATHEMATICAL FOUNDATIONS OF MACHINE LEARNING: Unveiling the Mathematical Essence of Machine Learning (2024 Guide for Beginners)
DAVID MACKAY
No ratings yet
Machine Learning Applications
From Everand
Machine Learning Applications
Kai Turing
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Huba Control 604 - Pressure - Switch
No ratings yet
Huba Control 604 - Pressure - Switch
4 pages
Guidelines For The Safety Assessment A Cosmetic Product
No ratings yet
Guidelines For The Safety Assessment A Cosmetic Product
25 pages
Saudi Aramco Typical Inspection Plan
No ratings yet
Saudi Aramco Typical Inspection Plan
2 pages
Diesel Engines 8V/10V 2000 M72: For Vessels With High Load Factors (1B)
100% (1)
Diesel Engines 8V/10V 2000 M72: For Vessels With High Load Factors (1B)
2 pages
Unit 7 Quiz The Future of Nursing Informatics
No ratings yet
Unit 7 Quiz The Future of Nursing Informatics
6 pages
Passed 1657-13-21MELCS DepEd-CAR RO Distinguishing Writing Patterns-Narration, Description, Definition, Exemplification
No ratings yet
Passed 1657-13-21MELCS DepEd-CAR RO Distinguishing Writing Patterns-Narration, Description, Definition, Exemplification
12 pages
10. Kyros Epsol Superkool 500_2021N
No ratings yet
10. Kyros Epsol Superkool 500_2021N
2 pages
Do You Support The Idea That Marketing and Management Should Be Taught in Last Two Years of High School?
No ratings yet
Do You Support The Idea That Marketing and Management Should Be Taught in Last Two Years of High School?
3 pages
Curriculum Vitae Lukas Gross
No ratings yet
Curriculum Vitae Lukas Gross
2 pages
Ed3book PDF
No ratings yet
Ed3book PDF
621 pages
Plant Hazard Report
No ratings yet
Plant Hazard Report
3 pages
For Plot - 2016-Ato Gari Simma 2020 April-Model
No ratings yet
For Plot - 2016-Ato Gari Simma 2020 April-Model
1 page
La Fábrica de Cretinos Digitales
No ratings yet
La Fábrica de Cretinos Digitales
19 pages
National Broadband Strategy 2023 FINAL
No ratings yet
National Broadband Strategy 2023 FINAL
139 pages
Week 4: Compose A Research Report On A Relevant Social Issue
100% (1)
Week 4: Compose A Research Report On A Relevant Social Issue
4 pages
Sequencing and Scheduling
No ratings yet
Sequencing and Scheduling
20 pages
Flir IP ConfigManual
No ratings yet
Flir IP ConfigManual
70 pages
Forensic Odontology
No ratings yet
Forensic Odontology
40 pages
PSN Institute of Technology and Science
No ratings yet
PSN Institute of Technology and Science
40 pages
Math PPT Module 5
No ratings yet
Math PPT Module 5
21 pages
En6G-Iig-7.3.1 En6G-Iig-7.3.2: Test - Id 32317&title Prepositional Phrases
100% (1)
En6G-Iig-7.3.1 En6G-Iig-7.3.2: Test - Id 32317&title Prepositional Phrases
15 pages
CyScan Operators Guide
No ratings yet
CyScan Operators Guide
45 pages
Jotashield Decor High Build Fine
No ratings yet
Jotashield Decor High Build Fine
3 pages
Event Marketing Manager Specialist in Cincinnati KY Resume Kim Thompson
No ratings yet
Event Marketing Manager Specialist in Cincinnati KY Resume Kim Thompson
2 pages
09 Groundwater
No ratings yet
09 Groundwater
29 pages
SSC Selection Post: Graduation Level Full Test 4 K Gokulakrishnan
No ratings yet
SSC Selection Post: Graduation Level Full Test 4 K Gokulakrishnan
2 pages
Maths Class Xi Chapter 01 To 06 Practice Paper 07
No ratings yet
Maths Class Xi Chapter 01 To 06 Practice Paper 07
3 pages
Skills and Strategies for Teaching English for Specific Purposes
No ratings yet
Skills and Strategies for Teaching English for Specific Purposes
71 pages
Review For Test On Chapter 10-14
No ratings yet
Review For Test On Chapter 10-14
6 pages