0% found this document useful (0 votes)

53 views

COMP2050-Lecture 22 - Machine Learning

Machine Learning in AI

Uploaded by

azanetranclc17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

COMP2050-Lecture 22 - Machine Learning

Machine Learning in AI

Uploaded by

azanetranclc17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Machine Learning

COMP2050 - Artificial Intelligence

KHOA D. DOAN
[email protected]

Book Office Hour | Course Website

Slides adapted from/based on UC Berkeley CS 188, 2022
What is Learning?
● Learning is the process of acquiring some expertise from experience

What breed
is it?

2
Why Machine Learning?

learning estimation
Where does it come from?
learning structure

3
Types of Learning
● Supervised Learning: correct answers for each training instance

Gene Y’s Expression

Sale Price

Square Meters Gene X’s Expression

4
Types of Learning
● Supervised Learning: correct answers for each training instance
● Unsupervised Learning: find interesting patterns in data

5
Types of Learning
● Supervised Learning: correct answers for each training instance
● Unsupervised Learning: find interesting patterns in data
● Reinforcement learning: reward sequence, no correct answers

6
What is Learning?
● Learning is the process of acquiring some expertise from experience

What breed
is it?

● Most central problem?

7
What is Learning?
● Learning is the process of acquiring some expertise from experience

What breed
is it?

● Most central problem: generalization

○ How to abstract from “training” examples to “test” examples.
○ Analogy with human learning?

8
Training and Testing

9
Example: Spam Filter
● Input: an email
● Output: spam/ham
● Setup:
○ Get a large collection of example
emails, each labeled“spam” or “ham”
○ Note: someone has to hand label all
this data!
○ Want to learn to predict labels of new,
future emails
● Features: The attributes used to make the
ham / spam decision
○ Words: FREE!
○ Text Patterns: $dd, CAPS
○ Non-text: SenderInContacts, WidelyBroadcast
10
○ …
Model-Based Classification
● Model-based approach
○ Build a model (e.g. Bayes’ net) where
both the label and features are
random variables
○ Instantiate any observed features
○ Query for the distribution of the label
conditioned on the features

● Challenges
○ What structure should the BN have?
○ How should we learn its parameters?

11
Naïve Bayes for Text
● Bag-of-words Naïve Bayes:
○ Features: Wi is the word at position i
○ As before: predict label conditioned on feature variables
(spam vs. ham)
○ As before: assume features are conditionally independent
given label
● Generative model:

12
Naïve Bayes for Text
● Bag-of-words Naïve Bayes:
○ Features: Wi is the word at position i
○ As before: predict label conditioned on feature variables
(spam vs. ham)
○ As before: assume features are conditionally independent
given label
● Generative model:

● Prediction:

13
Naïve Bayes for Text: Parameters
● Model

● What are the parameters?

14
Naïve Bayes for Text: Parameters
● Model

● What are the parameters?

15
Parameter Estimation

16
Parameter Estimation with Maximum Likelihood
● Estimating the distribution of a random variable
● Empirically: use training data (learning!)
○ E.g.: for each outcome x, look at the empirical rate of that value:

○ This is the estimate that maximizes the likelihood of the data

17
General Case: n outcomes
● P(Heads) = q, P(Tails) = 1-q

● Flips are i.i.d.:

○ Independent events
○ Identically distributed according to unknown distribution
○ Sequence D of 𝛂H Heads and 𝛂T Tails

18
Parameter Estimation with Maximum Likelihood
● Data: Observed set D of 𝛂H Heads and 𝛂T Tails
● Hypothesis space: Binomial distributions
● Learning: finding q is an optimization problem
○ What’s the objective function?

● MLE: Choose q to maximize probability of D

19
Parameter Estimation with Maximum Likelihood

● Set derivative to zero, and solve!

20
Maximum Likelihood for Naïve Bayes Spam Classifier
● Model:
○ Random variable Fi = 1 if i’th dictionary word is present in email
○ Random variable Y is in {spam, ham} depending on email label
● Data D:
○ N emails with NH ”hams” and NS “spams”
○ fi(j) = 1 if i’th word appeared in email j
● Parameters:
○ Probability tables P(Y) and P(Fi | Y)
○ Collectively call them both θ
● MLE: Choose q to maximize probability of D

21
Maximum Likelihood for Naïve Bayes Spam Classifier*
● Let’s find single parameter P(Fi | Y = ham) (this will be our θ):
○ Denote L(θ) = P(D | θ) for ease of notation

22
Maximum Likelihood for Naïve Bayes Spam Classifier*

23
Maximum Likelihood for Naïve Bayes Spam Classifier *

P(Fi | Y = ham):
24
Parameter Estimation with Maximum Likelihood
● How do we estimate the conditional probability tables?
○ Maximum Likelihood, which corresponds to counting
● Need to be careful though … let’s see what can go wrong?

25
Underfitting and Overfitting

26
Example: Overfitting
P(features, C=spam) P(features, C=ham)

P(C=spam) = 0.5 P(C=spam) != 0.5

P(“we’ve” | C=spam) = 0.1 We've P(“we’ve” | C!=spam) = 0.8
P(“updated” | C=spam) = 0.2 updated P(“updated” | C!=spam) = 0.7
our
login
credential
policy.
Please
conﬁrm
your
account
by
logging
into
P(“Google” | C=spam) = 0.3 Google P(“Google” | C!=spam) = 0.0
Docs.
27
What went wrong?
Generalization and Overfitting
● Problems with relative-frequency parameters
○ Unlikely to see occurrences of every words in training data.
○ Likely to see occurrences of a word for only 1 class in training data.

● What exactly is learning?

● Learning is to generalize
○ Want a classifier which does well on test data
○ Overfitting: fitting the training data very closely,
but not doing well on test data
○ Underfitting: fits the training set poorly

28
Smoothing

29
Laplace Smoothing
● Laplace’s estimate:
○ Pretend you saw every outcome once more
than you actually did

○ Can derive this estimate with Dirichlet priors

30
Laplace Smoothing
● Laplace’s estimate (extended):
○ Pretend you saw every outcome k extra times

○ What’s Laplace with k = 0?

○ k is the strength of the prior

● Laplace for conditionals:

○ Smooth each condition independently:

31
Course Conclusion

32
Applications of Deep Reinforcement Learning: Go

33
Applications of Deep Reinforcement Learning: Go
Just MiniMax Search?

34
Exhaustive Search?

35
Reducing depth with value network

36
Value network

37
Reducing breadth with policy network

38
Policy network

39
AlphaGo: neural network training pipeline

40
Robotics

41
AI Ethics Ever More Important
● Why?

42
AI Ethics Ever More Important
● Why?
○ AI is making decisions, at scale
○ Any kind of issues (e.g. bias or malignant use) could significantly affect
people
● Many open questions:
○ Who is responsible?
○ How to diagnose and prevent?

43
Some Key AI Ethics Topics
● Disinformation
● Bias and fairness
● Privacy and surveillance
● Metrics
● Algorithmic colonialism

44
What will be AI’s impact in the future?
● You get to determine that!
● As you apply AI
● As researchers / developers
● As auditors and regulators
● As informed public voices

45
Where to Go Next?
● Machine Learning: COMP3020
● Data Mining: COMP4040
● Several online resources
○ The Batch: https://www.deeplearning.ai/thebatch/
○ Import AI: https://jack-clark.net/
○ AI Ethics course: ethics.fast.ai
○ The Robot Brains Podcast: https://therobotbrains.ai
○ Computer Vision, NLP, Optimization, Reinforcement Learning, Neural
Science, Cognitive Modeling…
● UROP Projects

46
THANK YOU!

Good luck on the exam/projects and have a nice summer!

See you around!

Introduction To Probability and Statistics SOLUTION PDF
No ratings yet
Introduction To Probability and Statistics SOLUTION PDF
22 pages
Set Theory for Beginners: Foundational Mathematics for Software Developers, #1
From Everand
Set Theory for Beginners: Foundational Mathematics for Software Developers, #1
Subhomoy Haldar
No ratings yet
Instant Download Levy processes and stochastic calculus 2nd Edition David Applebaum PDF All Chapters
100% (5)
Instant Download Levy processes and stochastic calculus 2nd Edition David Applebaum PDF All Chapters
61 pages
GMAT Foundations of Math
From Everand
GMAT Foundations of Math
Manhattan Prep
4/5 (4)
Foundations of Agnostic Statistics
100% (1)
Foundations of Agnostic Statistics
318 pages
Chapter 4
No ratings yet
Chapter 4
53 pages
A Course in Machine Learning
100% (1)
A Course in Machine Learning
191 pages
SP14 CS188 Lecture 21 -- Naive Bayes - Print
No ratings yet
SP14 CS188 Lecture 21 -- Naive Bayes - Print
41 pages
14 Supervised Machine Learning
No ratings yet
14 Supervised Machine Learning
94 pages
lec20-ML I
No ratings yet
lec20-ML I
48 pages
lec21-ML II
No ratings yet
lec21-ML II
66 pages
lec09 (1)
No ratings yet
lec09 (1)
50 pages
lec09 (1) (1)
No ratings yet
lec09 (1) (1)
50 pages
19_ML_intro
No ratings yet
19_ML_intro
33 pages
cs188-fa22-note19
No ratings yet
cs188-fa22-note19
8 pages
Bayesian Learning
No ratings yet
Bayesian Learning
49 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
03 ML Essentials
No ratings yet
03 ML Essentials
52 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
Lec1 -Introduction
No ratings yet
Lec1 -Introduction
55 pages
L02 Fundamentals of ML
No ratings yet
L02 Fundamentals of ML
39 pages
Chapter 19
No ratings yet
Chapter 19
30 pages
NBayes-1-20-2011-ann
No ratings yet
NBayes-1-20-2011-ann
21 pages
AI Chapter 6
No ratings yet
AI Chapter 6
28 pages
AI Lec 04+05 - Naive Bayes
No ratings yet
AI Lec 04+05 - Naive Bayes
55 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
39 pages
cs221-lecture10
No ratings yet
cs221-lecture10
43 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
lecture3-linear-classifiers
No ratings yet
lecture3-linear-classifiers
36 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
Irs Unit 4 CH 1
No ratings yet
Irs Unit 4 CH 1
58 pages
A Few Useful Things to Know About Machine Learning
No ratings yet
A Few Useful Things to Know About Machine Learning
10 pages
19 ML Intro
No ratings yet
19 ML Intro
31 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
49 pages
p78 Domingos
No ratings yet
p78 Domingos
10 pages
CS464 Chapter 4: Naïve Bayes: (Slides Based On The Slides Provided by Öznur Taştan and Mehmet Koyutürk)
No ratings yet
CS464 Chapter 4: Naïve Bayes: (Slides Based On The Slides Provided by Öznur Taştan and Mehmet Koyutürk)
55 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Chap 18
No ratings yet
Chap 18
51 pages
Unit 5
No ratings yet
Unit 5
41 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Machine Learning
100% (1)
Machine Learning
189 pages
Introduction To Machinelearning
No ratings yet
Introduction To Machinelearning
75 pages
Key Ideas in Machine Learning
No ratings yet
Key Ideas in Machine Learning
11 pages
ML -1_Sovan_Introduction to ML
No ratings yet
ML -1_Sovan_Introduction to ML
83 pages
5 Le
No ratings yet
5 Le
36 pages
Naive456 Bayes297Classification
No ratings yet
Naive456 Bayes297Classification
21 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
Chapter Six AI
No ratings yet
Chapter Six AI
40 pages
A Course in Machine Learning
No ratings yet
A Course in Machine Learning
50 pages
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
No ratings yet
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
48 pages
A Course in Machine Learning 1648562733
No ratings yet
A Course in Machine Learning 1648562733
193 pages
(PDF) Introduction To Machine Learning PDF
No ratings yet
(PDF) Introduction To Machine Learning PDF
94 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
Btcse 504 Machine Learning
No ratings yet
Btcse 504 Machine Learning
11 pages
DWM Exp5 C49
No ratings yet
DWM Exp5 C49
12 pages
Learning: Book: Artificial Intelligence, A Modern Approach (Russell & Norvig)
No ratings yet
Learning: Book: Artificial Intelligence, A Modern Approach (Russell & Norvig)
22 pages
#1 Book on Python Programming
From Everand
#1 Book on Python Programming
Minhaj
No ratings yet
CODING INTERVIEW: 50+ Tips and Tricks to Better Performance in Your Coding Interview
From Everand
CODING INTERVIEW: 50+ Tips and Tricks to Better Performance in Your Coding Interview
Eric Schmidt
No ratings yet
Tackling Imbalanced Data with Python: Advanced Techniques and Real-World Applications for Tackling Class Imbalance
From Everand
Tackling Imbalanced Data with Python: Advanced Techniques and Real-World Applications for Tackling Class Imbalance
Aarav Joshi
No ratings yet
COMP3040-Proposal-4
No ratings yet
COMP3040-Proposal-4
3 pages
4684_down
No ratings yet
4684_down
22 pages
annotated-CRL_202_20-_20Report_203.docx
No ratings yet
annotated-CRL_202_20-_20Report_203.docx
7 pages
2202.03599v3
No ratings yet
2202.03599v3
11 pages
Summative Test and Performance Task Q3 WK 1-2
No ratings yet
Summative Test and Performance Task Q3 WK 1-2
4 pages
The Humongous Book of Statistics Problems Translated for People Who Don t Speak Math W. Michael Kelley download
100% (2)
The Humongous Book of Statistics Problems Translated for People Who Don t Speak Math W. Michael Kelley download
53 pages
Chen10011 Notes
No ratings yet
Chen10011 Notes
58 pages
T 8
100% (1)
T 8
2 pages
Introduction To Probability For Data Science Chan Stanley instant download
No ratings yet
Introduction To Probability For Data Science Chan Stanley instant download
80 pages
Rockova and George JASA
No ratings yet
Rockova and George JASA
20 pages
Final Exam 2023 - Solutions
No ratings yet
Final Exam 2023 - Solutions
9 pages
Normal Distribution Characterizations With Applications
No ratings yet
Normal Distribution Characterizations With Applications
132 pages
Applied Mathematics 3rd Sem Regular
No ratings yet
Applied Mathematics 3rd Sem Regular
3 pages
Assignment 2solution
No ratings yet
Assignment 2solution
13 pages
A-Level Maths 9709-Paper 52-2024
No ratings yet
A-Level Maths 9709-Paper 52-2024
12 pages
Introduction To Lévy Processes
No ratings yet
Introduction To Lévy Processes
8 pages
Lampiran Lampiran 1. Kuisioner Morse Fall Scale (MFS)
No ratings yet
Lampiran Lampiran 1. Kuisioner Morse Fall Scale (MFS)
12 pages
Chapter Five 5. Two Dimensional Random Variables
No ratings yet
Chapter Five 5. Two Dimensional Random Variables
12 pages
This Study Resource Was: Fundamentals of Probability Lab4: Random Process
No ratings yet
This Study Resource Was: Fundamentals of Probability Lab4: Random Process
2 pages
Stochastic Calculus Notes 1/5
100% (3)
Stochastic Calculus Notes 1/5
25 pages
Chapter 5
No ratings yet
Chapter 5
42 pages
Math 102 Midterms Reviewer (With Mock Tests)
No ratings yet
Math 102 Midterms Reviewer (With Mock Tests)
3 pages
Quantitative Techniques For Managerial Decision - 1 (Qtmd1G21-1)
No ratings yet
Quantitative Techniques For Managerial Decision - 1 (Qtmd1G21-1)
26 pages
Multivariate Analysis: y N P V A
No ratings yet
Multivariate Analysis: y N P V A
2 pages
TEST2QMT400 Sem1 2031 - Answer
No ratings yet
TEST2QMT400 Sem1 2031 - Answer
4 pages
Statistics and Probability DLP Day 3
100% (1)
Statistics and Probability DLP Day 3
6 pages
Reading 3 Statistical Measures of Asset Returns - Answers
No ratings yet
Reading 3 Statistical Measures of Asset Returns - Answers
32 pages
Get Statistics For Business & Economics 13th Revised Edition Edition David Ray Anderson - eBook PDF PDF ebook with Full Chapters Now
100% (5)
Get Statistics For Business & Economics 13th Revised Edition Edition David Ray Anderson - eBook PDF PDF ebook with Full Chapters Now
51 pages
Chapter 6 Statstical Distribution
No ratings yet
Chapter 6 Statstical Distribution
1 page
Sampling Methods and The Central Limit Theorem
No ratings yet
Sampling Methods and The Central Limit Theorem
20 pages