0% found this document useful (0 votes)

93 views49 pages

Cs253 01 Introduction Marked

This document provides an introduction to an advanced machine learning course. It discusses how machine learning can help gain insights from massive, noisy datasets that are common in various scientific and industrial applications. Key challenges addressed in the course include handling large datasets that don't fit in memory, obtaining informative labels at low cost, and adapting model complexity for large data. The course will cover online learning, active learning, and nonparametric learning through homework assignments, a course project, and lectures. Background in machine learning is required.

Uploaded by

sanketsdive

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views49 pages

Cs253 01 Introduction Marked

Uploaded by

sanketsdive

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Advanced

Topics in
Machine Learning

Lecture 1 – Introduc8on

CS/CNS/EE 253
Andreas Krause
Learning from massive data
!  Many applica8ons require gaining insights from
massive, noisy data sets
!   Science
!  Physics (LHC, …), Astronomy (sky surveys, …), Neuroscience
(fMRI, micro-‐electrode arrays, …), Biology (High-‐throughput
microarrays, …), Geology (sensor arrays, …), …
!   Social science, economics, …

!  Commercial / civil applica8ons

!  Consumer data (online adver8sing, viral marke8ng, …)
!   Health records (evidence based medicine, …)

!  Security / defense related applica8ons

!  Spam ﬁltering / intrusion detec8on
!   Surveillance, … 2
Web-‐scale machine learning
!  Predict relevance of search
results from click data
!   Personaliza8on

!   Online adver8sing

!   Machine transla8on

!   Learning to index

!   Spam ﬁltering

!   Fraud detec8on

!   … L. Brouwer

>21 billion indexed T. Riley

web pages
Analyzing fMRI data
Mitchell et al.,
Science, 2008

!  Predict ac8va8on pa\erns for nouns

!   Google’s Trillion word corpus used to measure
co-‐occurrence

4
Monitoring transients in astronomy [Djorgovski]

Novae, Cataclysmic Variables Supernovae

Gamma-‐Ray Bursts Gravita8onal Microlensing Accre8on to SMBHs

Data-‐rich astronomy [Djorgovski]
!  Typical digital sky survey now generates ~ 10 -‐ 100 TB, plus
a comparable amount of derived data products
!  PB-‐scale data sets are on the horizon
!  Astronomy today has ~ 1 -‐ 2 PB of archived data, and
generates a few TB/day
!   Both data volumes and data rates grow exponen8ally, with a
doubling 8me ~ 1.5 years
!   Even more important is the growth of data complexity

!  For comparison:
Human memory ~ a few hundred MB
Human Genome < 1 GB
1 TB ~ 2 million books
Library of Congress (print only) ~ 30 TB
How is the data-‐rich science different? [Djorgovski]
!  The informa8on volume grows exponen8ally
Most data will never be seen by humans
The need for data storage, network, database-‐related
technologies, standards, etc.
!  Informa8on complexity is also increasing greatly
Most data (and data constructs) cannot be
comprehended by humans directly
The need for data mining, KDD, data understanding technologies,
hyperdimensional visualiza8on, AI/Machine-‐assisted discovery …
!  We need to create a new scien8fic methodology to do the
21st century, computa8onally enabled, data-‐rich science…
!   ML and AI will be essen8al components of the new
scien8fic toolkit
Data volume in scien8fic and industrial applica8ons

[Meiron et al]

8
How can we get gain insight from
massive, noisy data sets?

9
Key ques8ons
!  How can we deal with data sets that don’t ﬁt in main
memory of a single machine?
 Online learning

!  Labels are expensive. How can we obtain most

informa8ve labels at minimum cost?
 Ac=ve learning

!  How can we adapt complexity of classiﬁers for large

data sets?
 Nonparametric learning
10
Overview
!  Research-‐oriented advanced topics course
!   3 main topics
!  Online learning (from streaming data)
!   Ac8ve learning (for gathering most useful labels)

!   Nonparametric learning (for model selec8on)

!  Both theory and applica8ons

!   Handouts etc. on course webpage
!  h?p://www.cs.caltech.edu/courses/cs253/

11
Overview
!  Instructors:
Andreas Krause ([email protected]) and
Daniel Golovin ([email protected])

!  Teaching assistant:
Deb Ray ([email protected])

!  Administra9ve assistant:
Sheri Garcia ([email protected])

12
Background & Prequisites
!  Formal requirement:
CS/CNS/EE 156a or instructor’s permission

13
Coursework
!  Grading based on
!  3 homework assignments (one per topic) (50%)
!   Course project (40%)

!   Scribing (10%)

!  3 late days
!   Discussing assignments allowed, but everybody must
turn in their own solu8ons
!   Start early! 

14
Course project
!   “Get your hands dirty” with the course material
!  Implement an algorithm from the course or a paper you
read and apply it to some data set
!   Ideas on the course website (soon)

!   Applica8on of techniques you learnt to your own

research is encouraged
!   Must be something new (e.g., not work done last term)

15
Project: Timeline and grading
!  Small groups (2-‐3 students)
!   January 20: Project proposals due (1-‐2 pages);
feedback by instructor and TA
!   February 10: Project milestone

!   March ~10: Poster session (TBA)

!   March 15: Project report due

!  Grading based on quality of poster (20%), milestone

report (20%) and ﬁnal report (60%)

!  We will have a Best Project Award!!

16
Course overview
!  Online learning from massive data sets

!  Ac=ve learning to gather most informa8ve labels

!  Nonparametric learning to adapt model complexity

This lecture: Quick overview over all these topics

17
Tradi8onal classiﬁca8on task

Spam + +
+ –
+ +
+ – ––
+ + –
–
– ––
–
Ham

!  Input: Labeled data set with posi8ve (+) and nega8ve

(-‐) examples
!   Output: Decision rule (e.g., linear separator)

18
Main memory vs. disk access

Main memory:
Fast, random access, expensive

Secondary memory (hard disk)

~104 slower, sequen8al access, inexpensive

Massive data  Sequen8al access

How can we learn from streaming data?
19
Online classiﬁca8on task

Spam + X
–

+ –

–
X
Ham
X: Classiﬁca=on error
!  Data arrives sequen8ally
!   Need to classify one data point at a 8me

!   Use a diﬀerent decision rule (lin. separator) each 8me

!   Can’t remember all data points! 20

Model: Predic8on from expert advice
Experts l 1 l 2 l 3 … l T
e1 0 1 0 1 1
e2 0 0 1 0 0
e3 1 1 0 1 0
…
en 1 0 0 0 0

Loss Time
Total: ∑t l (t,it)  min
Expert = Someone with an opinion (not necessarily
someone who knows something)
Think of an expert as a decision rule (e.g., lin. separator) 21
Performance metric: Regret
!  Best expert: i* = mini ∑t l (t,i)
!   Let i1,…,iT be the sequence of experts selected

!   Instantaneous regret at 8me t: rt = l (t,it)-‐ l (t,i*)

!   Total regret:

!  Typical goal: Want selec8on strategy that guarantees

22
Expert selec8on strategies
!  Pick an expert (classiﬁer) uniformly at random?

!  Always pick the best expert?

23
Randomized weighted majority
Input:
!  Learning rate
Ini8aliza8on:
!  Associate weight w1,s = 1 with every expert s
For each round t
!  Choose expert s with prob.

!  Obtain losses

!  Update weights:

24
Guarantees for RWM
Theorem
For appropriately chosen learning rate, Randomized
Weighted Majority obtains sublinear regret:

Note: No assump8on about how the loss vectors l

are generated!

25
Prac8cal problems
!  In many applica8ons, number of experts (classifiers) is
infinite
 Online op8miza8on (e.g., online convex programming)
!  Ozen, only par8al feedback is available
(e.g., obtain loss only for chosen classifier)
 Mul8-‐armed bandits, sequen8al experimental design
!  Many prac8cal problems are high-‐dimensional
 Dimension reduc8on, sketching

26
Course overview
!  Online learning from massive data sets

!  Ac=ve learning to gather most informa8ve labels

!  Nonparametric learning to adapt model complexity

This lecture: Quick overview over all these topics

27
Spam or Ham?
o o
Spam o
+
o
–
o
+ o o o
+ o –
o
o o
–
o
o
Ham

!  Labels are expensive (need to ask expert)

!   Which labels should we obtain to maximize
classiﬁca=on accuracy?
28
Learning binary thresholds
!  Input domain: D=[0,1]
!   True concept c: -‐ -‐ -‐ -‐ + + + +
c(x) = +1 if x¸ t 0 Threshold t 1
c(x) = -‐1 if x < t

!  Samples x1,…,xn 2 D
uniform at random

29
Passive learning
!  Input domain: D=[0,1]
!   True concept c:

c(x) = +1 if x¸ t 0 1

c(x) = -‐1 if x < t

!  Passive learning:
Acquire all labels yi 2 {+,-‐}

30
Ac8ve learning
!  Input domain: D=[0,1]
!   True concept c:

c(x) = +1 if x¸ t 0 1

c(x) = -‐1 if x < t

!  Passive learning:
Acquire all labels yi 2 {+,-‐}
!   Ac8ve learning:
Decide which labels to obtain

31
Classiﬁca8on error
!  Azer obtaining n labels,
Dn = {(x1,y1),…,(xn,yn)}
-‐ -‐ -‐ -‐ + + + +
learner outputs hypothesis 0 1
consistent with labels Dn Threshold t

!  Classiﬁca8on error: R(h) = Ex~P[h(x) ≠ c(x)]

32
Sta8s8cal ac8ve learning protocol
Data source P (produces inputs xi)

Ac8ve learner assembles data set

Dn = {(x1,y1),…,(xn,yn)}
by selec8vely obtaining labels

Learner outputs hypothesis h

Classiﬁca8on error R(h) = Ex~P[h(x) ≠ c(x)]

How many labels do we need to ensure that R(h) · ε?
33
Label complexity for passive learning

34
Label complexity for ac8ve learning

35
Comparison
Labels needed to learn
with classiﬁca8on
error ε

Passive learning Ω(1/ε)
Ac8ve learning O(log 1/ε)

Ac=ve learning can exponen=ally reduce the number

of required labels!

36
Key ques8ons
!  For which classiﬁca8on tasks can we provably reduce
the number of labels?

!  Can we do worse by ac8ve learning?

!  Can we implement ac8ve learning eﬃciently?

37
Course overview
!  Online learning from massive data sets

!  Ac=ve learning to gather most informa8ve labels

!  Nonparametric learning to adapt model complexity

This lecture: Quick overview over all these topics

38
Nonlinear classiﬁca8on
–
– – –
– + –
– +
+
+ + –
– +
+ –
– –
– – – – – ––

!  How should we adapt the classiﬁer complexity to

growing data set size?

39
Nonlinear classiﬁca8on
+
– ++
– – – + –
– – –
+ + – –
– +
+ + –
– +
+ –
– –
– – – – – ––

!  How should we adapt the classiﬁer complexity to

growing data set size?

40
Nonlinear classiﬁca8on
+
++ –
– – + –
–
– – –
+ + – –
– + +
+ + – +
– +
+ –– –
– –
– – – – – –– + –

!  How should we adapt the classiﬁer complexity to

growing data set size?

41
Linear classiﬁca8on

Linear
classiﬁca8on

Loss func8on Complexity

e.g., hinge loss: penalty

42
From linear to nonlinear classiﬁca8on
Linear
classiﬁca8on

Nonlinear
classiﬁca8on

Complexity penalty
for func=on f??

43
1D Example

Nonlinear
classiﬁca8on

1 + + + + + + + + + +

-‐1 – – – – – –

44
Representa8on of func8on f

Solu8on of

can be wri\en as

for appropriate choice of ||f|| (Representer Theorem)

Hereby, k( . , . ) is called a kernel func=on

(associated with ||.||)
45
Examples of kernels

Squared exp. kernel Exponen=al k. Finite dimensional 46

46
Nonparametric solu8on
Solu8on of

can be wri\en as

Func8on f has one parameter αt for each data point xt!
No ﬁnite-‐dimensional representa8on  “non-‐parametric”

Large data set  Huge number of parameters!!

47
Key ques8ons
!  How can we determine the right tradeoﬀ between
func8on expressiveness (#parameters) and
computa8onal complexity?

!  How can we control model complexity in an online

fashion?

!  How can we quan8fy uncertainty in nonparametric

learning?

48
Course overview
Online
Learning

Response Bandit
surface methods op8miza8on

Nonparametric Ac=ve
Learning Learning
Ac8ve set
selec8on
49

Analysis Study of Malware Classification Portable Executable Using Hybrid Machine Learning
No ratings yet
Analysis Study of Malware Classification Portable Executable Using Hybrid Machine Learning
6 pages
Aqi To Print
No ratings yet
Aqi To Print
63 pages
Machine Learning
No ratings yet
Machine Learning
137 pages
ML 01
No ratings yet
ML 01
15 pages
Designing Machine Learning Systems With Python - Sample Chapter
100% (1)
Designing Machine Learning Systems With Python - Sample Chapter
31 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
58 pages
LN ML Rug
No ratings yet
LN ML Rug
267 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
Helsenki - Intro To ML
No ratings yet
Helsenki - Intro To ML
35 pages
Week 12 Intro To DS and ML
No ratings yet
Week 12 Intro To DS and ML
67 pages
Introduction To Machine Learning: Pekka Parviainen
No ratings yet
Introduction To Machine Learning: Pekka Parviainen
39 pages
ML - Unit I - Final
No ratings yet
ML - Unit I - Final
132 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
89 pages
CIS Theory - MachineLearning
No ratings yet
CIS Theory - MachineLearning
13 pages
Big Data Machine Learning Prodegree Ebrochure
No ratings yet
Big Data Machine Learning Prodegree Ebrochure
6 pages
Machine Learning CS229/STATS229: Instructors: Moses Charikar, Tengyu Ma, and Chris Re
No ratings yet
Machine Learning CS229/STATS229: Instructors: Moses Charikar, Tengyu Ma, and Chris Re
40 pages
Lecture 1.2 Introduction To Machine Learning
No ratings yet
Lecture 1.2 Introduction To Machine Learning
31 pages
Handout
No ratings yet
Handout
4 pages
Instructors: Moses Charikar, Tengyu Ma, and Chris Re: Hope Everyone Stays Safe and Healthy in These Difficult Times!
No ratings yet
Instructors: Moses Charikar, Tengyu Ma, and Chris Re: Hope Everyone Stays Safe and Healthy in These Difficult Times!
40 pages
Mlintro 2
No ratings yet
Mlintro 2
28 pages
01-Introduction To Machine Learning
No ratings yet
01-Introduction To Machine Learning
89 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
01 Intro Slides
No ratings yet
01 Intro Slides
67 pages
Class10-Introduction To ML
No ratings yet
Class10-Introduction To ML
32 pages
Machine Learning 2025
No ratings yet
Machine Learning 2025
111 pages
Unit 1&2
No ratings yet
Unit 1&2
270 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
81 pages
Machine Learning A Lecture Note
No ratings yet
Machine Learning A Lecture Note
111 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
COS324 Course Notes
No ratings yet
COS324 Course Notes
256 pages
COMP323 - Topic C - Introduction To Machine Learning 1
No ratings yet
COMP323 - Topic C - Introduction To Machine Learning 1
20 pages
ML Final
No ratings yet
ML Final
95 pages
An Overview of Machine Learning
No ratings yet
An Overview of Machine Learning
54 pages
Machine Learning: Professional CORE (CET3006B) T. Y. B.Tech CSE
No ratings yet
Machine Learning: Professional CORE (CET3006B) T. Y. B.Tech CSE
106 pages
Topic 08 - Data Modelling - Part II
No ratings yet
Topic 08 - Data Modelling - Part II
59 pages
Machine Learning Theory and Application
No ratings yet
Machine Learning Theory and Application
3 pages
Support Machine Learning
No ratings yet
Support Machine Learning
161 pages
Lecture01 Introduction To Machine Learning (Chapter1)
No ratings yet
Lecture01 Introduction To Machine Learning (Chapter1)
64 pages
INtro To ML
No ratings yet
INtro To ML
36 pages
Lesson 4 - Introduction Machine Learning
No ratings yet
Lesson 4 - Introduction Machine Learning
44 pages
Course Outline (Ds & Ai) 2024
No ratings yet
Course Outline (Ds & Ai) 2024
13 pages
Master Data Science, Data Analytics and Machine Learning Using Python
No ratings yet
Master Data Science, Data Analytics and Machine Learning Using Python
16 pages
Book
100% (1)
Book
269 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
Cmsa Sem 6 Dse ML
No ratings yet
Cmsa Sem 6 Dse ML
3 pages
Data Classification - Algorithms and Applications-Chapman and Hall - CRC (2014) - (Chapman & Hall - CRC Data Mining and Knowledge Discovery Series) Charu C. Aggarwal PDF
100% (1)
Data Classification - Algorithms and Applications-Chapman and Hall - CRC (2014) - (Chapman & Hall - CRC Data Mining and Knowledge Discovery Series) Charu C. Aggarwal PDF
704 pages
Data Science & AIML Coursework
No ratings yet
Data Science & AIML Coursework
10 pages
Workshop 0
No ratings yet
Workshop 0
22 pages
An Introduction To Machine Learning
No ratings yet
An Introduction To Machine Learning
136 pages
Machine Learning Advanced
100% (2)
Machine Learning Advanced
12 pages
Handout - BITS-F464 - Machine - Learning - August 2019
No ratings yet
Handout - BITS-F464 - Machine - Learning - August 2019
4 pages
Machine
No ratings yet
Machine
61 pages
Lecture 1
No ratings yet
Lecture 1
36 pages
Unit I 1
No ratings yet
Unit I 1
203 pages
1 Introduction
No ratings yet
1 Introduction
58 pages
Deep Learning Intro Slides
No ratings yet
Deep Learning Intro Slides
68 pages
MLT Syllabus
No ratings yet
MLT Syllabus
3 pages
ML Overview
No ratings yet
ML Overview
26 pages
2 Syllabus
No ratings yet
2 Syllabus
3 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
Syllabus
No ratings yet
Syllabus
3 pages
Math for Deep Learning: What You Need to Know to Understand Neural Networks
From Everand
Math for Deep Learning: What You Need to Know to Understand Neural Networks
Ronald T. Kneusel
No ratings yet
SSC CGL Aao Syllabus Resources & Selection Procedure
No ratings yet
SSC CGL Aao Syllabus Resources & Selection Procedure
4 pages
GFJR Igy'': Oil and Natural Gas Corporation Limited
No ratings yet
GFJR Igy'': Oil and Natural Gas Corporation Limited
1 page
Career Preparation Interview
No ratings yet
Career Preparation Interview
13 pages
Important Search World Transformation
No ratings yet
Important Search World Transformation
4 pages
An Autonomous Institute Affiliated To Savitribai Phule Pune University
No ratings yet
An Autonomous Institute Affiliated To Savitribai Phule Pune University
1 page
The C++ IO Streams and Locales
No ratings yet
The C++ IO Streams and Locales
163 pages
GATE 2016 CS Set 2 Answer Key
No ratings yet
GATE 2016 CS Set 2 Answer Key
24 pages
United States Patent: Enns Et Al. (45) Date of Patent: Dec. 2, 2003
No ratings yet
United States Patent: Enns Et Al. (45) Date of Patent: Dec. 2, 2003
24 pages
AIML Lect5 Assignment ID3
No ratings yet
AIML Lect5 Assignment ID3
2 pages
Data Mining Comprehensive Exam - Regular PDF
No ratings yet
Data Mining Comprehensive Exam - Regular PDF
3 pages
Title: Medical Image Classification Using Artificial Neural Networks
No ratings yet
Title: Medical Image Classification Using Artificial Neural Networks
12 pages
Cyber Lab Manual
No ratings yet
Cyber Lab Manual
15 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
40 - Malware Detection Using Machine Learning and Performance Evaluation
No ratings yet
40 - Malware Detection Using Machine Learning and Performance Evaluation
77 pages
Glossary
No ratings yet
Glossary
7 pages
Sistem Pendukung Keputusan Untuk Menentukan Jurusan Pada Siswa Sma Menggunakan Metode KNN Dan Smart
No ratings yet
Sistem Pendukung Keputusan Untuk Menentukan Jurusan Pada Siswa Sma Menggunakan Metode KNN Dan Smart
10 pages
BA - Group02 - SecB-final Final
No ratings yet
BA - Group02 - SecB-final Final
14 pages
Packag Technol Sci - 2022 - Esfahanian - A Novel Packaging Evaluation Method Using Sentiment Analysis of Customer Reviews
No ratings yet
Packag Technol Sci - 2022 - Esfahanian - A Novel Packaging Evaluation Method Using Sentiment Analysis of Customer Reviews
9 pages
Module3-Fitting A Model To Data
No ratings yet
Module3-Fitting A Model To Data
57 pages
ML 5th
No ratings yet
ML 5th
8 pages
Machine Learning For Biometrics: Concepts, Algorithms and Applications (Cognitive Data Science in Sustainable Computing) Partha Pratim Sarangi
100% (1)
Machine Learning For Biometrics: Concepts, Algorithms and Applications (Cognitive Data Science in Sustainable Computing) Partha Pratim Sarangi
62 pages
DWDM Asgmnt Prog
No ratings yet
DWDM Asgmnt Prog
51 pages
Information Security Awareness - Refresher Course
100% (2)
Information Security Awareness - Refresher Course
83 pages
DataIku Machine Learning Basics p2
No ratings yet
DataIku Machine Learning Basics p2
43 pages
01 Basics 01ML 02
No ratings yet
01 Basics 01ML 02
35 pages
F# For Machine Learning Essentials - Sample Chapter
No ratings yet
F# For Machine Learning Essentials - Sample Chapter
29 pages
Analysis and Prediction in Agricultural Data Using Data Mining Techniques
No ratings yet
Analysis and Prediction in Agricultural Data Using Data Mining Techniques
8 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
2023 Ranlp-1 126
No ratings yet
2023 Ranlp-1 126
10 pages
Book Recommendation System
No ratings yet
Book Recommendation System
51 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
ML Tutorial by MS
No ratings yet
ML Tutorial by MS
354 pages
Bayesian Network Solutions
No ratings yet
Bayesian Network Solutions
7 pages
Richardson DAA 3e PPT Ch03)
No ratings yet
Richardson DAA 3e PPT Ch03)
47 pages
Detection of Defects in Rolled Stainless Steel Plates by Machine Learning
No ratings yet
Detection of Defects in Rolled Stainless Steel Plates by Machine Learning
7 pages
Workshop Satellite Data Analysis and Machine Learning Classification With QGIS
No ratings yet
Workshop Satellite Data Analysis and Machine Learning Classification With QGIS
1 page

Cs253 01 Introduction Marked

Uploaded by

Cs253 01 Introduction Marked

Uploaded by

Advanced

! Commercial / civil applica8ons

! Security / defense related applica8ons

! Learning to index

>21 billion indexed T. Riley

! Predict ac8va8on pa\erns for nouns

Novae, Cataclysmic Variables Supernovae

Gamma-­‐Ray Bursts Gravita8onal Microlensing Accre8on to SMBHs

[Meiron et al]

! Labels are expensive. How can we obtain most

! How can we adapt complexity of classiﬁers for large

! Nonparametric learning (for model selec8on)

! Both theory and applica8ons

! Applica8on of techniques you learnt to your own

! March ~10: Poster session (TBA)

! March 15: Project report due

! Grading based on quality of poster (20%), milestone

! We will have a Best Project Award!!

! Ac=ve learning to gather most informa8ve labels

! Nonparametric learning to adapt model complexity

This lecture: Quick overview over all these topics

! Input: Labeled data set with posi8ve (+) and nega8ve

Secondary memory (hard disk)

Massive data  Sequen8al access

! Use a diﬀerent decision rule (lin. separator) each 8me

! Can’t remember all data points! 20

! Instantaneous regret at 8me t: rt = l (t,it)-­‐ l (t,i*)

! Typical goal: Want selec8on strategy that guarantees

! Always pick the best expert?

Note: No assump8on about how the loss vectors l

! Ac=ve learning to gather most informa8ve labels

! Nonparametric learning to adapt model complexity

This lecture: Quick overview over all these topics

! Labels are expensive (need to ask expert)

c(x) = +1 if x¸ t 0 1

c(x) = +1 if x¸ t 0 1

! Classiﬁca8on error: R(h) = Ex~P[h(x) ≠ c(x)]

Ac8ve learner assembles data set

Learner outputs hypothesis h

Classiﬁca8on error R(h) = Ex~P[h(x) ≠ c(x)]

Ac=ve learning can exponen=ally reduce the number

! Can we do worse by ac8ve learning?

! Can we implement ac8ve learning eﬃciently?

! Ac=ve learning to gather most informa8ve labels

! Nonparametric learning to adapt model complexity

This lecture: Quick overview over all these topics

! How should we adapt the classiﬁer complexity to

! How should we adapt the classiﬁer complexity to

! How should we adapt the classiﬁer complexity to

Loss func8on Complexity

can be wri\en as

for appropriate choice of ||f|| (Representer Theorem)

Hereby, k( . , . ) is called a kernel func=on

Squared exp. kernel Exponen=al k. Finite dimensional 46

can be wri\en as

Large data set  Huge number of parameters!!

! How can we control model complexity in an online

! How can we quan8fy uncertainty in nonparametric

You might also like

!  Commercial / civil applica8ons

!  Security / defense related applica8ons

!   Learning to index

!  Predict ac8va8on pa\erns for nouns

Gamma-‐Ray Bursts Gravita8onal Microlensing Accre8on to SMBHs

!  Labels are expensive. How can we obtain most

!  How can we adapt complexity of classiﬁers for large

!   Nonparametric learning (for model selec8on)

!  Both theory and applica8ons

!   Applica8on of techniques you learnt to your own

!   March ~10: Poster session (TBA)

!   March 15: Project report due

!  Grading based on quality of poster (20%), milestone

!  We will have a Best Project Award!!

!  Ac=ve learning to gather most informa8ve labels

!  Nonparametric learning to adapt model complexity

!  Input: Labeled data set with posi8ve (+) and nega8ve

!   Use a diﬀerent decision rule (lin. separator) each 8me

!   Can’t remember all data points! 20

!   Instantaneous regret at 8me t: rt = l (t,it)-‐ l (t,i*)

!  Typical goal: Want selec8on strategy that guarantees

!  Always pick the best expert?

!  Ac=ve learning to gather most informa8ve labels

!  Nonparametric learning to adapt model complexity

!  Labels are expensive (need to ask expert)

!  Classiﬁca8on error: R(h) = Ex~P[h(x) ≠ c(x)]

!  Can we do worse by ac8ve learning?

!  Can we implement ac8ve learning eﬃciently?

!  Ac=ve learning to gather most informa8ve labels

!  Nonparametric learning to adapt model complexity

!  How should we adapt the classiﬁer complexity to

!  How should we adapt the classiﬁer complexity to

!  How should we adapt the classiﬁer complexity to

!  How can we control model complexity in an online

!  How can we quan8fy uncertainty in nonparametric