Helsenki - Intro To ML
Helsenki - Intro To ML
1,
Introduction
2,
What is machine learning?
I Definition:
machine = computer, computer program (in this course)
learning = improving performance on a given task, based
on experience / examples
I In other words
I instead of the programmer writing explicit rules for how to
solve a given problem, the programmer instructs the computer
how to learn from examples
I in many cases the computer program can even become better
at the task than the programmer is!
3,
Example 1: tic-tac-toe
I How to program the computer to play tic-tac-toe?
5,
Example 2: spam filter
From: [email protected]
Subject: viagra spam
cheap meds...
From: [email protected]
Subject: important information
non-spam
here’s how to ace the test...
.. ..
. .
From: [email protected]
Subject: you need to see this ?
how to win $1,000,000...
6, 6,
Example 3: face recognition
I Programmer writes rules: “If short dark hair, big nose, then it
is Mikko” (impossible! how do we judge the size of the nose?!)
...
7,
Problem setup
I So we need to separate
I Task: What is the problem that the program is solving?
8,
Related scientific disciplines (1)
I Pattern recognition
I Recognizing objects and identifying people in controlled or
uncontrolled settings, from images, audio, etc. Such tasks
typically require machine learning techniques.
9,
Availability of data
11 ,
Related scientific disciplines (2)
I Data mining
I Trying to identify interesting and useful associations and
patterns in huge datasets
I Focus on scalable algorithms
I Example: shopping basket analysis
I Statistics
I historically, introductory courses on statistics tend to focus on
hypothesis testing and some other basic problems
I however there’s a lot more to statistics than hypothesis testing
I there is a lot of interaction between research in machine
learning, data mining and statistics
12 ,
Example 4
13 ,
Example 5
14 ,
Example 6
15 ,
Example 7
I Self-driving cars:
I Sensors (radars,
cameras) superior to
humans
I How to make the
computer react
appropriately to the
sensor data?
16 ,
Example 8
I Character recognition:
I Automatically sorting
mail (handwritten
characters)
I Digitizing old books
and newspapers into
easily searchable
format (printed
characters)
17 ,
Example 9
I Recommendation systems
(‘collaborative filtering’):
I Amazon: ”Customers
who bought X also
bought Y ”...
I Netflix: ”Based on
tar
ens
your movie ratings, you
en
go
Ava
Leo
Ali
Sev
Far
might enjoy...”
Challenge: One million Linda 4 5 5 1 2
dollars ($1,000,000) 3 4 3
prize money recently Jack 1 4 1 5 1
awarded! Bill ? 4 1 ?
Lucy 2 1 1 5
John 1 1 4 5
4 5 5
2 3 3
18 ,
Example 10
I Machine translation:
I Traditional approach: Dictionary and explicit grammar
I More recently, statistical machine translation based on
example data is increasingly being used
19 ,
Example 11
20 ,
Example 12
21 ,
Example 13
22 ,
Example 14
23 ,
What about privacy?
24 ,
Course outline
I Introduction
I Supervised learning
I classification
I regression
I evaluation and model selection
I Unsupervised learning
I clustering
I matrix decompositions
25 ,
Related courses
26 ,
Practical details (1)
I Lectures:
I November 1st (today) – December 16th
I Tuesdays and Fridays at 10:15–12:00 in Exactum CK112
I Lecturer: Teemu Roos
(Exactum A322, [email protected])
I Language: English
I Based on parts of the course textbook (next slide)
I (previous instances of this course have used different
textbooks)
27 ,
Practical details (2)
I Textbook:
I authors: Gareth James, Daniela Witten,
Trevor Hastie and Robert Tibshirani
I title: An Introduction to Statistical
Learning – with Applications in R
I publisher: Springer (2013, first edition)
I web page:
www-bcf.usc.edu/~gareth/ISL/
28 ,
Practical details (3)
I Lecture material
I this set of slides (by Hoyer/Kivinen/Roos) is intended for use
as part of the actual lectures, together with the blackboard etc.
I we will cover some topics in more detail than the textbook
(and some less)
I in particular some additional detail is needed for homework
problems
I both the selected parts of the textbook as well as additional
material indicated on the course homepage are required
material for the exam
29 ,
Practical details (4)
I Exercises (Thu 14:15, Fri 12:15):
I Two kinds:
I mathematical exercises (pen-and-paper)
I computer exercises (support given in R but Python is a good
choice too)
I Problem set handed out every Friday, focusing on topics from
that week’s lectures
I Solutions returned at the exercise sessions
I For those without an exercise group: crash the party in either
group until we either create a new group or vacancy is created
otherwise
I If necessary, solutions can be turned in by email
([email protected]) but pity the poor TA who
checks piles of exercises weekly)
I Language of exercise sessions: English
I Exercise points make up 40% of your total grade, must get at
least half the points to be eligible for the course exam.
30 ,
Practical details (5)
31 ,
Practical details (6)
I Course exam:
I December 20th at 8:00am (sorry, not my choice)
I Constitutes 60% of your course grade
I Must get a minimum of half the points of the exam to pass
the course
I Pen-and-paper problems, similar style as in exercises (also
‘essay’ or ‘explain’ problems)
32 ,
Practical details (8)
I Prerequisites:
I Mathematics: Basics of probability theory and statistics, linear
algebra (i.e., vectors and matrices) and real analysis (i.e.,
derivatives, etc.)
I Computer science: Good programming skills (but no previous
familiarity with R necessary)
33 ,
Practical details (9)
I Course material:
I Webpage (public information about the course):
www.cs.helsinki.fi/en/courses/582631/2016/s/k/1
I Help?
I Ask the assistants/lecturer at exercises/lectures
I Contact assistants/lecturer separately
34 ,
We’re in this together. Let’s do it!
35 ,