0% found this document useful (0 votes)

5 views45 pages

Machine Learning Notes22

The course on Applied Machine Learning, led by Richard Johansson, covers various machine learning models, their implementation, and practical applications, emphasizing real-world contexts and ethical considerations. Students will engage in interactive lectures, complete five compulsory programming assignments using Python, and participate in a take-home exam. The course also includes resources such as literature, exercise sheets, and online quizzes to support learning.

Uploaded by

hewan bekele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views45 pages

Machine Learning Notes22

Uploaded by

hewan bekele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Applied Machine Learning

Introduction to the Course

Richard Johansson
[email protected]
Welcome to the course!

· Machine learning is increasingly popular among students

· our courses take increasing volumes
· many thesis projects develop or apply ML models
· …and in industry, public sector
· many companies come to us looking for students
· joint research projects
· Why the fuss and why now?
Success stories: image recognition
Success stories: machine translation
Data

[source]
[source]
Applications…

[source]
Topics covered in the course

· The usual “zoo”: a selection of machine learning models

· what’s the idea behind them?
· how are they implemented? (at least on a high level)
· what are the use cases?
· how can we apply them practically?
· But hopefully also the “real-world context”:
· extended “messy” practical assignments requiring that you
think of what you’re doing
· invited talks from industry and/or the healthcare sector
· annotation of data, evaluation
· ethical and legal issues, interpretability
Overview

Practical issues about the course

Fundamental concepts in machine learning

Machine learning libraries in Python

Course webpage

· The official course webpage is the Canvas page

https://chalmers.instructure.com/courses/33104
People involved in the course

· Richard: examiner, responsible for the course

· Anton, Jack, Newton, Selma, Philipp, Laleh, Styrbjörn: helping
you with the assignments
Structure of teaching

· Lecture discussions Tuesdays and Fridays 13–15

· we will use a flipped classroom format with pre-recorded
lectures you are expected to have watched before the session
· summary and discussion of the content of the recorded
lectures
· interactive coding
· solving a few exercises when we have time
· feel free to ask questions before the session!
· Assistance sessions Thursdays 13–17
· our TAs help you work on your assignments
· please let me know if it’s too crowded
· in a computer lab room (with possibly additional remote
sessions)
Assignments

· Five compulsory assignments:

PA 1 intro to the ML workflow, decision trees
PA 2 random forests
PA 3 text classification
PA 4 neural network software
PA 5 medical image classification
· We will use the Python programming language
· Please refer to the course PM for details about grading
· Assignments are done in groups
Programming assignment 1

· Warmup lab exercise: quick tour of the scikit-learn library

· Introduction to decision trees
· For a high grade: implement decision tree regression
· Assistance sessions this Thursday
· Submission deadline: January 27
Literature

· We won’t follow a book closely, but we’ll give pointers to

reading material in this book:
· Machine Learning: A course for engineers and scientists by
Lindholm et al: http://smlbook.org/
· And additional papers to read for some topics
· Some notes to complement the lectures
· Example code will be posted on the course page
Additional material along the way

· Exercise sheets, old exams

· Online quizzes
Exam, mid-March

· This is a take-home exam: a written assignment

· Will be available during the whole exam period
· Two-part structure:
1. a first compulsory part about basic concepts: you need to
answer most of these questions correctly to pass
2. a second optional part that requires more insight: answer these
questions for a higher grade
Student representatives

· If you’re interested in being a student representative, please

send me an email!
· The workload is light and there will be a small reward…
Overview

Practical issues about the course

Fundamental concepts in machine learning

Machine learning libraries in Python

Predictive models

· Given some object, make a prediction

· is this patient diabetic?
· what animal does this image show?
· what is the market value of this apartment?
· what are the phonemes contained in this speech signal?
Predictive models

· Given some object, make a prediction

· is this patient diabetic?
· what animal does this image show?
· what is the market value of this apartment?
· what are the phonemes contained in this speech signal?
· The goal of machine learning is to build the predictive models
by observing data
Predictive models

· Given some object, make a prediction

[source]
Why machine learning?

Why would we want to “learn” the function from data instead of

just implementing it?
· Usually because we don’t really know how to write down the
function by hand
· speech recognition
· image classification
· machine translation
·…
· Might not be necessary for limited tasks where we know
· What is more expensive in your case? knowledge or data?
Don’t forget your domain expertise!

ML makes some tasks automatic, but we still need our brains:

· defining the tasks, terminology, evaluation metrics
· annotating (hand-labeling) training and testing data
· designing features
· error analysis
Example: is the patient diabetic?

In order to predict, we make some measurements of

properties we believe will be useful: these are called the
features
Example: is the patient diabetic?

· In order to predict, we make some measurements of

properties we believe will be useful: these are called the
features
More terminology: what is the output?

· Classification: learning to output a category label

· spam/non-spam; positive/negative; …
· Regression: learning to guess a number
· value of a share; number of stars in a review; …
How is the training signal provided?

· In supervised learning, the training set consists of

input–output pairs
· our goal is to learn to produce the outputs
Types of supervision: alternatives

· Unsupervised learning: we are given “unorganized” data

· our goal is to discover some structure
15 15

10 10

5 5

0 0

5 5

10 10
7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0

· Reinforcement learning: our problem is formalized as a game

· an agent carries out actions and receives rewards
Example: Fisher’s iris data

versicolor
2.4 virginica

2.2

2.0

petal_width
1.8

1.6

1.4

1.2

1.0
3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0
petal_length
Approach 1: linear separator

if 0.85 · petal_length + 2.42 · petal_width ≥ 8.34:

return virginica
else
return versicolor

2.4

2.2

2.0
petal_width

1.8

1.6

1.4

1.2

1.0
3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5
petal_length
Approach 2: if/then/else tree

2.4

2.2

2.0
petal_width

1.8

1.6

1.4

1.2

1.0
3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5
petal_length
Basic supervised machine learning workflow
Basic ML methodology: evaluation

· Select an evaluation procedure (a “metric”) such as

· classification accuracy: proportion correct classifications?
· mean squared error often used in regression
· or some domain-specific metric
· Compare to one or more baselines
· trivial solution
· rule-based solution
· existing solution
· Apply your model to a held-out test set and evaluate
· the test set must be different from the training set
· also: don’t optimize on the test set; use a development set or
cross-validation!
Managing your data
Managing your data
Managing your data
Managing your data for evaluation and cross-validation

[source]
Overview

Practical issues about the course

Fundamental concepts in machine learning

Machine learning libraries in Python

Use cases for machine learning

· Standard use cases: standard

solutions are available

· Special cases: we may need to tailor

our own solutions
The Python machine learning ecosystem (selection)
Machine learning software: a small sample

· General-purpose software, large collections of algorithms:

· scikit-learn: http://scikit-learn.org
▶ Python library – will be used in this course
· Weka: http://www.cs.waikato.ac.nz/ml/weka
▶ Java library with nice user interface
· Special-purpose software, small collections of algorithms:
· Keras, PyTorch, TensorFlow, JAX for neural networks
· LibSVM/LibLinear for support vector machines
· XGboost, lightgbm for tree ensembles
·…
· large-scale learning in distributed architectures:
· Spark MLLib
· H2O
Scikit-learn toy example

See also
https://scikit-learn.org/stable/getting_started.html
Up next

· Thursday: lab sessions for programming assignment 1

· Topic of Friday’s discussion:
· decision trees
· ensembles and random forests
· generalization, under/overfitting
· Please prepare for assignment 1 by reading my code and the
extra reading on decision trees

R22 Machine Learning Digital Notes Final
No ratings yet
R22 Machine Learning Digital Notes Final
143 pages
ML - Unit I - Final
No ratings yet
ML - Unit I - Final
132 pages
COS324 Course Notes
No ratings yet
COS324 Course Notes
256 pages
SEM 5 Syllabus
No ratings yet
SEM 5 Syllabus
28 pages
CS230
No ratings yet
CS230
6 pages
Machine Learning: Martin Jaggi & Nicolas Flammarion
No ratings yet
Machine Learning: Martin Jaggi & Nicolas Flammarion
52 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
1694266379-Unit1 Machine Learning Introduction CU 2.0
No ratings yet
1694266379-Unit1 Machine Learning Introduction CU 2.0
58 pages
Fall2024 W4995 Lecture1
No ratings yet
Fall2024 W4995 Lecture1
110 pages
ML Key Concepts
No ratings yet
ML Key Concepts
139 pages
MAchine Learning
No ratings yet
MAchine Learning
120 pages
100 Projects
No ratings yet
100 Projects
25 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
ABES Presentation
No ratings yet
ABES Presentation
91 pages
01 ML Basics
No ratings yet
01 ML Basics
61 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
ML Short U1-4
No ratings yet
ML Short U1-4
60 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
81 pages
1725629890-Unit1 Machine Learning Introduction CU 3.0
No ratings yet
1725629890-Unit1 Machine Learning Introduction CU 3.0
38 pages
Machine Learning
No ratings yet
Machine Learning
25 pages
Unit 1
No ratings yet
Unit 1
43 pages
Intro Slides
No ratings yet
Intro Slides
31 pages
8 VHVHV
100% (1)
8 VHVHV
3 pages
Specimen Paper 1 Phy
No ratings yet
Specimen Paper 1 Phy
23 pages
01 Lecture1
No ratings yet
01 Lecture1
36 pages
Lecture 01 - Introduction To AML-Jan24
No ratings yet
Lecture 01 - Introduction To AML-Jan24
66 pages
01 Introduction
No ratings yet
01 Introduction
23 pages
Applied Machine Learning
No ratings yet
Applied Machine Learning
49 pages
Lecture 3
No ratings yet
Lecture 3
36 pages
Lec1 Intro To p556
No ratings yet
Lec1 Intro To p556
29 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Nursing Masters Thesis Examples
100% (4)
Nursing Masters Thesis Examples
4 pages
Introduction To Machine Learning: Pekka Parviainen
No ratings yet
Introduction To Machine Learning: Pekka Parviainen
39 pages
Topic 1 - Introduction
No ratings yet
Topic 1 - Introduction
30 pages
Basic Principles of Pneumatics and Electropneumatics Textbook 573030
No ratings yet
Basic Principles of Pneumatics and Electropneumatics Textbook 573030
24 pages
Lec 01
No ratings yet
Lec 01
28 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Ship Parts
100% (1)
Ship Parts
34 pages
ML Notes
No ratings yet
ML Notes
25 pages
EM 538 - ISE 489 Syllabus
No ratings yet
EM 538 - ISE 489 Syllabus
11 pages
CP4252 ML Syllabus
No ratings yet
CP4252 ML Syllabus
4 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
37 pages
Grade 9 Subject Choice Information Booklet
No ratings yet
Grade 9 Subject Choice Information Booklet
29 pages
Document 1
No ratings yet
Document 1
6 pages
Online MachineLearningUsing Python
No ratings yet
Online MachineLearningUsing Python
3 pages
Latihan Soal Revisi - 1
75% (4)
Latihan Soal Revisi - 1
93 pages
ML Resources CW 2025
No ratings yet
ML Resources CW 2025
5 pages
0MLwP Workshop Brochure
No ratings yet
0MLwP Workshop Brochure
8 pages
1 Lecture 1: Introduction To Machine Learning
No ratings yet
1 Lecture 1: Introduction To Machine Learning
12 pages
Being Artifex - ML Ai
No ratings yet
Being Artifex - ML Ai
5 pages
ML Syllabus - 1
No ratings yet
ML Syllabus - 1
5 pages
EContent 7 2025 01 31 11 08 21 01IT0610pdf 2023 12 17 20 26 49pdf 2025 01 16 07 59 27
No ratings yet
EContent 7 2025 01 31 11 08 21 01IT0610pdf 2023 12 17 20 26 49pdf 2025 01 16 07 59 27
3 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Introduction To Machine Learning With Python
No ratings yet
Introduction To Machine Learning With Python
2 pages
ML Course Outline
No ratings yet
ML Course Outline
4 pages
Syl3 ML
No ratings yet
Syl3 ML
5 pages
956 - BSC DataScience Semester 4 DSC D ML Paper 4
No ratings yet
956 - BSC DataScience Semester 4 DSC D ML Paper 4
3 pages
Machine Learning Specialization CloudxLab PDF
No ratings yet
Machine Learning Specialization CloudxLab PDF
12 pages
Machine Learning-Updated
No ratings yet
Machine Learning-Updated
4 pages
INF385T IMLsyllabus
No ratings yet
INF385T IMLsyllabus
4 pages
M.L.CSE Syllabus
No ratings yet
M.L.CSE Syllabus
3 pages
Course Outline ML
No ratings yet
Course Outline ML
3 pages
Welcome
No ratings yet
Welcome
1 page
Arab Tamil
No ratings yet
Arab Tamil
64 pages
Java ML
No ratings yet
Java ML
7 pages
Food Atm Project
100% (3)
Food Atm Project
27 pages
Andhra Pradesh Integrated Clean Energy Policy - 30oct2024 (FINAL)
No ratings yet
Andhra Pradesh Integrated Clean Energy Policy - 30oct2024 (FINAL)
98 pages
Chapter 13 - Statement of Cash Flows
100% (1)
Chapter 13 - Statement of Cash Flows
31 pages
tb1 ch13
No ratings yet
tb1 ch13
50 pages
Introduction
No ratings yet
Introduction
4 pages
Assignment Cover Sheet
No ratings yet
Assignment Cover Sheet
77 pages
(English (Auto-Generated) ) 2015 World Champion - 'The Power of Words' Mohammed Qahtani, Toastmasters International (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) 2015 World Champion - 'The Power of Words' Mohammed Qahtani, Toastmasters International (DownSub - Com)
4 pages
Maths GR12 Session 1 14 Ssip LHS
No ratings yet
Maths GR12 Session 1 14 Ssip LHS
47 pages
1887 18480-Krispijn Iconea2008
No ratings yet
1887 18480-Krispijn Iconea2008
34 pages
BUS ROUTES 1 Shift 11.07.24
No ratings yet
BUS ROUTES 1 Shift 11.07.24
6 pages
Cost Accounting - Bcom - Module 3
No ratings yet
Cost Accounting - Bcom - Module 3
8 pages
Electricity and Magnetism II - Jackson Homework 6
No ratings yet
Electricity and Magnetism II - Jackson Homework 6
4 pages
Acfrogacgstlhqv4lelnp 1pj5n9eaagtj4xtazdet1ldqae4eoit604mf3wkwznjkjsv9u8biuqk E44cauk9hp8by63nycwc8wcqm1l Ortz93qzclo2dh94sdsbpi9n 20rwyhbbshj4axex44bqrbzrlj1ka Nnuxd9llw
No ratings yet
Acfrogacgstlhqv4lelnp 1pj5n9eaagtj4xtazdet1ldqae4eoit604mf3wkwznjkjsv9u8biuqk E44cauk9hp8by63nycwc8wcqm1l Ortz93qzclo2dh94sdsbpi9n 20rwyhbbshj4axex44bqrbzrlj1ka Nnuxd9llw
6 pages
ITIL - A Guide To Event Management PDF
No ratings yet
ITIL - A Guide To Event Management PDF
5 pages
Industrial Training Report 1
No ratings yet
Industrial Training Report 1
16 pages
Enrichr - AUGMENT (3D 2D Customer Engagement) - Product Features
No ratings yet
Enrichr - AUGMENT (3D 2D Customer Engagement) - Product Features
6 pages
Production and Operations Management: A Life Cyde Approadi
No ratings yet
Production and Operations Management: A Life Cyde Approadi
10 pages
Tahir Qarayev Accounting Imtahan Suallari
No ratings yet
Tahir Qarayev Accounting Imtahan Suallari
6 pages
Nrha Reiner
No ratings yet
Nrha Reiner
5 pages
Recent Cyclones in India
No ratings yet
Recent Cyclones in India
3 pages
Organizational Chart Tritech
No ratings yet
Organizational Chart Tritech
1 page
Separation of Ethanol-Water Using Benzene As Entrainer: Background
No ratings yet
Separation of Ethanol-Water Using Benzene As Entrainer: Background
2 pages
Business Plan Example Hostel
100% (1)
Business Plan Example Hostel
11 pages