Week1 UDL CM20315 01 Intro
Week1 UDL CM20315 01 Intro
Prof. Simon Prince, Dr. Georgios Exarchakis and Dr. Andrew Barnes
1. Introduction
This is a VERY large lecture theatre. Please leave the back five rows empty!
CM20315 - Machine Learning
Prof. Simon Prince, Dr. Georgios Exarchakis and Dr. Andrew Barnes
1. Introduction
Semester 1
Book
http://udlbook.com
Supervised learning
• Define a mapping from input to output
• Learn this mapping from paired input/output data examples
Regression
• Goal: take actions to change the state so that you receive rewards
• You don’t receive any data – you have to explore the environment
yourself to gather data as you go
Example: chess
• States are valid states of the chess board
• Actions at a given time are valid possible moves
• Positive rewards for taking pieces, negative rewards for losing them
Example: chess
• States are valid states of the chess board
• Actions at a given time are valid possible moves
• Positive rewards for taking pieces, negative rewards for losing them
Why is this difficult?
• Stochastic
• Make the same move twice, the opponent might not do the same thing
• Rewards also stochastic (opponent does or doesn’t take your piece)
• Temporal credit assignment problem
• Did we get the reward because of this move? Or because we made good
tactical decisions somewhere in the past?
• Exploration-exploitation trade-off
• If we found a good opening, should we use this?
• Or should we try other things, hoping for something better?
Landmarks in Deep Learning
• 1958 Perceptron (Simple `neural’ model)
• 1986 Backpropagation (Practical Deep Neural networks)
• 1989 Convolutional networks (Supervised learning)
• 2012 AlexNet Image classification (Supervised learning)
• 2014 Generative adversarial networks (Unsupervised learning)
• 2014 Deep Q-Learning -- Atari games (Reinforcement learning)
• 2016 AlphaGo (Reinforcement learning)
• 2017 Machine translation (Supervised learning)
• 2019 Language models ((Un)supervised learning)
• 2022 Dall-E2 Image synthesis from text prompts ((Un)supervised learning)
• 2022 ChatGPT ((Un)supervised learning)
• 2023 GPT4 Multimodal model ((Un)supervised learning)
2018 Turing award winners
This course
Deep neural networks
How to train them
How to measure their performance
How to make that performance better
This course