0% found this document useful (0 votes)

5 views

dl_01_introduction

The document outlines the structure and content of a Deep Learning course led by Thore Graepel and guest lecturers from DeepMind, covering topics like deep reinforcement learning, neural networks, and supervised learning. It emphasizes the importance of prior knowledge in Python and machine learning, with coursework assessed through programming assignments in TensorFlow using Google Colab. The course features guest lectures from prominent researchers in the field and aims to provide a comprehensive understanding of deep learning and its applications in AI.

Uploaded by

Avijit Manna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

dl_01_introduction

Uploaded by

Avijit Manna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 87

COMP GI22/MI22

Deep Learning Lecture 1

Thore Graepel & Guest Lecturers from DeepMind
[email protected]
Overview
● Team and Structure of the Course
● Guest Lectures and Lecturers
● DeepMind approach to AI
● Why Deep Learning?
● Deep Reinforcement Learning at work
○ Learning to Play Atari Games with Deep RL
○ AlphaGo - Learning to master level Go
● Extra revision material (supervised learning)
The DeepMind/UCL Team

Matteo Hessel Diana Borsa

(TA Lead)
Koray Kavukcuoglu Hado van Hasselt
(Co-Lead DL) (Co-Lead RL)

Teaching Assistants:

● Zach Eaton-Rosen
● Lewis Moffat
● Michael Jones
● Raza Habib Marie Mulville
Alex Davies (PgM)
● Thomas Gaudelet
Format and Coursework
● Format: Two streams, both streams mandatory
○ Tuesdays: Deep Learning taught by a selection of fantastic guest lecturers from DeepMind
○ Thursdays: Reinforcement Learning taught by Hado Van Hasselt (also DeepMind)
○ Some exceptions, check timetable at https://timetable.ucl.ac.uk/ and on Moodle (for topics)
● Assessment: 100% through Coursework
○ There are four deep learning and four reinforcement learning assignments
○ Each of the eight assignment will be weighted equally, i.e., counting 12.5%
○ Coursework is mixture of programming assignments and questions
○ Framework for coursework will be Colab, a Jupyter notebook environment that requires no
setup to use and runs entirely in the cloud.
○ Machine Learning algorithms will be implemented in TensorFlow through Colab.
○ You can find more information about the assessment on Moodle.
○ Todo: Set up Google account with address: "[email protected]",
where XXXXXXXX is your (numerical) student number
● Support: Use Moodle forum and Moodle direct messages
TensorFlow - What is it?
Hado or Diana
Warning: Lots of work and prior knowledge required!
● Last year, many people complained that it was too much work!
● If you do not know how to code in Python this may not be right for you!
● A lot of preliminary knowledge required - see quiz!
● Deep Learning lectures are delivered by top researchers in the field and will
stretch towards the current research frontier → brace yourselves!
● Check out the Self-Assessment Quiz on Moodle
DeepMind
Guest Lecturers
Introduction to TensorFlow
● Lecture topics:
○ Introduction to Tensorflow principles
○ Practical work-through examples in Colab
● Guest Lecturer: Matteo Hessel
○ Joined DeepMind in 2015.
○ Masters in Machine Learning from UCL Matteo Hessel
○ Master of Engineering Politecnico di Milano
● Guest Lecturer: Alex Davies
○ Joined Deepmind in 2017
○ PhD in Machine Learning at Cambridge
○ Worked with team of international scientists to build the world's first
machine learned musical.

Alex Davies
Neural Nets, Backprop, Automatic Differentiation
● Lecture topics:
○ Neural nets
○ Multi-class classification and softmax loss
○ Modular backprop
○ Automatic differentiation
● Guest Lecturer: Simon Osindero
○ Joined DeepMind in 2016.
○ Undergrad/Masters in Natural Sciences/Physics at University of Cambridge.
○ PhD in Computational Neuroscience from UCL (2004). Supervisor: Peter Dayan.
○ Postdoc at University of Toronto with Geoff Hinton. (Deep belief nets, 2006).
○ Started an A.I. company, LookFlow, in 2009. Sold to Yahoo in 2013.
○ Current research topics: deep learning, RL agent architectures and algorithms,
memory, continual learning.
Convolutional Neural Networks
● Lecture topics:
○ Convolutional networks
○ Large-scale image recognition
○ ImageNet models
● Guest Lecturer: Karen Simonyan
○ Joined DeepMind in 2014
○ DPhil (2013) and Postdoc (2014) at the University of Oxford
with Andrew Zisserman
○ Research topics: deep learning, computer vision
■ VGGNets, two-stream ConvNets, ConvNet visualisation, etc.
■ https://scholar.google.co.uk/citations?user=L7lMQkQAAAAJ
Temporal Hierarchies

Recurrent Nets and Sequence Generation

● Lecture topics:
○ Recurrent Neural Networks
○ Long-Short Term Memory (LSTM)
○ (Conditional) Sequence Generation
● Guest Lecturer: Oriol Vinyals
○ Joined DeepMind in 2016.
○ Worked in Google Brain from 2013 to 2016.
○ PhD in Artificial Intelligence from UC Berkeley (2009-13). Supervisor: Darrell / Morgan.
○ Current research topics: deep learning, sequence modeling, generative models,
distillation, RL/Starcraft, one shot learning.

Sequence Prediction Seq2Seq Recurrent Architectures

End-To-End and Energy-Based Learning
● Lecture topics:
○ End-to-end learning
○ Energy based learning
○ Ranking
○ Embeddings
○ Triplet loss
● Guest Lecturer: Raia Hadsell
○ PhD From NYU, postdoc at CMU’s Robotics Institute
○ Senior Scientist and Tech Manager at SRI International
○ Now leading a research team at DeepMind
○ Research in Deep Learning, Robotics, Navigation, Life-Long Learning
Optimisation
● Lecture topics:
○ First-order methods
○ Second-order methods
○ Stochastic methods
○ Some convergence theory
● Guest Lecturer: James Martens
○ Joined DeepMind in Sept 2016
○ PhD from University of Toronto under Geoff Hinton & Rich Zemel in
2015
○ Undergrad from Waterloo in Math and Computer Science
○ Working on: second-order optimization for neural nets,
characterizing expressive power/efficiency of neural nets, generative
models / unsupervised learning
Attention and Memory Models
● Lecture topics:
○ Neural attention models
○ Recurrent neural networks with external memory
○ Neural Turing Machines / Differentiable Neural Computers
● Guest Lecturer: Alex Graves
○ Joined Deepmind 2013
○ Undergrad Theoretical Physics, Univ. of Edinburgh
○ Masters Mathematics and Theoretical Physics, Univ. of Cambridge
○ PhD Artificial Intelligence TU Munich, supervisor Jürgen Schmidhuber
○ CIFAR Junior fellow with Geoff Hinton, Univ. of Toronto
○ Research focuses on sequence learning with recurrent neural networks:
memory, attention, sequence generation, model compression
Deep Learning for Natural Language Processing
● Lecture topics:
○ Deep Learning for Natural Language Processing
○ Neural word embeddings
○ Neural machine translation
● Guest Lecturer: Ed Grefenstette
○ DPhil from Oxford
○ Co-Founder of Dark Blue Labs (acquired by DeepMind)
○ Research in Machine Learning, Computational Linguistics
Unsupervised Learning and Deep Generative Models
● Lecture topics:
○ Density estimation and unsupervised learning.
○ Deep Generative Models: latent variable and implicit models.
○ Approximate inference and variational inference.
○ Stochastic optimisation
● Guest Lecturer: Shakir Mohamed
○ Joined DeepMind in 2013.
○ PhD in Statistical Machine Learning, St John’s College, University of Cambridge. Supervisor: Zoubin
Ghahramani.
○ CIFAR Junior Research Fellow at the University of British Columbia with Nando de Freitas.
○ Topics in Probabilistic thinking, approximate Bayesian inference, unsupervised learning and density
estimation, deep Learning, reinforcement learning.
○ Undergrad in electrical engineering. From Johannesburg, South Africa.
Reinforcement Learning Stream (Hado)

● Introduction to Reinforcement Learning

● Markov Decision Processes
● Planning by Dynamic Programming
● Model-Free Prediction
● Model-Free Control

● Value Function Approximation (Deep RL)

● Policy Gradient Methods
● Integrating Learning and Planning Hado van Hasselt
● Exploration and Exploitation
● Case Study: AlphaGo
Case Study: AlphaGo (TBC)
● Lecture topics:
○ The story behind AlphaGo
○ Deep RL applied to Classical Board Games
○ Combining Tree Search and Neural Networks
○ Evaluation against machines and humans
● Guest Lecturer: David Silver
○ Computer Science at Cambridge, PhD Alberta
○ Co-Founder/CTO of Elixier Studios
○ Faculty member at UCL (on leave at DeepMind)
○ Joined DeepMind in 2013
○ Research in deep reinforcement learning, integration
of learning and planning, games
Case Study: Practical Deep RL (TBC)
● Lecture topics:
○ Learning to play Atari games: DQN in Detail
○ Faster Agents through parallel training
○ Better data efficiency through unsupervised RL
○ Some practical advice
● Guest Lecturer: Volodymyr Mnih
○ PhD in Machine Learning at the University of Toronto
○ Early DeepMind pioneer
○ Legendary work on Deep RL for playing Atari, published in Nature
DeepMind founded 2010 (joined Google 2014)
Mission: “Solve Intelligence”

An Apollo Programme for AI (150+ scientists)

A new approach to organizing science

General Artificial Intelligence

General-Purpose Learning Algorithms

Learn automatically from raw inputs - not pre-programmed

General - same system can operate across a wide range of tasks

Artificial ‘General’ Intelligence (AGI) – flexible, adaptive, inventive

‘Narrow’ AI – hand-crafted, special-cased, brittle

Reinforcement Learning
OBSERVATIONS
GOAL

Agent Environment

ACTIONS

○ General Purpose Framework for AI

○ Agent interacts with the environment
○ Select actions to maximise long-term reward
○ Encompasses supervised and unsupervised learning as special cases

Deep Learning
What is intelligence?
Intelligence measures an agent’s ability to achieve
goals in a wide range of environments

Complexity
Measure of Intelligence Value achieved
penalty

Sum over environments

Universal Intelligence: A Definition of Machine Intelligence, Legg & Hutter 2007

Multi-Agent and AI
Grounded Cognition
A true thinking machine has to be grounded in a rich sensorimotor reality
Games are the perfect platform for developing and testing AI algorithms
Unlimited training data, no testing bias, parallel testing, measurable progress
‘End-to-end’ learning agents: from pixels to actions
Thanks to Koray for DL slides
Why Deep Learning?
● Enables End-To-End Training
○ Optimise for the end loss
○ Don’t engineer your inputs
○ Learn good representations
● Versatile: Can be applied to images, text, audio, video
● Modular design of systems (modular backprop)
● Represent weak prior knowledge (e.g., convolutions)
● Now computationally feasible at scale (GPUs)

Deep Learning
Supervised Learning
○ Convolutional Networks on MNIST

[ Lecun, et. al ]

○ Convolutional Networks on ImageNet

[ Krizhevsky, et. al ]

Deep Learning
Supervised Learning
○ Convolutional Networks on Text

[ Zhang, et. al ]

○ Convolutional Networks on Video

[ Collobert, et. al ]
[ Simonyan, et. al ]

Deep Learning
Supervised Learning
○ End-to-End Training
○ Optimize for the end loss
○ No engineered inputs
○ With enough data, learn a big non-linear function
○ Learn good representations of data
■ Rich enough supervised labeling is enough to train transferrable representations
■ Best feature extractor
■ Karpathy, Razavian et al, Yosinski et al, Donahue et al
○ Large labeled dataset + big/deep neural network + GPUs
○ Ever more sophisticated modules → Differentiable Progrogramming

Deep Learning
Supervised Learning
○ Innovation continues
■ Inception
■ Ladder Nets
■ Residual Connections
■ …
○ Performance is continuously improving
○ Architectures for easier optimization [ Rasmus, et. al ]
■ Batchnorm

[ Szegedy, et. al ] [ He, et. al ]

Deep Learning
Unsupervised Learning
○ Unsupervised Learning/Generative Models
■ RBM
■ Auto-encoders
■ PCA, ICA, Sparse Coding
[ Hinton, et. al ]
■ VAE
■ NADE - and all variants
■ GANs
○ How to evaluate/rank different algorithms?
○ Quantitative approach or visual quality?
■ How can we trust if the input domain itself is not interpretable?
○ How can unsupervised learning help a task?
[ Larochelle, Murray]

Deep Learning
Sequence Modeling
○ Almost all data are sequence
■ Text
■ Video [ Hochreiter and Schmidhuber ]
■ Audio
■ Image [nade, pixelrnn]
■ Multi-modal (caption → image, image → caption)

[ Vinyals, et. al ]
[ Sutskever, et. al ]

Deep Learning
Human-level control
through deep
reinforcement learning
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G.
Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig
Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran,
Daan Wierstra, Shane Legg, Demis Hassabis

Google DeepMind
(Mnih et al. Nature 2015)
ATARI Games
● Designed to be challenging and
interesting for humans
● Provides a good platform for sequential
decision making
● Widely adopted RL benchmark for
evaluating agents (Bellemare’13)
● Many different games emphasize
control, strategy, …
● Provide a rich visual domain

Deep Learning
End-to-End Reinforcement Learning

Deep Learning
Deep Learning
Deep Learning
Deep Learning
Deep Learning
DeepMind Lab - Challenging RL Problems in 3D

General Artificial Intelligence

Mastering the game of Go with deep
neural networks and tree search
David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den
Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot,
Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy
Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel & Demis Hassabis

Google DeepMind
(Silver, Huang, et al 2016)
#3 most downloaded
academic paper this month
Why is Go hard for computers to play?

Game tree complexity = bd

Brute force search intractable:

1. Search space is huge

2. “Impossible” for computers
to evaluate who is winning
Value network
Evaluation

v (s)

Position
Policy network
Move probabilities

p (a|s)

Position
Reducing depth with value network
Reducing depth with value network
Reducing breadth with policy network
Evaluating current AlphaGo against computers
4500

AlphaGo (v18)
V13 scores 494/495
4000
against computer
opponents 3500 9p

Professional
7p

dan (p)
5p

AlphaGo (Nature v13)

3000 3p
1p
V18 beats V13 2500 9d

3 to 4 stones 7d
2000

Amateur
dan (d)
handicap 5d

Crazy Stone

Zen
1500 3d

Pachi
1d

Fuego
1000 1k
CAUTION: ratings
3k

Beginner
kyu (k)
based on self-play 500
5k

Go
Gnu
results 0
7k
Computer Programs Calibration Human Players

DeepMind challenge match Lee Sedol (9p)

AlphaGo (Mar 2016) Top player of
4-1 past decade

Beats Beats

Nature match Fan Hui (2p)

AlphaGo (Oct 2015) 3-times reigning
5-0 Euro Champion

Beats Beats

KGS Amateur
Crazy Stone and Zen
humans
Extra revision material (Supervised Learning)
• Review of concepts from supervised learning
• Generalisation, overfitting, Underfitting
• Learning curves
• Stochastic gradient descent
• Linear regression
• Cost function
• Gradients
• Logistic regression
• Cost function
• Gradients
Supervised Learning Problem
Given a set of input/output pairs (training set) we wish to compute the
functional relationship between the input and the output

Example 1: (people detection) given an image we wish to say if it depicts a

person or not. The output is one of two possible categories
Example 2: (pose estimation) we wish to predict the pose of a face image The
output is a continuous number (here a real number describing the face
rotation angle)
In both problems the input is a high dimensional vector x representing pixel
intensity/colour
Example: People Detection
Example: People Detection (cont.)
Supervised Learning Model

Supervised Learning Problem: Compute a function which best

describes I/O relationship
Learning Algorithm

• Example Algorithms:
• Linear Regression
• Logistic Regression
• Neural Networks
• Decision Trees
• In this lecture, we will revise linear and logistic regression
Key Questions for the ML Practitioner

• How is the data collected? (need assumptions!)

• How do we represent the inputs? (may require pre-processing step)
• How accurate is the learned function on new data (study of
generalization error)?
• Many algorithms may exist for a task. How do we choose?
• How “complex” is a learning task? (computational complexity,
sample complexity)
Important Challenges for ML
• New inputs differ from the ones in the training set (look up tables do
not work!)
• Inputs are measured with noise
• Output is not deterministically obtained by the input
• Input is often high dimensional but some components/variables may
be irrelevant
• How can we incorporate prior knowledge?
Generalisation
Most important idea of machine learning:
Train models such that they correctly predict on unseen data
(from the same distribution)
• Empirical risk minimization: Minimise error on training sample
• Validation: Hold out data for testing to obtain unbiased estimator

• When data is scarce, can use cross-validation

Cross Validation
Underfitting and Overfitting
Underfitting Overfitting
• Error driven by approximation • Error driven by generalization
• High bias / low variance • Low bias / high variance
• What to do? • What to do?
• Use more features • Use fewer features
• User more complex model • Use simpler model
• Reduce regularization • Increase regularization
• Train for longer • Stop training early
More Data versus Better Algorithm
• In high-variance, overfitting situations
more data helps
• Example: Confusion Set Disambiguation
• Banko and Brill 2001, “Scaling to Very
Very Large Corpora for Natural
Language Disambiguation”
• See also: “The Unreasonable
Effectiveness of Data”, Pereira, Norvig,
Halevy
Real-World Learning Curves: Underfitting

Training
Error

Validation
Error
Real-World Learning Curves: Overfitting

Training
Error

Early Stopping
Validation
Error
Real-World Learning Curves: Just Right

Training
Error

Validation
Error
Generalisation in Deep Learning
• “Understanding Deep Learning requires rethinking generalization”, Zhang, S. Bengio, Hardt,
Recht, Vinyals
• Deep Neural Networks easily fit random labels
• Generalization error varies from 0 to 90% without changes in model
• Deep NNs can even (rote) learn to classify random images
(Stochastic) Gradient Descent
Generalisation from Stochastic Gradient Descent
Linear Regression
Linear Regression Cost Function
• Model:

• Example-wise loss function:

• Total loss function:

• Minimising the squared error is equivalent to assuming Gaussian noise in a

maximum likelihood estimation
Stochastic gradient descent for regression
• Total loss gradient:

• Loss gradient:

• Model gradient:

• Put together:
Batch and stochastic gradient descent
•
Regularisation
Non-linear Basis Functions
Regression with polynomial basis functions

Degree = 0 Degree = 1 Degree = 2

Degree = 3 Degree = 4 Degree = 5

Polynomial Fit for different degrees
• Training error goes down with
increasing degree (better fit)
• Test error is optimal at degree 2,
and deteriorates for higher
degrees
• Note the similarity to learning
curves discussed earlier. The
effective hypothesis class of
neural networks becomes more
complex with longer training
Logistic Regression for classification

• Generalized linear model for

binary classification
• Used, e.g., in click-through-rate
prediction for search engine
advertising
• Find linear hyperplane to
separate the data
• Predict probability of class
Logistic Regression Cost Function
• Linear model:

• (Inverse) Link function:

• Cross entropy loss:

• The regression loss is a composition of these three functions,

aggregated over training examples
Logistic (Inverse) Link Function

By Michaelg2015 (Own work) [CC BY-SA 4.0 (http://creativecommons.org/licenses/by-sa/4.0)], via Wikimedia Commons
Cross Entropy
Logistic Regression Cost Function
Modular Gradients for Logistic Regression
• Total Gradient:

• Loss gradient:

• Link gradient:

• Model gradient:
Putting the gradient back together

• Similarly, the backpropagation algorithm works through the layers of

deeper neural networks to calculate error gradients w.r.t. to weights
• Simon’s lecture will give more details

DL Notes 1 5 Deep Learning
100% (1)
DL Notes 1 5 Deep Learning
189 pages
Soap Making Business
100% (2)
Soap Making Business
8 pages
DL Unit1 Final
No ratings yet
DL Unit1 Final
41 pages
Master Thesis Value at Risk
100% (3)
Master Thesis Value at Risk
4 pages
Deep Learning
No ratings yet
Deep Learning
243 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
DL_IT324a_1
No ratings yet
DL_IT324a_1
38 pages
DL - Unit1
No ratings yet
DL - Unit1
59 pages
Syllabus
No ratings yet
Syllabus
2 pages
Tud DL Lecture01 Intro
No ratings yet
Tud DL Lecture01 Intro
46 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
130 pages
Syllabus
No ratings yet
Syllabus
5 pages
Ai 4 All
No ratings yet
Ai 4 All
18 pages
Lec 01 Introduction
No ratings yet
Lec 01 Introduction
98 pages
DL Module 1 - CS-1 Fundamentals of Neural Network
No ratings yet
DL Module 1 - CS-1 Fundamentals of Neural Network
81 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
2 pages
Deep learning (nirali)
No ratings yet
Deep learning (nirali)
32 pages
MTech Deep Learning Syllabus
No ratings yet
MTech Deep Learning Syllabus
1 page
Updated DL Handbook 2023-24
No ratings yet
Updated DL Handbook 2023-24
25 pages
Deep Learning Kathi
No ratings yet
Deep Learning Kathi
18 pages
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
No ratings yet
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
92 pages
Seminar Report - Virtual Reality Bsc-it-Vi
No ratings yet
Seminar Report - Virtual Reality Bsc-it-Vi
11 pages
Brochure CMU-DELE 03-05-2023 V12
No ratings yet
Brochure CMU-DELE 03-05-2023 V12
12 pages
DL unit 5 perfect pdf._1
No ratings yet
DL unit 5 perfect pdf._1
17 pages
Deep Learning Lib
No ratings yet
Deep Learning Lib
3 pages
Deep Learning
No ratings yet
Deep Learning
169 pages
S5 and S6-2023 curriculum syllabus
No ratings yet
S5 and S6-2023 curriculum syllabus
6 pages
Introduction (BT4222) YL
No ratings yet
Introduction (BT4222) YL
48 pages
3rd Unit DL Final Class Notes
No ratings yet
3rd Unit DL Final Class Notes
78 pages
HDS401 Deep Learning Module Outline
No ratings yet
HDS401 Deep Learning Module Outline
3 pages
Csa4020 Deep-Learning LP 1.0 22 Csa4020 Deep-Learning LP 1.0 1 Deep Learning
No ratings yet
Csa4020 Deep-Learning LP 1.0 22 Csa4020 Deep-Learning LP 1.0 1 Deep Learning
2 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
126 pages
Deep Learning
100% (1)
Deep Learning
21 pages
Deep Learning Technique Syllabus
No ratings yet
Deep Learning Technique Syllabus
2 pages
Aula 1 T
No ratings yet
Aula 1 T
4 pages
22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
Bcse332l Deep-Learning TH 1.0 0 Bcse332l
No ratings yet
Bcse332l Deep-Learning TH 1.0 0 Bcse332l
3 pages
UNIT I part 1 notes
No ratings yet
UNIT I part 1 notes
28 pages
CSD411-Week_1-Intro_to_DL_1723007626128804966166b3028a8d746
No ratings yet
CSD411-Week_1-Intro_to_DL_1723007626128804966166b3028a8d746
19 pages
DL Lab File Front Page
No ratings yet
DL Lab File Front Page
7 pages
Activity-1 DL
No ratings yet
Activity-1 DL
5 pages
Lecture_1
No ratings yet
Lecture_1
10 pages
Deep Learning University
No ratings yet
Deep Learning University
129 pages
Essentials of Deep Learning
No ratings yet
Essentials of Deep Learning
2 pages
Deep Reinforcement Learning Nanodegree Program Syllabus
No ratings yet
Deep Reinforcement Learning Nanodegree Program Syllabus
13 pages
3rd Unit DL Final Class Notes (1)
No ratings yet
3rd Unit DL Final Class Notes (1)
78 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
Salman Technical Seminar
No ratings yet
Salman Technical Seminar
24 pages
20IT7301 - Deep Learning Syllabus
No ratings yet
20IT7301 - Deep Learning Syllabus
3 pages
Lect 4-Introduction to Deep Learning
No ratings yet
Lect 4-Introduction to Deep Learning
33 pages
Principles of Deep Learning
No ratings yet
Principles of Deep Learning
1 page
DL Cif 2023
No ratings yet
DL Cif 2023
3 pages
Lecture 1 Logistic.pptx
No ratings yet
Lecture 1 Logistic.pptx
19 pages
IISc DL Detailed Curriculum
No ratings yet
IISc DL Detailed Curriculum
7 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
200 pages
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
From Everand
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
William Sullivan
1/5 (1)
Coding Education Shifts
From Everand
Coding Education Shifts
Zoe Codewell
No ratings yet
Algorithmic Thinking, 2nd Edition: Learn Algorithms to Level Up Your Coding Skills
From Everand
Algorithmic Thinking, 2nd Edition: Learn Algorithms to Level Up Your Coding Skills
Daniel Zingaro
No ratings yet
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
From Everand
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
Dr. Rajkumar Tekchandani
No ratings yet
2012 AMC Junior Years 7 and 8 Questions Australian Mathematics Competition
No ratings yet
2012 AMC Junior Years 7 and 8 Questions Australian Mathematics Competition
8 pages
Mits Computer Project (1)
No ratings yet
Mits Computer Project (1)
23 pages
Project Schedule N Risk Tracker
No ratings yet
Project Schedule N Risk Tracker
8 pages
Qatargas South - Pvl-Rev-6 - March 2019
No ratings yet
Qatargas South - Pvl-Rev-6 - March 2019
166 pages
Counting: Discrete Structures For Computing On August 31, 2021
No ratings yet
Counting: Discrete Structures For Computing On August 31, 2021
27 pages
Zdiff SIEMENS 132KV Diff
No ratings yet
Zdiff SIEMENS 132KV Diff
11 pages
Year 7 Forces and Their Effects - The Solar System and Beyond
100% (3)
Year 7 Forces and Their Effects - The Solar System and Beyond
10 pages
Materi Biographical Recount
No ratings yet
Materi Biographical Recount
14 pages
2 Power System and Stability (Read-Only)
100% (1)
2 Power System and Stability (Read-Only)
17 pages
Energy Efficient Lighting - Feb 2021
No ratings yet
Energy Efficient Lighting - Feb 2021
64 pages
6 The Rio Grande Free Sample PDF
No ratings yet
6 The Rio Grande Free Sample PDF
5 pages
11 Prestressed Concrete Chapter 11
100% (1)
11 Prestressed Concrete Chapter 11
52 pages
KeralaPentecostHistory (SajuMathew) PDF
100% (1)
KeralaPentecostHistory (SajuMathew) PDF
440 pages
Soln Class Problems CH 1-4 Spring 2017
No ratings yet
Soln Class Problems CH 1-4 Spring 2017
4 pages
Zinsser Et Al., 2015
No ratings yet
Zinsser Et Al., 2015
22 pages
Jan Strelau - Temperament
100% (1)
Jan Strelau - Temperament
475 pages
Iida Fvi Station Improvement
No ratings yet
Iida Fvi Station Improvement
10 pages
Mading LTE 2100
No ratings yet
Mading LTE 2100
892 pages
Lecture 03,04,05 - Intensity Transformation and Spatial Filtering PDF
No ratings yet
Lecture 03,04,05 - Intensity Transformation and Spatial Filtering PDF
46 pages
BCOM Part-III Assignment Question-2021 For Examination-2022
No ratings yet
BCOM Part-III Assignment Question-2021 For Examination-2022
2 pages
Api 1104 VT Test
No ratings yet
Api 1104 VT Test
2 pages
Theory of Metal Cutting
No ratings yet
Theory of Metal Cutting
2 pages
Jokey Pump Panel To Centq
No ratings yet
Jokey Pump Panel To Centq
7 pages
GDCE JE Notification
No ratings yet
GDCE JE Notification
6 pages
ESR Study of Reactions of Cellulose With .OH Generated by Fe2+ - H2O2
No ratings yet
ESR Study of Reactions of Cellulose With .OH Generated by Fe2+ - H2O2
11 pages
Vitotronic 100 CTC HC1
No ratings yet
Vitotronic 100 CTC HC1
28 pages
Unit-1.7 - Basic Concepts of OOP in C++
No ratings yet
Unit-1.7 - Basic Concepts of OOP in C++
8 pages
Syntactic Analysis
No ratings yet
Syntactic Analysis
21 pages

dl_01_introduction

Uploaded by

dl_01_introduction

Uploaded by

COMP GI22/MI22

Deep Learning Lecture 1

Matteo Hessel Diana Borsa

Recurrent Nets and Sequence Generation

Sequence Prediction Seq2Seq Recurrent Architectures

● Introduction to Reinforcement Learning

● Value Function Approximation (Deep RL)

An Apollo Programme for AI (150+ scientists)

A new approach to organizing science

General Artificial Intelligence

Learn automatically from raw inputs - not pre-programmed

General - same system can operate across a wide range of tasks

Artificial ‘General’ Intelligence (AGI) – flexible, adaptive, inventive

‘Narrow’ AI – hand-crafted, special-cased, brittle

○ General Purpose Framework for AI

Sum over environments

Universal Intelligence: A Definition of Machine Intelligence, Legg & Hutter 2007

○ Convolutional Networks on ImageNet

○ Convolutional Networks on Video

[ Szegedy, et. al ] [ He, et. al ]

General Artificial Intelligence

Game tree complexity = bd

Brute force search intractable:

1. Search space is huge

AlphaGo (Nature v13)

DeepMind challenge match Lee Sedol (9p)

Nature match Fan Hui (2p)

Example 1: (people detection) given an image we wish to say if it depicts a

Supervised Learning Problem: Compute a function which best

• How is the data collected? (need assumptions!)

• When data is scarce, can use cross-validation

• Example-wise loss function:

• Total loss function:

• Minimising the squared error is equivalent to assuming Gaussian noise in a

Degree = 0 Degree = 1 Degree = 2

Degree = 3 Degree = 4 Degree = 5

• Generalized linear model for

• (Inverse) Link function:

• Cross entropy loss:

• The regression loss is a composition of these three functions,

• Similarly, the backpropagation algorithm works through the layers of

You might also like