Lec 1 Intro Course Overview
Lec 1 Intro Course Overview
Option 1:
Understand the problem, design a solution
Option 2:
Set it up as a machine learning problem
data
supervised
learning
Deep Reinforcement Learning, Decision Making,
and Control
CS 285
Instructor: Sergey Levine
UC Berkeley
data
reinforcement
learning
What is reinforcement learning?
What is reinforcement learning?
• i.i.d. data
• known ground truth outputs in training
decisions (actions)
Cathy Wu
Why should we care about deep
reinforcement learning?
How do we build intelligent machines?
Intelligent machines must be able to adapt
Deep learning helps us handle unstructured
environments
Reinforcement learning provides a formalism for
behavior
decisions (actions)
consequences
observations
rewards
Levine*, Finn*, et al. ‘16
What is deep RL, and why should we care?
standard
features mid-level features classifier
computer
(e.g. HOG) (e.g. DPM) (e.g. SVM)
vision
Felzenszwalb ‘08
end-to-end training
deep
learning
standard
reinforcement
learning
features
? more features
? linear policy
or value func.
action
Action
(run away)
action
sensorimotor loop
Action
(run away)
Example: robotics
robotic state
modeling & low-level
control observations estimation
prediction
planning
control
controls
pipeline (e.g. vision)
tiny, highly specialized tiny, highly specialized
“visual cortex” “motor cortex”
decisions (actions)
Tesauro, 1995
L.-J. Lin, “Reinforcement learning for robots using neural networks.” 1993
Why should we study this now?
predicted real
Auditory
Cortex
observations
education one would obtain the
actions
adult brain.
- Alan Turing
environment