Deep Reinforcement Learning Nanodegree Program Syllabus
Deep Reinforcement Learning Nanodegree Program Syllabus
Deep Reinforcement
Learning
Nanodegree Program Syllabus
Overview
The Deep Reinforcement Learning Nanodegree program is designed to enhance students’ existing machine learning and
deep learning skills with the addition of reinforcement learning theory and programming techniques. This program will grow
students’ deep learning and reinforcement learning expertise, give them the skills they need to understand the most recent
advancements in deep reinforcement learning, and build and implement their own algorithms.
Prerequisites
• Ability to describe backpropagation and knowledge of neural network architectures (like a CNN for image classification).
Required Hardware/Software
Learners need access to a computer running a 64-bit operating system with at least 8GB of RAM, along with administrator
account permissions sufficient to install programs including Anaconda with Python 3.6 and supporting packages.
*The length of this program is an estimation of total hours the average student may take to complete all required
coursework, including lecture and project time. If you spend about 5-10 hours per week working through the program, you
should finish within the time provided. Actual hours may vary.
Lesson 1
• A friendly introduction to reinforcement learning.
Introduction to RL
Lesson 2
Lesson 3
• Learn about policies and value functions.
The RL Framework: • Derive the Bellman equations.
The Solution
Lesson 6 • Learn the difference between the Sarsa, Q-Learning, and Expected Sarsa
algorithms.
Temporal-Difference Methods
Lesson 8
• Learn how to adapt traditional algorithms to work with continuous spaces.
RL in Continuous Spaces
Course 2
Value-Based Methods
Leverage neural networks to train an agent that learns intelligent behaviors from sensory data.
Course Project
Navigation
Leverage neural networks to train an agent to navigate a virtual world and collect as many yellow bananas
as possible while avoiding blue bananas.
Deep Q-Learning • Learn how to implement a Deep Q-Network (DQN), along with Double-DQN,
Dueling-DQN, and Prioritized Replay.
Lesson 3 • Learn from experts at NVIDIA how to use value-based methods in real-world
robotics.
Deep RL for Robotics
Course 3
Policy-Based Methods
Learn the theory behind evolutionary algorithms and policy-gradient methods. Design one’s own algorithm to train a simulated
robotic arm to reach target locations.
Course Project
Continuous Control
Train a robotic arm to reach target locations. For an extra challenge, train a four-legged virtual creature
to walk.
Lesson 4 • Learn from experts at NVIDIA how to use actor-critic methods to generate
optimal financial trading strategies.
Deep RL for Financial Trading
Course 4
Course Project
Lesson 2
• Master the skills behind DeepMind’s AlphaZero.
Case Study: Alphazera
Alexis Cook
Curriculum Lead
Alexis is an applied mathematician with a master’s in computer science from Brown University and
a master’s in applied mathematics from the University of Michigan. She was formerly a National
Science Foundation Graduate Research Fellow.
Arpan Chakraborty
Computer Scientist
Arpan is a computer scientist with a PhD from North Carolina State University. He teaches at
Georgia Tech (within the Master of Computer Science program), and is a coauthor of the book
Practical Graph Mining with R.
Mat Leonard
Instructor
Mat is a former physicist, research neuroscientist, and data scientist. He completed his PhD
and postdoctoral fellowship at the University of California, Berkeley.
Luis Serrano
Instructor
Luis was formerly a machine learning engineer at Google. He holds a PhD in mathematics
from the University of Michigan and a postdoctoral fellowship at the University of Quebec at
Montreal.
Cezanne is an expert in computer vision with a master’s in electrical engineering from Stanford
University. As a former researcher in genomics and biomedical imaging, she’s applied computer
vision and deep learning to medical diagnostic applications.
Dana Sheahen
Electrical Engineer
Dana is an electrical engineer with a master’s in computer science from Georgia Tech. Her work
experience includes software development for embedded systems in the automotive group at
Motorola, where she was awarded a patent for an onboard operating system.
Chhavi Yadav
Content Developer
Chhavi is a computer science graduate student at New York University where she researches
machine learning algorithms. She is also an electronics engineer and has worked on wireless
systems.
Juan Delgado
Computational Physicist
Juan is a computational physicist with a master’s in astronomy. He is finishing his PhD in biophysics.
He previously worked at NASA developing space instruments and writing software to analyze large
amounts of scientific data using machine learning techniques.
Miguel Morales
Content Developer
• Project review cycle creates a feedback loop with multiple opportunities for
improvement—until the concept is mastered.
• Project reviewers leverage industry best practices and provide pro tips.
• Unlimited access to mentors means help arrives when it’s needed most.
• 2 hr or less average question response time assures that skills development stays on track.
Empower job-readiness.
• Access to a Github portfolio review that can give you an edge by highlighting your
strengths, and demonstrating your value to employers.*
• Get help optimizing your LinkedIn and establishing your personal brand so your profile
ranks higher in searches by recruiters and hiring managers.
Mentor Network
• Mentors work across more than 30 different industries and often complete a Nanodegree
program themselves.
12.22.22 | V1.0