0% found this document useful (0 votes)

235 views5 pages

CS 7642, Reinforcement Learning and Decision Making: General Information

This document provides information about the CS 7642 Reinforcement Learning and Decision Making course offered in Spring 2021. It outlines the course instructors, communication methods, objectives, prerequisites, resources, academic honesty policy, schedule, assignments, projects, exams, grading, and policies. The primary goals of the course are to provide a broad survey of reinforcement learning techniques and approaches, develop a deeper understanding of major topics, and skills to build reinforcement learning systems and conduct research in the field.

Uploaded by

Mohamed Fawzy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

235 views5 pages

CS 7642, Reinforcement Learning and Decision Making: General Information

Uploaded by

Mohamed Fawzy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CS 7642, Reinforcement Learning and

Decision Making
Spring 2021
Instructor of Record:

Charles Isbell, [email protected]

259, College of Computing Building

Creators of Online Material:

Prof. Charles Isbell, [email protected]

Prof. Michael Littman, [email protected]

Head TAs:

Tim Bail, [email protected]

Miguel Morales, [email protected]

Piazza:

Piazza will be our primary source of communication and discussion.

Office Hours:

Check Piazza for weekly announcements.

General Information
Reinforcement Learning and Decision Making is a three-credit course on, well,
Reinforcement Learning and Decision Making. Reinforcement Learning is a
subarea of Machine Learning, that area of Artificial Intelligence that is concerned
with computational artifacts that modify and improve their performance through
experience. This course focuses on automated computational decision making
through a combination of classic papers and more recent work. It examines
efficient algorithms, where they exist, for single-agent and multiagent planning
as well as approaches to learning near-optimal decisions from experience.
Topics include Markov decision processes; stochastic and repeated games;
partially observable Markov decision processes; reinforcement learning; and
interactive reinforcement learning. The class is particularly interested in issues
of generalization, exploration, and representation.

Objectives
There are four primary objectives for the course:

● To provide a broad survey of approaches and techniques in RLDM

● To develop a deeper understanding of several major topics in RLDM
● To develop the design and programming skills that will help you to
build RLDM systems
● To develop the basic skills necessary to pursue research in RLDM

As you will see in the next section, we assume that you are already familiar with
machine learning techniques and have some comfort with doing empirical work
in machine learning. As a result, we emphasize the more computational aspects
of developing decision-making systems. Having said that, our concern with
research is expressed by having students replicate results in published papers
in the area.

Prerequisites
The official prerequisite for this course is an introductory course in machine
learning at the graduate level. While having taken such a course is not strictly
necessary, you will find that the lectures make constant call-backs to material
covered in graduate machine learning courses (and the course offered by the
creators of this material in particular). Of course, having said all that, the most
important prerequisite for enjoying and doing well in this class is your interest in
the material. I say that every semester and in every course, but it's true. In the
end, it will be your own motivation to understand the material that gets you
through it more than anything else. If you are not sure whether this class is for
you, please talk to me.
Resources
● Readings. We use research paper readings, and those will be
provided for you. We also use Sutton and Barto's Reinforcement
Learning book (see:
http://www.incompleteideas.net/book/the-book-2nd.html
● Computing. You will have access to CoC clusters for your
assignments, I suppose, but you won't need them. You are required to
use Python for all assignments, and you can leverage many of the
libraries available to you. However, you are not allowed to use any
reinforcement learning library. All reinforcement learning related code
must be your own. If in doubt, ask.
● Web. We will use Canvas Announcements and Piazza to post
last-minute announcements, so check it early and often. You are
responsible for keeping up with class announcements.

Statement of Academic Honesty

At this point in your academic careers, I feel that it would be impolite to harp on
cheating, so I won't. You are all adults, more or less, and are expected to follow
the university's code of academic conduct (you know, the honor code).
Furthermore, at least some of you are researchers-in-training, and I expect that
you understand proper attribution and the importance of intellectual honesty.

This is not CS 7641. Do not assume anything you read on that syllabus applies
to this in any way, shape, or form. Note that unauthorized use of any previous
semester course materials, such as tests, quizzes, homework, projects, videos,
and any other coursework, is prohibited in this course. You are not to use code
from previous or current students, you must submit your own work. Using these
materials will be considered a direct violation of academic policy and will be
dealt with according to the GT Academic Honor Code.

Furthermore, I do not allow copies of my exams out in the ether (so there should
not be any out there for you to use anyway). Just as you are not to use the
previous material you are not to share current material—including lecture
material—with others either now or in the future. My policy on that is strict. If you
violate the policy in any shape, form, or fashion you will be dealt with according
to the GT Academic Honor Code. I also have several... friends... from Texas
who will help me personally deal with you. They are on retainer from my
Machine Learning course and they've tasted blood.
Readings and Lectures
The online lectures are meant to summarize the readings and stress the
important points. You are expected to critically read any assigned material. Your
active participation in the material, the lectures, and various forums are crucial in
making the course successful. This is less about my teaching than about your
learning. My role is to merely assist you in the process of learning more about
the area.

To help you to pace yourself, I have provided a nominal schedule (check the
Calendar page in Canvas) that tells you when we would be covering material if
we were meeting once a week for three hours during the term. I recommend you
try to keep that pace. More to the point, there are ~weekly assignments that
correspond to the reading material and it will be difficult to do those without at
least passing familiarity with the material.

Grading
Your final grade is divided into three components: homework, projects, and a
final exam.

● Homework. There will be six short homework assignments involving

programming. You will be provided Jupyter Notebooks and will submit
your solution to Gradescope.
● Projects. Students will be asked to replicate results from relevant
papers from the literature. Each of the three projects will consist of a
short write up and submission of your code (Python is required).
● Exams. There will be one written, closed-book final exam scheduled
for our class's final exam. Although I'm told I have a reputation for
creative exams, these exams are meant to be a walk in the park if you
follow and read the material.
When you upload files on Canvas, make sure that all the answers are
clearly visible and the files shown are the ones you want to be
graded. Upon submitting you acknowledge that you are aware that
illegible or incorrect PDFs will receive 0 and you will not be able to
submit for a regrade.

Due Dates
All graded assignments are due by the time and date indicated on Canvas. We
do not accept late submissions for homework assignments. No exceptions
whatsoever. We do accept late project assignments for a 20 point per day
penalty, a max of 5 days, or a 0 grade. The only exceptions to late project
assignment penalties will require: a note from the appropriate authority and
immediate notification of the problem when it arises. Naturally, your excuse
must be acceptable. If an alien parasite that thrives on electronic assignments
gets into your computer and erases all copies of your work from existence, I will
need a signed note from the relevant galactic authorities who have
investigated... in English. We only accept submissions 1 week after the due
date, including any exceptional cases. After that week, you will automatically get
a 0 for that assignment, with no change for a makeup. For cases that require
longer than a week, we suggest dropping the course or asking for an incomplete
semester.

Numbers
Component

Homework (6) 30%

Projects (3) 45%

Exams (1) 25%

In the spirit of mechanism design, the grading scheme is set up so that one can't
blow off reading the material and still earn an A. Similarly, one can't blow off a
project either. Not that you would do either of those things, but it's all about
incentives, people.

Disclaimer
I reserve the right to modify any of these plans as need be during the course of
the class; however, I won't do anything capriciously, anything I do change won't
be too drastic, and you'll be informed as far in advance as possible.

PLE Rubric
No ratings yet
PLE Rubric
1 page
15-922 University Calendar - Fall2016Spring2017 Update 6-16
No ratings yet
15-922 University Calendar - Fall2016Spring2017 Update 6-16
2 pages
Syllabus Harvard Machine Learning Advanced
No ratings yet
Syllabus Harvard Machine Learning Advanced
5 pages
UT Dallas Syllabus For cs6375.002 05f Taught by Yu Chung NG (Ycn041000)
No ratings yet
UT Dallas Syllabus For cs6375.002 05f Taught by Yu Chung NG (Ycn041000)
3 pages
Machine Learning 1
100% (1)
Machine Learning 1
245 pages
L01 Course Overview
No ratings yet
L01 Course Overview
21 pages
CSE 571: Artificial Intelligence (Fall 2018) : Warning
No ratings yet
CSE 571: Artificial Intelligence (Fall 2018) : Warning
7 pages
CSE 571: Artificial Intelligence (Spring 2020) : Warning
No ratings yet
CSE 571: Artificial Intelligence (Spring 2020) : Warning
8 pages
ITMD513 SP2023 Syllabus Shamsuddin PDF
No ratings yet
ITMD513 SP2023 Syllabus Shamsuddin PDF
7 pages
Course Student Manual
No ratings yet
Course Student Manual
7 pages
DS 669-102 - Reinforcement Learning New Jersey Institute
No ratings yet
DS 669-102 - Reinforcement Learning New Jersey Institute
4 pages
Syllabus EECE3324 Fall2021
No ratings yet
Syllabus EECE3324 Fall2021
7 pages
Zoom Links For Classe, Lab, and Office Hours Are Available Online Through The Zoom Canvas Module
No ratings yet
Zoom Links For Classe, Lab, and Office Hours Are Available Online Through The Zoom Canvas Module
4 pages
GTL Peru 2025 - Week 0 Instructions
No ratings yet
GTL Peru 2025 - Week 0 Instructions
2 pages
Course DIT822: Software Engineering For AI Systems
No ratings yet
Course DIT822: Software Engineering For AI Systems
34 pages
Deep Learning - SP24
No ratings yet
Deep Learning - SP24
6 pages
Iuc Aml
No ratings yet
Iuc Aml
3 pages
MJ Cs303e
No ratings yet
MJ Cs303e
10 pages
Syllabus CSCI 8000 Efficient Deep Learning 2024fall
No ratings yet
Syllabus CSCI 8000 Efficient Deep Learning 2024fall
4 pages
Se 801
No ratings yet
Se 801
3 pages
Syllabus Ee541 22sp
No ratings yet
Syllabus Ee541 22sp
7 pages
Handout: Course Information: CS 229 Machine Learning
No ratings yet
Handout: Course Information: CS 229 Machine Learning
4 pages
CS7643 Deep Learning Syllabus and Schedule - v2
No ratings yet
CS7643 Deep Learning Syllabus and Schedule - v2
9 pages
Syllabus
No ratings yet
Syllabus
3 pages
COL333/671: Introduction To AI
No ratings yet
COL333/671: Introduction To AI
17 pages
1 Introduction
No ratings yet
1 Introduction
77 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
6040 Syllabus
No ratings yet
6040 Syllabus
7 pages
Sum 2025 CS7643 Deep Learning Syllabus and Schedule - v2
No ratings yet
Sum 2025 CS7643 Deep Learning Syllabus and Schedule - v2
11 pages
Syllabus
No ratings yet
Syllabus
10 pages
CS-UY 1114 Fall 2020 Syllabus
No ratings yet
CS-UY 1114 Fall 2020 Syllabus
6 pages
CS378 - Generative Visual Computing
No ratings yet
CS378 - Generative Visual Computing
4 pages
Syllabus For Computational Engineering
No ratings yet
Syllabus For Computational Engineering
10 pages
Syllabi-CS 7643 2024-1
No ratings yet
Syllabi-CS 7643 2024-1
11 pages
Robot Learning FRI II Fall 2024
No ratings yet
Robot Learning FRI II Fall 2024
4 pages
ECE597MS Syllabus
No ratings yet
ECE597MS Syllabus
4 pages
Problem Solving With Computers Syllabus
No ratings yet
Problem Solving With Computers Syllabus
4 pages
CS320 UW Madison
No ratings yet
CS320 UW Madison
5 pages
CS 229 Machine Learning Handout #1: Course Information: Teaching Staff and Contact Info
No ratings yet
CS 229 Machine Learning Handout #1: Course Information: Teaching Staff and Contact Info
4 pages
BIT 1400 - CourseOutline - Fall 2023
No ratings yet
BIT 1400 - CourseOutline - Fall 2023
7 pages
Lec0 Logistics
No ratings yet
Lec0 Logistics
40 pages
Class Assessments: Artificial Intelligence - CS-6601-O01
No ratings yet
Class Assessments: Artificial Intelligence - CS-6601-O01
5 pages
Syllabus+CSE 6363 003 Fall 1
No ratings yet
Syllabus+CSE 6363 003 Fall 1
26 pages
ECE 579 Murphey W25
No ratings yet
ECE 579 Murphey W25
5 pages
CS 5720 Neural Network & Deep Learning - Fall24 - Syllabus
No ratings yet
CS 5720 Neural Network & Deep Learning - Fall24 - Syllabus
10 pages
EE306 Telang F19 Descriptor
No ratings yet
EE306 Telang F19 Descriptor
8 pages
Cs 7638 Syllabus and Schedule 2023-2
No ratings yet
Cs 7638 Syllabus and Schedule 2023-2
6 pages
Course Information
No ratings yet
Course Information
4 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
3 pages
Handout #1: Course Information: CS 229 Machine Learning
No ratings yet
Handout #1: Course Information: CS 229 Machine Learning
3 pages
Applied Data Analytics With Python
No ratings yet
Applied Data Analytics With Python
14 pages
CSC 520 AI 2018 Spring Syllabus
No ratings yet
CSC 520 AI 2018 Spring Syllabus
7 pages
Syllabi-CS 6515 2024-1
No ratings yet
Syllabi-CS 6515 2024-1
3 pages
INF385T IMLsyllabus
No ratings yet
INF385T IMLsyllabus
4 pages
Boston University QS 108
No ratings yet
Boston University QS 108
4 pages
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
No ratings yet
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
11 pages
Artificial Intelligence Undergraduate Curriulum
No ratings yet
Artificial Intelligence Undergraduate Curriulum
53 pages
AA Syllabus 2024 25
No ratings yet
AA Syllabus 2024 25
4 pages
Amath301 Syllabus
No ratings yet
Amath301 Syllabus
7 pages
Masters Thesis
No ratings yet
Masters Thesis
58 pages
Rat Model 1
No ratings yet
Rat Model 1
1 page
Math Action Plan
No ratings yet
Math Action Plan
4 pages
6 Bài Task 2 2023 Kien Luyen
No ratings yet
6 Bài Task 2 2023 Kien Luyen
6 pages
Worksheet Unit 1 Class 11
0% (1)
Worksheet Unit 1 Class 11
4 pages
Canton Celebrates Program-2
No ratings yet
Canton Celebrates Program-2
3 pages
Listening Mark Sheet
No ratings yet
Listening Mark Sheet
2 pages
Measurement of Intelligence
No ratings yet
Measurement of Intelligence
28 pages
Critical Reflection Name: Nabilah Binti Mansor I/C No: 920122-04-5138 MATRIC NO: 0610/1863
No ratings yet
Critical Reflection Name: Nabilah Binti Mansor I/C No: 920122-04-5138 MATRIC NO: 0610/1863
3 pages
SCCCS Startalk課程教學一覽表
No ratings yet
SCCCS Startalk課程教學一覽表
1 page
Educational Administration
No ratings yet
Educational Administration
6 pages
Chapter 3: Market Integration Semi-Final
No ratings yet
Chapter 3: Market Integration Semi-Final
8 pages
Rudiments of A Seminar
No ratings yet
Rudiments of A Seminar
7 pages
Leadership and Managerial Capabilities of Secondary School Heads in The Division of Negros Occidental Basis For Enhanced Capability Training Program
No ratings yet
Leadership and Managerial Capabilities of Secondary School Heads in The Division of Negros Occidental Basis For Enhanced Capability Training Program
9 pages
Educ 312 Prelim Module
100% (1)
Educ 312 Prelim Module
32 pages
Sw-1738323277-02. Arusha Region Socio-Economic Profile
No ratings yet
Sw-1738323277-02. Arusha Region Socio-Economic Profile
345 pages
Chapter 2
No ratings yet
Chapter 2
24 pages
Prospectus Session 2016 17 PDF
No ratings yet
Prospectus Session 2016 17 PDF
120 pages
155-Article Text-153-1-10-20100319
No ratings yet
155-Article Text-153-1-10-20100319
14 pages
Unit 1
No ratings yet
Unit 1
4 pages
4 Oralcom - Nov For Student
No ratings yet
4 Oralcom - Nov For Student
16 pages
DLP 3rd Quarter
100% (2)
DLP 3rd Quarter
92 pages
Adnan CV
No ratings yet
Adnan CV
4 pages
Message From Gerard Deeb: 28Th May Derby Day
No ratings yet
Message From Gerard Deeb: 28Th May Derby Day
6 pages
1726117202943
No ratings yet
1726117202943
7 pages
Engineering
No ratings yet
Engineering
8 pages
The 5th Sustainable Development Goal
No ratings yet
The 5th Sustainable Development Goal
1 page

CS 7642, Reinforcement Learning and Decision Making: General Information

Uploaded by

CS 7642, Reinforcement Learning and Decision Making: General Information

Uploaded by

CS 7642, Reinforcement Learning and

Charles Isbell, ​[email protected]

Creators of Online Material:

Prof. Charles Isbell, ​[email protected]

Tim Bail, ​[email protected]

Piazza will be our primary source of communication and discussion.

Check Piazza for weekly announcements.

● To provide a broad survey of approaches and techniques in RLDM

Statement of Academic Honesty

● Homework.​ There will be six short homework assignments involving

Homework (6) 30%

Projects (3) 45%

Exams (1) 25%

You might also like

Charles Isbell, [email protected]

Prof. Charles Isbell, [email protected]

Tim Bail, [email protected]

● Homework. There will be six short homework assignments involving