Syllabus
Syllabus
Inference
1 Course Information
In the information age, we are surrounded with data, much of which contains an
element of randomness, or "noise". Taking advantage of the abundance of informa-
tion requires the ability to make principled inferences and predictions under noisy
conditions.
A wide variety of inference and prediction can be understood in one way or another
as applications of conditional probability. In the first part of the course, we will
develop some of the mathematical tools we need before we can study conditional
probability rigorously. These include proof by induction, combinatorics, set algebra,
and probability spaces. In the middle section, we develop some theory of probability
and discrete random variables, with an emphasis on conditional probability and
Bayes rule. In the final segment, we look at some of the ways conditional probability
can be applied to study random environments. Topics in this section include basic
information theory, discrete state Markov chains (a kind of random process), and
basic elements of Bayesian statistics and decision theory.
1
1.2 Prerequisites
Grade of B or better in any one of ISTA 116, MATH 163, MATH 263, or equiva-
lent. One of CS 245, LING/MATH/PHIL 202, MATH 215 or MATH 243. These
requirements (particularly the second one) may be waived for students with strong
academic records with permission of the instructor. Students who do not meet the
formal course requirements should be very comfortable with mathematical functions
of one and two variables, should have at least some familiarity with basic concepts of
probability, and should be comfortable constructing logical arguments with explicit
justification of steps.
1.5 Readings
I havent found a single textbook that covers everything we need at the right level.
Therefore, the primary readings will be my lecture notes, which I will update as we
go along.
I also recommend the following book chapters, which have complementary cover-
age:
Chapter 1 from:
Jaynes, E.T. (2003). Probability Theory: The Logic of Science. Cambridge
University Press. (Available for free electronically from UA Library)
2
Chapters 4-8 from:
Bolstad, William M. (2007). Introduction to Bayesian Statistics. Hoboken, New
Jersey: John Wiley and Sons.
Chapters 1-6 and 10 from:
Applebaum, David (2008). Probability and Information: An Integrated Approach,
2nd Edition. Cambridge University Press.
This course has two main purposes: First, after taking this course, students should
be accustomed to reasoning probabilistically, incorporating uncertainty in every day
inferences, and should be able to translate real-world problems into mathemati-
cal language. Second, students should become comfortable with the structure of
mathematical arguments, and should be able to assess whether conclusions follow
deductively from definitions and premises. After completing this course, students
will be prepared for 400-level classes in probability, statistics, artificial intelligence
and machine learning.
The required meetings consist of two 75-minute lectures and one 50-minute discussion
/ problem-solving session per week. New topics will be presented in lectures. In
discussion/problem sections we will work through examples, and students will present
their solutions to homework exercises.
The word lecture has its roots in the word reading. As such it is a bit of a
misnomer in an age when students have access to reading materials on their own.
Class meetings should not be a mechanism for the one-way delivery of information
thats what the reading is for. Therefore students are expected to do the relevant
reading before each class, so that class time can be spent interactively: reinforcing
the reading, clarifying difficult concepts, and discussing subtleties.
3
2 Course and Grading Policies
Most people find the material in this class quite challenging. Practicing using con-
cepts and techniques with homework problems is critical for internalizing the ma-
terial; but it is not enough on its own, particularly since it is sometimes possible
to complete homework mechanically, with only a shallow understanding of whats
going on. A deeper understanding occurs when one has to try to explain a concept
or problem to someone else. The goal of the discussion sections is for students to
engage with each other in a way that allows the kind of understanding which is
only possible through explaining to take hold. Ideally this would occur naturally;
however, sometimes concrete incentives are needed. As such 20% of the final grade
comes from attending and participating during class and discussion sections. Stu-
4
dents may miss up to 3 lectures and 3 discussion sections without adversely affecting
their grade.
Late Policy. Assignments are due at the end of class or discussion section on the
posted date. Limited extensions will be entertained, but must be requested before
the assignment is due. The first request by a given student will automatically be
granted; the probability of rejecting an extension increases with each subsequent
request.
2.4 Exams
Three exams will be given at roughly equal intervals. The first two will be takehome
exams, to be completed individually over a short time period (about 48 hours) with-
out collaboration or use of outside sources. The final exam will be in class on the
designated day during finals week.
All exams will be similar in format to the homework: some straightforward calcula-
tion problems, some problems that require representing a scenario mathematically,
and some relatively simple proofs.
5
Make-up Policy. Exams may only be made up in case of a serious, unan-
ticipated emergency. The student or an appointed proxy must contact the in-
structor in advance of the exam. Clearly worded documentation, from a doctor,
dean, etc., demonstrating physical inability to take the exam at the appointed time,
will be required to grant a make-up appointment. No make-ups will be granted
for personal reasons, including travel or personal hardship.
Students have the option of completing a term project at the end of the semester. The
grade on the project may be used to substitute for one of the two takehome midterm
exams, or as a 50 point extra credit assignment, whichever is more beneficial.
There are two options for this project. The first consists of identifying 8-10 exercises
that were neither assigned for a grade nor discussed in recitation, and writing them
up in the manner of a long homework assignment. Problems identified with asterisks
as challenging count double.
The second option involves identifying a real-world phenomenon that can repre-
sented using a probabilistic model with at least one variable that cannot be directly
observed, and at least one that can, such that the distribution of the latter depends
on the value of the former. For example, the unobserved variable could be a pa-
rameter in the distribution of the observed variable; it may be part of the transition
matrix in a Markov chain; or it may simply correspond to which in a set of compet-
ing hypotheses accurately describes the phenomenon. The goal is to (a) define the
relevant variables and explain how they relate, (b) identify a set of questions about
the problem that can be addressed with data, (c) gather or find some data relevant
to the chosen questions, and (d) attempt to answer the question either analytically
(solving equations on paper) or by computer simulation.
For those who took ISTA 116 with me, the second option is similar to the paper
assigned in that class, but taking a Bayesian perspective.
3 University Policies
Classroom Behavior. Students are expected to behave respectfully toward each
other and to the instructor. Disrespectful behavior includes the use of cell phones or
6
other electronic devices in the classroom during class hours.
The Arizona Board of Regents Student Code of Conduct is here: http://dos.web.
arizona.edu/uapolicies/scc5308abcd.html#sccphilosophy
ABOR Policy 5-308, prohibits threats of physical harm to any member of the Uni-
versity community, including to oneself. See: http://policy.web.arizona.edu/
~policy/threaten.shtml.
7
4 Approximate Schedule of Lecture Topics
8
5 Discussion Schedule and Deadlines