0% found this document useful (0 votes)

4 views26 pages

Advanced Machine Learning: Course Overview

CS 726 is an Advanced Machine Learning course focusing on representing, generating, and reasoning about high-dimensional objects such as images, text, and time-series data. The course covers various topics including probabilistic graphical models, deep latent variable models, and inference techniques, with an emphasis on practical applications in NLP, vision, and more. Students will engage in quizzes, programming assignments, and exams to evaluate their understanding of the material throughout the semester.

Uploaded by

ads03122002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views26 pages

Advanced Machine Learning: Course Overview

Uploaded by

ads03122002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

CS 726

Advanced Machine Learning

Course Overview
Sunita Sarawagi
Spring 2025
Scope of the course
Learning to represent, generate, and reason on objects:
○ High dimensional x = {x1,.....,xn}, space of x is large
○ Inter-dependent components
Examples:
○ Image
○ Video
○ Time-series
○ Text
Examples of high dimensional spaces

This image is very high-dimensional: comprising of 1024*1024*3 = 3 million dimensional real space
Words in a sentence

If you ask a question, you are a fool only once. If

you do not ask, you are a fool forever.

Assume a vocabulary size of 50 K.

The sentence of 25 words has 25*50 K = 1.25 million dimensional discrete space
Different task settings
Given training data D, train a model M that can be used for

● Generation
○ Unconditional: Generate a sample X that is representative in D
○ Conditional: Given an input prompt X, generate a likely sample Y.

● Density estimation:
○ What is the probability that a given sample X is part of the training distribution D

● Other forms of reasoning:

○ Causality, Counter-factual reasoning, recourse on predictions.
Text to text generation
● Write a poem

● Translation

● Text-to-tree generation
Translation
Input: x Predicted sequence: y

• Each token in the output is a random variable and there is inter-

dependence in the output tokens.

• We want to output a probability with the output translation, and not just
produce one translation.

• We cannot predict the whole sentence in one shot but need to

decompose it into parts
Text to image generation
● Imagen
● Stable diffusion
Topics for Generation
Goal: Output a distribution 𝑃𝜃 𝒚 𝑥 over a structured
output 𝒚 = 𝑦1 , … , 𝑦𝑛 , optionally conditioned on an
input x.
● Representation/Modeling: Form of 𝑃𝜃 , how to
represent 𝑃 𝒚 of high-dimensional y for easy
learnability and efficient inference.
● Training or learning: How to parameterize the
distribution and learn the parameters
● Inference: How to efficiently generate?
Key insight from the course

Decompose high-dimensional objects into

smaller manageable sub-parts
Representation
● With observed variables

● With latent variables

Can we make the dependency graph simpler via factorization?

Representation
● Represent the rate of change of a random
variable (stochastic differential equations)
Learning
● How to parameterize the joint distribution for
sample-efficient learning

● How to efficiently learn the parameters 𝜃 of the

distribution
○ Training data (conditional): 𝐷 = { 𝑥 1 , 𝒚1 , … , 𝑥 𝑁 , 𝒚𝑁 }
○ Training data: (unconditional) D = {x1, x2,...., xN}
Adapting trained distributions

● In-context learning for regression, time-series,

and language tasks

● Parameter efficient fine-tuning

Inference
● Given a 𝑥, how to efficiently find the most likely
𝑦1 , … , 𝑦𝑛 ∶ MAP Inference.
● How to generate multiple representative
examples from estimated model: Sampling
○ Generate examples that are representative of the
distribution
Density estimation
Given D = {x1, x2,...., xN} learn a P(x), so that given a new x we can efficiently
calculate the probability of “x”.

Applications: Out of distribution detection, outlier detection, classification

Density estimator
Course contents

Representation of P(X) or P(Y|X)

● Probabilistic graphical models: Bayesian Networks and Markov
Random Fields
○ Exact, efficient, but limited capacity
○ But, important to understand them to build a framework for
probabilistic reasoning
○ Intuitive and easy to incorporate prior knowledge and biases
○ Special Graphical models
■ Gaussian processes: special structure that allow trivial computation
of marginals
Representation (continued)

● Deep latent variable models:

● VAEs, GANs, Discrete diffusion models – technology behind latest image
generation models such as ImageGen
● Representation via variable transformation: Normalizing
flows
● Stochastic differential equations P(Y|X) where X is time
and distribution represented as rate of change →
continuous time diffusion model
Course contents
Learning
● Parameterization (model architectures for efficient learning)
○ Feature-based like in CRFs
○ Deep neural methods e.g. transformers
● Training algorithms
○ Maximum likelihood learning
○ Generalized Expectation Maximization: Variational Auto
Encoders, diffusion models for images
Learning (continued)
● Advanced topics from deep learning:
● In-context learning in foundation models
● Parameter efficient fine-tuning
● Model editing
Course contents

Inference
● Boolean queries on conditional inference
● Marginalization queries: P(Xi), max_x P(x)
○ Sum-product and max-product Inference in Graphical Models
● Sampling
○ Classical methods of sampling in tractable model: forward sampling, importance
weighted sampling, Markov Chain Monte Carlo sampling (MCMC),
○ Recent methods usable in deep learning: Monte-Carlo with Langevin dynamics
Inference (Continued)
● Inference challenges in modern LLMs (a special Bayesian network)
○ Limitations of greedy decoding
○ Sampling multiple generations
○ Grammar constrained decoding
○ Speculative decoding

● Other forms of Inference

● Causal effects
● Algorithmic recourse
Who should take the course
● Students who are interested in doing research in machine learning
● Students who want to learn to think about learning from a probabilistic
perspective in the context of modern deep learning
● Students who want to model learning tasks in a manner that cuts across
applications.
○ The course will cite applications in NLP, vision, time-series, event sequences, and speech
when relevant but it is not primarily about any of these applications.
Mode of running the course
● Two 85 minute slots per week:
● SAFE/Moodle quiz on the material covered in the prior week
○ 20 minute duration at a pre-announced time.
○ Grading will be done on top n-2 out of n quizzes. No compensation for missed quizzes.
○ First quiz on Jan 15th on probability and ML basics
● All materials will be uploaded on Moodle, announcements via Moodle,
questions on Moodle or [email protected]
○ Forum for each topic for discussions and questions.
Evaluation
Approximate credit structure
• 15% In-class Quizzes
• 20% 4—6 graded programming and paper homeworks (in teams of 3)
• 25% Mid-semester exam
• 35% End semester exam
• 3% Scribing
• 2% Attendance and class participation

Course calendar https://www.cse.iitb.ac.in/~sunita/cs726/

Machine Learning Basics
No ratings yet
Machine Learning Basics
151 pages
Deploying and Managing Exchange Server 2013 HA
No ratings yet
Deploying and Managing Exchange Server 2013 HA
265 pages
COS324 Course Notes
No ratings yet
COS324 Course Notes
256 pages
Wipro 090323080218 Phpapp02
No ratings yet
Wipro 090323080218 Phpapp02
25 pages
Humidificador F&P MR850
100% (1)
Humidificador F&P MR850
67 pages
09-UALink Overview HipChipsNov2024 v1
No ratings yet
09-UALink Overview HipChipsNov2024 v1
16 pages
2d Computer Graphics in Modern C and Standard Library Compress
No ratings yet
2d Computer Graphics in Modern C and Standard Library Compress
466 pages
Data Visualization On Melbourne Housing Dataset
No ratings yet
Data Visualization On Melbourne Housing Dataset
11 pages
Acer Download Tool 3.006
No ratings yet
Acer Download Tool 3.006
9 pages
Oops Notes
No ratings yet
Oops Notes
85 pages
Ford Fanuc R30ia R30ib Nextgen e
No ratings yet
Ford Fanuc R30ia R30ib Nextgen e
560 pages
2021 SharePoint QMS Guidebook
No ratings yet
2021 SharePoint QMS Guidebook
29 pages
cs236 Lecture1 2023
No ratings yet
cs236 Lecture1 2023
46 pages
Hands-On Bayesian Neural Network
No ratings yet
Hands-On Bayesian Neural Network
28 pages
Cs Woodside Petrel VBM
No ratings yet
Cs Woodside Petrel VBM
2 pages
Syllabus - Malware Analysis and Development
No ratings yet
Syllabus - Malware Analysis and Development
4 pages
Ramgarh
No ratings yet
Ramgarh
25 pages
ATxBlock Universal Temperature Transmitter
No ratings yet
ATxBlock Universal Temperature Transmitter
11 pages
Ipc SW
100% (1)
Ipc SW
374 pages
NIVETEC NT CT Nivelco NIVOCONT K
No ratings yet
NIVETEC NT CT Nivelco NIVOCONT K
5 pages
Cryptography and Data Security - Lecture - 1 PDF
No ratings yet
Cryptography and Data Security - Lecture - 1 PDF
20 pages
Alice Book Volume 1
No ratings yet
Alice Book Volume 1
281 pages
01 Intro
No ratings yet
01 Intro
37 pages
ShopperNOW Training Manual Download
No ratings yet
ShopperNOW Training Manual Download
18 pages
Ashish Mehta: Mobile - +91-9028596846
No ratings yet
Ashish Mehta: Mobile - +91-9028596846
3 pages
1: Introduction To Programming
No ratings yet
1: Introduction To Programming
9 pages
SAP Note 32215812931
No ratings yet
SAP Note 32215812931
2 pages
Deep Gen Models Tutorial
No ratings yet
Deep Gen Models Tutorial
96 pages
PowerPoint - All About Digital Citizenship New Zealand PowerPoint (Years 5-8)
No ratings yet
PowerPoint - All About Digital Citizenship New Zealand PowerPoint (Years 5-8)
23 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
CM20315 01 Intro
No ratings yet
CM20315 01 Intro
62 pages
Introduction To Tic Tac Toe
No ratings yet
Introduction To Tic Tac Toe
8 pages
Lecture-01 Introductory
No ratings yet
Lecture-01 Introductory
29 pages
Lecture # 1-2 Introduction To Gen AI
No ratings yet
Lecture # 1-2 Introduction To Gen AI
41 pages
TSIA SD210 Lecture1 2025
No ratings yet
TSIA SD210 Lecture1 2025
87 pages
Unit 4 Short Notes
No ratings yet
Unit 4 Short Notes
27 pages
Advanced Machine Learning
No ratings yet
Advanced Machine Learning
63 pages
Irjet V6i3619
No ratings yet
Irjet V6i3619
3 pages
ML and Deep Learning Syllabus
No ratings yet
ML and Deep Learning Syllabus
3 pages
Machine Learning 2025
No ratings yet
Machine Learning 2025
111 pages
Lec1 Intro
No ratings yet
Lec1 Intro
51 pages
Cheatsheets For Deep Learning 1650192034
No ratings yet
Cheatsheets For Deep Learning 1650192034
95 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
01 Intro 2
No ratings yet
01 Intro 2
67 pages
00 - SIMOS (300-209) Course Introduction
No ratings yet
00 - SIMOS (300-209) Course Introduction
5 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
Cs 228
No ratings yet
Cs 228
98 pages
Machine Learning A Lecture Note
No ratings yet
Machine Learning A Lecture Note
111 pages
Deep Learning Intro Slides
No ratings yet
Deep Learning Intro Slides
68 pages
BCS602 Model Question Paper Solved (Search Creators)
No ratings yet
BCS602 Model Question Paper Solved (Search Creators)
37 pages
Evaluation2 Anglais
No ratings yet
Evaluation2 Anglais
4 pages
Notes SRM SEM5 IOT
No ratings yet
Notes SRM SEM5 IOT
2 pages
Chapter 1 - Introduction and Supervised
No ratings yet
Chapter 1 - Introduction and Supervised
40 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
33 pages
01 ML Basics
No ratings yet
01 ML Basics
61 pages
01 Intro Slides
No ratings yet
01 Intro Slides
67 pages
M2 AI Chap1 Neural-Network
No ratings yet
M2 AI Chap1 Neural-Network
60 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
168 pages
Lecture 03 - Feedforward Networks - 4p
No ratings yet
Lecture 03 - Feedforward Networks - 4p
19 pages
Catalogue S A Auto LUDHIANA
No ratings yet
Catalogue S A Auto LUDHIANA
5 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
Lecture # 2-1 Probabilistic Models
No ratings yet
Lecture # 2-1 Probabilistic Models
40 pages
Intro Slides
No ratings yet
Intro Slides
31 pages
LBDL
No ratings yet
LBDL
185 pages
Lecture 04 - Optimization - 4p
No ratings yet
Lecture 04 - Optimization - 4p
11 pages
DLT Unit-1
No ratings yet
DLT Unit-1
28 pages
LN ML Rug
No ratings yet
LN ML Rug
283 pages
Lec 04 Deep Networks 2
No ratings yet
Lec 04 Deep Networks 2
78 pages
Ml2 Script v2
No ratings yet
Ml2 Script v2
123 pages
ML 5
No ratings yet
ML 5
28 pages
Pa Lab MDM
No ratings yet
Pa Lab MDM
4 pages
LN ML Rug
No ratings yet
LN ML Rug
267 pages
Lecture Notes 2016
No ratings yet
Lecture Notes 2016
132 pages
MLSM Lecture1 050923
No ratings yet
MLSM Lecture1 050923
37 pages
Brief Intro To ML PDF
No ratings yet
Brief Intro To ML PDF
236 pages
Mock Que
No ratings yet
Mock Que
2 pages
DL Unit 1
No ratings yet
DL Unit 1
27 pages
IEEE Xplore Reference Download 2025.3.12.15.58.20
No ratings yet
IEEE Xplore Reference Download 2025.3.12.15.58.20
2 pages
Instructors: Moses Charikar, Tengyu Ma, and Chris Re: Hope Everyone Stays Safe and Healthy in These Difficult Times!
No ratings yet
Instructors: Moses Charikar, Tengyu Ma, and Chris Re: Hope Everyone Stays Safe and Healthy in These Difficult Times!
40 pages
R18B Tech MinorIVYearISemesterTENTATIVESyllabus
No ratings yet
R18B Tech MinorIVYearISemesterTENTATIVESyllabus
22 pages
Table of Contents
No ratings yet
Table of Contents
9 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Csa4020 Deep-Learning LP 1.0 22 Csa4020 Deep-Learning LP 1.0 1 Deep Learning
No ratings yet
Csa4020 Deep-Learning LP 1.0 22 Csa4020 Deep-Learning LP 1.0 1 Deep Learning
2 pages
CSCE 636: Deep Learning
No ratings yet
CSCE 636: Deep Learning
30 pages
Ai Fellowship 2023
No ratings yet
Ai Fellowship 2023
13 pages
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
No ratings yet
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
37 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
Week 3
No ratings yet
Week 3
2 pages
IGNOU MCA Design and Analysis of Algorithms Previous Years Unsolved Papers MCS 211
From Everand
IGNOU MCA Design and Analysis of Algorithms Previous Years Unsolved Papers MCS 211
Manish Soni
No ratings yet
Algebra - Drill Sheets Gr. 3-5
From Everand
Algebra - Drill Sheets Gr. 3-5
Nat Reed
No ratings yet