0% found this document useful (0 votes)

64 views

Lecture 2 Autoregressive Models

This document outlines a lecture on autoregressive models for deep unsupervised learning. It discusses the limitations of simple generative models like histograms in high dimensions and introduces parameterized distributions and maximum likelihood training as solutions. Autoregressive models are presented as modern neural network approaches that decompose complex joint distributions into conditional distributions. Recurrent neural networks and masking-based models are given as examples of autoregressive models.

Uploaded by

albertoluin10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

Lecture 2 Autoregressive Models

Uploaded by

albertoluin10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 113

CS294-158 Deep Unsupervised Learning

Lecture 2 Likelihood Models: Autoregressive Models

Pieter Abbeel, Xi (Peter) Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan
UC Berkeley
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 2
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 3
Likelihood-based models
Problems we’d like to solve:
- Generating data: synthesizing images, videos, speech, text
- Compressing data: constructing efficient codes
- Anomaly detection
Likelihood-based models: estimate pdata from samples x(1), …, x(n) ~ pdata(x)
Learns a distribution p that allows:
- Computing p(x) for arbitrary x
- Sampling x ~ p(x)
Today: discrete data

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 4
Desiderata
We want to estimate distributions of complex, high-dimensional data
- A 128x128x3 image lies in a ~50,000-dimensional space
We also want computational and statistical efficiency
- Efficient training and model representation
- Expressiveness and generalization
- Sampling quality and speed
- Compression rate and speed

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 5
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 6
Learning: Estimate frequencies by counting
Recall: the goal is to estimate pdata from samples
x(1), …, x(n) ~ pdata(x)
Suppose the samples take on values in a finite set
{1, …, k}
The model: a histogram
- (Redundantly) described by k nonnegative
numbers: p1, …, pk
- To train this model: count frequencies
pi = (# times i appears in the dataset) /
(# points in the dataset)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 7
Inference and Sampling
Inference (querying pi for arbitrary i): simply a lookup into the array p1, …, pk

Sampling (lookup into the inverse cumulative distribution function)

1. From the model probabilities p1, …, pk, compute the cumulative
distribution
Fi = p1 + ⋯ + pi for all i ∈ {1, …, k}
2. Draw a uniform random number u ~ [0, 1]
3. Return the smallest i such that u ≤ Fi

Are we done?

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 8
Failure in high dimensions
No, because of the curse of dimensionality. Counting fails when
there are too many bins.
- (Binary) MNIST: 28x28 images, each pixel in {0, 1}
784
- There are 2 ≈ 10236 probabilities to estimate
- Any reasonable training set covers only a tiny fraction of this
- Each image influences only one parameter. No generalization
whatsoever!

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 9
Problematic even for single variable

learned histogram = training data distribution

→ often poor generalization

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 10
Parameterized Distributions

Fitting a parameterized distribution often

generalizes better

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 11
Status
- Issues with histograms
- High dimensions: won’t work
- Even 1-d: if many values in the domain, prone to overfitting

- Solution: function approximation. Instead of storing each

probability, store a parameterized function pθ(x)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 12
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 13
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 14
Likelihood-based generative models
Recall: the goal is to estimate pdata from x(1), …, x(n) ~ pdata(x)

Now we introduce function approximation: learn θ so that pθ(x) ≈ pdata(x).

- How do we design function approximators to effectively represent
complex joint distributions over x, yet remain easy to train?
- There will be many choices for model design, each with different tradeoffs
and different compatibility criteria.

Designing the model and the training procedure go hand-in-hand.

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 15
Fitting distributions
- Given data x(1), …, x(n) sampled from a “true” distribution pdata
- Set up a model class: a set of parameterized distributions pθ
- Pose a search problem over parameters

- Want the loss function + search procedure to:

- Work with large datasets (n is large, say millions of training examples)
- Yield θ such that pθ matches pdata — i.e. the training algorithm works. Think of
the loss as a distance between distributions.
- Note that the training procedure can only see the empirical data distribution,
not the true data distribution: we want the model to generalize.

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 16
Maximum likelihood
- Maximum likelihood: given a dataset x(1), …, x(n), find θ by solving the optimization
problem

- Statistics tells us that if the model family is expressive enough and if enough data
is given, then solving the maximum likelihood problem will yield parameters that
generate the data
- Equivalent to minimizing KL divergence between the empirical data distribution
and the model

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 17
Stochastic gradient descent
Maximum likelihood is an optimization problem. How do we solve it?
Stochastic gradient descent (SGD).
- SGD minimizes expectations: for f a differentiable function of θ, it solves

- With maximum likelihood, the optimization problem is

- Why maximum likelihood + SGD? It works with large datasets and is compatible
with neural networks.

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 18
Designing the model
- Key requirement for maximum likelihood + SGD: efficiently compute log p(x) and
its gradient
- We will choose models pθ to be deep neural networks, which work in the regime
of high expressiveness and efficient computation (assuming specialized hardware)
- How exactly do we design these networks?
- Any setting of θ must define a valid probability distribution over x:

- log pθ(x) should be easy to evaluate and differentiate with respect to θ

- This can be tricky to set up!

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 19
Bayes nets and neural nets
Main idea: place a Bayes net structure (a directed acyclic graph) over the variables in
the data, and model the conditional distributions with neural networks.
Reduces the problem to designing conditional likelihood-based models for single
variables. We know how to do this: the neural net takes variables being conditioned on
as input, and outputs the distribution for the variable being predicted.

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 20
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 21
Autoregressive models
- First, given a Bayes net structure, setting the conditional distributions to neural
networks will yield a tractable log likelihood and gradient. Great for maximum
likelihood training!

- But is it expressive enough? Yes, assuming a fully expressive Bayes net structure:
any joint distribution can be written as a product of conditionals

- This is called an autoregressive model. So, an expressive Bayes net structure with
neural network conditional distributions yields an expressive model for p(x) with
tractable maximum likelihood training.

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 22
A toy autoregressive model
Two variables: x1, x2
Model: p(x1, x2) = p(x1) p(x2|x1)
- p(x1) is a histogram
- p(x2|x1) is a multilayer perceptron
- Input is x1
- Output is a distribution over x2 (logits, followed by softmax)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 23
One function approximator per conditional
Does this extend to high dimensions?
- Somewhat. For d-dimensional data, O(d) parameters
- Much better than O(exp(d)) in tabular case
- What about text generation where d can be arbitrarily large?
- Limited generalization
- No information sharing among different conditionals
Solution: share parameters among conditional distributions. Two
approaches:
- Recurrent neural networks

- Masking

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 24
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 25
RNN autoregressive models - char-rnn

Sequence of Character at
characters ith position

[Karpathy, 2015]

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 26
MNIST
■ Handwritten digits
■ 28x28
■ 60,000 train
■ 10,000 test

■ Original: greyscale
■ “Binarized MNIST” -- 0/1 (black/white)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 27
RNN on MNIST

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 28
RNN with Pixel Location Appended on MNIST
■ Append (x,y) coordinates of pixel in the image as input to RNN

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 29
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models
- MADE

- Masked Convolutions

- Wavenet
- PixelCNN (+variations)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 30
Masking-based autoregressive models
Second major branch of neural AR models
■ Key property: parallelized computation of all conditionals
■ Masked MLP (MADE)
■ Masked convolutions & self-attention
■ Also share parameters across time

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 31
Masked Autoencoder for Distribution Estimation (MADE)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 32
Masked Autoencoder for Distribution Estimation (MADE)
General principle

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 33
MADE on MNIST

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 34
Masked Autoencoder for Distribution Estimation (MADE)

# param: normal FC weight

# x: layer input
# y: autoregressive activations
mask = get_linear_ar_mask(in_size, out_size)
# create mask of pattern
# array([[0., 1., 1.],
# [0., 0., 1.],
# [0., 0., 0.]], dtype=float32)
y = tf.matmul(x, param * mask)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 35
MADE results

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 36
MADE results

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 37
MADE -- Different Orderings
All orderings achieve roughly the same bits per dim, but samples are different

Top to Middle,
Random Permutation Even then Odd Indices Rows (Raster Scan) Columns Bottom to Middle

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 38
MADE: Multiple Orderings

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 39
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models
- MADE

- Masked Convolutions

- Wavenet
- PixelCNN (+variations)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 40
Masked Temporal (1D) Convolution
p(xi+1| x<=i)

● Easy to implement, masking part of

the conv kernel
● Constant parameter count for
variable-length distribution!
● Efficient to compute, convolution has
hyper-optimized implementations on
all hardware

However
● Limited receptive field, linear in
number of layers

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 41
WaveNet

● Improved receptive field: dilated

convolution, with exponential dilation
● Better expressivity: Gated Residual
blocks, Skip connections

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 42
WaveNet on MNIST

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 43
WaveNet with Pixel Location Appended on MNIST
■ Append (x,y) coordinates of pixel in the image as input to
WaveNet

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 44
Masked Temporal (1D) Convolution
# More efficient implementation possible by
# padding instead of masking kernels
# k: size of kernel
# kernel: convolution weights
padded_x = tf.pad(x, [
(0, 0), (k - 1, 0),
(0, 0), (0, 0)
])
y = tf.nn.conv2d(padded_x, kernel, padding='VALID')

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 45
Outline
- Motivation
- Simple generative models: histograms
- Modern neural autoregressive models
- Parameterized distributions and maximum likelihood
- Autoregressive Models
- Recurrent Neural Nets
- Masking-based Models
- MADE

- Masked Convolutions

- Wavenet
- PixelCNN (+variations)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 46
Masked Spatial (2D) Convolution - PixelCNN
■ Images can be flatten into 1D vectors, but they are fundamentally
2D
■ We can use a masked variant of ConvNet to exploit this knowledge
■ First, we impose an autoregressive ordering on 2D images:

This is called raster scan ordering.

(Different orderings are possible,
more on this later)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 47
PixelCNN
■ Design question: how to design a masking method to obey
that ordering?
■ One possibility: PixelCNN (2016)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 77
Recap: Logistic distribution

pdf cdf
= sigmoid((x - mu) / scale)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 78
Mixture of Logistics -- Discrete Distribution

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 79
Ex. Training Mixture of Logistics

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 80
PixelCNN++
■ Capture long dependencies efficiently by downsampling

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 81
PixelCNN++

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 82
Masked Attention
■ A recurring problem for convolution: limited receptive field ->
hard to capture long-range dependencies
■ (Self-)Attention: an alternative that has
■ unlimited receptive field!!
■ also O(1) parameter scaling w.r.t. data dimension
■ parallelized computation (versus RNN)

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 83
Attention

Self-attention when qi also generated from x

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 84
Self-Attention

Convolution Self-attention

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 85
Masked Attention

- masked(ki, q) * 1010

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 86
Masked Attention
■ Much more flexible than masked convolution. We can design
any autoregressive ordering we want
■ An example:

Zigzag ordering
- How to implement with masked
conv?
- Trivial to do with masked attention!

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 87
Masked Attention + Convolution

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 88
Masked Attention + Convolution

Gated PixelCNN PixelCNN++ PixelSNAIL

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 89
Multi-Head Self-Attention on MNIST

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 90
Masked Attention + Convolution

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 91
Sample Quality

Which set of samples are generated by a GAN versus an AR model?

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 92
AR models can have good samples
■ Good samples can be achieved by selective bits conditioning
■ Grayscale PixelCNN
■ Subscale Pixel Network

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 93
Class-Conditional PixelCNN

How to condition?

IN: One-hot encoding of the labels

THEN: multiplying by different learned

weight matrices in each convolutional
layer, and added as a bias channel-wise
and broadcasted spatially

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 94
Hierarchical Autoregressive Models with Auxiliary Decoders

De Fauw, Jeffrey, Sander Dieleman, and Karen Simonyan. "Hierarchical autoregressive image models with auxiliary decoders." arXiv
preprint arXiv:1903.04933 (2019).
APA
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 95
Image Super-Resolution with PixelCNN
■ A PixelCNN is conditioned on
7 x 7 subsampled MNIST
images to generated the
corresponding 28 x 28 image

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 96
Pixel Recursive Super Resolution

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 97
Hierarchy: Grayscale PixelCNN
■ Design an autoregressive model
architecture that takes
advantage of the structure of
data
■ Learn a PixelCNN on binary
images, and a PixelCNN
conditioned on binary images to
generate colored images

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 98
PixelCNN Models with Auxiliary Variables for Natural Image Modeling

UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models 99
Neural autoregressive models: the good
Best in class modelling performance:
■ expressivity - autoregressive factorization is general

■ generalization - meaningful parameter sharing has good

inductive bias

-> State of the art models on multiple datasets, modalities

100
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Masked autoregressive models: the bad
● Sampling each pixel = 1 forward pass!
● 11 minutes to generate 16 32-by-32 images on a Tesla K40 GPU

101
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Speedup by caching activations
https://github.com/PrajitR/fast-pixel-cnn

102
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Speedup by caching activations
https://github.com/PrajitR/fast-pixel-cnn

103
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Speedup by breaking autoregressive pattern
■ O(d) -> O(log(d)) by parallelizing within groups {2, 3, 4}
■ Cannot capture dependencies within each group: this is a fine assumption
if all pixels in one group are conditionally independent
■ Most often they are not, then you trade expressivity for sampling
speed

104
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Multiscale PixelCNN

Improved sampling speed

More limited modelling capacity

105
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Scaling Autoregressive Video Models

[Dirk Weissenborn, Oscar Tackstrom, Jakob Uszkoreit. “Scaling Autoregressive Video Models.” arXiv 1906.02634 (2019)]

106
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Scaling Autoregressive Video Models -- BAIR Robot Pushing
Large Spatiotemporal Subscaling Small Spatiotemporal Subscaling

[Dirk Weissenborn, Oscar Tackstrom, Jakob Uszkoreit. “Scaling Autoregressive Video Models.” arXiv 1906.02634 (2019)]

107
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Scaling Autoregressive Video Models -- Kinetics
Cooking (left-to-right by likelihood) Full Kinetics (left-to-right by likelihood)

[Dirk Weissenborn, Oscar Tackstrom, Jakob Uszkoreit. “Scaling Autoregressive Video Models.” arXiv 1906.02634 (2019)]

108
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Natural Image Manipulation for Autoregressive Models using Fisher Scores

■ Main challenge:
■ How to get a latent representation from PixelCNN?
■ Why hard? The random input happens on a per-pixel sample basis

■ Proposed solution
■ Use Fisher score

Note: applicable to any likelihood model

[Wilson Yan, Jonatha Ho, Pieter Abbeel. ““Natural Image Manipulation for Autoregressive Models using Fisher Scores.” arXiv 1912.05015

109
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Natural Image Manipulation for Autoregressive Models using Fisher Scores

[Wilson Yan, Jonatha Ho, Pieter Abbeel. ““Natural Image Manipulation for Autoregressive Models using Fisher Scores.” arXiv 1912.05015

110
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Natural Image Manipulation for Autoregressive Models using Fisher Scores

[Wilson Yan, Jonatha Ho, Pieter Abbeel. ““Natural Image Manipulation for Autoregressive Models using Fisher Scores.” arXiv 1912.05015

111
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Bibliography
char-rnn: http://karpathy.github.io/2015/05/21/rnn-effectiveness/
MADE: Germain, Mathieu, et al. "Made: Masked autoencoder for distribution estimation." International Conference on Machine Learning. 2015.
WaveNet: Oord, Aaron van den, et al. "Wavenet: A generative model for raw audio." arXiv preprint arXiv:1609.03499 (2016).
PixelCNN: Oord, Aaron van den, Nal Kalchbrenner, and Koray Kavukcuoglu. "Pixel recurrent neural networks." arXiv preprint arXiv:1601.06759 (2016).
Gated PixelCNN: Van den Oord, Aaron, et al. "Conditional image generation with pixelcnn decoders." Advances in Neural Information Processing Systems. 2016.
PixelCNN++: Salimans, Tim, et al. "Pixelcnn++: Improving the pixelcnn with discretized logistic mixture likelihood and other modifications." arXiv preprint arXiv:1701.05517 (2017)
Self-attention: Vaswani, Ashish, et al. "Attention is all you need." Advances in Neural Information Processing Systems. 2017.
PixelSNAIL: Chen, Xi, et al. "Pixelsnail: An improved autoregressive generative model." arXiv preprint arXiv:1712.09763 (2017)
Fast PixelCNN++: Ramachandran, Prajit, et al. "Fast generation for convolutional autoregressive models." arXiv preprint arXiv:1704.06001(2017).
Multiscale PixelCNN: Reed, Scott, et al. "Parallel multiscale autoregressive density estimation." Proceedings of the 34th International Conference on Machine Learning-Volume 70.
JMLR. org, 2017.
Grayscale PixelCNN: Kolesnikov, Alexander, and Christoph H. Lampert. "PixelCNN models with auxiliary variables for natural image modeling." Proceedings of the 34th International
Conference on Machine Learning-Volume 70. JMLR. org, 2017.
Subscale Pixel Network: Menick, Jacob, and Nal Kalchbrenner. "Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling." arXiv preprint
arXiv:1812.01608(2018)
Dirk Weissenborn, Oscar Tackstrom, Jakob Uszkoreit. “Scaling Autoregressive Video Models.” arXiv 1906.02634 (2019)
Sparse Attention: Rewon Child, Scott Gray, Alec Radford, Ilya Sutskever. “Generating Long Sequences with Sparse Transformers.” arXiv 1904.10509
Wilson Yan, Jonathan Ho, Pieter Abbeel. “Natural Image Manipulation for Autoregressive Models using Fisher Scores.” arXiv 1912.05015
PixelCNN Super Resolution: Dahl, Ryan, Mohammad Norouzi, and Jonathon Shlens. "Pixel recursive super resolution." Proceedings of the IEEE International Conference on
Computer Vision. 2017.
Grayscale PixelCNN: Kolesnikov, Alexander, and Christoph H. Lampert. "PixelCNN models with auxiliary variables for natural image modeling." Proceedings of the 34th International
Conference on Machine Learning-Volume 70. JMLR. org, 2017.

112
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models
Colab

113
UC Berkeley -- Spring 2020 -- Deep Unsupervised Learning -- Pieter Abbeel, Peter Chen, Jonathan Ho, Aravind Srinivas, Alex Li, Wilson Yan -- L2 Autoregressive Models

MITx 6.86x Notes - MD
No ratings yet
MITx 6.86x Notes - MD
91 pages
Lecture 3 Flow Models
No ratings yet
Lecture 3 Flow Models
58 pages
Chapter 1 - Introduction and Supervised
No ratings yet
Chapter 1 - Introduction and Supervised
40 pages
2023 LSE MY474 Applied Machine Learning Social Science, Lecture1
No ratings yet
2023 LSE MY474 Applied Machine Learning Social Science, Lecture1
65 pages
Instructors: Moses Charikar, Tengyu Ma, and Chris Re: Hope Everyone Stays Safe and Healthy in These Difficult Times!
No ratings yet
Instructors: Moses Charikar, Tengyu Ma, and Chris Re: Hope Everyone Stays Safe and Healthy in These Difficult Times!
40 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
56 pages
Ml2 Script v2
No ratings yet
Ml2 Script v2
123 pages
Get From Statistical Physics to Data-Driven Modelling Simona Cocco free all chapters
100% (2)
Get From Statistical Physics to Data-Driven Modelling Simona Cocco free all chapters
40 pages
Lecture # 4-2 Autoregressive Models
No ratings yet
Lecture # 4-2 Autoregressive Models
39 pages
Lecture 1 Introduction Lecture 2-9-2024
No ratings yet
Lecture 1 Introduction Lecture 2-9-2024
63 pages
Super Cheatsheet Machine Learning
100% (1)
Super Cheatsheet Machine Learning
15 pages
Instant ebooks textbook From Statistical Physics to Data-Driven Modelling Simona Cocco download all chapters
100% (4)
Instant ebooks textbook From Statistical Physics to Data-Driven Modelling Simona Cocco download all chapters
40 pages
practicalMachineLearning_lecture3
No ratings yet
practicalMachineLearning_lecture3
25 pages
From Statistical Physics to Data-Driven Modelling Simona Cocco - The latest ebook version is now available for instant access
100% (1)
From Statistical Physics to Data-Driven Modelling Simona Cocco - The latest ebook version is now available for instant access
66 pages
CS229 Andrew NG Lecture Notes
No ratings yet
CS229 Andrew NG Lecture Notes
216 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Cheet Sheet
No ratings yet
Cheet Sheet
47 pages
unit-iv-v-deep-learning-material
No ratings yet
unit-iv-v-deep-learning-material
32 pages
MLSM Lecture1 050923
No ratings yet
MLSM Lecture1 050923
37 pages
Cheatsheet Supervised Learning
No ratings yet
Cheatsheet Supervised Learning
4 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
Andrew NG Main - Notes PDF
No ratings yet
Andrew NG Main - Notes PDF
226 pages
Machine Learning CS229/STATS229: Instructors: Moses Charikar, Tengyu Ma, and Chris Re
No ratings yet
Machine Learning CS229/STATS229: Instructors: Moses Charikar, Tengyu Ma, and Chris Re
40 pages
Mlgs 2021 Endterm Solution
No ratings yet
Mlgs 2021 Endterm Solution
26 pages
Lecture Notes- Machine Learning for the Sciences
No ratings yet
Lecture Notes- Machine Learning for the Sciences
84 pages
Advances in Bayesian Machine Learning From Uncertainty To Decision Making
No ratings yet
Advances in Bayesian Machine Learning From Uncertainty To Decision Making
272 pages
Machine Learning
No ratings yet
Machine Learning
137 pages
Main Notes
No ratings yet
Main Notes
227 pages
L0
No ratings yet
L0
26 pages
LN ML Rug
No ratings yet
LN ML Rug
267 pages
CM20315_01_Intro
No ratings yet
CM20315_01_Intro
62 pages
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
No ratings yet
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
12 pages
CS 229 - Supervised Learning Cheatsheet
No ratings yet
CS 229 - Supervised Learning Cheatsheet
13 pages
LN ML Rug
No ratings yet
LN ML Rug
283 pages
Main Notes
No ratings yet
Main Notes
227 pages
ML Main Printing Material
No ratings yet
ML Main Printing Material
241 pages
Cs229-Main Notes Andrew NG and Tengyu Ma
No ratings yet
Cs229-Main Notes Andrew NG and Tengyu Ma
227 pages
Handout - BITS-F464 - Machine - Learning - August 2019
No ratings yet
Handout - BITS-F464 - Machine - Learning - August 2019
4 pages
Deep Learning & Machine Learning
No ratings yet
Deep Learning & Machine Learning
180 pages
02 ML Fundatmentals 2
No ratings yet
02 ML Fundatmentals 2
81 pages
L11 - UCLxDeepMind DL2020
No ratings yet
L11 - UCLxDeepMind DL2020
68 pages
poly_aml
No ratings yet
poly_aml
76 pages
CSE545 sp22 (10) Time-series and Longitudinal Analysis 4-15
No ratings yet
CSE545 sp22 (10) Time-series and Longitudinal Analysis 4-15
37 pages
Lec 01 Introduction
No ratings yet
Lec 01 Introduction
98 pages
Brief Intro To ML PDF
No ratings yet
Brief Intro To ML PDF
236 pages
CS229 Lecture 3 PDF
100% (1)
CS229 Lecture 3 PDF
35 pages
1-s2.0-S174680942300722X-main
No ratings yet
1-s2.0-S174680942300722X-main
5 pages
Notes5_Regression
No ratings yet
Notes5_Regression
14 pages
Stanford ML
No ratings yet
Stanford ML
168 pages
Deep Learning
No ratings yet
Deep Learning
12 pages
01_ml_basics
No ratings yet
01_ml_basics
61 pages
Economics 2150 Syllabus 8a
No ratings yet
Economics 2150 Syllabus 8a
22 pages
CS229 Lecture Notes: Andrew NG and Tengyu Ma April 25, 2023
No ratings yet
CS229 Lecture Notes: Andrew NG and Tengyu Ma April 25, 2023
223 pages
COS324 Course Notes
No ratings yet
COS324 Course Notes
256 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
CSE445 NSU Week_1
No ratings yet
CSE445 NSU Week_1
28 pages
Deep Learning A Tutorial
No ratings yet
Deep Learning A Tutorial
16 pages
The Deep Learning Engineer's Handbook: From Fundamentals to Advanced Techniques with Scikit-Learn, Keras, and TensorFlow
From Everand
The Deep Learning Engineer's Handbook: From Fundamentals to Advanced Techniques with Scikit-Learn, Keras, and TensorFlow
Aarav Joshi
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Knee Osteoarthritis Detection Using An Improved CenterNet With Pixel-Wise Voting Scheme-PAPER
No ratings yet
Knee Osteoarthritis Detection Using An Improved CenterNet With Pixel-Wise Voting Scheme-PAPER
17 pages
MCQ-403-Business Analytics
No ratings yet
MCQ-403-Business Analytics
38 pages
AI
100% (2)
AI
234 pages
Artificial Intelligence Education For Young Children
No ratings yet
Artificial Intelligence Education For Young Children
7 pages
Backend Modeling with SAP Solutions and Story Performance Optimization
No ratings yet
Backend Modeling with SAP Solutions and Story Performance Optimization
20 pages
6 C3 M4 L1-RecurrentNeuralNetwork1
No ratings yet
6 C3 M4 L1-RecurrentNeuralNetwork1
29 pages
Edited Version of Cardiovascular Diseases Risk Prediction Dataset Report
No ratings yet
Edited Version of Cardiovascular Diseases Risk Prediction Dataset Report
25 pages
Dental Technology Dissertation
100% (2)
Dental Technology Dissertation
4 pages
LR 1
No ratings yet
LR 1
35 pages
Batch 2018-22 Project 2021-22 Academic Year 2021-22: Predicting Diabetes Using Machine Learning
No ratings yet
Batch 2018-22 Project 2021-22 Academic Year 2021-22: Predicting Diabetes Using Machine Learning
3 pages
Data Science @ REBEL FOODS
No ratings yet
Data Science @ REBEL FOODS
2 pages
Trends 125 Week 11 20
71% (7)
Trends 125 Week 11 20
72 pages
Note On Backpropagation John Hull: Ith Observation, and y
No ratings yet
Note On Backpropagation John Hull: Ith Observation, and y
2 pages
从数据中发现因果关系和方程
No ratings yet
从数据中发现因果关系和方程
68 pages
Resume: Katpally Rakesh Reddy
No ratings yet
Resume: Katpally Rakesh Reddy
2 pages
DL_UNIT_IV
No ratings yet
DL_UNIT_IV
18 pages
Software Regression
No ratings yet
Software Regression
2 pages
10_neural_nets_with_keras.ipynb (1)
No ratings yet
10_neural_nets_with_keras.ipynb (1)
159 pages
E & AI -Full Notes
No ratings yet
E & AI -Full Notes
126 pages
Aerospace Dissertation Ideas
100% (2)
Aerospace Dissertation Ideas
5 pages
Igor Pontes: Data Scientist - Head of Marketing Intelligence
No ratings yet
Igor Pontes: Data Scientist - Head of Marketing Intelligence
5 pages
C1 W2 Lab05 Sklearn GD Soln
No ratings yet
C1 W2 Lab05 Sklearn GD Soln
3 pages
Session 3 Reflection
No ratings yet
Session 3 Reflection
3 pages
Selection From The Book Exploring Geological Data With WEKA For iSE-ACADEMY
No ratings yet
Selection From The Book Exploring Geological Data With WEKA For iSE-ACADEMY
17 pages
AI ML Training
No ratings yet
AI ML Training
6 pages
Assessment II
No ratings yet
Assessment II
25 pages
Machine Learning Toolkit User Manual
No ratings yet
Machine Learning Toolkit User Manual
7 pages
Artificial Intelligence-Based Facial Palsy Evaluation A Survey
No ratings yet
Artificial Intelligence-Based Facial Palsy Evaluation A Survey
19 pages
End Semester Exam_TIME TABLE_May 2025
No ratings yet
End Semester Exam_TIME TABLE_May 2025
1 page
TY AI Syllabus
No ratings yet
TY AI Syllabus
72 pages