0% found this document useful (0 votes)

24 views47 pages

Latent Dirichlet Allocation: An Example of A Graphical Model

The document describes Latent Dirichlet Allocation, a statistical model that allows sets of observations to be explained by unobserved groups that explain why some parts of the data are similar. It discusses how LDA can be used to discover topics in a text corpus and describes the generative process and inference methods for LDA such as Gibbs sampling.

Uploaded by

Yuvraj Pardeshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views47 pages

Latent Dirichlet Allocation: An Example of A Graphical Model

Uploaded by

Yuvraj Pardeshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 47

Latent Dirichlet Allocation:

An example of a graphical model

CS775
LDA: discovering topics in a text
corpus
• Why map knowledge?
– Quickly grasp important themes in a new field
– Synthesize content of an existing field
– Discover targets for funding and research
– Understand a corpus of text
1. A generative model for documents
2. Discovering topics with Gibbs sampling
3. Results
– Topics and classes
– Topic dynamics
A generative model for
documents
• Each document a mixture of topics
• Each word chosen from a single topic

• from parameters
• from parameters

(Blei, Ng, & Jordan, 2003)

A generative model for
documents
w P(w|z = 1) = (1) w P(w|z = 2) = (2)
HEART 0.2 HEART 0.0
LOVE 0.2 LOVE 0.0
SOUL 0.2 SOUL 0.0
TEARS 0.2 TEARS 0.0
JOY 0.2 JOY 0.0
SCIENTIFIC 0.0 SCIENTIFIC 0.2
KNOWLEDGE 0.0 KNOWLEDGE 0.2
WORK 0.0 WORK 0.2
RESEARCH 0.0 RESEARCH 0.2
MATHEMATICS 0.0 MATHEMATICS 0.2
topic 1 topic 2
Choose mixture weights for each document, generate “bag of words”
 = {P(z = 1), P(z = 2)}
MATHEMATICS KNOWLEDGE RESEARCH WORK MATHEMATICS
{0, 1} RESEARCH WORK SCIENTIFIC MATHEMATICS WORK

SCIENTIFIC KNOWLEDGE MATHEMATICS SCIENTIFIC

{0.25, 0.75} HEART LOVE TEARS KNOWLEDGE HEART

MATHEMATICS HEART RESEARCH LOVE MATHEMATICS

{0.5, 0.5} WORK TEARS SOUL KNOWLEDGE HEART

{0.75, 0.25} WORK JOY SOUL TEARS MATHEMATICS

TEARS LOVE LOVE LOVE SOUL

{1, 0} TEARS LOVE JOY SOUL LOVE TEARS SOUL SOUL TEARS JOY
A generative model for
documents


z z z

w w w

• Called Latent Dirichlet Allocation (LDA)

• Introduced by Blei, Ng, and Jordan
(2003), reinterpretation of PLSI
(Hofmann, 2001)
Dirichlet Distributions
• In the LDA model, we would like to say that the topic
mixture proportions for each document are drawn from
some distribution.
• So, we want to put a distribution on multinomials. That
is, k-tuples of non-negative numbers that sum to one.
• The space is of all of these multinomials has a nice
geometric interpretation as a (k-1)-simplex, which is just
a generalization of a triangle to (k-1) dimensions.
• Criteria for selecting our prior:
– It needs to be defined for a (k-1)-simplex.
– Algebraically speaking, we would like it to play nice with the
multinomial distribution.
Dirichlet Examples
Topic Model:
Geometric Representation
Dirichlet Distributions

• Useful Facts:
– This distribution is defined over a (k-1)-simplex. That is, it
takes k non-negative arguments which sum to one.
Consequently it is a natural distribution to use over
multinomial distributions.
– In fact, the Dirichlet distribution is the conjugate prior to
the multinomial distribution. (This means that if our
likelihood is multinomial with a Dirichlet prior, then the
posterior is also Dirichlet!)
– The Dirichlet parameter i can be thought of as a prior
count of the ith class.
The LDA Model


  

z1 z2 z3 z4 z1 z2 z3 z4 z1 z2 z3 z4

w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4

• For each document, 

• Choose ~Dirichlet()
• For each of the N words wn:
– Choose a topic zn» Multinomial()
– Choose a word wn from p(wn|zn,), a multinomial
probability conditioned on the topic z .
The LDA Model
  K

 z w N M


For each document,

• Choose » Dirichlet()
• For each of the N words wn:
– Choose a topic zn» Multinomial()
– Choose a word wn from p(wn|zn,), a multinomial
probability conditioned on the topic zn.
Inference
  K

 z w N M


•The inference problem in LDA is to compute the posterior of the

hidden variables given a document and corpus parameters 
and . That is, compute p(,,z|w,,).
•Unfortunately, exact inference is intractable, so we turn to
alternatives…
The LDA equations
  K

 z w N M


z
 
wm,n zn ,   n   Discrete   n 
z
M #documents, K #topics
  Dirichlet    V # terms in vocabulary
zi  
di 
 Discrete   di
N # document length
  Dirichlet  
Joint factorization
  K Dirichlet

Discrete
Discrete

 z w N M


N
p  d ,  , z, w  ,    p  d    p  z n  d  p  wn z n ,   p   
n 1
N
p  z, w  ,      p  d    p  z n  d  p  wn z n ,   p   d d
n 1

N K

 
p  w  ,      p  d     p  zn  d  p wm, n zn ,  p   d d
n 1 n 1
M N K

 
p  D  ,       p  d     p  zn  d  p wm, n zn ,  p   d d
m 1 n 1 n 1
Intractabiliby
  K

 z w N M


p  z, w  ,  
p  z w,  ,   
 p  z, w  ,  
z

Problems
Denominator does not factorize
Denominator represents a summation over O  K V 
Gibbs sampling

For variables z = z1, z2, …, zn

Draw zi(t+1) from P(zi|z-i, w)
z-i = z1(t+1), z2(t+1),…, zi-1(t+1), zi+1(t), …, zn(t)
Gibbs sampling

• Need full conditional distributions for

variables
• Since we only sample z we need

number of times word w assigned to topic j

number of times topic j used in document d

Gibbs sampling
iteration
1
i wi di zi
1 MATHEMATICS 1 2
2 KNOWLEDGE 1 2
3 RESEARCH 1 1
4 WORK 1 2
5 MATHEMATICS 1 1
6 RESEARCH 1 2
7 WORK 1 2
8 SCIENTIFIC 1 1
9 MATHEMATICS 1 2
10 WORK 1 1
11 SCIENTIFIC 2 1
12 KNOWLEDGE 2 1
. . . .
. . . .
. . . .
50 JOY 5 2
Gibbs sampling
iteration
1 2
i wi di zi zi
1 MATHEMATICS 1 2 ?
2 KNOWLEDGE 1 2
3 RESEARCH 1 1
4 WORK 1 2
5 MATHEMATICS 1 1
6 RESEARCH 1 2
7 WORK 1 2
8 SCIENTIFIC 1 1
9 MATHEMATICS 1 2
10 WORK 1 1
11 SCIENTIFIC 2 1
12 KNOWLEDGE 2 1
. . . .
. . . .
. . . .
50 JOY 5 2
Gibbs sampling
iteration
1 2
i wi di zi zi
1 MATHEMATICS 1 2 ?
2 KNOWLEDGE 1 2
3 RESEARCH 1 1
4 WORK 1 2
5 MATHEMATICS 1 1
6 RESEARCH 1 2
7 WORK 1 2
8 SCIENTIFIC 1 1
9 MATHEMATICS 1 2
10 WORK 1 1
11 SCIENTIFIC 2 1
12 KNOWLEDGE 2 1
. . . .
. . . .
. . . .
50 JOY 5 2
Gibbs sampling
iteration
1 2
i wi di zi zi
1 MATHEMATICS 1 2 ?
2 KNOWLEDGE 1 2
3 RESEARCH 1 1
4 WORK 1 2
5 MATHEMATICS 1 1
6 RESEARCH 1 2
7 WORK 1 2
8 SCIENTIFIC 1 1
9 MATHEMATICS 1 2
10 WORK 1 1
11 SCIENTIFIC 2 1
12 KNOWLEDGE 2 1
. . . .
. . . .
. . . .
50 JOY 5 2
Gibbs sampling
iteration
1 2
i wi di zi zi
1 MATHEMATICS 1 2 2
2 KNOWLEDGE 1 2 ?
3 RESEARCH 1 1
4 WORK 1 2
5 MATHEMATICS 1 1
6 RESEARCH 1 2
7 WORK 1 2
8 SCIENTIFIC 1 1
9 MATHEMATICS 1 2
10 WORK 1 1
11 SCIENTIFIC 2 1
12 KNOWLEDGE 2 1
. . . .
. . . .
. . . .
50 JOY 5 2
Gibbs sampling
iteration
1 2
i wi di zi zi
1 MATHEMATICS 1 2 2
2 KNOWLEDGE 1 2 1
3 RESEARCH 1 1 ?
4 WORK 1 2
5 MATHEMATICS 1 1
6 RESEARCH 1 2
7 WORK 1 2
8 SCIENTIFIC 1 1
9 MATHEMATICS 1 2
10 WORK 1 1
11 SCIENTIFIC 2 1
12 KNOWLEDGE 2 1
. . . .
. . . .
. . . .
50 JOY 5 2
Gibbs sampling
iteration
1 2
i wi di zi zi
1 MATHEMATICS 1 2 2
2 KNOWLEDGE 1 2 1
3 RESEARCH 1 1 1
4 WORK 1 2 ?
5 MATHEMATICS 1 1
6 RESEARCH 1 2
7 WORK 1 2
8 SCIENTIFIC 1 1
9 MATHEMATICS 1 2
10 WORK 1 1
11 SCIENTIFIC 2 1
12 KNOWLEDGE 2 1
. . . .
. . . .
. . . .
50 JOY 5 2
Gibbs sampling
iteration
1 2
i wi di zi zi
1 MATHEMATICS 1 2 2
2 KNOWLEDGE 1 2 1
3 RESEARCH 1 1 1
4 WORK 1 2 2
5 MATHEMATICS 1 1 ?
6 RESEARCH 1 2
7 WORK 1 2
8 SCIENTIFIC 1 1
9 MATHEMATICS 1 2
10 WORK 1 1
11 SCIENTIFIC 2 1
12 KNOWLEDGE 2 1
. . . .
. . . .
. . . .
50 JOY 5 2
Gibbs sampling
iteration
1 2 … 1000
i wi di zi zi zi
1 MATHEMATICS 1 2 2 2
2 KNOWLEDGE 1 2 1 2
3 RESEARCH 1 1 1 2
4 WORK 1 2 2 1
5 MATHEMATICS 1 1 2 2
6 RESEARCH 1 2 2 2
7 WORK 1 2 2 2
8 SCIENTIFIC 1 1 1 … 1
9 MATHEMATICS 1 2 2 2
10 WORK 1 1 2 2
11 SCIENTIFIC 2 1 1 2
12 KNOWLEDGE 2 1 2 2
. . . . . .
. . . . . .
. . . . . .
50 JOY 5 2 1 1
Example of Gibbs Sampling
• Assign word tokens randomly to topics
(●=topic 1; ●=topic 2 )

River Stream Bank Money Loan

River Stream Bank Money Loan
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

Slide Credit: Padhraic Smyth, UC Irvine

After 1 iteration
• Apply sampling equation to each word
token
River Stream Bank Money Loan
River Stream Bank Money Loan
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

Slide Credit: Padhraic Smyth, UC Irvine

After 4 iterations

River Stream Bank Money Loan

River Stream Bank Money Loan
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

Slide Credit: Padhraic Smyth, UC Irvine

After 32 iterations
● ●
topic 1 topic 2
stream .40 bank .39
bank .35 money .32
river .25 loan .29

River Stream Bank Money Loan

River Stream Bank Money Loan
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

Slide Credit: Padhraic Smyth, UC Irvine

A visual example: Bars

sample each pixel from

a mixture of topics

pixel = word
image = document
Corpus preprocessing

• Used all D = 28,154 abstracts from 1991-

2001
• Used any word occurring in at least five
abstracts, not on “stop” list (W = 20,551)
• Segmentation by any delimiting character,
total of n = 3,026,970 word tokens in corpus
• Also, PNAS class designations for 2001
(thanks to Kevin Boyack)
Topics and classes
• PNAS authors provide class designations
– major: Biological, Physical, Social Sciences
– minor: 33 separate disciplines*
• Find topics diagnostic of classes
– validate “reality” of classes
– show topics pick out meaningful structure
(classes, and the the relations between them)
210
SYNAPTIC
NEURONS
POSTSYNAPTIC
HIPPOCAMPAL
SYNAPSES
LTP
PRESYNAPTIC
TRANSMISSION
POTENTIATION
PLASTICITY
EXCITATORY
RELEASE
DENDRITIC
PYRAMIDAL
HIPPOCAMPUS
DENDRITES
CA1
STIMULATION
TERMINALS
SYNAPSE
201
RESISTANCE
RESISTANT
DRUG
DRUGS
SENSITIVE
MDR
MULTIDRUG
SUSCEPTIBLE
SELECTED
GLYCOPROTEIN
SENSITIVITY
PGP
AGENTS
CONFERS
MDR1
CYTOTOXIC
CONFERRED
CHEMOTHERAPEUTIC
EFFLUX
INCREASED
280
SPECIES
SELECTION
EVOLUTION
GENETIC
POPULATIONS
POPULATION
VARIATION
NATURAL
EVOLUTIONARY
FITNESS
ADAPTIVE
RATES
THEORY
TRAITS
DIVERSITY
EXPECTED
NEUTRAL
EVOLVED
COMPETITION
HISTORY
222
CORTEX
BRAIN
SUBJECTS
TASK
AREAS
REGIONS
FUNCTIONAL
LEFT
MEMORY
TEMPORAL
IMAGING
PREFRONTAL
CEREBRAL
TASKS
FRONTAL
AREA
TOMOGRAPHY
EMISSION
POSITRON
CORTICAL
2
SPECIES
GLOBAL
CLIMATE
CO2
WATER
ENVIRONMENTAL
YEARS
MARINE
CARBON
DIVERSITY
OCEAN
EXTINCTION
TERRESTRIAL
COMMUNITY
ABUNDANCE
EARTH
ECOLOGICAL
CHANGE
TIME
ECOSYSTEM
39
THEORY
TIME
SPACE
GIVEN
PROBLEM
SHAPE
SIMPLE
DIMENSIONAL
PAPER
NUMBER
CASE
LOCAL
TERMS
SYMMETRY
RANDOM
EQUATION
CLASSICAL
COMPLEXITY
NUMERICAL
PROPERTIES
Mapping science
• Topics provide dimensionality reduction

• Some applications require visualization

(and even lower dimensionality)

• Low-dimensional representation from

methods for analysis of compositional data
Evaluating Predictive Power

• Perplexity
– Indicates ability to predict words on new
unseen documents
Lower the
better
Author-Topic Model
Uniform Document
distribution of
documents over
authors ad
Author
Distribution of
authors over x
topics
Topic

  z
Topic A Word
distribution
over words
  w
Nd
T D

MAST20004 Probability: Lecturers: Mark Fackrell and Aihua Xia
No ratings yet
MAST20004 Probability: Lecturers: Mark Fackrell and Aihua Xia
560 pages
Probabilistic Topic Models
No ratings yet
Probabilistic Topic Models
78 pages
Topic Models in Natural Language Processing
No ratings yet
Topic Models in Natural Language Processing
64 pages
Lda-The Gritty Details
100% (1)
Lda-The Gritty Details
12 pages
Probability Probability Distribution Function Probability Density Function Random Variable Bayes' Rule Gaussian Distribution
No ratings yet
Probability Probability Distribution Function Probability Density Function Random Variable Bayes' Rule Gaussian Distribution
26 pages
Topoc Modeling PDF
No ratings yet
Topoc Modeling PDF
120 pages
Statistics M It
No ratings yet
Statistics M It
566 pages
Q1 Housekeeping Week4
No ratings yet
Q1 Housekeeping Week4
4 pages
Topic Models Indian Institute of Technology Pawangcoursestopicmodelspdf
No ratings yet
Topic Models Indian Institute of Technology Pawangcoursestopicmodelspdf
93 pages
ME314 Day11
No ratings yet
ME314 Day11
77 pages
All of Graphical Models
No ratings yet
All of Graphical Models
135 pages
Latent Dirichlet Allocation
100% (2)
Latent Dirichlet Allocation
13 pages
6 Probabilities
No ratings yet
6 Probabilities
52 pages
ACP Inferential Statistics S1 A
No ratings yet
ACP Inferential Statistics S1 A
89 pages
L14 TopicModels Sampling
No ratings yet
L14 TopicModels Sampling
40 pages
Session 2
No ratings yet
Session 2
58 pages
Probabilistic Topic Models
No ratings yet
Probabilistic Topic Models
78 pages
L11 TopicModels 2
No ratings yet
L11 TopicModels 2
37 pages
Lec 12
No ratings yet
Lec 12
54 pages
Topic Models in Natural Language Processing
No ratings yet
Topic Models in Natural Language Processing
55 pages
Johnson11MLSS Talk Extras
No ratings yet
Johnson11MLSS Talk Extras
73 pages
20 Latent Dirichlet Allocation
No ratings yet
20 Latent Dirichlet Allocation
27 pages
L2 - Mathematical Preliminaries.
No ratings yet
L2 - Mathematical Preliminaries.
42 pages
Unit 2 Mathematical Foundation of Big Data: - Syllabus
No ratings yet
Unit 2 Mathematical Foundation of Big Data: - Syllabus
26 pages
Distributed Gibbs Sampling of Latent Topic Models: The Gritty Details This Is An Early Draft. Your Feedbacks Are Highly Appreciated
No ratings yet
Distributed Gibbs Sampling of Latent Topic Models: The Gritty Details This Is An Early Draft. Your Feedbacks Are Highly Appreciated
17 pages
Appl Stat 2007 ZK
No ratings yet
Appl Stat 2007 ZK
124 pages
Probability
No ratings yet
Probability
56 pages
CSE291D Lecture 3: Conjugate Priors Generative Models For Discrete Data
No ratings yet
CSE291D Lecture 3: Conjugate Priors Generative Models For Discrete Data
71 pages
Probability and Statistics
No ratings yet
Probability and Statistics
80 pages
L2 - Mathematical Preliminaries
No ratings yet
L2 - Mathematical Preliminaries
41 pages
CS109/Stat121/AC209/E-109 Data Science: Bayesian Methods Continued, Text Data
No ratings yet
CS109/Stat121/AC209/E-109 Data Science: Bayesian Methods Continued, Text Data
35 pages
NLP Notes-1
No ratings yet
NLP Notes-1
54 pages
Lec 4
No ratings yet
Lec 4
35 pages
Lec-1 Probabilistic Models
No ratings yet
Lec-1 Probabilistic Models
29 pages
02 Fundamentals of Probability
No ratings yet
02 Fundamentals of Probability
21 pages
Current State of The Course!!!: We're Done With Part I Search and Planning! Part II: Probabilistic Reasoning
No ratings yet
Current State of The Course!!!: We're Done With Part I Search and Planning! Part II: Probabilistic Reasoning
30 pages
LabMeeting 20231201 Bayesprism
No ratings yet
LabMeeting 20231201 Bayesprism
17 pages
SNLP Overview
No ratings yet
SNLP Overview
43 pages
Effect of Home Environment On The Learning Outcomes....
No ratings yet
Effect of Home Environment On The Learning Outcomes....
21 pages
Information Retrieval - Lsi, Plsi and Lda: Jian-Yun Nie
No ratings yet
Information Retrieval - Lsi, Plsi and Lda: Jian-Yun Nie
34 pages
Unit 2, Part 2:topic Modeling
No ratings yet
Unit 2, Part 2:topic Modeling
26 pages
SP14 CS188 Lecture 12 - Probability - Print
No ratings yet
SP14 CS188 Lecture 12 - Probability - Print
33 pages
LU - 35 Latent Dirichlet Algorithm
No ratings yet
LU - 35 Latent Dirichlet Algorithm
13 pages
Improving Topic Models With Latent Feature Word Representations
No ratings yet
Improving Topic Models With Latent Feature Word Representations
16 pages
Scribe: Naive Bayes Classifier
No ratings yet
Scribe: Naive Bayes Classifier
16 pages
Business Analytics CHAPTER 5
No ratings yet
Business Analytics CHAPTER 5
3 pages
ITD253 L8 TopicModelling
No ratings yet
ITD253 L8 TopicModelling
31 pages
MLT Unit 4 Notes
No ratings yet
MLT Unit 4 Notes
26 pages
10 1 1 84 8490 PDF
No ratings yet
10 1 1 84 8490 PDF
7 pages
Gibbs Sampling
No ratings yet
Gibbs Sampling
10 pages
Input To The LDA Algorithm:: Latent Dirichlet Allocation Using Gibbs Sampling Technique Is A Framework For Analyzing
No ratings yet
Input To The LDA Algorithm:: Latent Dirichlet Allocation Using Gibbs Sampling Technique Is A Framework For Analyzing
3 pages
Qualitative Data Analysis
100% (2)
Qualitative Data Analysis
27 pages
"Impact of E-Commerce": Submitted by
No ratings yet
"Impact of E-Commerce": Submitted by
53 pages
Markov Random Topic Fields: Hal Daum e III School of Computing University of Utah Salt Lake City, UT 84112 [email protected]
No ratings yet
Markov Random Topic Fields: Hal Daum e III School of Computing University of Utah Salt Lake City, UT 84112 [email protected]
4 pages
ECIR2009 Topic Trend Detection
No ratings yet
ECIR2009 Topic Trend Detection
5 pages
Topic Model For LDA
No ratings yet
Topic Model For LDA
9 pages
Bce 211F Sim SDL Manual - 1
No ratings yet
Bce 211F Sim SDL Manual - 1
54 pages
Stat 230 PDF
No ratings yet
Stat 230 PDF
56 pages
Mit Micro Economics Lecture
No ratings yet
Mit Micro Economics Lecture
9 pages
Topic Models and Latent Dirichlet Allocation
No ratings yet
Topic Models and Latent Dirichlet Allocation
2 pages
Probability and Statistics: Essays in Honor of David A. Freedman
No ratings yet
Probability and Statistics: Essays in Honor of David A. Freedman
8 pages
Probability and Statistics: Essays in Honor of David A. Freedman
No ratings yet
Probability and Statistics: Essays in Honor of David A. Freedman
8 pages
Z and T Test
No ratings yet
Z and T Test
7 pages
Final New CH 1-5 PR
No ratings yet
Final New CH 1-5 PR
62 pages
Poverty and Its Measurement: The Presentation of A Range of Methods To Obtain Measures of Poverty
No ratings yet
Poverty and Its Measurement: The Presentation of A Range of Methods To Obtain Measures of Poverty
39 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
8 pages
2024 GEB Presentation
No ratings yet
2024 GEB Presentation
56 pages
Khadiza Rahman
No ratings yet
Khadiza Rahman
279 pages
The Art of The Question: The Structure of Questions Posed by Youth Soccer Coaches During Training
No ratings yet
The Art of The Question: The Structure of Questions Posed by Youth Soccer Coaches During Training
17 pages
Writer Editor Proofreader Trainer in Washington DC Resume Michael Fruitman
No ratings yet
Writer Editor Proofreader Trainer in Washington DC Resume Michael Fruitman
7 pages
Writing A Business Plan and Feasibility Study - 360966104
No ratings yet
Writing A Business Plan and Feasibility Study - 360966104
19 pages
EMERY SJ 1988 The Prediction of Moisture Content in Untreated Pavement Layers CSIR Research Report 644
No ratings yet
EMERY SJ 1988 The Prediction of Moisture Content in Untreated Pavement Layers CSIR Research Report 644
109 pages
How To Earn Money Online
No ratings yet
How To Earn Money Online
1 page
Unit 2 - 2
No ratings yet
Unit 2 - 2
22 pages
Artificial Intelligence Application On Aircraft Ma
No ratings yet
Artificial Intelligence Application On Aircraft Ma
7 pages
The Infuence of Running On Lower Limb Cartilage - A Systematic Review and Meta Analysis
No ratings yet
The Infuence of Running On Lower Limb Cartilage - A Systematic Review and Meta Analysis
20 pages
Homework Solutions Chegg
100% (1)
Homework Solutions Chegg
4 pages
Plant Detection
No ratings yet
Plant Detection
19 pages
Weekly Learning Activity Sheet Research 1, Quarter 3, Week 4 Learning Competency: Learning Objectives
No ratings yet
Weekly Learning Activity Sheet Research 1, Quarter 3, Week 4 Learning Competency: Learning Objectives
7 pages
ECON1005 Tutorial Sheet 6
No ratings yet
ECON1005 Tutorial Sheet 6
3 pages
Research Defense
No ratings yet
Research Defense
18 pages
Perceiving Acoustic Source Orientation in Three-Dimensional Space
No ratings yet
Perceiving Acoustic Source Orientation in Three-Dimensional Space
6 pages
Course Title:: Annexure
No ratings yet
Course Title:: Annexure
6 pages
Measuring Public Spending Preferences Using An Interactive Budgeting Questionnaire
No ratings yet
Measuring Public Spending Preferences Using An Interactive Budgeting Questionnaire
9 pages
Crassipes) Sebagai Fitoremediasi Dalam Menurunkan
No ratings yet
Crassipes) Sebagai Fitoremediasi Dalam Menurunkan
8 pages
Research On Dark Tourism
No ratings yet
Research On Dark Tourism
2 pages
Statistics and Probability
No ratings yet
Statistics and Probability
3 pages
The Green Book of Mathematical Problems
From Everand
The Green Book of Mathematical Problems
Kenneth Hardy
4.5/5 (3)
Knit the Alphabet: Quick and Easy Alphabet Knitting Patterns
From Everand
Knit the Alphabet: Quick and Easy Alphabet Knitting Patterns
Claire Garland
5/5 (1)

Latent Dirichlet Allocation: An Example of A Graphical Model

Uploaded by

Latent Dirichlet Allocation: An Example of A Graphical Model

Uploaded by

Latent Dirichlet Allocation:

An example of a graphical model

(Blei, Ng, & Jordan, 2003)

SCIENTIFIC KNOWLEDGE MATHEMATICS SCIENTIFIC

MATHEMATICS HEART RESEARCH LOVE MATHEMATICS

{0.75, 0.25} WORK JOY SOUL TEARS MATHEMATICS

• Called Latent Dirichlet Allocation (LDA)

• For each document, 

For each document,

•The inference problem in LDA is to compute the posterior of the

For variables z = z1, z2, …, zn

• Need full conditional distributions for

number of times word w assigned to topic j

number of times topic j used in document d

River Stream Bank Money Loan

Slide Credit: Padhraic Smyth, UC Irvine

Slide Credit: Padhraic Smyth, UC Irvine

River Stream Bank Money Loan

Slide Credit: Padhraic Smyth, UC Irvine

River Stream Bank Money Loan

Slide Credit: Padhraic Smyth, UC Irvine

sample each pixel from

• Used all D = 28,154 abstracts from 1991-

• Some applications require visualization

• Low-dimensional representation from

You might also like