3 - Unit - 1 - Find Structures of Documents
3 - Unit - 1 - Find Structures of Documents
• The aim is to generate new samples from what has already been
distributed in the training data.
• For Example
• Let's say you have input data x and you want to classify the data
into labels y.
• p(x,y) is p(y|x) is
• It's like we were trying to model the decision boundary between the
classes.
𝑃 𝑥𝑦 𝑃(𝑦)
𝑃(𝑥)
Generative and Discriminative Sequence Models
• We are just interested in the argmax, elliminate the denominator
that will be the same for every y.
• So, we get :
• In the second case, the joint probability distribution P(X, Y), since
P(X | Y) P(Y) = P(X, Y), which explicitly models the actual
distribution of each class.
Generative and Discriminative Sequence Models
• With the joint probability distribution function, given a Y, calculate
("generate") its respective X.
and
6. G models can provide rich insights about the data, when you
do not have labels.
Complexity of Approaches
• Sentence/topic segmentation approaches can be rated in terms of
complexity (time and memory) of their training and prediction
algorithms and in terms of their performance on real-world
dataset.
1. Discriminative approach
2. Generative models
4. Sequence approaches
Complexity of Approaches
1. Discriminative approach
• Since, they require multiple passes over the training data to adjust
for their feature weights.
2. Generative models
3. Discriminative classifiers
4. Sequence approaches