Graph-Based Text Representations PPT
Graph-Based Text Representations PPT
Information Retrieval
Fragkiskos D. Saikat
Malliaros! Mondal
UC San Diego!
Msc
Michalis Vazirgiannis! École Polytechnique!
20419CMP020
!
Text Representation
Bag of words Model
Graph Semantics
world
the
barks
dog paris
2!
Text Representation
Since text is unstructured, a document is usually converted in a common
representation like Bag of Model.
Nowadays, the most commonly used text representation model in the area
Textual Feature
of Information Extraction!
retrieval is called asModel Text (VSM)
the Vector Space Model Evaluation!
Data! Term Weighting! Learning! Categorization!
!
4!
Graph-based Document Representation
5!
Graph-based text representations
42
Graph
Semantics
7
Graph-of-Words (GoW)
Model
predict field
scienc knowledg
mine
extract
structur data discoveri
volum continu
known
w = 3!
unweighted, undirected graph!
9
Example of Weighed Undirected GoW
Mathematical aspects of computer−aid
computer-aided share trading.
We consider problems of ● aspect
statistical analysis of share
prices and propose problem ●
probabilistic characteristics to
describe the price series. We
●
discuss three methods of
mathematical modelling of
price series with given statist
probabilistic characteristics. ●
●
Edge weights mathemat trade
1 ●
2 share
3 ●
4 price
5 ● ● ●
probabilist analysi
● characterist
●
seri
●
model
● method
10
In-degree based TW
12
Graph-based Representation of Tweets
!
• Represents all the input tweets !
• Nodes: unique terms !
• Edges: #co-occurrences within a
tweet!
Example graph!
1. Good goal by Neymar!
2. Goal! Neymar scores for brazil!
3. Goal!! Neymar scores again!
4. Watching the game tonight!
13
THANK YOU