0% found this document useful (0 votes)

32 views

1 SS 2011 - Graph-Based Methods For NLP - UKP Lab - Wolfgang Stille

This document summarizes a seminar on graph-based methods for natural language processing. Key points include: - The seminar will take place entirely in May, with seminar papers due by July 15th. - Graphs are a natural way to represent relationships between language units as nodes and edges. There are efficient algorithms for tasks like graph traversal, clustering, and ranking that operate directly on graph representations. - PageRank is an influential algorithm that was used by early Google to measure the importance of web pages based on the link structure between them. It assigns a numerical score between 0 and 1 to each page based on the pages that link to it.

Uploaded by

Shreyas Bhatt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

1 SS 2011 - Graph-Based Methods For NLP - UKP Lab - Wolfgang Stille

Uploaded by

Shreyas Bhatt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

SEMINAR: GRAPH-BASED METHODS FOR NLP

Organisatorisches:

•  Seminar findet komplett im Mai statt

•  Seminarausarbeitungen bis 15. Juli (?)
•  Hilfen Seminarvortrag / Ausarbeitung auf der Webseite
•  Tucan number for registra1on: 20-‐00-‐0596-‐se

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 1

Fahrplan

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 2

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 3
Mo#va#on for graph representa#on
§  Graphs are an intui1ve and natural way to encode en##es (e.g. language units)
as nodes and their rela#ons (e.g. similari1es) as edges (directed / undirected)
§  feature-‐based representa1on can be transformed into a graph via a similarity measure
§  graphs may not necessarily be transformed back into a feature representa1on (at least
not a unique one). Think of e.g. points in n-‐dimensional space.

Graph isomorphism

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 4

Graph representa#ons

Adjacency
Matrix

ì
î
Adjacency
List

Additional information such as

weights might be saved easily.

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 5

Mo#va#on for graph representa#on
There exist eﬃcient algorithms that directly operate on graphs

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 6

Eﬃcient Algorithms?

P = NP

?

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 7

Eﬃcient Algorithms!

There are eﬃcient (polynomial) algorithms for the exact solu1on of many problems on
graphs, e.g.
•  Graph Traversal (DFS, Shortest Paths, Max-‐Capacity Paths, …)
•  Op1mal Trees and Branchings (MST, MAX-‐FOREST, MAX-‐BRANCHING, …)
•  Graph Clustering (Min-‐Cut, Markow Clustering, Chinese Whispers, …)
•  Graph Ranking (PageRank, Random Walks, Markow Chain Theory)
•  Graph Distances (local: Paths, global: Graph Edit Distance, …)
•  Flows on Graphs (MAX-‐FLOW, MIN-‐COST FLOW, …)
•  Matching and Assignment (Hungarian Method, Edmond’s Algorithm)
•  many more

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 8
Eﬃcient Algorithms!

There are eﬃcient approxima#on algorithms and heuris#cs for the approximate solu1on
of many graphs problems, e.g.
•  Subgraph Problems (Dense Subgraphs, Minors, …)
•  Op1mal Tour Problems (TSP, PCTSP, VRP, …)
•  Steiner Trees
•  many more

There are simple heuris#cs that o^en yield

quite good results, such as for example k-‐OPT
for the Euclidean TSP.

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 9

Why eﬃciency is crucial

Graphs are usually large-‐scale

•  In 2008, English Wikipedia used to have 2.301.486 ar1cles* with 55.550.003 links in
between
Graphs are usually dense and strongly connected
•  The largest "strongly-‐connected-‐component" of Wikipedia has 2.111.480 ar1cles.
Remember from the last lecture
•  Graphs in NLP are usually scale-‐free and have the small world property (high clustering
coeﬃcient)
à  Problem solu1ons o^en consider only small subgraphs (local neighborhoods), but an a
priori par11oning is usually not possible (this yields small 1me complexity but full space
complexity)
* by today there are almost 4 million ar1cles

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 10
PageRank
§  First-‐genera1on Google global ranking algorithm (1998)
§  Measure the (query-‐independent) importance of Web page based solely on the
link structure.
§  Assign each node a numerical score between 0 and 1, its PageRank.
§  Rank Web pages based on PageRank values.

General Idea:
every page has a number of in-‐links (back links) and out-‐links (forward links)
§ 
§  pages with more in-‐links are more important
§  in-‐links from important pages are more important

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 11

PageRank

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 12

Deﬁni#on of PageRank
u: a web page, R(u) its page rank
Fu: set of pages u points to (forward links) R(v) (1$ d)
R(u) = " |F | # d +
Bu: set of pages that point to u (backw. Links) v !Bu v
N
|Fu|: the number of links from u
N: total number of pages
page
d: damping factor, default d=0.85
B

§  The equa1on is recursive, but it may be computed page page
by star1ng with any set of ranks and itera1ng the A C
computa1on un1l it converges.
§  Rank sink problem: cycle of pages that page
accumulates rank within the cycle, but never D
page
distributes rank outside X
§  Need damping: uniform rank distribu1on for all
pages
SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 13
Random Surfer Model

§  When normalizing PageRank over all pages to 1, R(u) can be thought of as the
probability that a random surfer looks at a page u.
§  Damping corresponds to “teleporta1on”: With some probability d, the random
surfer is teleported to some other page

page
B
page
X page page
A C

page
D

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 14

Computation of PageRank

§  Numeric: Simulate a lot of random surfers: The Power method of

Eigenvector computation
§  initialize all pages with the same rank
§  repeat until convergence:
§  for all pages u: compute Rt+1(u) on the basis of Rt(v)
§  t:=t+1
input : matrix size N , error tolerance ϵ
output: eigenvector p

p0 = 1/N 1
t=0;
repeat until δ < ϵ:
t=t+1;
pt = MTpt−1 ;
δ = ||pt − pt−1 ||;
return pt ;
SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 15
LexRank:
Applica#on to Mul#-‐Document Summariza#on
Mul2-‐document summariza2on task:
1.  iden1fy important topics of the documents to be summarized
2.  iden1fy sentences belonging to a certain topic
3.  from these sentences belonging to the same topic, select the ones that best
describe the topic
4.  concatenate sentences from diﬀerent topics and make sure they ﬁt together

Consider sub-‐problem 3:

Input: Sentences that talk about more or less the same thing

Output: Scores for those sentences that reﬂect how well a single sentence
represents that topic

Solu#on idea: use measures on sentence similarity graph

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 16
From Sentences to TF*IDF vectors

TF: count w1..wn TF*IDF

w w w w w w w w
Sentence 1 2 3 n 1 2 3 n
.27
3 0 2 0 0 0 0
This is a sentence that
talks about some topic.

.24 .21 feature vector

5 0 3 1 0 0
And here is another
sentence that talks abot
of the second
something slightly sentence
different.
… …
This is the same as
7 4 0 0 0 .62
0 0 the vector space
model for
And here is yet another
one of these notorious
sentences
Informa1on
Retrieval
! total number of sentences $
IDF(w) = log#
DF 3 1 2 … 1 " DF(w)
&
%
SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 17
From TF*IDF vectors to sentence similarity graph

§  Sentence similarity graph:

§  nodes: sentences
§  edges: cosine similarity between sentence feature vectors
§  Can apply threshold on similarity or use similarity as edge weight

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 18

Measures: Centroid, Degree and Centrality

§  Centroid
§  Idea: select an average sentence. Compute average point of sentence vectors
(centroid)
§  select sentence that is most similar to the centroid for summariza1on
§  Degree Centrality
§  Idea: sentences that cover most of the content have a high node degree (number of
edges): since word overlap is responsible for edges, node degree measures word
overlap with the overall set of sentences
§  for summariza1on, choose the sentence with the highest degree
§  LexRank Centrality
§  Idea: it does not suﬃce to be similar to many sentences: similarity to important
sentences counts more.
§  normalize the adjacency sentence similarity to make it a stochas1c matrix
§  run PageRank to obtain scores that are used for ranking the sentences
§  for summariza1on, choose sentence with highest score
SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 19
Evalua#on of graph-‐based mul#-‐document
summariza#on

§  Scores: ROUGE metric: similar to BLEU, between manual summaries and system summaries
§  random baseline: select any sentence from set by chance
§  lead-‐based: select based on posi1on of sentence within document
è LexRank is a simple method for genng high scores. It uses the whole structure of the
graph, as opposed to Centroid or Degree.
This technique also works well for single-‐document summariza1on.

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 20

TextRank for Keyword Extrac#on

§  Keyword extrac#on: ﬁnd the most salient keywords for a document
§  Keyword extrac#on with PageRank:
§  preprocess document: iden1fy adjec1ves and nouns as targets
§  target co-‐occurrence graph: targets co-‐occurring within a window of 2-‐10 words
§  apply PageRank to get ranking scores on nodes
§  select highest scoring keywords, possibly concatenate ADJ-‐NOUN-‐NOUN sequences if
present in the text

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 21

Keyword Extrac#on Evalua#on

§  Comparison: Supervised system that is trained on manually assigned keywords,
using frequency and contextual features
§  Note that TextRank is unsupervised: no training necessary

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 22

Graph Clustering

§  Task: Find meaningful groups of nodes in graph by cunng edges
§  Intui1on: Connectedness within a cluster is higher than between clusters
§  Many graph clustering algorithms
ﬁnd the number of clusters
automa1cally

3 3 3

http://elisa.dyndns-web.com/~elisa/publications/

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 23

Clustering by Min-‐Cut / Max-‐Flow
§  MinCut algorithm: hierarchical top-‐down clustering
§  compute the minimum cut: leaving out a set of edges, which results in disconnec1ng a
set of nodes from another, with the smallest edge weight sum
§  recursively apply to the components that got disconnected
§  Finding the minimum cut is equivalent to finding the maximum flow in a
network
§  Advantage: Efficient. Fastest known algorithm of per-‐cut complexity
O(|E|+log3(|V|)
§  Disadvantage:
§  Unbalanced cuts
§  when to stop?

http://scienceblogs.com/goodmath/2007/08/maximum_flow_and_minimum_cut_1.php

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 24

Markov Chain Clustering
http://micans.org/mcl/

§  Clustering based on random walks: MCL is the parallel simula1on of all possible
random walks up to a ﬁnite length on a graph G
§  Idea: a random walker on the graph is more likely to stay within the same cluster
than to end up in a diﬀerent cluster a[er a small number of steps
§  Algorithm: can show convergence to a limit T
Add loops: transition matrix T= column-normalize (AG + I)
MCL process: alternate between
T=Tt // expansion: raise T to its power of t
T=inflate(T) // inflation: increase contrast within
columns by raising values to their power
of s (s>0) and normalize column-wise
Interpret T as a clustering: use strongest connection as label

Stijn van Dongen, Graph Clustering by Flow Simulation. PhD thesis, University of Utrecht, May 2000.

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 25

Expansion step: simulate the random walk

§  (stochas1c) adjacency matrix T: probabili1es to walk from node in column to
node in row in a single step.
§  T2: probabili1es to walk from A to B in 2 steps.

AG
loops
added

T T2

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 26

Inﬂa#on Step: only keep a]ractors

x2 norm
alize

§  Inﬂate the diﬀerences within a column by taking the k-‐th power of the value, then normalize
to ensure stochas1c property. k regulates the cluster sizes
§  Clustering: Highest entry in column vector is cluster label variants:
§  Could add small random noise to break 1es
§  Op1miza1on: Only keep K largest values, only keep values over threshold
SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 27
Chinese Whispers Graph Clustering

§  MCL: keep only a few strong neighbors

§  Chinese Whispers: only propagate strongest label in neighborhood
initialize: 
"forall vi in V: class(vi)=i;"
§  Nodes have a class and
while changes:"
communicate it to their
forall v in V, randomized order:" adjacent nodes
"class(v)=highest ranked   §  A node adopts one of the
class in neighborhood of v;" the majority class in its
D

neighbourhood
B

L4
8
5 L2
§  Nodes are processed in

deg=2 A
deg=1 random order for some

L1
itera1ons

3 E

6
C
deg=4 L3
§  Node weigh1ng schemes
L3

deg=3
deg=5
SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 28
Disambigua#on using Resource Graphs

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 29

Disambigua#on of Named En##es
using Resource Graphs

Wikipedia Link Graph

(Shortest) paths are one possibility

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 30

Disambigua#on of Named En##es
using Resource Graphs
(Shortest) paths are one possibility. What else?
•  maximum capacity paths (capaci1es needed, e.g. coherence, probabili1es, ...)
•  maximum ﬂows (Aten1on: Small world graph! Path length must be bounded!)
•  apply PageRank to weight nodes

Semantic enrichment:
•  Use the nodes on the paths / flows for enriching to overcome the knowledge
acquisition bottleneck

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 31

Summary on Graph Methods in NLP

§  Graph representa1on is a natural representa1on of en11es and their rela1ons
§  We might use well-‐known (eﬃcient) graph algorithms for the solu1on of
speciﬁc NLP problems
§  Taking the overall structure into account some NLP tasks might be improved
(enriching seman1cs)
§  Graph clustering algorithms solve unsupervised NLP tasks without the need to
specify the number of clusters
§  We can enrich informa1on by walks on graphs

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 32

Graph Algorithms
100% (10)
Graph Algorithms
500 pages
GraphBasedDataScience
No ratings yet
GraphBasedDataScience
37 pages
Coli R 00089
No ratings yet
Coli R 00089
4 pages
Doubts PDF
No ratings yet
Doubts PDF
42 pages
Graph Theory Implementation in NLP
No ratings yet
Graph Theory Implementation in NLP
9 pages
Graph-Based Text Representations PPT
No ratings yet
Graph-Based Text Representations PPT
14 pages
Clustering Sentence
No ratings yet
Clustering Sentence
4 pages
06 Basic Graph Algorithms
No ratings yet
06 Basic Graph Algorithms
31 pages
10 KG
No ratings yet
10 KG
63 pages
datastructure5
No ratings yet
datastructure5
34 pages
3589335.3641300
No ratings yet
3589335.3641300
4 pages
CS 224W 02-Nodeemb
No ratings yet
CS 224W 02-Nodeemb
71 pages
Cola Grafos RA34
No ratings yet
Cola Grafos RA34
7 pages
03 Nodeemb
No ratings yet
03 Nodeemb
66 pages
02-nodeemb
No ratings yet
02-nodeemb
71 pages
06-GNN3
No ratings yet
06-GNN3
73 pages
Description_of_approach
No ratings yet
Description_of_approach
5 pages
Lecture 1_Introduction
No ratings yet
Lecture 1_Introduction
124 pages
Graph Mining Handout
No ratings yet
Graph Mining Handout
7 pages
Unit 1 Graph Algorithms-Ii: Structure Page No
No ratings yet
Unit 1 Graph Algorithms-Ii: Structure Page No
32 pages
Textrank: Bringing Order Into Texts: Rada Mihalcea and Paul Tarau
No ratings yet
Textrank: Bringing Order Into Texts: Rada Mihalcea and Paul Tarau
8 pages
Text Rank
No ratings yet
Text Rank
8 pages
4., Graphs-1
No ratings yet
4., Graphs-1
100 pages
04-GNN2
No ratings yet
04-GNN2
73 pages
Introduction To Graph Theory: March 2011
No ratings yet
Introduction To Graph Theory: March 2011
11 pages
mlc_04_graph_methods_ranking_communities_link_prediction-sose2023
No ratings yet
mlc_04_graph_methods_ranking_communities_link_prediction-sose2023
110 pages
LLM Paper 1707247828
No ratings yet
LLM Paper 1707247828
24 pages
gnns
No ratings yet
gnns
75 pages
14-gnn
No ratings yet
14-gnn
58 pages
40 Graphs
No ratings yet
40 Graphs
25 pages
Graph Neural Network Node Emending - Node Edge and Sub Graph
No ratings yet
Graph Neural Network Node Emending - Node Edge and Sub Graph
66 pages
Week 16
No ratings yet
Week 16
47 pages
Social Network Graph Mining
No ratings yet
Social Network Graph Mining
34 pages
199 Fast Node Embeddings Learning - 13
No ratings yet
199 Fast Node Embeddings Learning - 13
11 pages
Graph Neural Network Introduction
No ratings yet
Graph Neural Network Introduction
88 pages
Large Language Models On Graphs: A Comprehensive Survey
No ratings yet
Large Language Models On Graphs: A Comprehensive Survey
26 pages
Unit I Graph Theory and concepts
No ratings yet
Unit I Graph Theory and concepts
35 pages
2312.02783v3
No ratings yet
2312.02783v3
25 pages
Comp250 hw4
No ratings yet
Comp250 hw4
6 pages
UNIT - 4 Graphs PDF
No ratings yet
UNIT - 4 Graphs PDF
19 pages
09-hetero
No ratings yet
09-hetero
62 pages
Automated Unsupervised Graph Representation Learning
No ratings yet
Automated Unsupervised Graph Representation Learning
14 pages
11-Graphs-Intro
No ratings yet
11-Graphs-Intro
33 pages
2402.07630v3
No ratings yet
2402.07630v3
23 pages
GML Introduction
No ratings yet
GML Introduction
11 pages
DS Unit-4
No ratings yet
DS Unit-4
47 pages
CSE 203#07 Graph
No ratings yet
CSE 203#07 Graph
28 pages
04 Pagerank
No ratings yet
04 Pagerank
64 pages
UNIT 3 some
No ratings yet
UNIT 3 some
44 pages
lect7_Graph Algorithm
No ratings yet
lect7_Graph Algorithm
45 pages
menendezLlorente
No ratings yet
menendezLlorente
22 pages
DataStructure - Graphs
No ratings yet
DataStructure - Graphs
24 pages
02 Tradition ML
No ratings yet
02 Tradition ML
68 pages
stcn major 2
No ratings yet
stcn major 2
96 pages
DSA
No ratings yet
DSA
48 pages
Graph Neural Networks For Natural Language Processing: A Survey
No ratings yet
Graph Neural Networks For Natural Language Processing: A Survey
127 pages
09 Hetero
No ratings yet
09 Hetero
72 pages
feb_28
No ratings yet
feb_28
12 pages
07 - APS - Graph
No ratings yet
07 - APS - Graph
54 pages
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
Hypothesis Testing Annotated
No ratings yet
Hypothesis Testing Annotated
65 pages
Random Variables and Expectation Lecture Slides
No ratings yet
Random Variables and Expectation Lecture Slides
11 pages
Exploiting Unlabeled Texts With Clustering-Based Instance Selection For Medical Relation Classification
No ratings yet
Exploiting Unlabeled Texts With Clustering-Based Instance Selection For Medical Relation Classification
10 pages
Which Clustering Do You Want? Inducing Your Ideal Clustering With Minimal Feedback
No ratings yet
Which Clustering Do You Want? Inducing Your Ideal Clustering With Minimal Feedback
52 pages
A Knowledge-Grounded Neural Conversation Model
No ratings yet
A Knowledge-Grounded Neural Conversation Model
8 pages
Magnetic Field Evolves To Gravity Field PDF
No ratings yet
Magnetic Field Evolves To Gravity Field PDF
7 pages
Einstein's 1912-1913 Struggles With Gravitation Theory: Importance of Static Gravitational Fields Theory
No ratings yet
Einstein's 1912-1913 Struggles With Gravitation Theory: Importance of Static Gravitational Fields Theory
41 pages
Effects of Gravitational and Radiate STR PDF
No ratings yet
Effects of Gravitational and Radiate STR PDF
4 pages
Status of LSS Cosmology With SDSS BOSS D
No ratings yet
Status of LSS Cosmology With SDSS BOSS D
101 pages
NLP-based Course Clustering and Recommendation: Kentaro Suzuki, Hyunwoo Park December 10, 2009
No ratings yet
NLP-based Course Clustering and Recommendation: Kentaro Suzuki, Hyunwoo Park December 10, 2009
21 pages
Adjustable Rotoverter Fuel-Free Energy G
No ratings yet
Adjustable Rotoverter Fuel-Free Energy G
10 pages
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
No ratings yet
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
47 pages
Review On Detection of Spam Comments Using NLP Algorithm
No ratings yet
Review On Detection of Spam Comments Using NLP Algorithm
4 pages
Data Structures Lab Exp 13 - 14 - 16 Graphs BFS - DFS - Prims - Kruskals
No ratings yet
Data Structures Lab Exp 13 - 14 - 16 Graphs BFS - DFS - Prims - Kruskals
50 pages
Maths
No ratings yet
Maths
12 pages
01 Number System
No ratings yet
01 Number System
25 pages
Theory of Computation and Compiler Design - Introduction PDF
No ratings yet
Theory of Computation and Compiler Design - Introduction PDF
20 pages
Sudoku Solving Algorithms - Wikipedia
100% (1)
Sudoku Solving Algorithms - Wikipedia
5 pages
Segment Tree
No ratings yet
Segment Tree
80 pages
Master Theorem
100% (1)
Master Theorem
21 pages
C-Aptitude Test Paper1
No ratings yet
C-Aptitude Test Paper1
2 pages
L 14 Divide&Conquer
No ratings yet
L 14 Divide&Conquer
23 pages
Synopsis
No ratings yet
Synopsis
21 pages
Sat Class 0811
0% (1)
Sat Class 0811
2 pages
Introduction To Graph Cluster Analysis
No ratings yet
Introduction To Graph Cluster Analysis
48 pages
Data Structures MCQ
100% (1)
Data Structures MCQ
19 pages
1 Error Detection and Correction
No ratings yet
1 Error Detection and Correction
25 pages
Permutation & Combination
100% (1)
Permutation & Combination
61 pages
Sm015 Chapter 8 Limits and Continuity: A X X F F A F X F
No ratings yet
Sm015 Chapter 8 Limits and Continuity: A X X F F A F X F
16 pages
Adding Integers Adding Integers: Answer Key
No ratings yet
Adding Integers Adding Integers: Answer Key
2 pages
Number Series
No ratings yet
Number Series
28 pages
1a9bd414-15b6-49f8-a30c-d35e07e6360c
No ratings yet
1a9bd414-15b6-49f8-a30c-d35e07e6360c
2 pages
Week One SS2 Maths Lesson Note
No ratings yet
Week One SS2 Maths Lesson Note
3 pages
Olympiad Combinat or Ics Chapter 9
No ratings yet
Olympiad Combinat or Ics Chapter 9
35 pages
CS8451 - Daa-Ct-1
No ratings yet
CS8451 - Daa-Ct-1
1 page
Math Review Algebra Edited
No ratings yet
Math Review Algebra Edited
12 pages
Bar Chart Advance
No ratings yet
Bar Chart Advance
2 pages
REsidue Number MAC PDF
No ratings yet
REsidue Number MAC PDF
6 pages
Planarity and Euler
No ratings yet
Planarity and Euler
9 pages
JM dc,+13 Rendy+Wulur 103-113
No ratings yet
JM dc,+13 Rendy+Wulur 103-113
11 pages
Number Theory
No ratings yet
Number Theory
10 pages
Gallier Theory of Computation
No ratings yet
Gallier Theory of Computation
398 pages
An Introduction To Formal Languages and Automata, Fifth Edition by Peter Linz
No ratings yet
An Introduction To Formal Languages and Automata, Fifth Edition by Peter Linz
2 pages

1 SS 2011 - Graph-Based Methods For NLP - UKP Lab - Wolfgang Stille

Uploaded by

1 SS 2011 - Graph-Based Methods For NLP - UKP Lab - Wolfgang Stille

Uploaded by

SEMINAR: GRAPH-BASED METHODS FOR NLP

• Seminar findet komplett im Mai statt

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 1

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 2

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 4

Additional information such as

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 5

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 6

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 7

There are simple heuris#cs that o^en yield

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 9

Graphs are usually large-­‐scale

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 11

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 12

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 14

§ Numeric: Simulate a lot of random surfers: The Power method of

Consider sub-­‐problem 3:

Solu#on idea: use measures on sentence similarity graph

TF: count w1..wn TF*IDF

.24 .21 feature vector

§ Sentence similarity graph:

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 18

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 20

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 21

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 22

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 23

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 24

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 25

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 26

§ MCL: keep only a few strong neighbors

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 29

Wikipedia Link Graph

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 30

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 31

SS 2011 | Graph-based Methods for NLP | UKP Lab - Wolfgang Stille | 32

You might also like

•  Seminar findet komplett im Mai statt

Graphs are usually large-‐scale

§  Numeric: Simulate a lot of random surfers: The Power method of

Consider sub-‐problem 3:

§  Sentence similarity graph:

§  MCL: keep only a few strong neighbors