0% found this document useful (0 votes)

66 views6 pages

EDIUM: Improving Entity Disambiguation Via User Modeling: Abstract

The document describes EDIUM, an entity disambiguation system that uses user interest models to disambiguate entity mentions in a user's tweets. EDIUM jointly models a user's interest scores based on tweet categories and context disambiguation scores to compensate for the sparse context in tweets. It evaluates the system's entity linking capabilities on user tweets and shows improvement by combining user models and context-based models.

Uploaded by

AkulBansal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views6 pages

EDIUM: Improving Entity Disambiguation Via User Modeling: Abstract

Uploaded by

AkulBansal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

EDIUM: Improving Entity Disambiguation via User

Modeling
Author
Organization

Abstract. Entity Disambiguation is the task of associating name mentions

in text to the correct referent entities in the knowledge base, with the goal
of understanding and extracting useful information from the document.
Entity disambiguation has become an important task to harness information shared by users on microblogging sites like twitter. However, noise
and lack of context in tweets makes disambiguation a difficult task. In this
paper, we describe an Entity Disambiguation system, EDIUM, which uses
User interest Models to disambiguate the mentions in the users tweets.
Our system jointly models the users interest scores and the context disambiguation scores, thus compensating the sparse context in the tweets
for a given user. We evaluated the systems entity linking capabilities on
the user tweets and showed that improvement can be achieved by combining the user models and the context based models.

Introduction

Named Entity Disambiguation (NED) is the task of identifying the correct entity reference from the knowledge bases (like DBpedia, Freebase or YAGO), for
the given mention. In microblogging sites like twitter, NED is an important task
for understanding the users intent and for topic detection & tracking, search
personalization and recommendations.
In past, many NED techniques have been proposed. Some utilize contextual information of an entity, while others use candidates popularity for disambiguation. But tweets being short and noisy, lack sufficient context for these
systems to disambiguate precisely. Due to this, the underlying user interests
are modeled to disambiguate the entities[1][2]. However, creation of user models might require some external knowledge (like Wikipedia edits[1]), which are
computationally expensive. Also, some users (e.g. news channels) tweet randomly (based on recent events or trending hashtags), while others follow their
interests too passionately. So, using same configurations (like same length sliding window[2]) for distinct users might not be effective. We tried to address
these issues by modeling the users tweeting behavior over time.
Our proposed system, EDIUM, links the entities in the tweets by simultaneously disambiguating the entities and creating the user models. The users
behavior is analyzed with respect to the model built and an appropriate weightage is assigned to the user model. The user model contributes in proportion
to the weight assigned to it while disambiguating the new tweet entities. This

approach can also be used for modeling users and disambiguating entities in
other streaming documents like emails or query logs. The next section describes
the EDIUM system in details.

The EDIUM System

EDIUM works by creating users interests as a distribution over the semantic

Wikipedia categories. EDIUM has three sub-systems: the context modeling system (Section 2.1), the user modeling system (Section 2.2) and the disambiguation system (Section 2.3). The users interests are modeled based on the tweet
categories by the user. These interests along with the local context are used by
the disambiguation system for linking entities in the new tweets. The final results are fed back to the system for improving the user model.
Every tweet has multiple mentions and each mention can be aligned to multiple entities. Table 1 represents few notations used while describing the system.
Table 1. Notations used for describing the system
Symbol
Cij
cji
P ar(C)
G(C)
Nr (C)
ICui
iciu

2.1

Description
j-th candidate entity for the i-th mention in a given tweet
Contextual similarity score for Cij
Parent of C is set of categories that are the immediate ancestor of category C
Grand Parent of C is set of all categories such that
G(C) P ar(P ar(C)) and G(C) P ar(C)
Set of categories in the r-th neighborhood of category C
Set of categories in the i-th interest cluster for user u
Score for the cluster ICui

Contextual Modeling System

Context Model (CM) disambiguate the entities based on the text around the
entities. Similarity between the text around the mention and text on Wikipedia
page of an entity is compared and an appropriate weightage for disambiguation is given to each candidate reference. The candidate referent with the maximum weightage is considered as disambiguated entity for the given mention.
We improved the referents disambiguation scores by combining the context
based scores with the users interest based scores in an appropriate manner. We
used existing entity linking systems like DBpedia Spotlight[3] and Wikipedia
Miner[4] for linking and disambiguating the entities based on the context.
The final score ScoreC (Cij ) given by the context model is the candidate score
normalized based on all the possible alignments for the given mention.
cj
ScoreC (Cij ) = Pi

cji

jCi

2.2

User Modeling System

User Model (UM) understands the users interests and behavior over time.

(1)

UM Creation: We used cluster-weighted models1 for modeling the users interests. The following assumptions were made while creating the user models.
Users only tweet on topics that interest them.
The amount of interest in a topic is proportional to the information shared
by the user on the topic.
Based on these assumptions, we modeled each user into weighted sub-clusters
of semantic Wikipedia categories. Each sub-cluster represents the users interest
over specific topic and weight represents the overall interest of user in that
topic.
UM is updated for users future tweets based on the categories in the users
current tweet. The tweet categories are extracted using the following steps.
1. Current tweets entities are discovered via disambiguation modeling (DM)
system (Section 2.3).
2. The entities with sufficiently high confidence are shortlisted to prevent UM
from learning incorrect information for the user. We considered only those
entities where the ratio of scores of the second ranked entity to the disambiguated entity is atmost 2 .
3. Tweet categories for the shortlisted entities are extracted using Wikipedia.
The score of each tweet category is equivalent to the number of tweet entities inherited by the category.
4. Considering the graph of semantic Wikipedia categories, the tweet categories are smoothed to include the parent categories. Parents are given
scores in inverse proportion to their out-degree for each child category.
Common parent gets lesser contribution from the childs score as compared
to rare parent.
The UM is created based on the tweet categories. If the category is already
present in the model, the score is updated by the sum of initial and the tweet
category score. Otherwise the category and its score is added to the model. As
the new tweet category scores are added to the UM, the model is evolved to
better represent the newly processed tweet.
To find the topic of interests for the user, each category is mapped to a single
interest cluster. We formed clusters based on the similar parent and grandparent categories for a given category. The score of the k-th interest cluster, icku , for
user u, is the sum of the weights of the categories in the cluster k.
Twitter users exhibit different interest behaviors, highly specific or too random. This can be seen from the fact that some users tweet based on the situations like trending hashtags or popular news, while others tweets only about
highly specific products or companies. Disambiguating entities depends highly
on the behavior of the users. While making use of interest models might be useful in the latter case, it might not be that effective in the former case. To handle
1
2

http://en.wikipedia.org/wiki/Cluster-weighted modeling
value depends on the usecase and performance of the underlying CM system. While
high values ensure the large learning rates, low values ensure the performance of the
UM system.

this issue, we introduce the concept of relatedness between the users learnt
model and the Disambiguation Model (DM).
Similarity between the UM and the DM is defined as cosine similarity between the tweet categories vector obtained when DM is used vs. when only
user model is used for disambiguation.
Sim(U M, DM ) = cos(Score (Ci ), ScoreU (Ci ))

(2)

Similarly, similarity between the CM and the DM is defined as cosine similarity between the tweet categories vector obtained when DM is used vs. when
only CM is used for disambiguation.
Sim(CM, DM ) = cos(Score (Ci ), ScoreC (Ci ))

(3)

Now we define relatedness, R as ratio of similarity between UM & DM and CM

& DM in inverse proportion to their contribution while disambiguation.

(1 ) Sim(U M, DM )
(1 ) Sim(U M, DM ) + Sim(CM, DM )

(4)

is the measure of consistency of users behavior towards the learnt user

model. tells how consistent is the user about his interests (and how stable the
user model is). The higher the value of , more consistent the user is.
We update after each tweet based on the contribution the user model has
in deciding the tweet categories. Since we dont want to decide the just based
on the users behavior on one tweet, we consider previous n relatedness values
for finding the new . The is the average of the last n relatedness values.
n

1X
Rt
n t=0

(5)

To deal with the changing users interests, we decrease by a factor of 0.9

each day. This lowers the dependency of DM on the UM with time. Also,
is restricted to 0.7 to resist model from learning the incorrect user models and
always making decisions irrespective of the contexts used. This also enables
model to discover new entity in highly interest focused twitter users.
The user model is committed to the database3 after each transaction and
is used whenever new tweet from the same user arrives. This helps us track
huge number of users and built the streaming disambiguation system for twitter streams.
Disambiguation : For each category Cij in a tweet, the final score given by user
model is
n
X
ScoreU (Cij ) =
Sim(Cij , ICuk ) Score(ICuk )
(6)
Ck =0
3

Mongo DB is used as a database

iciu
Score(ICui ) = P
n
icku

(7)

ck =0

Sim(Cij , ICui ) =

ICui N3 (Cij )
ICui N3 (Cij )

(8)

The ScoreU (Cij ) is normalized relative to all possible ScoreU (Ci ).

2.3

Entity Disambiguation System (DM)

DM disambiguate the entities based on the textual context as well as the users
interests. The DM systems combines both the context based models score and
the user based models score using the parameter , that relates the stability of
user to the previous tweeted topics. The final score predicted by the DM is
Score (Cij ) = ScoreU (Cij ) + (1 ) ScoreC (Cij )

(9)

The model selects the entity that maximized the Score (Cij ) for the given mention i.
Entityi = arg max Score (Cij )

(10)

Score

Results and Discussions

We evaluated the performance of EDIUM on manually annotated dataset of

100 tweets from 15 different twitter users. is initialized to 0.001 for each user
because UM has no prior information about the user. We experimented the system with n = 20 and = 0.95. As the UM is improved with users each tweet,
precision at 1 (P@1) score is calculated at interval of 20 tweets for each user.
The system is evaluated with both DBpedia Spotlight and Wikipedia Miner as
the context modeling system. Fig. 1 reports the performance of the system over
time when the proposed model is used vs. when just the CM is used or just the
UM (built using previous tweets with the proposed model) is used for disambiguation. We observed that EDIUM started to outperform the CM after 60
tweets (of each user) are processed by the system. The maximum performance
is achieved when the proposed model is used with the Wikipedia Miner as the
CM system.
EDIUM is experimented to perform better with
Wikipedia Miner (WM) than with DBpedia Spot- Table 2. Average scores
Method
Avg.
light (DS). This is because of the fact that the sysEDIUM
(WM)
0.49
tem is dependent on the underlying context modEDIUM
(DS)
0.38
els for learning the user interests. Context models
that are more precise, leads to faster and more accurate user models, thus significantly helping the
context model to disambiguate the entities.

(a) Performance with Wikipedia Miner

(b) Performance with DBpedia Spotlight

Fig. 1. P@1 score of EDIUM under different configurations

Conversely, if the underlying context models have low entity linking and disambiguation accuracies the user models usually takes much longer to learn the
user interests (with low values) and use them for entity disambiguation. It
can be seen that UM alone can also disambiguate the entities from the users
tweet and achieve significant performance.

Conclusion and Future Work

In this paper, we have modeled entity disambiguation based on the users past
interest information. The paper proposed a way to model the users interests
using the entity linking techniques and then using it later to improve the disambiguation in entity linking systems. The gain in precision is proportional to
the accuracies of the underlying entity linking system.
More analysis is required on the user modeling aspect of the system. Currently users past tweets is used for building the user model and the models
quality depends a lot on the underlying context model. We are including network and demographic information of users to improve user modeling. In future, this would help us in better disambiguating the entities by understanding
more aspects of user behavior.

References
1. Murnane, E.L., Haslhofer, B., Lagoze, C.: Reslve: leveraging user interest to improve
entity disambiguation on short text. In: Proceedings of the 22nd international conference on World Wide Web companion. WWW 13 Companion, Republic and Canton
of Geneva, Switzerland, International World Wide Web Conferences Steering Committee (2013) 8182
2. Shen, W., Wang, J., Luo, P., Wang, M.: Linking named entities in tweets with knowledge base via user interest modeling. In: Proceedings of the 19th ACM SIGKDD
international conference on Knowledge discovery and data mining. KDD 13, New
York, NY, USA, ACM (2013) 6876
3. Mendes, P.N., Jakob, M., Garca-Silva, A., Bizer, C.: Dbpedia spotlight: shedding light
on the web of documents. In: Proceedings of the 7th International Conference on
Semantic Systems. I-Semantics 11, New York, NY, USA, ACM (2011) 18
4. Milne, D., Witten, I.H.: An open-source toolkit for mining wikipedia. Artif. Intell. 194
(2013) 222239

Paradoxes
No ratings yet
Paradoxes
528 pages
اخلاق طبابت
No ratings yet
اخلاق طبابت
230 pages
2025 Specimen Paper 5 Mark Scheme
No ratings yet
2025 Specimen Paper 5 Mark Scheme
10 pages
G12 Phy Sci P2 June 2025 Marking Guidelines
No ratings yet
G12 Phy Sci P2 June 2025 Marking Guidelines
13 pages
Agri Surfactants Handbook - V14 - 280225 - ENGLISH
No ratings yet
Agri Surfactants Handbook - V14 - 280225 - ENGLISH
35 pages
I. F. Sharygin Problems in Plane Geometry Science For Everyone 1988
84% (19)
I. F. Sharygin Problems in Plane Geometry Science For Everyone 1988
412 pages
GISII
No ratings yet
GISII
76 pages
Soal Uas Bhs. Inggris Xii
No ratings yet
Soal Uas Bhs. Inggris Xii
18 pages
MTP Report
No ratings yet
MTP Report
42 pages
INDEXReport Ayush
No ratings yet
INDEXReport Ayush
38 pages
Clustering Tweets Via Tweet Embeddings Thesis
No ratings yet
Clustering Tweets Via Tweet Embeddings Thesis
48 pages
Modeling and Processing For Next Generat
No ratings yet
Modeling and Processing For Next Generat
38 pages
Introduction
No ratings yet
Introduction
61 pages
Lecture 09 Physiographic Divisions of Bangladesh
100% (1)
Lecture 09 Physiographic Divisions of Bangladesh
30 pages
UGEO - HM70A - Operation Manual (Vol1)
100% (1)
UGEO - HM70A - Operation Manual (Vol1)
232 pages
A Hashtag Recommendation System For Twitter Data Streams
No ratings yet
A Hashtag Recommendation System For Twitter Data Streams
26 pages
Mirza Kayesh Begg - 250274290 - CompleteReport
No ratings yet
Mirza Kayesh Begg - 250274290 - CompleteReport
12 pages
Hai An Agency & Logistics Co.,LTD (HAAL)
No ratings yet
Hai An Agency & Logistics Co.,LTD (HAAL)
26 pages
SNS Unit Iv
No ratings yet
SNS Unit Iv
27 pages
Tweet Segmentation and Its Application
No ratings yet
Tweet Segmentation and Its Application
5 pages
TPO 57 Listening
No ratings yet
TPO 57 Listening
11 pages
Hashtag-Based Tweet Expansion For Improved Topic Modeling
No ratings yet
Hashtag-Based Tweet Expansion For Improved Topic Modeling
19 pages
A Review of Approaches For Topic Detection in Twitter
No ratings yet
A Review of Approaches For Topic Detection in Twitter
28 pages
BDA
No ratings yet
BDA
31 pages
The Geisha Memory 2
No ratings yet
The Geisha Memory 2
25 pages
DOCS: Domain-Aware Crowdsourcing System: Yudian Zheng, Guoliang Li, Reynold Cheng
No ratings yet
DOCS: Domain-Aware Crowdsourcing System: Yudian Zheng, Guoliang Li, Reynold Cheng
12 pages
Beyond Search - Event-Driven Summarization For Web Videos
No ratings yet
Beyond Search - Event-Driven Summarization For Web Videos
23 pages
2018 HotelMarketingGuide FINAL
No ratings yet
2018 HotelMarketingGuide FINAL
12 pages
Social-Network Analysis Using Topic Models
No ratings yet
Social-Network Analysis Using Topic Models
10 pages
2020.findings Emnlp.344
No ratings yet
2020.findings Emnlp.344
11 pages
Entity Based Sentiment Classifier For Social Media Analysis
No ratings yet
Entity Based Sentiment Classifier For Social Media Analysis
66 pages
Ref 3 PPT Sun Sigir12twiner
No ratings yet
Ref 3 PPT Sun Sigir12twiner
10 pages
Akshada Tweet Report With Pages Removed
No ratings yet
Akshada Tweet Report With Pages Removed
15 pages
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
No ratings yet
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
11 pages
Analyzing and Ranking Prevalent News Over Social Media
No ratings yet
Analyzing and Ranking Prevalent News Over Social Media
12 pages
Tech Seminar
No ratings yet
Tech Seminar
9 pages
Character-Based Neural Embeddings For Tweet Clustering
No ratings yet
Character-Based Neural Embeddings For Tweet Clustering
9 pages
Thesis Paper Patrick Jaehnichen
No ratings yet
Thesis Paper Patrick Jaehnichen
88 pages
Tweet Stance
No ratings yet
Tweet Stance
8 pages
Monitoring The Public Opinion About The Vaccination Topic From Tweets Analysis
100% (1)
Monitoring The Public Opinion About The Vaccination Topic From Tweets Analysis
18 pages
Restricting Unsolicited Approaches and Counterfeit Users: Batch No: 28 Guided by Done by
No ratings yet
Restricting Unsolicited Approaches and Counterfeit Users: Batch No: 28 Guided by Done by
28 pages
Pdfs-V6-I2-P11 - Chinthala Shyamala 2016
No ratings yet
Pdfs-V6-I2-P11 - Chinthala Shyamala 2016
7 pages
Kumar 2021
No ratings yet
Kumar 2021
8 pages
CLS Aipmt-18-19 XIII Bot Study-Package-1 SET-1 Chapter-1 PDF
No ratings yet
CLS Aipmt-18-19 XIII Bot Study-Package-1 SET-1 Chapter-1 PDF
38 pages
Interactive Hashtag Recommendation System
No ratings yet
Interactive Hashtag Recommendation System
6 pages
Clustering Thesis
No ratings yet
Clustering Thesis
55 pages
Twitter BDA Presentation
No ratings yet
Twitter BDA Presentation
15 pages
A Bounded Derivative That Is Not Riemann Integrable
No ratings yet
A Bounded Derivative That Is Not Riemann Integrable
59 pages
Student Seating Plan Kirori Mal College
No ratings yet
Student Seating Plan Kirori Mal College
27 pages
IEEE Solved PROJECTS 2009
No ratings yet
IEEE Solved PROJECTS 2009
64 pages
Using Knowledge Graphs To Explain Entity Co-Occurrence in
No ratings yet
Using Knowledge Graphs To Explain Entity Co-Occurrence in
4 pages
STUDENTS TIMETABLE: 2014-2015 (Even Semester) : 1:35 PM 12:40 PM 10:50 Am 11:45 Am 3.25 PM 2.30 PM 9:55 Am 9:00 Am
No ratings yet
STUDENTS TIMETABLE: 2014-2015 (Even Semester) : 1:35 PM 12:40 PM 10:50 Am 11:45 Am 3.25 PM 2.30 PM 9:55 Am 9:00 Am
22 pages
Othello Analysis
No ratings yet
Othello Analysis
2 pages
6BT - 6BTA ReCon - Cummins Inc
No ratings yet
6BT - 6BTA ReCon - Cummins Inc
7 pages
COMP90049 2021S1 A3-Spec
No ratings yet
COMP90049 2021S1 A3-Spec
7 pages
IMD MBA Class Profiles
No ratings yet
IMD MBA Class Profiles
16 pages
Principles of Micro - Economics Sem-I (5483)
No ratings yet
Principles of Micro - Economics Sem-I (5483)
6 pages
Lake Pollution Model
0% (1)
Lake Pollution Model
2 pages
Cs533 Clustering Tweet Presentation
No ratings yet
Cs533 Clustering Tweet Presentation
19 pages
Dynamic Topic Modelling Tutorial
No ratings yet
Dynamic Topic Modelling Tutorial
13 pages
Numerical Methods and Programming Using Mathematica
No ratings yet
Numerical Methods and Programming Using Mathematica
14 pages
Integrating Semantic Concept Similarity in Model-Based Web Applications
No ratings yet
Integrating Semantic Concept Similarity in Model-Based Web Applications
8 pages
Diagnostic Procedures in Gynecology (2023)
No ratings yet
Diagnostic Procedures in Gynecology (2023)
3 pages
Project Report
No ratings yet
Project Report
10 pages
Brochure Cosec Tam
No ratings yet
Brochure Cosec Tam
8 pages
Group - 5 MIS Assignment 3
No ratings yet
Group - 5 MIS Assignment 3
6 pages
Friendbook A New Friend Recommendation Application
No ratings yet
Friendbook A New Friend Recommendation Application
4 pages
Analysis and Optimization of Data Classification Using K-Means Clustering and Affinity Propagation Technique
No ratings yet
Analysis and Optimization of Data Classification Using K-Means Clustering and Affinity Propagation Technique
9 pages
Mama Edha at Semeval-2017 Task 8: Stance Classification With CNN and Rules
No ratings yet
Mama Edha at Semeval-2017 Task 8: Stance Classification With CNN and Rules
5 pages
Becker and Kuropka - Topic-Based Vector Space Model PDF
No ratings yet
Becker and Kuropka - Topic-Based Vector Space Model PDF
6 pages
Group - 5 MIS Assignment 3
No ratings yet
Group - 5 MIS Assignment 3
7 pages
How To Ace The Psychometric Test
No ratings yet
How To Ace The Psychometric Test
7 pages
Sentiment Analysis PDF
No ratings yet
Sentiment Analysis PDF
4 pages
A Framework To Predict Social Crimes Using Twitter Tweets
No ratings yet
A Framework To Predict Social Crimes Using Twitter Tweets
5 pages
42 1478279573 - 04-11-2016 PDF
No ratings yet
42 1478279573 - 04-11-2016 PDF
6 pages
(IJCST-V4I6P20) :siddu P. Algur, Rashmi H. Patil, Prashant Bhat
No ratings yet
(IJCST-V4I6P20) :siddu P. Algur, Rashmi H. Patil, Prashant Bhat
6 pages
Culture Modern-Phase Presentation
No ratings yet
Culture Modern-Phase Presentation
6 pages
Processing and Visualizing The Data in Tweets
No ratings yet
Processing and Visualizing The Data in Tweets
9 pages
Trending Topic Analysis Using Novel Sub Topic Detection Model
No ratings yet
Trending Topic Analysis Using Novel Sub Topic Detection Model
5 pages
Temporal and Social Context Based Burst Detection From Folksonomies
No ratings yet
Temporal and Social Context Based Burst Detection From Folksonomies
6 pages
Senior Assistant Interview Schedule
No ratings yet
Senior Assistant Interview Schedule
7 pages
2 Literature Review
No ratings yet
2 Literature Review
15 pages
IPR Gandhinagar Apprentice (Diploma Degree) Recruitment 2020RIJADEJAcom
No ratings yet
IPR Gandhinagar Apprentice (Diploma Degree) Recruitment 2020RIJADEJAcom
3 pages
(IJCST-V5I2P52) :asst - Prof.J.Omana, S.Dhanalakshmi, V.M.Divyalakshmi, S.Mahalakshmi
No ratings yet
(IJCST-V5I2P52) :asst - Prof.J.Omana, S.Dhanalakshmi, V.M.Divyalakshmi, S.Mahalakshmi
4 pages
JournalNX - Traffic Time Monitoring
No ratings yet
JournalNX - Traffic Time Monitoring
3 pages
Eat Pray Love Reaction
100% (1)
Eat Pray Love Reaction
2 pages
Describe and Evaluate Vygotsky's Theory of Cognitive Development
No ratings yet
Describe and Evaluate Vygotsky's Theory of Cognitive Development
2 pages
Mediator in Social Network For User Interest Activity in Big Data
No ratings yet
Mediator in Social Network For User Interest Activity in Big Data
5 pages
Tutorial 1 For MIT Applied Probability Course
No ratings yet
Tutorial 1 For MIT Applied Probability Course
3 pages
Discovering Emerging Topics in Social Streams Via Link-Anomaly Detection
No ratings yet
Discovering Emerging Topics in Social Streams Via Link-Anomaly Detection
5 pages
Top Bar Beekeeping (Text)
No ratings yet
Top Bar Beekeeping (Text)
5 pages
PLAYBILL - Get Connected...
No ratings yet
PLAYBILL - Get Connected...
5 pages
Elementary Analysis Assignment
No ratings yet
Elementary Analysis Assignment
1 page
Discovering Emerging Topics in Social Streams Via Link-Anomaly Detection
No ratings yet
Discovering Emerging Topics in Social Streams Via Link-Anomaly Detection
5 pages
Music 2F03 Notes
No ratings yet
Music 2F03 Notes
6 pages
Manual Feedback Assembly (PFW) : Valco Instruments Co. Inc
No ratings yet
Manual Feedback Assembly (PFW) : Valco Instruments Co. Inc
2 pages
TEDAS: A Twitter-Based Event Detection and Analysis System
No ratings yet
TEDAS: A Twitter-Based Event Detection and Analysis System
4 pages
Abstract
No ratings yet
Abstract
2 pages
Taylor and Maclaurin Series of Sin Function
No ratings yet
Taylor and Maclaurin Series of Sin Function
1 page
Diy Drone and Quadcopter Projects The Editors of PDF Download
No ratings yet
Diy Drone and Quadcopter Projects The Editors of PDF Download
41 pages
Numerical Methods Guidelines, BSC (H) Maths, DU
No ratings yet
Numerical Methods Guidelines, BSC (H) Maths, DU
2 pages
CTRF
No ratings yet
CTRF
2 pages
I.S.I. Entrance Subjective Test
No ratings yet
I.S.I. Entrance Subjective Test
1 page
I.S.I. Entrance Subjective Test
No ratings yet
I.S.I. Entrance Subjective Test
1 page
Exponential Decay Model in Differential Equations
No ratings yet
Exponential Decay Model in Differential Equations
1 page

EDIUM: Improving Entity Disambiguation Via User Modeling: Abstract

Uploaded by

EDIUM: Improving Entity Disambiguation Via User Modeling: Abstract

Uploaded by

EDIUM: Improving Entity Disambiguation via User

Abstract. Entity Disambiguation is the task of associating name mentions

The EDIUM System

EDIUM works by creating users interests as a distribution over the semantic

Contextual Modeling System

User Modeling System

Now we define relatedness, R as ratio of similarity between UM & DM and CM

is the measure of consistency of users behavior towards the learnt user

To deal with the changing users interests, we decrease by a factor of 0.9

Mongo DB is used as a database

The ScoreU (Cij ) is normalized relative to all possible ScoreU (Ci ).

Entity Disambiguation System (DM)

Results and Discussions

We evaluated the performance of EDIUM on manually annotated dataset of

(a) Performance with Wikipedia Miner

(b) Performance with DBpedia Spotlight

Fig. 1. P@1 score of EDIUM under different configurations

Conclusion and Future Work

You might also like