0% found this document useful (0 votes)
42 views1 page

Sigir2013 Tutorial

This tutorial presents a comprehensive introduction to entity linking and retrieval. Part I provides a detailed overview of entity linking: identifying and disambiguating entity occurrences. Part II focuses on entity retrieval, by first considering scenarios where explicit representations of entities are available.

Uploaded by

Cương Bụi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
42 views1 page

Sigir2013 Tutorial

This tutorial presents a comprehensive introduction to entity linking and retrieval. Part I provides a detailed overview of entity linking: identifying and disambiguating entity occurrences. Part II focuses on entity retrieval, by first considering scenarios where explicit representations of entities are available.

Uploaded by

Cương Bụi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Entity Linking and Retrieval

Edgar Meij
Yahoo! Research Barcelona, Spain

Krisztian Balog
University of Stavanger Stavanger, Norway

Daan Odijk
ISLA, University of Amsterdam Amsterdam, The Netherlands

[email protected]

[email protected]

[email protected]

ABSTRACT
This full-day tutorial presents a comprehensive introduction to entity linking and retrieval. Part I provides a detailed overview of entity linking: identifying and disambiguating entity occurrences in unstructured text. Part II focuses on entity retrieval, by rst considering scenarios where explicit representations of entities are available, and then moving to a setting where evidence needs to be collected and aggregated from multiple documents or even collections, thereby combining techniques from both entity linking and entity retrieval. Part III concludes the tutorial with an overview and hands-on comparative analysis of applications and publicly available toolkits and web services.

Categories and Subject Descriptors


H.3 [Information Storage and Retrieval]: H.3.1 Content Analysis and Indexing; H.3.3 Information Search and Retrieval; H.3.4 Systems and Software

Keywords
Entity linking, entity retrieval, semantic search

1.

OVERVIEW

The explosive increase in the amount of unstructured textual data being produced in all kinds of domains calls for advanced methodologies for making sense of this data. Recent advances have enabled a precise manner of analysis, where phrasesconsisting of a single term or sequence of terms are automatically linked to entries in a knowledge base. This process is commonly known as entity linking. Entity linking facilitates advanced forms of searching and browsing in various domains and contexts. It can be used, for instance, to anchor textual resources in background knowledge; authors or readers of a piece of text may nd entity links to supply useful pointers. Another application can be found in search engines, where linking queries to entities to improve the user experience is becoming increasingly prevalent. More and more, users want to nd the actual entities that satisfy

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for prot or commercial advantage and that copies bear this notice and the full citation on the rst page. Copyrights for thirdparty components of this work must be honored. For all other uses, contact the owner/author(s). SIGIR13, July 28August 1, 2013, Dublin, Ireland. ACM 978-1-4503-2034-4/13/07.

their information need, rather than merely the documents that mention them; a process known as entity retrieval. It is common to consider entities from a general-purpose knowledge base such as Wikipedia or Freebase, since these provide sucient coverage for most tasks and applications. Wikipedia is therefore a typical target for entity linking and also a fertile ground for entity retrieval; its rich structure (including wikilinks, categories, and infoboxes) informs entity linking algorithms and enables advancing over plain document retrieval. Approaches for linking and retrieving entities are not Wikipedia-specic, however. Recent developments in the Web of Data enable the use of domain or task-specic entities. Alternatively, legacy or corporate knowledge bases can be used to provide entities. Entity linking and retrieval is also gaining popularity in the public domain, as witnessed by Wolfram Alpha, the Google Knowledge Graph, digital personal assistants such as Siri and Google Now, and various vertical search engines focusing on particular entity types. In this full-day tutorial we present a comprehensive introduction to entity linking and retrieval. Part I of the tutorial provides a detailed overview of entity linking. We introduce the fundamental concepts and principles, address the identication and disambiguation of entity occurrences in unstructured text, and detail state-of-the-art algorithms including unsupervised solutions, graph-based methods, and feature-based approaches in a machine learning setting. We continue with applications of entity linking for IR and conclude this part with discussing evaluation methodology. Part II focuses on entity retrieval and begins with a study of scenarios where explicit representations of entities are available in the form of, e.g., Wikipedia articles or RDF triples. We continue in a setting with more complex queries, requiring evidence to be collected and aggregated from unstructured textual datawith the potential help of some structured data. Such queries require a combination of techniques from entity linking and entity retrieval. Throughout Part II, two main families of models are discussed: generative language models and discriminative feature-based models. Both the entity linking and entity retrieval parts are anchored in recent evaluation eorts conducted at benchmarking campaigns such as INEX, TAC, and TREC. We introduce test collections, tasks, evaluation methodology, and experimental results from these evaluation initiatives. Finally, a number of publicly available toolkits and web services for entity linking and entity retrieval exist. The last part of the tutorial will give an overview and comparative analysis of these, followed by a hands-on session where they will be evaluated in various settings.

You might also like