0% found this document useful (0 votes)

16 views17 pages

Knowledge Based Question Answering System

Uploaded by

rajputakashchand4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views17 pages

Knowledge Based Question Answering System

Uploaded by

rajputakashchand4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Knowledge based Question

answering System
Knowledge-based question answering (KB-QA)

• is a paradigm in question-answering systems where a natural language

question is answered by querying a structured database or knowledge base
(KB) instead of relying on unstructured text.

• This approach leverages the structured data in a knowledge base, which

typically contains entities (e.g., people, places, organizations) and
relationships (e.g., birthplace, located in, works for) in a format that the
system can interpret and retrieve answers from with high accuracy.
Entity Linking,

• It explains how to associate a mention in text with a real-world entity in

an ontology, like a structured database, which is essential for knowledge-
based question answering (QA).

• This section discusses how systems identify specific entities (like people,
locations, organizations) within text and link these mentions to
corresponding records in a database.

• For fact-based QA, Wikipedia is often the preferred ontology, where each
unique Wikipedia page acts as an identifier for a specific entity.
Entity linking in this context has two main parts

• Mention Detection: Identifying phrases or words (mentions) in

the text that may refer to real-world entities.

• Mention Disambiguation: Deciding which specific entity each

mention corresponds to, especially in cases where a mention may
refer to multiple entities (e.g., “Washington” could mean the U.S.
state, the capital city, or George Washington).
Linking Based on Anchor Dictionaries and Web Graph

• A classic method for entity linking uses anchor dictionaries and graph
structures from Wikipedia, as implemented in the TAGME algorithm.
Here’s how this approach works:
Anchor Dictionaries
• An anchor dictionary is created by analyzing hyperlinks within
Wikipedia articles, as hyperlinks often serve as anchors for referring to
other pages.

• For instance, the phrase “Stanford University” might serve as a

hyperlink text or anchor to Stanford’s Wikipedia page.

• By using all the phrases that hyperlink to each Wikipedia page, an

anchor dictionary is built.

• This dictionary lists each possible entity (Wikipedia page) and the
phrases commonly used to link to it.
Each anchor in this dictionary has:

• Frequency: The number of times it appears in Wikipedia as a link

(link frequency) or in any context (total frequency).

• Link Probability: Calculated as the ratio of link frequency to total

frequency. This probability indicates the likelihood that an anchor
refers to a specific entity when it appears in a text.
Mention Detection
• For a new text, all potential entity mentions are found by
looking up sequences of up to six tokens (words) in the anchor
dictionary.

• The mention detection step often eliminates low-probability

links (phrases with low link probabilities or high ambiguity).

• For example, “Ada Lovelace” would likely have a high link

probability to a Wikipedia page about the mathematician, while
“born” (from the query “When was Ada Lovelace born?”) would
likely be ignored as it has a very low link probability.
Mention Disambiguation
• If an anchor or mention points to only one entity, linking is
straightforward. However, many mentions are ambiguous, with
multiple possible meanings. TAGME uses two main factors for
resolving ambiguity:

• Prior Probability: The likelihood of an anchor referring to a particular

entity, calculated as the ratio of the anchor’s links to that entity over its
total link occurrences.
• Relatedness/Coherence: The relationship between an entity candidate and other
entities in the question or context.

• This coherence is measured by how many Wikipedia pages link to both entities,
indicating a strong connection.

• For instance, the word "Yuan" could refer to the Chinese currency, a dynasty, or a
person’s name.

• But in the context of “Chinese Dynasty,” the coherence score would prioritize
linking to the "Yuan Dynasty" page.
2. Neural Graph-Based Linking

• Modern methods use neural networks to perform entity linking by

encoding both the mention (a phrase in the text) and potential
entities as embeddings (dense vectors), then computing a
similarity score.

• This approach, exemplified by the ELQ linking algorithm,

includes several key steps:
• Above diagram shows the ELQ linking algorithm of

• which is given a question q and a set of candidate entities from

Wikipedia with associated Wikipedia text, and outputs tuples (e,ms
,me) of entity id, mention start, and mention end.
Mention Detection

• The system encodes each token in a question using a model like

BERT.

• Then, it computes probabilities for each potential span

(sequence of tokens) to determine if it’s an entity mention.

• Each token receives a start and end probability, which helps

identify entity spans within the question.
Entity Encoding and Linking

• Each Wikipedia entity is pre-encoded into a dense vector using its

page information (e.g., title and first 128 tokens).

• This encoding is stored, allowing the model to search through

these vectors when a question arises.
Entity Encoding and Linking

• For each mention span identified in the question, the system

computes an embedding by averaging the BERT embeddings of
each token in the span.

• It then calculates similarity scores (dot products) between the

span embedding and all pre-computed entity embeddings.

• This process results in a ranked list of entities, with the highest

similarity score indicating the most likely entity match.
Training and Loss Functions

• The ELQ algorithm is trained using datasets with questions

annotated with entity boundaries and links, such as
WebQuestionsSP.

• The loss function for training combines two parts:

• Mention Detection Loss: A binary cross-entropy loss indicating if
a span is an entity.

• Entity Linking Loss: This loss component penalizes incorrect

matches between detected mentions and entities.

1001 Things You Wanted To Know About Visual FoxPro
50% (2)
1001 Things You Wanted To Know About Visual FoxPro
41 pages
RS3862 - Grade - 10 Database Management System
67% (3)
RS3862 - Grade - 10 Database Management System
75 pages
Detailed Hospital Management System Report
100% (1)
Detailed Hospital Management System Report
4 pages
Oracle PL SQL Interview Questions For 3 - Years Experience
50% (2)
Oracle PL SQL Interview Questions For 3 - Years Experience
89 pages
Informatica MDM Questions
100% (1)
Informatica MDM Questions
3 pages
Biological Databases
No ratings yet
Biological Databases
3 pages
If He Had Been With Me - Laura Nowlin - Google Books
No ratings yet
If He Had Been With Me - Laura Nowlin - Google Books
1 page
Extraction of Historical Events From Wikipedia
100% (1)
Extraction of Historical Events From Wikipedia
12 pages
Vlsi Basic Commands
No ratings yet
Vlsi Basic Commands
10 pages
Knowledge Derived From Wikipedia For Computing Semantic Relatedness
100% (2)
Knowledge Derived From Wikipedia For Computing Semantic Relatedness
32 pages
Morphological Analysis
No ratings yet
Morphological Analysis
118 pages
Morphological Analysis
No ratings yet
Morphological Analysis
118 pages
Learn SQL For FREE 30 Days ROADMAP by Rishabh Mishra
No ratings yet
Learn SQL For FREE 30 Days ROADMAP by Rishabh Mishra
7 pages
Data Warehouse Power Point Presentation
No ratings yet
Data Warehouse Power Point Presentation
18 pages
Django Excel
No ratings yet
Django Excel
43 pages
Data Warehouse: User Detailed Functional Specifications
No ratings yet
Data Warehouse: User Detailed Functional Specifications
26 pages
Frontiers of Computational Journalism - Columbia Journalism School Fall 2012 - Week 8: Knowledge Representation
No ratings yet
Frontiers of Computational Journalism - Columbia Journalism School Fall 2012 - Week 8: Knowledge Representation
39 pages
Computational Journalism at Columbia, Fall 2013, Lecture 7: Knowledge Representation
No ratings yet
Computational Journalism at Columbia, Fall 2013, Lecture 7: Knowledge Representation
39 pages
YAGO Ontology
No ratings yet
YAGO Ontology
49 pages
Unit 4 Database Design and Query Processing
No ratings yet
Unit 4 Database Design and Query Processing
60 pages
Dbms Record 2021 Even
No ratings yet
Dbms Record 2021 Even
40 pages
Vector Semantics
No ratings yet
Vector Semantics
83 pages
Entity Linking Indian Institute of Technology Pawangcoursessc16entitylinkingpdf
No ratings yet
Entity Linking Indian Institute of Technology Pawangcoursessc16entitylinkingpdf
44 pages
Mod2 Data Streams
No ratings yet
Mod2 Data Streams
75 pages
DBMS Codes
No ratings yet
DBMS Codes
28 pages
Knowledge Representation and Rule Based Systems
No ratings yet
Knowledge Representation and Rule Based Systems
17 pages
JAVA Development: Databases - SQL
No ratings yet
JAVA Development: Databases - SQL
38 pages
2019 Introduction To Neural Network Based Approaches For Question Answering Over Knowledge Graphs
No ratings yet
2019 Introduction To Neural Network Based Approaches For Question Answering Over Knowledge Graphs
34 pages
DA-100 Mod6-ENU-PowerPoint
No ratings yet
DA-100 Mod6-ENU-PowerPoint
26 pages
Ebay Analytics REPOST 01222013
No ratings yet
Ebay Analytics REPOST 01222013
20 pages
Discover Frequent Items in Small Stationary
No ratings yet
Discover Frequent Items in Small Stationary
16 pages
Knowledge Graphs
No ratings yet
Knowledge Graphs
43 pages
Entity Linking With A Knowledge Base
No ratings yet
Entity Linking With A Knowledge Base
20 pages
Siebel Enterprise Integration Manager Recommended Best Practices
No ratings yet
Siebel Enterprise Integration Manager Recommended Best Practices
32 pages
5 - The Relational Model of Data
No ratings yet
5 - The Relational Model of Data
31 pages
Feature-Based Approaches To Semantic Similarity Assessment of Concepts Using Wikipedia
No ratings yet
Feature-Based Approaches To Semantic Similarity Assessment of Concepts Using Wikipedia
18 pages
EVE: Explainable Vector Based Embedding Technique Using Wikipedia
No ratings yet
EVE: Explainable Vector Based Embedding Technique Using Wikipedia
22 pages
Ontology Matching - A Machine Learning Approach
No ratings yet
Ontology Matching - A Machine Learning Approach
20 pages
Unit 4 PDF
No ratings yet
Unit 4 PDF
12 pages
Named Entity Disambiguation: A Hybrid Approach: Ton Duc Thang University, Viet Nam E-Mail: Hien@tdt - Edu.vn
No ratings yet
Named Entity Disambiguation: A Hybrid Approach: Ton Duc Thang University, Viet Nam E-Mail: Hien@tdt - Edu.vn
16 pages
An Effective Approach For Discovering Relevant Semantic Associations Based On User Specified Relationships
No ratings yet
An Effective Approach For Discovering Relevant Semantic Associations Based On User Specified Relationships
8 pages
D R V K B: Ifferentiable Easoning Over A Irtual Nowledge ASE
No ratings yet
D R V K B: Ifferentiable Easoning Over A Irtual Nowledge ASE
16 pages
LearningToLinkWithWikipediaM IHW LearningToLinkWithWikipedia
No ratings yet
LearningToLinkWithWikipediaM IHW LearningToLinkWithWikipedia
10 pages
DMS Question Bank Answers
No ratings yet
DMS Question Bank Answers
22 pages
1115 Autoregressive Entity Retrieva
No ratings yet
1115 Autoregressive Entity Retrieva
20 pages
Dbms SP Gokhancantas
No ratings yet
Dbms SP Gokhancantas
13 pages
OntoSearch An Ontology Search Engine
No ratings yet
OntoSearch An Ontology Search Engine
12 pages
O12 1027
No ratings yet
O12 1027
15 pages
QA Review: IR-based Question Answering
No ratings yet
QA Review: IR-based Question Answering
11 pages
Import CSV File Using Mongoimport in
No ratings yet
Import CSV File Using Mongoimport in
6 pages
Akshay DBpedia GSoC 2017 Proposal
No ratings yet
Akshay DBpedia GSoC 2017 Proposal
12 pages
AI - Unit 4
No ratings yet
AI - Unit 4
21 pages
Extended Semantic Network For Knowledge Representa
No ratings yet
Extended Semantic Network For Knowledge Representa
11 pages
Advanced Java Lab Manual
No ratings yet
Advanced Java Lab Manual
41 pages
Winer: A Wikipedia Annotated Corpus For Named Entity Recognition
No ratings yet
Winer: A Wikipedia Annotated Corpus For Named Entity Recognition
10 pages
Complex Factoid Question Answering With A Free-Text Knowledge Graph
No ratings yet
Complex Factoid Question Answering With A Free-Text Knowledge Graph
12 pages
NILINKER Attention Based Approach To NIL Ent - 2022 - Journal of Biomedical Inf
No ratings yet
NILINKER Attention Based Approach To NIL Ent - 2022 - Journal of Biomedical Inf
12 pages
Link Context Extraction
No ratings yet
Link Context Extraction
11 pages
Introduction To The Semantic Web
No ratings yet
Introduction To The Semantic Web
7 pages
Retrieval Reranking and Multi Task Learning For Knowledge Base Question Answering
No ratings yet
Retrieval Reranking and Multi Task Learning For Knowledge Base Question Answering
11 pages
Learning To Tag and Tagging To Learn: A Case Study On Wikipedia
No ratings yet
Learning To Tag and Tagging To Learn: A Case Study On Wikipedia
15 pages
Semantic Parsing Via Staged Query Graph Generation: Question Answering With Knowledge Base
No ratings yet
Semantic Parsing Via Staged Query Graph Generation: Question Answering With Knowledge Base
11 pages
Harvesting Wiki Consensus - Using Wikipedia Entries As Ontology Elements
No ratings yet
Harvesting Wiki Consensus - Using Wikipedia Entries As Ontology Elements
15 pages
Towards A Question Answering System Over Temporal Knowledg Graph Embeddings
No ratings yet
Towards A Question Answering System Over Temporal Knowledg Graph Embeddings
10 pages
11964-Article Text-15492-1-2-20201228
No ratings yet
11964-Article Text-15492-1-2-20201228
9 pages
Extended Semantic Network For Knowledge Representation: /in Hybrid Approach
No ratings yet
Extended Semantic Network For Knowledge Representation: /in Hybrid Approach
10 pages
Research Review of The Knowledge Graph and Its Application
No ratings yet
Research Review of The Knowledge Graph and Its Application
20 pages
Sna PPT Aaadhitya
No ratings yet
Sna PPT Aaadhitya
14 pages
08 DNM IHW LearningToLinkWithWikipedia
No ratings yet
08 DNM IHW LearningToLinkWithWikipedia
10 pages
Database Fundamentals - Assignment 1
No ratings yet
Database Fundamentals - Assignment 1
11 pages
Artículo Redalyc 402640456012 PDF
No ratings yet
Artículo Redalyc 402640456012 PDF
6 pages
CR Transr
No ratings yet
CR Transr
11 pages
Ontowiki - A Tool For Social, Semantic Collaboration
No ratings yet
Ontowiki - A Tool For Social, Semantic Collaboration
14 pages
Trans R
No ratings yet
Trans R
7 pages
Ds Database Automation Pro Service
No ratings yet
Ds Database Automation Pro Service
4 pages
2020 Emnlp-Demos 4
No ratings yet
2020 Emnlp-Demos 4
8 pages
UTtoKB A Model For Semantic Relation Extraction For Unstructured Text
No ratings yet
UTtoKB A Model For Semantic Relation Extraction For Unstructured Text
7 pages
Yang 23 D
No ratings yet
Yang 23 D
11 pages
Neural Entity Linking On Technical Service Tickets: 1 Nadja Kurz 2 Felix Hamann 3 Adrian Ulges
No ratings yet
Neural Entity Linking On Technical Service Tickets: 1 Nadja Kurz 2 Felix Hamann 3 Adrian Ulges
6 pages
2017-Jayalakshmi S. Et Al-Automated Question Answering System Using Ontology and Semantic Role
No ratings yet
2017-Jayalakshmi S. Et Al-Automated Question Answering System Using Ontology and Semantic Role
5 pages
Term 2 GR 11 ICT Lesson Plan
No ratings yet
Term 2 GR 11 ICT Lesson Plan
6 pages
2007 (10) The Effect of Entity Recognition On Answer Validation
No ratings yet
2007 (10) The Effect of Entity Recognition On Answer Validation
7 pages
Recent Survey On Automatic Ontology Learning
No ratings yet
Recent Survey On Automatic Ontology Learning
5 pages
Spatial Data Mining Approaches For GIS - A Brief Review
No ratings yet
Spatial Data Mining Approaches For GIS - A Brief Review
2 pages
Description of Approach
No ratings yet
Description of Approach
5 pages
Servlet and JDBC Questions
No ratings yet
Servlet and JDBC Questions
4 pages
A - Guraliuk - 1 - ICTTE2020 Final
No ratings yet
A - Guraliuk - 1 - ICTTE2020 Final
2 pages
Rupesh Agarwal Resume Updated
No ratings yet
Rupesh Agarwal Resume Updated
1 page
Rima Bou-Abdou: Named Entity Linking With Wikipedia
No ratings yet
Rima Bou-Abdou: Named Entity Linking With Wikipedia
1 page
Literature Survey On Semantic Web
No ratings yet
Literature Survey On Semantic Web
4 pages
Computer Data
From Everand
Computer Data
Angel Gabaldon
No ratings yet
Visualizing Data Structures
From Everand
Visualizing Data Structures
Rhonda Hoenigman
No ratings yet
Semantic Network: Fundamentals and Applications
From Everand
Semantic Network: Fundamentals and Applications
Fouad Sabry
No ratings yet
Relationship Extraction: Fundamentals and Applications
From Everand
Relationship Extraction: Fundamentals and Applications
Fouad Sabry
No ratings yet