0% found this document useful (0 votes)

26 views7 pages

UTtoKB A Model For Semantic Relation Extraction For Unstructured Text

The document presents the UTtoKB model, which extracts semantic relationships from unstructured text using natural language processing techniques and ontology-based information extraction. The model employs various NLP tasks such as coreference resolution, named entity recognition, and semantic role labeling to convert text into RDF triples, which are then refined and mapped to a predefined ontology. Experimental results demonstrate improved precision and recall in extracting relevant information compared to initial RDF outputs.

Uploaded by

banmustafa66

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views7 pages

UTtoKB A Model For Semantic Relation Extraction For Unstructured Text

Uploaded by

banmustafa66

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

UTtoKB: a Model for Semantic Relation Extraction for

Unstructured Text

Mustafa Nabeel Salim1, a) and Dr. Ban Shareef Mustafa2, b)

1, 2
University of Mosul/ College of Computer Science and Mathematics/ Computer Science dep.
a)
[email protected]
b)
[email protected]

Abstract. In this paper, a model prototype called UTtoKB has been built. It extracts semantic relationships from an unstructured text based
on ontology. The model is a pipeline steps based on natural language processing (NLP) tasks and tools like Coreference Resolution (CR),
Named Entity Recognition (NER), Semantic Role Labeling (SRL), and Part of Speech (PoS) Tagging. WordNet is the tool used to measure
similarities between entities to convert them into ontology concepts and properties and populate them.
The model works fine in specific domains, while performance degrades in other domains due to the instability of WordNet performance in
finding semantic similarities.

Keyword. Ontology-Based Information Extraction, WordNet, Triples, Natural Language Processing

1. Introduction
NLP focuses on interactions between machines and natural languages, as well as a machine's capacity to comprehend or imitate the
comprehension of human language through a range of tasks and tools. Information Extraction (IE) is a subfield of the NLP, which is
used for extracting a structured information from the semi or unstructured text and convert it into a set of useful structure information
automatically. Ontology-Based Information Extraction (OBIE) is an information extraction technique guided by an ontology in which
structured information is extracted from text according to concepts and properties defined by an ontology[1]. Ontology is described as
a "formal and explicit specification of a shared conceptualization" [2]. Ontologies are typically defined for specific domains.
One of the most popular ontologies in the meanwhile is DBpedia, which is a collection of Resource Description Framework (RDF)
that is extracted from information created in the Wikipedia project [3]. DBpedia is made available to users on the World Wide Web
(WWW). DBpedia allows users to semantically query relationships and properties of Wikipedia resources[3].
In this paper, UTtoKB model is proposed. UTtoKB is built basically on deep learning-based NLP tasks like SRL, CR, NER, and
others like PoS Tagging and some Preprocessing tools. Input text was preprocessed through the usage of CR, tokenization and
sentence splitter to make it readable by machine. Information extracted as a set of relations in triple shape called RDF. RDFs are
refined with the help of WordNet[4] similarity tool with the predefined ontology. The refining of the set of RDFs, improves results by
20% compared to the results obtained from the first extraction of RDFs.

1.1 Related Works

There are a number of proposed ways for converting raw text into a formal presentation to complete a knowledge graph[5]. Many
works concentrate on extracting semantic relations from raw text and added it to ontology-based knowledge representation. The
relations are formulated as RDF. These systems can work in open information world and extract a sematic relation with no specific
guidance. Other systems use domain ontology concepts and relations to direct the extraction process[6]. In [7], T2KG system is an
early effort to extract sematic relations from unstructured text. T2KG was proposed which used a word2vec[8]vector representation
with cosine similarity and rule based approach to check similarities and make them compatible with DBpedia ontology. For extracting
triples, it used open information extraction technique called Open Language Learning for Information Extraction(OLLIE) [9] that
produces a predicate-argument structure. The system steps use entity extraction, CR, Triple extraction, Triple integration to knowledge
graph and the last step which map the predicate to its correspondence in knowledge graph. In [10], an end to end system created for
extracting structured information from Wikipedia articles into DBpedia namespace. DBpedia is an ontology based knowledge graph of
Wikipedia resources. It extracts useful information from Wikipedia articles specially InfoBox structure. The system uses a unique
architecture to extract RDF Triples from it. This architecture pipeline is including SRL, working in parallel with Named Entity
Linking and CR. LODifier [11]is a system proposed to extract or create RDF representation from unstructured text and link it to
DBpedia and WordNet ontologies. This architecture is built basically on three elements: Semantic Analysis, Named Entity
Recognition (NER) and Word Sense Disambiguation. It searches for all the entities that mentioned in the text and replace it with
English Wikipedia link. The next step is generating URI for Wikifier output. It converts from Wikipedia URI to DBpedia URI. C&C
parser and Boxer system used to determine the relation between entities. The relation produced from boxer converted to RDF
WordNet class types to get RDF URI per relation. Finally, LODifier constructs RDF graphs by defining URI for predicates and
relations that got from Boxer. In [12], a system is proposed to extract structured information from Twitter semi-structured messages.
The system architecture is basically built on GATE to recognize the entities of tweets and connect them to DBpedia ontology. The
system takes BBC and New York time news tweets as input. The GATE used is a pipeline of five steps includes preprocessing,
gazetteer, grammar rules creation, disambiguation to extract RDF triples. Gazetteer is a dictionary that helps to extract entities that
related to Wikipedia and DBpedia contents. In [13], a system is presented to extract information from Twitter messages. This
information is about tourism places in Indonesia. It accepts tweets of Twitter and the ontology of DBpedia as input. To make the input
is readable for the machine; it uses tokenization, sentence splitter, and PoS tagging as preprocessing for the text. PoS tagging helps to
extract NOUNs that are probably a potential entity. After extracting a set of entities, DBpedia spotlight is used to annotate entities
based on DBpedia: Place.
Some other researcher works depends on domain specific ontology that is built by experts. In [14], a method proposed to extents the
content of ontology that more suitable and related to the input text especially instances. In [15], an end-to-end system use dependency
parsing from Stanford to extract the triples. In [16], a system is proposed to extract information from a semi-structured text-based on a
predefined ontology for disaster management. It accepts a semi-structured information document and a predefined ontology. In [17], a
new technique is designed for extracting tabular information from papers that is relevant to users. The system is generic, meaning it
may be used on any document, regardless of its domain or content. In addition to its generic nature, the offered solution is strong and
can handle a variety of document layouts. Table detection and ontological information extraction are the two key modules of the
presented approach. The table detection module extracts all tables from a technical document, but the ontological information
extraction module only extracts relevant tables from all discovered tables. In [5], a pipeline methodology for extracting information
from a huge corpus is proposed, which includes several NLP tasks. The methodology is demonstrated using a large medical dataset
(CORD-19) to complete the previous steps and extract triples, which are then mapped to a rich ontology of biomedical concepts. In
[18], a web-based prototype system is proposed for extracting useful geospatial information from unstructured text. It uses NER to
extract named entities especially the one related to Location.
In UTtoKB, the model uses SRL, NER and PoS tagging for extracting RDF triples. The paper concentrate on mapping techniques to
map the extracted triples to predefined domain specific ontology. WordNet and GloVe vector representation are used in RDF mapping
to help link it to ontology concepts and properties.

1.2 Ontology-Based Information Extraction (OBIE)

The method of IE includes converting an unstructured text or a group of texts into sets of facts[19]. This technique is used for
extracting a certain kinds of information from text or other resources and representing them in a specific knowledge representation
method like Databases and Ontologies [20].
The most known term of Ontology definition is: "Ontology is a formal specification of a shared conceptualization"[2].
Ontologies are mostly used with conjunction with the term Semantic Web. It is represented by a collection of URIs entities that have a
meaning. Ontology represents vocabularies and knowledge base for specific domain. Ontology is made up of various parts including
classes, data type properties, object properties (including taxonomical relationships), instances, object property values, and restrictions
[21].
OBIE is a system that processes unstructured or semi-structured raw text to extract new information according to ontology. Usually,
output will be an addition to that ontology [22]. Another straightforward description of an OBIE system is that it guides IE algorithms
and methods by using ontology to extract the desired information. OBIE system can be defined as “an IE process guided by the
ontology to extract things such as classes, properties and instances”. Thus the part of IE system that differs from OBIE is that method
of extraction which is oriented to identify entities for specific ontology [6].

1.3 Architecture
UTtoKB is a system to convert a text to a set of RDF to be added as new assertions to the knowledge base according to specific
domain ontology. UTtoKB is a pipeline of several main components. It takes the text (document) as input and produces RDF tuples
according to a specific domain.
The components work as a workflow, the input text is preprocessed to provide the input sentences to the next module. The next
module is a generic semantic processing component to extract the semantic relations from text based on a semantic role labeler
providing the primary set of RDF to the next module. RDF refinement chooses the RDFs that can be mapped to domain ontology. The
main components in UTtoKB model architecture are:
1. Preprocessing module: the text is preprocessed by applying the CR, tokenizing text, and producing the output as a set of
sentences.
2. RDF Extraction: the main semantic module based on AllenNLP semantic role labeler to convert the sentences to semantic
framing structure, and produce the primary RDFs.
3. RDF Refinement and Ontology Population: this step makes the RDF triples extracted corresponds to Ontology contents. The
final mapping of RDF triples that to domain ontology concepts and properties.
The complete system architecture is shown in figure 1. The main components are discussed in detail in later sections.

Figure 1: UTtoKB Model Architecture

1.3.1 Text Preprocessing

Preprocessing of text is an important step in many NLP tasks. It helps to initialize the text and clean it from unneeded information
that can influence the implementation of the system negatively especially when the information is huge and unstructured. It consists of
coreference resolution, tokenization and sentence splitter. Figure 2 shows the preprocessing module architecture.

Figure 2: Preprocessing Module Architecture

1.3.2 RDF Extraction

RDF Extraction is considered the main phase in the system. It creates the initial set of RDFs. This step consists of Semantic Role
Labeling ,PoS tagging and Chunks Organizing and Named Entity Recognition .Chunks Organizing is used to merge all similar
arguments extracted from SRL that belong to the same chunk. RDF Extraction has four main steps as shown in figure 3.
Figure 3: RDF Extraction Module Architecture

1.3.3 RDF Refinement and Ontology Population

The main purpose in this phase is to make the RDF triples extracted earlier corresponds to Ontology contents. It accepts set of initial
RDFs extracted from the RDF extraction phase. The model is based on WordNet [4] to find the similarity. At last, in the pipeline, the
system will add the new refined RDF triples separately for IS-A and Non-IS-A relations to the Ontology. Figure 4 shows the
architecture for this phase.

Figure 4: RDF Refinements and Ontology Mapping

1.4 EXPERIMENTS
In UTtoKB prototype model, a set of documents about a specific domain is processed to extract information from these documents
and added it to domain ontology as a new assertion to the knowledge base. Country ontology has been constructed manually using
protégé platform[23]. The main concepts and roles in the predefined ontology are shown in Figure 5.

Figure 5: Country Ontology

The model prototype uses AllenNLP models for CR, SRL, and NER. To give a better understanding of the prototype model and
how it remove is works, it shows how architecture pipeline process of the system will be react with the example down below:

1. Input example:
"Muhammad speaks English and French languages while John speaks English only. Both Mu and Johnny reside in London
city."
2. Preprocessing: The input will be split into two sentences after applying Coreference Resolution, tokenization and sentence
splitting.
● Before applying Coreference Resolution:
Muhammad speaks English and French languages while John speaks English only. Both Mu and Johnny reside in London
city.
● After applying Coreference Resolution and Unifying Entities:
Muhammad speaks English and French languages while John speaks English only. Both Muhammad and John reside in
London city.
The final output of this step is:
a) Muhammad speaks English and French languages while John speaks English only
b) Both Muhammad and John reside in London city
3.RDF Extraction:
This step will extract the initial RDF based on SRL, NER and PoS tagging. In SRL, statement will be divided into segments based on
where predicates are mentioned) in the sentence.
a) MuhammadA0 speaks predicate English and French languagesA1
b) JohnA0 speaks predicate English only A1
c) Both Muhammad and John A0 reside predicate in London cityA1
Then convert to initial triple shape with NER and PoS tagging
(Muhammad, speaks, English)
(Muhammad, speaks, French)
(Muhammad, speaks, Languages)
(John, speaks, English)
(Muhammad, reside, London)
(John, reside, London)
(Muhammad, reside, city)
(John, reside, city)

4. RDF Refinement and Ontology Population

The previous extracted RDF triples are not fully precise and need to be fully dependent on the concepts and properties of the existing
ontology. This step helps to refine them in a proper way that is so convenient for ontology. Unrelated triples will be discarded. The
following triples will be added as new assertion:

(<Country: Person: Muhammad>, <Country: Speak>, <Country: Language: English>)

(<Country: Person: Muhammad>, <Country: Speak>, <Country: Language: French>)
(<Country: Person: John>, <Country: Speak, <Country: Language: English>)
(<Country: Person: Muhammad, <Country: Live>, <Country: City: London>)
(<Country: Person: John, <Country: Live>,<Country: City: London>)

The predicates "reside" and "speaks" are converted to most similar ontology properties "Live” and "Speak" respectively based on
semantic similarity with ontology. This step also adds those entities (John, Muhammad) as instances of <Country: Person> class,
(English, French) as instances of <Country: Language> and London as an instance of <Country: City> class, if they do not exist in
ontology. They represent the needed facts (is-a relations) to be added as new instances to the proper class it really belongs in the
ontology. All what were mentioned earlier can be applied to other examples.

1.5 UTtoKB Experiment Results

The evaluation of OBIE model is not a specific task, due to the lack of standard extractions for a specific ontology domain. The
common approach to deal with this issue is to extract the correct set of RDF triples manually from a test document and use the set to
compare with the model's generated results. All possible triples from the text document case study are manually generated and
calculated both the precision and the recall as the ratio of valid RDFs extracted by UTtoKB pipeline to the total number of valid RDFs
(manually extracted). The F1-score is calculated as the harmonic mean of precision and recall. Tables (1, 2 and 3) show the values of
Precision, recall and F1-score for the initial RDFs and refined RDFs in UTtoKB model.

Table 1: the evaluation of Initial RDFs

Metrics Percentage
Precision (%) 54.7
Recall (%) 48.3
F1 (%) 51.3

Table 2: Evaluation for IS-A and NON-IS_A relations Refined RDFs

Metrics Percentage
Precision (%) 75
Recall (%) 70
F1 (%) 72

Table 3: Evaluation of IS-A and NON-IS_A relations with different Threshold

Metrics Class and Properties Class and Properties
similarity 60% similarity 80%
Precision (%) 75 66
Recall (%) 70 61
F1 (%) 72 62.5

Another experiment is done to evaluate the similarities percentage by using Global vector representation (GloVe) [24] vector and
compare it to WordNet similarity techniques. In Table 4, results are showing some examples of entities and compare it to ontology
classes.

Conclusion and Future Works

Table 4: Similarity check in different methods

In this work, the UTtoKB model is proposed for extracting structured information from unstructured text with the help of a specific
domain ontology knowledge base. The model is built using NLP tasks from the AllenNLP platform [27] to extract the initial triples.
Extracted triples are refined to be more suitable to ontology concepts and properties using similarity techniques of WordNet. The
model results depend on how to find the semantic similarities between the extracted RDF and the domain ontology concepts and
properties. The results showed that WordNet is not a great tool for giving similarity because of the lack of information it offers in
some subjects. Other ways can be implemented for semantic similarities tests including Global Vector representation (GloVe) with
the help Cosine Similarity[25] and Euclidean distance[26].. The GloVe is an unsupervised vector model introduced by Stanford
University used to represent each word in the corpus with a single unique vector in a low dimension [24]. It is built on the training of
the Co-occurrence matrix representation technique that appears how frequently a word is used together with other words in a corpus.
In Table 4, comparative results are shown for semantic similarities between entities and classes using the WordNet similarities test and
GloVe vectors test. In future work, we are determined to find an efficient way to improve the performance of the model in the
application of ontology mapping.

REFERENCES
1. Martinez-Rodriguez, J.L., Hogan, A., Lopez-Arevalo, I.: Information extraction meets the semantic web: a survey. Semantic
Web. 11, 255–335 (2020)
2. Guarino, N., Oberle, D., Staab, S.: What is an ontology? In: Handbook on ontologies. pp. 1–17. Springer (2009)
3. Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., Van Kleef, P.,
Auer, S.: Dbpedia–a large-scale, multilingual knowledge base extracted from wikipedia. Semantic web. 6, 167–195 (2015)
4. Fellbaum, C.: WordNet. In: Theory and applications of ontology: computer applications. pp. 231–243. Springer (2010)
5. Papadopoulos, D., Papadakis, N., Litke, A.: A methodology for open information extraction and representation from large
scientific corpora: the CORD-19 data exploration use case. Applied Sciences. 10, 5630 (2020)
6. Karkaletsis, V., Fragkou, P., Petasis, G., Iosif, E.: Ontology based information extraction from text. In: Knowledge-Driven
Multimedia Information Extraction and Ontology Evolution. pp. 89–109. Springer (2011)
7. Kertkeidkachorn, N., Ichise, R.: T2kg: An end-to-end system for creating knowledge graph from unstructured text. Presented
at the Workshops at the Thirty-First AAAI Conference on Artificial Intelligence (2017)
8. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint
arXiv:1301.3781. (2013)
9. Etzioni, O., Bart, R.E., Schmitz, M.D., Doderland, S.G.: Open language learning for information extraction. (2014)
10. Exner, P., Nugues, P.: Entity extraction: From unstructured text to DBpedia RDF triples. Presented at the The Web of Linked
Entities Workshop (WoLE 2012) (2012)
11. Augenstein, I., Padó, S., Rudolph, S.: Lodifier: Generating linked data from unstructured text. Presented at the Extended
Semantic Web Conference (2012)
12. Nebhi, K.: Ontology-based information extraction from twitter. Presented at the Proceedings of the Workshop on Information
Extraction and Entity Analytics on Social Media Data (2012)
13. Rosyiq, A., Hayah, A.R., Hidayanto, A.N., Naisuty, M., Suhanto, A., Budi, N.F.A.: Information extraction from Twitter
using DBpedia ontology: Indonesia tourism places. Presented at the 2019 International Conference on Informatics, Multimedia, Cyber
and Information System (ICIMCIS) (2019)
14. Anantharangachar, R., Ramani, S., Rajagopalan, S.: Ontology guided information extraction from unstructured text. arXiv
preprint arXiv:1302.1335. (2013)
15. Batouche, B., Gardent, C., Monceaux, A., Blagnac, F.: Parsing text into RDF graphs. Presented at the Proceedings of the
XXXI Congress of the Spanish Society for the Processing of Natural Language (2014)
16. Abburu, S., Golla, S.B.: Ontology and NLP support for building disaster knowledge base. Presented at the 2017 2nd
International Conference on Communication and Electronics Systems (ICCES) (2017)
17. Rizvi, S.T.R., Mercier, D., Agne, S., Erkel, S., Dengel, A., Ahmed, S.: Ontology-based Information Extraction from
Technical Documents. Presented at the ICAART (2) (2018)
18. Papadias, E., Kokla, M., Tomai, E.: Educing knowledge from text: semantic information extraction of spatial concepts and
places. AGILE: GIScience Series. 2, 1–7 (2021)
19. Abdelmagid, M., Ahmed, A., Himmat, M.: Information Extraction methods and extraction techniques in the chemical
document’s contents: Survey. ARPN Journal of Engineering and Applied Sciences. 10, 1068–1073 (2015)
20. Grishman, R.: Information extraction. IEEE Intelligent Systems. 30, 8–15 (2015)
21. Wimalasuriya, D.C., Dou, D.: Components for information extraction: Ontology-based information extractors and generic
platforms. Presented at the Proceedings of the 19th ACM international conference on Information and knowledge management (2010)
22. Dung, T.Q., Kameyama, W.: Ontology-based information extraction and information retrieval in health care domain.
Presented at the International Conference on Data Warehousing and Knowledge Discovery (2007)
23. Sivakumar, R., Arivoli, P.: Ontology visualization PROTÉGÉ tools–a review. International Journal of Advanced Information
Technology (IJAIT) Vol. 1, (2011)
24. Pennington, J., Socher, R., Manning, C.D.: Glove: Global vectors for word representation. Presented at the Proceedings of
the 2014 conference on empirical methods in natural language processing (EMNLP) (2014)
25. Rahutomo, F., Kitasuka, T., Aritsugi, M.: Semantic cosine similarity. Presented at the The 7th International Student
Conference on Advanced Science and Technology ICAST (2012)
26. Vijaymeena, M., Kavitha, K.: A survey on similarity measures in text mining. Machine Learning and Applications: An
International Journal. 3, 19–28 (2016)
27. Gardner, M., Grus, J., Neumann, M., Tafjord, O., Dasigi, P., Liu, N., Peters, M., Schmitz, M., Zettlemoyer, L.: Allennlp: A
deep semantic natural language processing platform. arXiv preprint arXiv:1803.07640. (2018)

The Security Intelligence Graph: White Paper
No ratings yet
The Security Intelligence Graph: White Paper
14 pages
Semantic Web Based Information Systems State of The Art Applications Advances in Semantic Web and Information Systems Vol 1.9781599044279.47602
100% (2)
Semantic Web Based Information Systems State of The Art Applications Advances in Semantic Web and Information Systems Vol 1.9781599044279.47602
329 pages
Is WC 06 Welty Murdock
No ratings yet
Is WC 06 Welty Murdock
14 pages
Introduction To The Semantic Web
No ratings yet
Introduction To The Semantic Web
7 pages
M2Onto: An Approach and A Tool To Learn Owl Ontology From Mongodb Database
No ratings yet
M2Onto: An Approach and A Tool To Learn Owl Ontology From Mongodb Database
10 pages
What Kind of Sematics We Are Using and What Semantics We Need To Achieve The Vision of Complete Semantic Web
No ratings yet
What Kind of Sematics We Are Using and What Semantics We Need To Achieve The Vision of Complete Semantic Web
3 pages
Text Relation Extraction
No ratings yet
Text Relation Extraction
12 pages
Research Paper
No ratings yet
Research Paper
4 pages
Recent Survey On Automatic Ontology Learning
No ratings yet
Recent Survey On Automatic Ontology Learning
5 pages
A Knowledge Graph For Humanities Research
No ratings yet
A Knowledge Graph For Humanities Research
3 pages
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Similarity Measures
No ratings yet
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Similarity Measures
17 pages
Automatic Building of An Ontology From A Corpus of Documents
No ratings yet
Automatic Building of An Ontology From A Corpus of Documents
5 pages
OntoSearch An Ontology Search Engine
No ratings yet
OntoSearch An Ontology Search Engine
12 pages
3.4.24 - 2020 - A Relationship Extraction Method For Domain Knowledge Graph Construction
No ratings yet
3.4.24 - 2020 - A Relationship Extraction Method For Domain Knowledge Graph Construction
19 pages
Assignment 1
No ratings yet
Assignment 1
9 pages
Assignment 1
No ratings yet
Assignment 1
9 pages
D R V K B: Ifferentiable Easoning Over A Irtual Nowledge ASE
No ratings yet
D R V K B: Ifferentiable Easoning Over A Irtual Nowledge ASE
16 pages
Identifying The Semantic Relations On
No ratings yet
Identifying The Semantic Relations On
10 pages
Identifying The Semantic Relations On Unstructured Data
No ratings yet
Identifying The Semantic Relations On Unstructured Data
10 pages
TXTM C1 Text Mining
No ratings yet
TXTM C1 Text Mining
36 pages
Yadav 2014
No ratings yet
Yadav 2014
6 pages
What Do You Mean by Relation Extraction? A Survey On Datasets and Study On Scientific Relation Classification
No ratings yet
What Do You Mean by Relation Extraction? A Survey On Datasets and Study On Scientific Relation Classification
17 pages
3.4.24 - 2023 - AComprehensive Survey On Deep Learning For Relation Extraction
No ratings yet
3.4.24 - 2023 - AComprehensive Survey On Deep Learning For Relation Extraction
34 pages
09 Relation Extraction and Scoring
No ratings yet
09 Relation Extraction and Scoring
12 pages
Employing A Domain Specific Ontology To Perform Semantic Search
No ratings yet
Employing A Domain Specific Ontology To Perform Semantic Search
13 pages
QA Review: IR-based Question Answering
No ratings yet
QA Review: IR-based Question Answering
11 pages
A Proposed Approach For Arabic Semantic Annotation
No ratings yet
A Proposed Approach For Arabic Semantic Annotation
10 pages
Automatising Ruiz-Casado DKE 2007 Ps
No ratings yet
Automatising Ruiz-Casado DKE 2007 Ps
26 pages
Ontology Generator From Relational Database Based On Jena
No ratings yet
Ontology Generator From Relational Database Based On Jena
5 pages
Ontology-Based Interpretation of Keywords For Semantic Search
No ratings yet
Ontology-Based Interpretation of Keywords For Semantic Search
14 pages
Literature Survey On Semantic Web
No ratings yet
Literature Survey On Semantic Web
4 pages
Description Logic Systems With Concrete Domains: Applications For The Semantic Web
No ratings yet
Description Logic Systems With Concrete Domains: Applications For The Semantic Web
12 pages
Chapter19 IE, Relations
No ratings yet
Chapter19 IE, Relations
28 pages
Semi-Automatic Domain Ontology Creation From Text Resources
No ratings yet
Semi-Automatic Domain Ontology Creation From Text Resources
8 pages
Lecture07 03
No ratings yet
Lecture07 03
13 pages
AKMiner Domain-Specific Knowledge Graph Mining
No ratings yet
AKMiner Domain-Specific Knowledge Graph Mining
15 pages
tmp6D8D TMP
No ratings yet
tmp6D8D TMP
5 pages
Automatic Building of An Ontology From A Corpus of
No ratings yet
Automatic Building of An Ontology From A Corpus of
8 pages
Semantic Web and Ontologies: 1 What Is An Ontology?
No ratings yet
Semantic Web and Ontologies: 1 What Is An Ontology?
15 pages
A - Guraliuk - 1 - ICTTE2020 Final
No ratings yet
A - Guraliuk - 1 - ICTTE2020 Final
2 pages
Google-Based Information Extraction
No ratings yet
Google-Based Information Extraction
8 pages
Constructing Domain Ontology
No ratings yet
Constructing Domain Ontology
8 pages
Open Relation Extraction: Relational Knowledge Transfer From Supervised Data To Unsupervised Data
No ratings yet
Open Relation Extraction: Relational Knowledge Transfer From Supervised Data To Unsupervised Data
10 pages
Social Media
No ratings yet
Social Media
6 pages
Ontology Learning
No ratings yet
Ontology Learning
4 pages
NLP Proposal
No ratings yet
NLP Proposal
2 pages
Harvesting Wiki Consensus - Using Wikipedia Entries As Ontology Elements
No ratings yet
Harvesting Wiki Consensus - Using Wikipedia Entries As Ontology Elements
15 pages
Performance Analysis On Agriculture Ontology Using SPARQL Query System
No ratings yet
Performance Analysis On Agriculture Ontology Using SPARQL Query System
5 pages
2022.findings Naacl.115
No ratings yet
2022.findings Naacl.115
12 pages
Text Classification by Augmenting Bag of Words (BOW) Representation With Co-Occurrence Feature
No ratings yet
Text Classification by Augmenting Bag of Words (BOW) Representation With Co-Occurrence Feature
5 pages
A Hybrid Method For Integrating Multiple
No ratings yet
A Hybrid Method For Integrating Multiple
18 pages
An Ontology-Based Framework For Semantic Web Content Mining: S.Yasodha S.S.Dhenakaran
No ratings yet
An Ontology-Based Framework For Semantic Web Content Mining: S.Yasodha S.S.Dhenakaran
6 pages
Session 6
No ratings yet
Session 6
19 pages
Quizrdf: Search Technology For The Semantic Web
No ratings yet
Quizrdf: Search Technology For The Semantic Web
9 pages
Research Review of The Knowledge Graph and Its Application
No ratings yet
Research Review of The Knowledge Graph and Its Application
20 pages
BertNet Harvesting Knowledge Graphs From Pretrained Language Models
No ratings yet
BertNet Harvesting Knowledge Graphs From Pretrained Language Models
13 pages
Ontology Design Preprint
No ratings yet
Ontology Design Preprint
33 pages
Dbpedia: A Nucleus For A Web of Open Data
No ratings yet
Dbpedia: A Nucleus For A Web of Open Data
14 pages
An Effective Approach For Discovering Relevant Semantic Associations Based On User Specified Relationships
No ratings yet
An Effective Approach For Discovering Relevant Semantic Associations Based On User Specified Relationships
8 pages
Buildeing Knowlwdge Base Through Deep Learning Relation Extraction
No ratings yet
Buildeing Knowlwdge Base Through Deep Learning Relation Extraction
7 pages
Toward Extenics-Based Innovation Model On Intellig
No ratings yet
Toward Extenics-Based Innovation Model On Intellig
23 pages
CS6010-Social Network Analysis PDF
100% (1)
CS6010-Social Network Analysis PDF
9 pages
SNSW Unit Iii
No ratings yet
SNSW Unit Iii
15 pages
Reasoning in Corporate Memory Systems: A Case Study of Group Competencies
No ratings yet
Reasoning in Corporate Memory Systems: A Case Study of Group Competencies
12 pages
Cristopher Rico A Delgado BSTM M502: Stmicroelectronics
100% (1)
Cristopher Rico A Delgado BSTM M502: Stmicroelectronics
5 pages
3 - 10 - 3 KNX IoT Information Model
No ratings yet
3 - 10 - 3 KNX IoT Information Model
112 pages
Malware Fuzzy Ontology For Semantic Web: Tala Tafazzoli and Seyed Hadi Sadjadi
No ratings yet
Malware Fuzzy Ontology For Semantic Web: Tala Tafazzoli and Seyed Hadi Sadjadi
9 pages
Using Ontology To Capture Supply Chain Code Halos Codex1071
No ratings yet
Using Ontology To Capture Supply Chain Code Halos Codex1071
18 pages
Metadata Catalogues in Spatial Information Systems
No ratings yet
Metadata Catalogues in Spatial Information Systems
22 pages
Advanced Informatics For Computing Research Second International Conference ICAICR 2018 Shimla India July 14 15 2018 Revised Selected Papers Part I Ashish Kumar Luhach Download
No ratings yet
Advanced Informatics For Computing Research Second International Conference ICAICR 2018 Shimla India July 14 15 2018 Revised Selected Papers Part I Ashish Kumar Luhach Download
56 pages
Bb-Mastering Structured Data On The Semantic Web
No ratings yet
Bb-Mastering Structured Data On The Semantic Web
244 pages
Knowledge Management Booklet - Caminao's Ways
No ratings yet
Knowledge Management Booklet - Caminao's Ways
22 pages
A Logic-Based Approach To Web Services Composition and Verification Using OWL-S
No ratings yet
A Logic-Based Approach To Web Services Composition and Verification Using OWL-S
5 pages
2007 Varavithya
No ratings yet
2007 Varavithya
11 pages
Reference Ontology
No ratings yet
Reference Ontology
10 pages
Sma Cep Project
No ratings yet
Sma Cep Project
20 pages
International Conference On Cultural Informatics, Communication & Media Studies
No ratings yet
International Conference On Cultural Informatics, Communication & Media Studies
16 pages
The University of Calgary
No ratings yet
The University of Calgary
220 pages
Proceedings IROS2014 Workshop IEEE ORA
No ratings yet
Proceedings IROS2014 Workshop IEEE ORA
41 pages
Uschold and Gruninger - 1996 - Ontologies Principles, Methods and Aplications
No ratings yet
Uschold and Gruninger - 1996 - Ontologies Principles, Methods and Aplications
69 pages
Leveraging Data Integration in Enterprise Search, Discovery With 3RDi
No ratings yet
Leveraging Data Integration in Enterprise Search, Discovery With 3RDi
5 pages
Semantic Knowledge Graphs For The News: A Review
No ratings yet
Semantic Knowledge Graphs For The News: A Review
38 pages
How To Define Build and Operationalize A Data Fabric
100% (1)
How To Define Build and Operationalize A Data Fabric
51 pages
Predicting Malware Threat Intelligence Using KGs
No ratings yet
Predicting Malware Threat Intelligence Using KGs
13 pages
Wollo University
No ratings yet
Wollo University
27 pages
Nursing Informatics
No ratings yet
Nursing Informatics
161 pages
SBVR and Business Ontology
No ratings yet
SBVR and Business Ontology
68 pages
Uml For Fhir
No ratings yet
Uml For Fhir
253 pages

UTtoKB A Model For Semantic Relation Extraction For Unstructured Text

Uploaded by

UTtoKB A Model For Semantic Relation Extraction For Unstructured Text

Uploaded by

UTtoKB: a Model for Semantic Relation Extraction for

Mustafa Nabeel Salim1, a) and Dr. Ban Shareef Mustafa2, b)

Keyword. Ontology-Based Information Extraction, WordNet, Triples, Natural Language Processing

1.1 Related Works

1.2 Ontology-Based Information Extraction (OBIE)

Figure 1: UTtoKB Model Architecture

1.3.1 Text Preprocessing

Figure 2: Preprocessing Module Architecture

1.3.2 RDF Extraction

1.3.3 RDF Refinement and Ontology Population

Figure 4: RDF Refinements and Ontology Mapping

Figure 5: Country Ontology

4. RDF Refinement and Ontology Population

(<Country: Person: Muhammad>, <Country: Speak>, <Country: Language: English>)

1.5 UTtoKB Experiment Results

Table 1: the evaluation of Initial RDFs

Table 2: Evaluation for IS-A and NON-IS_A relations Refined RDFs

Table 3: Evaluation of IS-A and NON-IS_A relations with different Threshold

Conclusion and Future Works

You might also like