Semantic Web
Semantic Web
Ian Horrocks
<[email protected]>
Information Management Group
School of Computer Science
University of Manchester
The Semantic Web
Today’s Web
• Distributed hypertext/hypermedia
• Information accessed via (keyword based) search and browse
• Browser tools render information for human consumption
What is the Semantic Web?
• Web was “invented” by Tim Berners-Lee (amongst others), a physicist
working at CERN
• His vision of the Web was much more ambitious than the reality of the
existing (syntactic) Web:
• This vision of the Web has become known as the Semantic Web
Hard Work using “Syntactic Web”
Find images of Peter Patel-Schneider, Frank van Harmelen and
Alan Rector…
Dr. Alan
<Person>Alan
Rector, Professor
Rector</Person>,
of Computer Rev. Alan
<Person>Alan
M. Gates,M.Associate
Gates</Person>,
Rector of the
<Job>Professor
Science, University
of Computer
of Manchester
Science</Job>, <Job>Associate
Church of the Holy
Rector</Job>
Spirit, Lake of
Forest,
the Church
Illinoisof
University of Manchester the Holy Spirit, Lake Forest, Illinois
What is the (Proposed) Solution?
Central Sulcus
Parietal Lobe
Frontal Lobe
Occipital
LobeTemporal Lobe
Lateral Sulcus
Applications of Ontologies
• Organising complex and semi-structured information
– UN-FAO, NASA, Ordnance Survey, General Motors,
Lockheed Martin, …
Applications of Ontologies
• Military/Government
– DARPA, NSA, NIST, SAIC, MoD, Department of Homeland
Security, …
• The Semantic Web and so-called Semantic Grid
Ontology Languages
Ontology Languages for the Web
• Semantic Web effort led to development of “resource description”
language(s)
– E.g., RDF, and later RDF Schema (RDFS)
• RDFS is recognisable as an ontology language
– Classes and properties
– Sub/super-classes (and properties)
– Range and domain (of properties)
• But RDFS too weak to describe resources in sufficient detail, e.g.:
– No existence/cardinality constraints
– No transitive, inverse or symmetrical properties
– No localised range and domain constraints
– …
• And RDF(S) has “higher order flavour” with non-standard semantics
– Difficult to provide reasoning support
From RDFS to OWL
• Two languages developed to address deficiencies & problems of RDFS:
– OIL: developed by group of (largely) European researchers
– DAML-ONT: developed by group of (largely) US researchers
• Efforts merged to produce DAML+OIL
– Development carried out by “Joint EU/US Committee on Agent Markup
Languages”
• DAML+OIL submitted to as basis for standardisation
– Web-Ontology (WebOnt) Working Group formed
– WebOnt developed OWL language based on DAML+OIL
– OWL now a W3C recommendation (i.e., a standard)
• OIL, DAML+OIL and OWL based on Description Logics
– OWL is effectively a “Web-friendly” syntax for SHOIN
What Are Description Logics?
• A family of logic based Knowledge Representation
formalisms
– Descendants of semantic networks and KL-ONE
– Describe domain in terms of concepts (classes), roles
(properties, relationships) and individuals
– Operators allow for composition of complex concepts
– Names can be given to complex concepts, e.g.:
Animal
IS-A
has-color
Cat Black
IS-A
Felix Mat
sits-on
[Quillian, 1967]
Semantics and Reasoning
• Distinguished by:
– Formal semantics (typically model theoretic)
• Decidable fragments of FOL (often contained in C2)
• Closely related to Propositional Modal & Dynamic Logics, and
to Guarded Fragment
– Provision of reasoning services
• Decision procedures for key problems
(satisfiability, subsumption, etc)
• Implemented systems (highly optimised)
Why Description Logic?
• OWL exploits results of 15+ years of DL research
– Well defined (model theoretic) semantics
Why Description Logic?
• OWL exploits results of 15+ years of DL research
– Well defined (model theoretic) semantics
– Formal properties well understood (complexity,
decidability)
I can’t find an efficient algorithm, but neither can all these famous people.
Pellet
Why Description Logic?
• Foundational research was crucial to design of OWL
– Informed Working Group decisions at every stage, e.g.:
• “Why not extend the language with feature x, which is clearly
harmless?”
<owl:Class>
<owl:intersectionOf rdf:parseType=" collection">
<owl:Class rdf:about="#Parent"/>
<owl:Restriction>
<owl:onProperty rdf:resource="#hasChild"/>
<owl:allValuesFrom>
<owl:unionOf rdf:parseType=" collection">
<owl:Class rdf:about="#Intelligent"/>
<owl:Class rdf:about="#Athletic"/>
</owl:unionOf>
</owl:allValuesFrom>
</owl:Restriction>
</owl:intersectionOf>
</owl:Class>
Why Ontology Reasoning?
• Given key role of ontologies in many applications, it is essential to
provide tools and services to help users:
– Design and maintain high quality ontologies, e.g.:
• Meaningful — all named classes can have instances
Why Ontology Reasoning?
• Given key role of ontologies in many applications, it is essential to
provide tools and services to help users:
– Design and maintain high quality ontologies, e.g.:
• Meaningful — all named classes can have instances
• Correct — captures intuitions of domain experts
Why Ontology Reasoning?
• Given key role of ontologies in many applications, it is essential to
provide tools and services to help users:
– Design and maintain high quality ontologies, e.g.:
• Meaningful — all named classes can have instances
• Correct — captures intuitions of domain experts
• Minimally redundant — no unintended synonyms
Banana split Banana sundae
Why Ontology Reasoning?
• Given key role of ontologies in many applications, it is essential to
provide tools and services to help users:
– Design and maintain high quality ontologies, e.g.:
• Meaningful — all named classes can have instances
• Correct — captures intuitions of domain experts
• Minimally redundant — no unintended synonyms
– Answer queries over ontology classes and instances, e.g.:
• Find more general/specific classes
• Retrieve individuals/tuples matching a given query
Research Challenges
Increasing Expressive Power
• Complex role inclusion axioms [Horrocks&Sattler, IJCAI-03]
– E.g., hasLocation ± partOf v hasLocation
• Concrete domains/datatypes, e.g., [Lutz, IJCAI-99; Pan et al, ISWC-03]
– E.g., value comparison (income > expenditure)
• Database style keys [Lutz et al, JAIR 2004]
– E.g., make + model + chassis-number is a key for Vehicles
• Rule language extensions
– First order extensions (e.g., SWRL) [Horrocks et al, JWS, 2005]
– Hybrid language extensions, e.g., [Eiter et al, KR-04; Motik et al, ISWC-04]
– LP/F-Logic/Common Logic [Chen et al, JLP, 1993; de Bruijn et al, WWW-05]
Improving Scalability
• Optimisation techniques
– Improve performance of DL reasoners, e.g., [Sirin et al, KR-06]
• Reduction to disjunctive Datalog [Motik et at, KR-04]
– Transform DL ontology to DatalogÇ rules
– Use LP techniques to deal with large numbers of ground facts
• Hybrid DL-DB systems [Horrocks et al, CADE-05]
– Use DB to store “Abox” (individual) axioms
– Cache inferences and use DB queries to answer/scope logical queries
• Polynomial time algorithms for sub-ALC logics [Baader et al, IJCAI-05]
– Graph based techniques for subsumption computation
Tools and Infrastructure
• Editors/environments
– Oiled, Protégé, Swoop, Construct, Ontotrack, …
Tools and Infrastructure
• Editors/environments
– Oiled, Protégé, Swoop, Construct, Ontotrack, …
• Reasoning systems
– Cerebra, FaCT++, Kaon2, Pellet, Racer, …
Pellet
Tools and Infrastructure
• Editors/environments
– Oiled, Protégé, Swoop, Construct, Ontotrack, …
• Reasoning systems
– Cerebra, FaCT++, Kaon2, Pellet, Racer, …
• Non-standard inferences
– Explanation, matching, least common subsumer, …
Tools and Infrastructure
• Editors/environments
– Oiled, Protégé, Swoop, Construct, Ontotrack, …
• Reasoning systems
– Cerebra, FaCT++, Kaon2, Pellet, Racer, …
• Non-standard inferences
– Explanation, matching, least common subsumer, …
• Design methodologies Entity
– Foundational ontologies,
modularisation, etc. Endurant Perdurant
Achievement Accomplishment
Summary
• Semantic Web aims to make web content more
accessible to automated processes
– Adds semantic annotations to web resources
– Alan Rector
– Franz Baader
– Uli Sattler
Resources
• FaCT++ system (open source)
– http://owl.man.ac.uk/factplusplus/
• Protégé
– http://protege.stanford.edu/plugins/owl/
Any questions?