0% found this document useful (0 votes)

12 views42 pages

Lecture 5

This lecture discusses the differences between XML and HTML, highlighting XML's advantages in structural information and machine readability. It covers the role of XML in the Semantic Web, its standard structure, and the importance of well-formed documents and namespaces. Additionally, it introduces addressing and querying XML documents using path expressions and XPath examples.

Uploaded by

manchestermilf1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views42 pages

Lecture 5

Uploaded by

manchestermilf1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

LECTURE 5

Knowledge Representation : AI 303

By
Dr.Ashraf Hendam
OUTLINE
• XML vs HTML
• Problems with Automated Interpretation of HTML Documents
• HTML vs XML: Structural Information
• HTML vs XML: Different Use of Tags
• Role of XML in the Semantic Web
• XML Standard structure
• Addressing & Querying XML Documents
• Types of Path Expressions
Semantic Web layers
XML vs HTML
• HTML
<h2>Nonmonotonic Reasoning: Context-Dependent Reasoning</h2>
<i>by <b>V. Marek</b> and
<b>M. Truszczynski</b>
</i>
<br>Springer 1993
<br> ISBN 0387976892
• XML
<book>
<title>Nonmonotonic Reasoning: Context- Dependent Reasoning</title>
<author>V. Marek</author> <author>M. Truszczynski</author>
<publisher>Springer</publisher> <year>1993</year>
<ISBN>0387976892</ISBN>
</book>
XML vs HTML
• Both use tags (e.g. <h2> and <year>)
• Tags may be nested (tags within tags)
• Human users can read and interpret
both HTML and XML representations
quite easily
• … But how about machines?
Problems with Automated Interpretation of
HTML Documents
• An intelligent agent trying to retrieve the names of
the authors of the book
• Authors’ names could appear immediately after
the title or immediately after the word by
• Are there two authors?
• Or just one, called “V. Marek and M. Truszczynski”?
HTML vs XML: Structural Information
• HTML documents do not contain structural information,
• i.e., pieces of the document and their relationships.
• HTML has only presentation
• XML more easily accessible to machines because
Every piece of information is described
Relations are also defined through the nesting structure.
E.g., the <author> tags appear within the <book> tags,
so they describe properties of the particular book.
HTML vs XML: Structural Information
• A machine processing the XML document would be
able to deduce that
the author element refers to the enclosing book
element rather than by proximity considerations
• XML allows the definition of constraints on values
• E.g. year must be a number of four digits
HTML vs XML: Another Example
In HTML
<h2>Relationship force-mass</h2>
<i> F = M × a </i>
In XML
<equation>
<meaning>Relationship forcemass</meaning>
<leftside> F </leftside>
<rightside> M × a </rightside>
</equation>
HTML vs XML: Different Use of Tags
• In HTML docs we have the same tags
• In XML completely different (for different meanings)
• HTML tags define display: color, lists …
• XML tags not fixed: user definable tags
• XML is a meta markup language:
• 11 language for defining markup languages
Role of XML in the Semantic Web

• The Semantic Web involves ideas and languages

at a fairly abstract level, e.g.: for defining
ontologies, publishing data using them
• We also need a practical way of encoding the
abstract languages
• Today’s Web technology is (still) largely based on
XML standards
Role of XML in the Semantic Web
• XML is a
(1) source of many key SW concepts and technology
(2) potential alternative the SW must improve on
(3) common serialization for SW data.
• XML: Meant for Serialization
• A serialization format is a way to encode information so that
when it’s passed between machines it can be parsed.
• In fact, the popularity of XML is due to its addressing the
problem of too many file formats.
XML Standard structure
• An XML document consists of
1. a prolog
2. a number of elements
Prolog of an XML Document
The prolog consists of
XML prolog which contains both XML Declaration and XML
DTD (Document Type Definition) and the body. If the XML prolog
is present, it should always be the beginning of the document.
<?xml version="1.0" encoding="UTF-16"?>
<!DOCTYPE book SYSTEM "book.dtd">
XML Elements

Elements are the “things” the XML document talks about

– E.g., books, authors, publishers
An element consists of:
– an opening tag
– the content
– a closing tag
<lecturer> David Billington </lecturer>
Content of XML Elements
• Content is what’s between the tags
• It can be text, or other elements, or nothing
<lecturer>
<name>David Billington</name>
<phone> +61 − 7 − 3875 507 </phone>
</lecturer>
• If there is no content, then element is called empty; it can be
abbreviated as follows:
• <lecturer/> = <lecturer></lecturer>
XML Attributes
An empty element isn’t necessarily meaningless
– It may have properties expressed as attributes
An attribute is a name-value pair inside the opening tag of an
element
<order orderNo="23456“
customer="John Smith"
date="October 15, 2017” >
<item itemNo="a528" quantity="1“ />
<item itemNo="c817" quantity="3“ />
</order>
Well-Formed XML Documents
Are constraints on syntactically correct documents:
– Only one outermost element (root element)
– Each element contains opening and corresponding closing tag
except self-closing tags like <foo/>)
– Tags may not overlap
<author><name>Lee Hong</author></name>
– Attributes within an element have unique names
– Element and tag names must be permissible
e.g.: can’t use strings beginning with digit “2ndbest”
The Tree Model of XML Docs
The tree representation of an XML document is an
ordered labeled tree:
– There is exactly one root
– There are no cycles
– Each non-root node has exactly one parent
– Each node has a label.
– The order of elements is important
– … but the order of attributes is not
The Tree Model of XML Docs
<email>
<head>
<from name="Michael Maher" address="[email protected]" />
<to name="Grigoris Antoniou" address="[email protected]" />
<subject>Where is your draft?</subject>
</head>
<body>
Grigoris, where is the draft of the paper you promised me last week?
</body>
</email>
Structuring XML Documents
An XML document is valid if
– it is well-formed XML
– Respects the structuring information it uses
Ways to define structure of XML documents:
– DTDs (Document Type Definition) came first, was
based on SGML’s approach
– XML Schema (aka XML Schema Definition, XSD) is
more recent and expressive
– RELAX NG and DSDs are two alternatives
Namespaces
• XML namespaces provide uniquely named
elements & attributes in an XML document
• XML document may use > 1 DTD or schema
Since each was developed independently,
name collisions can occur
Solution: use different prefix for each DTD
or schema
prefix:name
Namespaces even more important in RDF
Namespace Declarations
• Namespaces declared within elements for use
in it and its children (elements and attributes)
• A namespace declaration has form:
– xmlns:prefix="location"
– location is the URL of the DTD or XML
schema
• If no prefix specified: xmlns="location" then
the location is used as the default prefix
We’ll see this same idea used in RDF
Namespace Declarations
<vu:instructors xmlns:vu="http://www.vu.com/empDTD"
xmlns:gu="http://www.gu.au/empDTD"
xmlns:uky="http://www.uky.edu/empDTD" >
<uky:faculty uky:title="assistant professor" uky:name="John Smith"
uky:department="Computer Science"/>
<gu:academicStaff gu:title="lecturer" gu:name="Mate Jones"
gu:school="Information Technology"/> </vu:instructors>
Addressing & Querying XML Documents
• In relational databases, parts of a database can be selected
and retrieved using SQL
– Also very useful for XML documents
– Query languages: XQuery, XQL, XML-QL
• The central concept of XML query languages is
a path expression
– Specifies how a node or set of nodes, in the
tree representation, can be reached
• Useful for extracting data from XML
Types of Path Expressions
• Absolute (starting at the root of the tree)
– Syntactically they begin with the symbol /
– It refers to the root of the document (one
level above document’s root element)
• Relative to a context node
An XML Example
<library location="Bremen">
<author name="Henry Wise">
<book title="Artificial Intelligence"/>
<book title="Modern Web Services"/>
<book title="Theory of Computation"/>
</author>
<author name="William Smart">
<book title="Artificial Intelligence"/>
</author>
<author name="Cynthia Singleton">
<book title="The Semantic Web"/>
<book title="Browser Technology Revised"/>
</author>
</library>
An XML Example
Well-Formed XML Documents
Are constraints on syntactically correct documents:
– Only one outermost element (root element)
– Each element contains opening and corresponding closing tag
except self-closing tags like <foo/>)
– Tags may not overlap
<author><name>Lee Hong</author></name>
– Attributes within an element have unique names
– Element and tag names must be permissible
e.g.: can’t use strings beginning with digit “2ndbest”
Converting XML file to tree representation
To convert XML file to a tree representation the following rules are
used:-
• Elements are represented by ovals
• Attributes are represented by ovals
• Solid lines are used to connect between Elements and Elements
to attributes
• Attributes values are represented by rectangular shape
• Dotted lines are connecting attributes and their values
Converting XML file to tree representation
<library location="Bremen">
<author name="Henry Wise">
<book title="Artificial Intelligence"/>
<book title="Modern Web Services"/>
<book title="Theory of Computation"/>
</author>
<author name="William Smart">
<book title="Artificial Intelligence"/>
</author>
<author name="Cynthia Singleton">
<book title="The Semantic Web"/>
<book title="Browser Technology Revised"/>
</author>
</library>
Converting XML file to tree representation
Converting tree representation to XML file assignment
bookstore

book

category title author price edition

cooking lang name currency amount order year

en Giada dollar 30 first 2005

Well-Formed XML Documents Example

<bookstore> <bookstore>
<book> <book>
<7title lang="en“ lang=“ar">Harry Potter</title> <title lang="en">Harry Potter</title>
<price>29.99</book></price> <price>29.99</price>
</book> </book>
<book> <book>
<title lang="en">Learning XML</title> <title lang="en">Learning XML</title>
<price>39.95</price> <price>39.95</price>
</book> </book>
</bookstore>
</bookstore>
Examples of Path Expressions in XPath
Q1: /library/author
– Addresses all author elements that are children of
the library element node immediately below root
– /t1/.../tn, where each ti+1 is a child node of ti, is a
path through the tree representation
Q2: //author
– Consider all elements in document and check
whether they are of type author
– Path expression addresses all author elements
anywhere in the document
Examples of Path Expressions in XPath
Q3: /library/@location
– Addresses location attribute nodes within library
element nodes
– The symbol @ is used to denote attribute nodes
Q4: //book/@title="Artificial Intelligence”
– Adresses all title attribute nodes within book
elements anywhere in the document that have the
value “Artificial Intelligence
Examples of Path Expressions in XPath
//book/@title="Artificial Intelligence”
Examples of Path Expressions in XPath
Q6: Address first author element node in the XML
document
//author[1]
Q7: Address last book element within the first
author element node in the document
//author[1]/book[last()]
Q8: Address all book element nodes without a title
attribute
//book[not @title]
Examples of Path Expressions in XPath
<?xml version="1.0" encoding="UTF-8"?>
<bookstore>
<book>
<title lang="en">Harry Potter</title>
<price>29.99</price>
</book>
<book>
<title lang="en">Learning XML</title>
<price>39.95</price>
</book>
</bookstore>
Examples of Path Expressions in XPath
Expression Description

nodename Selects all nodes with the name "nodename"

/ Selects from the root node

// Selects nodes in the document from the current node that

match the selection no matter where they are

. Selects the current node

.. Selects the parent of the current node

@ Selects attributes

/bookstore Selects the root element bookstore

bookstore/book Selects all book elements that are children of bookstore
//book Selects all book elements no matter where they are in the document
bookstore//book Selects all book elements that are descendant of the bookstore element
//@lang Selects all attributes that are named lang
Examples of Path Expressions in XPath
Expression Description

[1] Selects the first element that is the child of the element.

[last()] Selects the last element that is the child of the element

[position()<3] Selects the first two book elements that are children of the element

//element[@attribute] Selects all the elements that have a certain attribute named

.. Selects the parent of the current node

@ Selects attributes

/bookstore/book[1] Selects the first book element that is the child of the bookstore
/bookstore/book[last()] Selects the last book element that is the child of the bookstore element
/bookstore/book[position()<3] Selects the first two book elements that are children of the bookstore element
//title[@lang] Selects all the title elements that have an attribute named lang

Structured Web Document XML
No ratings yet
Structured Web Document XML
124 pages
نسخة Week 4-6 XML-dtd-XML Schema
No ratings yet
نسخة Week 4-6 XML-dtd-XML Schema
90 pages
XML, Ajax and PHP
No ratings yet
XML, Ajax and PHP
40 pages
Unit II XML
No ratings yet
Unit II XML
73 pages
Web Unit 2 (Nep)
No ratings yet
Web Unit 2 (Nep)
45 pages
03a XML
No ratings yet
03a XML
57 pages
XML Mod4
No ratings yet
XML Mod4
89 pages
Xmlunit 2
No ratings yet
Xmlunit 2
133 pages
Module 5.1
No ratings yet
Module 5.1
81 pages
UNIT - 3 Web Technologies
No ratings yet
UNIT - 3 Web Technologies
41 pages
Unit 9 XMLandJAva
No ratings yet
Unit 9 XMLandJAva
70 pages
Introduction To XML Extensible Markup Language: Prof.N.Nalini AP (SR) VIT
No ratings yet
Introduction To XML Extensible Markup Language: Prof.N.Nalini AP (SR) VIT
35 pages
Unit III
No ratings yet
Unit III
39 pages
Unit 2 - XML
No ratings yet
Unit 2 - XML
48 pages
Module 8 (XML)
No ratings yet
Module 8 (XML)
61 pages
Structured Web Documents in XML: Grigoris Antoniou Frank Van Harmelen
No ratings yet
Structured Web Documents in XML: Grigoris Antoniou Frank Van Harmelen
109 pages
XML and Applications
No ratings yet
XML and Applications
39 pages
Itec102 Midterm L1
No ratings yet
Itec102 Midterm L1
32 pages
Unit 3
No ratings yet
Unit 3
80 pages
VTU Exam Question Paper With Solution of 20MCA23 Web Technologies July-2022-Ashwini
No ratings yet
VTU Exam Question Paper With Solution of 20MCA23 Web Technologies July-2022-Ashwini
41 pages
Edited Uint2
No ratings yet
Edited Uint2
87 pages
Web Dev Final Book Page Num
No ratings yet
Web Dev Final Book Page Num
55 pages
Unit 3
No ratings yet
Unit 3
42 pages
Chapter 4 XML
No ratings yet
Chapter 4 XML
52 pages
XML PPT
No ratings yet
XML PPT
37 pages
Chapter 4
No ratings yet
Chapter 4
22 pages
Extensible Markup Language Store and Transport Data
No ratings yet
Extensible Markup Language Store and Transport Data
43 pages
4..lect-09 XML Languages & Applications
No ratings yet
4..lect-09 XML Languages & Applications
14 pages
Web Dev Final Book Page Num-1-30
No ratings yet
Web Dev Final Book Page Num-1-30
30 pages
Unit Ii
No ratings yet
Unit Ii
106 pages
4 XML and PHP
No ratings yet
4 XML and PHP
34 pages
What Is XML?: Week 1
No ratings yet
What Is XML?: Week 1
15 pages
Chap1 - Introduction To DSS and XML
No ratings yet
Chap1 - Introduction To DSS and XML
26 pages
Proejct Part C Homework 3: About
No ratings yet
Proejct Part C Homework 3: About
60 pages
XML Programming: The Ultimate Guide to Fast, Easy, and Efficient Learning of XML Programming
From Everand
XML Programming: The Ultimate Guide to Fast, Easy, and Efficient Learning of XML Programming
Christopher Right
2.5/5 (2)
0432 XML DTD and XML Schema
No ratings yet
0432 XML DTD and XML Schema
32 pages
Chap 2 XML
No ratings yet
Chap 2 XML
18 pages
XML in Unit-1
No ratings yet
XML in Unit-1
19 pages
XML //theory Will Come
No ratings yet
XML //theory Will Come
10 pages
XML and DTD: Mario Alviano
No ratings yet
XML and DTD: Mario Alviano
51 pages
Unit-III Introduction To XML
No ratings yet
Unit-III Introduction To XML
25 pages
WT Unit Iv
No ratings yet
WT Unit Iv
18 pages
XML Unit 2 Notes
No ratings yet
XML Unit 2 Notes
24 pages
Unit - 4 XML
No ratings yet
Unit - 4 XML
82 pages
Unit 1
No ratings yet
Unit 1
9 pages
Unit-2 XML
No ratings yet
Unit-2 XML
17 pages
XML Introduction
No ratings yet
XML Introduction
42 pages
Unit 3 Working With XML Introduction To XML
No ratings yet
Unit 3 Working With XML Introduction To XML
41 pages
XML What Is XML?
No ratings yet
XML What Is XML?
15 pages
WT Unit-3 Notes
No ratings yet
WT Unit-3 Notes
41 pages
XML (BScCSIT 5th Semester)
No ratings yet
XML (BScCSIT 5th Semester)
39 pages
Extensible Markup Language
100% (1)
Extensible Markup Language
89 pages
What Is XML?
No ratings yet
What Is XML?
26 pages
Introduction To XML
100% (1)
Introduction To XML
35 pages
EU M1 eCTD Spec v3.0.4
No ratings yet
EU M1 eCTD Spec v3.0.4
62 pages
355 33 Powerpoint-Slides Chapter6 (XML)
No ratings yet
355 33 Powerpoint-Slides Chapter6 (XML)
36 pages
XML Writing and Parsing: SOA - Lab2
No ratings yet
XML Writing and Parsing: SOA - Lab2
16 pages
XML Notes
No ratings yet
XML Notes
11 pages
XXE (XML External Entity) Vuln
100% (1)
XXE (XML External Entity) Vuln
13 pages
Module 2 PDF
No ratings yet
Module 2 PDF
25 pages
Document Type Definition (DTD) : Author: Lukasz Kurgan
No ratings yet
Document Type Definition (DTD) : Author: Lukasz Kurgan
19 pages
Ontology-Based Information Sharing in Weakly Structure Enviroments
No ratings yet
Ontology-Based Information Sharing in Weakly Structure Enviroments
195 pages
Components of An XML Document
100% (6)
Components of An XML Document
21 pages
XML Schema
100% (1)
XML Schema
60 pages
Web Services Security Using SOAP, WSDL and Uddi
No ratings yet
Web Services Security Using SOAP, WSDL and Uddi
108 pages
AI Lab Course File WDW
No ratings yet
AI Lab Course File WDW
31 pages
Lecture 7
No ratings yet
Lecture 7
29 pages
A Quick Introduction To XML
No ratings yet
A Quick Introduction To XML
3 pages
Unit 3
No ratings yet
Unit 3
50 pages
Lecture 3
No ratings yet
Lecture 3
36 pages
AtoZInternetProgramming2 (PHP) MCQs
No ratings yet
AtoZInternetProgramming2 (PHP) MCQs
83 pages
XML Notes - Docghjghjghjgh
No ratings yet
XML Notes - Docghjghjghjgh
6 pages
XSL Primer
From Everand
XSL Primer
Stephen Cote
No ratings yet
Section 5 Ai303
No ratings yet
Section 5 Ai303
26 pages
Lecture 6
No ratings yet
Lecture 6
25 pages
Skill Developmen LAB Manual
No ratings yet
Skill Developmen LAB Manual
32 pages
Lab Manual
No ratings yet
Lab Manual
48 pages
IP-Chapter-5-PHP and XML-Notes-SH 2022-Prepared by Reshma Koli
No ratings yet
IP-Chapter-5-PHP and XML-Notes-SH 2022-Prepared by Reshma Koli
53 pages
Rapid Mart Development Guide
100% (1)
Rapid Mart Development Guide
98 pages
Advance Java MCQ
No ratings yet
Advance Java MCQ
107 pages
Integrative Programming and Technologies (Itec4121)
No ratings yet
Integrative Programming and Technologies (Itec4121)
59 pages
Piping & Instrumentation Diagrams User's Guide
No ratings yet
Piping & Instrumentation Diagrams User's Guide
315 pages
Code Migration Plan
No ratings yet
Code Migration Plan
35 pages
At Module-3
No ratings yet
At Module-3
36 pages
Practical
No ratings yet
Practical
9 pages
Exemel
No ratings yet
Exemel
91 pages
XML Reference: Siebel Enterprise Application Integration: March 2008
No ratings yet
XML Reference: Siebel Enterprise Application Integration: March 2008
82 pages
XXE Injection Attack
No ratings yet
XXE Injection Attack
6 pages
XML ImportExport Interface UniFLOWHelix - V1.4
No ratings yet
XML ImportExport Interface UniFLOWHelix - V1.4
25 pages
Heq Apr22 PGD We
No ratings yet
Heq Apr22 PGD We
6 pages
GEN TRAN XMLcom - Ibm.xtt - Xml.tutorial
No ratings yet
GEN TRAN XMLcom - Ibm.xtt - Xml.tutorial
14 pages
Principle of HTML, XHTML, & DHTML by Don Gosselin Answer To Comprehension Check CH1
100% (1)
Principle of HTML, XHTML, & DHTML by Don Gosselin Answer To Comprehension Check CH1
4 pages
Docs Huihoo Com Apache Ofbiz 2 1 1 OFBizQuickStart HTML
No ratings yet
Docs Huihoo Com Apache Ofbiz 2 1 1 OFBizQuickStart HTML
1 page
Ansi c12.19 2008
100% (4)
Ansi c12.19 2008
580 pages

Lecture 5

Uploaded by

Lecture 5

Uploaded by

LECTURE 5

Knowledge Representation : AI 303

• The Semantic Web involves ideas and languages

Elements are the “things” the XML document talks about

category title author price edition

cooking lang name currency amount order year

en Giada dollar 30 first 2005

nodename Selects all nodes with the name "nodename"

/ Selects from the root node

// Selects nodes in the document from the current node that

. Selects the current node

.. Selects the parent of the current node

/bookstore Selects the root element bookstore

.. Selects the parent of the current node

You might also like