XPath
GANESAN CHANDRASEKARAN # 51450717
What is XPath?
• XPath is a major element in XSLT
• XPath can be used to navigate through elements and attributes in an XML document.
• XPath is a syntax for defining parts of an XML document
• XPath uses path expressions to navigate in XML documents
• XPath contains a library of standard functions
• XPath is a major element in the XSLT standard
11/16/2019 FOR INTERNAL TRAINING ONLY 2
XPath Terminology
• Nodes:
• In XPath, there are seven kinds of nodes
• Element
• Attribute
• Text
• Namespace
• processing-instruction
• Comment
• document nodes.
• Atomic values:
• Atomic values are nodes with no children or parent.
• Parent, Children, Siblings, Ancestors, Descendants
11/16/2019 FOR INTERNAL TRAINING ONLY 3
XPath Syntax
• XPath uses path expressions to select nodes or node-sets in an XML document. The node is selected by
following a path or steps
• The most useful path expressions are listed below
Expression Description
nodename Selects all nodes with the name "nodename"
/ Selects from the root node
// Selects nodes in the document from the current node that match the selection no matter where
they are
. Selects the current node
.. Selects the parent of the current node
@ Selects attributes
11/16/2019 FOR INTERNAL TRAINING ONLY 4
Example:
<?xml version="1.0" encoding="UTF-8"?>
<bookstore> Path Expression Result
bookstore Selects all nodes with the name "bookstore"
<book> /bookstore Selects the root element bookstoreNote: If the path starts with a slash ( / ) it always represents an absolute path to an element!
<title lang="en">Harry Potter</title>
bookstore/book Selects all book elements that are children of bookstore
<price>29.99</price> //book Selects all book elements no matter where they are in the document
</book> bookstore//book Selects all book elements that are descendant of the bookstore element, no matter where they are under the bookstore element
<book> //@lang Selects all attributes that are named lang
<title lang="en">Learning XML</title>
<price>39.95</price>
</book>
</bookstore>
11/16/2019 FOR INTERNAL TRAINING ONLY 5
Predicates
• Predicates are used to find a specific node or a node that contains a specific value.
• Predicates are always embedded in square brackets.
Path Expression Result
/bookstore/book[1] Selects the first book element that is the child of the bookstore element.Note: In IE 5,6,7,8,9 first node is[0], but
according to W3C, it is [1]. To solve this problem in IE, set the SelectionLanguage to XPath:
In JavaScript: xml.setProperty("SelectionLanguage","XPath");
/bookstore/book[last()] Selects the last book element that is the child of the bookstore element
/bookstore/book[last()-1] Selects the last but one book element that is the child of the bookstore element
/bookstore/book[position()<3] Selects the first two book elements that are children of the bookstore element
//title[@lang] Selects all the title elements that have an attribute named lang
//title[@lang='en'] Selects all the title elements that have a "lang" attribute with a value of "en"
/bookstore/book[price>35.00] Selects all the book elements of the bookstore element that have a price element with a value greater than 35.00
/bookstore/book[price>35.00]/title Selects all the title elements of the book elements of the bookstore element that have a price element with a value
greater than 35.00
11/16/2019 FOR INTERNAL TRAINING ONLY 6
Unknown Nodes
• XPath wildcards can be used to select unknown XML nodes.
Wildcard Description
* Matches any element node
@* Matches any attribute node
node() Matches any node of any kind
• In the table below we have listed some path expressions and the result of the expressions:
Path Expression Result
/bookstore/* Selects all the child element nodes of the bookstore element
//* Selects all elements in the document
//title[@*] Selects all title elements which have at least one attribute of any kind
• By using the | operator in an XPath expression you can select several paths.
11/16/2019 FOR INTERNAL TRAINING ONLY 7
XPath Axes
• An axis defines a node-set relative to the current node.
AxisName Result
ancestor Selects all ancestors (parent, grandparent, etc.) of the current node
ancestor-or-self Selects all ancestors (parent, grandparent, etc.) of the current node and the current node itself
attribute Selects all attributes of the current node
child Selects all children of the current node
descendant Selects all descendants (children, grandchildren, etc.) of the current node
descendant-or-self Selects all descendants (children, grandchildren, etc.) of the current node and the current node itself
following Selects everything in the document after the closing tag of the current node
following-sibling Selects all siblings after the current node
namespace Selects all namespace nodes of the current node
parent Selects the parent of the current node
preceding Selects all nodes that appear before the current node in the document, except ancestors, attribute
nodes and namespace nodes
preceding-sibling Selects all siblings before the current node
self Selects the current node
11/16/2019 FOR INTERNAL TRAINING ONLY 8
XPath Operators
• An XPath expression returns either a node-set, a string, a Boolean, or a number.
Operator Description Example
| Computes two node-sets //book | //cd
+ Addition 6+4
- Subtraction 6-4
* Multiplication 6*4
div Division 8 div 4
= Equal price=9.80
!= Not equal price!=9.80
< Less than price<9.80
<= Less than or equal to price<=9.80
> Greater than price>9.80
>= Greater than or equal to price>=9.80
or or price=9.80 or price=9.70
and and price>9.00 and price<9.90
mod Modulus (division remainder) 5 mod 2
11/16/2019 FOR INTERNAL TRAINING ONLY 9