Xquery Tutorial: Dan Goldberg & Craig Knoblock University of Southern California
Xquery Tutorial: Dan Goldberg & Craig Knoblock University of Southern California
References
XQuery 1.0: An XML Query Language
www.w3.org/TR/xquery/
XQuery Tutorial
http://www.datadirect.com/developer/xquery/xquery_tutorial/ind ex.ssp
XQuery Tutorial
http://www.w3schools.com/xquery/default.asp
Xquery Overview
Xquery is an expression language
Every statement evaluates to some result
let $x := 5 let $y := 6 return 10*$x+$y Evaluates to 56
Primitive types
Number, boolean, strings, dates, times, durations, and XML types
Path Expressions
Xquery uses path expressions from Xpath (a W3C standard) Let $b := doc("bib.xml") return <result>{$b/bib/book}</result> /book selects the child elements named book /book/author selects the author elements of the toplevel book elements
//book[author/last = Stevens]
all book elements with author = Hunter
//book[@pages]
all book elements that have a pages attribute
FLWOR Expressions
For/Let, Where, Order by, Result Expressions
<html>{ let $d := doc("bib.xml") for $b in $d/bib/book where $b[@year > 1998] order by $b/publisher return <book>{$b/title, $b/price, $b/publisher}</book> }</html>
Projection
let $d := doc(bib.xml") return <result>{$d/bib/book/author}</result>
Project (cont.)
/bib/book/author = =
The same query can also be written as a for loop
Selection
Return the titles of all books published before 1997 /bib/book[@year < "1997"]/title = <title>TCP/IP Illustrated</title> <title>Advanced Programming in the Unix environment</title>
Selection (cont.)
Return the titles of all books published before 1997 /bib/book[@year < "1997"]/title = for $bk in doc(bib.xml)/bib/book where $bk/@year < "1997" return $bk/title = <title>TCP/IP Illustrated</title> <title>Advanced Programming in the Unix environment</title>
Selection (cont.)
Return book with the title Data on the Web /bib/book[title = "Data on the Web"] =
<book year="2000"> <title>Data on the Web</title> <author><last>Abiteboul</last><first>Serge</first></author> <author><last>Buneman</last><first>Peter</first></author> <author><last>Suciu</last><first>Dan</first></author> <publisher>Morgan Kaufmann Publishers</publisher> <price> 39.95</price> </book>
Selection (cont.)
Return the price of the book Data on the Web /bib/book[title = "Data on the Web"]/price = <price> 39.95</price> How would you return the book with a price of $39.95?
Selection (cont.)
for $bk in doc(bib.xml)/bib/book where $bk/price = "39.95" return $bk = let $d:= doc("bib.xml") return $d/bib/book[price = "39.95"]
=
<book year="2000"> <title>Data on the Web</title> <author><last>Abiteboul</last><first>Serge</first></author> <author><last>Buneman</last><first>Peter</first></author> <author><last>Suciu</last><first>Dan</first></author> <publisher>Morgan Kaufmann Publishers</publisher> <price> 39.95</price> </book>
Construction
Return year and title of all books published before 1997 for $bk in doc(bib.xml)/bib/book where $bk/@year < "1997" return <book>{ $bk/@year, $bk/title }</book> = <book year="1994"> <title>TCP/IP Illustrated</title> </book> <book year="1992"> <title>Advanced Programming in the Unix environment</title> </book>
Grouping
Return titles for each author
<result>{ let $d:= doc("bib.xml") for $l in distinct-values($d//author/last) return <author name="{$l}"> { $d/bib/book[author/last = $l]/title } </author> }</result>
Grouping Cont.
= <result>
<author name="Stevens"> <title>TCP/IP Illustrated</title> <title>Advanced Programming in the Unix environment</title> </author> <author name="Abiteboul"> <title>Data on the Web</title> </author>
</result>
Join
Return the books that cost more at amazon than fatbrain Let $amazon := doc(http://www.amazon.com/books.xml), Let $fatbrain := doc(http://www.fatbrain.com/books.xml) For $am in $amazon/books/book, $fat in $fatbrain/books/book Where $am/isbn = $fat/isbn and $am/price > $fat/price Return <book>{ $am/title, $am/price, $fat/price }<book>
Functions
Define function reverse ($items) { let $count := count($items) for $i in 0 to $count return $items[$count - $i] } Reverse(1 to 5) Note: (1 to 5) = (1, 2, 3, 4, 5)
Example Query 1
<bib> { for $b in doc("bib.xml")/bib/book where $b/publisher = "Addison-Wesley" and $b[@year > 1991] return <book year="{ $b/@year }"> {$b/title} </book> } </bib> What does this do?
Result Query 1
<bib> <book year="1994"> <title>TCP/IP Illustrated</title> </book> <book year="1992"> <title>Advanced Programming in the Unix environment</title> </book> </bib>
Example Query 2
<results> { for $b in doc("bib.xml")/bib/book, $t in $b/title, $a in $b/author/last return <result> { $t } { $a } </result> } </results>
Result Query 2
<results> <result><title>TCP/IP Illustrated</title> <last>Stevens </last> </result> <result><title>Advanced Programming in the Unix environment</title> <last>Stevens</last> </result> <result><title>Data on the Web</title> <last>Abiteboul</last> </result> <result> <title>Data on the Web</title> <last>Buneman</last> </result> <result><title>Data on the Web</title> <last>Suciu</last> </result> </results>
Example Query 3
<bib> { for $b in doc("bib.xml")//book where $b/publisher = "Addison-Wesley" and $b/@year > "1991" order by $b/title return <book> { $b/@year } { $b/title } </book> } </bib>
Example Result 3
<bib> <book year="1992"> <title>Advanced Programming in the Unix environment</title> </book> <book year="1994"> <title>TCP/IP Illustrated</title> </book> </bib>