Dan Wahlin XML For ASP - NET Developers Kaleidoscope

XML for ASP.
NET Developers
Dan Wahlin
201 West 103rd St., Indianapolis, Indiana, 46290 USA

ASSOCIATE PUBLISHER
XML for ASP.NET Developers Jeff Koch
Copyright © 2002 by Sams ACQUISITIONS EDITOR

All rights reserved. No part of this book shall be reproduced, stored in a Neil Rowe
retrieval system, or transmitted by any means, electronic, mechanical, photo-
DEVELOPMENT EDITOR
copying, recording, or otherwise, without written permission from the pub-
Kevin Howard
lisher. No patent liability is assumed with respect to the use of the information
contained herein. Although every precaution has been taken in the preparation MANAGING EDITOR
of this book, the publisher and author assume no responsibility for errors or Matt Purcell
omissions. Nor is any liability assumed for damages resulting from the use of
PROJECT EDITOR
the information contained herein.
Natalie Harris
International Standard Book Number: 0-672-32039-8
COPY EDITOR
Library of Congress Catalog Card Number: 00-105164 Karen Gill
Printed in the United States of America Kim Cofer
Barbara Hacha
First Printing: August 2001
04 03 02 01 4 3 2 1
INDEXER
Cheryl Landes
Trademarks PROOFREADER
All terms mentioned in this book that are known to be trademarks or service Marcia Deboy
marks have been appropriately capitalized. Sams cannot attest to the accuracy
of this information. Use of a term in this book should not be regarded as TECHNICAL EDITOR
affecting the validity of any trademark or service mark. Lamar Kirby
Mingyong Yang
Warning and Disclaimer TEAM COORDINATOR
Every effort has been made to make this book as complete and as accurate as Vicki Harding
possible, but no warranty or fitness is implied. The information provided is on
an “as is” basis. The author and the publisher shall have neither liability nor MEDIA DEVELOPER
responsibility to any person or entity with respect to any loss or damages aris- Dan Scherf
ing from the information contained in this book.
INTERIOR DESIGNER
Dan Armstrong
COVER DESIGNER
Aren Howell
PAGE LAYOUT
Susan Geiselman
Contents at a Glance
Introduction 1
1 XML and ASP.NET Development 3
2 XML for ASP.NET Basics 13
3 XPath, XPointer, and XLink 51
4 Understanding DTDs and XML Schemas 89
5 Using the XmlTextReader and XmlTextWriter Classes in ASP.NET 133
6 Programming the Document Object Model (DOM) with ASP.NET 175
7 Transforming XML with XSLT and ASP.NET 233
8 Leveraging ADO.NET’s XML Features Using ASP.NET 293
9 SQL Server 2000, XML, and ASP.NET 349
10 Working with ASP.NET, XML, SOAP, and Web Services 399
Index 451
Table of Contents
Introduction 1
1 XML and ASP.NET Development 3

XML’s Ancestor—SGML ......................................................................4
Why Do We Need XML? ......................................................................5
X Is for eXtensible ..................................................................................7
Differences Between XML and HTML ................................................8
Presentation Versus Description ........................................................8
Closing Tags ......................................................................................9
Element Nesting ..............................................................................10
Quoted Attributes ............................................................................10
Case Sensitivity ..............................................................................10
MSXML3 Versus the .NET Platform’s System.Xml Assembly ..........10
How Can XML Be Used by an ASP.NET
Developer? ........................................................................................11
Summary ..............................................................................................12
2 XML for ASP.NET Basics 13
What’s in an XML Document? ............................................................14
Well-Formed XML Documents ............................................................16
The Root Element ............................................................................17
<br> Versus <br/> ............................................................................18
Nesting No-No’s ..............................................................................19
Other Rules ......................................................................................20
Valid XML Documents ........................................................................20
The XML Declaration ..........................................................................21
The xml Keyword ............................................................................22
The XML Document Version ..........................................................22
The Encoding Type ........................................................................22
The standalone Keyword ................................................................23
XML Elements ......................................................................................24
XML Attributes ....................................................................................25
XML Namespaces ................................................................................27
Namespace Structure ......................................................................29
Default Namespaces ........................................................................29
Qualified Namespaces ....................................................................31
XML Processing Instructions ..........................................................34
XML Comments ..............................................................................34
XML Entities ..................................................................................35
Standard Entities ..............................................................................36
Using ASP.NET Objects to Generate XML ..........................................44
Character Entities ............................................................................37
Internal, External, and Parameter Entities ......................................38
CDATA Sections ..................................................................................40
Dealing with Whitespace ......................................................................41
The Relationship of XML to XHTML ................................................42
vi
XML FOR ASP.NET DEVELOPERS
Using ASP.NET Objects to Generate XML ..........................................44

Application Specifics ......................................................................45
Application Summary ......................................................................50
Summary ..............................................................................................50
3 XPath, XPointer, and XLink 51
Meet a Few of XML’s Relatives ..........................................................52
XPath—The SQL of XML ..................................................................53
XPath Basics ....................................................................................54
XPath Functions ..............................................................................60
XPath Abbreviation Examples ........................................................66
XPointer—Accessing XML Document Fragments ..............................69
XPointer Basics ..............................................................................70
XPointer Examples ..........................................................................75
XLink—Resource Relationship Management ......................................76
XLink Basics ..................................................................................77
XLink Keyword Definitions ............................................................79
XLink Attributes ..............................................................................80
The XLink Simple Link ........................................................................82
XLink Extended Links ..........................................................................83
Linkbases and External Linksets ....................................................86
Putting XLink Together ........................................................................87
Summary ..............................................................................................88
4 Understanding DTDs and XML Schemas 89
Why Use DTDs or Schemas? ..............................................................90
Do I Really Need to Validate? ..............................................................91
DTD Basics ..........................................................................................93
The DTD DOCTYPE ......................................................................94
DTD Elements ................................................................................95
DTD Attributes ................................................................................97
DTD Entities ....................................................................................99
DTD Notations ..............................................................................101
Summing Up DTDs ......................................................................101
XML Schemas—Looks a Lot Like XML! ........................................101
A Sample XML-DR Schema and Its DTD Counterpart ..............102
Using the Schema Keyword with Namespaces in XML-
DR Schemas ................................................................................105
XML-DR Elements, Groups and Attributes ..................................106
XML Data Types ..........................................................................111
XML-DR Description Element ....................................................113
XML-DR Summary ......................................................................113
The W3C XML Schema ....................................................................114
W3C Schma Elements and Attributes ............................................116
Data Type Definitions ....................................................................116
Creating Unique Fields, Keys, and Relationships..........................124
Namespace Support in XML Schemas ........................................128
Referencing XML Schemas from Within XML Documents ........129
XML Schema Summary ................................................................130
Summary ............................................................................................130
vii
CONTENTS
5 Using the XmlTextReader and XmlTextWriter Classes in

ASP.NET 133
Introducing the System.Xml Assembly ..............................................134
In Memory Versus Forward-Only Parsing ..........................................135
Pull Versus Push Models......................................................................137
Using the XmlTextReader Class to parse XML ..................................138
Building a SAX-Style Push Model Using the XmlTextReader ..........145
Step 1: Referencing Assemblies ....................................................152
Step 2: Setting the Handlers ..........................................................153
Step 3: Declaring the XmlTextReader Class ................................153
Step 4: Instantiating the XmlTextReader Class ............................153
Step 5: Reading from the Stream ..................................................154
Step 6: Checking for Element Nodes ............................................154
Step 7: Checking for End Element Nodes ....................................156
Step 8: Reading Text Nodes ..........................................................156
Step 9: Handling Processing Instructions, Whitespace,
and Entities ..................................................................................157
Step 10: Ending the Parsing Process and Catching Errors ..........157
Step 11: Calling the SAX Parser from an ASP.NET Page ............158
Validating XML Documents Using the XmlTextReader and
XMLValidatingReader ......................................................................159
Instantiating the XmlValidatingReader ........................................160
Setting the ValidationType ............................................................160
Using the XmlSchemaCollection Class ........................................161
Attaching Event Handlers ............................................................161
Creating a Generic Validation Class ..............................................162
Passing Authentication Credentials with the XmlTextReader
Class ..................................................................................................166
Using the XMmlTextWriter Class to Create XML Documents ..........166
Converting EDI or Legacy Data to XML ......................................169
Summary ............................................................................................173
6 Programming the Document Object Model (DOM) with
ASP.NET 175
Welcome to the DOM ........................................................................176
In-Memory Versus Forward-Only Parsing ..........................................178
Working with MSXML3 via Interop ..................................................179
DOM Classes in the System.xml Namespace and Assembly ............182
The XmlNode Class ............................................................................183
The XmlDocument Class ....................................................................189
XmlDocument Object Properties and Methods ............................189
Loading an XML Document using the XmlDocument Class........194
Creating Nodes Using the XmlDocument Class ..........................196
The XmlNodeList Class ....................................................................200
The XmlNamedNodeMap Class ........................................................201
Selecting Nodes Within the DOM Using Xpath ................................204
Putting it All Together ........................................................................205
The XmlNodeReader Class ................................................................209
XMLHTTPRequest Object ................................................................211
viii
XML FOR ASP.NET DEVELOPERS
Sample Applications-Client Server-Side Hierarchical XML

Menus ................................................................................................219
Chapter 2 Sample Application Revisited ............................................225
Summary ............................................................................................232
7 Transforming XML with XSLT and ASP.NET 233
What Is XSLT? ..................................................................................234
The Transformation Process ..............................................................235
XSLT Templates ............................................................................236
Getting Your Feet Wet with XSLT ......................................................237
The XSLT Language ..........................................................................242
The XSLT Document Root Element ............................................242
XSLT Elements ............................................................................243
Transforming XML into Another Form of XML Using
XSLT Elements............................................................................258
XSLT Functions ..................................................................................267
The XPathDocument Class ..........................................................273
.NET Classes Involved in Transforming XML....................................272
The XslTransform Class ................................................................275
The XsltArgumentList Class ........................................................276
Putting it All Together ....................................................................278
Using Extension Objects with XSLT ............................................282
Creating a Reusable XSLT Class ........................................................287
The Asp:Xml Web Control ............................................................289
Summary ............................................................................................291
8 Leveraging ADO.NET’s XML Features Using ASP.NET 293
Introducing ADO.NET ......................................................................294
Comparing Classic ADO with ADO.NET ..........................................294
XML Integration ............................................................................294
What Happened to the RecordSet Object? ....................................294
Disconnected Versus Connected ....................................................295
ADO.NET Basics ..............................................................................295
ADO.NET Managed Providers ....................................................296
The Command Class ....................................................................298
The SqlDataAdapter and OleDbDataAdapter Classes ..................306
The DataSet Class ..............................................................................308
Viewing DataSets as XML ............................................................311
Loading DataSets with XML ........................................................318
Saving DataSets as XML ..............................................................320
Working with the DataSet and XmlDataDocument Classes ..............324
XmlDataDocument Properties and Methods ................................328
Shaping DataSet Columns with the MappingType
Enumeration ................................................................................338
Mapping XSD Schemas to a DataSet ..........................................341
Creating DataSet Mappings Using XML ......................................343
Using DataSets to Work with Hierarchical XML Data
and XSLT ....................................................................................345
Summary ............................................................................................348
ix
CONTENTS
9 SQL Server 2000, XML, and ASP.NET 349

XML Features in SQL Server 2000 ..................................................350
Querying SQL Server 2000 Using HTTP ..........................................351
Configuring SQL Server Virtual Directories in IIS........................351
Querying SQL Server 2000 Through HTTP Using the
FOR XML Keywords ................................................................353
Using HTTP Queries to Return Elements and Schemas ..............362
Querying SQL Server 2000 Through HTTP with XML
Templates ....................................................................................363
Querying SQL Server 2000 through HTTP Using Templates,
XPath, and XDR Schemas ................................................................368
SQL Server 2000 Schemas and Annotations ................................369
Using XPath Queries and Schemas in a URL ..............................374
Using XPath Queries, Schemas, and Templates ..........................375
Using EXPLICIT Mode Queries ........................................................377
Using Directives in EXPLICIT Mode Queries ............................381
Using OPENXML to Manipulate X ....................................................385
XML Updategrams—Update, Inserts, and Delete Database
Records with XML Using ASP.NET ................................................389
Using ADO.NET with SQL Server 2000 ..........................................394
The XmlParserContext Class ........................................................396
Summary ............................................................................................398
10 Working with ASP.NET, XML, SOAP, and Web Services 399
Understanding SOAP ..........................................................................400
Alternatives to SOAP ....................................................................400
What Is SOAP? ............................................................................402
Analyzing SOAP’s Structure ........................................................404
The SOAP Envelope ......................................................................407
The SOAP Header ........................................................................408
The SOAP Body ............................................................................410
SOAP Encoding and Data Types ..................................................414
SOAP HTTP Headers ....................................................................419
Understanding Web Services ..............................................................421
Web Service Protocols (SOAP, HTTP-GET, HTTP-POST) ........422
Web Service Architecture ..............................................................424
Introducing Web Service Attributes ............................................425
Consuming a Web Service from an ASP.NET File ......................439
Retrieving Customer Orders Through a Web Service ........................444
Web Service Description ..............................................................444
Creating the ACME Distribution, Inc. Web Service ....................445
Consuming the ACME Distribution, Inc. Web Service ................447
Summary ............................................................................................450
Index 451
About the Author
Dan Wahlin is an independent consultant for Wahlin Consulting and also founded the XML for
ASP.NET website (http://www.XMLforASP.NET) which focuses on using XML and Web
Services in Microsoft’s .NET platform. He also works as a corporate trainer for Global
Knowledge. Previously, Dan worked as an enterprise Web site manager, director of
Internet/wireless development, and as senior consultant for a global systems integrator. Dan
enjoys writing technical articles on XML-related topics for Visual Studio Magazine,
ASPToday.com, and is a regular columnist in XML Magazine. Dan co-authored Professional
Windows DNA (WROX Press) and ASP.NET Tips and Tricks (Sams). You can contact Dan at
[email protected].
Dedication
To my wife Heedy, our two boys (my best buddies) Danny and Jeffery, and my parents.
Acknowledgments
I would like to thank my wife and children for their extreme patience as I spent evening after
evening and weekend after weekend working on this book. Heedy, I know you’re happy I
finally finished this book! I’d also like to thank my mom and dad (Elaine and Danny) for
bringing me up in such a pleasant environment and pushing me to succeed in life. I couldn’t
have asked for better parents. To Paul Allsing and Lin Thatcher, thanks for giving me my first
shot at working in the technology industry. Had it not been for your confidence in me, I
wouldn’t have had the opportunity to write this book.
I’d also like to thank Chris Lovett and Mark Fussell of Microsoft for answering so many ques-
tions for me over the past year. Without their help and samples, the book would not have
turned out as well as it did. Finally, I’d like to thank Neil Rowe of Sams for being such a fun
editor to work with. His patience and understanding when code had to be rewritten from build
to build made the process much more enjoyable.
Tell Us What You Think!
As the reader of this book, you are our most important critic and commentator. We value your
opinion and want to know what we’re doing right, what we could do better, what areas you’d
like to see us publish in, and any other words of wisdom you’re willing to pass our way.
As an Associate Publisher for Sams, I welcome your comments. You can fax, e-mail, or write
me directly to let me know what you did or didn’t like about this book—as well as what we
can do to make our books stronger.
Please note that I cannot help you with technical problems related to the topic of this book,
and that because of the high volume of mail I receive, I might not be able to reply to every
message.
When you write, please be sure to include this book’s title and author as well as your name and
phone or fax number. I will carefully review your comments and share them with the author
and editors who worked on the book.
Fax: 317-581-4770
Email: [email protected]
Mail: Jeff Koch

Sams Publishing
201 West 103rd Street
Indianapolis, IN 46290 USA
Introduction
Product vendors, CEOs, journalists, and technology evangelists alike have proclaimed that
XML is the next big programming paradigm shift that will revolutionize computing. Although
some of this hype is certainly not justified, much of it has been realized by early prototype
applications developed to simplify document management, data exchange, and application
integration issues. These applications have shown that XML can play a powerful role in mak-
ing applications more flexible, maintainable, and accessible in today’s distributed world.
Microsoft has been supportive of XML for many years now and has taken advantage of its
benefits in different applications, including BizTalk Server, Internet Explorer, and SQL Server
2000 (to name a few). With Microsoft’s introduction of the .NET platform, a new revolution in
application programming has started that features XML as the focal point. From Web Services
to XML document parsing and creation, the .NET platform supports a wide range of XML fea-
tures that ease application integration and allow programmers to write more efficient, distrib-
uted, and maintainable applications.
This book focuses on many aspects of XML and shows how it can be used by ASP.NET pro-
grammers to increase application efficiency and reach. This includes discussing the rules asso-
ciated with the XML Meta-Data Language, Document Type Definitions (DTDs) and XML
schemas, the XPath language, and upcoming technologies to watch for—including XPointer
and XLink.
After the basics are covered, the book delves into the different ways that XML documents can
be created, parsed, transformed, and transmitted to other systems using ASP.NET. This
includes an in-depth look at the following topics:
• Using the XmlTextReader’s Pull Model to parse XML
• Creating XML with the XmlTextWriter
• Validating XML documents against schemas and DTDs with the XmlValidatingReader
and XmlTextReader
• Working with the Document Object Model
• Transforming XML with XSLT
• XML support in ADO.NET
• Integrating SQL Server 2000’s XML features into ASP.NET applications
• Understanding the Simple Object Access Protocol (SOAP)
• Creating Web services
2
XML for ASP.NET Developers
My goal was to make this book a useful reference for ASP.NET developers needing to know
the ins and outs of working with XML in their applications. I also wanted developers to see a
variety of ways that XML could be used in application development. Reading about a particu-
lar concept is not enough; a code sample often has more value than several pages of text. As a
result, you’ll find that each chapter contains a large number of samples to reinforce concepts
discussed throughout the chapter. Happy XMLing!
XML and ASP.NET CHAPTER
1
Development
IN THIS CHAPTER
• XML’s Ancestor—SGML 4
• Why Do We Need XML? 5
• X Is for eXtensible 7
• Differences Between XML and HTML 8
• MSXML3 Versus the .NET Platform’s

System.Xml Assembly 10
• How Can XML Be Used by an ASP.NET

Developer? 1177
4
Unless you’ve been vacationing for a few years on a secluded island, it would be hard to miss
all the hype surrounding XML. Newspapers, magazines, and Web sites throughout the world
portray XML as the solution to many of our existing technological problems. This global atten-
tion has pushed XML into the limelight and resulted in a barrage of vendor applications claim-
ing XML support. But is XML really as good as everyone seems to think it is? Although it’s
not going to bring world peace or decrease ozone levels, it is certainly a power-packed technol-
ogy that has a lot to offer after you learn how it works and when it can be applied.
You’re probably keenly aware that XML provides a platform-independent way of describing
data that can be used in many applications. We’ll get into plenty of details concerning this a
little later, but before doing that, it’s important to first answer a few questions.
To start with, where did XML come from and why is it needed in today’s technologically ori-
ented world? More importantly, how can you as an ASP.NET developer leverage its power?
This chapter discusses the answers to these questions and compares XML to another language
you’re already familiar with. The chapters that follow will provide you with example after
example of how XML can be used with ASP.NET to add useful functionality to your develop-
ment projects. Before going too much further, let’s go ahead and answer the question of where
XML came from.
XML’s Ancestor—SGML
XML is a relatively new language in the world of markup languages. You’re probably quite
familiar with XML’s cousin language named Hypertext Markup Language, or simply HTML.
Markup languages are typically used to provide metadata about data. In the case of XML and
HTML, this is done through special tags such as the <font> tag in HTML. SQL Server’s sys-
tem tables are another example of metadata. They provide information about the structure of
data within the database itself. XML serves this same purpose except that it describes text-
based data.
Whereas the World Wide Web Consortium’s (W3C) XML 1.0 recommendation has only been
around since February of 1998, XML’s parent language, Standard Generalized Markup
Language (SGML), has been around as a standard since 1986. SGML is a complex language
standard used extensively by many industries, including the technical-publication industry, to
mark up data. It uses a syntax defined by SGML known as a Document Type Definition (DTD)
to specify the structure of a given SGML document.
Although XML is much easier to work with, many of its features and rules were derived from
the SGML language. When the W3C first began formulating what is now the XML 1.0 recom-
mendation back in 1996, they borrowed many concepts from SGML, including the DTD. To
make XML easier to work with and more Web-oriented, many of SGML’s complexities were
XML and ASP.NET Development
5
CHAPTER 1
left out. If you’re interested in reading a comparison of XML and SGML, take a look at 1
http://www.w3.org/TR/NOTE-sgml-xml-971215.
DEVELOPMENT
Now that you have some background on where XML came from, let’s take a look at why we
XML AND
ASP.NET
need it.
Why Do We Need XML?

With all the different programming languages available today, you may wonder why we need
another one just to mark up data. Although many languages allow for programming different
interfaces and interacting with remote computers, few provide a platform-neutral way to share
data between disparate systems. Fewer yet allow this exchange to pass through firewalls or
other security-related mechanisms or provide a format that is easy to work with for individuals
with varying levels of technical skill.
That’s where XML comes into the picture. The W3C XML working group knew that a simple
language needed to be created that would be straightforward to use, make exchange of data
between different systems more efficient, and work well in a Web environment. Because
SGML was already successful in marking up data, many pieces of it were used to develop the
XML subset, as mentioned earlier. This resulted in a language that can be both highly flexible
and strictly structured at the same time, depending on the needs of the individual or system
working with it.
To get a good feel for one of the reasons why XML is needed, take a look at the following
comma-delimited file:
Elbow Joint,12930430,6,25,06/28/2000,1238 Van Buren,B2B
➥Supply,1111236894,Walters
Valve,39405938,3,40,06/20/2000,4568 Arizona Ave.,A+ Supply,2221236894,Tammy
PVC,234954048,6,20,06/14/2000,49032 S. 51,A+ Supply,2221236894,Walters
Although some assumptions can be made about what some of the data elements are within the
file, many of the elements provide no way for you to know what they represent. For example,
does the field 06/28/2000 shown in the first line represent a date? If so, what type of date?
How about the 234954048 field in the third line? Is it a part number, a customer number, or
something totally unrelated? Listing 1.1 shows this same file converted into XML.
Listing 1.1 Marking Up a Comma-Delimited File Using XML

1: <?xml version=”1.0”?>
2: <supplies>
3: <item supplier=”1”>
4: <description>Elbow Joint</description>
5: <partID>12930430</partID>
6
Listing 1.1 continued

6: <numberInStock>6</numberInStock>
7: <numberOnOrder>25</numberOnOrder>
8: <deliveryDate>06/28/2000</deliveryDate>
9: <supplier>
10: <street>1238 Van Buren</street>
11: <company>B2B Supply</company>
12: <phone>1111236894</phone>
13: </supplier>
14: <orderedBy>Walters</orderedBy>
15: </item>
17: <description>Valve</description>
22: <supplier>
23: <street>4568 Arizona Ave.</street>
24: <company>A+ Supply</company>
25: <phone>2221236894</phone>
26: </supplier>
27: <orderedBy>Tammy</orderedBy>
28: </item>
30: <description>PVC</description>
35: <supplier>
36: <street>49032 S. 51</street>
37: <company>A+ Supply</company>
38: <phone>2221236894</phone>
39: </supplier>
40: <orderedBy>Walters</orderedBy>
41: </item>
42: </supplies>
Each data item can easily be recognized and understood because all the data has been
described using XML markup tags. Using these tags can make it easier for humans and
computers to process data.
Although it’s true that computers normally take care of automating many data-intensive appli-
cations, programs based on files such as the flat file shown earlier have little flexibility to
7
CHAPTER 1
changes in the data’s structure. On the other hand, XML enables applications to access data 1
based on descriptive tags rather than by position. This presents a great opportunity to increase
DEVELOPMENT
an application’s flexibility toward changes in data structure.
XML AND
ASP.NET
XML also presents a great opportunity to be able to share data between applications and even
components within an application. By knowing an XML file’s structure in advance, a compo-
nent within an application can work with data contained in the XML file and perform different
tasks. In some cases, the data may represent information about a customer or business sce-
nario. In other cases, the XML file may simply represent object property values that an
ASP.NET component can use to make programming simpler and more efficient.
XML also benefits from the capability to determine in advance what structure the file must fol-
low through incorporating DTD and schema files. Defining this structure allows for companies
with differing systems to exchange data without worrying about what system the data was
originally stored in. As long as both companies know what the XML document’s structure will
be, they can exchange data using XML syntax. Applications based on XML, such as BizTalk
Server or SQL Server 2000, help to make this process even easier, and when combined with
the power of ASP.NET, very advanced applications can be created.
Finally, as previously mentioned, XML does not require advanced degrees in computer science
to use it. Looking at the XML code in Listing 1.1, you can see that no intimidating characters
or syntax are being used. In fact, the XML file could be read and understood by an individual
with no technical experience at all. After the syntax, structural, and programmatic rules are
learned, XML documents can be created and used both in simple and complex applications.
X Is for eXtensible
One important benefit associated with XML that hasn’t been mentioned yet is its capability to
be extended. XML’s extensibility allows you as a developer to choose how tags within XML
documents are named and structured. In HTML, you’re limited to using specific tags that work
only when used in a predefined way. For instance, to place an image in an HTML document,
you must use the <img> tag. Within this tag, you can place only certain attributes, such as the
src and vspace attributes. XML allows for tags and attributes to be created at will, depending
on how the application being designed needs to be structured. Therefore, if you’d rather
describe a path to an image using a tag named <imagePath>, you’re free to do so (as long as
you also include a closing </imagePath> tag!).
At first glance, this may appear to be a step toward total anarchy. After all, if everyone can
create XML documents using tags that don’t follow any specific naming convention, how can
anyone else possibly use the documents? The answer is that XML’s extensibility is held in
check through the use of DTD or schema documents that define what structure a given XML
document must follow to be valid. Therefore, if you want to create a document using custom
8
tags with names such as <myTag> or <myData> to describe data, you can. Anyone wanting to
use your XML document can do so if you provide them with a definition of how the document
is structured and what tags are used to describe the data.
The key to understanding the value of XML’s extensible framework is realizing that XML
doesn’t specify how to present data to an end user or system. XML as a markup language is
concerned only with describing data that can be manipulated and presented using other lan-
guages. To make this more clear, let’s take a look at some of the differences between XML
and HTML.
Differences Between XML and HTML

Because XML and HTML are both subsets of the SGML language, they resemble each other
in several ways. However, their overall purpose differs. Many of the concepts that follow will
be discussed more in depth in Chapter 2, “XML for ASP.NET Basics.”
Presentation Versus Description

HTML was first designed to provide a way to present text-based information to a user located
at a remote location. If you’ve worked with the Web very long, you probably remember the
days when Web sites contained the standard gray background with black text. If you’re really
“old” as far as Web years go, you may even remember using a Lynx browser. As the Web grew
in popularity and became viewed as a novel way to present information to people around the
world, alternative ways of presenting the data using graphics, tables, and other elements were
incorporated into the language. These additions allowed for backgrounds and a host of other
presentation capabilities. You probably remember the days of bright or wild-looking back-
grounds with blinking text that was virtually impossible to read.
What does all this have to with XML? Very little, actually, but there’s a reason for mentioning
it. The point is that HTML is a presentation markup language that is useful only when com-
bined with an application such as a browser. It has many predefined tags and attributes that
can change font colors, make text larger, or place those lovely pink cloud backgrounds onto a
page. However, HTML does nothing to describe the data that is presented. A page containing
the following HTML code presents no clues about the data it contains:
<font color=”#02027a” size=”3”><b>274943494</b></font>
What does 274943494 represent in the code? Is it a bank account number, a part number, a per-
sonal identification number, or none of the above? We have no way of knowing by looking at
the HTML document because HTML only displays information, it doesn’t describe it.
9
CHAPTER 1
By comparison, XML says nothing about how to display data. Its sole purpose is to describe 1
the data using tags, attributes, and other items. The data found within the preceding HTML
DEVELOPMENT
code snippet could be written the following way in an XML document:
XML AND
ASP.NET
<population>274943494</population>
Now it’s easy to see exactly what the data represents—data associated with a population. The
data can further be described using other tags:
<unitedStates>
<population>274943494</population>
</unitedStates>
Now we know that the data is a population number associated with the United States.
Closing Tags
HTML provides a lot of freedom in leaving tags open and even has some tags that never have
to be closed. For example, in HTML, some tags can be left open without causing problems in
the browser. Here’s an example of a few of them:
• <img>
• <hr>
• <p>
• <br>
• <meta>
• <li>
• <option>
Looking through these tags, you’ll notice that HTML doesn’t require a matching end tag
for them. For example, placing an image into a page by using code such as <img src=
”file.gif”></img> isn’t correct in the HTML language. Other tags that normally require
closing tags can sometimes get by without the closing the tag, too. In newer browsers, a
<body> tag listed without a closing </body> tag may not even pose a problem to a Web page.
Both Microsoft and Netscape’s browsers have moved toward being more forgiving of improper
HTML coding.
XML is not nearly as forgiving. Although we’ll talk more about this in Chapter 2, all XML
tags must have a matching closing tag (or use a short-cut syntax that you’ll see later).
Forgetting to close a tag will result in an error message being received when the document is
parsed. This rule benefits the language because XML documents that are exchanged will all
10
follow the same structure, making it easier for applications to use the data in a reliable and
consistent manner.
Element Nesting
Although it’s not a recommended practice, HTML tags can be improperly nested and still
function properly. The following HTML code illustrates this:
<font><b>Text</font></b>
XML strictly prohibits improper nesting of tags. If a given tag starts before another tag in an
XML document, the tag that is declared first must end after the child tag, as shown next:
<street><name>Oak</name></street>
Quoted Attributes
HTML is very flexible in how attributes located within tags can be written. A browser will
treat the following two code segments the same:
<table border=”0” bgcolor=”#000000”>
<table border=0 bgcolor=#000000>
XML requires that all attributes be quoted:

<street name=”Oak” number=”7864” type=”Avenue”/>
Failure to follow this rule will result in an error generated by the XML parser.
Case Sensitivity
XML also differs from HTML in how strictly case sensitivity is enforced. Although the
following code is acceptable in HTML:
<table></Table>
it is not acceptable in XML because the first tag name is declared with a small t and the
second with a capital T. XML requires that a tag’s starting case match its ending tag’s case.
MSXML3 Versus the .NET Platform’s System.Xml

Assembly
The XML parser is responsible for going through an XML document and making sure it is
well formed. When the XML document has an associated DTD or schema, the parser also
makes sure that the document is valid. Both concepts are covered in the next chapter.
11
CHAPTER 1
Microsoft’s MSXML3 parser performs all the preceding checks and much more. However, it 1
was designed to work with “classic” ASP rather than ASP.NET. Although you can make calls
DEVELOPMENT
to MSXML3 from C# or VB.NET via Interop (discussed in more detail in Chapter 6,
XML AND
ASP.NET
“Programming the Document Object Model (DOM) with ASP.NET”), it is not recommended
that you use MSXML3 if you want to fully leverage all the XML features found in the .NET
platform.
For ASP.NET, Microsoft has built a new parser from the ground up that handles just about any
XML task imaginable. It is found within a .NET assembly named System.Xml. The chapters
that follow cover topics related to the System.Xml assembly and others involving well-formed
documents, valid documents, accessing documents using the Document Object Model (DOM)
or an XmlReader, and transforming documents using Extensible Stylesheet Language
Transformations (XSLT). Many examples and details about using the System.Xml assembly
are presented throughout the book.
How Can XML Be Used by an ASP.NET

Developer?
The architects of .NET knew that XML integration was crucial to its success. As a result, XML
is integrated directly into the platform. This integration is comprehensive and includes every-
thing from working with data found in different data stores to serializing objects for use on
remote systems.
ASP.NET and its related .NET technologies offer a great opportunity to separate an applica-
tion’s form from its function. No longer are you constrained to intermingling code snippets
used to call objects among basic HTML code tags. When combined with XML, ASP.NET
allows for the creation of applications that are more flexible, more efficient, and more manage-
able because the presentation code and business logic/events of applications can be separated.
This flexibility also applies to working with data received from any number of data sources.
Although you’ve always been able to work with the DOM through using the MSXML3 parser,
converting data returned from a database into XML hasn’t always been intuitive or easy—
especially if a custom XML structure is desired. As you’ll see in Chapter 8, “Leveraging
ADO.NET’s XML Features Using ASP.NET,” XML is the basis for a new type of data object
called the DataSet and is used ubiquitously throughout the .NET platform. For an ASP.NET
developer, this provides an unprecedented level of power to exchange data not only between
applications, but also between different objects that may be separated by firewalls and large
physical distances.
As browser and wireless device support for XML continues to increase, Web-based application
development using XML and ASP.NET will provide easier and less code-intensive ways of
12
presenting data to end users. Transformations between XML and HTML via XSLT will be
moved from the server to the client. Pages targeted for different devices can be served more
effectively using an XML-based repository that can be transformed to fit the appropriate device
using ASP.NET and XSLT. Doing this will provide the client with more speed because sorts,
filters, and other functions can be performed without the need for a round-trip back to the
server.
The rise of Web services will allow XML-based data published by companies around the world
to be consumed and placed within your ASP.NET applications. Credit cards can be processed
and user IDs validated by passing XML fragments to and from remote servers. As an ASP.NET
developer, you can leverage XML in many ways to provide needed services. Many of these are
discussed in the chapters that follow.
Summary
The XML language provides an extensible and powerful standard for describing data that can
be used within and shared between applications. Although it looks a lot like HTML, the two
languages have many differences. HTML serves the purpose of presenting data through an
application such as a browser, whereas XML describes data using markup tags, but does not
say how it should be presented.
The next chapter takes a more detailed look at XML’s rules and structure. Understanding the
basics of XML is necessary to create efficient ASP.NET/XML applications.
XML for ASP.NET Basics CHAPTER
2
IN THIS CHAPTER
• What’s in an XML Document? 14
• Well-Formed XML Documents 16
• Valid XML Documents 20
• The XML Declaration 21
• XML Elements 24
• XML Attributes 25
• XML Namespaces 27
• XML Processing Instructions 34
• XML Comments 34
• XML Entities 35
• CDATA Sections 40
• Dealing with Whitespace 41
• The Relationship of XML to XHTML 42
• Using ASP.NET Objects to Generate XML 44
14
Now that you’re familiar with how XML came into existence and some of the ways it can be
used, it’s time to learn how XML documents are created. You’ll be happy to know that many of
the concepts employed in HTML and ASP.NET programming are applicable to XML docu-
ment creation as well. Much like HTML, XML documents use various elements and attributes
that provide the document with structure. However, elements and attributes aren’t used in
marking up the data for presentation in a browser. They’re used to describe the data and per-
form other functions, many of which are detailed in later chapters.
It’s important to note that XML by itself doesn’t provide much functionality other than the
capability to describe data. XML is only one language in a family of other languages and tech-
nologies. It’s only when XML is combined with other languages and technologies (such as the
.NET platform) that its power is realized in building applications.
Before you learn about the other supporting technologies that work together to empower XML,
you should take a good look at how XML is created and what rules are involved in this cre-
ative process. If you feel that you’re already an expert in creating XML documents, feel free to
skip ahead to the next chapter.
The list of topics shown at the beginning of the chapter provides an overview of what you can
expect to learn. Although this list may seem somewhat lengthy, rest assured that you will be
provided with plenty of samples and analogies to help you gain an in-depth understanding of
each topic. By the end of the chapter, you should be able to create XML documents by hand or
dynamically using intrinsic ASP.NET objects. Having said that, let’s get started!
What’s in an XML Document?

“XML” has become one of the biggest buzzwords used in the technology industry today.
Everywhere you look, vendors both large and small are integrating XML into their applica-
tions, with .NET being no exception. Database products are making it easier to retrieve and
input XML data, and many e-commerce packages come with XML capabilities already built
in. Employers are scrambling to find developers with XML experience to work with the variety
of product offerings on the market.
Because of the international scope of the XML “buzz,” you might expect XML to be a com-
plex language requiring years and years of programming experience to work with and program.
Fortunately, this is not the case, as you saw in Chapter 1, “XML and ASP.NET Development.”
The creators of the XML specification knew that to have a platform-independent standard that
was useful to individuals and businesses throughout the world, they would have to throw out
many of the complexities associated with other languages.
To see just how simple an XML document can be, take a look at Listing 2.1. You’ll notice right
away that the code is understandable by normal human beings like you and me because it is
XML for ASP.NET Basics
15
CHAPTER 2
simply a combination of different XML elements and data. Although some of the elements
contained in the document may be confusing to you (until you read this chapter, anyway), it’s
easy to see that the code contains no cryptic characters (although it could), adheres to a partic-
ular layout standard, is regular ASCII text, and looks a lot like the HTML markup language
that you’ve likely used before.
LISTING 2.1 A Simple XML Document

1: <?xml version=”1.0” encoding=”UTF-8” standalone=”yes”?>
2: <root>
3: 
2
ASP.NET BASICS
4: <menuItem itemNumber=”1”>
5: <hyperLink>/default/default.aspx</hyperLink>
XML FOR
6: <name>Home</name>
7: </menuItem>
9: <hyperLink/>
10: <name>About Us</name>
11: <menuItem itemNumber=”2a”>
12: <hyperLink>/calendar/default.aspx</hyperLink>
13: <name>Events Calendar</name>
14: </menuItem>
15: <menuItem itemNumber=”2b”>
16: <hyperLink>/news/default.aspx</hyperLink>
17: <name>Press Releases</name>
18: </menuItem>
19: <menuItem itemNumber=”2c”>
20: <hyperLink>/contact/default.aspx</hyperLink>
21: <name>Contact Numbers</name>
22: </menuItem>
23: </menuItem>
24: 
25: </root>
The XML document in Listing 2.1 is a very simple representation of a hierarchical menu struc-
ture that could exist on a company Web site. This simple menu (similar to the Start menu in
Windows) contains two parent menu items named Home and About Us. The About Us menu
item contains three children menu items (Events Calendar, Press Releases, and Contact
Numbers). You’ll see how this XML can be used in an ASP.NET application in Chapter 6,
“Programming the Document Object Model (DOM) with ASP.NET.”
Although the actual data contained in the document, such as Contact Numbers (line 21) or
Events Calendar (line 13) may not mean much on their own, by looking at the document you
16
can see that the Contact Numbers data represents the name of a menu item. You’ll also notice
that data entries such as /contact/default.aspx are easily identified as being hyperlinks,
even to people not familiar with what a hyperlink may look like. Code such as
<hyperLink>/contact/default.aspx</hyperlink>
is a dead giveaway that the data within the tags must be related to a hyperlink in some manner.
Although the <hyperLink> tag could have been replaced with an <a> tag, as seen in HTML,
the data is described much better by using the word “hyperLink” don’t you think? The capabil-
ity to describe data within an XML document, which was discussed in Chapter 1, is referred to
as being “self-describing.”
The XML menu structure shown in Listing 2.1 also gives us insight into what an XML docu-
ment actually contains. Other than the data, an XML document contains a range of other items
including elements, attributes, processing instructions, comments, and text to name a few.
These items and others are discussed in the sections that follow. Before jumping into the differ-
ent items found in an XML document, however, let’s take a look at some basic XML rules.
Well-Formed XML Documents

As HTML has evolved over the years, browsers in general have become more and more lenient
in how they handle incorrect coding. The following code isn’t well-formed HTML because the
closing </html> tag is missing:
<html>
<body>
<p>Hello World</p>
</body>
Viewing this code in Internet Explorer 4 or 5 is not a problem, because simple coding mistakes
such as this non–well-formed document are forgiven.
As XML was developed, it was realized that well-formed documents were essential for XML
to be used consistently between applications. Imagine sending a document that was not well
formed to a friend who had an XML parser that blew up when documents were not well
formed! Your friend wouldn’t be able to open the document and use it in the manner that you
had intended.
Wait, you say. . .this type of thing happens all the time with HTML or Dynamic HTML
(DHTML) in different browsers today, so what’s the big deal? Because XML documents must
work consistently between parsers and applications, the creators of XML required strict con-
formance to a few basic XML principles. These principles include the following:
• An XML document may contain only one root element.
• All XML elements must have a closing tag.
17
CHAPTER 2
• All attribute values must be quoted.

• Although XML elements may contain child elements, all elements must be nested
correctly.
• XML elements are case sensitive.
• An XML document with an accompanying DTD or XML schema must contain items
specified within the DTD or schema to be considered valid.
If you’ve ever spent several hours (or more) attempting to make a particular DHTML applica-
tion work correctly in Netscape and Internet Explorer, you know that having a strictly con-
forming language is a very good thing. Let’s cover some of the ground rules in more detail.
2
ASP.NET BASICS
XML FOR
The Root Element
The XML document shown in Listing 2.1 makes it easy to visually identify the root element of
the document because it’s actually named “root.” It could have just as easily been named
“startingElement” if desired, because the actual name doesn’t matter. For XML parsers to have
a reference point, an XML document must contain only one root element per document.
Although the following code is acceptable:
<?xml version=”1.0”?>
<root>
<p>Hello World</p>
</root>
the next section of code is not acceptable, because it doesn’t have a single root element:
<root>
<p>Hello World</p>
</root>
<root>
<p>Hello World2</p>
</root>
The incorrect code could be remedied by adding in a root element as shown:

<mainRoot>
<root>
<p>Hello World</p>
</root>
<root>
<p>Hello World2</p>
</root>
</mainRoot>
18
If you think of XML documents in terms of a tree, the root element is the root of the tree, and
all other child elements are the branches and leaves of the tree.
<br> Versus <br/>

As discussed in Chapter 1, XML looks a lot like HTML at first glance. However, to adhere to
the rules of creating XML documents, a few things that you’re used to doing in HTML won’t
fly when you’re creating XML documents.
In HTML we are all used to using tags such as <br>, <img>, <hr>, and <p> without ever worry-
ing about closing them. In HTML it is perfectly acceptable from the browser’s point of view to
code the following:
<p>Hello World
To show the differences between HTML and XML, try running the following XML document
in Internet Explorer 5+ and see what happens. To run it, enter the following code into Notepad
and save the file as test.xml. Then open the file in Internet Explorer 5+ using the File, Open
menu:
<root>
<p>Hello World
</root>
After the document opens, you’ll receive an error stating something like End tag ‘root’
does not match the start tag ‘p’. Line 4, Position 3. This error occurs because the
XML parser can’t find the ending </p> tag. To prove this, run the following code (which closes
the <p> tag) in Internet Explorer 5+ and you’ll see that the document loads fine:
<root>
<p>Hello World</p>
</root>
NOTE
It’s important to note that the <p> tag as shown in the preceding XML document
does not serve the same function as if it were used in an HTML document. Its use in
the XML document is simply to describe the data Hello World. When you open the
text.xml document in Internet Explorer 5, you’ll see that the <p> tag is actually visible
and doesn’t add any type of formatting to Hello World, as would be expected in
HTML. Remember that XML doesn’t specify how to format or display data (without
using other technologies such as XSLT). It only organizes and describes the data. The
<p> tag was used to show how you must close tags in XML, not because it has any
special function or purpose, as it does in HTML.
19
CHAPTER 2
In cases where an XML tag contains no data or has only attributes (which you’ll learn more
about later in this chapter), XML provides us with a handy shortcut:
<menuItem itemNumber=”1”/>
This code is a less verbose way of handling the following code:

<menuItem itemNumber=”1”></menuItem>
Although the shortcut method may seem a little strange at first, after you’re comfortable using
it, you’ll find that it’s great for saving time and space when you need to include an element
that has only attributes (such as itemNumber in the preceding example) or that contains no 2
data.
ASP.NET BASICS
XML FOR
At this point, you may be wondering why this section is titled “<br> Versus <br/>.” Although
the <br> tag doesn’t actually mean “insert a line break” if used as an element within XML,
you’ll see in a later chapter that it can be used with related XML technologies, such as XSLT
or XHTML, to insert a line break. When used in this context, tags such as <br> and <img>
must be closed to avoid errors being raised. Therefore, <br> becomes <br/> and <img
src=”someImage.gif”> becomes <img src=”someImage.gif”/>. This may be a little confus-
ing because you haven’t learned anything about XSLT yet. If you take away nothing more
from this section than a knowledge that all tags must be closed in XML and its related tech-
nologies, then you know all you need to know for now. Future chapters build on this.
Nesting No-No’s
By now you certainly have no doubt that XML tags must be closed for an XML document to
be parsed correctly. Another “gotcha” must be followed as well. XML tags cannot be nested
like HTML tags can.
In HTML it’s possible, although not recommended, to code the following:
<font><b>Hello World</font></b>
This type of inappropriate nesting within an XML document is strictly prohibited and will
cause the parser to generate an error. To make this work with XML, you would need to fix the
nesting:
<font><b>Hello World</b></font>
To get away from HTML tags you already know (because <font> does nothing except describe
data in XML), here’s another improper nesting example that follows the same pattern as shown
previously:
<menuItem><name>About Us</menuItem></name>
20
The following is the proper way to code this section, assuming you want to list more than one
element on a line:
<menuItem><name>About Us</name></menuItem>
Because you have yet to see an XML document with more than one element on a line, you
may be wondering if this is okay. The answer is a resounding “Yes!” The documents you’ve
seen thus far have been indented and spaced simply to make them easier to read. They do not
have to be created this way to work, but good luck trying to read them in the future without
spacing and indentation!
Other Rules
By now you’re probably sick of learning the rules and want to move on to something new.
Don’t worry—we’re almost done. Two more important points need to be mentioned concerning
case sensitivity and attribute quoting.
Unlike HTML, in which <font> is the same as <FONT> is the same as <Font>, XML is case
sensitive. All beginning and ending tags must be the same case. To continue using our menu
metaphor, <name>About Us</Name> will cause the parser to generate an error saying, End tag
‘Name’ does not match the start tag ‘name’. Determining what case to use is completely
up to you. Because the parser normally tells you what the problem is, this is an easy problem
to resolve, assuming that you pick a naming convention and stick to it.
One final rule exists that may or may not be more difficult to remember, depending on how
you code your HTML files. All XML attributes must be quoted for the parser to be able to
parse the document. This means that the following HTML code would not be acceptable if it
were written in an XML document:
<body bgcolor=#ffffff>
This is easily fixed by adding quotes around the bgcolor attribute:

<body bgcolor=”#ffffff”>
Because of the recent XHTML recommendation (covered later in this chapter) and because
you will be working with XML more in the near future, it is highly recommended that you get
into the habit of quoting all your HTML attributes. It will make the transition to XML and its
related technologies much easier.
Valid XML Documents

Now that you’ve learned all the rules associated with creating well-formed documents, let’s
move on to discussing valid XML documents. Until now, you’ve seen examples of documents
21
CHAPTER 2
that allow for the inclusion of basically any element. Going back to Listing 2.1, if you wanted
to include an <image> element as a child of all <menuItem> tags, you could do so without caus-
ing any problems in the XML document.
The freedom to add any desired element to an XML document probably sounds like a pretty
good feature, and it is in certain circumstances. However, imagine creating an application that
manipulates an XML document received from a particular supplier. This application is pro-
grammed to work with a specific XML document that is assumed to contain a consistent set of
elements and attributes. What happens if the supplier starts adding in elements that your appli-
cation isn’t expecting? The supplier’s “freedom” to add in new elements may cause your appli- 2
cation to malfunction, crash, or simply report incorrect data.
ASP.NET BASICS
To prevent this exact scenario from occurring, XML documents can be checked or validated
XML FOR
against another document to make sure that the structure of the XML document is valid. This
means that you can have an XML document that follows all the XML rules (and is thus well
formed) but is invalid. How can this be possible? Enter the world of Document Type
Definitions (DTDs) and XML schemas.
To allow for validation of XML documents, a type of “metadata” document can be created to
define what elements, attributes, and other items can be contained within the XML document.
By defining the structure of the XML document, the problems described in Chapter 1 with the
imaginary supplier can be prevented because the application can validate the document before
performing any calculations or transformations.
Because DTDs and XML schemas cannot be summarized in just a few paragraphs, you’ll learn
about them in greater detail in Chapter 4, “Understanding DTDs and XML Schemas.” Now it’s
time to cover the specifics on items used to construct an XML document.
The XML Declaration

ASP.NET provides developers with directives that can be declared at the top of each ASP.NET
page. Directives allow a developer to specify what language the ASP.NET page will use,
whether sessions will be used on a given page, whether the page supports transactions, and
many other items. See the ASP.NET help section in the .NET SDK for information on other
valid directives.
The following are a few examples of some ASP.NET directives. Because it’s probable that you
are already familiar with these, we won’t discuss their purpose:
<%@ Page Language=”VB” ContentType=”text/xml” %>
<%@ Page Language=”C#” EnableSessionState=”False” %>
<%@ Import Namespace=”System.Xml” %>

22
XML declarations function in a manner similar to ASP.NET directives. You already saw your
first XML declaration back in Listing 2.1 even though you didn’t realize it at the time. The
XML declaration shown there was
<?xml version=”1.0” encoding=”UTF-8” standalone=”yes”?>
An XML document can be loaded using Microsoft’s System.Xml assembly (discussed in

Chapter 6), even if the declaration is omitted from the document. However, each XML docu-
ment should have a declaration so that information about the document can be viewed at any
time—both by humans and parsers. This will especially be true in the near future as different
versions of the XML specification are released.
Several key points need to be made about the XML declaration, including
• The xml keyword
• The version of the XML document
• The document encoding type
• The standalone keyword
The xml Keyword

The most important point to make about using the xml keyword is that it’s reserved in the
XML language and must be lowercase. You cannot use it as a name of an element or anything
else you create in an XML document without an error being thrown. Also, no whitespace can
exist between the first question mark and the xml keyword. If whitespace does exists, an error
will be raised by the parser.
Incorrect: <? xml version=”1.0” encoding=”UTF-8” standalone=”yes”?>
Correct: <?xml version=”1.0” encoding=”UTF-8” standalone=”yes”?>
The XML Document Version

Aside from the xml keyword, the previous declaration also shows the version as “1.0”. As the
World Wide Web Consortium (W3C—http://www.w3.org) XML recommendations continue to
be issued, the version will be changed so that parsers can detect an XML document’s version.
For now, however, the version is 1.0. Having access to the version of an XML document means
that conditional actions can be performed on the document, depending on the version number.
The Encoding Type

The XML declaration we’ve been working with also lists the encoding type of the document.
The simple document lists an encoding type of “UTF-8”, which is the default encoding type if
none is listed in the declaration.
23
CHAPTER 2
Before expounding on the different encoding types, it’s worth mentioning how computers store
text. A computer doesn’t actually store text as text, which may sound a little confusing. Rather,
it converts the text into a number or group of numbers that the computer recognizes as repre-
senting the text. Therefore, one particular computer may store a numerical representation of
the text differently than the same text on another computer.
To avoid problems that could potentially arise from different numeric representations of text on
computers, XML documents can specify how the document text is encoded. By doing this, a
computer receiving an XML document knows what encoding type to work with and can make
the appropriate transformation to read the document. 2
ASP.NET BASICS
This transformation process works because of character encodings developed by the Unicode
Consortium (http://www.unicode.org). To account for some older systems not being able to
XML FOR
read Unicode, the Unicode Transformation Formats (UTF) were developed. XML documents
default to the UTF-8 character encoding that uses single-byte encoding and Character Entities.
UTF-16 is a two-byte encoding standard that is needed to accommodate a large number of
characters such as those found in the Japanese or Chinese languages. Several other encoding
standards, including ISO-8859-1 and Shift_JIS, map to different encoding types.
Entire books have been written about Unicode and bit-shifting techniques, so this subject will
be covered only from a high-level perspective. The important point to take away from this
discussion is that many encoding types are available. But, the XML 1.0 Recommendation
specifies that all XML parsers must support the UTF-8 and UTF-16 encoding types. Sticking
to one of these types should help prevent problems that arise from exchanging XML docu-
ments between differing computer systems. An excellent article that gives an in-depth
look into the subject of XML encoding was written by Microsoft. You can find it at
http://msdn.microsoft.com/xml/articles/xmlencodings.asp.
The standalone Keyword

We briefly touched on the subject of DTDs earlier in the chapter. As mentioned previously,
you’ll learn more about these documents and XML schemas in Chapter 4. The optional
standalone keyword simply tells the XML parser whether the current XML document can
survive on its own or if it is dependent on another document (such as a DTD). Although the
standalone type was declared in the XML document shown in Listing 2.1, the document would
have defaulted to “yes” anyway had this not been specified. Although it is optional, if
you had created a DTD (a document describing what your XML document can contain) that
went along with this XML document, you could have used the following XML declaration:
<?xml version=”1.0” encoding=”UTF-8” standalone=”no”?>
24
XML Elements
The <a>, <hr>, <body>, and <div> tags found in the HTML language are all examples of
HTML elements. Each has a special meaning and function that different browsers understand.
Your familiarity with HTML makes understanding XML elements very straightforward and
easy, especially when you consider that an XML element is simply a container, like many of
the HTML elements you’re used to working with.
Listing 2.2 shows a few XML elements that you’ve seen before.
LISTING 2.2 XML Elements

5: </menuItem>
The <menuItem> element shown in Listing 2.2 is a parent to the <hyperLink> and <name>
elements. As described earlier in the chapter, each element tag must be closed for an XML
document to be considered well formed.
An XML element isn’t defined as only the tag itself. It includes all other subelements or con-
tent between the element’s starting and ending tags. This content is referred to as the “Content
Model” of the element. This means that the <menuItem> element shown in Listing 2.2 is
defined as containing the starting and ending <menuItem> tags as well as the <hyperlink> and
<name> tags.
The W3C XML 1.0 Recommendation allows document authors to name XML elements as they
would like. However, the elements must start with a character from A–Z or a–z or may start
with the ”_” character. Therefore, the following are all legal elements as defined by the W3C:
<bike>
<color>red</color>
<Type>adult bike</Type>
<STYLE1>Woman’s</STYLE1>
<_price>100.99</_price>
</bike>
Whereas the following are not legal elements:

<5bike>
<2color>red</2color>
<3Type>adult bike</3Type>
<@STYLE>Woman’s</@STYLE>
<*price>100.99</*price>
</5bike>
25
CHAPTER 2
Failure to follow the naming rules will result in an error similar to A name was started with
an invalid character.
XML Attributes
To continue with our HTML/XML comparisons, an attribute in XML is very similar to an
HTML attribute: the attribute helps describe an element. In HTML you can change the color
attribute to the <font> element:
<font color=”#02027a”>Some Text</font>
2
In XML you can follow the same guidelines:
ASP.NET BASICS
XML FOR
<menuItem itemNumber=”1”>Some Text</menuItem>
In this XML example, itemNumber is an attribute of the <menuItem> element. You’ll notice that
itemNumber’s value is quoted. Whereas in HTML you can get by without quoting attribute val-
ues, the XML rules require that all attribute values are quoted. Failure to follow this rule will
result in an error being raised by the parser.
The strict naming conventions applied to naming elements also apply to naming attributes in
XML. Attribute names must start with a character from A–Z or a–z, with the exception of the
“_” character. Failure to follow the naming rules will result in an error similar to A name was
started with an invalid character.
When to use attributes in an XML document is completely up to you. Some XML program-
mers use attributes rather than elements because they can take up less space and are quicker to
add to an XML document. For example, Listing 2.3 shows a recordset obtained from a data-
base converted to XML using only XML elements, whereas Listing 2.4 shows the same con-
version but with attributes used where possible.
LISTING 2.3 XML Recordset with Elements Only

2: <recordset>
3: <row>
4: <customerID>1</customerID>
5: <customerName>Dan Wahlin</customerName>
6: <customerAddress>1234 Any Street</customerAddress>
7: </row>
8: <row>
10: <customerName>Aaron Horne</customerName>
11: <customerAddress>3564 Any Street</customerAddress>
12: </row>
13: </recordset>
26
LISTING 2.4 XML Recordset with Attributes and Elements

2: <recordset>
3: <row customerID=”1” customerName=”Dan Wahlin”
4: customerAddress=”1234 Any Street”/>
5: <row customerID=”2” customerName=”Lamar Kirby”
6: customerAddress=”3564 Any Street”/>
7: </recordset>
As you can see from the code in Listing 2.4, using attributes to describe data can result in
smaller file sizes and may be considered by some to be easier to read. Whether attributes are
used is completely up to you. A good general rule to follow is that if data describes an element
in XML, it may be a good candidate for an attribute (although it doesn’t have to be). For exam-
ple, if you have an element named house and want to describe its color, you may want to
make color an attribute. Thinking in terms of object-oriented programming, an attribute is
similar to an object property.
If you realize that you’ve defined an attribute in your XML document that may actually need
to contain child elements of its own, the attribute should be changed to an element because
attributes cannot contain child elements. For example, the following XML document contains
an attribute named fname:
<root>
<name fname=”Dan” lname=”Wahlin” title=”Mr.”/>
</root>
What if the fname attribute actually needs to describe the birth name and nickname of an indi-
vidual? You could always delimit the two values with a pipe character, but this would defeat
the purpose of XML. Instead of doing this, you would want to break fname out as a child
element of the name element:
<root>
<name lname=”Wahlin” title=”Mr.”>
<fname>
<birthName>Daniel</birthName>
<nickName>Dan</nickName>
</fname>
</name>
</root>
You could make birthName and nickName attributes of fname as well:

27
CHAPTER 2
<root>
<name lname=”Wahlin” title=”Mr.”>
<fname birthName=”Daniel” nickName=”Dan”/>
</name>
</root>
XML Namespaces
Namespaces are an important part of the XML language, especially because developers can cre-
ate their own element tag names in an XML document. A namespace allows tags with the same 2
name to be distinguished based on who created the tags and what the tags are meant to describe.
ASP.NET BASICS
XML FOR
One of the easiest ways to see why namespaces are needed in XML is to look at a more
abstract example. If I asked you where the phone number 669-5835 called to, what would be
your response? More than likely, you would say something like, “I have no idea because you
didn’t even supply an area code!” Now, what if I asked you where the phone number (480)
669-5835 called to? Although you might not know the individual residence or business this
number called, with a little research you would be able to learn that this calls a number located
somewhere in the Phoenix, Arizona metropolitan area. The area code helped determine who, or
in this case where, the number was associated.
Now that you’ve seen a more abstract example, let’s take a look at another that ties into XML
more directly. Part of XML’s inherent power lies in its capability to describe data through
creating custom tags. This can present problems because of name collisions, as the following
scenario illustrates.
Company A sends Company B an XML document containing data about A’s customers.
Company B combines its own XML document with Company A’s document and updates a
database with the combined information. Both documents contain an element named
customerID. Company A’s customerID element contains a unique customer identifier gener-
ated by its computer systems. This number can be stored “as is” in the database with no
manipulation. Company B’s customerID element contains the customer’s Social Security num-
ber, including dashes. Before inserting this value into the database, all dashes must be stripped
out. Listing 2.5 shows Company A’s XML document and Listing 2.6 shows Company B’s
XML document.
LISTING 2.5 Company A’s XML Document—Unique Identifier

2: <recordset>
3: <row>
28
LISTING 2.5 continued

5: <name>Wahlin Consulting</name>
6: <street>1234 Any Street</street>
7: </row>
8: <row>
10: <name>Warehouse Central</name>
11: <street>3564 Any Street”</street>
12: </row>
13: </recordset>
LISTING 2.6 Company B’s XML Document—Social Security Number

2: <recordset>
3: <row>
4: <customerID>528-11-1234</customerID>
5: <name>Dan Wahlin</name>
7: </row>
8: <row>
9: <customerID>123-22-1234</customerID>
10: <name>Lamar Kirby</name>
12: </row>
13:</recordset>
Reading through this sample scenario, you can see how combining Company A’s and Company
B’s XML documents may create a problem when the database needs to be updated. Because
both companies use the customerID element in a different way for describing their data, and
because Company B has a business rule stating that customer Social Security numbers must be
stored without dashes in the database, it will be very difficult to distinguish from which com-
pany the customerID element came. This will make it difficult to know when to apply
Company B’s business rule.
To complicate this more, what if Company A’s name element was actually describing where a
particular part was manufactured, whereas Company B’s name element was the actual customer
name? The resulting ambiguity could make document exchange between the companies diffi-
cult and could complicate application development.
To simplify this process and allow elements of an XML document to be uniquely identified
and associated with the appropriate creator, the W3C recommended XML namespaces. As you
will see, namespaces help to eliminate the guesswork that occurs when different companies, or
29
CHAPTER 2
even different departments within the same company, use element or attribute names that are
the same but have different meanings.
Namespace Structure
Namespaces are easy to implement in an XML document after their structure is known and
understood. Before discussing namespaces in more detail, it’s important to know that all items
within an XML document are by default part of a namespace. This is because the reserved
keyword xml is associated with its own native namespace.
So that you gain a good understanding of namespace structure and types, let’s take a look at 2
code samples involving the following namespaces, which comply with the W3C’s January
ASP.NET BASICS
1999 Namespace Recommendation (http://www.w3.org/TR/1999/REC-xml-names-
XML FOR
19990114):
• Default Namespaces
• Qualified Namespaces
Default Namespaces
The following code shows a default namespace being applied to the <root> element:
<root xmlns=”http://www.anyURL.com”>
<menuItem itemNumber=”1”>
<hyperLink>/news/default.aspx</hyperLink>
<name>Press Releases</name>
</menuItem>
</root>
The preceding default namespace starts by using the keyword xmlns, which is shorthand for
“XML Namespace.” Because it is declared in the root of the XML document, it applies to all
content contained between the beginning and ending <root> tags. We can say that its scope is
“global” to the XML document because it applies to everything from the root down (or the
root up, if you think in terms of a tree structure).
The xmlns keyword is set equal to a Universal Resource Identifier, or URI, which in this
case is “http://www.anyURL.com”. The URI can be any unique identifier and does not
have to be a hyperlink. I could have just as easily specified xmlns=”myBusiness” if I had
wanted to. However, this wouldn’t be very efficient or unique because someone else could
use “myBusiness” as the URI for a namespace in their XML document. The best bet when
deciding on URI identifiers is to use your Web site URL followed by a particular path
(http://www.anyURL.com/myNamespace); it is unique and therefore unlikely that someone
else will use that same URL, especially because they have no ownership of it.
30
The URL does not actually have to point to a real file. This is a difficult concept to grasp at
first. After all, why list a URL for the namespace URI if it doesn’t actually point to a file? To
answer this, realize that the URI is simply a unique identifier and that URLs work well in this
situation because they are unique to start with.
In the previous example, the root element contains the following definitions:
URI = “http://www.anyURL.com”
Local name = “root”
The <menuItem>, <hyperLink>, and <name> elements are associated with the following defini-
tions, respectively:
Local name = “menuItem”
Local name = “hyperLink”
Local name = “name”
You can see that the default namespace’s ”scope” is everything within the XML document in
this example because all elements inherited the URI of “http://www.anyURL.com”. From the
preceding definitions, you can also see that each element’s name is referred to as its Local
name.
To better understand the scope of default namespaces, let’s run through a simple example
involving a variable in ASP.NET. Let’s assume that you have an ASP.NET page with a variable
named menuItem declared at the top of the page. This variable is assigned a value of “Contact
Us”. The variable is not declared within a function, so it has a global scope as far as the page is
concerned. Because of the global nature of the variable, we can access its value from anywhere
in the page.
Now let’s apply this same ASP.NET variable example to the default namespace example shown
next:
<root xmlns=”http://www.anyURL.com”>
</menuItem>
</root>
Because the default namespace declaration is “declared” at the root, all content within the
XML document will inherit this default namespace. This falls in line with the global menuItem
variable described in the ASP.NET example. Let’s see what happens if a default namespace is
declared in the menuItem tag, as shown next:
31
CHAPTER 2
<root>
<menuItem itemNumber=”1” xmlns=”http://www.anyURL.com”>
</menuItem>
</root>
The default namespace will apply to everything contained within the <menuItem> beginning
and ending tags. This says that the default namespace for the <hyperlink> and <name> tags is
now associated with the declared URI “http://www.anyURL.com” The root tag will not be a
part of this namespace because it isn’t within the <menuItem> beginning and ending tags. 2
ASP.NET BASICS
This local scope (local to the <menuItem> tag) is similar to moving our menuItem variable dec-
XML FOR
laration in the previous ASP.NET example into a function. Moving it within the function
makes the variable’s scope local to the function. Anything outside of the function cannot see or
access the variable unless a call is made to the function and the variable’s value is returned. A
similar process is occurring in the previous XML code. If we treat the <menuItem> tag like our
ASP.NET function, the same type of scope applies with respect to the default namespace:
xmlns=”http://www.anyURL.com”.
Qualified Namespaces
Although default namespaces are certainly useful when an element and all contents within it
share the same namespace, situations may arise when some of the contents within an element
come from a different namespace. Listing 2.7 shows an example.
LISTING 2.7 Working with Different Default Namespaces

2: <root>
3: <menuItem itemNumber=”1” xmlns=”http://www.myCompany.com”>
4: <hyperLink xmlns=”http://www.anotherCompany.com”>
5: /news/default.aspx
6: </hyperLink>
7: <name xmlns=”http://www.anotherCompany.com”>Press Releases</name>
8: <image>picture.jpg</image>
9: </menuItem>
10: </root>
From the preceding example, you can see that the <menuItem> and <image> elements are asso-
ciated with the “http://www.myCompany.com” default namespace. As you learned earlier, the
scope of this namespace normally includes everything that is contained within the <menuItem>
beginning and ending tags (remember the ASP.NET function comparison). In this example,
however, the <hyperlink> and <name> subelements belong to the “http://www.another
Company.com” default namespace. Because each of these subelements has its own default
32
namespace defined, the namespace declared for the <menuItem> element that normally trickles
down is overridden.
Although the code shown in Listing 2.7 is one potential way of declaring the two namespaces,
what would happen if 10 (or 1000, for that matter) other elements were associated with the
“http://www.anotherCompany.com” namespace? If a change is made to the namespace URI,
implementing the change requires that each of the 10 elements are updated. As an ASP.NET
developer, this should raise some red flags.
What if a specific namespace needs to be applied to the itemNumber attribute only? Default
namespaces do not allow for this because they apply to all content within the element in which
they are declared.
To combat these problems, the qualified namespace was developed. A qualified namespace
allows for namespaces to be specifically applied to individual elements (and other XML items
such as attributes) rather than having the namespace trickle down as you’ve seen with default
namespaces. To make this more clear, Listing 2.8 shows an example that uses a qualified
namespace.
LISTING 2.8 A Qualified Namespace Example

2: <root xmlns:another=”http://www.anotherCompany.com”>
3: <another:menuItem itemNumber=”1”>Some Text</another:menuItem>
4: </root>
In Listing 2.8, the namespace URI (“http://www.anotherCompany.com”) is associated with

a prefix named another and a local name of menuItem. Used in this manner, the prefix
(another) serves as a qualifier for the namespace. This means that the namespace applies only
to elements that have their name prefixed with another. Thus, the <root> element does not
belong to the qualified namespace even though the namespace was declared within the <root>
element. The <menuItem> element is a part of the namespace because its name is prefixed by
the word “another.”
What about the itemNumber attribute? Is it part of this qualified namespace? No, it is not pre-
fixed with the another namespace prefix, so it is considered to be in “no namespace” or
“locally scoped.” However, because attributes are used to describe elements, “locally scoped”
when used in this context simply means that the attribute is identified by the element that it is
describing, which of course does have a namespace prefix. The following code segment
reworks Listing 2.7 using a combination of default and qualified namespaces:
<?xml version=”1.0” ?>
<root xmlns=”http://www.myCompany.com”
33
CHAPTER 2
xmlns:another=”http://www.anotherCompany.com”>
<another:hyperLink>/news/default.aspx</another:hyperLink>
<another:name>Press Releases</another:name>
<image>picture.jpg</image>
</menuItem>
</root>
The default namespace (“http://www.myCompany.com”) applies to the <root>, <menuItem>,

and <image> elements. The qualified namespace (“http://www.anotherCompany.com”) applies
only to the <hyperLink> and <name> elements, which override the default namespace. If you 2
had wanted to include the itemNumber attribute in the qualified namespace, you could have
ASP.NET BASICS
done so with the following code segment:
XML FOR
<menuItem another:itemNumber=”1”>
Keep in mind that because attributes are always associated with specific elements, it’s not nor-
mally necessary to associate them with a namespace, because the namespace association can
be inferred from the element that they are describing.
TIP
As you read through the previous namespace examples, you may have wondered
how to clear a namespace after it was declared. For example, if a <root> element
declared a default namespace, how can that namespace be cleared if you don’t want
it to apply to a particular subelement? Going back to our ASP.NET comparisons,
namespaces can be cleared in the same manner as string variables declared in an
ASP.NET page—by setting the variable equal to empty quotes: variableName = “”. To
apply this same principle to clearing a default namespace, simply set the namespace
URI equal to empty quotes: <menuItem xmlns=””>.
Referring back to the problem that arose when XML documents from Company A and
Company B were merged, the customerID elements from each company could easily be identi-
fied correctly by prefixing them with a qualified namespace or by defining a default namespace
for each XML document. The following example shows how Company B’s document could be
rewritten using a qualified namespace so that naming conflicts associated with the customerID
element are eliminated:
<recordset xmlns:companyB=”http://www.companyb.com/customers/namespace”>
<row>
<companyB:customerID>528-11-1234</companyB:customerID>
<companyB:name>Dan Wahlin</companyB:name>
34
<companyB:street>1234 Any Street</companyB:street>

</row>
<row>
<companyB:customerID >123-22-1234</companyB:customerID>
<companyB:name>Lamar Kirby</companyB:name>
<companyB:street>3564 Any Street</companyB:street>
</row>
</recordset>
Namespaces are not always needed in an XML document, but in situations like the preceding
one shown with Company A and Company B, many problems can be alleviated by using them.
XML Processing Instructions

Remember our earlier discussion on the XML declaration? You didn’t know it then, but that
declaration is actually one form of an XML processing instruction (PI). PIs are useful in letting
parsers know how a particular document should be processed. The following PI lets the parser
know to process the document as XML version 1.0 with an encoding type of UTF-16:
<?xml version=”1.0” encoding=”UTF-16”?>
PIs can also be used when you want a particular action to be taken by one parser but ignored
by another. For example, assume that a parser looks for a PI that tells it to copy the current
XML file to a network folder and rename it. Assuming a parser is capable of performing this
functionality, the PI to accomplish this task may look like the following:
<?copyFile path=”\\npath\folderName”?>
Although the exact instruction to perform this task would of course be dependent on the parser,
you can see that all PIs start with the <? characters and end with the ?> characters. PIs can be
very useful when multiple parsers will parse a given document. Processing instructions allow
each parser (RTF, CDF, XML, and so on) to perform specific functions without causing prob-
lems in other parsers.
XML Comments
XML comments are the same as HTML comments. Although I personally prefer commenting
single lines of code using the single apostrophe (‘) as in Visual Basic or the // notation found
in C#, XML comments are just as easy to use when commenting out large sections of code.
Back in Listing 2.1, you saw the use of XML comments:

Although this was a very simple example of adding helpful comments to an XML document,
comments can also be used to comment entire sections out of an XML document, as shown in
Listing 2.9.
35
CHAPTER 2
LISTING 2.9 Adding Comments to XML

2: <root>
3: <menuItem>Home</menuItem>
4: 
22: </root>
Line 4 of Listing 2.9 starts the comment off by using the 
characters signal to the parser that the comment has ended. Any XML content found between
the comment tags will be ignored by the parser.
WARNING
If any data within a commented section of code contains the -- characters, the parser
will throw an error saying, Incorrect syntax was used in a comment. An entity
should be used to represent these characters if they exist within the data found in an
XML document. Entities are discussed in the next section.
XML Entities
Back in the old days (you know. . .1994 or so) when I was first learning HTML, I had a hard
time remembering what characters needed to be put into an HTML document when I wanted
several spaces to show up in a browser. Having had many years of practice, my brain has
finally remembered (hopefully permanently) that the HTML character entity represents
36
a space to the browser. As you are aware, HTML allows for a variety of characters to be
inserted into a Web page. For example, the HTML entity © represents the © character,
whereas the HTML entity ® represents the ® character. XML leverages entities in a simi-
lar manner as HTML.
XML uses the < and > characters to mark up an element, which can cause problems when these
characters exist in the data. To see this problem in action, try loading the following XML docu-
ment in Internet Explorer 5 or later:
<root>
<calculation>1 < 2</calculation>
</root>
You’ll see that the parser generates an error saying Whitespace is not allowed at this
location. At first glance, this error doesn’t seem to relate to the insertion of the < character
into the data. However, if you remember the discussion on XML element naming, you will
recall that element names cannot start with a space. The error is being generated because the
parser thinks that a new element is being started and the element name starts with a space.
Situations such as these involve Standard Entities.
Standard Entities
How can the preceding problem be solved? The answer is to take advantage of Standard
Entities defined in XML. These entities have a similar structure to the ones found in HTML.
Take a look at Table 2.1 to see the characters and their entity references.
TABLE 2.1 XML Standard Entities

Character Entity Reference
< <
> >
“ "
‘ '
& &
Using the Standard Entity reference for the < character, we can fix the error that occurred when
the previous XML document was loaded:
<root>
<calculation>1 < 2</calculation>
</root>
37
CHAPTER 2
Although < shows in the XML document source code, when the document is parsed, the <
character appears in the output. Why does this happen? Just as the browser converts to
a space, the XML parser will convert any entities that it recognizes into their appropriate char-
acter representations.
Character Entities
Character Entities look a lot like Standard Entities but allow for the display of characters typi-
cally not found in the 7-bit ASCII character set or on the majority of computer keyboards in
the world. A little earlier you took a look at how a space could be inserted into a Web page 2
using the entity. What happens if you want to insert a space into your XML data? You
ASP.NET BASICS
can do so easily by using a Character Entity:
XML FOR
<root>
<menuItem> </menuItem>
</root>
An interesting point about this example is that if you leave the preceding code exactly as is but
substitute a physical space for the entity, Internet Explorer 5 or greater will show the
following when the XML file is opened:
<root>
<menuItem/>
</root>
What happened to the space? Right-click the document, choose View Source, and you’ll see
that the space is actually there. However, because it was not specified that the space within the
<menuItem> element needed to be “preserved,” the parser treated the element as if it were
empty. We’ll talk more about handling whitespace a little later in this chapter.
Getting back to Character Entities—if it is necessary to insert the ® character into XML data,
the ® general entity can be used. The following is an example:
<root>
<menuItem>®</menuItem>
</root>
If you’re wondering how you can know what Character Entities exist, you can easily see all the
values by running the ASP.NET script shown in Listing 2.10 (increase the upper bound of i as
necessary):
38
LISTING 2.10 Character Entity ASP.NET Script

1: <script language=”C#” runat=”server”>
2: private void Page_Load(Object sender, EventArgs e) {
3: Response.Write(“<table>”);
4: Response.Write(“<tr>”);
5: for (int i=1;i<1001;i++) {
6: if (i % 4 == 1) {
7: Response.Write(“</td></tr><tr><td>”);
8: } else {
9: Response.Write(“<td>”);
10: }
11: Response.Write(Server.HtmlEncode(“&#” + i + “;”) + “ = “ +
12: “&#” + i + “;</td>”);
13: }
14: Response.Write(“</table>”);
15: }
16: </script>
Internal, External, and Parameter Entities

If you’ve spent much time working with include files in ASP.NET, you’re well on your way
toward understanding Internal, External, and Parameter Entities. With ASP.NET, the include
file declaration is replaced with the actual contents of the include file when processed on the
Web server. The various entities we’ll touch on in this section behave in a similar fashion as
they are processed by an XML parser.
Although these types of entities are actually associated with DTDs, their syntax and applica-
tion are very simple if you understand how includes work in ASP.NET. Therefore, we’ll cover
them here and leave the more detailed information about DTDs until Chapter 4.
Internal Entities
Thus far you’ve seen that entity references always start with the & character and end with the ;
character. This same structure holds for all types of entities (excluding Parameter Entities, as
you’ll see).
An Internal Entity allows a frequently used piece of information to be defined once within a
DTD and reused as many times as necessary in an XML document. This is exactly how an
include (or a user control) works with ASP.NET. A reusable piece of code (such as a header) is
put into an include file, which is placed on the Web server. This include file can then be used
as often as necessary in other ASP.NET pages. Any changes made to the include file are made
in only one place, which is, of course, very efficient from a maintenance point of view.
The following is an example of how to declare an entity in a DTD:
39
CHAPTER 2
<!ENTITY companyName “Tomorrow’s Learning”>
To use the entity within an XML document, simply wrap the name with the normal entity start
and end characters (& and ;). This is similar to wrapping include files with the 
syntax. Because we haven’t discussed DTDs, you’ll get your first taste of them by seeing an
“internal” DTD that is used to define the companyName entity:
<!DOCTYPE root [
]>
<root>
2
ASP.NET BASICS
<menuItem>
<hyperLink>http://www.TomorrowsLearning.com</hyperLink>
XML FOR
<name>&companyName;</name>
<menuItem>
<hyperLink>/location/default.aspx</hyperLink>
<name>Location of &companyName;</name>
</menuItem>
</menuItem>
</root>
Running this document in Internet Explorer 5 will result in the &companyName; entity being
replaced with the text Tomorrow’s Learning.
External Entities
External Entities function in the same manner as Internal Entities from the XML document’s
perspective. The entity name (wrapped with & and ;) is placed in the XML document and is
replaced with whatever text was specified by the External Entity declaration when the docu-
ment is parsed. The difference between the Internal Entity and External Entity is where the
entity definition occurs.
With an External Entity declaration, the entity declaration is made outside of the DTD in a
separate file. Here’s an example of an External Entity declaration. The actual definition would
occur within the DTD:
<!ENTITY companyName SYSTEM “http://www.TomorrowsLearning.com/companyName.txt”>
This declaration names the entity just like the Internal Entity did but then defines the declara-
tion as a SYSTEM entity. This information is followed by the path to the declaration file contain-
ing the entity’s information.
External Entities can even point to other XML files. This can be very useful in situations
where multiple departments all make small contributions to a “master” XML document. Each
department maintains its own XML document and these “mini” documents are pulled into one
main XML document through the use of External Entities.
40
Parameter Entities
Updating information that is located in one place makes maintenance of XML documents eas-
ier and more efficient. What if a DTD contains repeated sections, though? Wouldn’t it be nice
to able to declare these sections once and then reuse them throughout the DTD?
Parameter Entities accomplish this exact task through a syntax that is very similar to Internal
and External Entity declarations. The declaration syntax differs from the previously discussed
entities only in the use of a % character followed by a space:
<!ENTITY % children “(location,address,phone+)”>
This Parameter Entity declaration can then be used within the DTD by preceding the entity
name with % and ending it with ;:
%companyName;
Although both character and parameter entities are defined in a DTD, the parameter entity can
be used only within the confines of the DTD itself. It cannot be used in an XML document like
character entities can. Chapter 4 contains a more in-depth discussion of DTDs.
CDATA Sections
There will invariably be times when the data you want to include within an XML document
contains characters that may cause the parser to fail. Earlier you saw that when the < character
was placed into the data, the parser was tricked into thinking that a new element was starting.
As a result, it generated an error.
Aside from using entities to prevent the parser from failing, as shown earlier, the XML lan-
guage also includes the character data (CDATA) section to allow data such as the < or & char-
acters to be included without causing problems when parsed. CDATA sections are unparsed by
the XML parser and treated as text or “character data.” Their content is simply passed through
the parser untouched.
To gain a better understanding of how CDATA sections are handled by the parser, let’s first
take a look at an ASP.NET page with embedded HTML:
<%
for (int i=0;i<10;i++) {
%>
<font color=”#02027a”><b>The current number is:</b></font> <%=i%>
<br>
<%
}
%>
41
CHAPTER 2
In this simple example, the HTML (<font color=”#02027a”><b>The current number

is:</b></font>) in the ASP.NET page is not parsed on the Web server by the ASP.NET
processor. It is simply passed to the browser for display. Only code located between the <% and
%> tags is processed. CDATA sections are treated in a similar manner by the XML parser. They
are part of the XML document, but the parser leaves them alone during the parsing process.
CDATA sections are quite useful when you’d like to include HTML or XML tags within your
XML document but don’t want the parser to treat them as if they are actually part of the XML
document’s elements. In other words, you want the parser to simply pass through all the ele-
ments untouched. The following example embeds HTML data into a <slide> element: 2
ASP.NET BASICS
<presentation>
XML FOR
<slide number=”1”>
<![CDATA[
<table border=”0”>
<tr>
<td>Hello World</td>
</tr>
</table>
]]>
</slide>
</presentation>
Opening this document in Internet Explorer 5 or later results in the CDATA section being
shown exactly as it was entered into the XML document. CDATA sections even leave spacing
alone. Try tabbing or spacing over the <td>Hello World</td> portion of the CDATA section
to the left or right, and you’ll see that it will display exactly as you type it in the Internet
Explorer 5 or later. Why doesn’t it strip out spacing as we saw in a previous example?
Remember that the parser leaves the text within the CDATA section alone and simply passes it
through untouched.
Dealing with Whitespace

Earlier in the chapter, you saw how whitespace (spaces, tabs, returns, and so on) is stripped out
of an XML document during the parsing process. In many ways this default behavior is very
similar to how browsers handle whitespace in HTML. To demonstrate this, the extra space
added after the period in the following example will not show when the page is displayed in
the browser (two spaces were added after the period):
<font face=”arial”>Hello World. HTML is easy!</font>
Although whitespace in the form of spaces can be added to HTML by adding the entity
or to XML by adding the entity, another method exists that allows whitespace to
42
remain in the data. Through using this alternative, you can avoid resorting to CDATA sections
or entities.
What is the alternative? The xml:space attribute. This special attribute can be set equal to
“default” or to “preserve”, as shown next:
<name xml:space=”preserve”> Hello World </name>
By setting the xml:space attribute to “preserve”, all whitespace within the element’s data will
be left untouched by the parser. Setting the attribute to “default” allows the parser to strip the
data of any excess whitespace. Use of this attribute is normally reserved for data that contains
whitespace that needs to be left alone, such as with a poem.
NOTE
Internet Explorer 5 or later does not honor the xml:space attribute by default. When
developing client-side code using MSXML (or MSXML3), you can turn this feature on
by setting the preserveWhiteSpace property to true.
The Relationship of XML to XHTML

On January 26, 2000 the W3C released a recommendation for ”Reformulation of HTML 4
in XML 1.0,” referred to as Extensible Hypertext Markup Language or XHTML (http://
www.w3.org/TR/xhtml1/). You’ll be happy to know that the rules associated with version 1.0
of the XML specification also apply to XHTML, so you do not have to learn a whole new
markup language to use XHTML. Although neither Netscape nor Microsoft specifically sup-
ports XHTML in their browsers, you can start using it today.
The XHTML specification makes the following points about the benefits of migrating from
HTML to XHTML:
• XHTML documents are XML conforming. As such, they are readily viewed, edited, and
validated with standard XML tools.
• XHTML documents can be written to operate as well or better than they did before in
existing HTML 4-conforming user agents as well as in new, XHTML 1.0-conforming
user agents.
• XHTML documents can utilize applications (for example, scripts and applets) that rely
on either the HTML DOM or the XML DOM.
• As the XHTML family evolves, documents conforming to XHTML 1.0 will be more
likely to interoperate within and among various XHTML environments.
43
CHAPTER 2
• Document developers and user agent designers are constantly discovering new ways to
express their ideas through new markup. In XML, it is relatively easy to introduce new
elements or additional element attributes. The XHTML family is designed to accommo-
date these extensions through XHTML modules and techniques for developing new
XHTML-conforming modules (described in the forthcoming XHTML Modularization
specification). These modules will permit the combination of existing and new feature
sets when you’re developing content and designing new user agents.
• Alternative ways of accessing the Internet are constantly being introduced. Some esti-
mates indicate that by the year 2002, 75% of Internet document viewing will be carried
out on these alternative platforms. The XHTML family is designed with general user
2
ASP.NET BASICS
agent interoperability in mind. Through a new user agent and document-profiling mecha-
XML FOR
nism, servers, proxies, and user agents will be able to perform best-effort content trans-
formation. Ultimately, it will be possible to develop XHTML-conforming content that is
usable by any XHTML-conforming user agent.
Now that you’ve seen the benefits of XHTML, you may be wondering how it differs from
HTML. Although the tags are basically the same, all XHTML tags must be closed, nested, and
cased properly for the XHTML document to be well formed. Attributes must be quoted, must
have an associated value, and cannot be defined more than once on the same element. For
example, the selected attribute shown next is incorrect if XHTML rules are being followed:
<option value=”1” selected>ASP.NET</option>
To make this XHTML compliant, the selected attribute must have a value that is quoted:
<option value=”1” selected=”selected”>ASP.NET</option>
A complete discussion of these rules and many others was given earlier in this chapter.
As its name implies, XHTML is extensible. With HTML you cannot introduce a new tag
unless the device that renders it understands the tag. With XHTML, modules can be created
that tell the device how to work with a custom tag. This allows devices such as Internet-
enabled phones to work with a “down-sized” XHTML module that is geared toward lower
memory consumption, and it allows more powerful applications to work with “extended”
XHTML modules that may include custom tags. In general, XHTML is much more flexible,
efficient, and extensible than HTML because of modularization. For more information on
XHTML modules, visit: http://www.w3.org/TR/2001/REC-xhtml11-20010531/.
So what does an XHTML document look like? Actually, it’s very similar to the HTML docu-
ments you’re used to working with except that it follows the XML rules. A simple XHTML
document is shown next:
44

<!DOCTYPE html
PUBLIC “-//W3C//DTD XHTML 1.0 Strict//EN”
“DTD/xhtml1-strict.dtd”>
<html xmlns=”http://www.w3.org/1999/xhtml” xml:lang=”en” lang=”en”>
<head>
<title>My First XHTML Document</title>
</head>
<body>
<p>Hello World!</p>
</body>
</html>
This document starts with the XML declaration, which is followed by one of three possible
DTDs. These include a traditional (most of the general public will use this), a strict (allows for
“cleaner” markup without including layout tags), and a frameset DTD (used with frames), as
shown next:
<!DOCTYPE html
PUBLIC “-//W3C//DTD XHTML 1.0 Strict//EN”
“DTD/xhtml1-strict.dtd”>
<!DOCTYPE html
PUBLIC “-//W3C//DTD XHTML 1.0 Transitional//EN”
“DTD/xhtml1-transitional.dtd”>
<!DOCTYPE html
PUBLIC “-//W3C//DTD XHTML 1.0 Frameset//EN”
“DTD/xhtml1-frameset.dtd”>
Aside from the DTD, you’ll notice that a global namespace is also included on the HTML
element along with two language attributes.
Because XHTML documents follow the XML rules, parsers can treat them as regular XML
documents. This provides a lot of power and flexibility when different XML languages such as
XLink and XPointer are used (discussed in Chapter 3, “XPath, XPointer, and XLink”) to
access specific portions of an XHTML document. Because XHTML is available for implemen-
tation today, you should begin creating all HTML documents in a manner that makes them
XHTML compliant.
Using ASP.NET Objects to Generate XML

At this point, you’re probably anxious to put some of the XML basics that you have learned to
work. Keep in mind that although the content up to this point hasn’t been what you might con-
sider exciting, the concepts you learned are very important in building XML applications that
are both functionally and programmatically correct. Having a thorough understanding of XML
will help prevent many hours of debugging down the road.
45
CHAPTER 2
Because we haven’t discussed the XML parser built in to the .NET platform yet, the sample
application that follows uses what you already know to construct an XML file on-the-fly,
based on user input. By the end of the application, you’ll understand why a parser would make
things a lot easier. Before getting too far into the application code, let’s discuss it in more
detail.
Application Specifics
The application’s purpose is to allow department heads to submit job bid requests for informa-
tion technology contractors. Upon submitting a request, the real-world version of the applica- 2
tion would start a workflow and provide a lot of other functionality that won’t be covered in
ASP.NET BASICS
this sample.
XML FOR
Let’s assume that the business requirements specify that there will be a maximum of 25 bid
requests a week and that after the workflow is started, bid requests need to be stored in an
XML file, which will be used throughout the workflow process. At the end of each month, this
XML file will be imported into a proprietary database using a batch process already in place.
Creating the Application

This application can easily be constructed using standard IO classes built into the .NET plat-
form. Because we haven’t covered creating XML documents using an XML parser yet, these
objects provide a viable solution. However, in future chapters you’ll see how to accomplish the
same task more efficiently using an XML parser.
To start, let’s create an ASP.NET page named default.aspx and place it within a virtual direc-
tory (any virtual directory having “write” permissions will do). In case you’re not familiar with
virtual directories, they are a name alias (you can name them anything you’d like) that repre-
sents a path to a physical directory on your hard drive. To set up a virtual directory in Internet
Information Server, consult the IIS documentation.
The default.aspx page contains the following pieces of functionality:
• Data Entry
• XML Document Creation/Appending
• XML Document Viewing
The data entry section is basic HTML, so the specifics involved in creating it won’t be cov-
ered. We’ll assume that the various form element tags are named properly according to the
application’s business rules. Listing 2.11 shows the code that processes the form after it has
been submitted by the user. The entire file can be found in this chapter’s code as Listing
2.11.aspx.
46
LISTING 2.11 Code Used to Create or Append Information to an XML Document Using
Standard ASP.NET Classes
1: <%@ Import Namespace=”System” %>
2: <%@ Import Namespace=”System.IO” %>
4: string rootTag = “root”;
5: public class CreateXmlFile {
6: HttpRequest Request = null;
7: public CreateXmlFile(HttpRequest ServerRequest) {
8: //Overload Constructor when request object is needed
9: Request = ServerRequest;
10: }
11: public CreateXmlFile() {}
12: public string writeXML(string root,string node,string recordID,
13: string version,string fileName,bool writeEndRoot,bool writeAtts){
14: bool blnExists = false;
15: StreamWriter writer = null;
16: string pathFile = getPath(fileName);
17: //See if the XML file already exists
18: blnExists = File.Exists(pathFile);
19: if (!blnExists) { //Create the file since it doesn’t exist
20: writer = new StreamWriter(File.Open(pathFile,
21: FileMode.Create, FileAccess.Write));
22: writer.WriteLine(“<?xml version=\”” + version + “\”?>”);
23: writer.WriteLine(“<” + root + “>”);
24: } else {
25: writer = new StreamWriter(File.Open(pathFile,
26: FileMode.Append, FileAccess.Write));
27: }
28: writer.WriteLine(“<” + node + “ id=\”” + recordID);
29: if (!writeAtts) {
30: writer.Write(“\”>”);
31: } else {
32: writer.Write(“\” “);
33: }
34: foreach (string x in Request.Form) {
35: if (x.ToUpper()!=”SUBMITBUTTON” && x.IndexOf(“_”) != 0) {
37: writer.WriteLine(“<” + x + “>” + Request.Form[x] +
38: “</” + x + “>”);
39: } else { //Write out as attributes
40: writer.WriteLine(x + “=\””+Request.Form[x]+”\” “);
41: }
42: }
43: }
47
CHAPTER 2

45: writer.WriteLine(“</” + node + “>”);
46: } else {
47: writer.WriteLine(“/>”);
48: }
49: if (writeEndRoot) {
50: writer.WriteLine(“</” + root + “>”);
51: }
52: writer.WriteLine();
53: writer.Flush(); 2
ASP.NET BASICS
54: writer.Close();
55: return “”;
XML FOR
56: }
57:
58: public string ReadFile(string file,bool bGetPath) {
59: //Create the File System Object and txtfile
60: string filePath = “”;
61: string fileOut = “”;
62: StreamReader reader = null;
63: filePath = bGetPath?getPath(file):file;
64: try {
65: reader = new StreamReader(File.OpenRead(filePath));
66: fileOut = reader.ReadToEnd(); //Can be expensive
67: }
68: catch (Exception e) {
69: fileOut = “There was an error: “ + e.ToString();
70: }
71: finally {
72: if (reader != null) reader.Close();
73: }
74: return fileOut;
75: }
76:
77: private string getPath(string file) {
78: string path = Request.ServerVariables[“PATH_TRANSLATED”];
79: path = path.Substring(0,path.LastIndexOf(“\\”)+1) + file;
80: return path;
81: }
82: } //End Class
83:
84: public void submitButton_Click(Object Sender, EventArgs E) {
85: string output = “”;
86: string ID = Request.Form[“dateTime”];
87: ID = ID.Replace(‘/’,’-’);
48

88: ID = ID.Replace(‘:’,’-’);
89: ID = ID.Replace(‘ ‘,’-’);
90: CreateXmlFile file = new CreateXmlFile(Request);
91: string sStatus = file.writeXML(rootTag,”request”,ID,”1.0”,
92: “request.xml”,false,false);
93: if (sStatus.Length != 0) {
94: output += “Error: “ + sStatus;
95: content2.InnerHtml = output;
96: } else {
97: content.Visible = false;
98: content2.Visible = true;
99: }
100: }
101:
102: public void viewXML_Click(Object Sender, EventArgs E) {
103: CreateXmlFile file = new CreateXmlFile(Request);
104: Response.ContentType = “text/xml”;
105: Response.Write(file.ReadFile(“request.xml”,true) +
106: “</” + rootTag + “>”);
107: Response.End();
108: }
109:
110: public void Page_Load(Object sender, EventArgs E) {
111: content.Visible = false;
112: content2.Visible = false;
113: if (!Page.IsPostBack) {
114: content.Visible = true;
115: }
116: }
117: </script>
Looking at the preceding code, you can see that a class is specified named CreateXmlFile.
This class accepts the Request object in its constructor. When the page first loads, the main
HTML form is shown to the user. After entering the necessary information, the user clicks a
button named submitButton that triggers the submitButton_Click event handler on the server.
This handler instantiates the CreateXmlFile class and calls its writeXml() method. This
method accepts the parameters listed in Table 2.2
TABLE 2.2 writeXML() Method Parameters
Parameter Name Purpose

root Root node name
node Record node name
49
CHAPTER 2
TABLE 2.2 continued

Parameter Name Purpose
recordID ID for a given record
version XML document version
fileName Name of XML document
writeEndRoot Write out end root tag
writeAtts Write data as attributes rather than elements
2
The writeXML() method creates a File object and then opens a text file. If the text file doesn’t
ASP.NET BASICS
already exist, one is created (lines 20–23) and the XML declaration and root tag items are
XML FOR
written to the file. The function then proceeds to loop through each form element, writing each
element’s name and value to the document (lines 34–43). If the writeXML() method’s
writeAtts parameter is true, this process will add the form element name/value pairs to the
document as attributes. Otherwise, it will add them as child elements of the node parameter. If
the writeEnd
Root parameter is true, the closing root element tag is written to the text file. This would be
used only if one request were being captured per XML document. If the writeEndRoot para-
meter is false, no closing root tag is written to the document.
Wait a minute! Didn’t we just spend an entire chapter discussing how important it is to follow
the XML rules and close all tags so that XML documents are well formed? Well. . .yes we did.
But because we’re not working with an XML parser, this is a necessary evil for the application
to handle more than one bid request update to the XML file. Because the File object doesn’t
have a method allowing us to delete one specific line easily, we can’t insert the final root tag
until all bid requests for the month have been entered. After we discuss manipulating XML
documents using the DOM in Chapter 6, you’ll see how this problem can be rectified.
A potential resource-intensive alternative would be to use the File object’s ReadToEnd()
method to read the entire contents of an existing XML document into a variable, replace the
existing end root tag with empty strings, add a new bid request, and then write out the end root
tag to the end of the document. This process would occur with each bid request addition and
could become a problem if the client were to start entering more than 25 requests per week as
specified. A few other workarounds to the problem exist as well, although we won’t focus on
them here.
Continuing with the current (non–parser-based) solution, after a bid request has been added to
the XML document, you are presented with an opportunity to see the XML document. The
viewXml_Click event handler (lines 102–108) contains the code that sets the Response.
ContentType to “text/xml”, loads the non–well-formed XML document, dynamically adds
the closing root tag, and then passes the now well-formed document to the browser (Internet
Explorer 5or later required).
50
Application Summary
It’s obvious that this solution isn’t an optimal one. If the client ever anticipates having several
thousand bid requests a month, you can imagine how big the file could get and what type of
concurrency problems would be created. However, as long as the end root tag is added to the
XML document (making it well formed) before the batch process runs to update the client’s
database, it is one potential solution that will work. As mentioned previously, a much more
efficient method of adding to or updating an XML document will be presented in Chapter 6.
Summary
We’ve covered a lot of rules and concepts in this chapter. Although everything discussed plays
an important role in creating XML documents, it’s extremely important that you take away
from this chapter a good knowledge of XML’s rules. Documents that do not follow the rules
may look good at first, but as you saw in the “Well-Formed XML Documents” section, the
parser has no patience for rule bending and will let you know with an error message.
In the next chapter, we’ll begin looking at a few of XML’s relatives that allow information
located both in local and remote XML documents to be queried and/or manipulated. These
include XPath, XPointer, and XLink.
XPath, XPointer, and XLink CHAPTER
3
IN THIS CHAPTER
• Meet a Few of XML’s Relatives 52
• XPath—The SQL of XML 53
• XPointer—Accessing XML Document

Fragments 69
• XLink—Resource Relationship
Management 76
• The XLink Simple Link 82
• XLink Extended Links 83
• Putting XLink Together 87

52
Meet a Few of XML’s Relatives

In Chapter 2, “XML for ASP.NET Basics,” you saw that the XML language provides a struc-
ture for marking up data in a uniform, extensible, and descriptive manner that is readable both
by humans and computers. Although this capability is certainly a welcome addition to the pro-
gramming world, XML’s true power and flexibility can be realized only as it is combined with
other languages and technologies. An XML document is quite useless on its own without a
means for manipulating the data it is describing. To understand this further, imagine a SQL
Server database that allowed for the creation of databases and tables but had no mechanism for
querying the data. Although the data itself would be efficiently organized into relational tables,
this structure wouldn’t be very useful if you couldn’t use Transact SQL (T-SQL) to run queries
against the data. Assuming that an XML document is your SQL Server database, you obvi-
ously need some type of mechanism that allows you to access the data so that the XML docu-
ment can be useful in application development.
Several languages have been developed to allow XML documents to be queried and even
linked in some interesting ways. These languages include XPath, XPointer, and and XLink.
All three can be used in concert to access data located not only within a particular XML docu-
ment but also within external documents located anywhere in the world. Much of the function-
ality contained within the XLink and XPointer languages is still being finalized by the W3C.
As a result, Microsoft’s .NET XML parser implementation (along with many other popular
parsers) does not yet implement XLink or XPointer capabilities. However, this won’t stop us
from discussing some of the exciting capabilities that XLink and XPointer will offer in the
near future.
You may be wondering why a new manner of linking documents needs to be developed? After
all, doesn’t the HTML language provide us with sufficient linking capabilities? The answer is
both “yes” and “no.” Although most HTML documents employ the <a> tag to link different
documents, this functionality is extremely limited.
For example, individuals clicking a link may not want to view the entire document that they
are linking to on the other end. If they are clicking a link to obtain weather information, they
may want only the current temperature for the city they live in. Current HTML linking capabil-
ities require that the entire document be loaded rather than simply the section the individual
wants to view. XPath, XPointer, and XLink exist to remedy this problem and improve on
HTML’s current linking capabilities.
The information in this chapter will be presented in the following order:
• XPath—The SQL of XML?
• XPointer—Accessing XML Document Fragments
• XLink—Resource Relationship Management
XPath, XPointer, and XLink
53
CHAPTER 3
XPath—The SQL of XML

When the W3C committees first began working on an XML query language (http://www.
w3.org/TR/xpath), they realized that the XPointer and XSLT groups were attempting to
accomplish a similar task: access specific sections of an XML document. As a result, the
XPointer, XSLT, and XPath groups worked together to specify a query language to be used
with XML. The current XPath specification (version 1.0) became a W3C recommendation in
November of 1999 and is a result of this joint effort.
If you’ve ever used T-SQL to query a SQL Server database, you know how useful it is for
obtaining recordsets. Depending on the type of ASP.NET development you do, you are more
than likely used to seeing T-SQL (or ANSI SQL) syntax similar to the following example
embedded within ASP.NET pages or stored procedures:
SELECT * FROM AUTHORS WHERE au_id = ‘172-32-1176’
Although syntactically different, XPath provides the same access to XML document data as
T-SQL does to SQL Server data. The XPath language is used by XSLT (covered in Chapter 7,
“Transforming XML with XSLT and ASP.NET”) and XPointer to access data found within
specific XML document elements, attributes, and so on. 3
XPOINTER, AND
So what does XPath syntax look like? You may be surprised to learn that its syntax does not
resemble XML’s syntax at all. In some ways it resembles a syntax that you are more than
XPATH,
XLINK
likely used to working with: the Windows file system.
The Windows file system emulates a hierarchical structure with the drive letter representing the
root of all other files and folders on that drive. An XML document is structured hierarchically
with the root element (also called the root node) containing different child elements (or
nodes). These child elements may themselves contain other child elements.
To access a folder on one of your drives ( C: for example), your computer follows a path simi-
lar to the following:
c:\temp\subFolder1\subFolder2\xmlFile.xml
This path can easily be converted into an XML document:

<C>
<temp>
<subFolder1>
<subFolder2>xmlFile.xml</subFolder2>
</subFolder1>
</temp>
</C>
54
TIP
As a point of review, you cannot simply add C: as the root element name. Remember
in Chapter 2 that we discussed the concept of qualified namespaces. Putting the
colon in the root name makes the parser think that C was declared as a qualified
namespace somewhere in the document. Because it can’t find the declaration (or the
element name after the colon), the parser will raise an error.
If you wanted to access subFolder2 in the preceding XML document using XPath syntax, you
would simply type the following:
C/temp/subFolder1/subFolder2
You could also type a shorthand version. This version of the query searches for all subFolder2
nodes located anywhere within the XML document:
//subFolder2
You’re now an XPath expert! Well, the word “expert” may be a little strong at this point, but
your experience in working with the Windows file system should help you in understanding
how XPath works. XPath is, of course, much more powerful than this simple example. Aside
from its basic syntax, the XPath language also includes its own set of functions to provide
additional utilities such as formatting strings and numbers or performing mathematical
calculations.
Before going into more detail about XPath, it’s important that you realize that it is not
designed as a standalone language. Going back to the Windows file system comparison, with-
out having access to a DOS command prompt, File Manager, or Windows Explorer, typing the
path to a file wouldn’t get you very far and you definitely wouldn’t find the file or folder you
were looking for. The syntax you saw earlier to access a file is functional only when used in
conjunction with another language or program (such as a command prompt). XPath was
designed in the same manner. It is useful only when used with another language such as
XPointer or XSLT.
XPath Basics
The XPath specification defines a term named ”Location Steps” that sums up how the XPath
language actually works. To get to a particular node’s location, you have to list the steps to get
there using an XPath expression or pattern. Going back to the first XPath sample, we defined
the starting point as C and listed each step that should be followed to walk through the element
nodes to get to the destination data. Each step of the process was separated by a / character.
55
CHAPTER 3
Let’s clarify this a little more through giving a simple real-life example. Getting to a node
within a document is very similar to explaining your family lineage. If a friend asked you
about a portion of your family line going back to your great grandfather, you may give the
following information using XPath syntax:
Great_Grandfather/Grandmother/Mother/Self
You can see from this sample that each step in your family lineage is separated by the /
character.
Another important term used in the XPath specification is that of the context node. The context
node is the node being evaluated at each step of an XPath expression. In your family lineage
example, the context node is first your Great_Grandfather. The context changes to the
Grandmother node and then moves to the Mother node. Finally, the context node is Self. This
same logic applies to nodes being evaluated at each step of an XPath expression.
The context node becomes important as evaluations are being done on it. If someone asks you
how many children your ______ had, it would be difficult to answer the question unless the
blank was filled in (unless you can read minds of course). You need to know where to start (the
context) in order to reply. If, however, someone asks you how many children your great grand- 3
father had, you can now provide an answer because you have a context from which to start.
XPOINTER, AND
The path to a particular node within an XML document is constructed from three pieces of
XPATH,
XLINK
information:
• An Axis Type
• A Node Test
• Predicates (filtering)
The following sections explain each of these items in turn. After that discussion, we’ll discuss
XPath functions and get into some examples so that you can see more XPath expressions at
work.
Axis Types
The family lineage XPath statement we looked at earlier took advantage of some XPath abbre-
viations to make things look a little nicer. Without using abbreviations, the same statement
would look like this:
Great_Grandfather/child::Grandmother/child::Mother/child::Self
Don’t worry about the :: characters. We’re not going to define any C++ interfaces or anything
like that. The child:: syntax is one type of axis defined by the W3C that can be used when
you’re trying to locate a node within a document. Each axis is followed by two colons (::)
when used within an XPath expression. Axes are somewhat intimidating at first glance but are
56
very useful when you know a few basics about how they work. Table 3.1 contains all the avail-
able axes as defined in the XPath specification.
Table 3.1 XPath Axes

Axis Axis Description
child Includes all children of the context node.
descendant Includes all descendants of the context node. This includes
children, grandchildren, and so on.
parent Includes the parent of the context node.
ancestor Includes all ancestors of the context node.
following-sibling Includes all siblings that follow the context node in document
order.
preceding-sibling Includes all siblings that precede the context node in docu-
ment order.
following Includes all nodes that follow the context node in document
order. This does not include ancestors.
preceding Includes all nodes that precede the context node in document
order. This does not include ancestors.
attribute Includes all attributes of the context node.
namespace Includes all namespaces within the scope of the context node.
If no namespaces are declared, it is empty.
self Includes the context node only.
descendent-or-self Includes all descendants of the context node as well as the
context node itself.
ancestor-or-self Includes all ancestors of the context node as well as the con-
text node itself.
Although many of the axes, such as self, child, and parent are self-explanatory, a few are
not as obvious at first. Let’s take a look at a few axes examples based on Listing 3.1.
Listing 3.1 Working with Axes in XPath

2: <familyLine xmlns:me=”http://www.someURL.com”>
3: <greatGrandfather fname=”Jonathon” lname=”Doe”>
4: <grandMother fname=”Betty” lname=”Doe”>
5: <father fname=”James” lname=”Doe”>
57
CHAPTER 3

6: <me:self fname=”John” lname=”Doe”/>
7: <sister fname=”Mary” lname=”Doe”>
8: <child fname=”Ted” lname=”Doe”/>
9: </sister>
10: <brother fname=”Billy” lname=”Doe”/>
11: </father>
12: </grandMother>
13: </greatGrandfather>
14: </familyLine >
Assuming that the context node is <father>, the descendant axis to this node would, as the
axis description says, select all descendants, which in this case would be the <self>,
<sister>, <brother>, and <child> nodes. Which nodes would be included in the descendant-
or-self axis if the <father> node is still the context node? This axis would contain the
<father>, <self>, <sister>, <brother>, and <child> nodes. Keep in mind that although this
example uses the element named self, this has nothing to do with the word self as contained
in the descendant-or-self axis. self as used in the axis name simply refers to the context
node, which is <father> in this example. 3
XPOINTER, AND
The following are a few more examples that simply list the axis name and nodes contained
within it. Assume that the <self> node is now the context node:
XPATH,
XLINK
Axis Included Nodes
following-sibling <sister>, <brother>
preceding-sibling none
ancestor <father>, <grandMother>, <greatGrandfather>,
<familyLine>
ancestor-or-self <self>, <father>, <grandMother>, <greatGrandfather>,

<familyLine>
attribute fname and lname

namespace me qualified namespace
following <sister>, <brother>, <child>
preceding none
Node Tests
Node tests let you specify what type of node to locate relative to the context node. Each of the
previously discussed axes have their own principal node type. For those axes that can contain
58
elements, the principle node type is element. Because the attribute axes can contain only attrib-
utes, its principle node type is. . . you guessed it, attribute.
The previous examples have performed node tests to check for element nodes only. However,
you could have done other types of node tests if you had wanted to get to a different location
in the document other than an element. The other node tests you could have used include
• The * wildcard character, which includes all items in the current axis. The wildcard
works in a manner similar to the SQL * wildcard character used in SELECT statements.
• text(), which includes all text nodes in the current axis.
• node(), which includes all nodes in the current axis.
• comment(), which includes all comments in the current axis.
• processing-instruction(), which includes all processing-instructions in the current axis.
Although you’ll have the opportunity to see plenty of examples of node tests in action, the fol-
lowing are a few simple examples with explanations to get you started:
axis::node test Explanation

attribute::* All attributes of the context node are selected.
child::text() Matches the context node children’s text nodes.
descendant-or- Matches all sibling elements that are self::sibling
descendants of the context node. The context node itself is
also included if it is a sibling element.
Predicates
Using axes and node tests can help in reaching a specific location within an XML document.
But what if the context node contains four children that are elements and you want to select
only the third child element? You can use the child axis and element node test to get to the four
children, but neither the axes nor node tests help you to select the third element only.
Predicates, otherwise known as WHERE clauses, help solve this dilemma for you. A predicate
helps to filter out unwanted nodes by taking advantage of built-in functions. This process
is very similar to using the RowFilter property of the ADO.NET DataView class. Using
ADO.NET, filtering out records that don’t have a specific ID (as defined in intID below) is
as easy as typing the following code:
dataView.RowFilter = “ID = “ + intID;
Using XPath, filtering out nodes that don’t meet specific requirements is as easy as typing
child::sibling[position()=3]
This XPath statement uses a predicate to select the third sibling child of the context node.
59
CHAPTER 3
You can see that a predicate is started and ended by using the [ and ] character tokens.
XPath has several built-in functions, which you’ll learn more about in a moment, that can be
used within the predicate statement. The preceding example used the position() function.
Using one or more of these functions in combination with comparative operators, the processor
performs the necessary checks on the current node-set to see which nodes should remain and
should be excluded. If you’re wondering what a node-set is, think of it as another form of an
ADO.NET DataSet. The difference is that the node-set contains a collection of XML nodes,
whereas the DataSet contains records returned from a data source.
The predicate statement as a whole produces a Boolean value after it is evaluated. Either the
predicate statement is true and the node being evaluated is included in the node-set, or it is
false and the node is excluded from the node-set. This Boolean value is derived by comparing
the result of a function call to a value using comparative operators, including
• =, !=
• <, <=, >, >=
• -, +, *
• and, or
3
• div, mod
XPOINTER, AND
• | (union operator)
XPATH,
XLINK
Your first look at predicates showed how to get at the third sibling child node of the context
node. If your goal were to get the third and fourth child sibling nodes, you could use one of the
following statements:
child::sibling[position() >= 3]
child::sibling[position() > 2]
child::*[position() = 3 or position() = 4]
Later in the chapter, you’ll be presented with more predicate statements.
The Location Step Framework

Putting the axis, node test, and predicate information types together results in the following
location step syntax framework:
<axis>::<node-test> [<predicate expression>]
All location steps in an XPath expression will follow this syntax. We can break the syntax into
pieces: the XPath statement must first specify the type of axis to traverse. The child axis is the
default if no axis is specified. After it is on the proper axis, the node test is performed to check
whether a particular node exists. In cases where a specific node must be accessed, a predicate
can be added to the XPath expression, which acts as a filter.
60
Samples of using this syntax are shown later in the chapter, along with abbreviated forms that
can be used to shorten the location step framework. Before these are shown, however, it’s
important that you understand what native functions exist in the XPath recommendation.
Understanding these functions will allow you to fully leverage predicates.
XPath Functions
Many functions are available other than the position() function shown earlier. The XPath rec-
ommendation categorizes each of the predicate functions based on its purpose and functional-
ity. These categories include node-set functions, string functions, number functions, and
Boolean functions. We’ll first discuss the node-set functions.
XPath Node-Set Functions

The position() function that you have been working with thus far is an example of a node-set
function. Node-set functions have the obvious function of returning information about one or
more nodes. Table 3.2 shows the node-set functions listed in the W3C XPath recommendation.
Table 3.2 Node-Set Functions

Function Name Purpose
last() Returns the index number assigned to the last node in the current
context. The example selects the last customer child element.
Example: child::customer[last()]
position() Evaluates the position of the current node being evaluated within a
given context and returns a number. Numbering starts with 1.
Example: child::customer[position() = 2]
count(node-set) This function takes a node-set as a parameter and returns the num-
ber of nodes within the node-set. It is similar to Count(*) in SQL.
The example selects a father node containing four sibling nodes.
Example: descendant::father[count(sibling) = 4]
id(object) Selects nodes based on a unique ID attribute and returns a node-set.
It expects an object and if the object is a node-set, the node-set is
converted to a string value. Any IDs that match this string value will
be returned. This function is similar to the key() function (covered
in Chapter 7), but is more powerful because it can accept a node-set
rather than a single string value. It is designed to work with a valid
document in which attributes have been declared as type ID. The
example selects all customer child nodes with a unique ID of 1234.
The selection will also look for IDREF and IDREFS attribute node
types that have a value of 1234 (ID, IDREF, and IDREFS are dis-
cussed in Chapter 4, “Understanding DTDs and XML Schemas”).
Example: child::customer[id(“1234”)]
61
CHAPTER 3
Table 3.2 continued

local-name(node) Returns the node name located after the colon if a qualified name-
space is associated with the node (applying it to family:father
would return father) or simply the node’s name if no namespace
is associated. The function can accept a node (that is, local-
name(node)) or can simply evaluate the current node being evalu-
ated (local-name()). The example selects all child nodes of the
context node with a local-name of father.
Example: child::*[local-name() = “father”]
name(node) Works like the local-name() function, except that it returns the
actual node name as it appears in the original XML document,
including the namespace prefix if any exists.
Example: child::*[name() = “family:father”]
namespace-uri(node) This function returns the URI of any namespace associated with
the node being evaluated. If no namespace URI exists, an empty
string is returned. The example selects all child nodes of the con-
text node that have a namespace URI equal to 3
http://www.someURL.com.
XPOINTER, AND
Example: child::*[namespace-uri() = “http://
XPATH,
XLINK
www.someURL.com”]
You’ll notice that there is no specific node-set function to extract a namespace prefix (the me in
me:local-name for instance) should one exist on a node. The local-name() function allows
you to get the node’s name, the name() function gets the prefix and local-name together, and
the namespace-uri() function returns only the namespace URI. This certainly appears to pre-
sent a problem because knowing the prefix associated with a particular node may be a big part
of your applications, depending on the number of contributors to an XML document. Fortu-
nately, this problem can be solved by taking advantage of node-set functions used in conjunc-
tion with XPath string functions.
XPath String Functions

String functions in XPath provide some of the basic functionality that you are used to working
with in ASP.NET documents using VB.NET or C#. Functions are available to convert values to
a string, concatenate strings, return substrings, check string lengths, and more. Table 3.3 con-
tains a complete listing of all string functions in the XPath recommendation.
62
Table 3.3 String Functions

string(value) This function converts a value to a string. If a node-set is
passed in, the string value of the first node in the set is con-
verted. This function is not normally needed because string
conversions are handled on-the-fly when a string value is
required. The example converts the number 101 to “101”.
Example: string(101)
concat(string1, string2, This function concatenates (combines) all strings supplied
string3, ...) as arguments. The example returns “Phoenix, Arizona.”
Example: concat(“Phoenix”,”, “,”Arizona”)
starts-with(string1, This function returns true if string1 starts with string2,
string2) and returns false otherwise. The comparison is case sensi-
tive. The example will return true. If string2’s value had
been “a”, false would be returned.
Example: starts-with(“Arizona”,”A”)
contains(string1, This function is similar to the InStr() function in
string2) VB.NET or IndexOf() function C#, although it doesn’t
return a positional value. It returns true if string1 con-
tains string2 in it. The comparison is case sensitive. The
example returns true.
Example: contains(“Sibling”,”Sib”)
substring-before This function is similar to the Left() function in VB.NET,
(string1,string2) although it doesn’t accept a length argument. It returns the
portion of string1 that precedes string2. If string2
doesn’t exist in string1, empty strings are returned. The
example returns the “child” string.
Example: substring-before(“child,sibling”,”,”)
This function can also be used in conjunction with the
name() function to get a node’s namespace prefix, which
solves the problem mentioned in the previous section.
Assuming the name() function returns me:myself, this
example will return me:
Example: substring-before(name(myself),”:”)
63
CHAPTER 3
Table 3.3 continued

substring-after This function is similar to the Right() function in
(string1,string2) VB.NET, although it doesn’t accept a length argument. It
acts like the previous function except it returns the portion
of string1 that occurs after string2. The example returns
the “sibling” string.
Example: substring-after(“child,sibling”,”,”)
substring(string, This function is similar to the Mid() function in VB.NET
number,number) or the Substring() function in C#. It returns the portion of
the input string starting at the character index specified in
the second argument with a length equal to the third argu-
ment. If no length is specified, the returned string will con-
tain the input string starting from the second argument to
the end of the string. The first character starts with 1 rather
than 0, like C#. The example returns the “ling” string.
Example: substring(“sibling”,4)
string-length(string) This function returns the number of characters in the sup- 3
plied string argument. Passing in an empty string results in
XPOINTER, AND
a value of 0 being returned. The example returns a value
XPATH,
XLINK
of 7.
Example: string-length(“Sibling”)
normalize-space(string) This function is similar to the Trim() function in VB,
although it also replaces internal sections of whitespace
with single spaces. The example returns “How are you?”
Example: normalize-space(“ How are you? “)
translate(string, This function performs character substitutions. Characters
from,to) that exist in the first two arguments are converted to the
corresponding characters found in the third argument. This
function is best understood by seeing an example. The
example shown below will return “HOW ARE YOU?”
Example: translate(“how are you?”,”aehouwy”,
”AEHOUWY”)
The next example will return “I AM fiNE”
Example: translate(“i am fine”,”amne”,”AMNE”)
64
Table 3.3 continued

translate(string, Characters can be replaced (just like VB.NET’s Replace()
from, to) (continued) function) by supplying the second argument with the char-
acter to replace and the third argument with the character
that it should be replaced with. For example, the following
code will return “how are you?”
Example: translate(“how-are-you?”,” -”,” “)
XPath Number Functions

Table 3.4 lists all number functions contained within the XPath recommendation. In situations
where a function argument cannot be converted to a number, NaN (“not a number”) will be
returned.
Table 3.4 Number Functions

number(value) This function converts the supplied argument to a number. If no
argument is supplied, the function defaults to a node-set that con-
tains only the context node. This function is not normally needed
because number conversions are handled on-the-fly when a num-
ber value is required. If a node-set is supplied, it is first con-
verted to a string that is then applied to the number() function.
The following example returns 0.
Example: number(‘0’)
This next example returns a 1.
Example: number(boolean(‘true’))
You’ll learn more about the boolean() function in the next
section.
sum(node-set) This function returns a number containing the sum of all numeric
values of nodes contained within the node-set argument.
Assuming that the numeric values of the child nodes found in the
following XPath statement include (1,2,3,4), the example would
return a calculated value of 10.
Example: sum(/root/parcelNumbers/*)
floor(number) This function returns the supplied argument rounded down to the
nearest integer. For example, if 10.4 were passed in, the return
value would be 10.
Example: floor(10.4)
65
CHAPTER 3
Table 3.4 continued

ceiling(number) This function is similar to the floor() function but rounds the
supplied argument up to the nearest integer. For example, if 10.4
were passed in, the return value would be 11.
Example: ceiling(10.4)
round(number) This function returns a number rounded to the nearest integer.
For example, 10.4 rounds to 10, whereas 10.5 rounds to 11.
Example: round(10.4)
XPath Boolean Functions

The XPath specification contains a total of five Boolean functions. Although all these are listed
in Table 3.5, you will more than likely only use two of the five functions on a constant basis
when working with XPath expressions.
Table 3.5 Boolean Functions

3
XPOINTER, AND
boolean(value) This function returns a Boolean value (true or false) depending
XPATH,
XLINK
on the value passed into the function. The conversion of a value
to Boolean normally occurs automatically as needed. However,
the boolean() function is useful to explicitly convert a value to a
Boolean when necessary. For example, if you have a node named
customerExists that contains the text “true,” you can convert this
value from a string to a Boolean by using the following syntax:
Example: boolean(/root/customerName/customerExists)
To convert a 0 (false) or 1 (true) to a Boolean, use the fol-
lowing syntax:
Example: boolean(1)
not(boolean) This function returns the opposite of what is passed into it. Thus,
if a true is passed in, a false will be returned and vice versa. The
example returns a true.
Example: not(false)
true() This function returns true. This can be used when a constant
Boolean value is needed in an XPath expression.
66
Table 3.5 Boolean Functions

false() This function returns false. This can be used when a constant
Boolean value is needed in an XPath expression.
lang(string) This function returns true or false based on whether the con-
text node’s language (as determined by the value of the xml:lang
attribute) equates with the supplied string argument. Assuming
that an xml:lang attribute is equal to “en” on a particular node,
the following example returns true if evaluated with that node as
the context node.
Example: lang(“en”)
XPath Abbreviation Examples

Having covered all the various functions available in the XPath specification, you’re probably
ready to move from the learning phase to the “let me see this in action” phase. Before jumping
into some detailed XPath examples, you’ll be interested to know that many of the location step
types mentioned earlier in the chapter have an abbreviated form that can save you time and
make your XPath expressions much easier to read. A few of the key abbreviations need to be
touched on briefly to prepare you for the samples. The following are some important XPath
expression abbreviations:
• The @ symbol is an abbreviation for the attribute:: axis: attribute::name can be
written as @name.
• The context node can be represented by a period character: self::* can be written as .
• A few of the previously mentioned functions, such as position(), can be left out of the
predicate expression: street[position()=3] can also be written as street[3].
• The root node can be accessed by typing /, and access to all children of the root node
can be gained by typing //* or descendant::*.
• An XPath axis such as child:: can normally be left out of an expression.
child::street can be written as simply street. Understand that leaving out the axis
syntax doesn’t mean that it isn’t there any longer. Every location step moves along a
defined axis. When no axis is specified, the expression is assumed to follow the child
axis.
• The abbreviated code to access a parent node is similar to accessing a parent folder on
the Windows file system. A parent node can be accessed by writing parent::* or ..
67
CHAPTER 3
The following are a few examples shown first with the non-abbreviated syntax and then fol-
lowed by the abbreviated syntax. Some of the examples include more than two XPath expres-
sions to help you see other ways of accomplishing the same task. The examples are based on
the following XML document:
<golfers>
<golfer skill=”excellent” handicap=”4” clubs=”Taylor Made” id=”1111”>
<name>
<firstName>Heedy</firstName>
<lastName>Wahlin</lastName>
</name>
<favoriteCourses>
<course city=”Pinetop” state=”AZ” name=”Pinetop Lakes CC”/>
<course city=”Phoenix” state=”AZ” name=”Ocotillo”/>
<course city=”Snowflake” state=”AZ” name=”Silver Creek”/>
</favoriteCourses>
</golfer>
<golfer skill=”moderate” handicap=”8” clubs=”Taylor Made” id=”2222”>
<name>
<firstName>Dan</firstName> 3
<lastName>Wahlin</lastName>
XPOINTER, AND
</name>
<favoriteCourses>
XPATH,
XLINK
<course city=”Pinetop” state=”AZ” name=”Pinetop Lakes CC”/>
<course city=”Pinetop” state=”AZ” name=”White Mountain CC”/>
<course city=”Springville” state=”UT” name=”Hobble Creek”/>
</favoriteCourses>
</golfer>
</golfers>
XPath Examples
Select the id attribute of the first golfer element:
/golfers/descendant::golfer[position()=1]/attribute::id
/golfers/golfer[1]/@id
Select all descendants of the golfer element. This includes the name and favoriteCourses
elements:
/golfers/descendant::golfer/*
//golfer/*
/golfers/golfer/*
68
Select all attribute descendants of the golfers root element:

//attribute::*
//@*
Select the name attribute of the third course element associated with the second golfer element:
/golfers/child::golfer[position()=2]/child::favoriteCourses/
➥child::course[position()=3]/attribute::name
/golfers/golfer[2]/favoriteCourses/course[3]/@name
//golfer[2]/favoriteCourses/course[3]/@name
Select the first golfer element and all children. This selects the name and favoriteCourses
elements:
/golfers/child::golfer[position()=1]/child::*
/golfers/golfer[1]/*
Select the second golfer element’s course element having a name attribute equal to the first
golfer element’s first course element name attribute (a little confusing unless you break it
down into steps):
/golfers/child::golfer[position()=2]/child::favoriteCourses/child::course/
➥attribute::name[. =
/golfers/child::golfer[position()=1]/child::favoriteCourses/
➥child::course[position()=1]/attribute::name]
/golfers/golfer[2]/favoriteCourses/course/@name[. = /golfers/golfer[1]/
➥favoriteCourses/course[1]/@name]
Select the last course element’s name attribute that is located under the second golfer
element:
/golfers/child::golfer[position()=2]/favoriteCourses/course[last()]/@name
//golfer[2]//course[last()]/@name
/golfers/golfer[2]/favoriteCourses/course[last()]/@name
Select the city attribute of all course elements that have a name attribute starting with the
letter “P”:
/golfers/child::golfer/child::favoriteCourses/child::course[starts-with
➥(@name,’P’)]/attribute::city
/golfers/golfer/favoriteCourses/course[starts-with(@name,’P’)]/@city
Select the first golfer element’s skill attribute and capitalize it:
translate(/golfers/child::golfer[position()=1]/attribute::skill,
➥‘abcdefghijklmnopqrstuvwxyz’,’ABCDEFGHIJKLMNOPQRSTUVWXYZ’)
translate(/golfers/golfer[1]/@skill,’abcdefghijklmnopqrstuvwxyz’,
➥‘ABCDEFGHIJKLMNOPQRSTUVWXYZ’)
69
CHAPTER 3
Count the number of golfer elements:

count(/golfers/child::golfer)
count(//golfer)
After seeing the previous XPath statements, you may be wondering how you can test them to
see if they actually work. You may also want to practice creating some of your own XPath
statements. Fortunately, an excellent browser-based tool for testing XPath statements was
created by Aaron Skonnard of DevelopMentor. The tool can be downloaded from http://
staff.develop.com/aarons/xmllinks.htm. Figure 3.1 shows an example of the tool in
action.
XPOINTER, AND
XPATH,
XLINK
FIGURE 3.1
Using the XPath Interactive Expression Builder.
XPointer—Accessing XML Document Fragments

Earlier in the chapter it was mentioned that the XPath recommendation was jointly developed
by several W3C working groups. One of these contributing groups is also responsible for the
XPointer language (http://www.w3.org/TR/xptr). Their participation was beneficial not only
to the XPath language, but also to their own because XPointer leverages XPath’s capabilities.
XPointer stands for XML Pointer. When combined with XLink and XPath, the XPointer lan-
guage opens up many exciting possibilities for information retrieval from dispersed XML doc-
uments. The W3C XPointer Candidate Recommendation makes the following statement about
XPointer’s purpose:
70
XPointer supports addressing into the internal structures of XML documents. It allows for
examination of a document’s hierarchical structure and choice of its internal parts based on
various properties, such as element types, attribute values, character content, and relative posi-
tion. In particular, it provides for specific reference to elements, character strings, and other
parts of XML documents, whether or not they bear an explicit ID attribute.
Many of the concepts mentioned in this statement strongly resemble functionality already
available in the XPath language, which may lead you to believe that much is duplicated
between the two. However, this is not the case. XPointer simply extends XPath’s functional
utility, as you’ll see in the following sections. Your current XPath knowledge will be applicable
to many XPointer concepts.
XPointer Basics
At this point you may be wondering what the difference is between XPath and XPointer.
Because both seem to allow access to specific items found within an XML document, can
XPointer be used for something different? Obviously the answer to this question is “yes,” given
all the work that has gone into creating the language. XPointer differs from XPath because it
provides a mechanism for accessing specific content, referred to as document fragments,
located in local or remote documents. This content isn’t limited just to nodes. XPointer even
allows individual characters in a text string to be selected.
Imagine having an XML document containing research data that is composed of three sections,
including an introduction, body, and conclusion. The body and conclusion portions of the doc-
ument do not exist within this XML document. They are located externally within two XML
documents that contain information other than simply body or summary text.
Using XPath alone, there is no mechanism for gaining access to the body and summary text in
these remote documents. XPath is designed to locate specific nodes within a local XML docu-
ment. This is where XPointer comes into play. Using portions of XPath syntax, XPointer
allows the body and summary text sections to be gathered from the remote documents and
placed into the current XML document. All this can be done without having to load the entire
remote document (or documents in this case). Comparing this functionality to HTML, combin-
ing these languages allows you to link to a specific area of a document and “grab” its content
without having to load the entire page and then scroll to the content as is done with HTML
anchors.
The capability to access data down to the individual character level opens up a whole new
world for data sharing and exchange. Before getting too much further into the details, let’s
cover a few key XPointer definitions:
71
CHAPTER 3
Term Definition
sub-resource The portion of the XML document that XPointer identifies. Although
the entire XML document would be considered the resource, the
portion being accessed by XPointer would be the subresource.
point The W3C defines a point simply as “A position within XML infor-
mation.” A point may represent a node within an XML document or
simply a specific location within a node’s XML character data. The
combination of a container node and index value define a point’s
location. A point may be a position before any character or may pre-
cede or follow a node. A point can never exist on a character,
though, only before or after it.
As an example, the “T” character in the following document frag-
ment has a container of type element and has an index of 0. The “e”
would have an index of 1 and the “d” character an index of 2:
<golfer>
<firstName>Ted</firstName>
</golfer>
character point A point with a container node that has no child nodes. Examples 3
include a point within text, comments, processing instructions, and
XPOINTER, AND
so on.
XPATH,
XLINK
node point A point with a container node that has child nodes.
range A selection of XML content located between two end points. When
you highlight a section of text in a program (such as Word) using
the cursor, the selected area could be considered a range because it
has starting and ending points.
collapsed range A range that has equal start and end points. For example, if we start
a point at character three and end a point at character three, the
ranged is considered to be collapsed.
location Similar to a node in XPath. However, because XPointers can begin
in one paragraph and end in another and do not have to specify
nodes in particular, a location can be a node, point, or range.
location-set This is similar to a node-set created by an XPath expression.
However, aside from including nodes, a location-set can also
include points and/or ranges.
singleton This term is reserved for situations where one range, string range, or
single node is located by an XPointer. Multiple areas of XML con-
tent identified by an XPointer are not considered to be singletons.
fragment identifier An XPointer identifier (expression) that has an escaped URI.
Escaped refers to replacing characters such as < with their URI
escape value, which would be < in this case.
72
Points and ranges are used when working with XPointer expressions. Points determine where
to start and end a search for XML content within a document, and a range contains the XML
content located between two points. The syntax to create XPointer expressions using two
points is as follows:
http://www.someURL.com/someDOC.xml#xpointer(<point1> to <point2>)
This syntax would link to the range specified between point1 and point2 and load the entire
document. You’ll notice that this is similar to how the HTML anchor currently works, although
XPointer allows for much more flexibility, as you’ll see in the next few sections. After parsers
support this feature, you’ll be able to pick out a section of XML within a remote document
with pinpoint accuracy.
How XPointer Extends XPath

To avoid duplicating functionality already available in the XPath language, XPointer leverages
XPath where possible. However, it does extend XPath’s functionality in the following ways:
• XPointer adds the point and range location types to the existing XPath node-set. As a
result, an XPointer node test can include a test for a point or range in a location-set as
well as the other node-set tests found in XPath.
• XPointer builds on the concept of an XPath node-set by adding the locator-set concept,
which includes points and ranges in addition to the items included in an XPath node-set.
• The context in which XPath expressions are evaluated are governed by XPointer rules.
• XPointer adds several new functions, including string-range(), range-to(), range(),
range-inside(), here(), origin(), start-point(), end-point().
XPointer Functions
The additional functions available in the XPointer language along with their corresponding
descriptions are listed in Table 3.6. One interesting point about these functions is that they can
be called after a / character in a location step (that is, node/function()). This is different
from the XPath language where calling a function after the / character would normally result
in the parser raising an error (exceptions to this exist, such as the count() and sum() func-
tions to name just two).
Table 3.6 XPointer Functions

range-to(expression) This function returns a range for each location in the con-
text. The example returns the range from an element with
an id of 1111 to another element with an id of 2222.
73
CHAPTER 3
Table 3.6 continued

Example: xpointer(id(“1111”)/range-
to(id(“2222”)))
This next example returns a range from the first golfer
element to the third golfer element:
Example: xpointer(golfer[1]/range-to(golfer[3]))
string-range(location-set, This function searches the string-value of a location for a
string,number,number) substring that matches the string argument and returns
range locations for any occurrences. The first argument
accepts a location-set (//golfer), and the second argu-
ment accepts the string to search for. The third and fourth
number arguments are optional but function much like
the number arguments found in a C# Substring() func-
tion. The third argument tells where to start the returned
range, and the fourth argument specifies the number of
characters in the range. The example would return a
range starting with the fourth occurrence of every 3
“XML” string found in the title element. The range
XPOINTER, AND
will start just after the “L” character (four characters in)
XPATH,
and contain the five characters that come after the fourth
XLINK
(represented by [4]) “XML” string:
Example: string-range(//title,”XML”,4,5)[4]
range(location-set) This function returns ranges that match up with the loca-
tions specified in the location-set argument. The example
would return ranges for all folder elements that have an
attribute named path starting with the letter “D.” The
range starts just before the folder node’s beginning tag
and ends just after its ending tag:
Example: range(//folder[starts-with(@path,”D”)])
range-inside(location-set) This function returns ranges that match up with the con-
tent specified in the location-set argument. This is similar
to the innerText property found in DHTML, which also
returns the text between a start and end tag. It differs
from the range() function in that it includes only a
node’s content, not the node and its content. The example
would return ranges for all folder elements that have an
attribute named path starting with the letter “D.” The
range starts just after the folder node’s beginning tag and
ends just before its ending tag:
74
Table 3.6 continued

range-inside(location-set) Example: range-inside(//folder[starts-
(continued) with(@path,”D”)])
start-point(location-set) This function returns a location-set containing the start-
ing point for each range within the location-set argument.
Here’s how the start-point() function handles different
argument types:
If start point is passed in, that point is returned
unchanged.
When a range is passed in, the starting point of the range
is returned.
When the argument is an element, some text, a comment,
or a processing instructions, the argument is converted to
a range and the starting point of that range is returned.
If the argument is an attribute or namespace, a syntax
error is signaled by the function.
Example: start-point(//folder)
end-point(location-set) This function returns a location-set containing the ending
point for each range within the location-set argument.
The details on this function are the same as the start-
point function, except that an ending point is always
returned rather than a starting point.
Example: end-point(//folder)
here() This function returns a location-set containing the loca-
tion of the XPointer that is being evaluated. In simpler
terms, the here() function returns the element containing
the XPointer itself, either as a text element or an
attribute. The example allows the XPointer expression to
point to locations relative to the location of the XPointer
expression itself.
Example: #xpointer(here()/books)
origin() This function is useful only when used within the context
of XLink (discussed next) to link XML documents. It
returns a location-set containing the location of the origi-
nating link.
75
CHAPTER 3
XPointer Errors
Errors in XPointer can be handled differently than errors in XPath. In situations where an
XPath expression finds no matching locations within the XML document, an empty node-set is
returned and no error is raised. However, XPointer must be stricter with error handling because
it allows access to remote documents. XPointer is required to raise an error if a fragment iden-
tifier (expression) returns no results. The following is a summary of the three errors that can be
associated with XPointer and their associated description:
Error Description
Syntax Error This error occurs when an XPointer expression (also
called a fragment identifier) contains incorrect syntax.
Resource Error This error occurs when a resource document does not
contain well-formed XML. The fragment identifier
(XPointer expression) may be correct but be unable to
create a point or range because of resource document
problems.
Subresource Error This error occurs when an XPointer expression is syntac-
tically correct, the resource contains well-formed XML, 3
but no locations are returned. As mentioned, this type of
XPOINTER, AND
error does not occur in XPath, because empty node-sets
XPATH,
are acceptable there.
XLINK
XPointer Examples
As the XPointer language is moved to the W3C recommended status, more and more examples
of using it with supporting parsers will become available. Until then, here are a few examples of
using XPointer. The examples are prefixed by a remote resource name of www.someURL.com:
Identifying Fragments Using Bare Names

Select a node with an id of chapter1 that is found in the document named
remoteDocument.xml. This example shows how to use “bare names,” which is a shorthand
method that saves you from having to use the id() function. You’ll notice that this is very sim-
ilar to using an HTML anchor:
http://www.someURL.com/remoteDocument.xml#chapter1
This can also be written as:

http://www.someURL.com/remoteDocument.xml#xpointer(id(“chapter1”))
76
Identifying Fragments Using Child Sequences

XPointer allows nodes to be accessed by providing child sequence numbers as shown next:
http://www.someURL.com/remoteDocument.xml#/1/3/4
The #/1/3/4 syntax tells XPointer to go to remoteDocument.xml and access the first element
found there. After finding this element, find its third child element. Now choose this element’s
fourth child element.
Identifying Fragments Using XPointer Expressions

If every element of every XML document contained an id type attribute (as defined in the
DTD), accessing XML content would be much easier. However, because many XML docu-
ments don’t have id attributes, a more detailed way of accessing XML content must be used.
This is where XPointer expressions are useful. When bare name and child sequence methods of
accessing XML content are not specific enough, an XPointer expression can be created. This
expression will select the range between the first and fourth golfer nodes:
http://www.someURL.com/remoteDocument.xml#xpointer(/golfers/golfer[1]/
➥range-to(/golfers/golfer[4]))
The following example will pull the content between the first and third sibling nodes:
http://www.someURL.com/remoteDocument.xml#xpointer(//father/sibling[1] to
➥//father/sibling[3])
The next example will return all ranges within the /golfers/golfer/name context that contain
a substring equal to “Joe”:
http://www.someURL.com/remoteDocument.xml#xpointer(string-range(/golfers/
➥golfer/name,”Joe”))
Finally, the following document will first try to find a golfer element’s fourth name node. If
that location cannot be found, it will try to find the last name node:
http://www.someURL.com/remoteDocument.xml#xpointer(//golfer/name[4])
➥xpointer(//golfer/name[last()])
XLink—Resource Relationship Management

The XLink specification (http://www.w3.org/TR/xlink/) adds several additional pieces of
functionality that make it easier to access external resources and maintain complex linking
relationships. At press time, the XLink specification was in “Proposed Recommendation” sta-
tus, so the majority of the concepts covered in this section are likely to remain unchanged.
After the specification reaches “recommended” status at the W3C, you should start to see
parsers or applications on the market that support XLink.
77
CHAPTER 3
You may recall a short discussion we had concerning HTML linking limitations at the begin-
ning of this chapter. Although HTML links work quite well in a browser, some of the limita-
tions found in the HTML link include the following:
• HTML allows links only between two resources.
• HTML links have to be embedded in the originating link’s file. This can cause mainte-
nance headaches.
• HTML anchors can help locate specific information within a document but require that
the entire page be loaded. This wastes bandwidth.
• HTML links are unidirectional. They can point only in one direction.
• The HTML link has few attributes for customizing linking capabilities based on different
linking needs.
• Updating a target document with anchors is not possible without having “write” access
to that document.
• HTML links require specific element tags to work (<a> for example). This makes it diffi-
cult to add linking capabilities to other elements.
A few of these limitations are addressed by the XPointer language because it allows specific 3
content fragments to be accessed in remote XML documents. Although this is a great step for-
XPOINTER, AND
ward, the old HTML link can be improved on in many other ways. XLink strives to make these
XPATH,
XLINK
improvements a reality by adding new linking capabilities to documents that comply with the
XML version 1.0 recommendation.
XLink Basics
The XLink specification doesn’t actually mention linking different documents. Rather, it
defines a structure that consists of links between “resources.” These resources could be XML
documents, media files (images, movies, or music), database information, or any other type of
resource that needs to be referenced through a link.
Adding more functionality to any type of established item, such as the HTML link, is often-
times associated with an overall increase in complexity. This is not necessarily the case with
XLink. After you understand its keywords, elements, attributes, and functionality, you’ll see
that it looks just like regular XML.
Although the XLink language allows you to define links the same way as you do in HTML, it
does bring with it more advanced attribute and element definitions that help in linking two,
three, or more resources in a uniform manner. Understanding the various attributes and defini-
tions that come with the XLink language is important to leverage its linking power.
78
Before learning about the attributes and elements that exist in XPath, let’s take a look at some
examples of what XPath syntax looks like. First let’s write out the syntax for what is coined a
“simple link.” A simple link functions the same as a link found within an HTML page:
<golfer xmlns:xlink=”http://www.w3.org/1999/xlink/”>
<name xlink:type=”simple” xlink:href=”http://www.someURL.com/row.aspx”>
John Doe
</name>
</golfer>
The following is an example of an extended link containing four different resources. This type
of linking is not possible using HTML’s anchor tag:
<xlink:extended
xmlns:xlink=”http://www.w3.org/1999/xlink/”
role=”foursome”
title=”Golfing foursome”
>
<xlink:locator href=”dad.xml”
role=”golfer1”
title=”First golfer in foursome”
/>
<xlink:locator href=”aaronhorne.xml”
role=”golfer2”
title=”Second golfer in foursome”
/>
<xlink:locator href=”toddwahlin.xml”
role=”golfer3”
title=”Third golfer in foursome”
/>
<xlink:locator href=”michellehorne.xml”
role=”golfer4”
title=”Fourth golfer in foursome”
/>
<xlink:arc from=”golfer1”
to=”golfer2”
actuate=”onRequest”
show=”embed”
/>
<xlink:arc from=”golfer3”
to=”golfer4”
actuate=”onRequest”
show=”embed”
/>
</xlink:extended>
79
CHAPTER 3
I’ll defer an in-depth discussion of the different elements and attributes being used here until
later in the chapter. You can see, however, that after you understand a few basic concepts con-
cerning new elements and attributes defined in the XLink specification, implementing the
XLink language isn’t much different than working with normal XML documents.
XLink Keyword Definitions

Table 3.7 contains many XLink keywords that will become useful as you learn more about cre-
ating links.
Table 3.7 Keywords in XLink

Term Description
Arc An arc specifies how links are traversed. In the case where multi-
ple resources are being linked, the arc can specify which direc-
tion to follow between links with the to and from attributes. In
the case of a simple link, the arc is not necessary because it’s
obvious which direction the link flows.
Ending-resource In cases where two resources are specified to have a linking rela- 3
tionship, this is the resource identified in the to attribute.
XPOINTER, AND
Example: <xlink:arc from=”resource1” to=”resource2”/>
XPATH,
XLINK
Extended link An extended link associates many local and/or remote resources
together. The link is considered inline if any of the resources are
local and considered out-of-line if none are local.
Inline link Defined by the W3C as “A link where some content in the link-
ing element serves, by virtue of its presence inside the linking
element, as a participating resource.” Use of the <a> tag in
HTML is considered an inline link.
Locator Data used to identify a resource in a link. Identification occurs
by specifying a URI or other type of identity.
Linkbase A file external of any resource that contains an extended link.
This type of file is similar to an include file in ASP.NET because
it helps maintain specific information referenced by many
resources in one location.
Multidirectional link A multidirectional link can be started by any of the resources
participating in the link. As such, the HTML link would be
excluded from this definition because it can initiate only a one-
way link (hitting the Back button doesn’t count).
80
Table 3.7 continued

Term Description
Out-of-line link When a link starts from a resource other than the document con-
taining the link, the link is considered out-of-line. This type of
link includes only resources that are remote. An out-of-line link
exists when a document not under your control (you do not have
write permissions to it) is included in a link. This type of link is
necessary for remote links to initiate other links.
Simple link A simple link is very similar to a normal HTML link in that it
specifies a one-way link between two resources. Because one of
these two documents is always local, simple links are always
considered to be inline links.
Traversal A traversal occurs when a resource identified in a link is
accessed. Traversal can be started by a user or from an
application.
Now that you’re familiar with a few of XLink’s definitions and keywords, let’s take a look at
some attributes that are used in building simple and extended links.
XLink Attributes
The XLink language specifies several attributes that add various linking capabilities. A listing
of these attributes with their corresponding description can be found in Table 3.8.
Table 3.8 XLink Attributes

Attribute Description
xlink:arcrole The arcrole attribute is a semantic attribute used to describe the
meaning of a resource (or a property of a resource) based on the
context the link is viewed in. It must contain a URI reference.
The W3C explains the arcrole attribute in the following manner:
A resource might generically represent a “person,” but in the
context of a particular arc it might have the role of “mother”
and in the context of a different arc it might have the role of
“daughter.”
xlink:href Specifies the location to a remote resource. The XLink specifica-
tion defines this as containing a URI reference.
81
CHAPTER 3
Table 3.8 continued

xlink:label This traversal attribute is used to provide a way for arc elements
to identify a resource or locator element based on its label. It
can be used only on the resource or locator elements. This is
similar to using the id=”. . . ” attribute on HTML elements
when working with DHTML.
xlink:type This determines the link type to be used. Possible values include
simple, extended, locator, arc, resource, title, none. You’ll
learn more about what the extended, locator, arc, and title
values accomplish. As far as the resource value, it can be used
as an attribute value to define link participants that are available
locally (rather than remotely). Content enclosed within the
resource type element would be considered “local.”
xlink:role This attribute must contain a URI reference and is used to indi-
cate a property that a resource has.
xlink:title This attribute contains content that is intended to be read by a
human. The title attribute helps give information about the 3
link.
XPOINTER, AND
xlink:show This determines how content identified by a link will be shown.
XPATH,
XLINK
The possible values are replace, embed, new, other, and none.
The replace value functions in the same manner as linking
between two pages in HTML, whereas the embed value would be
similar to the function the <img> tag element performs.
xlink:actuate The actuate attribute specifies how the link should be traversed
from a timing perspective. Rather than always being triggered by
human interaction, the link can be set to be triggered by an appli-
cation. Possible values include onRequest, onLoad, other, and
none.
xlink:to This attribute is used only with an arc element in an extended
link to specify the end point of a link. Its value is equal to the
label attribute on a resource or locator type element.
xlink:from This attribute is used only with an arc element in an extended
link to specify the start point of a link. Its value is equal to the
label attribute on a resource or locator type element.
82
The XLink Simple Link

The HTML link that you’re already familiar with is actually an example of a simple link in the
XLink language. It provides a way for a single document to link to another document. This
type of link is unidirectional because the initiating page knows about the destination page, but
the destination page is not actually participating in the link. The destination page simply dis-
plays its content in the browser. It’s true that you could hit the browser’s Back button, but the
destination document still isn’t actually participating in the link. Because the simple link
includes a local resource (one in which you created the link), it is always considered to be an
inline link. Though limited in flexibility, the simple link has helped place the Web into the
limelight and has made it an essential resource for businesses and individuals alike.
Adding a simple link to an XML document that is part of an XLink-aware application is as
easy as adding a few attributes to an existing XML element. Note that the XLink qualified
namespace must be added to the XML document:
<root xmlns:xlink=”http://www.w3.org/1999/xlink>
<book>
<chapter xlink:type=”simple” xlink:href=”http://www.thisBook.com”>
Chapter 1
</chapter>
</book>
</root>
The xlink:type attribute is specified as “simple” and the xlink:href lists the link URI. This
method of declaring a link is more efficient than the current HTML linking implementation. To
understand why, imagine trying to add linking capability to a <p> tag in HTML without wrap-
ping it with the <a> and </a> tags (or using scripting). Using attributes allows a link to be
added to any element quite easily.
Although you learned a little about XLink attributes earlier, let’s take a look at two interesting
attributes that should prove to be very useful in applications that implement XLink in the
future. The actuate attribute can take on values of onLoad, onRequest, other, and none. The
onRequest value means that the link is traversed when the user requests it. The onLoad value is
used to traverse a link immediately after a document completes loading. When the show
attribute is equal to embed and actuate is equal to onLoad, a useful way of embedding content
from outside resources can be accomplished. When combined with XPointer, these technolo-
gies could be used to create the research XML document discussed earlier that linked to intro-
duction, body, and summary sections located in remote documents.
83
CHAPTER 3
XLink Extended Links

Extended links perform the obvious task of “extending” the capabilities of a simple link by
allowing multiple resource links to be managed rather than just one. Like the simple link,
extended links can be created by using XLink attributes placed on existing XML elements. The
W3C gives the following definition:
An extended link is a link that associates an arbitrary number of resources. The participating
resources may be any combination of remote and local.
The XLink specification also mentions that if all the resources are remote (not editable), the
link is considered to be out-of-line. If any of the identified resources are local (editable), then
the link is considered to be inline.
Figure 3.2 shows a simple representation of an extended link.
Resource 3 Resource 4
XPOINTER, AND
XPATH,
XLINK
FIGURE 3.2
Extended link relationships.
At first glance, this figure may look very similar to simple links between resources, as in
HTML. However, the extended link includes all these resources in a relationship and then spec-
ifies how the links should be traversed. Listing 3.2 shows how these links could be written
using extended link XPath syntax. It’s important to know that the extendLink and loc element
names were chosen to make the sample easier to understand. These element names are not part
of the XLink specification. In fact, any element name could be used.
84
Listing 3.2 An Extended Link

1: <extendedLink xlink:type=”extended”
2: xmlns:xlink=”http://www.w3.org/1999/xlink”
3: role=”http://www.someURL.com/resources/roles”
4: title=”Resource Manager”>
5: <loc xlink:type=”locator” href=”resource1.xml” label=”resource1”
6: title=”resource1 contents”
7: />
10: />
13: />
16: />
17:</extendedLink>
The extended element type shown in Listing 3.2 acts as a container for four different resource
links. It contains some of the same attributes you saw in the simple element type shown ear-
lier, and it also declares the XLink qualified namespace. Inside this container are four locator
elements that specify the URI to each resource. Going back to Figure 3.2, the small boxes
located on each resource would represent that resource’s locator.
Although the code shown in Listing 3.2 specifies the locators for each resource and groups
each resource in an extended container, it does not say how links between the resources should
be traversed. Should Resource 1 link to Resource 3? Should Resource 4 link to Resource 3?
The listing doesn’t mention anything about the way the resources should be linked.
To accommodate the need for specifying how links should be traversed, the XLink language
introduces the arc concept. An arc can be thought of as a traversal path that specifies how to
link from one resource and to another. Listing 3.3 adds on to Listing 3.2’s code by adding
information about link traversals through the use of arc element types.
Listing 3.3 An Extended Link with Arcs

1: <extendedLink xlink:type=”extended”
2: xmlns:xlink=”http://www.w3.org/1999/xlink”
3: role=”http://www.someURL.com/resources/roles”
4: title=”Resource Manager”>
7: />
85
CHAPTER 3

10: />
13: />
16: />
17: <go xlink:type=”arc” from=”resource1” to=”resource2” show=”replace”
18: actuate=”onRequest”
19: />
22: />
25: />
26:</extendedLink>
3
XPOINTER, AND
By specifying the arc path, you now know the locations of four resources as well as how each
link between the resources should be traversed. The arc elements listed previously tell the
XPATH,
XLINK
extended link that resource1 links to resource2, which links to resource3, which links to
resource4. If you had wanted resource4 to link back to resource1, you could have applied
another XLink type attribute to an element and specified the additional arc path using the from
and to attributes.
Any variety of arc paths can be specified as long as all paths are unique. All this is done with-
out having to actually add a physical link inside of resource3 that goes to resource4, as is
the case when you link to a resource that you have no control over. Exactly how will
resource3 link to resource4 if the link isn’t physically embedded in resource3? To answer
this question completely, we will have to wait for the first XLink-capable applications to
appear on the scene and analyze how this feature is implemented.
One interesting point made in the XLink specification is that if a from or to attribute is miss-
ing in an arc type element, an arc will be created in the appropriate direction to all locator type
elements with label attributes located within the extended link grouping. To clarify this less
intuitive concept, the W3C provides the following example:
<extendedlink xlink:type=”extended” xmlns:xlink=”http://www.w3.org/1999/xlink”>
<loc xlink:type=”locator” xlink:href=”...” xlink:label=”parent”
➥xlink:title=”p1”/>
86
<loc xlink:type=”locator” xlink:href=”...” xlink:label=”parent”

➥xlink:title=”p2”/>
<loc xlink:type=”locator” xlink:href=”...” xlink:label=”child”
➥xlink:title=”c1”/>
<go xlink:type=”arc” xlink:to=”child” />
</extendedlink>
Based on each locator type’s title attribute, the following arcs would be assumed because there
are missing from and to attributes in the arc type element:
p1-c1, p1-c2, p1-c3, p2-c1, p2-c2, p2-c3, c1-c1, c1-c2, c1-c3, c2-c1, c2-c2, c2-c3,
c3-c1, c3-c2, and c3-c3
As you can see, this rule causes a type of cross-join to occur that involves each locator type
element. Although all possible arcs are pursued, the uniqueness of each arc is maintained.
The extended link automatically allows links between resources to be organized and grouped.
For this extended link to be useful, it would need to be embedded in one of the resources. This
is called an inline extended link because the resource that contains the embedded link informa-
tion would be editable, in contrast to a third-party read-only resource. After the extended link
is inserted, the relationships between the different resources specified in the locator type ele-
ments can be managed.
Linkbases and External Linksets

In a perfect world, links within an HTML document have the capability to be changed without
having to edit the actual document itself. HTML alone does not allow for this, although you
could, of course, maintain all links for an ASP.NET page in a database or XML document if
you desired. The XLink authors knew that maintaining links in HTML is difficult and they
built in the capability for resource links to be separated from the actual resources. A file
referred to in the XLink specification as a “Linkbase” can be in a location that is separate from
the actual resources. The contents of this file would contain one or more extended links.
Because the extended link isn’t actually found within a resource, it is called an “external
linkset.” Figure 3.3 depicts this.
As Figure 3.3 shows, each resource is tied together by the external linkset. This linkset con-
tains locator information about each resource and arc statements that specify how the links
should be traversed, just as you saw earlier. Separating the actual link information from the
resources is useful because it allows you to manage a list of links externally making overall
maintenance much easier. To add a new link, you would simply add the resource’s locator to
the external linkset document and provide an arc. No other documents need updating!
87
CHAPTER 3
External
Linkset
FIGURE 3.3
An out-of-line extended link.
For an external linkset document to be used, a resource needs to know where the external 3
linkset document is located. This is accomplished by placing code similar to the following into
XPOINTER, AND
an inline resource:
XPATH,
XLINK
<basesloaded xlink:type=”extended” xmlns:xlink=”http://www.w3.org/1999/xlink”>
<startrsrc xlink:type=”locator” xlink:label=”spec” xlink:href=”spec.xml”/>
<linkbase xlink:type=”locator” xlink:label=”linkbase” xlink:href=
➥”linkbase.xml”/>
<load xlink:type=”arc” xlink:from=”spec” xlink:to=”linkbase”
xlink:actuate=”onLoad”
xlink:arcrole=”http://www.w3.org/1999/xlink/properties/linkbase”
/>
</basesloaded>
Maintenance of links suddenly becomes much easier through the use of linkbases and external
linksets.
Putting XLink Together

Determining when to use the different XLink attributes can be somewhat confusing. Fortunately,
the W3C created the following information table that specifies when attributes are required (R)
and when they are optional (O). To see which attribute can be used with a given element type,
look down each column. If a cell is empty, you can’t use the attribute with the element type.
Refer to the table when you can’t remember much because it’s late at night and you’re fever-
ishly working to finish up an XLink project!
88
simple extended locator arc resource title

type R R R R R R
href O R
role O O O O O
arcrole O O
title O O O O O
show O O
actuate O O
from O
to O
label O O
Summary
In this chapter we covered a lot of technologies related to XML. Although the XPointer and
XLink languages have yet to be implemented fully, they are promising technologies that will
make developing with XML even more powerful. In Chapters 6 and 7, you will see how useful
the XPath language can be to access data contained within an XML document using the DOM
or XSLT. However, before jumping into those chapters, Chapter 4 takes a look at XML DTDs
and schemas.
Understanding DTDs and CHAPTER
4
XML Schemas
IN THIS CHAPTER
• Why Use DTDs or Schemas? 90
• Do I Really Need To Validate? 91
• DTD Basics 93
• XML Schemas—Looks a Lot Like XML! 101
• Using the Schema Keyword with Namespaces

in XML-DR Schemas 105
• The W3C XML Schema 114

90
Why Use DTDs or Schemas?

By this point in the book you should have a good feel for how XML documents are created
and how flexible and extensible they can be. XML is so flexible that many other languages,
including XPointer and XLink (to name just two), have been modeled after it. Although flexi-
bility and extensibility are certainly beneficial, too much of either can be detrimental if left
uncontrolled.
Imagine writing an ASP.NET page that uses whatever keywords you want. Instead of using if
or switch statements to control program flow, you decide to make up your own version of the
if statement and call it doThisIf. Although your keywords and logic may make perfect sense
to you, no one else can use the code unless you explain to them the rules of how it works. In
addition, to make your keywords work on your (or any other) Web server, you would need to
write your own ASP.NET compiler that recognized the keyword additions you dreamed up.
Although the preceding example is certainly far fetched, this same type of logic applies to cre-
ating XML documents if you plan on sharing them with anyone other than yourself. Unless
other users know and can depend on a specific syntactical document structure, they will have a
hard time using your document even if it is well formed. For example, putting a person’s
address in a tag named address that has child tags named street, city, and state may make
perfect sense to you. But the other party may expect that the address information you provide
them is contained in a tag named deliveryAddress with child tags named address1,
address2, city, and state. Unless these issues are cleared up between you and the other
party, data exchange may prove difficult and frustrating.
To solve this problem, the XML 1.0 specification includes the Document Type Definition
(DTD). The DTD was born out of the strict rules that accompanied XML’s parent language,
SGML. A DTD acts as a syntax guide to an XML document by declaring rules that must be
followed for the document to be valid. Remember that all XML documents must be well
formed, but they do not have to be valid to be parsed, assuming no DTD is present. Going back
to the previous example involving addresses, a DTD would help the party receiving your XML
document to understand the document’s structure and be assured that the structure follows the
rules as defined in the DTD. If you decide to add a tag that isn’t included in the DTD, a vali-
dating parser will raise an error. Having this validation will allow any parties receiving your
document to write an application based on your document’s defined set of rules.
Although DTDs have excellent parser support and provide a rules-based way of validating
XML documents, they do have their weaknesses. Some of these weaknesses include the
following:
• DTD syntax is difficult to read and can be somewhat difficult to write. This stems from
its SGML lineage. Like any language, after you learn the basics of creating DTDs, they
aren’t that difficult to work with.
Understanding DTDs and XML Schemas
91
CHAPTER 4
• Determining when and when not to use quotes and other attributes can be confusing.
• DTD syntax does not follow or conform to the well-formed XML rules established in the
XML 1.0 specification.
• DTDs have very limited support for data typing.
• A DTD is very restrictive when defining the order of elements under a parent element.
• Because a given XML document can have only one external reference to a DTD, name-
space problems can arise.
• Programmatically accessing information contained within a DTD is too complex.
• DTDs have no built-in support for namespaces.
Even with their weaknesses, DTDs have wide industry support and are still are very important
in the world of XML. DTDs also have excellent support for entities. However, because of
some of the inherent weaknesses previously mentioned, an alternative to DTDs has been devel-
oped by the W3C. This alternative is termed “XML Schemas.”
Microsoft’s MSXML parser first began support for XML Data - Reduced schemas (XML-DR).
These schemas offer XML-like syntax, stronger data typing, and more flexibility in describing
an XML document’s structure and rules. Although Microsoft’s schema language differs from
the W3C’s XML schema, they both look and act a lot like XML, as you’ll soon see. Before
entering the world of DTDs and schemas, let’s take a look at when validation should and
should not be used when working with XML documents.
Do I Really Need To Validate?

In Chapters 2, “XML for ASP.NET Basics,” and 3, “XPath, XPointer, and XLink,” you worked
with XML documents that were well formed but not valid. Opening any of these documents in 4
Internet Explorer 5 or later proves to be no problem. Because none of the documents are truly
DTDS AND XML

UNDERSTANDING
valid, why doesn’t the XML parser return an error when the document is opened? The answer
SCHEMAS
is that they are well formed, but because they do not contain any type of a DTD or schema ref-
erence, they are not validated. In addition to this, the XML parser built into Internet Explorer 5
or later does not validate XML documents by default, even if they do have a DTD associated
with them. Validation must be turned on programmatically.
With that in mind, it’s important to know that all XML documents do not have to be valid to
be considered useful documents. Many situations may occur where an XML document is
developed that will not be exchanged with internal or external parties and therefore doesn’t
need validation. As long as the creator of the document knows what it contains, any applica-
tions he or she writes will function.
In fact, validating a document can slow down the parsing process because the parser must
ensure that everything is kosher with the document’s structure. Validation also makes the
92
document less flexible for future changes unless the DTD or schema is updated to reflect new
needs. As an example, assume that you create an application that gathers data from an XML
document. Rather than using a mechanism such as XPath to query elements by name, your
application recursively loops through the document and writes out the data it contains to a Web
page. Any new elements that are added cause no problems because they are included in the
looping process. Although this type of application admittedly may not have a lot of utility, it
definitely doesn’t require a DTD or schema to work correctly. Chapter 6, “Programming the
Document Object Model (DOM) with ASP.NET,” takes a look at a sample application that uses
XML to create a menu system very quickly on the server side. Because the document is itself
quite simple and needs to be accessed quickly, a DTD or schema will not be used—simply
because it isn’t necessary.
XML documents that do have an association with a DTD or schema do not always have to be
validated. Many parsers allow you to determine if validation should occur by telling the
parser to turn validation on or off. We’ll talk about this more in Chapter 5 (“Using the
XmlTextReader and XmlTextWriter Classes in ASP.NET”). All in all, the choice to validate is
yours alone because there’s nothing that dictates exactly when validation should be used. In
complex applications that expect particular elements and attributes, validation is more than
likely a good idea. In other applications, validation may be overkill. Use your best judgment
when deciding whether your document must be validated.
NOTE
As you go through the next few sections, remember that raw XML documents viewed
in Internet Explorer 5 or later are not validated. To check the validity of your docu-
ments or the samples that follow, point your browser to the following URL and click
the Demo link:
http://msdn.microsoft.com/Downloads/samples/
Internet/xml/xml_validator/sample.asp
If you’d rather work with a standalone product that validates, take a look at the fol-
lowing URLs:
http://www.xmlspy.com
http://www.xmetal.com
http://www.extensibility.com
An excellent source of XML-related products can also be found at
http://www.xmlsoftware.com.
93
CHAPTER 4
DTD Basics
As mentioned earlier, DTDs still have wide industry support even though they have several
weaknesses associated with them. Because of this support, a basic overview of DTDs is given
in this chapter so that you can understand and use them as appropriate in your XML docu-
ments. Because several books are available that concentrate specifically on DTDs, I won’t
spend a lot of time covering them here. However, you’ll learn enough to effectively use them
to validate XML documents. Because you are already familiar with what DTDs do, let’s jump
right into the specifics by first looking at a sample DTD. This sample is based on the XML
document shown in Listing 4.1.
Listing 4.1 A Simple XML Document

2: <root>
4: <menuItem itemNumber=”a”>
7: </menuItem>
8: <menuItem itemNumber=”b”>
9: <hyperLink/>
11: <menuItem itemNumber=”b1”>
14: </menuItem>
17: <name>Press Releases</name> 4
DTDS AND XML

18: </menuItem>
UNDERSTANDING
SCHEMAS
22: </menuItem>
23: </menuItem>
25: </root>
The following is an example of how this document’s DTD might look:

94
<!DOCTYPE root [
<!ELEMENT root (menuItem+)>
<!ELEMENT menuItem (hyperLink+, name+, menuItem*)>
<!ATTLIST menuItem itemNumber ID #REQUIRED>
<!ELEMENT hyperLink ANY>
<!ELEMENT name (#PCDATA)>
]>
The DTD DOCTYPE

Back in Chapter 2’s discussion on entities, you had your first look at a DOCTYPE declaration.
This declaration is similar to the XML declaration in that it lets the parser know when content
is starting. A DOCTYPE declaration must follow the XML declaration and come before any ele-
ments. Comments or processing instructions may be between the XML declaration and DOC-
TYPE declaration. Adding the DTD directly into Listing 4.1 would result in the XML/DTD
combination shown in Listing 4.2.
Listing 4.2 XML Document with DTD

2: <!DOCTYPE root [
3: <!ELEMENT root (menuItem+)>
4: <!ELEMENT menuItem (hyperLink+, name+, menuItem*)>
5: <!ATTLIST menuItem itemNumber ID #REQUIRED>
6: <!ELEMENT hyperLink ANY>
7: <!ELEMENT name (#PCDATA)>
8: ]>
9: <root>
11: <menuItem itemNumber=”a”>
14: </menuItem>
15: <menuItem itemNumber=”b”>
16: <hyperLink/>
21: </menuItem>
25: </menuItem>
95
CHAPTER 4

29: </menuItem>
30: </menuItem>
32: </root>
The DTD shown in Listing 4.2 is called an internal DTD. Internal DTDs actually “live” in an
XML document, rather than being found in an external document. Here’s the general syntax
for an internal DTD:
<!DOCTYPE rootElementName [ element, attribute, and entity declarations go here ]>
rootElementName would be substituted for the real name of the document’s root element. If
the root element name does not follow the DOCTYPE keyword, the parser will raise an error. As
shown in Listing 4.2, the root element was actually named root and therefore placed after the
DOCTYPE keyword.
An XML document referencing an external DTD (one that is in a separate file) could use one
of the following examples:
<!DOCTYPE rootElementName SYSTEM “http://www.someURL.com/pathToDTD/menu.dtd”>
This example is called an external SYSTEM DTD. It functions in a similar manner as includes do
in ASP.NET files or as .js or .css files do in HTML. After the parser sees the SYSTEM keyword,
it knows to look for an external DTD at the URI (Universal Resource Identifier) specified.
An external PUBLIC DTD would have the following syntax:
<!DOCTYPE rootElementName PUBLIC “-//SAMS//TEXT menus//EN” 4
“http://www.someURL.com/pathToDTD/menu.dtd”
DTDS AND XML

UNDERSTANDING
>
An application coming across this definition may have a built-in way of knowing where to SCHEMAS
look for the public DTD based on the organization name (SAMS) and document name (menus).
If one cannot be found, the second URI (which is a SYSTEM URI) would be used.
DTD Elements
The DTD used in Listing 4.2 contains several element type declarations. Element declarations
are actually quite obvious in a DTD because they begin with the keyword ELEMENT. The struc-
ture of an element declaration is as follows:
<!ELEMENT elementName (content model)>
96
Listing 4.2 declared the root, menuItem, hyperLink, and name elements (shown next):
<!ELEMENT root (menuItem+)>
<!ELEMENT menuItem (hyperLink+, name+, menuItem*)>
<!ELEMENT hyperLink ANY>
<!ELEMENT name (#PCDATA)>
Element declarations are fairly straightforward to implement. They start with <!ELEMENT and
are then followed by the element name. If the element contains any child elements, they are
listed in parentheses, as shown earlier with the menuItem element declaration. This listing of
the element’s content is called the content model. As for the menuItem element, its declaration
specifies that it can contain a hyperLink, name, and menuItem tag. Table 4.1 shows the mean-
ing of the other characters such as + and ? in the declaration.
Table 4.1 Suffix Characters

Character Description
+ The element may exist one or more times in the document.
* The element may exist zero or more times in the document.
? The element is optional. It may appear zero or one time in the document.
(none) The element may exist only once.
Without these characters, elements such as hyperLink would be able to exist only once in the
document. By placing the + character after it, we are telling the parser that the hyperLink char-
acter may exist one or more times in the document. Using these characters is crucial to the
menuItem element. Because it can be a child of itself, you need the capability to have it option-
ally listed in the DTD declaration. When it isn’t found as a child of itself, as with the first
menuItem tag in Listing 4.2 (line 11), the DTD doesn’t cause an error, because the * character
follows it in the declaration and tells the parser that the menuItem tag may exist zero or more
times. After all this information is included, the ELEMENT declaration is ended with the >
character.
The content model for the hyperLink element declaration is ANY because it can contain text
content:
<hyperLink>/default.aspx</hyperLink>
or can be empty:
<hyperLink/>
If the hyperLink element were always empty, you could have declared it as follows:
<!ELEMENT hyperLink EMPTY>
97
CHAPTER 4
If the hyperLink element does not contain child elements (as in our example) and will contain
only parsed character data (PCDATA), its declaration can be coded as follows:
<!ELEMENT hyperLink (#PCDATA)>
Element declarations can also declare optional children. The following example specifies that
the menuItem element must contain either a hyperLink element or zero or one name elements:
<!ELEMENT menuItem (hyperLink|name?)>
In cases where an element may contain parsed character data or child elements, the following
declaration would be used:
<!ELEMENT menuItem (#PCDATA|hyperLink|name)*>
This type of declaration is called a mixed content model because the menuItem element can
contain text or elements. As a matter of definition, parsed character data is any data the parser
actually parses. Any data within a CDATA section (covered in Chapter 2) would be considered as
unparsed character data because the parser simply passes it through untouched.
DTD Attributes
The structure of a DTD attribute declaration follows this pattern:
<!ATTLIST elementName attributeName type defaultDeclaration>
Listing 4.2 contained one attribute declaration. This particular attribute was named itemNumber
and was an attribute of the menuItem element:
<!ATTLIST menuItem itemNumber ID #REQUIRED>
Attribute declarations start with <!ATTLIST followed by the element name that the attribute
applies to in the XML document. The element name is then followed by the actual name of the 4
attribute. In the case of the itemNumber attribute, it was declared having a type of ID, which
DTDS AND XML

UNDERSTANDING
means its value must be unique in the document. Several attribute types can be used within an
attribute declaration, as shown in Table 4.2. SCHEMAS
Table 4.2 Attribute Types

Attribute Type Description
CDATA Attributes typed as CDATA (described in Chapter 2) can contain any
character information.
Example: <!ATTLIST menuItem itemNumber CDATA #REQUIRED>
98
Table 4.2 continued

Enumeration When an attribute can contain a variety of specific values, an enumer-
ation can be used. This concept is very similar to the select element
in HTML. It contains several option elements that force the user to
choose a specific value. In XML, if you declared a club attribute, an
enumeration could be used to specify what types of clubs are allowed
as values within the attribute. In this example, TaylorMade is defined
as the default value. Notice that no quotes are placed around the enu-
merated types.
Example: <!ATTLIST golfer clubs
(TaylorMade|Ping|Callaway) “TaylorMade”>
ID An attribute with an ID type must contain a value that is unique
throughout the document.
Example: <!ATTLIST menuItem itemNumber ID #REQUIRED>
IDREF/IDREFS The IDREF/IDREFS type allows for relationships to exist between dif-
ferent elements in a similar fashion to primary/foreign key relation-
ships in a relational database. For example, the following code relates
package2 and package3 to an invoice with an invoiceID equal to
“ar7”.
Example:
<!DOCTYPE order [
<!ELEMENT order (invoice,packages)>
<!ELEMENT invoice EMPTY>
<!ELEMENT packages (package*)>
<!ELEMENT package EMPTY>
<!ATTLIST invoice
invoiceID ID #REQUIRED
packagesInc IDREFS #REQUIRED>
<!ATTLIST package packageID ID #REQUIRED>
]>
<order>
<invoice invoiceID=”ar7” packagesInc=”pak1
pak2 pak3”/>
<packages>
<package packageID=”pak1”/>
</packages>
</order>
99
CHAPTER 4
Table 4.2 continued

NMTOKEN/NMTOKENS A NMTOKEN represents a string value that is restricted to containing
certain characters. For example, all NMTOKEN strings must be a single
word (no whitespace allowed) and should not contain a colon because
of namespace conflicts. The NMTOKEN type conforms to the element-
naming rules discussed in Chapter 2. The NMTOKENS type can contain
more than one NMTOKEN separated by a whitespace. The example
declares an itemNumber attribute that is of type NMTOKEN.
Example: <!ATTLIST menuItem itemNumber NMTOKEN #IMPLIED>
NOTATION A notation type represents binary data such as a gif or jpg.
Attribute declarations can also have a default value. Up to this point, some of the examples
you’ve seen contained defaults such as #IMPLIED and #REQUIRED. Table 4.3 is a listing of the
default values that can appear within an attribute declaration.
Table 4.3 Attribute Declaration Default Declarations

Default Description
#IMPLIED The attribute’s value is optional and has no default value associated
with it.
Example: <!ATTLIST menuItem itemNumber NMTOKEN #IMPLIED>
#REQUIRED The attribute’s value is required anywhere in the document that the
attribute exists.
Example: <!ATTLIST package packageID ID #REQUIRED>
#FIXED “value” Assigns a fixed default value to the attribute.
4
DTDS AND XML

UNDERSTANDING
Example: <!ATTLIST package invoiceID CDATA #FIXED
SCHEMAS
“startValue”>
“value” Although the attribute is not required to have a value, the value assigned
in the attribute declaration will be the default. This is good for ensuring
that attributes contain values that you want to start with, as in the case of
enumerations.
Example: <!ATTLIST package invoiceID (yes|no) “yes”>
DTD Entities
Chapter 2 gave you your first taste of internal and external entities. These types of entities
function in a manner similar to an include file in ASP.NET. For example, the following general
100
entity declaration defines companyName as an entity name. Whenever this entity is referenced in
an XML document, the companyName entity will be replaced with Tomorrow’s Learning:
The discussion of parameter entities in Chapter 2 didn’t provide you with many specifics simply
because you had not learned about DTDs yet. So, we’ll cover these in more detail here.
A parameter entity functions in a similar way to the internal and external entities, although it is
referenced only in the DTD (in contrast to in the XML document) and uses slightly different
syntax. The following is an example of a parameter entity and how it could be used within a
DTD:
<!ENTITY % clubTypes “(TaylorMade|Cobra|TommyArmour|Titliest)”>
<!ELEMENT clubsUsed %clubTypes;>
<!ELEMENT clubPreference %clubTypes;>
This example declares a parameter entity named clubTypes and assigns it a value of
(TaylorMade|Cobra|Tommy Armour|Titliest). This value is then referenced in different ele-
ment declarations, including the clubsUsed and clubPreference elements. Doing this causes
the value (TaylorMade|Cobra|Tommy Armour|Titliest) to be dynamically placed into these
element declarations and makes maintenance of the declarations easier. If the clubsUsed and
clubPreference element declarations change to include different child elements, these can be
changed in one place (at the clubTypes entity declaration) rather than at each element declara-
tion. Although the preceding example is very simple, imagine if 20 different elements all
allowed similar types of child elements. Changing the element declarations in one place is
certainly much more efficient than editing each and every one of the element declarations
directly!
Parameter entities can also be quite useful when a base-set of attributes collectively describe
more than one element. For example, if attributes named caption, name, id, width, and
height are used on several elements, the following parameter entity definition could be used:
<!ENTITY % commonAttributes
“caption CDATA #IMPLIED
name CDATA #IMPLIED
id CDATA #REQUIRED
width CDATA #IMPLIED
height CDATA #IMPLIED”
>
Applying this entity to an element definition can be accomplished by doing the following:
<!ELEMENT object EMPTY>
<!ATTLIST object %commonAttributes;>
101
CHAPTER 4
A simple external DTD named golfer.dtd along with its XML document (golfer.xml) are
included with the book’s supplemental code. The external DTD uses a parameter entity named
clubTypes, as shown earlier.
DTD Notations
Notations are used within a DTD to specify external content that a parser cannot parse (binary
data, for example). Up until this point, everything we’ve worked with has been text based.
Notations allow for content that is non-ASCII to be included in an XML document. Because
parsers are not required to support binary data according to the XML 1.0 recommendation, a
notation allows binary data within a document to be associated with a helper application. In the
following example, a png notation has been declared and associated with Adobe Photoshop.
References to png will result in Photoshop being used to show the binary image data:
<!NOTATION png SYSTEM “photoshop.exe “>
<!ATTLIST picture format NOTATION (png) #IMPLIED>
Summing Up DTDs
DTDs provide structure for an XML document allowing it to be validated by a parser.
Although the DTD syntax is not pretty to look at and can be somewhat difficult to work with,
it will be some time before DTDs go away entirely because of their support throughout the
industry. You’ll see an example of using the System.Xml assembly to validate against a DTD
in Chapter 5. Before you learn how to do this type of validation, however, it’s important that
you see some new ways of providing structure and validation for XML documents. We’ll talk
about these “schemas” in the next section.
XML Schemas—Looks a Lot Like XML! 4

After reading through the previous section covering DTDs, you may have wondered why an
DTDS AND XML

UNDERSTANDING
XML document’s structure isn’t validated by a document that actually looks like XML itself.
After all, if a DTD is simply a repository of metadata about an XML document, shouldn’t it SCHEMAS
look like an XML document?
Many other people have questioned the DTD standard and have developed their own alterna-
tive versions of XML document validation. One of these alternative versions, XML-DR, has
been developed by Microsoft and submitted to the World Wide Web Consortium(W3C). The
W3C has looked at Microsoft’s proposal and several others and finished up work on an XML
schema standard designed to allow for stronger data typing, XML-like syntax, and more over-
all power and flexibility in describing XML documents.
102
At the time this chapter was written, Microsoft’s System.Xml assembly supported the entire
XML schema specification, except for a few keywords that will be discussed later in this chap-
ter. In addition to its support for W3C XML schemas, the System.Xml assembly also provides
full support for Microsoft’s version of the XML schema (XML-DR) and DTDs.
The next few sections cover XML-DR as supported by .NET’s System.XML assembly. We will
also take a look at the W3C’s XML schema language toward the end of the chapter. To avoid
confusion between the different schema versions, the next few sections focus exclusively on
the XML-DR schema. Later on you’ll see that much of what you learn about Microsoft’s
XML-DR schema language will be applicable to the W3C’s version.
A Sample XML-DR Schema and Its DTD Counterpart

Let’s start our discussion on XML-DR schemas by walking through a simple example. Listing
4.3 contains information about different golfers. The metadata describing this XML document
is shown in the XML-DR schema contained within Listing 4.4.
Listing 4.3 Golfers XML Document

1: <?xml version=”1.0” encoding=”UTF-8”?>
2: <golfers xmlns=”x-schema:Listing4.4.xdr”>
3: <golfer skill=”excellent” handicap=”12” clubs=”Taylor Made” id=”1111”>
4: <name>
5: <firstName>Paul</firstName>
6: <lastName>Allsing</lastName>
7: </name>
8: <favoriteCourses>
9: <course city=”Pinetop” state=”AZ” name=”Pinetop Lakes CC”/>
10: <course city=”Phoenix” state=”AZ” name=”Ocotillo”/>
11: <course city=”Snowflake” state=”AZ” name=”Silver Creek”/>
12: </favoriteCourses>
13: </golfer>
14: <golfer skill=”moderate” handicap=”8” clubs=”Taylor Made” id=”2222”>
15: <name>
16: <firstName>Dan</firstName>
17: <lastName>Wahlin</lastName>
18: </name>
21: <course city=”Pinetop” state=”AZ” name=”White Mountain CC”/>
22: <course city=”Springville” state=”UT” name=”Hobble Creek”/>
24: </golfer>
25: </golfers>
103
CHAPTER 4
Listing 4.4 Golfers XML Document’s XML Schema

2: <Schema name=”golfers” xmlns=”urn:schemas-microsoft-com:xml-data”
3: xmlns:dt=”urn:schemas-microsoft-com:datatypes”
4: >
5: <description>
6: Golfers schema used to validate Listing4.3.xml
7: </description>
8: <ElementType name=”course” model=”closed” content=”empty”>
9: <AttributeType name=”city” dt:type=”string” required=”yes”/>
10: <AttributeType name=”state” dt:type=”string” required=”yes”/>
11: <AttributeType name=”name” dt:type=”string” required=”yes”/>
12 <attribute type=”city”/>
13: <attribute type=”state”/>
14: <attribute type=”name”/>
15: </ElementType>
16: <ElementType name=”favoriteCourses” model=”closed” content=”eltOnly”>
17: <element type=”course” minOccurs=”1” maxOccurs=”*”/>
18: </ElementType>
19: <ElementType name=”golfer” model=”closed”
20: content=”eltOnly” order=”seq”>
21: <AttributeType name=”skill” dt:type=”string” required=”yes”/>
22: <AttributeType name=”handicap” dt:type=”i1” required=”yes”/>
23: <AttributeType name=”clubs” dt:type=”string” required=”yes”/>
24: <AttributeType name=”id” dt:type=”i2” required=”yes”/>
25: <attribute type=”skill”/>
26: <attribute type=”handicap”/>
27: <attribute type=”clubs”/>
28: <attribute type=”id”/>
29: <element type=”name” minOccurs=”1” maxOccurs=”1”/>
30: <element type=”favoriteCourses” minOccurs=”1” maxOccurs=”1”/>
4
DTDS AND XML

31: </ElementType>
UNDERSTANDING
32: <ElementType name=”golfers” model=”closed” content=”eltOnly”>
SCHEMAS
33: <element type=”golfer” minOccurs=”1” maxOccurs=”*”/>
34: </ElementType>
35: <ElementType name=”lastName” model=”closed”
36: content=”textOnly” dt:type=”string”/>
37: <ElementType name=”firstName” model=”closed”
38: content=”textOnly” dt:type=”string”/>
39: <ElementType name=”name” model=”closed” content=”eltOnly” order=”seq”>
40: <element type=”firstName” minOccurs=”1” maxOccurs=”1”/>
41: <element type=”lastName” minOccurs=”1” maxOccurs=”1”/>
42: </ElementType>
43: </Schema>
104
The XML schema shown in Listing 4.4 contains all the metadata about the golfers XML doc-
ument in Listing 4.3 and describes its structure using well-formed XML. Although the XML-DR
schema is certainly much more verbose than an equivalent DTD (shown in Listing 4.5), hope-
fully you’ll agree that it is much easier to read and understand even if you have never seen a
schema document before. After you learn the various element, attribute, and data type declara-
tions, you’ll find that creating schemas is a snap. In case you wanted to see how much code the
DTD would take for the same XML document, here it is:
Listing 4.5 Golfers DTD Document

2: <!ELEMENT course EMPTY>
3: <!ATTLIST course
4: city CDATA #REQUIRED
5: state CDATA #REQUIRED
6: name CDATA #REQUIRED
7: >
8: <!ELEMENT favoriteCourses (course+)>
9: <!ELEMENT firstName (#PCDATA)>
10: <!ELEMENT lastName (#PCDATA)>
11: <!ELEMENT name (firstName, lastName)>
12: <!ELEMENT golfer (name, favoriteCourses)>
13: <!ATTLIST golfer
14: skill CDATA #REQUIRED
15: handicap CDATA #REQUIRED
16: clubs CDATA #REQUIRED
17: id CDATA #REQUIRED
18: >
19: <!ELEMENT golfers (golfer+)>
Although the XML document in Listing 4.3 references an external schema (Listing 4.4.xdr),
the schema could have been included internally within the XML. To do this, you would simply
take out the schema namespace reference in the golfers root element and place the schema
directly under the golfer element:
<golfers>
<Schema name=”golfers” xmlns=”urn:schemas-microsoft-com:xml-data”
xmlns:dt=”urn:schemas-microsoft-com:datatypes”>
<description>Golfers schema used to validate Listing4.3.xml</description>
<ElementType name=”course” model=”closed” content=”empty”>
<AttributeType name=”city” dt:type=”string” required=”yes”/>
<AttributeType name=”state” dt:type=”string” required=”yes”/>
105
CHAPTER 4
<AttributeType name=”name” dt:type=”string” required=”yes”/>

<attribute type=”city”/>
<attribute type=”state”/>
<attribute type=”name”/>
</ElementType>
......... (remainder of schema would go here)
</Schema>
<golfer xmlns=”x-schema:#golfers”>
......... (xml content would go here)
</golfer>
</golfers>
The golfer element calls the schema through using xmlns=”x-schema:#golfers”, where
#golfers is used to identify the schema name.
Now that you’ve had a chance to look at an XML schema, the next few sections will break it
apart and detail each piece used to create it.
Using the Schema Keyword with Namespaces in

XML-DR Schemas
An XML-DR schema begins with the Schema keyword, a name attribute, and two namespace
declarations. The schema in Listing 4.4 started with the following Schema tag:
<Schema name=”golfers” xmlns=”urn:schemas-microsoft-com:xml-data”
xmlns:dt=”urn:schemas-microsoft-com:datatypes”
>
. . . . . . .
</Schema>
Notice the inclusion of the urn:schemas-microsoft-com:xml-date and dt=”urn:schemas-

4
microsoft-com:datatypes” namespaces. These are specific to Microsoft’s XML-DR schema
DTDS AND XML

UNDERSTANDING
version and must appear as shown.
WARNING SCHEMAS
Even slight changes to the namespaces will result in an error if validated by XML-
DR–aware parsers.
We’ll use the dt qualified namespace later in the schema to assign specified data types to dif-
ferent elements and attributes. The name attribute is optional, although its inclusion is defi-
nitely recommended for schema recognition purposes.
106
Because XML-DR schemas look, act, and obey the XML rules just like regular XML docu-
ments, the beginning <Schema> tag must be ended appropriately using </Schema>. The XML-DR
Schema keyword is case sensitive and must appear with a capital S as shown.
XML-DR Elements, Groups, and Attributes

Listing 4.4 contains a variety of elements and attributes that are used together to make the
schema. The following sections describe these in more detail.
XML-DR Elements and Groups

Declaring elements in an XML-DR schema is an extremely easy process because it’s a lot like
using an element within an XML document. In fact, element declarations are very similar to
using the Dim statement in Visual Basic to declare a variable in an ASP.NET page for later use.
Going back to Listing 4.4, the lastName element was declared as follows:
<ElementType name=”lastName” model=”closed” content=”textOnly”
dt:type=”string”
/>
The ElementType tag is case sensitive and must be used exactly as shown. The valid attributes
that can be used with the ElementType tag are listed in Table 4.4 along with their description.
Table 4.4 ElementType Tag Attributes
name The name attribute specifies the name of the element being declared. Names
used in the schema must follow XML element-naming rules.
Example: <ElementType name=”someName”....>
model The model attribute specifies how strictly or loosely content within an element
is managed. Values can include open or closed. A closed model restricts content
within the declared element to what the schema declares. Any attempts to add
content to the element will fail. An open model allows other content to be added
later.
Example: <ElementType model=”open”...>
content The content attribute is similar to the content model used within DTDs. It
determines what type of content can be included within the element. Valid val-
ues include empty, eltOnly, textOnly, and mixed. An element containing only
attributes would be considered empty (because it has no child elements or text),
whereas an element containing text and child elements would be considered
mixed. Elements that contain no text but that do contain child elements would
have a content value equal to eltOnly.
Example: <ElementType content=”textOnly”...>
107
CHAPTER 4
Table 4.4 continued
dt:type You saw earlier that DTDs restrict element and attribute data types to string val-
ues (PCDATA and CDATA) for the most part, although there are a few other types
for attributes, such as ID and IDREF. XML schemas (both the XML-DR and
W3C XML schema versions) support a variety of data types. The dt prefix cor-
responds to the qualified namespace declared in the <Schema> tag. This is pre-
fixed to set this type attribute apart from another one that can be used with the
<element> tag (shown later in the chapter).
Example: <!ElementType dt:type=”int”...>
order One knock on DTDs is that elements specified in the content model for an ele-
ment declaration must appear in the order they appear. For example, the follow-
ing DTD declaration specifies that the golfer element must contain the name
and course elements in the order listed.
Example: <!ELEMENT golfer (name,course)>
If the golfer element contains a course element that comes before a name ele-
ment, an error will occur. XML-DR schemas avoid this by using the order
attribute. Valid values include one, many, or seq. Specifying one will allow any
element from a list to be included. Using many will allow elements in a list to
appear (or not appear) in any order, and using seq is like the preceding DTD
example—all elements must appear in a specific order.
Example:
<ElementType name=”golfer” model=”closed” content=”eltOnly”
➥order=”seq”>
<AttributeType name=”skill” dt:type=”string” required=”yes”/>
<AttributeType name=”handicap” dt:type=”i1” required=”yes”/> 4
<AttributeType name=”clubs” dt:type=”string” required=”yes”/>
DTDS AND XML

UNDERSTANDING
<AttributeType name=”id” dt:type=”i2” required=”yes”/>
SCHEMAS
<attribute type=”skill”/>
<attribute type=”handicap”/>
<attribute type=”clubs”/>
<attribute type=”id”/>
<element type=”name” minOccurs=”1” maxOccurs=”1”/>
<element type=”favoriteCourses” minOccurs=”1” maxOccurs=”1”/>
</ElementType>
The following golfer element declaration has a closed model (no additional items can be
added), which can contain elements but not text (eltOnly), and requires that the elements
108
appear in the sequence listed (name followed by favoriteCourses). It also contains several
attribute declarations, which are covered shortly:
<ElementType name=”golfer” model=”closed” content=”eltOnly” order=”seq”>
<AttributeType name=”handicap” dt:type=”i1” required=”yes”/>
</ElementType>
Organizing the golfer element declaration in this manner allows for content to be declared
once and then reused in other places rather than duplicating the information in multiple places.
As an example, the golfer element (and associated content) can be included as a child element
of the golfers root element declaration by doing the following:
<ElementType name=”golfers” model=”closed” content=”eltOnly”>
<element type=”golfer” minOccurs=”1” maxOccurs=”*”/>
</ElementType>
As you can see, after an element has been declared, it can be used within other elements. This
is accomplished by using the <element> tag. Going back to our earlier comparison with
ASP.NET, a declaration using the ElementType tag is similar to declaring a variable in an
ASP.NET page and then giving it a value. Using the element tag would be similar to assigning
the variable (or element) to another variable or object, which is the golfers element in this
case. The element tag has several attributes that can be used with it. These attributes, as well
as their designated purpose, are listed in Table 4.5.
Table 4.5 Element Tag Attributes

Attribute Purpose
type The type attribute references the name attribute declared using the
<ElementType name=”someName”> tag.
Example: <element type=”referencedName”...>
minOccurs This specifies the minimum number of times the element may appear in the
current context. Valid values include 0 when the element may nor may not
exist and 1 where it must exist at least once. This attribute defaults to a value
of 1 if not specified.
Example: <element minOccurs=”1”...>
109
CHAPTER 4
Table 4.5 continued

Attribute Purpose
maxOccurs This specifies the maximum number of times the element may appear in the
current context. Valid values include 1 when the element may occur only
once and * where it may occur an unlimited number of times. This attribute
defaults to a value of 1.
Example: <element maxOccurs=”*”...>
Using the attributes with the element tag is as simple as including them within the tag. The
following code sample shows the course element being assigned to the favoriteCourses ele-
ment using the element tag:
<ElementType name=”favoriteCourses” model=”closed” content=”eltOnly”>
<element type=”course” minOccurs=”1” maxOccurs=”*”/>
</ElementType>
In this example, the course element must occur at least once (minOccurs=”1”). It can also
occur as many times as needed (maxOccurs=”*”).
If the favoriteCourses element needed to contain a more complex combination of elements,
how would you specify this in the schema? For example, assume that the favoriteCourses
element can contain two elements named courseDetails and course or one of three elements
named courseName, courseLocation, and courseLength. With a DTD, this is easily accom-
plished by using the pipe (|) and suffix characters, but nothing you’ve learned up to this point
allows you to do the same thing in a schema.
Implementing this variable combination of elements requires the group element as shown next:
<ElementType name=”favoriteCourses” model=”closed” content=”eltOnly” 4
order=”one”>
DTDS AND XML

UNDERSTANDING
<group order=”seq” minOccurs=”0” maxOccurs=”*”>
SCHEMAS
<element type=”courseDetails” minOccurs=”1” maxOccurs=”1”/>
<element type=”course” minOccurs=”1” maxOccurs=”1”/>
</group>
<group order=”one” minOccurs=”0” maxOccurs=”*”>
<element type=”courseName” minOccurs=”0” maxOccurs=”1”/>
<element type=”courseLocation” minOccurs=”0” maxOccurs=”1”/>
<element type=”courseLength” minOccurs=”0” maxOccurs=”1”/>
</group>
</ElementType>
In addition to the order attribute defined in Table 4.4, the group tag also accepts the
minOccurs and maxOccurs attributes referenced in Table 4.5.
110
XML-DR Attributes
When Microsoft designed the XML-DR schema, they applied many of the same principles to
element and attribute declarations. This means that many of the features associated with ele-
ments also apply to the declaration and assignment of attributes:
<ElementType name=”golfer” model=”closed” content=”eltOnly” order=”seq”>
<AttributeType name=”handicap” dt:type=”i1” required=”yes”/>
</ElementType>
This example declares four attributes named skill, handicap, clubs, and id, using the
AttributeType tag. It then assigns these to the golfer element using the attribute tag. It’s
important to note that the AttributeType elements declared have a scope limited to the golfer
element. To make them have a global scope, AttributeType elements would need to be moved
out on their own, becoming a child of the Schema element.
Attributes that can be associated with the AttributeType tag are shown in Table 4.6.
Table 4.6 AttributeType Tag Attributes
Attribute Purpose
name The name attribute specifies the name of the attribute being declared.
Although it may seem somewhat odd that the attribute you are declaring has
a name that is actually based on an attribute value, after you use the syntax a
few times, it will become comfortable to work with. Names used in the
schema must follow XML element-naming rules.
Example: <AttributeType name=”someName”....>
default default specifies a default value for the attribute.
Example: <AttributeType default=”1234”...>
required required determines whether a value is required for the attribute being
declared that will be used in an XML document. It accepts values of yes
and no.
Example: <AttributeType required=”yes”...>
111
CHAPTER 4
Table 4.6 continued
Attribute Purpose
dt:type This determines the data type for the attribute and functions the same as the
dt:type attribute used with the ElementType tag.
dt:values Although elements in a schema cannot be an enumeration data type, attrib-
utes can be. When dt:type=”enumeration” is specified on an attribute, the
enumerated values are specified using the dt:values attribute.
Example:
<AttributeType name=”clubs” dt:type=”enumeration”
dt:values=”TaylorMade Callaway Cobra”
>
After an attribute has been declared using the AttributeType element tag, it can be referenced
by an attribute element anywhere within an ElementType tag. Any attribute tags not con-
tained within an ElementType declaration will cause an error. The attribute element can
contain the attributes listed in Table 4.7.
Table 4.7 Attribute Tag Attributes

Attribute Purpose
type The type attribute references the name attribute declared using the
<AttributeType name=”someName”> tag.
Example: <attribute type=”referencedName”..>
default default specifies a default value for the attribute. If a default value was
specified in the AttributeType element, it will be overridden.
4
Example: <attribute default=”1234”...>
DTDS AND XML

UNDERSTANDING
required As with the default attribute, this attribute can override a required attribute
SCHEMAS
declared on the AttributeType element. Possible values include yes and no.
Example: <attribute required=”no”...>
XML Data Types

DTDs are extremely limited in the validation they offer for data contained within an XML doc-
ument. In fact, DTD data types have no support for basic data types such as integer and date.
Fortunately, XML-DR schemas contain excellent data type support. Any text contained within
an element or attribute can be assigned a specific data type. If text contained within an element
or attribute in an XML document does not match the schema, an error will be raised. Table 4.8
112
lists all the Microsoft XML-DR date types. These types are in addition to the primitive data
types already available within DTDs, such as PCDATA, ID, and so on.
Table 4.8 XML-DR Data Types

Data Type Description
bin.base64 MIME-style Base64 encoded binary BLOB.
bin.hex Hexadecimal digits representing octets.
boolean 0 = “false” and 1 = “true”.
char One character string.
date Date in a subset ISO 8601 format. Time data is not included. A bogus date
such as 2/31/2000 will pass validation because the date itself is not actually
validated.
dateTime Date in a subset of ISO 8601 format. Time data can optionally be included,
although time zone information cannot.
dateTime.tz Same as dateTime but with an optional time zone added in.
fixed.14.4 Same as “number” but no more than 14 digits to the left of the decimal point
and no more than 4 to the right.
float Real number with no limit on digits. A float can possibly have a leading sign,
fractional digits, and optionally, an exponent.
int Numeric type but with an optional sign, no fractions, and no exponent.
number Number with no limit on digits. It can optionally have a leading sign, frac-
tional digits, and an exponent.
time Time in a subset ISO 8601 format. No date or time zone information is
included.
time.tz Time in a subset ISO 8601 format. No date or information is included. Time
zone is optional.
i1 1-byte signed integer.
r4 Real number with seven-digit precision (4 bytes)
r8 Same as a float data type.
ui1 Single-byte unsigned integer.
ui2 2-byte unsigned integer.
113
CHAPTER 4
Table 4.8 continued

Data Type Description
uri Universal Resource Identifier (URI). String.
uuid Hexadecimal digits representing octets. A uuid can contain embedded
hyphens, although they are ignored. This is similar to a COM UUID. For
example: “333C7BC4-460F-11D0-BC04-0080C7055A83”.
Data types can be applied to elements or attributes in one of two ways. The first is to use the
dt:type attribute on the appropriate element or attribute declaration:
<AttributeType name=”customerID” dt:type=”int”/>
The second is to encapsulate the data type in a datatype element:

<AttributeType name=”customerID”>
<datatype dt:type=”int”/>
</AttributeType>
Because it is easier and involves less coding, the previous method of declaring the data type
within the AttributeType or ElementType elements is more common.
XML-DR Description Element

If you have ever walked into a failing project coded by someone else, you know how valuable
comments can be. Although coding schemas can be less complex than writing a data-driven
ASP.NET page, comments still have their place. Going back to Listing 4.4, the schema con-
tained one description element (or comment) that looked like the following:
4
DTDS AND XML

UNDERSTANDING
<description>Golfers schema used to validate Listing4.3.xml</description>
The description element can be used within an AttributeType, ElementType, Schema, group, SCHEMAS
element, or attribute tag to provide additional information about why things are being done
a certain way and can be useful for tracking revisions.
XML-DR Summary
XML-DR schemas provide a way to describe XML documents in a more flexible and human-
friendly manner than DTDs. Although these schemas generally require more information and
tags, reading XML-DR schemas is much easier than reading an equivalent DTD. Because an
XML-DR schema must be written according to standard XML rules, making changes to a
114
schema via an XML parser is a very feasible proposition. Altering a DTD through the use of
an XML parser can prove to be very difficult, if not impossible, depending on the parser being
used.
With all the positive things XML-DR schemas have going for them, why did the W3C create a
new schema standard? To answer this completely, you’ll need to read through the next few sec-
tions. In short, the W3C XML schema specification provides additional functionality not avail-
able in the XML-DR schema.
The W3C XML Schema

This section will present you with the main concepts available in the XML schema specifica-
tion so that you can use XML schemas effectively. To view the full specification, take a look at
the “Specifications and Development” section found at the following URL:
http://www.w3.org/XML/Schema.
To get you up to speed quickly with XML schemas, Listing 4.6 contains the XML-DR schema
you first saw in Listing 4.4, only it has now been converted to the W3C XML schema
specification.
Listing 4.6 W3C XML Schema

2: <xsd:schema xmlns:xsd=”http://www.w3.org/2001/XMLSchema”
3: elementFormDefault=”qualified”
4: >
5: <xsd:element name=”course”>
6: <xsd:complexType>
7: <xsd:attribute name=”city” type=”xsd:string” use=”required”/>
8: <xsd:attribute name=”state” type=”xsd:string” use=”required”/>
9: <xsd:attribute name=”name” type=”xsd:string” use=”required”/>
10: </xsd:complexType>
11: </xsd:element>
12: <xsd:element name=”favoriteCourses”>
14: <xsd:sequence>
15: <xsd:element ref=”course” maxOccurs=”unbounded”/>
16: </xsd:sequence>
18: </xsd:element>
19: <xsd:element name=”firstName” type=”xsd:string”/>
20: <xsd:element name=”golfer”>
22: <xsd:sequence>
115
CHAPTER 4

23: <xsd:element ref=”name”/>
24: <xsd:element ref=”favoriteCourses”/>
25: </xsd:sequence>
26: <xsd:attribute name=”skill” type=”xsd:string” use=”required”/>
27: <xsd:attribute name=”handicap” type=”xsd:int” use=”required”/>
28: <xsd:attribute name=”clubs” type=”xsd:string” use=”required”/>
29: <xsd:attribute name=”id” type=”xsd:short” use=”required”/>
31: </xsd:element>
32: <xsd:element name=”golfers”>
34: <xsd:sequence>
35: <xsd:element ref=”golfer” maxOccurs=”unbounded”/>
36: </xsd:sequence>
38: </xsd:element>
39: <xsd:element name=”lastName” type=”xsd:string”/>
40: <xsd:element name=”name”>
42: <xsd:sequence>
43: <xsd:element ref=”firstName”/>
44: <xsd:element ref=”lastName”/>
45: </xsd:sequence>
47: </xsd:element>
48: </xsd:schema>
At first glance, you may not see much resemblance to Microsoft’s XML-DR schemas.
However, many features are found in Microsoft’s version that are very similar in the W3C’s 4
version. The following sections explain a few differences and explain some new concepts
DTDS AND XML

UNDERSTANDING
defined by the W3C.
SCHEMAS
NOTE
The preceding example declares a qualified namespace named xsd and associates it
with a URI of http://www.w3.org/2001/XMLSchema. Had a default namespace been
declared instead (remember that a default namespace applies to everything within
the document if it is declared at the root), each of the xsd namespace prefixes shown
previously would not be necessary for this XML schema, although a targetNamespace
attribute would need to be added as described later in this section. Many of the
examples that follow will not list the xsd prefix for simplicity.
116
W3C Schema Elements and Attributes

Aside from a few changes in case (Schema went to schema) and namespace declarations, one
major change exists in Listing 4.6’s sample schema as compared to the XML-DR schema
shown in Listing 4.4. The new schema specification no longer relies on the ElementType and
AttributeType elements to declare an element or attribute. Instead, these two elements have
been merged with the element and attribute elements, respectively.
The W3C XML schema specification allows for element and attribute declarations using code
similar to the following:
<attribute name=”city” type=”string” use=”required”/>
or
<element name=”course” type=”string” minOccurs=”1” maxOccurs=”unbounded”/>
An element or attribute that is applied to another element can then refer to this declaration
through using a ref attribute:
<attribute ref=”city”/>
<element ref=”course”/>
Eliminating the ElementType and AttributeType elements by combining them into the ele-
ment and attribute elements makes a lot of sense and should make working with schemas
less confusing.
Data Type Definitions

Earlier in this chapter, you saw how DTDs provide very little flexibility in declaring element
and attribute data types. If you’re designing a DTD for an XML document and want to require
that a boolean value be used on a particular attribute, you’re out of luck because DTDs handle
only string types. XML-DR schemas improve on this deficiency by adding the capability to
include specific data types. The W3C XML schema has taken the concept of data types to an
even higher level with the inclusion of the Simple and Complex type elements. As you’ll see in
the next few sections, these types allow for added flexibility in how element and attribute types
are declared and used within a schema.
You had the opportunity to see many data types associated with schemas back in the XML-DR
discussion. Although similar types are found in the W3C XML schema specification, the fol-
lowing table contains a complete listing divided into primitive and derived types. This table,
along with additional data type information, is available from the following W3C Web site:
http://www.w3.org/TR/xmlschema-2/#section-Datatypes-and-Facets.
117
CHAPTER 4
Data Type Numeric

primitive string false
boolean false
float true
double true
decimal true
duration false
dateTime false
time false
date false
gYearMonth false
gYear false
gMonthDay false
gDay false
gMonth false
hexBinary false
base64Binary false
anyURI false
QName false
NOTATION false
derived normalizedString false
token false
language false 4
IDREFS false
DTDS AND XML

UNDERSTANDING
ENTITIES false
NMTOKEN false SCHEMAS
NMTOKENS false
Name false
NCName false
ID false
IDREF false
ENTITY false
integer true
118
Data Type Numeric

nonPositiveInteger true
negativeInteger true
long true
int true
short true
byte true
nonNegativeInteger true
unsignedLong true
unsignedInt true
unsignedShort true
unsignedByte true
positiveInteger true
Figure 4.1 represents a diagram of the different data types in the XML schema specification.
The SimpleType Element

The simpleType element allows for the creation of customized data types. Simple types
become quite useful whenever you want to implement functionality available in another exist-
ing base type but want to restrict how the base data type can be used. For example, assume that
you want to use a simpleType element to create a custom data type that is a positive integer
between 55 and 100 inclusive:
<simpleType name=”roundScore”>
<restriction base=”positiveInteger”>
<minInclusive value=”55”/>
maxInclusive value=”100”/>
</restriction>
</simpleType>
This example restricts the custom simpleType to be a positiveInteger data type but also
extends it through the use of the minInclusive and maxInclusive XML schema elements.
After it is specified, the simpleType can be used to define the data type of an element or
attribute:
<element name=”score” type=”myNamespace:roundScore” minOccurs=”1”
➥maxOccurs=”unbounded”/>
<attribute name=”score” type=”myNamespace:roundScore” use=”required”/>

119
CHAPTER 4
anytype
all Complex types anySimpleType
duration dateTime time date gYearMonth gYear gMonthDay gDay gMonth
boolean base64Binary hexBinary float double anyURI Qname NOTATION
string decimal
normalizedString integer
token nonPositiveInteger long nonNegativeInteger
language Name NMTOKEN negativeInteger int unsignedLong positiveInteger
NCName NNTOKEN short unsignedInt
ID IDREF ENTITY byte unsignedShort
IDREFS ENTITIES unsignedByte
FIGURE 4.1
Diagram of XML schema data types.
4
DTDS AND XML

UNDERSTANDING
SCHEMAS
NOTE
Any time a custom type (such as roundScore) is referenced and the xsd namespace
prefix is not used throughout the schema because of declaring a default namespace,
a namespace specific to the custom type must be defined and used within the schema
as shown previously. Otherwise, the custom type will fall within the schema name-
space and an error will be triggered when it is not found to be part of the XML
schema specification.
120
In cases where more complex restrictions need to be added to a base data type, the pattern
element can be used. This element has a value attribute that can accept regular expressions.
The following example defines a simpleType named us-zipcode:
<simpleType name=’us-zipcode’>
<restriction base=’string’>
<pattern value=’[0-9]{5}(-[0-9]{4})?’/>
</restriction>
</simpleType>
The simpleType can also be constructed and associated directly within an element or attribute.
However, declaring a simpleType in this manner restricts the scope of the type to be local to
the element or attribute it is declared within. This is compared to the global scope associated
with a simpleType declared on its own, as a child element of the schema element. An example
of a local declaration follows. As mentioned, this declaration is very restricting because it can
be used only within the element named score (it cannot be used anywhere else in the docu-
ment). However, if the simpleType applies only to this one element, it may make sense to
declare it this way:
<element name=”score”>
<simpleType>
<restriction base=”positiveInteger”>
<minInclusive value=”55”/>
<maxInclusive value=”100”/>
</restriction>
</simpleType>
</element>
Declaring Complex Types

The capability to create reusable templates is a powerful and necessary feature if schemas are
to be flexible and support complex XML documents. For example, assume that you have an
element that contains four attributes and two elements listed in sequence. By creating a tem-
plate for this content, you can reuse its structure anywhere in the schema, and other types can
even extend its structure.
The content model found within a complex type is composed of a variety of items called parti-
cles. Each particle is either a local or global declaration, meaning that complex types can refer-
ence other XML schema types, elements, compositors (such as sequence, choice, or all), or
groups. Alternatively, local particles can be created internally within the complex type. This
concept will be solidified more in the examples that follow.
Because simple types and complex types can both exist in the same XML schema, naming col-
lisions can occur when both have the same name. As a result, you must name the different
types uniquely so that problems caused by naming conflicts do not impede the schema.
121
CHAPTER 4
XML-DR schemas attempt to emulate some of the simple and complex typing through their
use of the ElementType and AttributeType elements. However, the W3C XML schema offers
a more powerful solution. The following is an example of using a complexType element to
encapsulate different elements and attributes:
<complexType name=”golferType” mixed=”false”>
<sequence>
<element name=”name” type=”myNamespace:nameType”/>
<element name=”favoriteCourses”
type=”myNamespace:favoriteCoursesType”/>
</sequence>
<attribute name=”skill” type=”string” use=”required”/>
<attribute name=”handicap” type=”int” use=”required”/>
<attribute name=”clubs” type=”string” use=”required”/>
<attribute name=”id” type=”int” use=”required”/>
</complexType>
The sequence element type lets the parser know that the name and favoriteCourses elements
must appear in the order shown in the schema: name first, followed by favoriteCourses.
If you had wanted to allow one of three elements to potentially appear within an XML docu-
ment, the sequence element could be replaced by the choice element:
<choice>
<element name=”name” type=”nameType”/>
<element name=”favoriteCourses” type=”myNamespace:
➥favoriteCoursesType”/>
<element name=”courseDifficulty” type=”myNamespace:
➥courseDifficultyType”/>
</choice>
<attribute name=”skill” type=”string” use=”required”/> 4
DTDS AND XML

UNDERSTANDING
SCHEMAS
</complexType>
Use of the choice element will result in one of the nested elements being listed. In this exam-
ple, the name, favoriteCourses, or courseDifficulty elements can be included within an
XML document, but only one can exist. If more than one element is used, an error will occur
when the XML document is validated against the schema.
At times, you may want three different elements to appear in a complexType but don’t care if
they appear in an unordered manner. For instance, the code shown next requires that the name,
favoritecourse, and courseDifficulty elements all appear within the XML document, but it
does not restrict their sequence as the sequence element did earlier:
122

<all>
<element name=”name” type=”nameType” maxOccurs=”1”/>
<element name=”favoriteCourses” type=”myNamespace:favoriteCoursesType”
maxOccurs=”1”/>
<element name=”courseDifficulty”
type=”myNamespace:courseDifficultyType”
maxOccurs=”1”/>
</all>
<attribute name=”skill” type=”string” use=”required”/>
</complexType>
It’s important to note that the all element can appear only directly after a complex type decla-
ration. This means that it cannot be nested within another particle such as the sequence ele-
ment. This is contrary to how the sequence and choice elements can be combined to add
additional logic:
<choice>
<element name=”name” type=”nameType”/>
<sequence>
<element name=”favoriteCourses”
➥type=”myNamespace:favoriteCoursesType”/>
<element name=”courseDifficulty”
➥type=”myNamespace:courseDifficultyType”/>
</sequence>
</choice>
</complexType>
The preceding example states that either the name element can be appear within an XML docu-
ment, or a sequence consisting of the favoriteCourses and courseDifficulty elements. By
combining choice and sequence elements, some interesting options can be created. Also note
that the group element found in the XML-DR schemas is also available to use in the W3C
XML schema version in a similar fashion.
After the specifics associated with a complex type have been defined, it can then be referenced
by name (golferType was used in the previous examples) within another complex type or
within an element by using code similar to the following:
<element name=”golfer” type=”myNamespace:golferType” minOccurs=”1”
➥maxOccurs=”unbounded”/>
123
CHAPTER 4
After complex types have been declared and defined, they can be inherited and extended by
other complex types. This is a very powerful feature that allows for reusability resulting in less
coding where used appropriately. Taking a break from golfing for a minute and using an
animal complexType as an example, as shown in Listing 4.7, a feline and canine type can
extend animal using the complexContent and extension elements:
Listing 4.7 Extending Complex Types

2: <xsd:schema xmlns:xsd=”http://www. w3.org/2001/XMLSchema”
4: >
5: <xsd:complexType name=”animal”>
6: <xsd:sequence>
7: <xsd:element name=”furColor” type=”xsd:string”
8: maxOccurs=”unbounded”
9: />
10: <xsd:element name=”legCount” type=”xsd:int”/>
11: <xsd:element name=”weight” type=”xsd:decimal”/>
12: </xsd:sequence>
14: <xsd:complexType name=”feline”>
15: <xsd:complexContent>
16: <xsd:extension base=”animal”>
17: <xsd:sequence>
18: <xsd:element name=”meows” type=”xsd:boolean”/>
19: </xsd:sequence>
20: </xsd:extension>
21: </xsd:complexContent>
22:
23:
</xsd:complexType>
<xsd:complexType name=”canine”>
4
DTDS AND XML

UNDERSTANDING
24: <xsd:complexContent>
25: <xsd:extension base=”animal”>
26: <xsd:sequence>
SCHEMAS
27: <xsd:element name=”barks” type=”xsd:boolean”/>
28: </xsd:sequence>
29: </xsd:extension>
30: </xsd:complexContent>
32: <xsd:element name=”cat” type=”feline”/>
33: </xsd:schema>
124
Listing 4.8 shows the XML document that the previous XML schema describes. Notice that
the cat element lists child elements, including the furColor, legCount, weight, and meows
elements. This is because the cat element must include all the elements found within the
feline complex type, which is an extension of the animal complex type.
Listing 4.8 The XML Document Described by Listing 4.7

2: <cat xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance”
3: xsi:noNamespaceSchemaLocation=”Listing4.7.xsd”
4: >
5: <furColor>Black</furColor>
6: <legCount>4</legCount>
7: <weight>3</weight>
8: <meows>true</meows>
9: </cat>
Creating Unique Fields, Keys, and Relationships

Although DTDs do contain the capability to associate different elements with each other
through the use of ID and IDREF attribute types, this association is often difficult to follow and
use effectively. Because the capability to provide clear and concise associations is very impor-
tant as XML continues to rise in popularity among applications and vendors, the XML schema
specification includes the capability to associate different elements or attributes in an XML
document in much the same way that relational databases provide the capability to join tables
through primary and foreign key constraints.
Creating a relationship between different elements (or attributes) can be accomplished by using
the unique, key, keyref, selector, and field elements along with their supporting attributes.
To fully understand how this is accomplished, an example is in order.
You are more than likely familiar with the sample Northwind database that ships with
Microsoft’s Access and SQL Server products. This database provides information about differ-
ent customers and orders through a set of relational tables. A given customer can be linked to
his or her orders through a CustomerID field found in both the Customers and Orders tables.
Figure 4.2 shows the primary/foreign key relationships between the two tables.
Listing 4.9 shows how these tables and their relationships could be converted into an XML
schema.
125
CHAPTER 4
Customers ∞ Orders
CustomerID OrderID
CompanyName CustomerID
ContactName EmployeeID
ContactTitle OrderDate
Address RequiredDate
City ShippedDate
Region ShipVia
PostalCode Freight
Country ShipName
Phone ShipAddress
Fax ShipCity
ShipRegion
ShipPostalCode
ShipCountry
FIGURE 4.2
Relating the Customers and Orders Tables in the Northwind Database.
Listing 4.9. An XML Schema Representing the Customers and Orders Tables in the
Northwind Database
1: <?xml version=”1.0” encoding=”UTF-8” ?>
2: <xsd:schema id=”northwind” targetNamespace=”http://www.northwind.com”
3: xmlns=”http://www.northwind.com”
4: xmlns:xsd=”http://www.w3.org/2001/XMLSchema”>
5: <xsd:annotation>
6: <xsd:documentation xml:lang=”en”>
7: This Schema validates against northwind.xml
8: </xsd:documentation> 4
9: </xsd:annotation>
DTDS AND XML

UNDERSTANDING
10: <xsd:element name=”root”>
SCHEMAS
12: <xsd:sequence>
13: <xsd:element ref=”CustomersTable” maxOccurs=”unbounded” />
14: <xsd:element ref=”OrdersTable” maxOccurs=”unbounded” />
15: </xsd:sequence>
17: </xsd:element>
18: <xsd:element name=”CustomersTable”>
20: <xsd:sequence>
21: <xsd:element name=”CustomerID” type=”xsd:string” />
126
Listing 4.9. continued

22: <xsd:element name=”CompanyName” type=”xsd:string”
23: minOccurs=”0” />
24: <xsd:element name=”ContactName” type=”xsd:string”
26: <xsd:element name=”ContactTitle” type=”xsd:string”
28: <xsd:element name=”Address” type=”xsd:string”
30: <xsd:element name=”City” type=”xsd:string”
32: <xsd:element name=”Region” type=”xsd:string”
34: <xsd:element name=”PostalCode” type=”xsd:string”
36: <xsd:element name=”Country” type=”xsd:string”
38: <xsd:element name=”Phone” type=”xsd:string”
40: <xsd:element name=”Fax” type=”xsd:string”
42: </xsd:sequence>
44: <xsd:unique name=”CustomerID_PK”>
45: <xsd:selector xpath=”.” />
46: <xsd:field xpath=”CustomerID” />
47: </xsd:unique>
48: </xsd:element>
49: <xsd:element name=”OrdersTable”>
51: <xsd:sequence>
52: <xsd:element name=”OrderID” type=”xsd:int” />
53: <xsd:element name=”CustomerID” type=”xsd:string” />
54: <xsd:element name=”EmployeeID” type=”xsd:int”
56: <xsd:element name=”OrderDate” type=”xsd:time”
58: <xsd:element name=”RequiredDate” type=”xsd:time”
60: <xsd:element name=”ShippedDate” type=”xsd:time”
62: <xsd:element name=”ShipVia” type=”xsd:int”
64: <xsd:element name=”Freight” type=”xsd:float”
66: <xsd:element name=”ShipName” type=”xsd:string”
127
CHAPTER 4

68: <xsd:element name=”ShipAddress” type=”xsd:string”
70: <xsd:element name=”ShipCity” type=”xsd:string”
72: <xsd:element name=”ShipRegion” type=”xsd:string”
74: <xsd:element name=”ShipPostalCode” type=”xsd:string”
76: <xsd:element name=”ShipCountry” type=”xsd:string”
78: </xsd:sequence>
80: <xsd:unique name=”OrderID_PK”>
82: <xsd:field xpath=”OrderID” />
83: </xsd:unique>
84: <xsd:keyref name=”Customers_Orders_Constraint”
85: refer=”CustomerID_PK”>
88: </xsd:keyref>
89: </xsd:element>
90: </xsd:schema>
This particular XML schema leverages the unique and keyref elements (shown in bold) to
establish relationships. Lines 44–47 establish the CustomerID element (a child element of the
CustomersTable element) as being unique and names this constraint CustomerID_PK. This is
done by assigning the selector element xpath attribute to a period (.). Thinking back to
4
Chapter 3, you’ll remember that the period (.) represents the context node. In this case,
DTDS AND XML

UNDERSTANDING
CustomersTable is the current element being worked with, so it is being assigned to the
SCHEMAS
selector element. The field element’s xpath attribute is then assigned to the CustomerID
element. In cases where more than one element or attribute makes up a unique constraint, mul-
tiple field elements can be listed as children of the unique element, the same as multiple
fields can be combined to create a primary key in a relational database. Had this example con-
tained an attribute that must hold a unique value, the xpath attribute of the field element
could have listed the path to the attribute instead (for example: @attributeName).
After the unique constraint named CustomerID_PK is established, the OrdersTable element can
then be related to the CustomersTable element in a primary/foreign key type relationship.
Lines 84–88 show how the keyref element can be used to refer to an existing constraint,
CustomerID_PK in this case. After the constraint is selected, the selector and field elements
can be determined by using xpath statements as detailed in the previous paragraph. In cases
128
where an element or attribute doesn’t have to be unique but a key needs to be established, the
key element can be used in place of the unique element.
Namespace Support in XML Schemas

Thinking back to our earlier discussion on DTDs, you’ll recall that one of the weak points of
DTDs was that they are incapable of supporting namespaces. As the number of XML docu-
ments circulating throughout the world continues to increase, the usage and need for name-
spaces will increase as well. This, of course, does not bode well for DTDs.
To remedy the inability of DTDs to support namespaces, the W3C XML schema working
group went to a lot of trouble to ensure that namespace support was built in to the XML
schema specification from the ground-up. By doing this, XML schemas will be able to handle
more complex XML documents and describe them more effectively. So how are namespaces
handled? Let’s take a look.
Every XML schema must contain a specific namespace and associated URI that is either
default or local, depending on the desires of the schema author. Currently, the namespace that
must be used is http://www.w3.org/2001/XMLSchema. So why, you may ask, is this URI
needed? In sum, it allows different elements, attributes, and data types declared within an
XML schema to be identified as belonging to the XML schema specification. As far as built-in
schema data types (discussed earlier), this allows them to be positively identified and allows
XML schema authors to create custom data types as long as they are associated with a differ-
ent namespace. Custom data types come into play when simple types are created, as discussed
earlier in the chapter.
Although including the URI associated with the XML schema specification is certainly benefi-
cial, XML documents may contain other namespace declarations and associated URIs.
Fortunately, these can be handled easily within XML schemas by adding the targetNamespace
attribute to the schema root element:
<?xml version=”1.0” encoding=”UTF-8” ?>
<xsd:schema xmlns:xsd=”http://www.w3.org/2001/XMLSchema”
targetNamespace=”http://www.someURI.com”
>

</xsd:schema>
This will associate all top-level components (those that are immediate children of the
xsd:schema element) with the namespace URI of http://www.someURI.com. This does not
associate any other elements or attributes that are not immediate children of the xsd:schema
129
CHAPTER 4
element with the namespace! This is very important to understand. In cases where other ele-
ments or attributes within the schema need to be associated with the namespace as well, the
elementFormDefault and attributeFormDefault attributes can be applied to the schema root
element. The following example shows how all elements not associated with the XML schema
namespace can be associated with the namespace URI contained within the targetNamespace
attribute:
targetNamespace=”http://www.someURI.com” elementFormDefault=”qualified”
>

</xsd:schema>
In cases where some element declarations should not be associated with the target namespace,
they can remain unqualified by adding the form attribute to the element declaration:
targetNamespace=”http://www.someURI.com” elementFormDefault=”qualified”
>
<xsd:element name=”CustomerID” type=”xsd:string” form=”unqualified”/>


</xsd:schema>
Referencing XML Schemas from Within XML Documents

4
After you have created an XML schema document, how do you reference it within your XML
DTDS AND XML

UNDERSTANDING
document? The answer is surprising simple and involves only using a few attributes and/or
SCHEMAS
namespace declarations that are applied to the root element of the XML document. You saw
a simple example of referencing an XML schema that did not have a targetNamespace
attribute included within its schema root element back in Listing 4.8. This example used the
noNamespaceSchemaLocation attribute along with the schema instance namespace (xsi) to
reference the schema named Listing4.7.xsd:
<cat xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance”
xsi:noNamespaceSchemaLocation=”Listing4.7.xsd”
>
<furColor>Black</furColor>
<legCount>4</legCount>
130
<weight>3</weight>
<meows>true</meows>
</cat>
Had the schema file contained a targetNamespace attribute, the noNamespaceSchemaLocation

attribute would be replaced with the schemaLocation attribute. This attribute must contain the
URI referenced within the targetNamespace attribute of the schema along with the path to
the schema file. The two pieces of information must be separated by a space or carriage return.
The root element of the XML document must also declare the schema instance namespace (xsi):
<cat xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance”
xsi:schemaLocation=”http://www.animal.com
animalSchema.xsd”
xmlns=”http://www.animal.com”
>
<furColor>Black</furColor>
<legCount>4</legCount>
<weight>3</weight>
<meows>true</meows>
</cat>
Both examples assume that the XML schema document is located within the same directory as
the XML document. A file path or URL can be used in cases where the two documents are in
separate locations.
XML Schema Summary

The current W3C XML schema specification promises to bring an abundance of power to the
creation of XML schemas. Although we haven’t covered all the elements and attributes con-
tained within the W3C XML schema specification, we have covered many of the basic building
blocks.
Summary
This chapter provided you with an in-depth look at DTDs, XML Data–Reduced schemas
(XML-DR), and W3C XML schemas. Although DTDs have their pitfalls, they are still very
prominent and will be until an implementation of the XML schema recommendation is built in
to more parsers and supported throughout different industries.
Because ASP.NET relies on the System.Xml assembly for XML parsing, you can avoid many
of the shortcomings associated with DTDs by validating your XML documents using XML-DR
schemas or the new W3C XML schemas because of the superior support for data types, XML-
like structure, and overall power that schemas have to offer. Keep in mind that although you
131
CHAPTER 4
can certainly use XML-DR schemas for validation, it is recommended that you switch to the
W3C XML schema specification since Microsoft provides full support for the schema stan-
dards. If you already have a lot of time invested in XML-DR schemas, a few XML software
packages (XMLSpy, for instance) are available that already convert between the XML-DR and
W3C XML schemas, making it relatively easy to make the switch in the future.
The next chapter explores the XmlTextReader, XmlValidatingReader, and XmlTextWriter
classes and shows how the information you’ve learned up to this point can be used in real
applications to validate XML documents. Stay along for the ride because things from here on
out promise to be a little more hands-on and exciting!
DTDS AND XML

UNDERSTANDING
SCHEMAS
Using the XmlTextReader and CHAPTER
5
XmlTextWriter Classes in
ASP.NET
IN THIS CHAPTER
• Introducing the System.Xml Assembly 134
• In-Memory Versus Forward-Only Parsing 135
• Pull Versus Push Models 137
• Using the XmlTextReader Class to Parse

XML 138
• Building a SAX-Style Push Model Using the
XmlTextReader 145
• Validating XML Documents Using

the XmlTextReader and
XmlValidatingReader 159
• Creating a Generic Validation Class 162
• Passing Authentication Credentials with the

XmlTextReader Class 166
• Using the XmlTextWriter Class to Create XML

Documents 166
134
Introducing the System.Xml Assembly

Up to this point in the book, you’ve learned some of the more mundane languages and con-
cepts associated with eXtensible Markup Language(XML). Although XPath, XPointer, XLink,
Document Type Definitions(DTDs), and schemas aren’t necessarily going to keep you from
sleeping at night because of excitement, they are essential in making XML a useful language.
Without being able to access specific areas of an XML document with XPath or ensure that a
document is valid with XML schemas, XML would be of little use in building applications.
Having said that, let’s jump right into learning about the System.Xml Assembly and see how to
leverage some of the classes it contains in your ASP.NET applications.
Working with XML in ASP.NET applications requires that you reference a specific namespace.
A namespace in the .NET platform is used as an organizational system for program compo-
nents and is important in resolving naming conflicts (much like XML namespaces). To access
XML-specific features of the .NET platform, you must reference the System.Xml namespace
in your projects. This namespace is associated with the System.Xml assembly that is installed
with the .NET framework.
If you’re not familiar with assemblies, the .NET SDK defines them in the following manner:
An assembly is a collection of types and resources that are built to work together and form a
logical unit of functionality, a “logical” dll.
In more basic terms, an assembly takes several physical files, such as interfaces, classes,
resource files, and so on, and creates metadata referred to as a manifest about how the files
work together. The assembly can also contain information about versioning and security. One
of the many benefits of assemblies is that they can be used in ASP.NET applications without
adding a CLSID to the registry using regsvr32.exe. If you’ve dealt with “DLL Hell” in the
past, you should be very pleased to hear this. Eliminating the regsvr32.exe and registry aspect
means that you can build custom assemblies and then simply copy them to the bin directory of
your ASP.NET application. No other work is required!
The System.Xml assembly contains many of the classes that provide XML functionality in the
.NET platform. The following list contains some of the main classes that you’ll find yourself
using in XML-related ASP.NET applications.
• XmlAttribute
• XmlDocument
• XmlElement
• XmlNode
• XmlNamedNodeMap
• XmlNodeList
Using the XmlTextReader and XmlTextWriter Classes in ASP.NET
135
CHAPTER 5
• XmlNodeReader
• XmlTextReader
• XmlTextWriter
• XmlValidatingReader
• XPathDocument
• XslTransform
You’ll see how to use these classes and many others in upcoming chapters. In this chapter we
will focus specifically on the XmlTextReader, XmlTextWriter, and XmlValidatingReader
classes along with several of their supporting classes. Figure 5.1 gives an overview of how
these classes fit together.
XmlReader and XmlWriter Architecture
XML 1.0
Abstract XmlReader XmlWriter Abstract
Namespaces
XmlTextReader XmlNodeReader XmlTextWriter
Support Classes
XmlConvert
XmlResolver
XmlValidatingReader XmlSchemaCollection XmlNameTable

DTD, XSD and
XDR Schema XmlNamespaceManager
Validation
FIGURE 5.1
XmlReader and XmlWriter architecture.
In-Memory Versus Forward-Only Parsing

Before you see how to work with the classes shown in Figure 5.1, it’s important that you know
5
XMLTEXTREADER
XMLTEXTWRITER
when and why the classes should be used. In Chapter 6, “Programming the Document Object
Model (DOM) with ASP.NET,” you’ll learn how to use the Document Object Model(DOM) (a
AND
tree-based Application Program Interface(API)) to access specific nodes within an XML docu-
ment in a variety of ways, ranging from programmatic access to using XPath statements. To
136
see when and why you should use the XmlTextReader and XmlTextWriter classes discussed in
this chapter, it’s necessary to provide you with a basic overview of how the DOM works.
The concept of the DOM is actually very simple. If you think of a genealogical chart contain-
ing your family lineage, you’ll recognize that it contains a tree structure with hierarchical rela-
tionships in it. For example, if you start with your grandfather in the chart, your mother may be
listed under him. You would, of course, be listed under your mother.
The DOM works in a manner similar to the genealogical chart just described. Each node found
within an XML document is loaded into memory so that a tree hierarchy is constructed with
the root node being the start of the hierarchy. By having this tree in memory, the capability to
update, delete, and insert nodes is made available through a specific API. Although the pro-
gramming API exposed by the DOM is very powerful, it can present problems with memory
consumption, especially when large XML documents must be parsed. For example, if you have
an XML document that is several megabytes in size and each node is loaded into the DOM tree
structure and placed into memory, you can imagine how this could tax a computer in cases
where multiple XML documents of this size are being processed simultaneously.
To overcome some of the potential memory problems associated with using the DOM (the
DOM is very useful in some situations, as you’ll see in Chapter 6) and to provide an alternative
in cases where the DOM isn’t necessary, the .NET platform includesthe XmlTextReader class.
This class “lives” within the System.Xml assembly. Rather than loading the entire document
into memory as the DOM does, the reader processes one node at a time in a forward-only man-
ner. This is possible because the XmlTextReader is a stream that parses the XML document
into a sequence of XML tokens. Information about these tokens can be obtained through call-
ing different properties and methods built in to the class.
This forward-only stream creation means that when an XML document’s root element is
reached, only one node is in memory at any given time. When a child node is reached, its
ancestors have been parsed, but they are not in memory and going back to them is not an
option (remember, the process is forward only). This type of functionality allows one XML
token at a time to be checked and processed rather than loading everything into memory as
with the DOM. It provides a very low memory alternative that is capable of working with large
XML documents efficiently.
The XmlTextReader functions by starting with an 8KB buffer that is capable of growing to the
size it needs to accommodate the largest token in the stream. For example, a 32KB comment
has to be returned as a single, contiguous token, which causes the parser buffer to grow to
32KB in this case. If that is the biggest token that is added to the stream, the buffer will stay
that size for the rest of the document.
137
CHAPTER 5
All attributes are buffered as well because of namespace support. A start tag with 100 attributes
will typically require a larger buffer as a result. Even though the buffer size grows to fit the
largest token, the XmlTextReader still offers a very memory-conservative approach to dealing
with XML documents[md]especially when compared to the DOM. So is the DOM bad?
Certainly not! It is very useful in many situations, as you’ll see in Chapter 6.
Pull Versus Push Models

There is more than one way to parse an XML document in a forward-only manner. If you’ve
worked with XML much in the past, you’re probably familiar with the Simple API for XML
(SAX) that relies on an event-based push model. SAX has become popular for parsing XML
documents in a fast and efficient manner. Microsoft’s MSXML3 parser contains SAX support
that can be used in Visual Basic or C++ applications.
Although SAX is not a World Wide Web Consortium(W3C) recommendation and is not even
maintained by the W3C, it has become a defacto standard throughout the world when XML
documents must be read in a forward-only manner. SAX is actually maintained and developed
by David Megginson and several others who belong to an XML-DEV mailing list. SAX is an
event-based model that works by pushing out information about different XML nodes it finds
in a document to a content handler. The content handler can then process the data it receives,
as appropriate. If you’re interested in learning more about SAX and how it works, you can find
more information at http://www.megginson.com/SAX/index.html.
With the widespread use of SAX, you’re probably thinking that .NET must surely include
SAX support, right? Actually, SAX is not “officially” supported in .NET, although you’ll see
later in the chapter that a SAX-style model can be created. Instead, the .NET platform provides
an alternative pull model for parsing an XML document rather than the push model found in
SAX. Why, you ask? There are actually several reasons[md]most of which focus on perfor-
mance gains. Table 5.1 lists many of these reasons.
Table 5.1 Benefits of the Pull Model found in .NET

State Management Push model content handlers must build complex state
machines that a pull model client can simplify by managing
the state with natural top-down procedural refinement.
Multiple Input Streams A pull model allows a client to splice together multiple input
streams. Doing this with a push model can be difficult. 5
Layering Test You can easily build a push model on top of a pull model,
XMLTEXTREADER
XMLTEXTWRITER
whereas the reverse isn’t true. You’ll see an example of this

later in the chapter.
AND
138
Table 5.1 continued

Hints from Client You can design a pull model API to allow the client to give
hints to the parser about what the client expects next. For
example, in data type support, when a client knows the next
item to process is supposed to be an integer, the parser can
parse the integer right out of the parser buffer instead of
returning a string subsequently thrown away.
Avoids Extra Copy A pull model allows the client to give the parser the buffer to
write the strings into. This technique avoids the extra copy
from the parser buffer to the string object, which is then
pushed to the client buffer.
Skipping Items The push model must push everything, including the attrib-
utes, comments, text, whitespace, and so on. With a pull
model, the client pulls only what it’s interested in. If, for
example, the client doesn’t read the attributes, those attribute
values don’t need to be entity expanded, values “stringized,”
names atomized, and so on. This model allows for more effi-
cient messaging-level applications of XML.
Using the XmlTextReader Class to Parse XML

Now that you’ve been introduced to the DOM and had the opportunity to see why the .NET
platform includes a pull model rather than a push model, as in SAX, it’s time to see how to
actually use this pull model to parse XML. Forward-only, noncached XML document parsing
is handled by the XmlTextReader class. This class inherits properties and methods from the
abstract XmlReader class and is quite simple to instantiate and use. The pull model it provides
allows for a lot of flexibility in reading an XML document. To instantiate and use the class, the
following code can be used:
XmlTextReader xmlReader = new XmlTextReader(xmlDocument);
This version of the constructor allows the file path (a string) to the XML document to be
passed as an argument. Several other constructors can be used while instantiating the
XmlTextReader as well. For example, a TextReader or Stream can also be passed in as argu-
ments. See the .NET SDK for more details on the different XmlTextReader constructors.
After the XmlTextReader object is created, parsing can then begin on the document in a for-
ward-only manner. This is accomplished by using the different properties and methods built in
to the class. Table 5.2 lists what properties are available and Table 5.3 lists the available
methods.
139
CHAPTER 5
Table 5.2 XmlTextReader Properties
Property Description
AttributeCount Returns the number of attributes on the current node. This can be
used in looping through the attributes associated with a particular
element node. The MoveToNextAttribute() method (see Table
5.3) provides a more efficient manner of doing this, however.
BaseURI Returns the base URI of the current node.
CanResolveEntity Returns a Boolean value based on whether an entity is resolved.
Depth Returns the depth of the current node in the XML element stack.
Encoding Returns the encoding attribute for the XML document being
parsed.
EOF Returns a Boolean value telling whether the XmlTextReader is
positioned at the end of the stream.
HasAttributes Returns a Boolean value indicating whether the current node has
any attributes.
HasValue Returns a Boolean value indicating whether the node can have a
value. Although many nodes can have values, such as attributes
and CDATA sections, others cannot, such as end element tags.
IsDefault Returns a Boolean value that specifies whether the current node
is an attribute that was created from the default value defined in
the DTD or schema.
IsEmptyElement Returns a Boolean value indicating whether the current node is
an empty element, such as (<FavoriteCourses/> for example).
Item Returns the value of the attribute when looping through a collec-
tion. This is the indexer for the XmlTextReader in C#.
LineNumber Returns the current line number of the position within the XML
document. This is similar to the getLineNumber() method asso-
ciated with the Locator class in SAX.
LinePosition Returns the current line position. This is similar to the
getColumnNumber() method in SAX.
LocalName Returns the name of the current node without the namespace pre-
fix attached to it.
Name Returns the name of the current node along with any associated 5
namespace prefix.
XMLTEXTREADER
XMLTEXTWRITER
Namespaces Gets or sets a value indicating whether to do namespace support.

Namespace support is the default (true).
AND
140
Table 5.2 continued
NamespaceURI Returns the namespace URI of the node the reader is positioned
on.
NameTable Gets the XmlNameTable associated with this implementation.
Using the NameTable can result in more efficient object compar-
isons. An example of using it is shown later in the chapter.
NodeType You’ll likely use this property very heavily in your ASP.NET
applications. It returns the type of the current node based on the
XmlNodeType enumeration.
Normalization Gets or sets a value indicating whether to do whitespace normal-
ization as specified in the WC3 XML recommendation version
1.0 (see http://www.w3.org/TR/1998/REC-xml-19980210).
Prefix Returns the namespace prefix (if any) of the current node.
QuoteChar Returns the quotation mark character used to enclose the value of
an attribute node. For example, ‘ or “.
ReadState Returns the read state of the stream based upon the ReadState
enumeration. Possible values can be ReadState.Closed,
ReadState.EndOfFile, ReadState.Error, ReadState.Initial,
or ReadState.Interactive.
Value Returns the text value of the current node.
WhitespaceHandling Gets or sets a value that specifies how whitespace is handled.
This property ties into the WhitespaceHandling enumeration.
XmlLang Returns the current xml:lang scope where the current node
resides.
XmlResolver Gets or sets the XmlResolver used for resolving URLs and/or
PUBLIC IDs used in external DTDs. The XmlUrlResolver class
inherits from the XmlResolver abstract class and is used to
resolve DTD and schema locations. If the directory where the
DTD or schema resides requires authentication, the credentials to
pass for authentication can be set through this property.
XmlSpace Returns the current xml:space scope where the current node
resides. This property is associated with the XmlSpace enumera-
tion that contains values of XmlSpace.None, XmlSpace.Default,
XmlSpace.Preserve.
141
CHAPTER 5
Table 5.3 XmlTextReader Methods
Method Description
Close() Changes the ReadState to Closed and sets all the properties
back to zero. Use in conjunction with the ReadState property.
GetAttribute() Returns the value of an attribute.
GetRemainder() Returns the remainder of the buffered XML in cases where a
different XmlTextReader may finish reading a document. After
this method is called, the EOF property is set to true.
IsStartElement() Returns true if the current content node is a start element tag.
LookupNamespace() Resolves a namespace prefix that exists within the current ele-
ment’s scope. Assuming the namespace resolves, the namespace
URI will be return in a string value.
MoveToAttribute() Move to a specified attribute based on indexed position or
name.
After calling this method, you can access the Name,
NamespaceURI, and Prefix properties associated with that
attribute.
MoveToContent() Checks whether the current node is a content (nonwhitespace
text, CDATA, Element, EndElement, EntityReference, or
EndEntity) node. If the node is not a content node, the method
skips ahead to the next content node or end of file. Nodes that
are of type ProcessingInstruction, DocumentType, Comment,
Whitespace, or SignificantWhitespace will be skipped over.
MoveToElement() After navigating through attributes of an element, you can
move back to the element by using this method.
MoveToFirstAttribute() Moves to the first attribute of an element node.
MoveToNextAttribute() Moves to the next attribute of an element node. This can be
used effectively in a while loop to enumerate through any ele-
ment attributes.
Read() Reads the next XML node from the stream. You will normally
use this method within a while loop to parse through an XML
document.
ReadAttributeValue() Reads and parses the attribute value into Text and/or
EntityReference node types. 5
XMLTEXTREADER
XMLTEXTWRITER
ReadBase64 Decodes Base64 and returns the decoded binary bytes.

ReadBinHex Decodes BinHex and returns the decoded binary bytes.
AND
142
Table 5.3 XmlTextReader Methods
Method Description
ReadChars Reads the text contents of an element into a character buffer.
The size of the buffer will normally be set initially. Characters
are then added to the buffer:
int len;
while ((len = reader.ReadChars(buffer, 0, buflen)) > 0) {
//Call Buffer Function or create String
}
ReadElementString() Used to read text-only elements.

ReadEndElement() Verifies that the current node is an element end tag and places
the position within the stream to the next node.
ReadInnerXml() Reads all the content (including markup) of the current node as
a string. If the current node is empty, an empty string will be
returned.
ReadOuterXml() Reads all the content (including markup) of the current node as
a string. This includes the node’s start and end element tags. If
the current node is empty, an empty string will be returned.
ReadStartElement() Verifies that the current node is an element and positions the
reader on the next node.
ReadString() Reads the contents of an element type node as a string.
ResolveEntity() Resolves the entity reference for EntityReference nodes.
Note: The XmlTextReader cannot resolve general entities.
Skip() Skips the current element node and moves it to the next node if
one exists.
Listing 5.1 provides an example of working with some of the XmlTextReader properties and
methods to walk through an XML document and write out the XML tokens it finds to the
browser. An explanation of the code follows.
Listing 5.1. Using the XmlTextReader Class

1: <%@ Import Namespace=”System.Xml” %>
2: <script language=”C#” runat=server>
3: public class ReadXmlFile {
5: public string ReadDoc(String doc) {
6: string m_Document = doc;
7: XmlTextReader xmlReader = null;
143
CHAPTER 5

8: try {
9: xmlReader = new XmlTextReader(m_Document);
10: WriteXml(xmlReader);
11: }
13: output += “Error Occurred While Reading “ + m_Document +
14: “ “ + e.ToString();
15: }
16: finally {
17: if (xmlReader != null)
18: xmlReader.Close();
19: }
20: return output;
21: }
22: private void WriteXml(XmlTextReader xmlReader) {
23: while (xmlReader.Read()) {
24: if (xmlReader.NodeType == XmlNodeType.Element) {
25: output += indent(xmlReader.Depth*4);
26: output += “<b><” + xmlReader.Name + “</b>”;
27: while (xmlReader.MoveToNextAttribute()) {
28: output += “&nbsp<i>” + xmlReader.Name + “=\”” +
29: xmlReader.Value + “\”</i>&nbsp”;
30: }
31: if (xmlReader.IsEmptyElement) {
32: output += “<b>/></b><br>”;
33: } else {
34: output += “<b>></b><br>”;
35: }
36: } else if (xmlReader.NodeType == XmlNodeType.EndElement) {
38: output += “<b></” + xmlReader.Name + “></b><br>”;
39: } else if (xmlReader.NodeType == XmlNodeType.Text) {
40: if (xmlReader.Value.Length != 0) {
42: output += “<font color=#ff0000>” + xmlReader.Value
+
43: “<br></font>”;
44: }
45: }
46: }
5
XMLTEXTREADER
XMLTEXTWRITER
47: }
48: private string indent(int number) {
49: string spaces = “”;
AND
50: for (int i=0; i < number; i++) {

144

51: spaces += “ ”;
52: }
53: return spaces;
54: }
55: } //End Class
56:
57: private void Page_Load(Object sender, EventArgs e) {
58: ReadXmlFile readXmlFileSample = new ReadXmlFile();
59: output.InnerHtml =
60: readXmlFileSample.ReadDoc(Server.MapPath(“golfers.xml”));
61: }
62: </script>
63: <html>
64: <head>
65: </head>
66: <body>
67: <font size=”5” color=”#02027a”>
68: Working with the XmlTextReader Class
69: </font>
70: <p />
71: <div id=”output” runat=”server” />
72: </body>
73: </html>
This example starts by referencing the System.Xml assembly in line 1. Doing this allows
access to all the classes found within the assembly, including the XmlTextReader class needed
for this application. After the page is hit, the Page_Load() event takes care of instantiating the
ReadXmlFile class (line 58). After the class is instantiated, the ReadDoc() method is called and
the path to the XML document is passed in as an argument (line 60). The ReadDoc() method
takes care of instantiating the reader by passing the file path to its constructor. It then passes
the newly created XmlTextReader object to the WriteXml() (line 22) method. Each node in the
XML document is processed as it is reached. This processing begins with a call to the
XmlTextReader’s Read() method (line 23). As each token in the stream is read, the node type
is checked. If the node type is equal to XmlNodeType.Element, the MoveToNextAttribute()
method is called within a while loop (lines 27–30). Any existing attributes are iterated through
and displayed, as appropriate.
If the node is not an element but is instead an element end tag (XmlNodeType.EndElement), the
appropriate closing tag is included (line 38). If the node is actually a text node
(XmlNodeType.Text), the text is included, but the font color is set to red so that the data stands
out more (lines 41–44). The text is read by calling the Value property.
145
CHAPTER 5
Each node is indented a specified amount by checking the XmlTextReader’s Depth property.
This property automatically tracks how deep you are into an XML document’s hierarchical
structure, making it very easy to display the document in a properly formatted manner or sim-
ply track parent/child relationships. After the entire document has been parsed, it is written out
to the browser.
Note that the attributes are not automatically passed to you, as with SAX. If you didn’t want to
work with any attributes, they could simply be skipped over. The same logic follows for text
nodes, processing instructions, comments, and so on. Until you tell the code to pull these dif-
ferent node types from the stream, they are simply ignored, which keeps buffer sizes minimal
and results in very fast and efficient XML document parsing.
The XmlTextReader can also be associated with the XmlNameTable class to increase efficiency
even more in certain scenarios. The XmlNameTable class is capable of holding a table of atom-
ized string values and is useful when the same element, namespace, or attribute name is com-
pared multiple times while reading the XML document stream. Instead of doing costly string
comparisons, the XmlNameTable can do object pointer comparisons. For example, if a condi-
tional check is done on a particular namespace URI many times while parsing, the URI can be
added to the XmlNameTable as shown next to increase efficiency:
XmlTextReader reader = new XmlTextReader(url);
object myUri = reader.NameTable.Add(“http://www.SomeServer.com/namespaceURI”);
As the stream of XML tokens is read, a direct object pointer comparison can then be made
against the XmlNameTable object:
if (reader.NamespaceURI.Equals(myUri)) {
//Do special processing if the two URI values are equal
}
This method of testing for a URI value is in contrast to the following less-efficient way of
doing the same conditional test:
if (reader.NamespaceURI == “http://www.SomeServer.com/namespaceURI”) {
//Do special processing if the two URI values are equal
}
Building a SAX-Style Push Model Using

the XmlTextReader 5
Earlier in the chapter, it was mentioned that a SAX-style push model could be built from a pull
XMLTEXTREADER
XMLTEXTWRITER
model but that the reverse was not true. This is possible because with a pull model you deter-
AND
mine what should be pulled and when it should be pulled. With a push model you have no con-
trol over what you get because everything found in the document is pushed to you via events.
146
Before jumping into the code used to create the SAX parser, you may wonder why you would
want to create this in the first place, especially because the pull model exposed by the
XmlTextReader is so efficient already.
The first answer to this question is that SAX is very popular throughout the world. Because of
its popularity, many applications being ported to .NET may have SAX ContentHandlers with
a lot of code in them that functions well. Converting all this code to work directly with the
XmlTextReader could be costly from a management perspective. Rather than rewriting these
handlers, the code that will be shown could be used temporarily while other parts of the appli-
cation are ported to .NET.
If you don’t buy the previous reason for creating the SAX parser class in .NET (hey, I gave it
my best shot!), suffice it to say that creating a SAX parser provides an excellent stage for dis-
playing just how powerful the XmlTextReader can be in .NET applications. Through seeing the
process of creating the SAX parser, you’ll become more familiar with different ways that the
XmlTextReader can be used and see a few other things you can do with .NET classes.
If you’ve used SAX in the past, you’re probably intimately familiar with the different parts of
the SAX parser ContentHandler interface. Listing 5.2 shows this SAX interface converted to a
.NET interface.
Listing 5.2 The IContentHandler Interface (IContentHandler.cs)

1: using XmlParsers.Sax.Helpers;
2: interface IContentHandler {
3:
4: void setDocumentLocator(Locator locator);
5:
6: void startDocument();
7:
8: void endDocument();
9:
10: void processingInstruction(string target, string data);
11:
12: void startPrefixMapping(string prefix, string uri);
13:
14: void endPrefixMapping(string prefix);
15:
16: void startElement(string namespaceURI, string localName,
17: string rawName, Attributes atts);
18:
19: void endElement(string namespaceURI, string localName,string rawName);
20:
147
CHAPTER 5

21: void characters(char[] ch, int start, int end);
22:
23: void ignorableWhitespace(char[] ch, int start, int end);
24:
25: void skippedEntity(string name);
If you’re not familiar with this interface, it’s actually quite simple to understand. The different
interface members are called by the SAX parser as it parses an XML document. For example,
when the document parsing starts, startDocument() is called. When an element type node is
found within the document, startElement() is found and any attributes associated with the
element are passed as arguments (whether you want them or not). This push process continues
until the end of the document is reached (endDocument()).
A few classes are required by SAX that are not built in to the .NET platform. Fortunately,
creating custom classes to emulate the classes that the SAX ContentHandler expects is very
straightforward. Listing 5.3 shows some helper classes and a struct that were created to
enable SAX-like functionality in the IContentHandler interface (and in the IErrorHandler
interface, which is not shown here). More specifically, the code takes care of creating a
SAXParseException, Attributes, and Locator class. The Attributes class contains a collec-
tion of attributes that will be passed as an argument to startElement() in the ContentHandler.
The Locator class allows the line position and column position of the SAX parser to be
tracked as it is parsing the XML document.
Listing 5.3 Creating SAX Helpers (SAXHelpers.cs)

1: namespace XmlParsers.Sax.Helpers {
2: using System;
3: using System.Collections;
4: public class SAXParseException {
5: string lineNumber = “”;
6: string systemID = “”;
7: string message = “”;
8: public string getLineNumber() {
9: return lineNumber;
10: }
11: public string getSystemID() {
12: return systemID; 5
13: }
XMLTEXTREADER
XMLTEXTWRITER
14: public string getMessage() {

15: return message;
AND
16: }
148

17: public string LineNumber {
18: set {
19: lineNumber = value;
20: }
21: }
22: public string SystemID {
23: set {
24: systemID = value;
25: }
26: }
27: public string Message {
28: set {
29: message = value;
30: }
31: }
32: }
33:
34: public class Locator {
35: int lineNumber;
36: int columnNumber;
37: public Locator() {
38: lineNumber = 0;
39: columnNumber = 0;
40: }
41: public int LineNumber {
42: set {
43: lineNumber = value;
44: }
45: }
46: public int ColumnNumber {
47: set {
48: columnNumber = value;
49: }
50: }
51: public int getLineNumber() {
52: return lineNumber;
53: }
54: public int getColumnNumber() {
55: return columnNumber;
56: }
57: }
58:
59: public struct SaxAttribute {
60: public string Name;
149
CHAPTER 5

61: public string NamespaceURI;
62: public string Value;
63: }
64:
65: public class Attributes {
66: public ArrayList attArray = new ArrayList();
67: public int getLength() {
68: return attArray.Count;
69: }
70: public string getQName(int index) {
71: SaxAttribute saxAtt = (SaxAttribute)attArray[index];
72: return saxAtt.Name;
73: }
74: public string getValue(int index) {
75: SaxAttribute saxAtt = (SaxAttribute)attArray[index];
76: return saxAtt.Value;
77: }
78: public Attributes TrimArray() {
79: attArray.TrimToSize();
80: return this;
81: }
82: }
83: }
After all the classes expected by the SAX ContentHandler and ErrorHandler interfaces are
created and ready to be used, the actual implementation of the SAX parser can be constructed
through using the XmlTextReader class, along with a few others. Listing 5.4 shows this imple-
mentation and is followed by step-by-step details on what the code is doing.
Listing 5.4 Constructing a SAX Parser Using the XmlTextReader Class (SAXParser.cs)
1: namespace XmlParsers.Sax {
2: using System;
3: using System.Xml;
5: using XmlParsers.Sax.Handlers;
7:
8:
/// <summary>
/// The SaxParser class build a SAX push model
5
XMLTEXTREADER
XMLTEXTWRITER
9: /// from the pull model found in the XmlTextReader.

10: /// </summary>
AND
11: public class SaxParser {

12: private ContentHandler Handler = null;
150

13: private ErrorHandler errorHandler = null;
14: public void setErrorHandler(ErrorHandler errorHandler) {
15: this.errorHandler = errorHandler;
16: }
17: public void setContentHandler(ContentHandler handler) {
18: this.Handler = handler;
19: }
20: public void parse(string url) {
21: int buflen = 500;
22: char[] buffer = new char[buflen];
23: Stack nsstack = new Stack();
24: Locator locator = new Locator();
25: SAXParseException saxException = new SAXParseException();
26: Attributes atts;
27: XmlTextReader reader = null;
28: try {
29: reader = new XmlTextReader(url);
30: object nsuri = reader.NameTable.Add(
31: “http://www.w3.org/2000/xmlns/”);
32: Handler.startDocument();
33: while (reader.Read()) {
34: int len;
35: string prefix;
36: locator.LineNumber = reader.LineNumber;
37: locator.ColumnNumber = reader.LinePosition;
38: Handler.setDocumentLocator(locator);
39: switch (reader.NodeType) {
40: case XmlNodeType.Element:
41: nsstack.Push(null);//marker
42: atts = new Attributes();
43: while (reader.MoveToNextAttribute()) {
44: if (reader.NamespaceURI.Equals(nsuri)) {
45: prefix = “”;
46: if (reader.Prefix == “xmlns”) {
47: prefix = reader.LocalName;
48: }
49: nsstack.Push(prefix);
50: Handler.startPrefixMapping(prefix,
51: reader.Value);
52: } else {
53: SaxAttribute newAtt =
54: new SaxAttribute();
55: newAtt.Name = reader.Name;
56: newAtt.NamespaceURI =
57: reader.NamespaceURI;
151
CHAPTER 5

58: newAtt.Value = reader.Value;
59: atts.attArray.Add(newAtt);
60: }
61: }
62: reader.MoveToElement();
63: Handler.startElement(reader.NamespaceURI,
64: reader.LocalName, reader.Name,
65: atts.TrimArray());
66: if (reader.IsEmptyElement) {
67: Handler.endElement(reader.NamespaceURI,
68: reader.LocalName, reader.Name);
69: }
70: break;
71: case XmlNodeType.EndElement:
74: while (prefix != null) {
75: Handler.endPrefixMapping(prefix);
76: prefix = (string)nsstack.Pop();
77: }
78: break;
79: case XmlNodeType.Text:
80: while ((len =
81: reader.ReadChars(buffer, 0, buflen))>0) {
82: Handler.characters(buffer, 0, len);
83: }
84: //After read you are automatically put
85: //on the next tag so you have to
86: //call the proper case from here.
87: if (reader.NodeType == XmlNodeType.Element) {
88: goto case XmlNodeType.Element;
89: }
90: if (reader.NodeType ==
91: XmlNodeType.EndElement) {
92: goto case XmlNodeType.EndElement;
93: }
94: break;
95: case XmlNodeType.ProcessingInstruction:
96: Handler.processingInstruction(reader.Name,
97: reader.Value);
5
XMLTEXTREADER
XMLTEXTWRITER
98: break;
99: case XmlNodeType.Whitespace:
AND
152

100: char[] whiteSpace =
101: reader.Value.ToCharArray();
102: Handler.ignorableWhitespace(whiteSpace,0,1);
103: break;
104: case XmlNodeType.Entity:
105: Handler.skippedEntity(reader.Name);
106: break;
107: }
108: } //While
109: Handler.endDocument();
110: } //try
111: catch (Exception exception) {
112: saxException.LineNumber = reader.LineNumber.ToString();
113: saxException.SystemID = “”;
114: saxException.Message =
115: exception.GetBaseException().ToString();
116: errorHandler.error(saxException);
117: }
118: finally {
119: if (reader.ReadState != ReadState.Closed) {
120: reader.Close();
121: }
122: }
123: } //parse()
124: } //SAXParser
125: } //namespace
Let’s take a step-by-step look at what is happening in the code shown in Listing 5.4.
Step 1: Referencing Assemblies

The namespace XmlParsers.Sax is declared and several assemblies are made available to the
SAXParser class, including the System.Xml assembly shown in line 3. The helper classes
shown in Listing 5.3 are also referenced (XmlParser.Sax.Helpers) along with the necessary
content handlers (XmlParser.Sax.Handlers).
1: namespace XmlParsers.Sax {
2: using System;
5: using XmlParsers.Sax.Handlers;
153
CHAPTER 5
Step 2: Setting the Handlers

The setErrorHandler() and setContentHandler() methods take care of letting the
SAXParser class know which objects to pass information to as the stream of XML is read or
when an error occurs.
12: private ContentHandler Handler = null;
13: private ErrorHandler errorHandler = null;
14: public void setErrorHandler(ErrorHandler errorHandler) {
15: this.errorHandler = errorHandler;
16: }
17: public void setContentHandler(ContentHandler handler) {
18: this.Handler = handler;
19: }
Step 3: Declaring the XmlTextReader Class

After the ContentHandler and ErrorHandler classes are set, the parse() method of the
SAXParser class can be called with the path to the XML document being passed in as a para-
meter. Within the parse() method, several objects are created, including those necessary to
handle buffering (lines 21–22), namespace prefixes (line 23), the ContentHandler Locator and
Attributes objects (line 24, line 26), and the ErrorHandler SAXParseException object (line
25). Finally, line 27 shows an XmlTextReader object named reader being set to null.
20: public void parse(string url) {
21: int buflen = 500;
22: char[] buffer = new char[buflen];
23: Stack nsstack = new Stack();
24: Locator locator = new Locator();
25: SAXParseException saxException = new SAXParseException();
26: Attributes atts;
27: XmlTextReader reader = null;
Step 4: Instantiating the XmlTextReader Class

The next section of code begins the try portion of a try/catch/finally block and then takes
care of instantiating the XmlTextReader class. Notice that the path to the XML document that
will be parsed is passed into the constructor. This path was originally passed into the parse()
method as a string (shown in Step 3). Line 31 shows the creation of an XmlNameTable object as
well. In cases where you will do numerous comparisons between a particular string value and 5
an XML token returned by the XmlTextReader during parsing, you’ll want to consider using an
XMLTEXTREADER
XMLTEXTWRITER
XmlNameTable. This is because object comparisons can be done during the parsing process that
offer performance benefits over regular string comparisons. In Step 6, you’ll learn more about
AND
how the XmlNameTable object named nsuri (line 30) is used.

154
28: try {
29: reader = new XmlTextReader(url);
30: object nsuri = reader.NameTable.Add(
31: “http://www.w3.org/2000/xmlns/”);
Step 5: Reading from the Stream

Assuming the XML document identified by the file path passed into the constructor of the
XmlTextReader is found, the stream of XML tokens are now ready to be read. If the file is not
found, the catch block will be hit, which will cause the ErrorHandler to be called (shown
later). Before the stream reading begins, however, the ContentHandler’s startDocument()
method is called to let it know that parsing is about to begin (line 32). After the
ContentHandler has been notified, the process of reading the stream is started by calling the
XmlTextReader’s Read() method from within a while statement, as shown in line 33. This
method will return true as long as the end of the stream has not been reached.
Lines 36–38 show how information about the reader’s position within the stream can be passed
to the ContentHandler. After this information has been passed, the bulk of the parsing work
begins as the reader’s NodeType property is checked within a switch statement (line 39).
32: Handler.startDocument();
34: int len;
35: string prefix;
36: locator.LineNumber = reader.LineNumber;
37: locator.ColumnNumber = reader.LinePosition;
38: Handler.setDocumentLocator(locator);
39: switch (reader.NodeType) {
Step 6: Checking for Element Nodes

Now that we’re within the switch statement, each XML token read from the stream can be
checked against the XmlNodeType enumeration. The case statement shown in line 40 will be hit
if an element type node is found in the stream. Because element nodes are the main structural
component of an XML document, the bulk of the parsing work will be done within this partic-
ular case.
Line 41 starts things off by creating a null entry into a stack object named nsstack. This
object will be used to track namespace prefixes that may be found on a given element so that
the ContentHandler’s startPrefixMapping() method can pass this information (shown in
lines 45–51). Line 42 then creates a SAX helper class named Attributes that is used to hold
any attributes found on a given element. To enumerate through any existing attributes, the
XmlTextReader’s MoveToNextAttribute() method can be called as shown in line 43. This
method returns a value of true each time an attribute is found. Lines 53–59 take care of
155
CHAPTER 5
adding each attribute’s Name, NamespaceURI, and Value to the Attributes object collection.
Notice that as the attributes are being enumerated through, the NamespaceURI property of the
reader object is being compared to the XmlNameTable object named nsuri (line 44). As men-
tioned earlier, this object-to-object comparison results in performance gains over simply com-
paring strings, as shown next:
if (reader.NamespaceURI == “http://www.w3.org/2000/xmlns/”) {
........
}
After all attributes has been enumerated through (assuming some existed in the first place), the
XmlTextReader’s MoveToElement() method is called to get back on the original element. Now
that the necessary information about the element has been gathered, the ContentHandler’s
startElement() method is called and the objects detailed earlier along with information about
the element itself are passed in as arguments (lines 63–65). After this is completed, a check is
made to see whether the element’s content model is empty (line 66). If it is, the endElement()
method is called to let the ContentHandler know that the parser will be moving on to other
nodes within the XML document.
40: case XmlNodeType.Element:
41: nsstack.Push(null);//marker
42: atts = new Attributes();
44: if (reader.NamespaceURI.Equals(nsuri)) {
45: prefix = “”;
46: if (reader.Prefix == “xmlns”) {
47: prefix = reader.LocalName;
48: }
49: nsstack.Push(prefix);
50: Handler.startPrefixMapping(prefix,
51: reader.Value);
52: } else {
53: SaxAttribute newAtt = [sr]
54: new SaxAttribute();
55: newAtt.Name = reader.Name;
56: newAtt.NamespaceURI = [sr]
57: reader.NamespaceURI;
58: newAtt.Value = reader.Value;
59: atts.attArray.Add(newAtt);
60: } 5
61: }
XMLTEXTREADER
XMLTEXTWRITER
62: reader.MoveToElement();
63: Handler.startElement(reader.NamespaceURI,
AND
64: reader.LocalName, reader.Name,

65: atts.TrimArray());
156
66: if (reader.IsEmptyElement) {
69: }
70: break;
Step 7: Checking for End Element Nodes

The next case statement simply checks whether the current XML Token being read from the
stream is of type XmlNodeType.EndElement. As with empty elements, an end element that is
found in the stream will cause the ContentHandler’s endElement() method to be called so
that it knows the current element has been completely processed. This section of code also
takes care of reading off of the stack object nsstack and calling endPrefixMapping()(lines
74–77).
71: case XmlNodeType.EndElement:
74: while (prefix != null) {
75: Handler.endPrefixMapping(prefix);
76: prefix = (string)nsstack.Pop();
77: }
78: break;
Step 8: Reading Text Nodes

Although the capability to handle element nodes (both start and end elements) is certainly use-
ful, getting to the text contained within these elements is a necessity if the data is to be of any
use. Fortunately, the XmlTextReader makes it easy to handle text nodes by using the
ReadChars() method. Lines 80–83 show how this method is used. First, notice that the
ReadChars() method accepts several arguments, including the buffer to write characters to
(appropriately named buffer in this case), the position within the buffer to start writing to, and
the number of characters to write into the buffer. If you remember back to Step 3 (lines 21 and
22) the size of the buffer along with the actual buffer itself were defined. In this example, the
size of buflen is set to 500. In cases where a text node is greater than 500 characters, the
while loop will take care of filling the buffer repeatedly with 500 characters at a time until all
characters within the text node have been handled.
After reading the characters, the XmlTextReader automatically places itself on the next XML
token within the stream. This means that you do not explicitly have to tell it to move, because
it does the moving for you. As a result of this behavior, lines 87–93 take care of checking what
the new node type is that the reader is positioned on and calls the proper case statement
depending on the node type. If these statements were omitted, several start or end elements
could be skipped and left unprocessed.
157
CHAPTER 5
79: case XmlNodeType.Text:

80: while ((len =
81: reader.ReadChars(buffer, 0, buflen))>0) {
82: Handler.characters(buffer, 0, len);
83: }
84: //After read you are automatically put
85: //on the next tag so you have to
86: //call the proper case from here.
87: if (reader.NodeType == XmlNodeType.Element) {
88: goto case XmlNodeType.Element;
89: }
90: if (reader.NodeType ==
92: goto case XmlNodeType.EndElement;
93: }
94: break;
Step 9: Handling Processing Instructions, Whitespace,

and Entities
As processing instruction, whitespace, and entity node types are encountered during the pars-
ing process, each is passed to the appropriate method within the ContentHandler, as shown in
the following code. As you look through the code, you’ll see that this process is fairly straight-
forward.
95: case XmlNodeType.ProcessingInstruction:
96: Handler.processingInstruction(reader.Name,
97: reader.Value);
98: break;
99: case XmlNodeType.Whitespace:
100: char[] whiteSpace =
101: reader.Value.ToCharArray();
102: Handler.ignorableWhitespace(whiteSpace,0,1);
103: break;
104: case XmlNodeType.Entity:
105: Handler.skippedEntity(reader.Name);
106: break;
Step 10: Ending the Parsing Process and Catching Errors

If the XML document is parsed completely by the XmlTextReader, the endDocument() method 5
XMLTEXTREADER
XMLTEXTWRITER
is called on the ContentHandler (line 109). If an error arises during the parsing process, such
as the XML document not being found or not being well-formed, the error will be caught by
AND
158
the catch block shown in lines 111–117. This block takes care of calling the ErrorHandler
object so that the error can be reported to the SAX application.
After the parsing process has completed, the finally block will be called. This block takes
care of ensuring that the XmlTextReader is not already closed through checking the ReadState
property.
109: Handler.endDocument();
110: } //try
111: catch (Exception exception) {
112: saxException.LineNumber = reader.LineNumber.ToString();
113: saxException.SystemID = “”;
114: saxException.Message =
115: exception.GetBaseException().ToString();
116: errorHandler.error(saxException);
117: }
118: finally {
119: if (reader.ReadState != ReadState.Closed) {
120: reader.Close();
121: }
122: }
Step 11: Calling the SAX Parser from an ASP.NET Page

After reading through all the code explanations in the previous 10 steps, you should have a
good idea of what features the XmlTextReader class has to offer. The only thing left to show is
how an ASP.NET page can instantiate and use the SAXParser class and its associated handler
classes to parse an XML document. Listing 5.5 shows the segment of code from a file named
SAXTest.aspx that instantiates the SAXParser class.
Listing 5.5 Using the SAXParser Class from an ASP.NET Page

1: public void StartSAX() {
2: Response.Write(“<b>Starting SAX Parsing....</b><p />”);
3: SaxParser parser = new SaxParser();
4: ContentHandler handler = new ContentHandler(Request,Response);
5: ErrorHandler errorHandler = new ErrorHandler(Request,Response);
6: parser.setContentHandler(handler);
7: parser.setErrorHandler(errorHandler);
8: try {
9: parser.parse(Server.MapPath(“SAXTest.xml”));
10: }
11: catch (Exception exp) {
12: Response.Write(exp.ToString());
13: }
14: }
159
CHAPTER 5
The ContentHandler class in this example receives data about different nodes in the XML
document and then writes the results out to the browser. However, the same code (with minor
modifications) could be used to update a database, text file, or other data store. In concluding
the discussion on creating a .NET version of the SAX parser by using the XmlTextReader
class, it’s important to keep in mind that it is always recommended that you use the
XmlTextReader directly in your ASP.NET applications. It is engineered to provide fast and
efficient support of XML document parsing.
Validating XML Documents Using the

XmlTextReader and XmlValidatingReader
XML provides a very useful and extensible framework for marking up data that can be
exchanged between applications in a platform-neutral manner. However, the capability to cre-
ate your own XML element and attribute names does come with a price. This price comes into
play when you want to exchange XML documents with other people or applications not famil-
iar with your document structure.
Thinking back to Chapter 4, “Understanding DTDs and XML Schemas,” you’ll recall that
three types of documents have been created to handle this problem and make exchanging XML
documents a feasible and efficient proposition. As a refresher, this chapter presented you with
information on DTDs, XML Data-Reduced Schemas (XML-DR), and W3C XML schemas
(XSD). In this section you’ll see how to apply the knowledge you gleaned from Chapter 4 to
your ASP.NET applications.
If you’ve worked with MSXML3, you’re probably familiar with using the validateOnParse
property or validate() method to validate XML documents against DTDs or XDR schemas.
With the validateOnParse property, a document can be validated as it is parsed into the DOM
structure while the validate() method allows for runtime validation of a document that has
already been parsed and loaded. Although validating XML documents in .NET applications is
quite a bit different from the MSXML3 mechanism, the manner in which documents are vali-
dated is much more efficient and flexible now.
The validation process involves the XmlTextReader because it provides fast, forward-only,
noncached access to XML documents. However, the XmlTextReader doesn’t have any proper-
ties or methods that can handle the validation process. Instead, the XmlTextReader is used in
conjunction with another class named the XmlValidatingReader. This class inherits from the
5
abstract XmlReader class just as the XmlTextReader does. In fact, almost all the properties and
XMLTEXTREADER
XMLTEXTWRITER
methods found in the XmlTextReader are also found in the XmlValidatingReader. However,
the XmlValidatingReader is designed specifically to validate XML documents or read XML
AND
fragments (shown in Chapter 9, “SQL Server 2000, XML, and ASP.NET”).

160
Instantiating the XmlValidatingReader

The XmlValidatingReader has three constructors that can be used to initialize it. The first one
accepts an XmlTextReader as an argument, whereas the remaining two accept several argu-
ments and handle XML fragments such as those returned from SQL Server 2000. You’ll be
exposed to the other constructors in Chapter 9. To validate XML documents, you’ll want to use
the first constructor as shown next:
XmlValidatingReader vReader = new XmlValidatingReader(reader);
The XML document that is parsed by the reader object (an XmlTextReader object) is used by
the XmlValidatingReader as it does comparisons of the XML document against DTDs or
schemas.
Setting the ValidationType

After the XmlValidatingReader class is instantiated, its ValidationType property can be set
to the proper ValidationType enumeration member:
vReader.ValidationType = ValidationType.Auto;
Table 5.4 shows the different members of the ValidationType enumeration.
Table 5.4 ValidationType Enumeration
Member Name Description

Auto If the XmlValidatingReader’s ValidationType property is not set, this
member will be used as the default. The .NET SDK lists the following
rules about the Auto member:
If there is no DTD or schema, it will parse the XML without validation.
If there is a DTD defined in a <!DOCTYPE ...> declaration, it will load
the DTD and process the DTD declarations such that default attributes and
general entities will be made available. General entities are loaded and
parsed only if they are used (expanded).
If there is no <!DOCTYPE ...> declaration but there is an XSD
schemaLocation attribute, it will load and process those XSD schemas,
and it will return any default attributes defined in those schemas.
If there is no <!DOCTYPE ...> declaration and no XSD or XDR schema
information, the parser is a non-validating parser (that is,
ValidationType=ValidationType.None)
If there is no <!DOCTYPE ...> declaration and no XSD schemaLocation
attribute but there are some namespaces using the MSXML x-schema:
URN prefix, it will load and process those schemas, and it will return any
default attributes defined in those schemas.
161
CHAPTER 5
TABLE 5.4 continued

Member Name Description
If there is no <!DOCTYPE ...> declaration but there is a schema declara-
tion (<schema>), it will validate using the in-line schema.
DTD Allows for validation against a DTD only.
None Creates the equivalent of a nonvalidating parser.
No validation errors will be thrown when this is used.
Schema Validate against XSD schemas.
XDR Validate against XDR schemas.
Using the XmlSchemaCollection Class

If you will be validating against XDR or XSD schemas, the XmlSchemaCollection class can
be used to cache your schemas, thus providing performance benefits. Using the XmlSchema
Collection class is as simple as instantiating it and then calling the Add() method:
XmlSchemaCollection schemaCol = new XmlSchemaCollection();

schemaCol.Add(“http://www.someURI.com”,”schema.xsd”);
The Add() method has several overloaded forms that can be used. The preceding version
accepts the namespace associated with the schema (normally the targetNamespace URI for
XSD schemas) and the name of the schema document. The Add() method can be called multi-
ple times in cases where several schemas are used in an application for validation. The main
benefit of using this class to hold different schemas is that the schemas do not have to be
reloaded each time validation needs to take place because they can be pulled from the schema
cache.
After adding the appropriate schemas to the collection, the XmlValidatingReader (named
vReader in this example) needs to be associated with the collection to take advantage of its
caching features:
vReader.Schemas.Add(schemaCol);
Attaching Event Handlers

After the XmlValidatingReader has been instantiated and its properties have been set, the
5
reader must be hooked up to an event handler that will be called if an error occurs during the
XMLTEXTREADER
XMLTEXTWRITER
validation process. In .NET, event handlers can be attached to an object by using += and sub-
tracted by using the -= syntax. The XmlValidatingReader has one event named Validation
AND
EventHandler that can be attached to catch errors that may arise. Attaching to the event
162
involves using the += syntax along with specifying the callback handler that should be called if
a validation error occurs. After the event handler is attached, the Read() method is called on
the XmlValidatingReader object to start validating the XML document.
vReader.ValidationEventHandler +=
new ValidationEventHandler(this.ValidationCallBack);
// Parse through XML
while (vReader.Read()){}
The callback handler (named ValidationCallBack()) is shown in the following code segment.
Note that an external class with specific error-handling capabilities could be referenced as well.
private void ValidationCallBack(object sender, ValidationEventArgs args) {
//Deal with any errors here
}
Creating a Generic Validation Class

Now that you’ve seen what is involved with using the XmlValidatingReader, let’s put all these
concepts together to build a generic validating class that could be called from any ASP.NET
page when validation against a DTD, XDR, or XSD schema is needed. Listing 5.6 shows this
class named Validator.
Listing 5.6 A Generic Validation Class (validator.cs)

1: namespace XmlParsers.Validation {
2: using System;
4: using System.Xml.Schema;
5: using System.IO;
6: using System.Text;
7: using System.Net;
8: /// <summary>
9: /// The Validator class encapsulates XML validation functionality
10: /// </summary>
11: public class Validator {
12: bool _valid;
13: bool _logError;
14: string _logFile;
15: string _xmlFilePath;
17: XmlValidatingReader vReader = null;
18:
19: public bool Validate(string xmlFilePath,
20: XmlSchemaCollection schemaCol,
21: bool logError,string logFile) {
163
CHAPTER 5

22: _logError = logError;
23: _logFile = logFile;
24: _xmlFilePath = xmlFilePath;
25: _valid = true;
26: try {
27: xmlReader = new XmlTextReader(_xmlFilePath);
28: vReader = new XmlValidatingReader(xmlReader);
29: if (schemaCol != null) {
30: vReader.Schemas.Add(schemaCol);
31: }
32: vReader.ValidationType = ValidationType.Auto;
33:
34: /* Provide your own resolver if implementing a custom
35: * caching mechnism
36: * XmlUrlResolver resolver = new XmlUrlResolver();
37: * vReader.XmlResolver = resolver;
38: */
39: vReader.ValidationEventHandler +=
40: new ValidationEventHandler(this.ValidationCallBack);
41: // Parse through XML
42: while (vReader.Read()){}
43: }
44: catch {
45: valid = false;
46: }
47: finally { //Close our readers
48: if (xmlReader.ReadState != ReadState.Closed) {
50: }
51: if (vReader.ReadState != ReadState.Closed) {
52: vReader.Close();
53: }
54: }
55: return _valid;
56: }
57:
58: private void ValidationCallBack(object sender,
59: ValidationEventArgs args) {
60: valid = false; //hit callback so document has a problem
61: DateTime today = DateTime.Now;
5
XMLTEXTREADER
XMLTEXTWRITER
62: StreamWriter writer = null;

63: try {
64: if (_logError) {
AND
65: writer =
66: new StreamWriter(_logFile,true,Encoding.ASCII);
164

67: writer.WriteLine(“Validation error in: “ +
68: xmlFilePath);
70: writer.WriteLine(args.Message + “ “ +
71: today.ToString());
73: if (xmlReader.LineNumber > 0) {
74: writer.WriteLine(“Line: “ +
75: xmlReader.LineNumber +
76: “ Position: “ +
77: xmlReader.LinePosition);
78: }
80: }
81: writer.Flush();
82: }
83: catch {}
84: finally {
85: if (writer != null) {
86: writer.Close();
87: }
88: }
89: }
90: } //Validator
91: } //namespace
The Validator class simply implements the functionality discussed in the last few sections.
The bulk of the work is done in the Validate() method, which accepts the following
parameters:
Parameter Name Description

xmlFilePath Path to the XML document that will be validated.
schemaCol Represents an XmlSchemaCollection created by the client of the
Validator class. This argument can be null when necessary (DTD
validation, for instance).
logError Boolean determining whether logging should be used when errors in
validation occur.
logFile Path to the log file that will be written to if logError is true and an
error occurs during validation.
165
CHAPTER 5
The Validate() method first sets the values of a few global variables and then instantiates the
XmlTextReader using the XML document located at the path specified in the xmlFilePath
parameter (line 27). Notice that the _valid variable is set to true to start, because it will be
assumed that the document is valid.
In line 28, the reader is passed to the constructor of the XmlValidatingReader named
vReader. From here, the method takes care of setting properties on vReader and then attaches
the ValidationEventHandler event handler to a callback function named Validation
CallBack. After these preparatory steps have taken place, vReader then reads through and
validates the XML document against the appropriate DTD or schema (line 42).
If any errors occur during validation, ValidationCallBack() will be called and the global
variable named _valid will be set to false. If the _logError variable is true, information
about the error will be written to a log file that can be reviewed later. After processing within
the ValidationCallBack() handler has finished, control will return to the Validate() method
and the value of the _valid variable will be returned to the calling ASP.NET page.
Calling the Validator class from within an ASP.NET page is as simple as instantiating the
class, creating the XmlSchemaCollection (optional), and then calling the Validate() method
while passing in the proper parameters.
Listing 5.7 Calling the Validator Class from an ASP.NET Page (validateSchema.aspx)
1: void Page_Load(object sender, System.EventArgs e) {
2: string xmlFilePath = Server.MapPath(“golfersNotValid(XSD).xml”);
3: string logFile = Server.MapPath(“validationErrors.log”);
4:
5: XmlSchemaCollection schemaCol = new XmlSchemaCollection();
6: schemaCol.Add(“http://www.golfExample.com”,
7: Server.MapPath(“golfers.xsd”));
8: Validator validator = new Validator();
9: bool status = validator.Validate(xmlFilePath,schemaCol,true,logFile);
10: if (status) {
11: Response.Write(“Validation of golfersNotValid(XSD).xml “ +
12: “ was SUCCESSFUL!”);
13: //Call method to process XML document here
14: } else {
15: Response.Write(“Validation of golfersNotValid(XSD).xml “ +
16: “ failed! Check the “ +
17: “log file for information on the failure.”);
5
XMLTEXTREADER
XMLTEXTWRITER
18: }
19: }
AND
166
Passing Authentication Credentials with the

XmlTextReader Class
There may be times when the XML document used by the XmlTextReader is secured and
therefore inaccessible without passing the appropriate user credentials. Fortunately, the
XmlTextReader class allows credentials to be passed so that the document can be accessed.
This is accomplished by using the CredentialCache class located in the System.Net name-
space. When no special credentials are specified (when anonymous access is allowed, for
instance), the CredentialCache class’s DefaultCredentials property is used behind the
scenes. In many situations you won’t have to worry about setting the credentials, so the line
shown in bold is not necessary:
XmlTextReader xmlReader = new XmlTextReader(“golfers.xml”);
xmlReader.XmlResolver.Credentials = CredentialCache.DefaultCredentials;
while (xmlReader.Read()) {
//....process nodes
}
In cases where specific user IDs and passwords must be specified to access XML documents
located at different URLs, the NetworkCredential class can be used along with the
CredentialCache class, as shown next. Using these classes presents an excellent mechanism
for accessing secured XML documents.
XmlTextReader xmlReader =
new XmlTextReader(“http://www.golfExample.com/xml/golfers.xml”);
NetworkCredential creds = new NetworkCredential(“golfNut”,”1putt”,
“lottaWater”);
CredentialCache credsCache = new CredentialCache();
credsCache.Add(new Uri(“www.golfExample.com”), “Basic”, creds);
credsCache.Add(new Uri(“xml.golfExample.com”), “Basic”, creds);
xmlReader.XmlResolver.Credentials = credsCache;
while (xmlReader.Read()) {
//....process nodes
}
Using the XmlTextWriter Class to Create

XML Documents
Up to this point in the chapter, you’ve seen how the XmlTextReader class can be used along
with several supporting classes to parse XML in a forward-only, noncached manner. In this
section you’ll see how this same concept can be used to dynamically create XML documents
without incurring any overhead from creating a DOM structure. In Chapter 6 you’ll see how to
do this same type of thing using DOM-specific classes.
167
CHAPTER 5
The XmlTextWriter class performs the task of writing to an XML document in a forward-only,
cursor-style manner. Although all the properties and methods associated with the class won’t
be listed here, they are extremely simple to use after you’ve seen an example. To introduce you
to how the XmlTextWriter class works, Listing 5.8 shows how to use several of its properties
and methods.
Listing 5.8 Generating XML with the XmlTextWriter Class

3: public void Page_Load(Object Src, EventArgs E) {
4: string xmlDoc = Server.MapPath(“xmltextwriter.xml”);
5: XmlTextWriter writer = null;
6: try {
7: writer = new XmlTextWriter(xmlDoc,Encoding.UTF8);
8: writer.Formatting = Formatting.Indented;
9: writer.WriteStartDocument(true);
10: writer.WriteComment(“XML Nodes added using the XmlTextWriter”);
11: writer.WriteStartElement(“golfers”);
12: writer.WriteAttributeString(“xmlns”,”x-schema:golfers.xdr”);
13: writer.WriteStartElement(“golfer”, null);
14: writer.WriteAttributeString(“skill”,”moderate”);
15: writer.WriteAttributeString(“handicap”,”12”);
16: writer.WriteAttributeString(“clubs”,”Taylor Made”);
17: writer.WriteAttributeString(“id”,”1111”);
18: writer.WriteElementString(“firstName”,null,”Paul”);
19: writer.WriteElementString(“lastName”,null,”Allsing”);
20: writer.WriteStartElement(“favoriteCourses”, null);
21: writer.WriteStartElement(“course”,null);
22: writer.WriteAttributeString(“city”,”Phoenix”);
23: writer.WriteAttributeString(“state”,”Arizona”);
24: writer.WriteAttributeString(“name”,”Ocotillo”);
25: writer.WriteEndElement();
27: writer.WriteAttributeString(“city”,”Tempe”);
29: writer.WriteAttributeString(“name”,”Ken McDonald”);
32: writer.WriteAttributeString(“city”,”Phoenix”); 5
XMLTEXTREADER
XMLTEXTWRITER
34: writer.WriteAttributeString(“name”,”Ahwatukee CC”);

AND
36: writer.WriteEndElement(); //favoriteCourses

37: writer.WriteEndElement(); //golfer
168

38: writer.WriteEndElement(); //golfers
39: writer.Flush();
40: writer.Close();
41: XmlDocument doc = new XmlDocument();
42: doc.Load(xmlDoc);
44: doc.Save(Response.Output);
45: }
47: Response.Write(e.ToString());
48: }
49: finally {
51: writer.Close();
52: }
53: }
54: }
55: </script>
The XmlTextWriter constructor accepts several different arguments. This example passes in a
document name (xmltextwriter.xml) to which all output can be written. It also passes in the
desired encoding type for the document by using the Encoding enumeration. After the class is
instantiated, several methods are called that allow for customization over what is included in
the XML document.
To start, the Formatting property is set to Formatting.Indented so that the resulting XML
document has a hierarchical indentation (line 8). Next, the XML declaration is added by pass-
ing in a Boolean value of true to the WriteStartDocument() method (line 9). Several nodes
are then added to the document through calling the appropriate methods. The specific methods
used in Listing 5.8 are shown next along with their description:
Method Description
WriteXmlDecl(boolean) Determines whether an XML declaration
should be added with a standalone attribute.
If true, the following will be added:
<?xml version=”1.0”
standalone=”yes” ?>
false will yield:
<?xml version=”1.0”
standalone=”no ?>
169
CHAPTER 5
Method Description
If you do not want any XML declaration,
simply omit the call to the WriteXmlDecl()
method.
WriteComment(comment) Adds an XmlComment node to the document.
WriteStartElement(elemName) Creates an XmlElement start tag using the
supplied argument as the tag name. This is
used when an element can contain more than
just attributes.
WriteEndElement() Creates an ending element tag. This method
must appear after the corresponding
WriteStartElement() method.
WriteAttributeString(attName,value) Creates an attribute on the appropriate ele-
ment using the supplied name and value
arguments.
WriteElementString This method is used when an element con-
(elemName, nameSpace, value) tains only a text node. It accepts the element
name, namespace URI, and text node value.
Converting EDI or Legacy Data to XML

The capability to write out XML in a forward-only manner lends itself nicely to document
conversions. For example, if a flat file containing thousands of rows needs to be converted to
XML, you could parse it and manually wrap the different element tags around the data.
Although this process would work, the number of string conversions needed to take care of
creating the XML could get rather expensive. On top of this, the code could also become diffi-
cult to maintain because of the number of string concatenations or StringBuilder calls. The
XmlTextWriter provides an excellent way to avoid some of these problems by converting the
flat-file data to XML in a fast, efficient, and code-friendly manner.
Another example of how the XmlTextWriter class could be quite valuable is in Electronic Data
Interchange(EDI) to XML conversions. EDI is a standard that is used by a large number of
companies to exchange data with trading partners. If your company doesn’t support EDI (you
support only XML in your applications, for example), what can you do to allow your company
to exchange documents with these companies? One potential solution would be to use the
XmlTextWriter class to dynamically create XML based on data within an EDI document. Let’s 5
take a closer look at how this could be accomplished.
XMLTEXTREADER
XMLTEXTWRITER
If you haven’t seen an EDI document before, you’ll get your first look in Listing 5.9. This sim-
AND
plified document represents a purchase order. If you have EDI experience, you’ll recognize
that this is not necessarily complete, but it will suffice for our purposes.
170
Listing 5.9 A Purchase Order EDI Document (ediModule.edi)

1: BEG*1234*01/24/2001~
2: DTM*110*01/25/2001~
3: PO1*1*232324~
4: NTE*Line Number 1~
5: PO1*2*99999~
6: NTE*Line Number 2~
7: CTT*2*10~
This document contains a header section, two detail sections, and a summary section. Marked
up in XML, the document could look like the one shown in Listing 5.10.
Listing 5.10 A Purchase Order EDI Document Converted to XML

<?xml version=”1.0” encoding=”utf-8”?>
<root>
<header>
<poNumber>1234</poNumber>
<poDate>01/24/2001</poDate>
<shipDate>01/25/2001</shipDate>
</header>
<detail>
<lineNum>1</lineNum>
<partNum>232324</partNum>
<desc>Line Number 1</desc>
</detail>
<detail>
<lineNum>2</lineNum>
<partNum>99999</partNum>
<desc>Line Number 2</desc>
</detail>
<summary>
<lineCount>2</lineCount>
<totalQuantity>10</totalQuantity>
</summary>
</root>
Converting the EDI document shown in Listing 5.9 to the XML document shown in Listing
5.10 is surprisingly simple when the XmlTextWriter class is used. Listing 5.11 shows the code
to accomplish this task.
171
CHAPTER 5
Listing 5.11 Converting EDI to XML (ediModule.cs)

1: namespace EDI.Converter {
2: using System;
4: using System.IO;
6:
7: /// <summary>
8: /// EDI to XML Converter
9: /// </summary>
10:
11: public class EdiToXml {
12: string _ediPath;
13: XmlTextWriter writer;
14: public EdiToXml(string ediPath,string xmlPath) {
15: ediPath = ediPath;
16: writer = new XmlTextWriter(xmlPath,Encoding.UTF8);
17: }
18: public bool Convert() {
19: FileStream fs = null;
21: string ediLine;
22: try {
23: writer.WriteStartDocument();
24: writer.WriteStartElement(“root”);
25: fs = new FileStream(_ediPath,
26: FileMode.Open,FileAccess.Read);
27: reader = new StreamReader(fs);
28: while ((ediLine = reader.ReadLine()) != null) {
29: string[] tokens = ediLine.Split(new char[]{‘*’});
30: GenerateXml(tokens);
31: }
32: writer.WriteEndElement(); //Close root element
33: return true;
34: }
35: catch {
36: return false;
37: }
38: finally {
39: if (fs != null) {
40: fs.Close();
5
XMLTEXTREADER
XMLTEXTWRITER
41: }
AND
172

42: if (reader != null) {
43: reader.Close();
44: }
46: writer.Close();
47: }
48: }
49:
50: }
51: private void GenerateXml(string[] tokens) {
52: if (tokens[0] != null) {
53: switch (tokens[0].ToString().ToUpper()) {
54: case “BEG”:
55: writer.WriteStartElement(“header”);
56: writer.WriteStartElement(“poNumber”);
57: writer.WriteString(Clean(tokens[1]));
59: writer.WriteStartElement(“poDate”);
62: break;
63: case “DTM”:
64: writer.WriteStartElement(“shipDate”);
67: writer.WriteEndElement(); //header
68: break;
69: case “PO1”:
70: writer.WriteStartElement(“detail”);
71: writer.WriteStartElement(“lineNum”);
74: writer.WriteStartElement(“partNum”);
77: break;
78: case “NTE”:
79: writer.WriteStartElement(“desc”);
82: writer.WriteEndElement(); //detail
83: break;
84: case “CTT”:
85: writer.WriteStartElement(“summary”);
173
CHAPTER 5

86: writer.WriteStartElement(“lineCount”);
89: writer.WriteStartElement(“totalQuantity”);
92: writer.WriteEndElement(); //summary
93: break;
94: }
95: }
96: }
97: private string Clean(string token) {
98: string newToken = token.Replace(“~”,””);
99: return newToken;
100: }
101: } //EdiToXml
102: } //namespace
After the EdiToXml class is instantiated and passed the path to the EDI and XML documents,
the code in lines 25–31 reads through the EDI document line by line using the StreamReader
class (part of the System.IO namespace). Each line is split into different tokens and is placed
into a string array based on the * separator. The string array (named tokens) is then passed to
the GenerateXml() method, which writes out the data found within the string array to an XML
document using the XmlTextWriter class. When you use this class, the code is nicely orga-
nized, easy to follow, and very efficient. To see the EdiToXml class in action, run the file
named EdiToXmlClient.aspx.
Summary
You should now have a good feel for how XML can be read and written using forward-only,
noncached mechanisms. The XmlTextReader and XmlTextWriter classes (along with their sup-
porting classes) pack a lot of power that can provide your ASP.NET/XML applications with
efficiency and speed. There’s still a lot more to learn, however, so don’t relax just yet!
In the next chapter, you’ll learn more about the DOM and see how you can use it to navigate
XML documents and to insert, update, delete, and move XML nodes.
5
XMLTEXTREADER
XMLTEXTWRITER
AND
Programming the Document CHAPTER
6
Object Model (DOM) with
ASP.NET
IN THIS CHAPTER:
• Welcome to the DOM 176
• In-Memory Versus Forward-Only Parsing 178
• Working with MSXML3 via Interop 179
• DOM Classes in the System.Xml Namespace
and Assembly 182
• The XmlNode Class 183
• The XmlDocument Class 189
• The XmlNodeList Class 200
• The XmlNamedNodeMap Class 201
• Selecting Nodes Within the DOM Using
Xpath 204
• Putting It All Together 205
• The XmlNodeReader Class 209
• XMLHTTPRequest Object 211
• Sample Application—Client/Server-Side
Hierarchical XML Menus 219
• Chapter 2 Sample Application Revisited 225
XML Basics
176
Welcome to the DOM

In the previous chapter, you learned how to work with XML in a forward-only manner using
the XmlTextReader and XmlTextWriter classes. You also saw how these classes can be used to
provide fast and efficient XML parsing/writing. It’s now time to examine another way to work
with XML that provides the capability to update, insert, delete, and move nodes within an
XML document. Welcome to the DOM!
You may not realize it, but you’re probably already familiar with the concept of the DOM. If
you’ve done any type of Dynamic HTML (DHTML) development in Internet Explorer 4 or
later, you have used a DOM that was created by the browser. You may have used it to change a
heading color or to dynamically show or hide specific content. When a browser loads a page in
Internet Explorer 4 or later, the various parts that make up the page are loaded into a structure
and placed in memory. Accessing this structure using DHTML is as easy as knowing what part
of the document you want to gain access to. By using the window.document.all collection,
you have access to the entire HTML page’s objects and content through client-side code writ-
ten in JavaScript (officially called ECMAScript: http://www.ecma.ch/ecma1/stand/ecma-
262.htm) or VBScript. For example, changing the text color of a particular section located
within a <div> tag having an id attribute equal to “mainContent” could be accomplished with
the following code in Internet Explorer 4 or later:
<html>
<head>
<Script Language=”JavaScript”>
function changeColor(color,div) {
window.document.all(div).style.color=color;
}
</Script>
</head>
<body bgcolor=”#FFFFFF”>
<div id=”mainContent”
onMouseOver=”changeColor(‘#ff0000’,’mainContent’)”
onMouseOut=”changeColor(‘#000000’,’mainContent’)”
>
Testing this out
</div>
</body>
</html>
If you have done much DHTML programming, you’re probably painfully aware that the DOM
used in Internet Explorer 4 or later is very different from the one used in Netscape Navigator 4
(the Netscape 6 DOM is more like Internet Explorer’s DOM). Unfortunately, the two browsers
Programming the Document Object Model (DOM) with ASP.NET
177
CHAPTER 6
are so different that a page that works perfectly fine in one is very unlikely to work in the 6
other. The W3C recognized the problems caused by inconsistencies between DOMs and issued
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
a DOM Level 1 recommendation back in 1998. More information about this can be found at
http://www.w3.org/TR/REC-DOM-Level-1/introduction.html. Since releasing the DOM
Level 1 recommendation, the W3C also released DOM Level 2 and is currently starting work
on DOM level 3. The W3C states
The DOM Level 2 is made of a set of core interfaces to create and manipulate the structure
and contents of a document and a set of optional modules. These modules contain specialized
interfaces dedicated to XML, HTML, an abstract view, generic stylesheets, Cascading Style
Sheets, Events, traversing the document structure, and a Range object.
Many of the concepts mentioned in the W3C statement are new additions to the DOM specifi-
cation. Some of these include a style sheet object model, namespace support, an event model,
and support for text ranges. More information about the DOM Level 2 specification can be
found at http://www.w3.org/TR/DOM-Level-2/.
“What does all this have to with XML?” you ask. Actually—everything. Although the DOM
created by the browser isn’t the same because of form collections, image collections, and so
on, many of the same concepts apply to an XML document’s DOM. When an XML document
is first loaded and parsed, an internal representation (similar to a tree structure) of the docu-
ment is placed in memory. This structure is based on the concept of nodes with the root node
being followed by other children nodes. Like the DOM the browser creates, the DOM created
by a DOM-compatible XML parser is programmatically accessible using classes that help you
access specific nodes within the DOM structure. After these classes and their associated prop-
erties and methods are understood, virtually any node within an XML document can be
accessed, updated, inserted, or deleted.
To gain a better understanding of how the DOM works, Figure 6.1 shows a visual representa-
tion of the following XML document’s DOM:
<golfer>
<name>Dan Wahlin</name>
<courses>
<course>Pinetop Lakes CC</course>
<course>Ocotillo</course>
</courses>
</golfer>
As shown in Figure 6.1, the DOM is composed of the various items found within the XML
document. Each of these items is a node in the DOM structure. This means that the text under
the two course nodes is actually represented in the DOM by a text node. The example shown
XML Basics
178
is very basic and doesn’t include many of the nodes that could exist in a given XML DOM
(attribute nodes, comment nodes, and so on), but it does give you a visual overview of how the
DOM is structured.
golfer
name courses
Dan Wahlin course course
Pinetop Lakes CC Ocotillo
FIGURE 6.1
Representation of the DOM.
The next sections will explain how you can gain access to an XML document’s DOM, both on
the server side with ASP.NET and on the client side with JavaScript. Before covering those
topics, however, let’s first revisit the differences between in-memory and forward-only parsing.
In-Memory Versus Forward–Only Parsing

The DOM model is very efficient at allowing access to specific XML document nodes. It
allows complete flexibility in updating, inserting, deleting, or moving nodes. However, as with
most things, this power does come with a price. One of the problems associated with the DOM
concerns memory consumption. Because an entire XML document’s structure is loaded into
memory after being parsed, a large document could potentially consume a lot of memory. A
smarter implementation of an XML DOM could behave more like a database system and page
nodes in and out from disk to conserve memory. In fact, some object database vendors have
put DOM wrappers on their systems, thereby achieving this sort of hybrid XML database.
179
CHAPTER 6
In Chapter 5, “Using the XmlTextReader and XmlTextWriter Classes in ASP.NET,” you saw 6
how the DOM memory limitations can be overcome by using the XmlTextReader class (and
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
helper classes). Rather than loading the entire document into memory as with the DOM, the
reader processes one node at a time by creating a stream that XML tokens can be read from.
This forward-only stream means that when the document’s root element is reached, very little
data is buffered in memory at any given time. When a child node is reached, its ancestors have
been parsed, but they are not in memory and going back to them is not an option. This type of
functionality allows one item at a time to be checked and processed rather than loading every-
thing into memory. It provides a very low memory alternative that is capable of working with
very large XML documents.
When should you use the DOM instead of the forward-only parsing model exhibited by the
XmlTextReader class? Unfortunately, the answer to this question isn’t a simple one because
each application has unique requirements. In general, XML documents that simply need to be
read and not updated should leverage the speed and efficiency provided by the XmlTextReader
class. Doing this helps ensure that the application remains scalable. On the other hand, XML
documents that need modification either in the form of updates, inserts, or deletes will benefit
from the flexibility of having the document’s structure in memory and should therefore use
classes associated with the DOM. Although this is a less scalable solution, it is certainly
needed in many situations. You’ll be introduced to the classes that can be used to manipulate
the DOM in a moment. Let’s first take a quick look at how MSXML3 can be used in ASP.NET
applications.
Working with MSXML3 via Interop

The .NET platform provides two routes to take for accessing the DOM. The optimal route
is discussed in the next section on the System.Xml namespace and assembly. The alternative
route is to use the .NET Interop service to access a “classic” COM XML parser such as
MSXML3.
Why would you ever want to take the alternative route when you can simply use the parser
designed specifically for .NET applications? The answer depends on the amount of work you
already have invested in your current applications. It may be that you’re ready to move to the
enhanced scalability built in to .NET applications and that you’re ready to leverage the power
of C# or VB.NET used in conjunction with ASP.NET. However, you may have hundreds or
thousands of lines of existing code that manipulates the DOM using the MSXML3 parser. Is it
worth it to completely rewrite all this code to use the new parser? That is a question that you
must answer yourself. If converting all your code to the .NET framework is not an option
because of time constraints, Interop may be the best temporary route to take.
XML Basics
180
If you take the Interop route, you can port your XML application to the .NET platform without
rewriting all your existing code. To do this, you must follow a few simple steps:
1. Convert MSXML3.dll to managed code using the Tlbimp.exe utility file.
2. Import this newly converted file’s namespace into your ASP.NET page.
3. Begin the conversion process from ASP to ASP.NET.
To get started, the first step in the preceding list must be completed. The Tlbimp.exe utility
converts the type definitions found in a COM type library to a .NET assembly that contains
metadata about the COM type definitions. In sum, the utility takes the type library information
found within MSXML3 and converts it into a form that can be used within an ASP.NET page.
To make the conversion, go to a command prompt and type the following:
tlbimp msxml3.dll /out:c:\inetpub\wwwroot\bin\msxml3Managed.dll
You may need to adjust the output file path as necessary. The output path should point to the
appropriate bin directory where you’ll be running the application that uses MSXML3 in your
ASP.NET page. If the Tlbimp.exe utility ran successfully, a new file will show up at the loca-
tion specified by the out parameter. To view the metadata contained within this file, locate the
IL Disassembler tool (Start, Programs, Microsoft .NET Framework SDK, Tools) or go to the
command prompt and type the following from within the appropriate bin directory:
ildasm msxml3Managed.dll
When you open the newly created metadata file, you should see a screen similar to the one
shown in Figure 6.2.
FIGURE 6.2
msxml3Managed.dll metadata.
181
CHAPTER 6
The IL Disassembler parses the metadata created by the Tlbimp.exe utility and organizes it by 6
interface. You’ll notice that the DOMDocument30 interface is shown in Figure 6.2, along with a
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
lot of other interfaces. By expanding the DOMDocument30 interface, you will be able to see
all the properties and methods available for use. You’ll notice that they are the same ones you
have worked with in your regular ASP pages, except that now they can be used with VB.Net or
C# to create an ASP.NET page. Listing 6.1 shows a simple example that uses the new
mxsml3Managed.dll assembly (mxsml3Managed namespace) to parse an XML document.
LISTING 6.1 Using MSXML3 in the .NET Platform

1: <%@ Import Namespace=”msxml3Managed” %>
2: <html>
3: <body>
4: <script language=”C#” runat=”Server”>
5: void Page_Load(Object Src, EventArgs E) {
6: DOMDocument30 doc = new DOMDocument30();
7: doc.load(Server.MapPath(“golfers.xml”));
8: IXMLDOMNode node = doc.documentElement;
9: Response.Write(“Document Element Name: <b>” + node.nodeName +
10: “</b><br>”);
11: for (int i=0;i<node.childNodes.length;i++) {
12: Response.Write(“<” + node.childNodes[i].nodeName +
13: “  ”);
14: if (node.childNodes[i].attributes.length > 0) {
15: for (int j=0;
16: j<node.childNodes[i].attributes.length;j++) {
17: IXMLDOMAttribute att = (IXMLDOMAttribute)
18: node.childNodes[i].attributes[j];
19: Response.Write(att.nodeName + “ = \””);
20: Response.Write(att.nodeValue + “\”  ”);
21: }
22: }
23: Response.Write(“/><br>”);
24: }
25: } // End Page_Load
26: </script>
27: </body>
28: </html>
As you can see, all the normal MSXML3 objects and their associated properties and methods
are being used in the preceding example. One difference, however, is that with VBScript (and
JScript) everything was a variant or object and late bound. In converting code from an older
ASP page, for the new code to compile successfully, you will need to add in the appropriate
type information for each object that is used.
XML Basics
182
DOM Classes in the System.Xml Namespace and

Assembly
Now that you’ve seen both an alternative to using the DOM and a way to work with the
MSXML3 parser, let’s take a look at using classes found within .NET’s System.Xml name-
space to work with the DOM. This namespace is found within the System.Xml assembly and
comes with several classes that can be used to gain access to the XML DOM. Many of the
classes in this namespace function in a manner similar to those found in the MSXML3 parser.
The DOM support found in .NET is based on the W3C DOM Level 2 Core XML specification
(http://www.w3.org/TR/DOM-Level-2/core.html). Understanding the classes that provide
this support as well as how they work together is very straightforward after you learn a few
basics. To start things off, examine the classes listed in Table 6.1:
TABLE 6.1 DOM Classes

Class Name Description
XmlNode The XmlNode class is the principle object used in the DOM. It is an
abstract class, meaning that it is not instantiated directly but is imple-
mented and extended by many other classes in the System.Xml name-
space. It is used to manipulate a particular node type located within
an XML document or to access a node’s attributes.
XmlElement The XmlElement class is associated with element type nodes in an
XML document.
XmlAttribute The XmlAttribute class is associated with attribute type nodes within
an XML document.
XmlDocument The XmlDocument class is used to access XML documents and iden-
tify the root element. It acts as the gateway to an XML document’s
DOM. Instantiating an instance of the XmlDocument class in ASP.NET
is accomplished by the following code:
XmlDocument xmlDoc = new XmlDocument();
XmlNodeList The XmlNodeList class provides access to a collection of nodes that

can be iterated through. Calls to the XmlDocument’s ChildNodes
property or GetElementsByTagName() method return an XmlNodeList
object.
XmlNamedNodeMap The XmlNamedNodeMap class provides access to a collection of nodes
(attributes, for instance) that can be called by name or iterated
through on a given XmlNode object.
183
CHAPTER 6
TABLE 6.1 Continued 6

Class Name Description
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
XmlNodeReader The XmlNodeReader class can be used to turn a DOM structure into a
stream.
XMLHTTPRequest Although not part of the System.Xml namespace, the XMLHttpRequest
object (installed with Internet Explorer 5 or later) can be used for sending and
receiving XML content over HTTP from the client browser.
Through instantiating the classes shown in Table 6.1, the DOM can be accessed and manipu-
lated as desired. To better understand how some of these classes can be used, the following
section provides a step-by-step look at the process of creating the DOM structure:
1. An XML document is loaded using the XmlDocument class:
XmlDocument doc = new XmlDocument();
doc.Load(Server.MapPath(“golfers.xml”));
2. The root node is found and assigned to an XmlNode object:

XmlNode node = doc.DocumentElement;
3. The child nodes of the root node are accessed and assigned to a XmlNodeList collection
class:
XmlNodeList nodeList = node.ChildNodes;
4. While working with a particular node, the node’s attributes are assigned to an
XmlNamedNodeMap collection class:
XmlNameNodeMap namedNodeMap = node.Attributes;
The next few sections will discuss how to use these classes in ASP.NET. First up, the XmlNode
class.
The XmlNode Class

The XmlNode class is the blueprint for many of the classes listed in Table 6.1. Access to ele-
ments, attributes, and other items require the assistance of properties and methods exposed by
XmlNode. Every item within an XML document can be considered a node and can be accessed
through the DOM, as mentioned earlier in the chapter. Text is even considered to be a specific
node type called a “text node.”
The different types of nodes that can exist in an XML document are held within an enumera-
tion named XmlNodeType. A text node will be identified within the enumeration as
XmlNodeType.Text. Table 6.2 shows the XmlNodeType enumeration members:
XML Basics
184
TABLE 6.2 XmlNodeType Enumeration Members
Attribute Entity
CDATA EntityReference
Comment None
Document Notation
DocumentFragment ProcessingInstruction
DocumentType SignificantWhitespace
Element Text
EndElement Whitespace
EndEntity XmlDeclaration
The XmlNode class has many properties and methods that can be used to work with different
type of nodes within an XML document. Tables 6.3 and 6.4 list these properties and methods,
respectively.
TABLE 6.3 XmlNode Properties
Attributes Returns an XmlNamedNodeMap collection object containing the list of
attributes for a given node. This applies only to nodes of type
XmlNodeType.Element.
BaseURI Retrieves the base URI of the current node.
ChildNodes Retrieves all children of this node.
FirstChild Returns the first child node of the current node.
HasChildNodes Returns a Boolean indicating whether this node has any child nodes.
InnerText (*) Gets or sets the concatenated values of the node and all its children.
InnerXml (*) Gets or sets the XML markup that represents the children of this
node.
IsReadOnly (*) Returns a Boolean indicating whether the node is read-only.
Item Returns the first child element with the specified name or
NamespaceURI.
LastChild Returns the last child of the current node.
LocalName Returns the name of the current node without the namespace prefix.
Name Returns the name of the current node.
NamespaceURI Returns the namespace URI of the current node.
NextSibling Returns the sibling node directly following this node.
185
CHAPTER 6

THE DOCUMENT
OBJECT MODEL
PROGRAMMING
NodeType Returns the current node’s type.
OuterXml (*) Returns the XML markup representing the current node and all its
children. This differs from the InnerXml property because the current
node’s markup is included.
OwnerDocument Returns the XmlDocument that contains this node.
ParentNode Returns the parent of the current node. Attribute nodes as well as sev-
eral other node types cannot have parents and therefore will return
null.
Prefix Gets or sets the namespace prefix for the current node.
PreviousSibling Returns the sibling node that directly precedes the node.
Value Gets or sets the value of the node.
(*) Indicates an extension to the W3C DOM
TABLE 6.4 XmlNode Methods
Method Description
AppendChild(newNode) Appends a new child node to the current node
at the end of the list of children.
Clone() (*) Creates a duplicate of the current node.
CloneNode(deep) Similar to the Clone() method except it
accepts a Boolean to determine whether all
children of the current node should be cloned
as well.
CreateNavigator() Creates an XPathNavigator Object used to
navigate the current DOM structure.
GetNamespaceOfPrefix(prefix) (*) Returns the namespace URI in the closest
xmlns declaration containing the supplied
prefix.
GetPrefixOfNamespace(URI) (*) Returns the namespace prefix in the closest
xmlns declaration containing the supplied URI.
InsertAfter(newNode,refNode) Inserts a new node after the specified reference
node.
InsertBefore(newNode,refNode) Inserts a new node before the specified refer-
ence node.
XML Basics
186
TABLE 6.4 Continued

Method Description
Normalize() Specific operations such as XPointer depend on a
particular document tree structure to work prop-
erly. The Normalize() method helps to ensure
that the DOM view is the same if it was saved
and then reloaded.
PrependNode(newNode) (*) This is a quick method for adding the new node
as the first child of the current node.
RemoveAll() (*) Removes all children and/or attributes of the
current node.
RemoveChild(oldNode) Removes the specified child node associated with
the node.
ReplaceChild(newNode,oldNode) Removes the old node from the node’s children
nodes and replaces it with a new node.
SelectNodes(xpath) (*) Accepts an XPath statement and returns an
XmlNodeList collection if nodes are matched.
SelectSingleNode(xpath) (*) Accepts an XPath statement and returns an
XmlNode object if one is matched.
Supports(feature,version) This method allows for a specific feature, such as
XML, to be tested for a version. Thus, it could be
used to test whether XML version 1.0 is sup-
ported. A Boolean is returned.
WriteContentTo(xmlWriter) (*) Saves the child nodes of the node to the specified
XmlWriter object.
WriteTo(xmlWriter) (*) Saves the current node to the specified XmlWriter
object.
Using the properties and methods of the XmlNode class is no different from working with
properties and methods of other classes in ASP.NET.
It’s important to note that the XmlNode class is never instantiated like many classes you’re used
to working with in ASP.NET. For example, you don’t do the following:
XmlNode node = new XmlNode();
This is because the XmlNode class is an abstract base class that can be implemented by other
classes (as you’ll see later). An object can, however, be typed as XmlNode. To make this
clearer, you need to think a little bit differently, especially if you haven’t worked much in an
187
CHAPTER 6
object-oriented language. For example, the following code creates and loads an XmlDocument 6
and then gets the root node. This node is then assigned to an object named oNode that is of
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
type XmlNode:
<script language=”c#” runat=”Server”>
void Page_Load(object sender, EventArgs e) {
XmlDocument oDocument = new XmlDocument();
oDocument.Load(Server.MapPath(“golfers.xml”));
XmlNode oNode = oDocument.DocumentElement;
Response.Write(oNode.Name);
}
</script>
After the XmlNode object is created, a call is made to the object’s Name property. The value
returned by the property is written out to the Web page using a call to the Response object’s
Write() method. The sample uses the XmlDocument class, which we haven’t covered yet.
Don’t worry about its functionality at this point, because the next section will explain it in
greater detail.
Some of the most common properties you’ll use with the XmlNode class include Name,
NodeType, Value, and InnerText. Other common properties that can be used to access
specific nodes include the FirstChild, NextSibling, and ChildNodes properties. Listing 6.2
shows an example of using some of the properties mentioned previously:
LISTING 6.2 Using XmlNode Properties

2: <script language=”c#” runat=”Server”>
3: void Page_Load(object sender, EventArgs e) {
4: XmlDocument oDocument = new XmlDocument();
5: oDocument.Load(Server.MapPath(“golfers.xml”));
6: XmlNode oNode =
7: oDocument.DocumentElement.ChildNodes[0].ChildNodes[0].ChildNodes[1];
8: Response.Write(“<b>nodeName:</b> “ + oNode.Name + “<br>”);
9: if (oNode.NodeType == XmlNodeType.Element) { // Element type node
10: Response.Write(“<b>Element’s text node’s nodeValue:</b> “);
11: Response.Write(oNode.ChildNodes[0].Value + “<br>”);
12: Response.Write(“<b>Element’s InnerText:</b> “ + oNode.InnerText +
13: “<br>”);
14: }
15: }
16: </script>
XML Basics
188
NOTE
It is much more efficient to use the FirstChild and NextSibling properties rather
than the ChildNodes property. The ChildNodes property exists because it is in the
W3C spec and was included here simply to demonstrate how it handles indexing.
Lines 6–12 of Listing 6.2 use properties of the XmlNode class to get to other child node objects
and then write out the node name and value. Line 6 first gets to the root node by using the
DocumentElement property of the XmlDocument object (discussed next). The code then pro-
ceeds to get the first child node of the root (golfer), the first child node of the golfer node
(name), and the second child node of the name node (lastName). The remaining code simply
writes out the lastName node’s Name, Value, and Innertext. An if statement was added sim-
ply to check whether the node being analyzed is of type element through comparing it to the
XmlNodeType enumeration (line 9).
Breaking the XmlNode Class Down into Pieces

Up to this point, we’ve assumed that the most specific class that can be used to manipulate an
XML document is the XmlNode because it works with individual nodes. However, the XmlNode
class can be broken down into distinct classes, depending on what type of node is being
accessed. For example, at times you will want to work with certain properties and methods cre-
ated especially for CDATA sections. At other times, you may need to manipulate a text node.
Table 6.5 lists the different classes along with a basic description of their purpose. All these
classes inherit (indirectly or directly) from the XmlNode class.
TABLE 6.5 Other DOM Classes That Extend the XmlNode Class
Object Name NodeType Description
XmlAttribute Attribute Represents an attribute object.
XmlCDataSection CDATA Prevents blocks of text from
being parsed by an XML
parser.
XmlComment Comment Represents an XML comment.
XmlDocumentFragment DocumentFragment Can be used to create tree sec-
tions for insertion into a docu-
ment. Useful for creating
sibling nodes to be added to a
parent.
XmlDocumentType DocumentType Contains information associated
with the document type declara-
tion.
189
CHAPTER 6

Object Name NodeType Description
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
XmlElement Element Represents an XML document
element object.
XmlEntity Entity Represents an entity found
within an XML document.
XmlEntity Reference EntityReference Represents an entity reference
node.
XmlNotation Notation Can contain a notation declared
in a DTD.
XmlProcessingInstruction ProcessingInstruction Represents a processing
instruction.
XmlText Text Represents the text content of an
element or attribute.
Because these classes inherit from the base XmlNode class, they all expose a base set of
properties and methods. For a full listing of these, consult the .NET SDK documentation.
The XmlDocument Class

For an XML document to be parsed and loaded into a DOM structure, it must first be loaded
using an XmlDocument class. This class takes care of providing the functionality to load (or
create) XML documents and find the root node of the documents. Without it, the other classes
mentioned previously would have no access to an XML document’s DOM.
The XmlDocument class actually inherits from the XmlNode class (the document is itself a node)
and extends it by adding methods such as Load() and LoadXml(). Thus, in reality the XmlNode
object provides most of the base functionality to work with and manipulate nodes.
XmlDocument Object Properties and Methods

Before jumping into some more examples of using the XmlDocument class, let’s take a look at
the properties and methods that it exposes. Table 6.6 contains the properties, and Table 6.7
contains the methods.
TABLE 6.6 XmlDocument Class Properties
Attributes Returns an XmlAttributeCollection object containing the
list of attributes for a given node. This applies only to nodes
that are of type XmlNodeType.Element.
XML Basics
190
TABLE 6.6 Continued

BaseURI Returns the base URI of the current node.
ChildNodes Retrieves all children of the current node.
DocumentElement Returns the root XmlElement node (outermost element) of
the XML document.
DocumentType Returns the document type node that specifies the DTD
(if one exists) for the XML document.
FirstChild Returns the first child node of the current node.
HasChildNodes Returns a Boolean indicating whether this node has any
child nodes.
Implementation Returns the XmlImplementation object for the XML
document.
InnerText (*) Gets or sets the concatenated values of the node and all its
children.
InnerXml (*) Gets or sets the XML markup that represents the children of
this node.
IsReadOnly (*) Returns a Boolean indicating whether the current node is
read-only.
Item Returns the first child element with the specified name or
NamespaceURI.
LastChild Returns the last child of the current node.
LocalName Returns the name of the current node without the namespace
prefix.
Name Returns the name of the current node.
NamespaceURI Returns the namespace URI of the current node.
NameTable (*) Gets the XmlNameTable associated with this implementation.
This allows for more efficient node name comparisons to
occur because the XmlNameTable uses object pointer compar-
isons rather than string comparisons. For example, when a
node name such as “Order” is returned many times, all com-
parisons will return the same string object each time because
a pointer to that object is being used. An example of using
this was shown in Chapter 5.
NextSibling Returns the sibling node directly following this node.
NodeType Returns the current node’s type.
191
CHAPTER 6

THE DOCUMENT
OBJECT MODEL
PROGRAMMING
OuterXml (*) Returns the XML markup representing the current node and
all its children. This differs from the InnerXml property
because the current node’s markup is included.
OwnerDocument Returns the XmlDocument that contains this node.
ParentNode Returns the parent of the current node. Attribute nodes as
well as several other node types cannot have parents and
therefore will return null.
Prefix Gets or sets the namespace prefix for the current node.
PreserveWhiteSpace (*) Determines whether whitespace will be preserved in the
XML document.
PreviousSibling Returns the sibling node that directly precedes the current
node.
Value Gets or sets the value of the current node.
TABLE 6.7 XmlDocument Class Methods
Method Description
AppendChild(newNode) Appends a new child node to the current node.
Clone() (*) Creates a duplicate of the current node.
CloneNode(deep) Similar to the Clone() method except it accepts
a Boolean to determine whether all children of
the current node should be cloned as well.
CreateAttribute(name) Creates an XmlAttribute node with the argu-
ment name. Other override methods exist.
CreateCDataSection(text) Creates a XmlCDataSection node that contains
the argument text.
CreateComment(text) Creates a XmlComment node that contains the
argument text.
CreateDocumentFragment() Creates an empty XmlDocumentFragment
object. This can then be used to create indepen-
dent sections of a document. No namespace can
be used with this method.
XML Basics
192
TABLE 6.7 Continued

Method Description
CreateDocumentType(name, publicID, Creates a new XmlDocumentType node. The
systemID,internalSubset) (*) internalSubset argument allows entity defini-
tions to be passed in that will be part of the
DTD.
CreateElement(name) Creates an empty XmlElement node using the
argument name. Other override methods exist.
CreateEntityReference(name) Creates a new EntityReference object with
the argument name.
CreateNavigator() Creates an XPathNavigator Object used to
navigate the current DOM structure.
CreateNode(type,name, Creates a node using the specified type, name,
namespace_uri) (*) and namespace uri. Other override methods
exist.
CreateProcessingInstruction) Creates a processing instruction node that con
(target,data) tains the specified target and data. Target would
refer to what processing instruction you are try-
ing to add to in the XML document. For exam-
ple, you may want to add an encoding type to
the xml processing instruction.
CreateSignificantWhiteSpace Creates a significant whitespace node. The text
(text) (*) value passed in must contain one of the follow-
ing characters: ,
, , or .
CreateTextNode(text) (*) Creates a text node that contains the argument
text. This node can then be added to an
XmlElement node.
CreateWhitespace(text) (*) Creates an XmlWhiteSpace type node. The text
value passed in must contain one of the follow-
ing characters: ,
, , or .
CreateXmlDeclaration(version, Creates an XML declaration with the associated
encoding,standalone) (*) values.
GetElementByID(id) Returns the element with the specified id
attribute.
GetElementsByTagName(name) Returns a collection of elements that have the
specified name. Other override methods exist.
GetNamespaceOfPrefix(prefix) (*) Returns the namespace URI in the closest
xmlns declaration containing the supplied
prefix.
193
CHAPTER 6

Method Description
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
GetPrefixOfNamespace(URI) (*) Returns the namespace prefix in the closest
xmlns declaration containing the supplied URI.
ImportNode(node,deep) Imports a node (and possibly all descendants
depending on the Boolean value of deep) from
another document into the current document.
InsertAfter(newNode,refNode) Inserts a new node after the specified reference
node.
InsertBefore(newNode,refNode) Inserts a new node before the specified refer-
ence node.
Load(filePath or URL) (*) Loads an XML document from the location
specified in the argument. This can be a file
path or URL. Alternatively, other override
methods allow a document to be loaded from a
TextReader object, a stream, or an XmlReader
object.
LoadXML(string) (*) Loads an XML document using the argument
string. For example, you can create an XML
document on the fly by using:
doc.LoadXml(“<root><child>test</child>
</root>”)
Normalize() Specific operations such as XPointer depend on

a particular document tree structure to work
properly. The Normalize() method helps to
ensure that the DOM view is the same if it was
saved and then reloaded.
PrependChild(newNode) (*) This is a quick method for adding the new node
as the first child of the current node.
ReadNode(XmlReader) (*) Creates an XmlNode object based on the infor-
mation in the XmlReader. The reader must be
positioned on a node or attribute. This provides
a fast way to create XmlNodes based on XML
tokens found in the XmlReader stream.
RemoveAll() (*) Removes all children and/or attributes of the
current node.
RemoveChild(oldNode) Removes the specified child node associated
with the current node.
XML Basics
194
TABLE 6.7 Continued

Method Description
ReplaceChild(newNode,oldNode) Removes the old node from the current node’s
children nodes and replaces it with a new node.
Save(string) (*) Saves the XML document to the specified loca-
tion. The document can be saved to a file or
text-writer object. Other override methods exist.
SelectNodes(xpath) (*) Accepts an XPath statement and returns an
XmlNodeList collection if nodes are matched.
SelectSingleNode(xpath) (*) Accepts an XPath statement and returns an
XmlNode object if one is matched.
Supports(feature,version) This method allows for a specific feature, such
as XML, to be tested for a version. Thus, it
could be used to test whether XML version 1.0
is supported. A Boolean is returned.
WriteContentTo(xmlWriter) (*) Saves the child nodes of the current node to the
specified XmlWriter object.
WriteTo(xmlWriter) (*) Saves the current node to the specified
XmlWriter object.
Now that you’ve seen all that the XmlDocument class has to offer, let’s see how to use a few of
its more important properties and methods. If you’re using Visual Studio.NET, you’ll be happy
to know that the built-in IntelliSense feature will provide you with access to all the properties
and methods associated with a particular XML object.
If you’re not using Visual Studio, don’t be too concerned. There will be plenty of examples in
the next few sections that will show you just how easy the XmlDocument class and its related
classes are to work with.
Loading an XML Document using the XmlDocument Class

Accessing an XML document located on a file server or out on the Internet involves using only
a few properties and one method of the XmlDocument object. In the following example you’ll
see how to instantiate an XmlDocument object and load an XML document located in the same
virtual directory as the ASP.NET page. The document will be loaded by calling the object’s
Load() method.

<script language=”C#” runat=”Server”>
195
CHAPTER 6

try {
6
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
}
catch (Exception exc) {
Response.Write(exc.ToString());
}
oDocument.Save(Response.Output);
}
</script>
This code will load the golfers.xml XML document and write it out to the Response object
for display in the browser. Had the golfers.xml file path not been found, an error message
stating, System.IO.FileNotFoundException: Couldn’t find file. would appear in the
browser because a try....catch block was implemented.
The Save() method can also save to a file assuming write access is provided to the IUSR
account in IIS (see the IIS administration manual for more information on the IUSR account).
However, this method does not allow for saving to a URL. Although this is a limitation, it
makes sense especially because the remote server may not be under your control. If you do
have control over the remote server, getting around this limitation can be accomplished by
posting the XML document to an ASP.NET (or other) page located on the remote server or by
using classes found within the System.Net namespace to push the XML using FTP, HTTP
PUT, and so on. This page can then parse the document and perform the save operation.
Saving updates made to an XML document can be accomplished by calling the
XmlDocument’s Save() method:
String sXML = “<?xml version=\”1.0\”?><root><test>”;
sXML += “<testChild>Testing!</testChild></test></root>”;
String savePath = Server.MapPath(“testNodes.xml”);
oDocument.LoadXml(sXML);
Response.Write(“Document saved to: “ + savePath + “<p>”);
Response.Write(“Click <a href=\”testNodes.xml\”>here</a> to view the
➥file”);
oDocument.Save(savePath);
}
</script>
Although the preceding example saves to a file, the Save() method has several override meth-
ods that allow a document to be saved to a stream object, such as an XmlTextWriter class
(discussed in Chapter 5 and again later in this chapter).
XML Basics
196
There may be times when you have an XML document in the form of a string and would like
to parse and load it into the DOM. The XmlDocument class provides this capability through its
LoadXML() method (shown in the previous example). The following code segment contains a
string named SXML that contains a short XML document. This string can be parsed and then
loaded into the DOM as shown next:
<%@ Import Namespace=”System.NewXml” %>
String sXML = “<?xml version=\”1.0\”?><root><test>”;
SXML += “<testChild>Testing!</testChild></test></root>”;
oDocument.LoadXml(sXML);
Response.ContentType = “text/xml”;
oDocument.Save(Response.Output);
}
</script>
NOTE
This input string must be a valid Unicode string. If you have encoded XML bytes in
memory, use the Load() method that takes a Stream object.
Creating Nodes Using the XmlDocument Class

As you read through the methods associated with the XmlDocument class, you may have
noticed several starting with the word “Create.” Because the XmlDocument class is responsible
for the overall document (rather than to a single node), it has the capability of creating any
item within the context of the XML document that was loaded. This comes in handy when you
need to create another child element or attribute for a particular node. The XmlDocument class
can create a variety of items. These include the following:
• Elements
• Attributes
• Comments
• Processing instructions
• CDATA sections
• Nodes
• Text nodes
• Empty document fragments
197
CHAPTER 6
• Entity references 6
• DTD declarations
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
• XML declarations
• Whitespace
• Significant whitespace
Creating different items using the XmlDocument class involves instantiating the class, loading
the XML document (or XML string), calling the appropriate create method, and then calling
one of the append/prepend methods to add the new elements to an existing node in the docu-
ment. Listing 6.3 loads an XML string into the DOM and then creates two empty element
nodes named testB and testC. These nodes are then appended to the root node of the
document.
LISTING 6.3 Adding Nodes with the XmlDocument Class

4: String sXML = “<?xml version=\”1.0\”?><root><testA>”;
5: sXML += “<testChild>Testing!</testChild></testA></root>”;
7: oDocument.LoadXml(sXML);
8: XmlNode oRoot = oDocument.DocumentElement;
9: try {
10: XmlNode oElement1 = oDocument.CreateElement(“testB”);
11: XmlNode oElement2 = oDocument.CreateElement(“testC”);
12: oRoot.AppendChild(oElement1);
13: oRoot.AppendChild(oElement2);
14: }
15: catch(Exception exc) {
16: Response.Write(exc.ToString());
17: }
18: Response.ContentType =”text/xml”;
19: oDocument.Save(Response.Output);
20: }
21: </script>
After creating the two new nodes using the CreateElement() method, the XML document root
is found by calling the XmlDocument’s DocumentElement property. Next, the AppendChild()
method is called (lines 12 and 13), which adds the newly created nodes directly under the root
node. The resulting XML document now includes the newly added testB and testC elements:
XML Basics
198
<root>
<testA>
<testChild>Testing!</testChild>
</testA>
<testB/>
<testC/>
</root>
Before moving on to the next topic, let’s consider an alternative method for writing out
XML nodes to a document. In Chapter 5 you were provided with an in-depth look at the
XmlTextReader class. This class is a forward-only stream that can be used to read large XML
documents quickly without using much memory. You’ll recall that the XmlTextReader class is
complemented by an XmlTextWriter class. This class performs the task of writing to an XML
document in a forward-only, cursor-style manner.
The next example shown in Listing 6.4 demonstrates how the XmlTextWriter class can be used
to create an XML fragment that is then inserted into a master XML document. The alternative
would be to create the nodes using the XmlDocument or a related class and then place each
node within the master document. The pros and cons of this approach follow.
LISTING 6.4 Using the XmlTextWriter and XmlDocument Classes Together

3:
6: String masterDoc = Server.MapPath(“golfers.xml”);
7: XmlTextWriter writer = null;
8: StringBuilder sb = null;
9: StringWriter sw = null;
10: try {
11: sb = new StringBuilder();
12: sw = new StringWriter(sb);
13: writer = new XmlTextWriter(sw);
15: writer.WriteStartElement(“golfer”, null);
16: writer.WriteAttributeString(“skill”,”moderate”);
17: writer.WriteAttributeString(“handicap”,”12”);
18: writer.WriteAttributeString(“clubs”,”Taylor Made”);
19: writer.WriteAttributeString(“id”,”1111”);
20: writer.WriteElementString(“firstName”,null,”Paul”);
21: writer.WriteElementString(“lastName”,null,”Allsing”);
22: writer.WriteStartElement(“favoriteCourses”, null);
199
CHAPTER 6
LISTING 6.4 Continued 6

THE DOCUMENT
OBJECT MODEL
PROGRAMMING
26: writer.WriteAttributeString(“name”,”Ocotillo”);
29: writer.WriteAttributeString(“city”,”Tempe”);
31: writer.WriteAttributeString(“name”,”Ken McDonald”);
36: writer.WriteAttributeString(“name”,”Ahwatukee CC”);
38: writer.WriteEndElement(); //favoriteCourses
39: writer.WriteEndElement(); //golfer
40: writer.Flush();
41:
44: doc.LoadXml(sb.ToString());
45: XmlDocumentFragment frag = doc.CreateDocumentFragment();
46: foreach (XmlNode node in doc.ChildNodes) {
47: frag.AppendChild(node);
48: }
49: doc.Load(masterDoc);
50: doc.DocumentElement.PrependChild(frag);
51: doc.Save(Response.Output);
52: writer.Close();
53: sw.Close();
54: }
56: Response.Write(e.ToString());
57: }
58: finally {
60: writer.Close();
61: }
62: if (sw != null) {
63: sw.Close();
64: }
65: }
66: }
67: </script>
XML Basics
200
By using the XmlTextWriter to create the XML fragment, you eliminate having to rely on the
XmlDocument to create objects one at a time and then insert them into the DOM tree. Although
this can simplify the process of adding nodes, some performance implications are present that
you may not realize. First, the XML fragment that is created must still be loaded into a DOM
structure. Second, the master XML document must also be loaded into the DOM so that the
fragment can be inserted into it. Creating these DOM structures takes time and resources.
Adding nodes by using the XmlDocument class alone can minimize these performance costs
because the DOM structure is loaded and parsed only once. The moral of the story is that if
you need to randomly add one or more nodes into an existing XML document and optimal per-
formance and memory management is an absolute necessity, use the XmlDocument class to cre-
ate the new nodes. If you’re creating an XML document from scratch and simply saving it to a
location, use the XmlTextWriter class.
The XmlNodeList Class

At times, accessing a specific node won’t be all you need to do in an XML project. You may
need to access a group of nodes and enumerate them to look for a certain value, delete a node,
add a node, or perform an update. The XmlNodeList class allows for these types of operations.
Let’s take a look at the XmlNodeList’s properties and methods in tables 6.8 and 6.9.
TABLE 6.8 XmlNodeList Class Properties
Count Returns the number of nodes in the XmlNodeList collection starting with a base of 1.
TABLE 6.9 XmlNodeList Class Methods
Method Description
GetEnumerator() Allows the collection of nodes to be enumerated using a simple
foreach iteration.
Item(index) Returns the node in the collection at a position equal to the index passed in.
The first node will have an index equal to 0.
The next example shows how to use the XmlNodeList class:

<script language=”C#” runat=”server”>
public void Page_Load(Object Src, EventArgs E) {
XmlNodeList oNodeList =
201
CHAPTER 6
oDocument.DocumentElement.GetElementsByTagName(“golfer”);
Response.Write(“<b>Length of NodeList:</b> “ + oNodeList.Count +
6
THE DOCUMENT
OBJECT MODEL
“<p>”);
PROGRAMMING
Response.Write(“<b>For/Next Loop</b><br>”);
for (int i=0;i<oNodeList.Count;i++) {
Response.Write(oNodeList[i].Name + “<br>”);
}
Response.Write(“<p><b>For/Each Loop</b><br>”);
foreach (XmlNode node in oNodeList) {
Response.Write(node.Name + “<br>”);
}
}
</script>
The previous code shows two ways of walking through an XmlNodeList collection. The first
uses a for..next loop and the second uses a foreach loop. This sample also shows how to
use the Count property as well as the Item() method. You’ll notice, however, that the Item
method is abbreviated by using oNodeList[i] rather than oNodeList.Item[i].
NOTE
Using the foreach mechanism is more efficient then doing a for..next loop. This is
because oNodeList.Count has to check to see what the actual count is every time you
use it in the loop. To make the for..next version more efficient than shown, it would
be better to store the value returned by oNodeList.Count in a local variable and ref-
erence this during the looping process.
The XmlNamedNodeMap Class

The XmlNodeList object allows for the retrieval of a collection of nodes, but what if a collec-
tion of attributes are within a node that you need access to? The XmlNamedNodeMap object
solves this problem by allowing a group of attributes to be put into a collection and then
iterated through using looping structures.
NOTE
The XmlNamedNodeMap class can also be used to hold other node collections, such as
notations or entities. If you’re interested in working with a collection of attributes,
the XmlNamedNodeMap class can be used; however, the XmlAttributeCollection class
extends this class to provide additional functionality specific to attributes.
XML Basics
202
Let’s take a look at the properties and methods of this object in tables 6.10 and 6.11.
TABLE 6.10 XmlNamedNodeMap Class Properties
Count Returns the number of nodes in the XmlNamedNodeMap collection.
TABLE 6.11 XmlNamedNodeMap Class Methods
Method Description
GetEnumerator() Allows the collection of nodes to be enumerated using a
simple foreach iteration.
Item(index) Returns the node in the collection at a position equal to the
index passed in. The first node will have an index equal to 0.
GetNamedItem(name) Returns the appropriate XmlNode from the XmlNamedNodeMap
collection.
RemoveNamedItem(name) Removes the appropriate XmlNode (an attribute) from the
XmlNamedNodeMap collection.
SetNamedItem(node) Adds an XmlNode to the collection using the node’s Name property.
The process of using XmlNamedNodeMap properties and methods is demonstrated in Listing 6.5.
LISTING 6.5 Working with the XmlNameNodeMap Class

4: Response.Write(“<font size=\”5\” color=\”#02027a\”>”);
5: Response.Write(“Working with the NamedNodeMap and “ +
6: “AttributeCollection Objects”);
7: Response.Write(“</font><p>”);
9: oDocument.Load(Server.MapPath(“golfers.xml”));
10: XmlNode oNode = oDocument.DocumentElement.FirstChild;
11: XmlNamedNodeMap oNamedNodeMap = oNode.Attributes;
12: Response.Write(“<p><b>XmlNamedNodeMap For/Each Loop</b><br>”);
203
CHAPTER 6

13: foreach (XmlAttribute att in oNamedNodeMap) {
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
14: Response.Write(att.Name + “ = “ + att.Value + “<br>”);
15: }
16: Response.Write(“<p><b>XmlAttributeCollection For/Next “ +
17: “Loop</b><br>”);
18: XmlAttributeCollection oAttCol = oNode.Attributes;
19: for (int i=0;i<oAttCol.Count;i++) {
20: Response.Write(oAttCol[i].Name + “ = “ + oAttCol[i].Value +
21: “<br>”);
22: }
23:
24: Response.Write(“<p><b>XmlAttributeCollection <i>SetNamedItem()” +
25: “</i>”);
26: Response.Write(“ using XmlDocument: test</b><br>”);
27: XmlAttribute oNewAtt = oDocument.CreateAttribute(“test”);
28: oNewAtt.Value = “Hi!”;
29: oNode.Attributes.SetNamedItem(oNewAtt);
30: XmlAttributeCollection oAttCol2 = oNode.Attributes;
31: for (int i=0;i<oAttCol2.Count;i++) {
32: Response.Write(oAttCol2[i].Name + “ = “ + oAttCol2[i].Value +
33: “<br>”);
34: }
35: Response.Write(“<p><b>XmlAttributeCollection <i>” +
36: “RemoveNamedItem()</i>:”);
37: Response.Write(“ skill</b><br>”);
38: oAttCol2.RemoveNamedItem(“skill”);
39: for(int i=0;i<oAttCol2.Count;i++) {
40: Response.Write(oAttCol2[i].Name + “ = “ + oAttCol2[i].Value +
41: “<br>”);
42: }
43: Response.Write(“<p><b>XmlAttributeCollection <i>GetNamedItem()” +
44: “</i>: skill</b><br>”);
45: try {
46: Response.Write(oAttCol2.GetNamedItem(“skill”).Value);
47: }
49: Response.Write(“<font color=\”#ff0000\”>” +
50: e.GetBaseException() + “</font>”);
51: Response.Write(“<br>The above error occurred because” +
52: “ the skill attribute no”);
53: Response.Write(“ longer exists.”);
54: }
55: }
56: </script>
XML Basics
204
The previous code shows different ways of walking through an XmlNamedNodeMap collection.
The first uses a foreach loop (lines 13–15) and the second uses a for loop (lines 19–21). Line
30 introduces a class that was briefly mentioned at the beginning of this section. This class is
referred to as the XmlAttributeCollection and it can do more than simply allow attributes
to be enumerated through. For example, it has several useful methods in addition to those
found on the XmlNamedNodeMap class that can be used for inserting or removing attributes
from a collection. These additional methods include Append(), CopyTo(), InsertAfter(),
InsertBefore(), Prepend(), Remove(), RemoveAll(), and RemoveAt(). Listing 6.5 shows how
the SetNamedItem(), RemoveNamedItem(), and GetNamedItem() methods (discussed in Table
6.11) can be used. Both methods accept the node that should be set or removed in the
collection.
Selecting Nodes Within the DOM Using Xpath

Frequently in your ASP.NET/XML applications, you’ll need to access a specific node or nodes
located within the DOM. Although this can be done by iterating over all the nodes within the
DOM structure using looping techniques or by using the GetElementsByTagName() method, a
more targeted approach exists. If you have much experience with MSXML3, these techniques
will be very familiar to you.
Earlier in the chapter, you saw the different methods associated with the XmlNode class. Two
of those methods, (SelectNodes() and SelectSingleNode()), accept XPath statements
as arguments and can be used to select specific nodes from within a DOM structure. The
SelectNodes() method can be used when several nodes (an XmlNodeList) need to be returned
and processed. The nodes that are returned can be iterated over to access child nodes, attrib-
utes, and so on. The process of using the SelectNodes() method is shown next:
public void Page_Load(Object sender, EventArgs E) {
string filePath = Server.MapPath(“requests.xml”);
doc.Load(filePath);
XmlNodeList contactNodes = doc.SelectNodes(“//contactName[.=’Dave’]”);
int count = contactNodes.Count;
for (int i=0;i<count;i++) {
Response.Write(contactNodes.Item(i).Name + “ = “);
Response.Write(contactNodes.Item(i).InnerText + “<br />”);
}
}
</script>
205
CHAPTER 6
This example will search through the requests.xml file and return all element nodes (no mat- 6
ter how deeply nested they are) that have a name equal to contactName and a text node value of
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
“Dave.” If the statement does not match any nodes, no error will be raised. Rather, the looping
process will not occur because the Count property of the XmlNodeList will be 0.
The SelectSingleNode() method works in a similar manner. However, instead of returning an
XmlNodeList collection, it will return only a single node. In cases where an XPath statement
matches more than one node, the first node will be returned. An example of using the
SelectSingleNode() method is shown next:

string filePath = Server.MapPath(“requests.xml”);
doc.Load(filePath);
string xpath = “//request[@id=’462001|32633|PM’]/contactName”;
XmlNode contactNode = doc.SelectSingleNode(xpath);
if (contactNode != null) {
Response.Write(contactNode.Name + “ = “ + contactNode.InnerText);
}
}
</script>
Putting It All Together

Before continuing on to the next section, let’s take a brief pause and examine how a few of the
classes discussed in previous sections can be used in conjunction with each other. Listing 6.6
will use the XmlTextReader, XmlDocument, XmlNode, XmlNodeList, and XmlNamedNodeMap
classes to construct an XML document’s structure in the browser. The custom Validator class
(found in the XmlParsers.Validation namespace) shown in Chapter 5 will be used as well to
validate the XML document against an XSD schema before processing it.
LISTING 6.6 Using the System.Xml Classes

1: <%@ Import Namespace=”System.Text” %>
3: <%@ Import Namespace=”System.Xml.Schema” %>
4: <%@ Import Namespace=”XmlParsers.Validation” %>
5:
7: public class XmlDocumentTest {
8: StringBuilder output = new StringBuilder();
9: int indent = 1;
XML Basics
206
LISTING 6.6 Continued

10:
11: public string ParseDoc(string xmlFilePath, string valFilePath,
12: string logFile) {
14:
15: try {
16: XmlSchemaCollection schemaCol= new XmlSchemaCollection();
17: schemaCol.Add(“http://www.golfExample.com”,valFilePath);
18: Validator validator = new Validator();
19: bool status = validator.Validate(xmlFilePath,
20: schemaCol,true,logFile);
21: if (status) {
22: xmlReader = new XmlTextReader(xmlFilePath);
23: XmlDocument xmlDoc = new XmlDocument();
24: xmlDoc.Load(xmlReader);
25: XmlNode oNode = xmlDoc.DocumentElement;
26: WriteNodeName(oNode,0);
27: XmlNodeList oNodeList = oNode.ChildNodes;
28: foreach (XmlNode node in oNodeList) {
29: XmlNode oCurrentNode = node;
30: if (oCurrentNode.HasChildNodes) {
31: WriteNodeName(oCurrentNode,indent);
32: WalkTheTree(oCurrentNode);
33: indent—;
34: } else {
36: }
37: }
38: } else {
39: output.Append(“Validation of golfers.xml failed!” +
40: “ Check the log file for information” +
41: “ on the failure.”);
42: }
43:
44: }
46: output.Append(“Exception: {0}”, e.ToString());
47: }
48: finally {
49: if (xmlReader != null)
51: }
52:
207
CHAPTER 6

53: return output.ToString();
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
54: }
55:
56: private void WalkTheTree(XmlNode oNodeToWalk) {
57: indent++;
58: XmlNodeList oNodeList = oNodeToWalk.ChildNodes;
59: for (int j=0;j<oNodeList.Count;j++) {
60: XmlNode oCurrentNode = oNodeList[j];
63: WalkTheTree(oCurrentNode);
64: indent—;
65: } else {
67: }
68: }
69: }
70:
71: private void WriteNodeName(XmlNode node,int iIndent) {
72: int h = 0;
73: for (int k=0;k<(iIndent * 10);k++) {
74: output.Append(“ ”);
75: }
76: if (node.NodeType == XmlNodeType.Text) { // Text node
77: output.Append(“<font color=’#ff0000’>” + node.Value +
78: “</font><br>”);
79: } else {
80: if (node.Attributes.Count > 0) {
81: XmlNamedNodeMap oNamedNodeMap = node.Attributes;
82: output.Append(“<b>” + node.Name + “</b> (“);
84: if (h!=0) output.Append(“  ”);
85: h++;
86: output.Append(“<i>” + att.Name +
87: “</i>=\”” +
88: att.Value + “\””);
89: }
90: output.Append(“)<br>\n\n”);
91: } else {
92: output.Append(“<b>” + node.Name +
93: “</b><br>\n\n”);
94: }
95: } // end if
XML Basics
208

96: } // WriteNodeName
97: } // XmlDocumentTest
98:
99: void Page_Load(Object sender, EventArgs E) {
100: string xmlFilePath = Server.MapPath(“golfers.xml”);
101: string valFilePath = Server.MapPath(“golfers.xsd”);
102: string logFile = Server.MapPath(“validationErrors.log”);
103: XmlDocumentTest xmlDocument = new XmlDocumentTest();
104: xml.InnerHtml = xmlDocument.ParseDoc(xmlFilePath,
105: valFilePath,logFile);
106: }
107: </script>
108: <html>
109: <body>
110: <font size=”5” color=”#02027a”>Working with XML Objects</font>
111: <p />
112: <div id=”xml” runat=”server” />
113: </body>
114: </html>
The following list provides a step-by-step look at what is being accomplished in the previous
code:
1. The XmlDocumentTest class is instantiated (line 103).
2. The XmlDocumentTest class’s ParseDoc() method is called and passed the path of the
XML document to be parsed along with the paths of the file to validate against and the
log file to write errors to (lines 104 and 105).
3. The ParseDoc() method creates an XmlSchemaCollection object and adds in the
name of the file to validate against (lines 16 and 17). The Validate() method of the
Validator class (shown in Chapter 5) is then called, which returns a Boolean value and
assigns it to a variable named status (line 19). If status is true, processing of the
XML document continues. Otherwise, a message is returned to the end user stating that
the file was not valid (lines 39–41).
4. An XmlTextReader object is instantiated and passed into the Load() method of the
XmlDocument class (Lines 22–24).
5. The XML document’s root node is found and assigned to the XmlNode object named
oNode (line 25).
6. A call is made to the WriteNodeName() method, which accepts an XmlNode as an

argument as well as an index value (line 26).
209
CHAPTER 6
7. The WriteNodeName() method sets how far the node should be indented and checks to 6
see what type of node it was passed. Text nodes are written out in red and element nodes
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
are bold. If a node is an element type and has attributes, the attributes are enumerated
using a foreach command. Each attribute and its associated value are italicized (lines
71–96).
8. On leaving the WriteNodeName() method, an XmlNodeList is created containing all chil-
dren of the root node. Each of these children is looped through one by one. In doing so,
the WriteNodeName() method is called to write out each node’s information. If the node
being analyzed contains its own child nodes, the WalkTheTree() method is called and
the node containing the child nodes is passed as an argument (Lines 56–69).
9. The WalkTheTree() method performs the same steps as shown in step 8. It creates an
XmlNodeList and checks for child nodes located under each node in the list. If child
nodes exists, it recursively calls itself and the process starts over. This occurs until all
child nodes have been analyzed.
10. After all the nodes and their associated child nodes have been analyzed and added to
the output variable, output is returned and written out to the ASP.NET page (lines 104
and 105).
The XmlNodeReader Class

In Chapter 5 you saw how XML documents could be parsed in a forward-only manner using
the XmlTextReader class. This stream-based mechanism is fast and efficient but is read-only.
In cases where the capability to modify an XML document and then stream a portion of the
document to another application is needed, the XmlNodeReader class can be used. This class
inherits all the properties and methods associated with the XmlReader abstract class like the
XmlTextReader does. In fact, aside from a few minor differences in properties and methods,
the only real difference between the XmlTextReader and XmlNodeReader classes is that the
XmlNodeReader accepts an XmlNode object in its constructor. This allows a stream to be started
from any node within a DOM structure. Listing 6.7 shows how this class can be used. You’ll
notice that after the class is instantiated, it looks and acts just like the XmlTextReader class.
LISTING 6.7 Using the XmlNodeReader to Stream a Portion of a DOM Structure

6: doc.Load(Server.MapPath(“requests.xml”));
7: string xpath = “/root/request[2]/contactName”;
XML Basics
210

8: XmlNode contactNode = doc.SelectSingleNode(xpath);
9: if (contactNode != null) {
10: contactNode.InnerText = “Bill Shaw”;
11: XmlNode requestNode = contactNode.ParentNode;
12: XmlNodeReader nodeReader = new XmlNodeReader(requestNode);
13: while (nodeReader.Read()) {
14: if (nodeReader.NodeType == XmlNodeType.Element) {
15: output += indent(nodeReader.Depth*4);
16: output += “<b><” + nodeReader.Name + “</b>”;
17: while (nodeReader.MoveToNextAttribute()) {
18: output += “&nbsp<i>” + nodeReader.Name + “=\”” +
19: nodeReader.Value + “\”</i>&nbsp”;
20: }
21: if (nodeReader.IsEmptyElement) {
22: output += “<b>/></b><br>”;
23: } else {
24: output += “<b>></b><br>”;
25: }
26: } else if (nodeReader.NodeType ==
29: output += “<b></” + nodeReader.Name +
30: “></b><br>”;
31: } else if (nodeReader.NodeType == XmlNodeType.Text) {
32: if (nodeReader.Value.Length != 0) {
34: output += “<font color=#ff0000>” +
35: nodeReader.Value +
36: “<br></font>”;
37: }
38: }
39: }
40: }
41: Response.Write(output);
42: }
43: private string indent(int number) {
44: string spaces = “”;
45: for (int i=0; i < number; i++) {
46: spaces += “ ”;
47: }
48: return spaces;
49: }
50: </script>
211
CHAPTER 6
XMLHTTPRequest Object 6
THE DOCUMENT
OBJECT MODEL
The XMLHTTPRequest object is not actually part of the .NET platform. In fact, the object is
PROGRAMMING
intended to be used on the client side for retrieving data from remote HTTP servers. Although
Internet Explorer 5 or later is the only browser that supports the object at the current time (it
can also be used from within client/server applications), it presents a nice way to call pages
(such as ASP.NET pages) that live on remote servers, especially in an intranet environment.
The data returned from the ASP.NET page can be bound to controls or elements within the
HTML page. The great part about the object is that the call to the remote server can be made
without having to reload the client’s HTML page each time. This provides the end user with a
richer experience.
Using the XMLHTTPRequest object on the client to call remote XML files or ASP.NET pages
can be quite efficient, depending on your application’s needs. The example you will see shortly
shows how to use the object to view different IT request form submissions added to an XML
document. You first saw a very primitive form of this application in Chapter 2, “XML for
ASP.NET Basics,” and the revised version of this application will be covered at the end of this
chapter.
After a bid request has been added to an XML document (named requests.xml), you can
use the XMLHTTPRequest object to view an individual user’s entry. Rather than using the
XMLHTTPRequest object, you could have the user submit a request to an ASP.NET page that
retrieves the data and returns HTML to the user. Doing that would, of course, require the
client’s page to be reloaded each time a new request is submitted to the server. Using the
XMLHTTPRequest object, the user can submit a request to the server and view the desired
request without having to reload the page each time. Before going into the example, take a
moment to familiarize yourself with the XMLHTTPRequest object’s properties and methods in
tables 6.12 and 6.13.
TABLE 6.12 XMLHTTPRequest Object Properties

onreadystatechange This property is used to specify the event handler that should be
called when the readyState property changes. This is a read
write property.
Example:
oXMLHttp.onreadystatechange = XMLState();
XML Basics
212
TABLE 6.12 Continued

readyState Represents the state of the request made to a remote server.
Possible values are
1 = Loading
2 = Loaded
3 = Interactive
4 = Completed
This is a read-only property.
alert(oXMLHttp.readyState);
responseBody Returns the response as an array of unsigned bytes. This is a
read-only property.
Example:
sValue = oXMLHttp.responseStream;
responseStream Returns the response as an IStream
object. This is a read-only property.
Example:
sValue = oXMLHttp.responseStream
responseText Returns the response as a string. This is a read-only property.
Example:
alert(oXMLhttp.responseText);
responseXML Returns the response as a normal XML document. Validation is
turned off for security reasons. This is a read-only property.
Example:
alert(oXMLhttp.responseXML);
status Returns the HTTP status code returned by the server. This is a
read-only property.
Example:
alert(oXMLhttp.status);
statusText Returns the HTTP response status text back from the server as a
string. This is a read-only property.
Example:
alert(oXMLhttp.statusText);
213
CHAPTER 6
TABLE 6.13 XMLHTTPRequest Object Methods

6
Method Description
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
abort() Cancels the current HTTP request.
Example:
oXMLHttp.abort();
getAllResponseHeaders() Retrieves the values of all the HTTP headers delimited by
a carriage-return line feed.
Example:
sHeader = oXMLHttp.getAllResponseHeaders();
getResponseHeader Returns the value of an HTTP header from the response
(header_name) body, depending on what header is passed into the
function.
Example:
sHeader = oXMLHttp.getResponseHeader(“Content-
Type”);
open(method,url,async, Initializes an Msxml2.XMLHTTP request. Accepted argu-

userid,password) ments are the method, URL, and authentication informa-
tion (userid and password) for the request. The userid
and password arguments are optional.
oXMLHttp.open(“POST”,
”http://www.someServer.com”,false);
send() Sends an HTTP request to the server and then waits for a
response. This method can be asynchronous or synchro-
nous, depending on the async argument originally passed
into the open method.
Example:
oXMLHttp.send();
setRequestHeader Determines the name of an HTTP header to send.

(header_name,value) Example:
oXMLHttp.setRequestHeader(“User-Agent”,
“appName”);
As with all the XML objects you’ve worked with to this point, the XMLHTTPRequest object is
very simple to work with after you have a handle on the properties and methods it exposes.
The example that follows shows how some of the properties and methods can be used.
XML Basics
214
As mentioned earlier, the example will create an HTML page that allows a user to view
previously submitted requests to an XML file. The user will be able to select an ID from a
drop-down box and then view the response without having to leave the HTML page.
To start the process, the ASP.NET page (Listing6.8.aspx) fills a drop-down box with infor-
mation about each request in the request.xml file. This is accomplished by using the XmlNode
class’s SelectNodes() method along with several other classes covered earlier in the chapter.
After the user selects a request from the drop-down box, the drop-down’s onChange event fires
and a client-side function named getRequest() is called.
LISTING 6.8 Loading XML Element Data into a Drop-Down Box

4: string jobCode = “”;
5: string contactName = “”;
6: string dateTime = “”;
7: string options = “”;
8: string table = “”;
9: string attID = “”;
10: int intCount = 0;
11:
12: Response.Expires = -1;
13: string ID = Request.QueryString[“id”];
15: try {
17: }
18: catch (Exception c) {
19: Response.Write(“There are currently no requests to view.”);
20: Response.End();
21: }
22: XmlNodeList requestNodeList = doc.SelectNodes(“//request”);
23: if (requestNodeList != null) {
24: int nodeCount = requestNodeList.Count;
25: for (int i=0;i<nodeCount;i++) {
26: XmlNode requestNode = requestNodeList.Item(i);
27: XmlElement requestElement = (XmlElement)requestNode;
28: //Get the ID attribute since we’re on the request node
29: attID = requestElement.GetAttribute(“id”);
30: if (requestNode.HasChildNodes) {
31: XmlNode childNode = requestNode.FirstChild;
32: if (childNode.Name.ToUpper() == “DATETIME”)
33: dateTime = childNode.InnerText;
215
CHAPTER 6

34: XmlNodeList requestChildrenNodeList =
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
35: requestNode.ChildNodes;
36: foreach (XmlNode requestChildNode
37: in requestChildrenNodeList) {
38: if (intCount < 1) {
39: table += “<tr><td>” +
40: requestChildNode.Name.ToUpper() +
41: “:</td>”;
42: table += “<td><div id=’” +
43: requestChildNode.Name +
44: “‘></div></td></tr>”;
45: }
46:
47: switch (requestChildNode.Name.ToUpper()) {
48: case “JOBCODE”:
49: jobCode = requestChildNode.InnerText;
50: break;
51: case “CONTACTNAME”:
52: contactName = requestChildNode.InnerText;
53: break;
54: case “DATETIME”:
55: dateTime = requestChildNode.InnerText;
56: break;
57: }
58: }
59:
60: options += “<option value=\”” + attID + “\”>”;
61: options += jobCode + “ - Entered By: “ +
62: contactName + “ (“ + dateTime + “)”;
63: options += “</option>”;
64: }
65: intCount++;
66: } //foreach
67: } //if
68: optionsDiv.InnerHtml = options;
69: tableRowsDiv.InnerHtml = table;
70: }
71: </script>
72: <html>
73: <head>
74: <meta http-equiv=”Content-Type” content=”text/html;
75: charset=windows-1252”>
76: <SCRIPT LANGUAGE=”javascript”>
77: <!—
XML Basics
216

78: function getRequest() {
79: var form = document.forms[0]
80: if (form.requestID.selectedIndex == 0) return;
81: var index = form.requestID.selectedIndex;
82: var id = form.requestID.options[index].value;
83: var sURL = “http://localhost/testbed/chapter6/” +
84: “Listing6.9.aspx?status=getXML&id=” + id;
85: var oXMLHttp = new ActiveXObject(“MSXML2.XMLHTTP”)
86: oXMLHttp.open(‘GET’, sURL, false);
87: oXMLHttp.send();
88: document.all(“requestData”).XMLDocument =
89: oXMLHttp.responseXML;
90: updateBindings();
91: }
92: function updateBindings() {
93: var oRoot = requestData.XMLDocument.documentElement;
94: for (i=0;i<oRoot.childNodes.length;i++) {
95: var node = oRoot.childNodes(i);
96: if (node.nodeType == 1) {
97: try {
98: document.all(node.nodeName).innerText = node.text;
99: }
100: catch (e) {
101: alert(“Sorry, but the selected Request” +
102: “ ID was not found”);
103: }
104: }
105: }
106: }
107: //—>
108: </SCRIPT>
109: </head>
110: <body bgcolor=”#FFFFFF”>
111: <!— Create our XML Data Island —>
112: <XML id=”requestData”></XML>
113: <form method=”post” action”” name=”requestform”>
114: <table border=”0” cellspacing=”0” cellpadding=”5”
115: width=”85%” bgcolor=”e8e8e8”>
116: <tr bgcolor=”#02027a”>
117: <td colspan=”6” align=”center”>
118: <font face=”arial” color=”#ffffff” size=”4”>
119: INFORMATION TECHNOLOGY SPECIALIST
120: REQUEST REVIEW FORM
121: </font>
217
CHAPTER 6

122: </td>
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
123: </tr>
124: <tr>
125: <td align=”left” valign=”middle” width=”34%”>
126: <font face=”Arial, Helvetica, sans-serif” size=”2”>
127: <b>Select A Request:</b>
128: </font>
129: </td>
130: <td align=”left” valign=”top” colspan=”2”>
131: <select name=”requestID” onChange=”getRequest()”>
132: <option>Select One:</option>
133: <div id=”optionsDiv” runat=”server” />
134: </select>
135: </td>
136: <td valign=”top” align=”left”
137: colspan=”3” width=”27%”>
138: </td>
139: </tr>
140: </table>
141: <table ID=”dataTable” cellpadding=”5”>
142: <div style=”font-family:arial”
143: id=”tableRowsDiv” runat=”Server” />
144: </table>
145: </form>
146: </body>
147: </html>
Figure 6.3 shows a screenshot of the page generated by the code shown in Listing 6.8.
After a user selects a specific IT request from the drop-down box, the getRequest() method
locates the request id the user selected and assigns it to the id variable. Next, a QueryString
is dynamically created that includes the request id the user desires to see. Notice that the URL
points to an ASP.NET page (lines 81–83).
The XMLHTTPRequest object is instantiated and a call is placed to its open() method (line 86).
After this method has been successfully called, the information is sent to the remote server
using the send() method shown in line 87. After the remote server has responded, the resulting
XML document text is placed within an XML data island (named requestData) that is embed-
ded in the Web page. An XML data island is an Internet Explorer 5 or later feature that allows
XML documents to be embedded in a Web page. Using the updateBindings() method (lines
92–106), the data island is used to populate div tags located on the page programmatically. An
alternative way of accomplishing the same task would be to bind the fields to the data island
using the dataSrc and dataFld element attributes.
XML Basics
218
FIGURE 6.3
The IT request form.
On the server side, an ASP.NET page named Listing6.8.aspx checks to see whether a
QueryString variable named status contains a value of getXML. If it does, it takes the id
value that was passed on the QueryString, instantiates an XmlDocument, and calls the
SelectSingleNode() method passing in an XPath statement that includes the request id. The
resulting XML fragment is then returned to the client-side XMLHTTPRequest object.
The code shown in Listing 6.9 shows this process and further illustrates how to use the
XmlNode’s SelectSingleNode() method to select a specific node within the requests.xml
file.
LISTING 6.9 Selecting Nodes with the SelectSingleNode() Method. (This page is called
from listing6.8.aspx.)
5: Response.Expires = -1;
6:
7: string ID = Request.QueryString[“id”];
8: string status = Request.QueryString[“status”];
9:
10: switch (status.ToUpper()) {
11: case “GETXML”:
12: GetRequestXML(ID);
219
CHAPTER 6

13: break;
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
14: default:
15: break;
16: }
17: }
18:
19: private void GetRequestXML(string requestID) {
21: try {
23: }
24: catch (Exception c) {
25: Response.Write(“<root>There are currently no” +
26: “ requests to view.<root>”);
27: }
28: //Perform XPath query using SelectSingleNode() method
29: string xpath = “//request[@id=’” + requestID + “‘]”;
30: XmlNode requestNode = doc.SelectSingleNode(xpath);
31: if (requestNode != null) {
32: Response.Write(requestNode.OuterXml);
33: } else {
34: Response.Write(“<root><request>NO ID</request></root>”);
35: }
36:
37: }
38: </script>
The end result of this process is that the user can select different request IDs and view their
associated data without having to leave the Web page. This solution is server friendly because
only a few calls are made to the server to obtain data; that allows the end user to be more effi-
cient and happy, which hopefully results in fewer calls to the help desk.
Sample Application—Client/Server-Side
Hierarchical XML Menus
Now that you’ve seen what you can do with the main XML classes, let’s leverage that knowl-
edge to construct a menu similar to the Start menu in Windows. XML was chosen for this
application simply because of its capability to support hierarchical structures and because of
the speed with which it can be accessed from a Web server. The application will use an XML
document named menuItems.xml and an ASP.NET file named menus.aspx on the server side.
On the client side, a cascading style-sheet document and a JavaScript document will be used
XML Basics
220
for the Dynamic HTML (DHTML). All these files working together produce the menu applica-
tion shown in Figure 6.4.
FIGURE 6.4
Screenshot of the XML-based menu application.
The foundation of the application is the XML document because it contains the data used for
the menus. A portion of the document is shown next:
<menuItem>
<hyperLink/>
<name>About Us</name>
<menuItem>
<hyperLink>/calendar/default.aspx</hyperLink>
<name>Calendars</name>
</menuItem>
<menuItem>
<hyperLink>/govt/default.aspx</hyperLink>
<name>Government</name>
</menuItem>
<menuItem>
<hyperLink>/pdf/orgchart.pdf</hyperLink>
<name>Organization Charts</name>
</menuItem>
</menuItem>
As you can see, the XML document relies on one of XML’s core strengths: the capability to
describe hierarchical relationships. Looking at the preceding document, you can see that
221
CHAPTER 6
several <menuItem> tags are nested within each other, creating a parent/child relationship. This 6
relationship is used to create menus that can contain submenus as seen in the Windows Start
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
menu.
On the server side, a custom class named XmlMenu has a recursive function used to handle
navigating through the XML document. It uses a few of the classes you’ve seen before, such
as the XmlDocument and XmlNode classes, and a few you may not know about yet, called
ArrayList and StreamWriter. These different classes are used to access node values for dis-
play or to store node information. As the menuItem nodes in the XML document are navigated,
the different menu structures are placed into ArrayLists. After all the arrays are created, a call
to the Response object sends the formatted menus up to the client where the DHTML
JavaScript code takes over.
Now that you have a feel for the application, let’s take a look at the code that starts things off
on the server side (menus.aspx):
<%@ Import Namespace=”XmlHierMenu” %>
<%
// Create paths to xml files
string filepath = Server.MapPath(“menuItems.xml”);
string filepath2 = Server.MapPath(“menuItems2.xml”);
string filepath3 = Server.MapPath(“menuItems3.xml”);
// Following is constructor used to create a static menu file in cases

// where the menu does not change often.
// string saveToFile = Server.MapPath(“test.aspx”);
// XmlMenu menuSave = new XmlMenu(true,saveToFile);
// menuSave.CreateMenu(“menu1”,filepath);
// Instantiate XmlMenu object and call CreateMenu method

XmlMenu menu = new XmlMenu();
Response.Write(menu.CreateMenu(“menu1”,filepath));
Response.Write(menu.CreateMenu(“menu2”,filepath2));
Response.Write(menu.CreateMenu(“menu3”,filepath3));
%>
As shown, the code starts by instantiating the XmlMenu class, which is part of the XmlHierMenu
namespace in the XmlMenu class. After the class is instantiated, a call is made to its
CreateMenu() method. This method accepts the name of the menu and the path to the menu’s
XML document. In this example, three menus named menu1, menu2, and menu3 will be
created for use in an ASP.NET page.
The CreateMenu() method is shown in Listing 6.10:
XML Basics
222
LISTING 6.10 CreateMenu() Method (xmlMenus.cs)
1: public string CreateMenu(string startMenu,string file) {

2: StringBuilder strOutput = new StringBuilder;
3: int i = 0;
4: ArrayList startArray = new ArrayList();
5: string strVariable = “”;
6: string strTemp = “”;
7:
8: XmlDocument XMLDoc = new XmlDocument();
9: try {
10: XMLDoc.Load(file);
11: }
13: strOutput.Append(e.GetBaseException().ToString());
14: return strOutput.ToString();
15: }
16:
17: XmlNodeList nodeList = XMLDoc.DocumentElement.ChildNodes;
18:
19: foreach (XmlNode node in nodeList) {
20: XmlNode currentNode = node;
21: if (currentNode.HasChildNodes == true &&
22: currentNode.ChildNodes.Count>1) {
23: strCurrentMenu = startMenu + “_” + (i+1);
24: string thisMenu = startMenu;
25: if (currentNode.ChildNodes.Count>2) {
26: strVariable = “<span id=\”” + thisMenu + “_span” +
27: (i+1) + “\” class=’cellOff’ “ +
28: “onMouseOver=\”stateChange(‘“ + _
29: strCurrentMenu + “‘,this,” + _
30: intLevel + “)\” onMouseOut=\”” +
31: “stateChange(‘’,this,’’)\”>” + _strImage +
32: currentNode.ChildNodes[1].InnerText +
33: “</span><br>\n”;
34: startArray.Add(strVariable);
35: WalkTree(currentNode);
36: } else {
38: (i+1) + “\” class=’cellOff’” +
39: “onMouseOver=\”stateChange(‘’,this,’’);” +
40: “hideDiv(“ + _intLevel + “)\” onMouseOut=\”” +
41: “stateChange(‘’,this,’’)\” onClick=\”” +
42: “location.href=’” +
43: currentNode.ChildNodes[0].InnerText +
44: “‘\”>” + currentNode.ChildNodes[1].InnerText +
223
CHAPTER 6

45: “</span><br>\n”;
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
46: startArray.Add(strVariable);
47: }
48: }
49: i++;
50: }
51:
52: startArray.TrimToSize();
53: arrayNamesArray.Add(startMenu);
54: for (i=0;i<startArray.Count;i++) {
55: strTemp += startArray[i];
56: }
57: arrayHolderArray.Add(strTemp);
58:
59: //Reverse Array order so we don’t have to worry about the ZIndex
60: arrayHolderArray.Reverse();
61: arrayNamesArray.Reverse();
62:
63: //Loop through arrays and write out divs and their individual content
64: for (i=0;i<_arrayNamesArray.Count;i++) {
65: strOutput.Append(“<div id=’” + _arrayNamesArray[i].ToString() +
“‘ class=’clsMenu’>”);
66: strOutput.Append(_arrayHolderArray[i].ToString());
67: strOutput.Append(“</div>\n”);
68: }
69: arrayHolderArray.Clear();
70: arrayNamesArray.Clear();
71:
72: if (_blnStaticMenus) {
73: StreamWriter writer = new StreamWriter(File.Open(_strSaveToFile,
74: FileMode.OpenOrCreate, FileAccess.Write));
75: writer.Write(strOutput.ToString());
76: writer.Flush();
77: if (writer !=null) writer.Close();
78: }
79: return strOutput.ToString();
80: } //CreateMenus
This method begins by loading the XML document (line 10). It then finds the root node of the
document so that its child nodes can be looped through recursively. As the children are looped
through, a call is made to the HasChildNodes property of the XmlNode class and a count is
taken on how many child nodes the current node has (lines 25–47). If it only has one child
node, then it’s a text node; if it has more than one, the children must be processed recursively.
XML Basics
224
As each node is evaluated, its information is put into a string that will be used on the client
side to manipulate its color and position. This string is then added to an ArrayList named
startArray (lines 34 and 46).
To walk through the child nodes of a menuItem node, a call is made to the WalkTree() method,
which accepts the node to walk through and evaluate. The code for this method is shown in
Listing 6.11:
LISTING 6.11 WalkTree() Method (xmlMenus.cs)
1: private void WalkTree(XmlNode node) {

2: intLevel += 1;
3: string strVariable = “”;
4: string strTemp = “”;
5: ArrayList tempArray = new ArrayList();
6:
7: for (int j=2;j<node.ChildNodes.Count;j++) {
8: XmlNode newNode = node.ChildNodes[j];
9: if (newNode.HasChildNodes == true &&
10: newNode.ChildNodes.Count>2) {
11: // Each node should have a 0=hyperlink and 1=text node
12: // so don’t call the function again if there are just these
13: // children
14: strCurrentMenu += “_” + (j-1);
15: string thisMenu = _strCurrentMenu.Substring(0,
16: strCurrentMenu.Length-2);
18: (j-1) + “\” class=’cellOff’ onMouseOver=\”” +
19: “stateChange(‘“ + _strCurrentMenu + “‘,this,” +
20: _intLevel + “)\” onMouseOut=\”” +
21: “stateChange(‘’,this,’’)\”>” +
22: _strImage + newNode.ChildNodes[1].InnerText +
23: “</span><br>\n”;
24: tempArray.Add(strVariable);
25: WalkTree(newNode);
26: } else {
27: strVariable = “<span id=\”” + _strCurrentMenu + “_span” +
28: (j-1) + “\” class=’cellOff’ onMouseOver=\”” +
29: “stateChange(‘’,this,’’);hideDiv(“ + _intLevel +
30: “)\” onMouseOut=\”stateChange(‘’,this,’’)\”” +
31: “ onClick=\”location.href=’” +
32: newNode.ChildNodes[0].InnerText +
33: “‘\”>” + newNode.ChildNodes[1].InnerText +
34: “</span><br>\n”;
35: tempArray.Add(strVariable);
225
CHAPTER 6

36: }
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
37: }
38:
39: tempArray.TrimToSize();
40: arrayNamesArray.Add(_strCurrentMenu);
41: for (int i=0;i<tempArray.Count;i++) {
42: strTemp += tempArray[i];
43: }
44: arrayHolderArray.Add(strTemp);
45: strCurrentMenu = _strCurrentMenu.Substring(0,_
46: strCurrentMenu.Length-2);
47: //Exiting function so go back to previous menu version
48: intLevel -= 1;
49: tempArray.Clear();
50: } // WalkTree
When the walkTree() method is called, the node currently being analyzed is passed in and
more checks are run to see if it also has child nodes that need to be walked through (lines 9
and 10). If it does, WalkTree() is called again and the node to walk through is passed into the
method. This process of calling the same function to go deeper into a tree structure is called
recursion. It’s a very powerful programming technique that requires little code. You saw an
example of recursion a little earlier in the “Putting It All Together” section.
During this entire process, the menuItem nodes that are found are parsed and their appropriate
hyperLink and name node values are placed into ArrayLists. When the entire XML docu-
ment has been parsed, the ASP.NET page writes the contents of the array lists out using the
Response object. From this point on, client-side code controls the DHTML menus
(XMLMenuScript.js).
This application provides a good look at how recursion can be used to get at XML nodes and
traverse hierarchical parent/child relationships. Although the client-side code that goes with
this application is designed to work only in Internet Explorer 4 or later, you can download a
slightly different version of the application that is cross-browser from
http://www.TomorrowsLearning.com.
Chapter 2 Sample Application Revisited

If you remember back to Chapter 2, a simple application was developed that added values sub-
mitted from a form to an XML document using .NET classes that were not necessarily associ-
ated with XML. In that sample it was mentioned that a better way for adding information to an
XML document existed. Now that you’re familiar with the XmlNode and XmlDocument classes
XML Basics
226
and have seen some of their properties and methods, this simple application can be converted
to a true XML application using the System.Xml classes rather than the file system objects
used in the more primitive example. Using the XmlDocument and XmlNode classes, you’ll be
able to capture form data and then add nodes to an XML document dynamically. The following
is a fragment of the XML document that will be created when a user submits a request:
<request id=”462001|32633|PM”>
<dateTime>4/6/2001 3:26:33 PM</dateTime>
<startDate>05/06/2001</startDate>
<endDate>05/13/2001</endDate>
<dueDate>04/15/2001</dueDate>
<projectTime>40</projectTime>
<projectLength>hours</projectLength>
<cost>10000</cost>
<jobCode>ITV1000</jobCode>
<subject>IT Contractor Request</subject>
<jobTitle>Manager/Director</jobTitle>
<contactName>Dan Wahlin</contactName>
<contactDepartment>Tomorrow’s Learning</contactDepartment>
<contactStreet>1234 Anywhere St.</contactStreet>
<contactCity>Phoenix</contactCity>
<contactState>AZ</contactState>
<contactZip>85244</contactZip>
<contactPhone>123-1234</contactPhone>
<contactFax>123-4321</contactFax>
<contactEmail>[email protected]</contactEmail>
<jobStreet>1234 Anywhere St.</jobStreet>
<jobCity>Phoenix</jobCity>
<jobState>AZ</jobState>
<jobZip>85244</jobZip>
</request>
The form used to gather data from the end user is similar to the one used in Chapter 2. Upon
submitting the form, however, the code used to capture the data has been given a major over-
haul. Listing 6.12 shows the part of the ASP.NET page called when data is submitted by an
end user:
LISTING 6.12 Capturing Data Using XML and ASP.NET

1: public void submitButton_Click(Object Sender, EventArgs E) {
3: string sRootName = “root”;
4: string sID = “”;
5: string ID = Request.Form[“dateTime”];
6: string sStatus = “”;
227
CHAPTER 6

7: string newID = XmlFunctions.PrepareID(ID);
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
8: bool blnStatus = true;
9:
11: try {
12: doc.Load(sXMLFilePath);
13: }
15: //File doesn’t exist yet so make one
16: doc.LoadXml(“<?xml version=\”1.0\”?><” + sRootName +
17: “></” + sRootName + “>”);
18: }
19:
20: //******************************************************************
21: //*********************** Prevent duplicate ids *****************
22: //******************************************************************
23: string sXPath = “//request[contains(@id,’” + newID + “‘)]”;
24: XmlNodeList requestNodeList = doc.SelectNodes(sXPath);
25: int lastNodeIndex = requestNodeList.Count;
26: if (lastNodeIndex == 0) {
27: sID = newID;
28: } else {
29: // Get the last node in the NodeList that has this id.
30: XmlElement lastRequestElement =
31: (XmlElement)requestNodeList.Item(lastNodeIndex-1);
32: sID = lastRequestElement.GetAttribute(“id”);
33:
34: //Ensure user didn’t enter request twice
35: XmlElement contactElement =
36: (XmlElement)doc.SelectSingleNode(“//request[@id=’” +
37: sID + “‘]/contactName”);
38: if (contactElement != null) {
39: //Ensure element isn’t empty
40: if (!contactElement.IsEmpty) {
41: if (contactElement.InnerText.ToUpper().Trim() ==
42: Request.Form[“contactName”].ToUpper().Trim()) {
43: Response.Write(“It appears you are attempting” +
44: “ to enter two “ +
45: “entries for the same request.” +
46: “ Processing cannot continue.”);
47: blnStatus = false;
48: }
49: }
50: }
XML Basics
228

51: //Create new ID that is unique since one already exists
52: //If a different person is entering at the same time, create
53: //a unique ID for them by adding one after the “-” character
54: // if the ID already exists due to someone else
55: //entering a value a millisecond earlier.
56:
57: int iPosition = sID.IndexOf(“-”);
58: if (iPosition > 0) {
59: int start = iPosition+1;
60: int sNewIDAddOn =
61: (Convert.ToInt32(sID.Substring(start))+1);
62: sID = newID + “-” + sNewIDAddOn.ToString();
63: } else { //Add a - plus a 1
64: sID = newID + “-1”;
65: }
66: }
67: //******************************************************************
68: //******************** End Prevent duplicate ids ****************
69: //******************************************************************
70:
71: //Create XmlNode object that child nodes will be added to
72: XmlNode oRootNode = doc.DocumentElement;
73: //Create request element node
74: XmlNode oContainer = doc.CreateElement(“request”);
75: //Create id attribute
76: XmlNode oAttribute = doc.CreateAttribute(“id”);
77: //Create text for id
78: oAttribute.Value = sID;
79: oContainer.Attributes.SetNamedItem(oAttribute);
80: //Add request node to root
81: oRootNode.AppendChild(oContainer);
82: if (blnStatus) {
83: foreach (string x in Request.Form) {
84: if (x.ToUpper() != “SUBMITBUTTON” && x.IndexOf(“_”) != 0) {
85: sStatus = XmlFunctions.AddNode(x,Request.Form[x],
86: oContainer,doc);
87: if (sStatus.Length != 0) {
88: Response.Write(sStatus);
89: blnStatus = false;
90: break;
91: }
92: }
93: }
94: }
229
CHAPTER 6

95: if (blnStatus) {
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
96: doc.Save(sXMLFilePath);
97: output += “Your information was successfully stored!<p>”;
98: output += “<a href=\”requests.xml\”>View XML File Contents</a>”;
99: content2.InnerHtml = output;
100: content2.Visible = true;
101: } else {
102: Response.Write(sStatus);
103: }
104: }
The code starts off in Line 10 by instantiating the XmlDocument class. Line 12 attempts to load
an existing XML document. If an error occurs because of the document not being found, a call
is made to the XmlDocument’s LoadXml() method and a simple XML document shell is created
(lines 16 and 17).
After we have an XML document (either an existing one or one created from scratch), the next
few lines of code help ensure that a unique id attribute is being added to the XML document.
Because the ID is based on a combination of date and time, it is certainly possible that more
than one user could submit a form request at the same time (down to the second) as another
user. If a request element in the XML document already has an id attribute equal to the one
the current user is submitting, the code will dynamically append an identity number on the end
of the id and then use that id to represent the new user’s request submission.
To see if another id already exists and matches up with the id the user is submitting, a call
to the XmlDocument class’s SelectNodes() method is made (line 24). An XPath statement is
passed into this method that uses the XPath contains function (covered in Chapter 3, “XPath,
XPointer, and XLink,” to check whether the id exists in any other id attributes already in
the XML document. If the call to the SelectNodes() method does not return one or more
nodes, the id does not already exist and can be used to uniquely identify the user’s request
submission.
If the id is found within another id already in the document, Lines 25–66 make the user’s sub-
mitted id unique to the XML document through performing a few alterations to it. First, the
last node that contains the same id that the user is submitting is found by accessing the index
of the last child node returned from the earlier call to the SelectNodes() method (lines 30 and
31). This id is then assigned to the sID variable. sID is used to create another XPath statement
that is passed into the XmlDocument class’s SelectSingleNode() method. The node that
matches will be a contactName node because of the Xpath statement (lines 36 and 37). This
XPath query is done to ensure that the id in question isn’t being compared to an id that was
XML Basics
230
just placed into the XML document by the same user because of accidentally submitting the
form twice (hey, users can do some weird things!). The value of the contactName node cur-
rently selected is then compared to the value found in Request.Form[“contactName”] (lines
41–48). If the two values are not equal, we know that the id being submitted is being com-
pared to an id in the XML document that was submitted by a different user at the exact same
time.
At this point, we can go ahead and make the necessary alternations to the id submitted by the
user to ensure that it is unique. We first use the latest id that contains the current user’s ID in it
(already placed in the sID variable earlier) and check the end of it for a “-” character. If one
exists, multiple entries have been submitted at the same time. From here, the code grabs the
numbers after the “-” character, converts them to an integer, and then adds one to the number.
The resulting number is then appended onto the current user’s submitted id to make it unique.
To help make this more clear, here’s how three id entries all submitted at the same time by
different users would look in the XML document:
1st Submission id: 462001|32633|PM
2nd Submission id: 462001|32633|PM-1
3rd Submission id: 462001|32633|PM-2
Having read through all that, you should have a new appreciation for databases and the way
they use identity fields to keep rows unique! Before we go further into the code, it’s important
to realize that this method of generating IDs is simply for example, so that you can see some
of the different XML classes in action. If you truly want a unique ID to be generated, you
would be better off instantiating a class that is designed to return a GUID or leverage features
found in ADO.NET (covered in Chapter 8, “Leveraging ADO.NET’s XML Features Using
ASP.NET”).
After the ID issue has been resolved, sID is added as the value for the id attribute (line 78) and
the attribute is added to the request node created earlier. Line 81 uses the XmlNode class’s
appendChild method to add the request node to the root node of the XML document. Now
that a container exists for all the user data in the form of the request node, the code loops
through the Request object’s form collection to gather the user data.
Lines 83–93 contain the looping logic. You’ll notice that as each item in the collection is
looped through, a call is made to a method named AddNode(). This is a static method that
exists in a class named XmlFunctions. The AddNode() method’s sole purpose is to add the data
entered by the user into the request node. The code used to accomplish this is shown next:
public static string AddNode(string sNodeName,string sNodeText,
XmlNode oNodeToAppendTo,XmlDocument oDoc) {
if (sNodeText == “”) sNodeText = “null”;
try {
231
CHAPTER 6
XmlNode oNewNode = oDoc.CreateElement(sNodeName);

XmlNode oNewNodeText = oDoc.CreateTextNode(sNodeText);
6
oNewNode.AppendChild(oNewNodeText);
THE DOCUMENT
OBJECT MODEL
PROGRAMMING
oNodeToAppendTo.AppendChild(oNewNode);
}
return exc.ToString();
}
return “”;
}
The AddNode() method accepts four different arguments. These are shown next:
Argument Description
sNodeName Name of the new node to add to the container node.
sNodeText Value that the new node being created will contain.
Example:
<sNodeName>sNodeText</sNodeName>
oNodeToAppendTo Node object that the new node should be appended to.
oDoc XmlDocument object that will be used to create new element nodes
and their associated text nodes.
As described previously, the function accepts the new node’s name (sNodeName), the data the
node will describe (sNodeText), the XmlNode object to which the newly created node will be
appended to (oNodeToAppendTo), and the XmlDocument object that will be used to create new
nodes (oDoc). After these arguments are passed, the function simply creates a new node,
appends a text node to it, and then appends the newly created node to the request node in the
XML document being worked with. This process occurs for each element in the form that the
user submits.
Every user’s data will be added to the XML document in the same fashion as shown in Listing
6.12. Comparing this version of the application to the one used in Chapter 2, you can see how
the XML document stays well formed throughout the entire process. This is in contrast to hav-
ing to manually add the ending root tag immediately before viewing the XML document, as
was done in Chapter 2’s example. Was this solution a little more difficult to develop? Probably
so, assuming you’re already familiar with the classes used in Chapter 2’s application. However,
it provides much more power and flexibility in manipulating the XML document. If users want
the capability to view past submissions, a query mechanism similar to the one shown earlier in
the “XMLHTTPRequest Object” section could be built to allow for this. Try doing that using only
the objects used in Chapter 2’s example! It could be done, but parsing string data manually can
tend to get boring and tedious.
XML Basics
232
Summary
In this chapter you’ve been presented with a comprehensive look at many of the DOM classes
available in the System.Xml namespace. These classes include the XmlNode, XmlDocument,
XmlNodeList, XmlNamedNodeMap, and XmlNodeReader. You also saw how to use the
XMLHTTPRequest object on the client side. By using these classes and their various properties
and methods, XML documents loaded into the DOM can be accessed and manipulated
programmatically.
In the next chapter you’ll learn about Extensible Stylesheet Language Transformations (XSLT)
and see how XML documents can be transformed into other structures.
Transforming XML with XSLT CHAPTER
7
and ASP.NET
IN THIS CHAPTER
What Is XSLT? 234
The Transformation Process 235
Getting Your Feet Wet with XSLT 237
The XSLT Language 242
XSLT Functions 267
.NET Classes Involved in Transforming XML 272
Creating a Reusable XSLT Class 287

234
What Is XSLT?
During the development of the XML specification, the W3C working group realized that for
XML to reach its full potential, a method of transforming XML documents into different for-
mats needed to exist. At some time or another, an application that has the capability to work
with XML documents will need to display or structure the data in a different format than speci-
fied in the document. If the only method for accomplishing this task necessitates programmati-
cally transforming the XML document into the appropriate format by using an XML parser
paired with a programming language, the power of having a cross-platform and language-
independent XML language would be lost. Some method of transforming XML documents
into different formats such as HTML, flat files, Wireless Markup Language (WML), and even
other forms of XML needed to be devised so that it could be used on any platform and with
any language.
To accommodate this transformation process, Extensible Stylesheet Language Transformations
(XSLT) was created. Version 1.0 of the XSLT specification reached recommended status at the
W3C in November of 1999 (http://www.w3.org/TR/1999/REC-xslt-19991116) and many
XML parsers now provide full XSLT support. The .NET framework provides 100% compli-
ance with the XSLT version 1.0 specification.
What exactly is XSLT useful for and why would you, as an ASP.NET developer, want to learn
about it? The answer boils down to the capability of XSLT to transform XML documents into
different formats that can be consumed by a variety of devices, including browsers, Personal
Digital Assistants (PDAs), Web-enabled phones, and other devices that will appear in the near
future.
Transformations can also be useful in situations where an XML document’s structure does not
match up well with an application that will accept the data within the document. An XML doc-
ument may contain the appropriate data to be imported into a database, for example, but may
not be structured in a way that the application performing the import expects. For example, the
application may be better prepared to handle element-based XML documents rather than ones
with a lot of attributes, as shown in the following document:
<root>
<row id=”1” fname=”Dan” lname=”Wahlin”/>
<row id=”2” fname=”Heedy” lname=”Wahlin”/>
<row id=”3” fname=”Danny” lname=”Wahlin”/>
<row id=”4” fname=”Jeffery” lname=”Wahlin”/>
</root>
Using XSLT, this document can be transformed into a structure that the application is better
suited to work with:
Transforming XML with XSLT and ASP.NET
235
CHAPTER 7
<root>
<row>
<id>1</id>
<fname>Dan</fname>
<lname>Wahlin</lname>
</row>
<row>
<id>2</id>
<fname>Heedy</fname>
</row>
7
XML WITH XSLT

<row>
TRANSFORMING
AND ASP.NET
<id>3</id>
<fname>Danny</fname>
</row>
<row>
<id>4</id>
<fname>Jeffery</fname>
</row>
</root>
This chapter teaches you how to perform this type of transformation—as well as many
others—by covering the following topics:
• The transformation process
• The XSLT language
• .NET classes involved in transforming XML
The Transformation Process

The process of transforming an XML document into another format, such as HTML or WML,
relies on two types of processing engines. First, a parser capable of loading an XML document
must be present to load the source document into a DOM tree structure (for more information
about the DOM, refer to Chapter 6, “Programming the Document Object Model (DOM) with
ASP.NET.” Next, the XSLT document must be loaded and a tree structure will be created for it,
as well. This tree structure will normally be optimized to accommodate XSLT processing and
is specific to the processor being used. An XSLT processor is then needed to take the XML
document structure, match up nodes within the document against “templates” found in the
XSLT document, and then output the resulting document. The third tree structure (the resulting
document) is dynamically created based on information contained in the XSLT document. A
simple diagram of this transformation process is shown in Figure 7.1.
236
XML
XML
HTML
PDF
WML
ETC.
XSLT Processor
XSLT
FIGURE 7.1
The XSLT transformation process.
XSLT Templates
Before looking more closely at how to build XSLT documents, it’s important that you under-
stand what the building blocks of these documents are. Although XSLT stands for Extensible
Stylesheet Language Transformations, an alternative name for it could potentially be
Extensible Template Language Transformations. Why? The answer is because of its reliance on
templates to process and create a particular output structure. The W3C provides the following
statement about templates:
A stylesheet contains a set of template rules. A template rule has two parts: a pattern which is
matched against nodes in the source tree and a template which can be instantiated to form part
of the result tree. This allows a stylesheet to be applicable to a wide class of documents that
have similar source tree structures.
If you have ever used templates in Excel, Word, or PowerPoint, you know that they provide a
basic structure that can be reused for specific purposes. For example, every time you submit an
expense report, you may be accustomed to filling out a template in Word that is designed for
this purpose. The template likely has specific form fields built in so that every expense report
being submitted looks the same. There may be other templates that are used for purchase
orders. The point is that the templates are geared to match up with a specific task, such as cre-
ating an expense report, a purchase order, or some other activity.
Templates in XSLT function in much the same way, except that they are geared to match up
with nodes in an XML document. XSLT templates provide a way to process and structure data
contained within elements and attributes in the source XML document. Their basic purpose is
to provide a template structure that can be processed when a particular node in the source
XML document is discovered.
So how do templates work? The XSLT processor described earlier is provided with two tree
structures to walk through. The first is the structure for the source XML document and the
237
CHAPTER 7
second is the XSLT document itself. After these two structures are provided, the XSLT proces-
sor attempts to match element or attribute names found in the XML document with templates
contained in the XSLT tree structure. This matching process uses XPath expressions that are
embedded within the XSLT document. When a node found within the XML document matches
a template in the XSLT document, that template is processed.
Processing of templates found within an XSLT document normally starts with a template that
matches the root node of the XML document and proceeds down to its children. When a tem-
plate is processed, the output is added to the third tree structure mentioned earlier that is used
in building the output document. 7
XML WITH XSLT

Templates offer an efficient way to process a variety of XML document structures and are very
TRANSFORMING
AND ASP.NET
efficient in cases where an XML document contains repetitive items. Each time an element,
attribute, text node, and so on is found, it is matched up with the appropriate template via
XPath expressions. If a given node does not have a matching template, no processing will
occur on it, and the next section of the XML document is processed. In cases where a match-
ing node is found, the template takes care of generating the proper output structure based on
data/nodes contained within the node.
So that you can see templates in action, the next section introduces you to a simple XSLT doc-
ument. The sections that follow describe in greater detail how to use templates and other parts
of the XSLT language.
Getting Your Feet Wet with XSLT

In this section we’ll examine a simple XSLT document that transforms XML into HTML for
display in a browser. The sections that follow show how XSLT can transform XML into many
formats other than HTML. This example represents a common task that you will likely use
when developing ASP.NET applications that require the presentation of XML data within a
browser. Listing 7.1 shows an XML document that contains information about different
golfers.
LISTING 7.1 Golfers XML Document

1: <?xml version=”1.0” ?>
2: <golfers>
3: <golfer skill=”excellent” handicap=”10” clubs=”Taylor Made” id=”1111”>
4: <name>
5: <firstName>Heedy</firstName>
7: </name>
238

10: <course city=”Phoenix” state=”AZ” name=”Ocotillo”/>
11: <course city=”Snowflake” state=”AZ” name=”Silver Creek”/>
13: </golfer>
14: <golfer skill=”moderate” handicap=”12” clubs=”Taylor Made” id=”2222”>
15: <name>
16: <firstName>Dan</firstName>
18: </name>
21: <course city=”Pinetop” state=”AZ” name=”White Mountain CC”/>
22: <course city=”Springville” state=”UT” name=”Hobble Creek”/>
24: </golfer>
25: </golfers>
Listing 7.2 presents an XSLT document that can be used to transform the XML just shown into
HTML. The different elements used in this document are discussed later.
LISTING 7.2 Golfers XSLT Document

2: <xsl:stylesheet xmlns:xsl=”http://www.w3.org/1999/XSL/Transform”
3: version=”1.0”>
4: <xsl:output indent=”yes” method=”html”/>
5: <xsl:template match=”/”>
6: <html>
7: <head>
8: <style type=”text/css”>
9: .blackText {font-family:arial;color:#000000;}
10: .largeYellowText {font-family:arial;font-size:18pt;
11: color:#ffff00;}
12: .largeBlackText {font-family:arial;font-size:14pt;
13: color:#000000;}
14: .borders {border-left:1px solid #000000;
15: border-right:1px solid #000000;
16: border-top:1px solid #000000;
17: border-bottom:1px solid #000000;}
18: </style>
19: </head>
20: <body bgcolor=”#ffffff”>
21: <span class=”largeBlackText”><b>Golfers: </b></span>
239
CHAPTER 7

22: <p/>
23: <xsl:apply-templates/>
24: </body>
25: </html>
26: </xsl:template>
27: <xsl:template match=”golfers”>
29: </xsl:template>
30: <xsl:template match=”golfer”>
31: <table class=”borders” border=”0” width=”640” cellpadding=”4” 7
XML WITH XSLT

32: cellspacing=”0” bgcolor=”#efefef”>
TRANSFORMING
AND ASP.NET
33: <xsl:apply-templates select=”name”/>
34: <tr class=”blackText”>
35: <td width=”12%” align=”left”><b>Skill: </b></td>
36: <td width=”12%” align=”left”>
37: <xsl:value-of select=”@skill”/>
38: </td>
39: <td width=”12%” align=”left”><b>Handicap: </b></td>
41: <xsl:value-of select=”@handicap”/>
42: </td>
43: <td width=”12%” align=”left”><b>Clubs: </b></td>
45: <xsl:value-of select=”@clubs”/>
46: </td>
47: </tr>
48: <tr>
49: <td colspan=”6”> </td>
50: </tr>
52: <td colspan=”6” class=”largeBlackText”>
53: Favorite Courses
54: </td>
55: </tr>
56: <tr>
57: <td colspan=”2”><b>City: </b></td>
58: <td colspan=”2”><b>State: </b></td>
59: <td colspan=”2”><b>Course: </b></td>
60: </tr>
61: <xsl:apply-templates select=”favoriteCourses”/>
62: </table>
63: <p/>
64: </xsl:template>
65: <xsl:template match=”name”>
240

66: <tr>
67: <td colspan=”6” class=”largeYellowText” bgcolor=”#02027a”>
68: <xsl:value-of select=”firstName”/>
69: <xsl:value-of select=”lastName”/>
70: </td>
71: </tr>
72: </xsl:template>
73: <xsl:template match=”favoriteCourses”>
75: </xsl:template>
76: <xsl:template match=”course”>
78: <td colspan=”2” align=”left”>
79: <xsl:value-of select=”@city”/>
80: </td>
82: <xsl:value-of select=”@state”/>
83: </td>
85: <xsl:value-of select=”@name”/>
86: </td>
87: </tr>
88: </xsl:template>
89: </xsl:stylesheet>
To transform the XML document shown in Listing 7.1 using the XSLT document shown in
Listing 7.2, the code shown in Listing 7.3 can be used:
LISTING 7.3 Using the XslTransform Class

2: <%@ Import Namespace=”System.Xml.Xsl” %>
3: <%@ Import Namespace=”System.Xml.XPath” %>
6: string xmlPath = Server.MapPath(“listing7.1.xml”);
7: string xslPath = Server.MapPath(“listing7.2.xsl”);
8:
9: //Instantiate the XPathDocument Class
10: XPathDocument doc = new XPathDocument(xmlPath);
11:
12: //Instantiate the XslTransform Class
13: XslTransform transform = new XslTransform();
14: transform.Load(xslPath);
241
CHAPTER 7

15:
16: //Custom format the indenting of the output document
17: //by using an XmlTextWriter
18: XmlTextWriter writer = new XmlTextWriter(Response.Output);
20: writer.Indentation=4;
21: transform.Transform(doc, null, writer);
22: }
23: </script>
7
XML WITH XSLT

TRANSFORMING
On executing the code in Listing 7.3, the XML document will magically be transformed into
AND ASP.NET
HTML that can be rendered in a browser. The result of this transformation is shown in
Figure 7.2.
FIGURE 7.2
Transforming an XML document into HTML.
The code shown in Listings 7.2 and 7.3 may look quite foreign to you at this point. Don’t let
that worry you, though, because each portion of the code will be broken down to show how the
different pieces work together. In addition to covering the XSLT language, the examples that
follow also demonstrate XPath expressions as a point of review. For a detailed explanation of
XPath, refer to Chapter 3, “XPath, XPointer, and XLink.” Before examining the .NET classes
involved in transforming XML to other structures, let’s first examine what pieces are involved
in constructing an XSLT document.
242
The XSLT Language

Now that you’ve seen the transformation process and have been introduced to what an XSLT
document looks like, let’s break the different parts used in the document into individual pieces.
First up: the XSLT document root element.
The XSLT Document Root Element

Looking back at Listing 7.2, you’ll notice that the document follows all the rules specified in
the XML specification described in Chapter 2, “XML for ASP.NET Basics.” The case of each
opening tag matches the case of the closing tag, all attributes are quoted, all tags are closed,
and so on. XSLT documents are, in fact, well-formed XML documents. As a result, the first
line of each document should contain the XML declaration. Although this line is optional, it’s
essential that you get into the practice of using it, especially because new versions of the XML
specification will certainly be coming in the future.
Following the XML declaration, one of two elements specific to the XSLT language can be
used for the document’s root node. These elements are the following:
• <xsl:stylesheet>
• <xsl:transform>
Although you can use either element as the root of an XSLT document, the samples that follow
throughout this chapter use the xsl:stylesheet element. You can certainly substitute the
xsl:transform element instead if you feel more comfortable using it.
Two different items must also be included for an XSLT document to follow the guidelines
found in the XSLT specification. These are a local namespace declaration as well as an
attribute named version. The inclusion of the xsl:stylesheet element, the namespace decla-
ration, and the version attribute are shown next:
<xsl:stylesheet xmlns:xsl=”http://www.w3.org/1999/XSL/Transform”
version=”1.0”
>

</xsl:stylesheet>
The namespace URI (http://www.w3.org/1999/XSL/Transform) must be listed exactly as

shown, and the version attribute must have a value of 1.0 for the document to be conformant
with the November 1999 XSLT specification. As different versions of the specification are
released, this version number can be changed, depending on what features the XSLT document
243
CHAPTER 7
uses. Failure to list these parts correctly will result in an error being returned by the XSLT
processor.
NOTE
XSLT version 1.1 was in Working Draft at the time this section was written. XSLT style
sheets that use new features found in this version will need to let the XSLT processor
know by changing the version attribute to 1.1.
7
XSLT Elements
XML WITH XSLT

TRANSFORMING
AND ASP.NET
If you’ve had the opportunity to work with HTML in the past, you’re already aware of how
elements are used to perform specific tasks. For example, the <table> element can be used
along with the <tr> and <td> elements to construct a table for display in a browser. The <img>
element can be used when an image needs to be displayed, and the <form> element can be
used as a container for different form elements such as text boxes and radio buttons. Each of
these elements have a specific purpose and when appropriate, can contain supporting child
elements.
The XSLT version 1.0 specification lists several elements that can be used to transform XML
documents. These elements can be used in a variety of ways, including determining the output
format, performing if/then type logic, looping, and writing out data within a node contained in
the XML document to the result tree structure. An XSLT element is distinguished from other
elements that may be within an XSLT document by its association with a namespace that
defines a URI of http://www.w3.org/1999/XSL/Transform. Declaring this namespace on the
XSLT root element (xsl:stylesheet or xsl:transform) was shown in the previous section.
Table 7.1 contains a listing of all potential elements in version 1.0 of the XSLT specification.
Notice that each element is prefixed by the xsl namespace.
TABLE 7.1 XSLT Elements

XSLT Element Description
xsl:apply-imports Used in conjunction with imported style sheets to over-
ride templates within the source style sheet. Calls to
xsl:apply-imports cause an imported template with
lower precedence to be invoked instead of the source
style sheet template with higher precedence.
xsl:apply-templates When xsl:apply-templates is used, the XSLT proces-
sor finds the appropriate template to apply, based on the
type and context of each selected node.
244
TABLE 7.1 continued

xsl:attribute Creates an attribute node that is attached to an element
that appears in the output structure.
xsl:attribute-set Used when a commonly defined set of attributes will be
applied to different elements in the style sheet. This is
similar to named styles in CSS.
xsl:call-template Used when processing is directed to a specific template.
The template is identified by name.
xsl:choose Used along with the xsl:otherwise and xsl:when ele-
ments to provide conditional testing. Similar to using a
switch statement in C# or Select Case statement in
VB.NET.
xsl:comment Writes a comment to the output structure.
xsl:copy Copies the current node from the source document to
the result tree. The current node’s children are not
copied.
xsl:copy-of Used to copy a result-tree fragment or node-set into the
result tree. This performs a “deep copy,” meaning that
all descendants of the current node are copied to the
result tree.
xsl:decimal-format Declares a decimal-format that is used when converting
numbers into strings with the format-number() function.
xsl:element Creates an element with the specified name in the output
structure.
xsl:fallback Provides an alternative (or fallback) template when spe-
cific functionality is not supported by the XSLT proces-
sor being used for the transformation. This element
provides greater flexibility during transformations as
new XSLT versions come out in the future.
xsl:for-each Iterates over nodes in a selected node-set and applies a
template repeatedly.
xsl:if Used to wrap a template body that will be used only
when the if statement test returns a true value.
xsl:import Allows an external XSLT style sheet to be imported into
the current style sheet. The XSLT processor will give a
lower precedence to imported templates as compared to
templates in the original XSLT style sheet.
245
CHAPTER 7
TABLE 7.1 continued

xsl:include Allows for the inclusion of another XSLT style sheet
into the current style sheet. The XSLT processor gives
the same precedence to the included templates as tem-
plates in the original XSLT style sheet.
xsl:key Declares a named key and is used in conjunction with
the key() function in XPath expressions.
xsl:message Used to output a text message and optionally terminate 7
style sheet execution.
XML WITH XSLT

TRANSFORMING
AND ASP.NET
xsl:namespace-alias Used to map a prefix associated with a given namespace
to another prefix. This can be useful when a style sheet
generates another style sheet.
xsl:number Used to format a number before adding it to the result
tree or to provide a sequential number to the current
node.
xsl:otherwise Used with the xsl:choose and xsl:when elements to
perform conditional testing. Similar to using default in a
switch statement.
xsl:output Specifies options for use in serializing the result tree.
xsl:param Used to declare a parameter with a local or global
scope. Local parameters are scoped to the template in
which they are declared.
xsl:preserve-space Preserves whitespace in a document. Works in conjunc-
tion with the xsl:strip-space element.
xsl:processing-instruction Writes a processing instruction to the result tree.
xsl:sort Used with xsl:for-each or xsl:apply-templates to
specify sort criteria for selected node lists.
xsl:strip-space Causes whitespace to be stripped from a document.
Works in conjunction with the xsl:preserve-space
element.
xsl:stylesheet This element must be the outermost element in an XSLT
document and must contain a namespace associated
with the XSLT specification and a version attribute.
xsl:template Defines a reusable template for producing output for
nodes that match a particular pattern.
xsl:text Writes out the specified text to the result tree.
246
TABLE 7.1 continued

xsl:transform Used in the same manner as the xsl:stylesheet
element.
xsl:value-of Writes out the value of the selected node to the result
tree.
xsl:variable Used to declare and assign variable values that can be
either local or global in scope.
xsl:when Used as a child element of xsl:choose to perform mul-
tiple conditional testing. Similar to using case in a
switch or Select statement.
xsl:with-param Used in passing a parameter to a template that is called
via xsl:call-template.
Although not every element listed in Table 7.1 is discussed in this section, you will be exposed
to the more common elements and see how they can be used to transform XML into formats
such as HTML, WML, and even EDI.
NOTE
The HTML generated by an XSLT document must conform to the rules outlined in the
XML specification. All elements must be closed, including <img>, <input>, and all the
other elements that normally do not need to be closed in HTML. Attributes used on
elements must be quoted, a beginning and ending element tag’s case must match,
and so on. Keep in mind that the XSLT processor knows how to work only with well-
formed XML and knows nothing about the tags used in HTML, WML, and so on. As a
result, everything within the XSLT document must follow the XML rules.
Transforming to HTML Using XSLT Elements

One of the best ways to learn about the different XSLT elements is to see them in action.
Listing 7.4 repeats the XSLT style sheet shown earlier in Listing 7.2 but adds additional func-
tionality. After looking through the code, you’ll see a step-by-step explanation of what the code
is doing.
LISTING 7.4 Golfers XSLT Document

247
CHAPTER 7

4: <xsl:output method=”html” indent=”yes”/>
6: <html>
7: <head>
10: .largeYellowText {font-family:arial;
11: font-size:18pt;color:#ffff00;}
12: .largeBlackText {font-family:arial; 7
XML WITH XSLT

13: font-size:14pt;color:#000000;}
TRANSFORMING
AND ASP.NET
18: </style>
19: </head>
21: <span class=”largeBlackText”>
22: <b>List of</b>
23: <xsl:if test=”count(//golfer) > 0”>
24:
25: <xsl:value-of select=”count(//golfer)”/>
26: </xsl:if>
27: <b>Golfers</b>
28: </span>
29: <p/>
31: </body>
32: </html>
33: </xsl:template>
35: <xsl:apply-templates select=”golfer”/>
36: </xsl:template>
38: <table class=”borders” border=”0” width=”640”
39: cellpadding=”4” cellspacing=”0” bgcolor=”#efefef”>
43: <b>Skill: </b>
44: </td>
46: <xsl:attribute name=”style”>
248

47: <xsl:choose>
48: <xsl:when test=”@skill=’excellent’”>
49: color:#ff0000;font-weight:bold;
50: </xsl:when>
51: <xsl:when test=”@skill=’moderate’”>
52: color:#005300;
53: </xsl:when>
54: <xsl:when test=”@skill=’poor’”>
55: color:#000000;
56: </xsl:when>
57: <xsl:otherwise>
58: color:#ffffff;
59: </xsl:otherwise>
60: </xsl:choose>
61: </xsl:attribute>
63: </td>
65: <b>Handicap: </b>
66: </td>
69: </td>
71: <b>Clubs: </b>
72: </td>
75: </td>
76: </tr>
77: <tr>
79: </tr>
83: </td>
84: </tr>
85: <tr>
86: <td colspan=”2”>
87: <b>City: </b>
88: </td>
90: <b>State: </b>
91: </td>
249
CHAPTER 7

93: <b>Course: </b>
94: </td>
95: </tr>
97: </table>
98: <p/>
99: </xsl:template>
101: <tr> 7
XML WITH XSLT

TRANSFORMING
AND ASP.NET
105: </td>
106: </tr>
107: </xsl:template>
112: <xsl:call-template name=”writeComment”/>
116: </td>
119: </td>
122: </td>
123: </tr>
125: <xsl:template name=”writeComment”>
126: <xsl:comment>List Course Information</xsl:comment>
The following explanation walks through each element used in Listing 7.4.
Line 1:
The XML declaration is used because this is a valid XML document.

250
Lines 2–3:
This line contains the first XSLT element used in the document: xsl:stylesheet. As shown
earlier, this element has an associated namespace declaration and version attribute. The
xsl:transform element could also be used here. One of these two elements will always be the
root node of the XSLT document.
Line 4:
The xsl:output element is used to specify what type of format will be created in the result
tree. The method attribute can contain values of xml, html, or text. This element is a top-level
element, meaning that it must be a child of the xsl:stylesheet element to be used properly.
The different attributes that can be used to describe this element are the following: method,
version, encoding, omit-xml-declaration, standalone, doctype-public, doctype-system,
cdata-section-elements, indent, media-type. This element includes the indent attribute,
which will indent the result tree to show its hierarchical structure. XSLT processors do not
have to honor the indentation request. If the processor does support indentation, the manner in
which the indentation is implemented is up to the processor.
Lines 5–33:
6: <html>
7: <head>
10: .largeYellowText {font-family:arial;
11: font-size:18pt;color:#ffff00;}
12: .largeBlackText {font-family:arial;
13: font-size:14pt;color:#000000;}
18: </style>
19: </head>
21: <span class=”largeBlackText”>
22: <b>List of</b>
24:
251
CHAPTER 7
26: </xsl:if>
27: <b>Golfers</b>
28: </span>
29: <p/>
31: </body>
32: </html>
33: </xsl:template>
Here’s an example of the first template definition specified in the XSLT document. Templates
contain structured information that will be processed and output to the result tree. They are
7
processed when the XPath pattern found in the match attribute matches a node found in the
XML WITH XSLT

TRANSFORMING
AND ASP.NET
source XML document.
This template has a match attribute with a value of /. The value (/) represents a pattern that
matches the XML document (think of it as the position directly above the root XML element).
The purpose of a pattern is to identify which nodes a template applies to. You can think of pat-
terns as valid XPath statements, although the XSLT definition does define them separately.
If the pattern specified in the match attribute matches a node in the source XML document, the
information located within the template will be processed and written out to the result tree. In
this case, when the XML document is matched, the basic elements used to start an HTML doc-
ument are added to the result tree.
NOTE
It’s worth repeating that that the <html> and <body> elements contained within the
template are simply standard XML elements to the XSLT processor. It knows nothing
about HTML elements and simply cares that the elements follow the XML rules. Only
when processing has completed and the result tree is rendered in an application that
understands HTML tags, will the <html> and <body> elements have any presentation
purpose.
The xsl:template element can have the attributes shown in Table 7.2.
TABLE 7.2 xsl:template Attributes
Attribute Name Description

match The match attribute’s value is a pattern defined by an XPath statement
that states which nodes in the source XML document should be
processed by the template it is associated with. Although optional, if this
attribute is not included, the name attribute shown next must be included.
252
TABLE 7.2 continued

Attribute Name Description
name Applies a name to a template. This attribute is used by xsl:call-
template. Although optional, if this attribute is not included, the match
attribute shown next must be included.
priority The value must be a number that says the priority of the template. The
priority attribute’s value will be taken into consideration if several tem-
plates match the same node.
mode Applies a mode to a template. The mode is simply a name that can be
used by the xsl:apply-templates element to hit a template that does a
specific form of transformation. For example, if you need a node named
chapter within an XML document to be processed differently, depend-
ing on its position within the document, you can create different tem-
plates that all match the chapter node. To differentiate the templates
from each other (because they have the same match attribute value) a
mode can be applied to each one. A call can then be made to the specific
template that needs to be processed by using the xsl:apply-templates
element and then specifying the template that has the desired mode.
Lines 23–26:
24:
26: </xsl:if>
This portion of the XSLT document shows how the xsl:if element can be used to test a par-
ticular condition. In this case, an XSLT function named count() is used. This function takes
an XPath statement as input and returns the number of nodes as output. You’ll learn more
about XSLT functions in the next section.
The xsl:if element must have an attribute named test, which converts a valid XPath state-
ment to a Boolean. In this case, the test checks to see whether the number of golfer nodes
exceeds the value of 0. If the test returns true, the content (referred to as the template body)
between the xsl:if start and end tags is processed.
Line 25 contains the xsl:value-of element, which is used frequently in XSLT documents
to write out the value of a particular node to the result tree. This element must contain an
attribute named select. The value of the attribute must contain a valid XPath expression. The
node (or nodes) returned by the expression is converted to a string. In this case, the number of
golfer nodes within the XML document is returned and placed in the result tree structure.
253
CHAPTER 7
The xsl:value-of element may optionally include an attribute named disable-output-

escaping that is useful when characters such as < or > need to be output without being escaped
by using < or >. The attribute accepts a value of yes or no. Using it is especially useful
when an XML element contains HTML tags that need to be written to the result tree structure
without escaping the brackets.
Line 30:
The xsl:apply-templates element provides a way to call templates that may match with
7
other items contained in the source XML document. Before explaining what this element does
XML WITH XSLT

TRANSFORMING
AND ASP.NET
in more detail, it’s appropriate to introduce a term called the context node. The context node is
defined as the source document node currently being processed by the XSLT processor as
specified by a template. Because we are processing the document node in the current template,
that node is considered the context node. Obviously, many other elements are below this node,
including golfers and golfer, that may also have templates associated with them. By using
xsl:apply-templates the XSLT code is saying (in more human terms), “Find all templates
that match with child elements of the current node (the context node) and go out and process
them.” The first child node that will be found is the root node of the source document
(golfers). The template that matches up with it is described next.
Lines 34–36:
36: </xsl:template>
The golfers node in the source XML document matches up with the template defined in these
lines of the XSLT document. As the XSLT processor is attempting to match templates with
nodes in the XML document, any node with a name of golfers will automatically be
processed by this template. Within the template, you’ll notice that the content is very sparse,
except for the inclusion of an xsl:apply-templates element. With the context node now
being the golfers node, calling xsl:apply-templates will send the processor looking for
templates that match up with child nodes of the golfers node.
You’ll notice that line 30 includes an attribute named select that applies to the xsl:apply-
templates element. This attribute accepts a valid XPath expression as a value. In this case, it
selects a template that matches up with a node named golfer. Because the golfers node con-
tains golfer nodes only as children, including this attribute is unnecessary and is shown only
to exemplify its use. If, however, the golfers node contained child nodes other than the
golfer node and we wanted the golfer node template to be processed first, the inclusion of
the select attribute would be more appropriate. This will become more clear as the next tem-
plate is discussed.
254
Lines 37–99:
38: <table class=”borders” border=”0” width=”640”
39: cellpadding=”4” cellspacing=”0” bgcolor=”#efefef”>
43: <b>Skill: </b>
44: </td>
47: <xsl:choose>
50: </xsl:when>
51: <xsl:when test=”@skill=’moderate’”>
52: color:#005300;
53: </xsl:when>
55: color:#000000;
56: </xsl:when>
57: <xsl:otherwise>
58: color:#000000;
60: </xsl:choose>
63: </td>
66: </td>
69: </td>
71: <b>Clubs: </b>
72: </td>
75: </td>
76: </tr>
77: <tr>
79: </tr>
255
CHAPTER 7

83: </td>
84: </tr>
85: <tr>
87: <b>City: </b>
88: </td>
90: <b>State: </b>
91:
92:
</td>
<td colspan=”2”>
7
XML WITH XSLT

93: <b>Course: </b>
TRANSFORMING
AND ASP.NET
94: </td>
95: </tr>
97: </table>
98: <p/>
99: </xsl:template>
This template does the bulk of the work in Listing 7.4 by matching up with all golfer nodes in
the source XML document. When the template is processed, the shell structure for an HTML
table is written out. This table will be used to present all the information about a specific golfer.
Line 40 uses the xsl:apply-templates and provides a pattern for the template that should be
called by using the select attribute. By providing a pattern equal to name, only a template that
matches up with the pattern will be processed. Why didn’t we simply call xsl:apply-templates
and not worry about which of the context node’s child node templates were called? The answer
is that we want to ensure that the template with a pattern matching the name child node is
processed before any other children of the context node (golfer, in this case).
After the template matching the name node is called, processing will be done on that template
and then return to the golfers template. Specifically, the XSLT processor will jump back to
the next statement in the golfer template that immediately follows the call to <xsl:apply-
templates select=”name”/>.
Lines 46–74 exhibit several of the XSLT elements shown earlier in Table 7.1. To start things
off, line 46 contains an xsl:attribute element named style. This XSLT element adds a style
attribute to the <td> tag in line 45. The value of the attribute is dynamically assigned based on
a series of conditional tests. To accomplish the tests, the xsl:choose, xsl:when, and xsl:
otherwise elements are used. These elements function in a manner similar to the switch,
case, and default keywords used when coding a switch statement in C#.
The conditional test starts with the xsl:choose element. It can be followed by as many
xsl:when elements, as needed. The xsl:when element must contain a mandatory attribute
256
named test that contains the expression to test. If the test returns true, the content between
the xsl:when starting and ending tags will be assigned to the value of the style attribute. The
test performed in line 48 checks to see whether an attribute of the context node (remember that
the context node is golfer at this point) named skill has a value equal to excellent. If it
does, the style attribute will have a value of color:#ff0000;font-weight:bold;. Assuming
the skill attribute does have a value of excellent, the actual structure added on completion of
the xsl:choose block testing will be the following:
<td width=”12%” align=”left” style=”color:#ff0000;font-weight:bold;”>
If the first xsl:when returns false, testing will continue down the line. If no xsl:when tests
return true, the xsl:otherwise block will be hit and the style attribute will be assigned a
value of color:#000000; (lines 57–59).
After the style attribute has been added to the <td> tag, processing continues with line 62,
which adds the value of the skill attribute to the table column by using the xsl:value-of ele-
ment discussed earlier. Lines 64–94 continue to add additional columns to the table and write
out the value of attributes found on the context node (the golfer node).
When processing in the golfers template completes, the xsl:apply-templates element is
again used along with a select attribute that points to a template pattern of favoriteCourses
(line 96). This template will be discussed later.
Lines 100–107:
101: <tr>
105: </td>
106: </tr>
The template declaration shown in line 100 matches up with all name nodes found within the
XML document. This template is called from within the golfer template discussed earlier (see
line 40). Processing of the template is limited to writing out a new row in the table (line 101)
followed by a column containing the values of the firstName and lastName elements. These
values are written to the result tree by using the xsl:value-of element(lines 103 and 104).
Lines 108–127:
257
CHAPTER 7

116: </td>
119: </td>
122:
123:
</td>
</tr>
7
XML WITH XSLT

TRANSFORMING
AND ASP.NET
The template matching favoriteCourses contains no functionality other than to call xsl:
apply-templates. This is because it contains no attributes and acts only as a parent container
element. Because the favoriteCourses node contains only child nodes named course, calling
xsl:apply-templates will result in only one template match.
Processing within the template that matches the course nodes is limited to adding a new row
(line 113) with columns containing the values for attributes named city, state, and name.
Each course node found within the source XML document will be matched with this template
and the appropriate attribute values will be written out.
Line 112 introduces a new XSLT element that hasn’t been covered to this point. The
xsl:call-template element can be used to call a template in a similar manner as calling a
function within C# or VB.NET. Calling a template in XSLT is accomplished by identifying the
name of the template to call via a name attribute. The template that is called must, in turn, have
a matching name attribute as shown in line 125.
Calling templates can be useful when a template doesn’t match up with a given node in an
XML document but needs to be accessible to process commonly used features or perform cal-
culations. For example, if your XSLT code needs to walk through a list of pipe-delimited
strings, a template can be called recursively until each piece of data within the string has been
processed. You’ll see a concrete example of using the xsl:call-template element in conjunc-
tion with the xsl:with-param and xsl:param elements toward the end of this chapter.
Line 128:
258
The XSLT language follows all the rules outlined in the XML specification. As such, the
xsl:stylesheet element must be closed.
This example has shown how you can use some of the main elements found in the XSLT spec-
ification. For more information on some of the other elements not covered in the previous
example, refer to the XSLT version 1.0 specification (http://www.w3.org/TR/1999/REC-
xslt-19991116) or pick up a copy of a book titled XSLT Programmer’s Reference
(ISBN 1-861003-12-9).
Transforming XML into Another Form of XML Using XSLT Elements

HTML isn’t the only structure that can be created by an XSLT document. Many cases may
occur in which other formats, such as a comma-delimited, WML, or even another form of
XML, need to be generated to integrate with an application. In this section we’ll take a look at
a simple example of transforming from XML to XML and show how a few of the XSLT ele-
ments can make this process easier. XML-to-XML transformations are useful when an XML
document’s structure needs to be changed before sending it off to a vendor or to another appli-
cation that expects a different format.
Listing 7.5 shows the XML document that needs to be transformed, Listing 7.6 shows the
result of the transformation, and Listing 7.7 shows the XSLT document used in processing the
transformation.
LISTING 7.5 The Source XML Document

2: <root>
3: <row>
4: <id>1</id>
5: <name>
6: <fname>Dan</fname>
7: <lname>Wahlin</lname>
8: </name>
9: <address type=”home”>
10: <street>1234 Anywhere St.</street>
11: <city>AnyTown</city>
12: <zip>85789</zip>
13: </address>
14: <address type=”business”>
15: <street>1234 LottaWork Ave.</street>
17: <zip>85786</zip>
18: </address>
19: </row>
20: <row>
259
CHAPTER 7

21: <id>2</id>
22: <name>
23: <fname>Elaine</fname>
24: <lname>Wahlin</lname>
25: </name>
29: <zip>85789</zip>
30: </address> 7
XML WITH XSLT

TRANSFORMING
AND ASP.NET
32: <street>1233 Books Way</street>
34: <zip>85784</zip>
35: </address>
36: </row>
37: </root>
LISTING 7.6 The Result of Transforming the XML Document in Listing 7.5
2: <root>
3: <row id=”1” fname=”Dan” lname=”Wahlin”>
7: <zip>85789</zip>
8: </address>
10: <street>1234 LottaWork Ave.</street>
12: <zip>85786</zip>
14: </address>
15: </row>
16: <row id=”2” fname=”Elaine” lname=”Wahlin”>
20: <zip>85789</zip>
21: </address>
23: <street>1233 Books Way</street>
260

25: <zip>85784</zip>
26: </address>
27: </row>
28: </root>
LISTING 7.7 The XSLT Document

4: <xsl:output method=”xml” indent=”yes” encoding=”utf-8”
5: omit-xml-declaration=”no”/>
7: <root>
9: </root>
10: </xsl:template>
11: <xsl:template match=”row”>
12: <row>
13: <xsl:attribute name=”id”>
14: <xsl:value-of select=”id”/>
16: <xsl:attribute name=”fname”>
17: <xsl:value-of select=”name/fname”/>
19: <xsl:attribute name=”lname”>
20: <xsl:value-of select=”name/lname”/>
22: <xsl:for-each select=”address”>
23: <xsl:copy-of select=”.”/>
24: </xsl:for-each>
25: </row>
26: </xsl:template>
Breaking the XSLT document down into individual pieces reveals a few new things not seen in
previous examples. First, Line 4 uses the xsl:output element to specify an output format of
xml. It also specifies that the XML declaration should be included. This is done by setting the
omit-xml-declaration attribute to no. Because this is the attribute’s default value, it could
have been left out altogether.
Lines 6–10 take care of setting the starting template (the one that matches the document node)
needed in the XSLT document. This template simply adds a node named root to the result tree
261
CHAPTER 7
and then triggers the process of looking for other templates that match up with nodes in the
source XML document.
The template matching the row element node writes a row element to the result tree. The bulk
of the transformation process occurs in lines 13–24. To start, three different attributes are
added to the row element by using the xsl:attribute element. The value of these attributes is
obtained by using the xsl:value-of element to access the appropriate elements in the source
XML document.
After the attributes are added, the xsl:for-each element is used to loop through all address
elements. This element simply takes the name of the node-set to loop through as the value of 7
XML WITH XSLT

the select attribute. Because the address elements (and their children) remain unchanged
TRANSFORMING
AND ASP.NET
from the source to the result tree, the xsl:copy-of element is used to simply copy over the
address element (and all its children) to the result tree. Had we wanted only to copy the
address element itself and not the children, we could have used the xsl:copy element instead.
However, utilizing the xsl:copy-of element prevents us from having to create each element
(address, street, city, zip) dynamically by using the xsl:element or xsl:copy elements.
Now that you’ve had an opportunity to see some of the most common XSLT elements in
action, let’s take a look at a few more that can help make your XSLT documents more dynamic
and flexible.
Using Variables and Parameters: The xsl:variable and

xsl:param Elements
As programmers, we all take for granted the capability to use variables and pass parameters to
functions. In fact, it’s hard to imagine programming without variables and parameters. Most
programmers would be hard pressed to eliminate them from their applications. Fortunately,
there’s no need to worry about variables or parameters being eliminated from C#, VB.NET, or
even from languages such as XSLT. The XSLT specification includes the capability to use a
variable or pass a parameter to a template. In this section, you are provided with a general
overview of how variables and parameters can be used to make your XSLT documents more
flexible. Let’s first take a look at how variables can be used.
Variables in XSLT
XSLT variables are used to avoid calculating the same result multiple times. Although very
similar to variables used in C# or any other programming language, XSLT variables can be set
once but cannot be updated after they are set. The value assigned to a variable is retained until
the variable goes out of scope. What, you say! XSLT variables can be set only once? Doesn’t
this make them the equivalent of a constant?
There’s a method to the madness that makes perfect sense when analyzed. Because XSLT
relies on templates that can be called randomly, depending on the structure of the source XML
262
document, the capability to update a particular global variable could introduce problems. These
types of problems are referred to as “side effects,” and the XSLT specification eliminates the
problem by allowing for no side effects. If you’ve ever stepped into a project that you didn’t
originally code that had bugs cropping up because of overuse of global variables, you’ll appre-
ciate this concept.
To illustrate the concept of side effects more, let’s take a look at a simple example. If a tem-
plate named template1 relies on another template named template2 to update the value of a
global variable, what happens if template2 doesn’t ever get processed? The rather obvious
answer is that the global variable will never be updated and therefore can cause potential prob-
lems to template1 when it is processed. By not allowing variables in XSLT style sheets to be
updated, this problem is avoided.
NOTE
Some XSLT processors do allow for variables to be updated by using extension func-
tions. These functions are specific to the XSLT processor and are not part of the XSLT
specification.
It’s important to realize that XSLT documents can be written without the use of variables.
However, variables can aid in cleaning up the code and can also result in more efficient XSLT
style sheets. The following example uses a portion of the code shown earlier in Listing 7.4 to
demonstrate how a variable declared globally (as a child element of the xsl:stylesheet ele-
ment) can be used in XSLT:
<xsl:variable name=”count” select=”count(//@handicap[. < 11])”/>
<xsl:template match=”golfer”>

<td width=”12%” align=”left”>

<xsl:value-of select=”@handicap”/>
<br/>
<xsl:if test=”@handicap > 11”>
<xsl:value-of select=”$count”/> golfers have lower handicaps
</xsl:if>
</td>

</xsl:template>
263
CHAPTER 7
The xsl:variable element can have a name and a select attribute. The name attribute is
required and serves the obvious purpose of assigning a name that can be used to reference
the variable. The select attribute is optional but when listed, must contain a valid XPath
expression.
So what have we gained by using a variable in this template? The code has actually been made
much more efficient as compared to writing the same code without using the variable. Instead
of having to calculate the count of all the golfers in the XML document with handicaps less
than 11 each time a golfer node template is matched, the variable obtains this value and stores
it when the XSLT style sheet is first loaded. The variable can then be referenced in several 7
places by adding the $ character to the beginning of the variable name ($count in this case).
XML WITH XSLT

TRANSFORMING
AND ASP.NET
Doing this cuts out unnecessary processing during the transformation process.
At times, the value of an attribute may need to be dynamically generated and used in several
places. In many cases, using a variable can make this process easier:
<xsl:variable name=”color”>
<xsl:choose>
<xsl:when test=”@handicap = 10”>
#ff0000
</xsl:when>
<xsl:when test=”@handicap = 20”>
#ffff00
</xsl:when>
<xsl:otherwise>
#ffffff
</xsl:otherwise>
</xsl:choose>
</xsl:variable>
<font color=”{$color}”><xsl:value-of select=”@handicap”/></font>

<p> </p>
<font color=”{$color}” size=”4”><xsl:value-of select=”lastName”/></font>
Although the value of the color attribute found on the font tag could be added by using the
xsl:attribute element multiple times, by defining the value once in the variable, the code is
kept cleaner and the processing is more efficient.
TIP
The previous example shows a shortcut that can be used to embed data directly into
attributes that will be written out to the result tree. By wrapping a variable (or other
item) with the { and } brackets, it will dynamically be added to the result tree. This is
continues
264
in contrast to adding the attribute name and value to a tag by using the
xsl:attribute element. As another example, if you had an attribute named width in
an XML document, you could write it out to a table tag by doing the following:
<table width=”{@width}”>. This is similar to doing something such as <table
width=”<%=myWidth%>”> in ASP.NET.
Variables can also be useful for storing values returned by calling a template. This process will
be shown a little later after you’re introduced to the xsl:param element.
Parameters in XSLT
Parameters are useful in a variety of programming languages, with XSLT being no exception.
Parameters can be used in XSLT documents in two basic ways. First, parameter values can be
passed in from an ASP.NET application. This allows data not found within the XML document
or XSLT style sheet to be part of the transformation process. Second, parameter values can be
passed between XSLT templates in much the same way that parameters can be passed between
functions in C# or VB.NET. You’ll see how parameters can be used in both ways later in the
chapter.
Declaring a parameter is similar to declaring a variable. Simply name the parameter and add an
optional select attribute:
<xsl:param name=”myParam” select=”’My Parameter Value’”/>
As with variables, parameters can be children of the xsl:stylesheet or xsl:transform ele-

ments and can also be children of the xsl:template element. So when would you want to use
a parameter? Let’s assume that the golfers XSLT document shown in Listing 7.4 needs to show
a specific golfer based on user input in a Web form. To accomplish this task, the ASP.NET
application can pass in the value entered by the user to a parameter within the XSLT docu-
ment. This value can then be used to display the proper golfer’s information. A simplified
document that uses a parameter named golferNum is shown next:
<xsl:stylesheet xmlns:xsl=”http://www.w3.org/1999/XSL/Transform”
version=”1.0”>
<xsl:param name=”golferName” select=”’Dan’”/>
<xsl:template match=”golfers”>
<xsl:apply-templates select=”//golfer[name/firstName=$golferName]”/>
</xsl:template>
<xsl:template match=”golfer”>
<xsl:apply-templates/>
</xsl:template>
<xsl:template match=”name”>
<xsl:value-of select=”firstName”/>
265
CHAPTER 7

<xsl:value-of select=”lastName”/>
</xsl:template>
</xsl:stylesheet>
Having this parameter in the XSLT style sheet will cause a specific golfer’s information to be
transformed. Any other golfers in the XML document will simply be ignored. How is this
accomplished? A small change was made to the xsl:apply-templates element in the golfers
template. Instead of processing all golfer nodes in the XML document, the XPath expression
in the select attribute specifies the specific golfer node to process:
7
<xsl:apply-templates select=”//golfer[name/firstName=$golferName]”/>
XML WITH XSLT

TRANSFORMING
AND ASP.NET
Although this example hard-codes a value for the golferName parameter, an ASP.NET page
would normally pass the value in using specific classes in the System.Xml assembly. You’ll be
introduced to these classes later in the chapter. If the value passed into the golferName para-
meter from the ASP.NET page does not match up with an existing golfer node in the XML
document, no error will be raised.
Parameters can also be used in conjunction with the xsl:call-template element. Fortunately,
from working with other programming languages, you already have a good understanding of
how this works. Imagine calling a method named GetOrders() that accepts a single parameter
as an argument. The method call would look something like the following:
GetOrders(“ALFKI”);
Now imagine that GetOrders is the name of an XSLT template used to transform an XML
document containing customers and orders. Calling the template and passing the parameter can
be accomplished by doing the following:
<xsl:template match=”Customer”>
<xsl:call-template name=”GetOrders”>
<xsl:with-param name=”CustomerID” select=”@CustomerID”/>
</xsl:call-template>
</xsl:template>
<xsl:template name=”GetOrders”>
<xsl:param name=”CustomerID” select=”’ALFKI’”/>
<xsl:value-of select=”//Order[@CustomerID = $CustomerID]”/>
</xsl:template>
This example shows the use of the xsl:call-template and xsl:with-param elements to initi-
ate the template call. The xsl:call-template element has a single attribute that provides the
name of the template to call. The xsl:with-param element has two attributes. One is used to
name the parameter that data will be passed to and the other provides the value that is to be
passed. The select attribute can contain any valid XPath expression. The xsl:with-param
element can only be a child of the xsl:call-template or xsl:apply-templates element.
266
TIP
The xsl:param element named CustomerID (shown earlier) has a parameter value of
ALFKI with single quotes around it because it is a string value rather than an XPath
expression. Had the single quotes been omitted, the XSLT processor would try to find
a node named ALFKI (which doesn’t exist, of course). Although this seems fairly obvi-
ous, it’s an easy mistake to make.
The xsl:param element in the template being called is updated by using the xsl:with-param
element shown previously. It has two potential attributes, including name and select, as
described earlier. In the previous example, the parameter named CustomerID is assigned a
default value of ALFKI. This value will be overridden when the GetOrders template is called
and a parameter value is passed to it using the xsl:with-param element.
Because variables cannot be updated in XSLT, parameters play a large role in allowing values
to be passed between templates. A global parameter (declared as a child of the xsl:stylesheet
or xsl:transform elements) can receive input from an ASP.NET page, but it cannot be
updated more than once after processing of the XSLT document begins. However, the capabil-
ity to place parameters within the scope of a specific template body offers the capability to call
templates recursively. This is possible because a single template can call itself and pass a para-
meter value (or more than one value, in the case of multiple parameters within the template)
that can then be processed as appropriate.
Although the inability to update variables and parameters may seem somewhat restrictive, the
authors of the XSLT specification knew that it was necessary because XML documents can
contain many different structures. You can’t depend on one template being processed before
another, especially in the case where one XSLT document is used to transform a variety of
XML documents—all with different structures.
Accessing “Return” Values of XSLT Templates

You may have noticed that although a template can act somewhat like a method, it has no way
to return a response…or does it? By wrapping the xsl:variable element around a template
call made using the xsl:call-template element, the output normally written to the result tree
can instead be captured by the variable. This process is shown next:
<xsl:template match=”Customer”>
<xsl:variable name=”CID” select=”@CustomerID”/>
<xsl:variable name=”Orders”>
<xsl:call-template name=”GetNames”>
<xsl:with-param name=”CustomerID” select=”$CID”/>
</xsl:call-template>
267
CHAPTER 7
</xsl:variable>
<b>Customer</b><br/>
ID: <xsl:value-of select=”$CID”/><br/>
<xsl:for-each select=”$Orders//Order”/>
<xsl:value-of select=”@OrderID”/>
</xsl:for-each>
</xsl:template>
<xsl:template name=”GetNames”>
<xsl:param name=”CustomerID” select=”’ALFKI’”/>
<xsl:value-of select=”//Orders[@CustomerID = $CustomerID]”/>
</xsl:template>
7
XML WITH XSLT

TRANSFORMING
AND ASP.NET
The variable named Orders will be filled with a node-set generated by a call to the GetNames
template. Doing this offers a powerful means for building more dynamic and efficient XSLT
documents.
Now that you’re familiar with several of the main elements used in creating XSLT style sheets,
it’s time to examine XSLT functions.
XSLT Functions
In Chapter 3, “XPath, XPointer, and XLink,” you were introduced to several functions built in
to the XPath language. Because XSLT relies on XPath for creating expressions that locate
nodes in an XML document, these functions are available for use in your XSLT documents. In
addition to these functions, the XSLT language adds a few more. Table 7.3 shows these func-
tions and provides a description and example of using them in XSLT documents.
TABLE 7.3 XSLT Functions

Function Description
current() Returns a node set that contains the current node as its only
member. This function exists to help identify the current
node when it is different from the context node. In previous
examples you have seen that the context node can be repre-
sented by the . character. For example, to write out the
value of the context node being processed by a template,
you can do the following:
<xsl:value-of select=”.”/>
This code will provide the same result:

<xsl:value-of select=”current()”/>
268
TABLE 7.3 continued

current() (continued) At this point in the template, the current node is the same
as the context node. When used within the square brackets
of a predicate ([ and ]), however, the current node is often
different from the context node. An example of this is
shown next:
<xsl:template match=”/”>
<xsl:variable name=”golferVar”
➥select=”//golfer”/>
<xsl:text>Other Comparable Golfers: </xsl:text>
<xsl:for-each select=”$golferVar”>
<xsl:if test=”$golferVar[./@skill=
➥current()/@skill]”>
<xsl:value-of select=”$golferVar/name/
➥lastName”/>
</xsl:if>
</xsl:for-each>
</xsl:template>
The predicate statement ([./@skill = current()/

@skill]) will compare the value of the context node’s
skill attribute (in this case, the golfer node being looped
through) to the golfer node being handled by the template
(the current node). This is an example of how the context
node changes during a loop. However, the current node is
associated with the node being handled by the template and
stays the same during the looping process.
document(object,node-set?) Although XSLT makes it easy to transform a single XML
document into other structures, what happens if the result
tree needs to be created from more than just one XML
document? Using the document() function, other XML
documents can be pulled into an XSLT document for pro-
cessing. This can be useful in many situations, including
when one XML document contains a presentation structure
and another contains the data to be plugged into that struc-
ture. Using the document() function, these types of activi-
ties can be accomplished relatively easily. An example of
using the document() function is shown next:
<xsl:variable name=”xmlDoc”
➥select=”document(‘data.xml’)”/>
269
CHAPTER 7
TABLE 7.3 continued

This will load data.xml into the variable named xmlDoc.
Referencing elements within the external document can be
accomplished in the following manner:
<xsl:value-of select=”$xmlDoc//root/data/@id”/>
Although the preceding document() function example tar-

gets an external document, this function can also be used to
target a node-set within the main XML document. This can 7
be useful when you want to work with a lookup table
XML WITH XSLT

TRANSFORMING
AND ASP.NET
structure embedded in the XML document.
Aside from providing a string URI value, a node-set can be
passed to access the remote document, as shown next:
For the XML document:
<golfers>
<golfer
href=”http://www.ilikegolf.com/golfers/golfers.xm
➥l”/>
<handicaps href=”handicapsWest.xml”/>
<handicaps href=”handicapsEast.xml”/>
</golfer>
</golfers>
The following XSLT document creates a result tree con-

taining the handicap nodes from the referenced documents:
<xsl:stylesheet
xmlns:xsl=”http://www.w3.org/1999/XSL/Transform”
version=”1.0”>
<xsl:template match=”/”>
<golfers>
<xsl:apply-templates
➥select=”//handicaps”/>
</golfers>
</xsl:template>
<xsl:template match=”handicaps”>
<xsl:copy-of
➥select=”document(@href)//handicap”/>
</xsl:template>
</xsl:stylesheet>
270
TABLE 7.3 continued

element-available(string) This function is useful if you are writing an XSLT docu-
ment that may be processed using different XSLT proces-
sors. Before using elements that you know may not be
supported, a check can be made to see if the processor does
indeed support the element in question. This is helpful
when testing for XSLT elements found in a later version of
XSLT or in checking for vendor specific elements. The
function will return a Boolean value:
<xsl:if test=”element-
➥available(‘xsl:someNewElement’)”>
<xsl:someNewElement>HI!</xsl:someNewElement>
</xsl:if>
format-number(number, This function converts numbers to a string using a format

string,string?) pattern supplied in the second parameter. Here are some
examples of using this function:
The following function call returns 5,351:
format-number(5351,”#,###”)
The following function call returns 5351.00:

format-number(5351, “#.00”)

format-number(53.51, “#.0000”)

format-number(53.51, “0000.0000”)

format-number(53.51, “0000.####”)

format-number(53.56, “0.0”)
function-available(string) Similar to element-available, although this function checks

whether specific functions are supported by the XSLT
processor.
generate-id(node-set?) This function generates a string that is guaranteed to
uniquely identify a node. The same string will always be
returned for the same node. The string that is generated
will vary from processor to processor and will start with an
alphabetic character.
271
CHAPTER 7
TABLE 7.3 continued

<xsl:attribute name=”id”>
<xsl:value-of select=”generate-id(.)”/>
</xsl:attribute>
key(string,object) This function is used in conjunction with the xsl:key ele-

ment to return a node-set that has a specific name and
value defined by the xsl:key statement. For example, the
following code shows how the xsl:key element can define 7
a key:
XML WITH XSLT

TRANSFORMING
AND ASP.NET
<xsl:key name=”course-city” match=”course”
➥use=”@city”/>
This key can then be accessed by using the key() function:

<xsl:for-each select=”key(‘course-city’,
’Pinetop’)”>
...
</xsl:for-each>
system-property(string) This function returns the value of the system property iden-
tified by the name passed as the argument. Three different
system properties must be supported by a compliant XSLT
processor, including xsl:version, xsl:vendor, and
xsl:vendor-url. An example of using this function
follows:
<xsl:value-of select=”system-property
➥(‘xsl:version’)”/>
unparsed-entity-uri() This function returns declarations of unparsed entities in

the DTD of the source XML document.
Given the entity declaration:
<!ENTITY clubs SYSTEM
➥“http://www.lottaclubs.com/clubs.txt”>
The following code:

<xsl:value-of select=”unparsed-entity-
➥uri(‘clubs’)”/>
Would return a value of

http://www.lottaclubs.com/clubs.txt.
272
.NET Classes Involved in Transforming XML

Now that you’ve seen the different XSLT elements and functions that are at your disposal, it’s
time to learn about what classes in the .NET framework can be used in your ASP.NET applica-
tions when XSL transformations are necessary. After all, XSLT is simply a text-based language
that is of little utility without an XSLT processor.
Several classes built in to the System.Xml assembly can be used when transforming XML into
other structures via XSLT. Back in Listing 7.3, a preview of a few of these classes interacting
with each other was given that demonstrated how to transform an XML document into HTML.
In this section you’ll learn more about these classes and a few others so that you are fully
armed with everything you need to know to use XSLT in your ASP.NET applications. Figure
7.3 presents an overview of the main classes used in XSL transformations.
XML Stores
DOM DOM <-> DataSet Fast XSLT

XmlDocument XmlDataDocument XPathDocument
Implement XPathNavigator and IXPathNavigable
XSL/T XPath
XslTransform
XmlReader XmlWriter
System.IO.Stream System.IO.Textwriter
FIGURE 7.3
.NET Classes involved in XSL transformations.
Table 7.4 provides a description of each of these classes.
TABLE 7.4 .NET Classes Used in XSL Transformations

Class Description
XmlDocument The XmlDocument class implements the IXPathNavigable interface
and extends the XmlNode class, which provides the capability to create
nodes within a DOM structure. This class was discussed in Chapter 6.
Because the XmlDocument class provides node-creation capabilities, it
will not provide the fastest throughput in XSL transformations.
However, in cases where a DOM structure must be edited first before
being transformed, this class can be used.
273
CHAPTER 7
TABLE 7.4 continued

Class Description
XmlDataDocument The XmlDataDocument class extends the XmlDocument class. The
XmlDataDocument class can be used when working with DataSets in
ADO.NET. Chapter 8, “Leveraging ADO.NET’s XML Features
Using ASP.NET,” covers this class in more depth.
XPathDocument The XPathDocument class implements the IXPathNavigable interface
like the XmlDocument class does. However, the XPathDocument class
does not extend the XmlNode class (as the XmlDocument class does) 7
and therefore provides the fastest option for transforming XML via
XML WITH XSLT

TRANSFORMING
XSLT. You’ll see this class used in the examples that follow.
AND ASP.NET
Because the XPathDocument class implements the IXPathNavigable
interface, it is able to leverage features built in to the abstract
XPathNavigator class (which, in turn, uses the XPathNodeIterator
abstract class for iteration over node-sets) to provide cursor-style
access to XML data, resulting in fast and efficient XSL transforma-
tions.
XslTransform The XslTransform class is used to transform XML data into other
structures. Using the XslTransform class involves instantiating it,
loading the proper style sheet with the Load() method, and then pass-
ing specific parameters to its Transform() method. This process will
be detailed in the next few sections.
XsltArgumentList The XsltArgumentList class is used to provide parameter values to
xsl:param elements defined in an XSLT style sheet. It can be passed
as an argument to the XslTransform class’s Transform() method.
The XPathDocument Class

Before looking at the XslTransform class, you need to familiarize yourself with the
XPathDocument class. To use this class you must reference the System.Xml.XPath namespace
in your ASP.NET applications. As mentioned in Table 7.4, this class provides the most efficient
way to transform an XML document using XSLT because it provides a read-only representa-
tion of a DOM structure. The XPathDocument class is very simple to use because it has only
one XML-related method named CreateNavigator() that can be used to create an instance of
the XPathNavigator class. However, it does have several constructors that are worth mention-
ing. Table 7.5 shows the different constructors.
274
TABLE 7.5 XPathDocument Constructors
Constructor Description
Public XPathDocument(XmlReader, Accepts an XmlReader as well as an
XmlSpace) XmlSpace enumeration.
Public XPathDocument(XmlReader) Accepts an XmlReader.
Public XPathDocument(TextReader) Accepts a TextReader.
Public XPathDocument(Stream) Accepts a Stream.
Public XPathDocument(string,XmlSpace) Accepts the string value of the path to
an XML document and an XmlSpace
enumeration.
Public XPathDocument(string) Accepts the string value of the path to an
XML document.
Listing 7.3 used the last constructor shown in Table 7.5 that accepts the path to the XML docu-
ment to transform. You could also load the XPathDocument with XML data contained in a
Stream (a FileStream for instance), an XmlReader, or a TextReader. Having these different
constructors offers you complete control over how transformations will be carried out in your
ASP.NET applications. Which one you use will depend on how you choose to access your
application’s XML documents. Listing 7.8 instantiates an XPathDocument class by passing in
an XmlTextReader object.
LISTING 7.8 Instantiating an XPathDocument Class

<%@ Import Namespace=”System.Xml.XPath” %>
<%@ Import Namespace=”System.IO” %>
<%@ Import Namespace=”System.Text” %>
string xmlPath = Server.MapPath(“listing7.1.xml”);
string xslPath = Server.MapPath(“listing7.2.xsl”);
FileStream fs = new FileStream(xmlPath,FileMode.Open,

FileAccess.Read);
StreamReader reader = new StreamReader(fs,Encoding.UTF8);
XmlTextReader xmlReader = new XmlTextReader(reader);
//Instantiate the XPathDocument Class

XPathDocument doc = new XPathDocument(xmlReader);
Response.Write(“XPathDocument successfully created!”);
275
CHAPTER 7

//Close Readers
reader.Close();
xmlReader.Close();
}
</script>
Running the code shown in Listing 7.5 will write out “XPathDocument successfully created!”
to the browser. You’ll certainly agree that because it has simply readied the XML document for
transformation, this code doesn’t buy you much. To actually transform the XML document 7
XML WITH XSLT

using XSLT, you’ll need to use another class named XslTranform.
TRANSFORMING
AND ASP.NET
The XslTransform Class
The XslTransform class is found in the System.Xml.Xsl namespace. Using it is as easy as
instantiating it, loading the XSLT document, and then calling its Transform() method. Tables
7.6 and 7.7 show the different properties and methods found in the XslTransform class.
TABLE 7.6 XslTransform Class Properties
XmlResolver The XmlResolver property can be used to specify a resolver class used
to resolve external resources. For example, it can be used to resolve
resources identified in xsl:include elements. If this property is not
set, the XslTransform class will use the relative path of the supplied
XSLT style sheet to resolve any included style sheets.
Chapter 5 showed an example of using the XmlUrlResolver class to
access authenticated documents.
TABLE 7.7 XslTransform Class Methods
Method Description
Load() Loads an XSLT document. This method can accept an XmlReader, a
document URL, or a variety of other objects.
Transform() The Transform() method is overloaded and can therefore accept a
variety of parameters. The most common form of the method that
you’ll likely use in your ASP.NET applications is shown next (check
the .NET SDK for the other overloaded versions of the method):
xsl.Transform(XpathDocument,XsltArgumentList,Stream)
276
Listing 7.9 builds on Listing 7.8 by adding in the XslTransform class.
LISTING 7.9 Using the XslTransform Class

5: <%@ Import Namespace=”System.Text” %>
8: string xmlPath = Server.MapPath(“listing7.1.xml”);
9: string xslPath = Server.MapPath(“listing7.2.xsl”);
10:
11: FileStream fs = new FileStream(xmlPath,FileMode.Open,
12: FileAccess.Read);
13: StreamReader reader = new StreamReader(fs,Encoding.UTF8);
14: XmlTextReader xmlReader = new XmlTextReader(reader);
15:
17: XPathDocument doc = new XPathDocument(xmlReader);
18:
19: //Instantiate the XslTransform Class
20: XslTransform xslDoc = new XslTransform();
21: xslDoc.Load(xslPath);
22: xslDoc.Transform(doc,null,Response.Output);
23:
24: //Close Readers
25: reader.Close();
27: }
28: </script>
In the next section, you’ll see how you can pass in parameter values to XSLT style sheets using
the XsltArgumentList class.
The XsltArgumentList Class

Earlier in the chapter, you saw how parameters could be used in XSLT style sheets through the
xsl:param element. As a quick refresher, this XSLT element must have a name attribute and
optional select attribute:
<xsl:param name=”customerID” select=”’ALFKI’”/>
XSLT parameters allow your ASP.NET applications to pass in values needed by the style sheet
to properly process the source XML document. In this section you’ll see how to create an
277
CHAPTER 7
XsltArgumentList class and add parameter name/value pairs to it. It can also be used with
extension objects. Table 7.8 shows the different methods available on the XsltArgumentList
class (it has no properties).
TABLE 7.8 XsltArgumentList Methods
Method Description
AddExtensionObject(namespaceURI, Allows an extension object to be added to the
object) collection of extension objects. The namespace
URI can be used to remove or retrieve an 7
object from the collection using either
XML WITH XSLT

TRANSFORMING
AND ASP.NET
GetExtensionObject() or RemoveExtension
Object(). Similar to the addObject() method
found on the IXslProcessor interface in
MSXML3.
AddParam(name,namespaceURI,value) Allows a parameter name/value pair to be added
to the collection of parameters. If you do not want
to assign a namespaceURI, the URI can be empty
strings. It is similar to the addParameter()
method found on the IXslProcessor interface in
MSXML3.
GetExtensionObject(namespaceURI) Allows an extension object to be retrieved from
the collection of extension objects based on the
namespaceURI assigned to the object in the
AddExtensionObject() method.
GetParam(name,namespaceURI) Allows a parameter name/value pair to be
retrieved from the collection of parameters based
on a name and namespaceURI combination. If a
parameter name has no assigned namespaceURI,
the URI can be empty strings.
RemoveExtensionObject Allows an extension object to be removed from
(namespaceURI) the collection of extension objects based on the
namespaceURI assigned to the object in the
AddExtensionObject() method.
RemoveParam(name,namespaceURI) Allows a parameter name/value pair to be
removed from the collection of parameters based
on a name and namespaceURI combination. If a
parameter name has no assigned namespaceURI,
the URI can be empty strings.
278
The method that you’ll use most frequently among those listed in Table 7.8 is the AddParam()
method. This method accepts the name of the parameter, a namespace URI (optional), and the
value of the parameter. The following example shows how to add a parameter named
golferName to the XsltArgumentList collection:
XSLT Code:
<xsl:param name=”golferName”/>
ASP.NET Code:
XsltArgumentList args = new XsltArgumentList();
args.AddParam(“golferName”,””,”Dan”);
This code allows the XSLT parameter named golferName to be assigned a value of Dan.
Although the value of Dan was hard-coded into the AddParam() method, it could just as easily
be dynamically pulled from a text box or drop-down box, as you’ll see in the next example.
Because the XsltArgumentList class relies on the HashTable class behind the scenes, multiple
parameter name/value pairs can be added and stored.
After an XsltArgumentList class has been instantiated and filled with the proper name/value
pairs, how do the parameters in the XSLT style sheet get updated with the proper values? The
answer is to pass the XsltArgumentList into the XslTransform class’s Transform() method,
as shown next:
//Create the XPathDocument object
XPathDocument doc = new XPathDocument(Server.MapPath(“Listing7.1.xml”));
//Create the XslTransform object

XslTransform xslDoc = new XslTransform();
xslDoc.Load(Server.MapPath(“Listing 7.4.xsl”));
//Create the XsltArgumentList object

XsltArgumentList args = new XsltArgumentList();
args.AddParam(“golferName”,””,”Dan”);
//Perform the transformation - pass in the parameters in the XsltArgumentList

xslDoc.Transform(doc,args,Response.Output);
In the next section you’ll be presented with an ASP.NET application that does this task.
Putting It All Together

You’ve now seen the main XSLT classes built in to the .NET framework. In this section you’ll
see how these can be used to build a simple ASP.NET application that allows a user to select a
specific golfer’s information from an XML document. After the golfer is chosen, XSLT will be
279
CHAPTER 7
used along with the XPathDocument, XslTransform, and XsltArgumentList classes to display
the golfer’s information. Figures 7.4 and 7.5 show screen shots of the two pages involved in
the sample XSLT application.
XML WITH XSLT

TRANSFORMING
AND ASP.NET
FIGURE 7.4
The golfer selection form.
FIGURE 7.5
The XSLT-generated results of the golfer selection.
To build this application, code-behind techniques were used in the ASP.NET page. If you’re
not familiar with this mechanism in ASP.NET coding, it allows the actual program code to be
stored separately from the visual portion (the HTML) found in the ASP.NET page. The tech-
nique of placing all the code (programming code and HTML) into one ASP.NET page shown
in many places throughout the book was used simply to make listings easier to read and follow.
280
In practice, however, it’s highly recommended that you leverage code-behind techniques to
keep your ASP.NET code more maintainable.
For this example, a file named xsltGolfer.aspx.cs contains all the programming code for the
listings that follow, and xsltGolfer.aspx contains the HTML. The XSLT style sheet used for
the application is named xsltGolfer.xsl. Let’s start by examining what code is executed
when the ASP.NET page first loads (the Page_Load event). As you’ll see in Listing 7.10, this
code takes care of loading all the firstName element values found in the XML document into
a drop-down box.
LISTING 7.10 The Page_Load Event and FillDropDown() Method (xsltGolfer.aspx.cs)

1: private void Page_Load(object sender, System.EventArgs e) {
2: if (!Page.IsPostBack) {
3: FillDropDown(“firstName”);
4: }
5: }
6:
7: private void FillDropDown(string element) {
8: string name = “”;
9: this.ddGolferName.Items.Clear();
10: XmlTextReader reader = new XmlTextReader(xmlPath);
11: object firstNameObj = reader.NameTable.Add(“firstName”);
13: if (reader.Name.Equals(firstNameObj)) {
14: name = reader.ReadString();
15: ListItem item = new ListItem(name,name);
16: this.ddGolferName.Items.Add(item);
17: }
18: }
19: reader.Close();
20: }
You can see that the XmlTextReader and XmlNameTable classes are used to efficiently parse the
XML data and add it to the drop-down box (ddGolferName). Both classes were discussed in
Chapter 5, “Using the XmlTextReader and XmlTextWriter Classes in ASP.NET.”
After the user selects a specific golfer from the drop-down box and clicks the button, the
btnSubmit_Click event is fired. The code within this event takes care of getting the selected
golfer name value from the drop-down box and passes it into the XSLT style sheet by using the
XsltArgumentList class. The style sheet then takes care of transforming the selected golfer’s
XML data into HTML, as shown earlier in Figure 7.5. Listing 7.11 shows the code involved in
this process.
281
CHAPTER 7
LISTING 7.11 Transforming XML to HTML Using XSLT (xsltGolfer.aspx.cs)

1: protected void btnSubmit_Click(object sender, System.EventArgs e) {
2: string xslPath = Server.MapPath(“xsltGolfer.xsl”);
4: StringBuilder sb = new StringBuilder();
5: StringWriter sw = new StringWriter(sb);
6:
7: try {
8: xmlReader = new XmlTextReader(xmlPath);
10: XPathDocument doc = new XPathDocument(xmlReader); 7
XML WITH XSLT

11:
TRANSFORMING
AND ASP.NET
12: //Instantiate the XslTransform Classes
15:
16: //Add Parameters
17: XsltArgumentList args = new XsltArgumentList();
18: args.AddParam(“golferName”,””,
19: this.ddGolferName.SelectedItem.Value);
20:
21: //Call Transform() method
22: transform.Transform(doc, args, sw);
23:
24: //Hide-Show ASP.NET Panels in xsltGolfer.aspx
25: this.pnlSelectGolfer.Visible = false;
26: this.pnlTransformation.Visible = true;
27: this.divTransformation.InnerHtml = sb.ToString();
28: }
29: catch (Exception excp) {
30: Response.Write(excp.ToString());
31: }
32: finally {
34: sw.Close();
35: }
36: }
Although this doesn’t show much in the way of new classes, it does show how the different
classes discussed in earlier sections can be tied together to create an ASP.NET application that
leverages XML and XSLT.
282
Using Extension Objects with XSLT

While looking through the methods exposed by the XsltArgumentList class back in Table 7.8,
you may have wondered how the extension object methods could be used to enhance XSLT/
ASP.NET applications. Using these methods is surprisingly easy and can provide your XSLT
style sheets with even more power and flexibility. Keep in mind that by using extension objects
in XSLT, you may render your XSLT unusable on other platforms or by other languages sim-
ply because extensions are not a part of the XSLT 1.0 specification (the XSLT 1.1 working
draft does include extension elements and functions, however). If your application will be the
only one that uses a particular XSLT style sheet and you need additional functionality not in
the XSLT 1.0 specification, extension objects may be the answer. Some other benefits of using
extension objects include:
• Methods on classes within other namespaces (other than System namespaces) can be
called.
• Extension functions allow better encapsulation and reuse of classes.
• Style sheets can be kept smaller and more maintainable.
What exactly is an extension object? Think of it as an external class that can be referenced and
used within an XSLT style sheet. By using extension objects, you can get the current date and
time, query a database to do a lookup based on a value found in the XML source document, hit
a Web service, or trigger another application to begin running. All of this and much more can
be done from within an XSLT style sheet.
To see how this works in practice, the next code sample shown in Listing 7.12 builds on the
previous one shown in Listing 7.11 to add the capability to write out the current date/time of
the server from a specific location within the style sheet. Let’s first look at the class that will
be instantiated and used as an extension object.
LISTING 7.12 The Date/Time Extension Class (xsltDateObject.cs)

1: namespace XsltTransformation.ExternalObjects {
2: using System;
3:
4: public class XsltDateTime {
5: DateTime _date;
6: public XsltDateTime() {
7: _date = DateTime.Now;
8: }
9: public DateTime GetDateTime() {
10: return _date;
11: }
12: }
13: }
283
CHAPTER 7
This class (named XsltDateTime) does nothing more than return the current system date and
time. It must be instantiated within an ASP.NET page and then added to the external object
collection of the XsltArgumentList class.
You may be wondering if it would be easier to pass the date and time into the style sheet using
a regular XSLT parameter. The answer is “yes”; it would be easier because no external objects
would be needed from within the style sheet. However, by calling the extension object from
within the XSLT style sheet, a more up-to-date date/time value will be returned (assuming that
accuracy matters in the application). And let’s face it, the demo code wouldn’t be as cool if a
regular XSLT parameter was used, especially because you know all about those at this point! 7
XML WITH XSLT

Listing 7.13 demonstrates how to pass an extension object into an XSLT style sheet using the
TRANSFORMING
AND ASP.NET
XsltArgumentList class. The lines of code relating to the extension object use are shown in
bold.
LISTING 7.13 Adding an External Object to the XsltArgumentList Class

(xsltExtension.aspx.cs)
1: protected void btnSubmit_Click(object sender, System.EventArgs e) {
2:
3: string xslPath = Server.MapPath(“xsltExtension.xsl”);
4: XsltDateTime xsltExtObj = new XsltDateTime(); //The Extension Object
8:
9: try {
10: xmlReader = new XmlTextReader(xmlPath);
11:
13: XPathDocument doc = new XPathDocument(xmlReader);
14:
15: //Instantiate the XslTransform Classes
18:
19: //Add Parameters and Extension Object to the Collection
21: args.AddParam(“golferName”,””,
22: this.ddGolferName.SelectedItem.Value);
23: //Add the namespaceURI and object to the object collection
24: args.AddExtensionObject(“urn:xsltExtension-DateTime”,
25: xsltExtObj);
26:
27: //Call Transform() method
284

28: transform.Transform(doc, args, sw);
29:
30: //Hide ASP.NET Panels
31: this.pnlSelectGolfer.Visible = false;
32: this.pnlTransformation.Visible = true;
33: this.divTransformation.InnerHtml = sb.ToString();
34: }
35: catch (Exception excp) {
36: Response.Write(excp.ToString());
37: }
38: finally {
40: sw.Close();
41: }
42: }
The XSLT style sheet obviously needs to be able to reference the XsltDateTime object that is
passed in and call its GetDateTime() method. This is accomplished by adding the proper
namespace prefix and URI into the style sheet. For this example, a namespace URI of
urn:xsltExtension-DateTime is used along with a namespace prefix of dateTimeObj. Any
namespace URI can be used as long as it is consistent between the ASP.NET page and the
XSLT style sheet. Listing 7.14 shows the complete style sheet and highlights where the exter-
nal object is referenced and used.
LISTING 7.14 Calling External Objects Within an XSLT Style Sheet (xsltExtension.xsl)
3: xmlns:dateTimeObj=”urn:xsltExtension-DateTime” version=”1.0”>
5: <xsl:param name=”golferName” select=”’Dan’”/>
7: <xsl:apply-templates
8: select=”//golfer[name/firstName=$golferName]”/>
9: </xsl:template>
12: </xsl:template>
14: <table class=”borders” border=”0” width=”640” cellpadding=”4”
15: cellspacing=”0” bgcolor=”#efefef”>
285
CHAPTER 7

19: <b>Skill: </b>
20: </td>
23: <xsl:choose>
26: </xsl:when>
27: <xsl:when test=”@skill=’moderate’”> 7
XML WITH XSLT

28: color:#005300;
TRANSFORMING
AND ASP.NET
29: </xsl:when>
31: color:#000000;
32: </xsl:when>
33: <xsl:otherwise>
34: color:#000000;
36: </xsl:choose>
39: </td>
42: </td>
45: </td>
47: <b>Clubs: </b>
48: </td>
51: </td>
52: </tr>
53: <tr>
55: </tr>
59: </td>
60: </tr>
61: <tr>
286

63: <b>City: </b>
64: </td>
66: <b>State: </b>
67: </td>
69: <b>Course: </b>
70: </td>
71: </tr>
73: </table>
74: <p/>
75: <xsl:value-of select=”dateTimeObj:GetDateTime()”/>
76: </xsl:template>
78: <tr>
81:
83: </td>
84: </tr>
85: </xsl:template>
88: </xsl:template>
94: </td>
97: </td>
100: </td>
101: </tr>
287
CHAPTER 7
How is the GetDateTime() method called on the XsltDateTime object that is passed into
the style sheet? This is accomplished by referencing the namespace prefix (dateTimeObj)
associated with the namespace URI assigned to the object in the ASP.NET page (urn:xslt
Extension-DateTime). The prefix is declared in line 3 and then used to call the GetDate
Time() method in line 75. Although this is a fairly straightforward example of using external
objects within XSLT style sheets, much more complicated functionality, such as querying data-
bases or calling Web services, can be accomplished with XSLT external objects.
Before this chapter ends, the next section will show how to create a reusable XSLT class that
can be used within ASP.NET applications and demonstrate the asp:Xml Web control. 7
XML WITH XSLT

TRANSFORMING
AND ASP.NET
Creating a Reusable XSLT Class
By now you should feel comfortable working with the different classes within the System.Xml
assembly that can be used to perform XSL transformations. However, wouldn’t it be nice if
you could write a generic class that doesn’t require any knowledge of the XPathDocument or
XslTransform classes (and their supporting classes) to be used? Not only would such a class
provide more productive programming, but it would also allow your ASP.NET applications to
leverage all the benefits offered by following object-oriented programming techniques.
The code in Listing 7.15 shows a generic class that allows an ASP.NET programmer to trans-
form an XML document using XSLT without any knowledge of XSLT-specific classes. This
code leverages well-known classes such as HashTable to accomplish the transformation.
LISTING 7.15 A Reusable XSLT Transformation Class (xsltTransform.cs)

1: namespace XsltTransformation {
2:
3: using System;
5: using System.Xml.XPath;
6: using System.Xml.Xsl;
8: using System.IO;
10:
11: /// <summary>
12: /// A generic XSL Transformation Class for use in ASP.NET pages
13: /// </summary>
14: public class XsltTransform {
15:
16: public static string TransformXml(string xmlPath,string xsltPath,
17: Hashtable xsltParams,Hashtable xsltObjects) {
288
18:

21: try {
22: XPathDocument doc = new XPathDocument(xmlPath);
25: xslDoc.Load(xsltPath);
26:
27: //Fill XsltArgumentList if necessary
28: if (xsltParams != null) {
29: IDictionaryEnumerator pEnumerator =
30: xsltParams.GetEnumerator();
31: while (pEnumerator.MoveNext()) {
32: args.AddParam(pEnumerator.Key.ToString(),””,
33: pEnumerator.Value);
34: }
35: }
36: if (xsltObjects != null) {
37: IDictionaryEnumerator pEnumerator =
38: xsltObjects.GetEnumerator();
39: while (pEnumerator.MoveNext()) {
40: args.AddExtensionObject(pEnumerator.Key.ToString(),
41: pEnumerator.Value);
42: }
43: }
44: xslDoc.Transform(doc,args,sw);
45: return sb.ToString();
46: }
48: return exp.ToString();
49: }
50: finally {
51: sw.Close();
52: }
53: }
54: } //XsltTransform
55: } // XsltTransformation namespace
Having a reusable class for XSL transformations results in much cleaner ASP.NET code, as
Listing 7.16 shows.
289
CHAPTER 7
LISTING 7.16 Using the XsltTransform Class in ASP.NET (xsltTransform.aspx)

public void Page_Load(object sender, System.EventArgs e) {
string xmlPath = Server.MapPath(“Listing7.1.xml”);
string xslPath = Server.MapPath(“xsltExtension.xsl”);
//Create the External Object

XsltDateTime xsltExtObj = new XsltDateTime();
//Create Hashtables to hold params and external objects

//If none are needed, pass null into the TransformXml() method instead
Hashtable xsltParams = new Hashtable(); 7
XML WITH XSLT

Hashtable xsltObjects = new Hashtable();
TRANSFORMING
AND ASP.NET
//Fill the Hashtables with the params and external objects
xsltParams.Add(“golferName”,”Heedy”);
xsltObjects.Add(“urn:xsltExtension-DateTime”,xsltExtObj);
//Call the custom XsltTransform class’s TransformXml method

string xsl = XsltTransform.TransformXml(xmlPath,xslPath,
xsltParams,xsltObjects);
Response.Write(xsl);
}
As this book was going to press, I wrote a new article demonstrating how the techniques
shown in Listing 7.15 can be used to create an ASP.NET server control geared at targeting
multiple devices using XML and XSLT. This article will appear in the premier issue
(September 2001) of Visual Studio Magazine (http://www.devx.com), and the code can be
downloaded from the DevX site or from the CodeBank section of
http://www.TomorrowsLearning.com.
The Asp:Xml Web Control

The .NET framework also comes with a prebuilt Web control that can be used for doing simple
XSL transformations in ASP.NET pages. This control allows the XML document source and
XSLT style sheet source to be set using attributes. Listing 7.17 shows how the control is used.
LISTING 7.17 Using the asp:Xml Web Control

1: <html>
2: <body>
3: <asp:Xml ID=”xslTransform” Runat=”server”
290

4: DocumentSource=”Listing7.1.xml”
5: TransformSource=”Listing7.2.xsl”>
6: </asp:Xml>
7: </body>
8: </html>
In situations where the values of the DocumentSource and/or TransformSource attributes are
not known until runtime, they can be assigned dynamically, as Listing 7.18 shows.
LISTING 7.18 Dynamically Assigning Source Documents to the asp:Xml Web Control
3: xslTransform.DocumentSource = “Listing7.1.xml”;
4: xslTransform.TransformSource = “Listing7.2.xsl”;
5: }
6: </script>
7: <html>
8: <body>
9: <asp:Xml ID=”xslTransform” Runat=”server”></asp:Xml>
10: </body>
11: </html>
The asp:Xml server control also allows DOM structures to be passed into it programmatically
using the Document and Transform properties:
1: <%@Import Namespace=”System.Xml”%>
2: <%@Import Namespace=”System.Xml.Xsl”%>
6: doc.Load(Server.MapPath(“Listing7.1.xml”));
7: XslTransform trans = new XslTransform();
8: trans.Load(Server.MapPath(“Listing7.2.xsl”));
9: xslTransform.Document = doc;
10: xslTransform.Transform = trans;
11: }
12: </script>
13: <html>
14: <body>
15: <asp:Xml ID=”xslTransform” Runat=”server”>
16: </asp:Xml>
17: </body>
18: </html>
291
CHAPTER 7
Summary
Although XSLT is a very big topic that can’t possibly be covered in a single chapter, you have
been exposed to some of the more important aspects of the language that will get you started
transforming XML documents in ASP.NET applications. XSLT provides a cross-platform,
language-independent solution that can used to transform XML documents into a variety of
structures.
The .NET platform provides several classes developed specifically for doing XSLT transforma-
tions, including the XslTransform and XPathDocument classes. By using these and other
7
classes, you have the ability to leverage the complete XSLT language in your ASP.NET
XML WITH XSLT

TRANSFORMING
AND ASP.NET
applications.
In the next chapter, you’ll be provided with an in-depth look at the different XML features
found in ADO.NET.
Leveraging ADO.NET’s XML CHAPTER
8
Features Using ASP.NET
IN THIS CHAPTER
• Introducing ADO.NET 294
• Comparing Classic ADO with ADO.NET 294
• ADO.NET Basics 295
• The DataSet Class 308
• Working with the DataSet and

XmlDataDocument Classes 324
294
Introducing ADO.NET
ADO.NET represents the next evolutionary step in working with data located in local or dis-
tributed data stores. Although its name is very similar to the ActiveX Data Objects (ADO) of
old, ADO.NET brings an abundance of new features to the table. In fact, ADO.NET has been
rewritten from the ground up to make working with all types of data (including data in the
form of XML) easier and more manageable. This chapter discusses many of ADO.NET’s new
features and shows you how straightforward it is to work with data structured as XML. If
you’re already familiar with “classic” ADO, you’re likely interested in knowing how it differs
from ADO.NET. Let’s start things off by comparing the two.
Comparing Classic ADO with ADO.NET

If you’ve had the opportunity to work with ADO much, you know it represents an excellent
way of retrieving and manipulating data from a variety of data stores. Whether you’re hitting
SQL Server, Oracle, DB2, or even a nonrelational store, ADO provides many interfaces that
allow you to effectively integrate data into your applications.
ADO even allows for the automatic generation of XML, although this feature didn’t appear
until ADO 2.5. Because of XML’s more recent emergence as a power player among the
world’s programming languages, however, it was an afterthought in ADO and not truly inte-
grated from the ground up. This doesn’t infer that the architects of ADO didn’t plan properly
for the future. It simply exemplifies how fast technology changes!
XML Integration
ADO.NET flips the scale as far as XML integration goes. No longer are you restricted to con-
verting a Recordset to XML using adPersistXml or some other manual means. In ADO.NET,
XML is a first-class citizen, and its integration is apparent from the start. What does this mean
for your ASP.NET projects? Quite simply, you no longer have to worry about which way you’d
like to work with data. If you would rather work with a standard relational view, you can. If
you need to work with XML natively, you can do that as well. Changing between these differ-
ent views can be accomplished with a call to a single property. We’ll talk more about this in a
moment.
What Happened to the RecordSet Object?

XML’s tight integration with ADO.NET isn’t the only big change that has been made. You may
be surprised to know that the RecordSet object no longer exists in ADO.NET! Wait, you say,
what happened to the RecordSet object? Wasn’t it one of the main objects in ADO? Although
the RecordSet is very useful, it has been replaced with a more functional and robust class
Leveraging ADO.NET’s XML Features Using ASP.NET
295
CHAPTER 8
called the DataSet. Although we’ll discuss the DataSet in more detail later in this chapter,
some of its new features include support for hierarchical relationships between tables, the
capability to add constraints, and much more. This means that you can now relate a database’s
Customers table to the Orders table all within the same object. This hierarchical association is
done without resorting to shape commands.
The RecordSet object also has another inherent problem that could haunt you, depending on the
context it was being used in. For example, if you attempt to pass a RecordSet between objects
living on opposite sides of a firewall, a potential problem could arise if the RecordSet was
blocked from passing through. With ADO.NET, this problem is alleviated because the DataSet
class is based in XML. Because XML can ride on top of protocols such as HTTP, exchanging
data records between objects living virtually anywhere in the world is now possible.
Disconnected Versus Connected

The DataSet class shifts ADO.NET away from the connection-based model exposed by ADO
and toward a more message-like model. This change offers many improvements, including the
capability to send data through firewalls, the elimination of type conversions in COM, and a
truly disconnected way of working with data. ADO.NET’s basis in XML means that virtually 8
any data type can now be supported and passed. This is in contrast to the limited set of data
XML FEATURES
types exposed by COM marshaling. The elimination of type conversions results in an increase
ADO.NET’S
LEVERAGING
in scalability and efficiency.
Another important point that can be made about ADO.NET is that DataSets are always dis-
connected, which results in connections being released earlier. Looping through a DataSet can
be done without any connection being open to the data source. Although this can be done
using the disconnected RecordSet in ADO, disconnected Recordsets are still subject to the
inefficiencies found in COM marshaling, as mentioned earlier.
Although many other benefits are associated with ADO.NET, we’ll limit the scope of our dis-
cussions to topics relating specifically to ADO.NET, XML, and ASP.NET in this chapter.
However, if you have no experience working with ADO.NET, there’s no need to worry. The
next few sections cover enough of the basics to get you up to speed quickly. After the basics
are covered, you’ll see how XML can be used in conjunction with ADO.NET.
ADO.NET Basics
If you have experience working with “classic” ADO, you’ll be happy to know that much of
what you already know can be applied to ADO.NET. Although the RecordSet object no longer
exists, you’ll find that ADO.NET’s Connection, Command, and DataSet classes are still based
on many principles you’re already familiar with. One big change that you’ll see in the exam-
ples that follow is that ADO.NET is much more object oriented than ADO was. Even declaring
296
a data type is accomplished through the use of objects. To prepare you for working with the
new ADO.NET classes, the next section covers the different managed providers that can be
used to connect to a data source.
ADO.NET Managed Providers

ADO used providers such as Jet and OLE DB for accessing different data stores. Although
both still can play a role in ADO.NET, the choice of which managed provider to use when
working with the .NET platform is broken into two basic choices. These are
• SQL Managed Provider—This provider is used only if connecting to a SQL Server 7.0 or
later database. Direct communication with SQL Server can be achieved through the use
of the Tabular Data Stream (TDS) protocol. This presents an efficient means of access-
ing SQL Server because OLE DB is bypassed.
• OLE DB Managed Provider—This provider allows for access to all other data sources,
including Access, Sybase, Oracle, DB2, and so on. It can also be used to access SQL
Server 7.0 or later, although using the SQL managed provider is more efficient in this
scenario.
If you’re wondering why the word “managed” is being used to describe a provider, you need to
take into consideration how the .NET platform works. With the introduction of languages such
as C# and the Common Language Runtime, data or memory used within the .NET platform is
allocated and released automatically using a garbage collection system. The .NET framework
provides a runtime environment that manages code execution and many other services for you.
If you’re coming from a C++ background, you know that the process of having memory and
objects cleaned up for you isn’t normally found within the C++ language environment. Memory
must be allocated and managed by the programmer rather than by a system or platform.
The managed providers in ADO.NET simply leverage .NET’s managed environment so that
many coding details are taken care of for you in the background. This doesn’t preclude you
from implementing sound programming practices, but it does eliminate many complexities.
Different ADO.NET classes are used, depending on which managed provider you choose. The
code written to use either provider is remarkably similar in many cases, however. The next two
sections will introduce how you can establish a connection to a data source using either the
SqlConnection and OleDbConnection classes.
SQL Managed Provider Connection Class

When using the SQL Managed Provider, the System.Data.SqlClient namespace must be ref-
erenced in your ASP.NET page. Although not necessary for basic ADO.NET functionality, it’s
also wise to import the System.Data namespace because it will come into play when using
type information for stored procedure parameters, and so on. After these namespaces have been
297
CHAPTER 8
imported, initiating a connection to a SQL Server database is as easy as creating an instance of

the SqlConnection class, as shown next:
<%@ Import Name=”System.Data” %>
<%@ Import Name=”System.Data.SqlClient” %>
void Page_Load(Object Src, EventArgs E) {
string connStr = “server=localhost;uid=sa;pwd=;database=northwind”;
SqlConnection conn = new SqlConnection(connStr);
Response.Write(“Connection established!”);
}
</script>
Alternatively, the connection string can also be set using the ConnectionString property:
<%@ Import Name=”System.Data.SqlClient” %>
string connStr = “server=localhost;uid=sa;pwd=;database=northwind”;
SqlConnection conn = new SqlConnection();
conn.ConnectionString = connStr; 8
XML FEATURES
}
ADO.NET’S
LEVERAGING
</script>
As shown, the main class used in the preceding samples is the SQLConnection class.
OLE DB Managed Provider Connection Class

Now that you’ve seen how to use the SQL managed provider, learning how to use the OLE DB
managed provider will be a snap. Very few differences exist between the two providers as far
as instantiating connections to data sources. One change that must be made is importing the
System.Data.OleDb namespace into your ASP.NET page rather than the System.Data.
SqlClient namespace. The following code shows how to use the OleDbConnection class to
hit the Northwind database:
<%@ Import Name=”System.Data.OleDb” %>
string connStr = @”Provider=SQLOLEDB.1;Data Source=(local);
uid=sa;pwd=;Initial Catalog=northwind”;
OleDbConnection conn = new OleDbConnection();
conn.ConnectionString = connStr;
}
</script>
298
Comparing the SQL to the OLE DB managed provider, you can see that each imports a differ-
ent namespace and uses the appropriate connection class to attach to a data source. Apart from
these differences, the code is virtually the same.
The Command Class

After you have connected to a data source, you’ll more than likely want to either select data
or perform some other operation such as an update, insert, or delete. ADO.NET handles this type
of functionality through the Command class. The use of the word “Command” is generic because
the actual type of command that you use will depend on which ADO.NET managed provider
you use to connect to the data store. For example, if you’ll be using the SQL managed provider,
you’ll want to use the SqlCommand class. If you’ll be using the ADO managed provider, you’ll
want to use the OleDbCommand class. To see how this class is instantiated and used, take a look at
the samples that follow. Listing 8.1 and Listing 8.2 both instantiate the appropriate Command class
on Line 9, respectively.
LISTING 8.1 Using the SQL Managed Provider Command Class

1: public class SqlConnect {
2: private SqlConnection dataConn = null;
3: private SqlDataReader reader = null;
4:
5: public string openConnection(HttpResponse Response,
6: string dbConnectString,string cmdString) {
7: try {
8: dataConn = new SqlConnection(dbConnectString);
9: SqlCommand sqlCmd = new SqlCommand(cmdString,dataConn);
10: dataConn.Open();
11: reader = sqlCmd.ExecuteReader();
12: Response.Write(“<table><tr><td><b>ID</b></td>” +
13: “<td><b>Name</b></td></tr>”);
16: Response.Write(“<td>” + reader[“CategoryID”].ToString() +
17: “</td>”);
18: Response.Write(“<td>” + reader[“CategoryName”].ToString() +
19: “</td>”);
20: Response.Write(“</tr>”);
21: }
23: return “<p>Sql Server Data Connection Opened”;
24: }
26: return(e.ToString());
299
CHAPTER 8

27: }
28: finally {
30: reader.Close();
31: }
32: if (dataConn != null) {
33: dataConn.Close();
34: }
35: }
36: }
37: }
LISTING 8.2 Using the OLE DB Managed Provider Command Class

1: public class OleDbConnect {
2: private OleDbConnection dataConn = null;
3: private OleDbDataReader reader = null;
4:
5: public string openConnection(HttpResponse Response, 8
XML FEATURES
ADO.NET’S
LEVERAGING
7: try {
8: dataConn = new OleDbConnection(dbConnectString);
9: OleDbCommand oleDbCmd = new OleDbCommand(cmdString,dataConn);
11: reader = oleDbCmd.ExecuteReader();
12: Response.Write(“<table><tr><td><b>ID</b></td>” +
13: “<td><b>Name</b></td></tr>”);
16: Response.Write(“<td>” + reader[“CategoryID”].ToString() +
17: “</td>”);
18: Response.Write(“<td>” + reader[“CategoryName”].ToString() +
19: “</td>”);
21: }
23: return “<p>OleDb Server Data Connection Opened”;
24: }
27: }
28: finally {
300

30: reader.Close();
31: }
34: }
35: }
36: }
37: }
Both examples exemplify passing the command string and a Connection object as arguments
to the appropriate Command class’s constructor. Several other constructors can be used to instan-
tiate a Command, but we’ll forgo that discussion because the .NET documentation details each
constructor well.
After the Command object has been created, the connection’s Open() method is called and the
appropriate Command is executed. Assuming that the following SQL string is passed to the
openConnection() method found in both listings, all fields existing in the Categories table
will be returned:
“SELECT * FROM Categories”
Having all the fields in the Categories table is certainly a step forward (you have to start
somewhere, right?), but what can we do with them? Both listings access the different fields by
using the DataReader class (SqlDataReader or OleDbDataReader). As each field is accessed,
information about it is written out to a Web page using the Response object.
Although little focus will be placed on the DataReader in this chapter, it’s important to under-
stand that because of the read-only, forward-only architecture it provides, the DataReader class
offers the best performance for your ASP.NET applications in cases where data simply need to
be displayed. The DataReader works in a manner similar to the XmlTextReader discussed in
Chapter 5, “Using the XmlTextReader and XmlTextWriter Classes in ASP.NET,” and is very
efficient when dealing with multiple records because it does not buffer a large amount of data
at any given time. Several other classes and ASP.NET objects can be used to work with data
returned from a data source, which you’ll learn about later in this chapter.
Accessing Stored Procedures with the Command Class

Building scalable and robust ASP.NET applications involves taking advantage of any possible
gain in speed or efficiency. Accessing data using the SqlCommand or OleDbCommand classes is
no exception. Although getting data can be done with regular SQL statements, stored proce-
dures should be used whenever possible because they are compiled and they create a map spec-
ifying the quickest route to the data.
301
CHAPTER 8
The code used with the ADO.NET Command class to access a stored procedure is much like the
code used to do the same using the Command object in ADO. A few differences exist in how
data types are declared, but for the most part the parameters passed into or out of a stored pro-
cedure are created much like they used to be in ADO. Listing 8.3 shows an example of access-
ing a stored procedure using the SQL managed provider.
LISTING 8.3 Accessing Stored Procedures Using the SQL Provider

1: <%@ Import Namespace=”System.Data”%>
2: <%@ Import Namespace=”System.Data.SqlClient”%>
5: private SqlConnection dataConn = null;
6: private SqlDataReader reader = null;
7:
8: public string OpenConnection(HttpResponse Response,
9: string dbConnectString,string cmdString, string paramName,
10: string paramValue) {
11: try {
13: SqlCommand sqlCmd = new SqlCommand(cmdString,dataConn);
8
14: sqlCmd.CommandType = CommandType.StoredProcedure;
XML FEATURES
ADO.NET’S
LEVERAGING
15:
16: SqlParameter param = sqlCmd.Parameters.Add(new
17: SqlParameter(paramName,SqlDbType.Char, 5));
18:
19: param.Direction = ParameterDirection.Input;
20: sqlCmd.Parameters[paramName].Value = paramValue;
22: reader = sqlCmd.ExecuteReader();
23: Response.Write(“<table><tr><td><b>Product Name</b></td>”);
24: Response.Write(“<td><b>Total</b></td></tr>”);
27: Response.Write(“<td>”+reader[“ProductName”].ToString());
28: Response.Write(“</td>”);
29: Response.Write(“<td>”+reader[“Total”].ToString());
30: Response.Write(“</td></tr>”);
31: }
33: return “<p>Stored Procedure called successfully!”;
34: }
37: }
302

38: finally {
40: reader.Close();
41: }
44: }
45: }
46: }
47: } // SqlConnect
48:
50: string connStr = “server=localhost;uid=sa;pwd=;database=Northwind”;
51: string sql = “CustOrderHist”;
52: //Open Sql Connection
53: SqlConnect sqlconn = new SqlConnect();
54: sqlconn.OpenConnection(Response, connStr,sql,”@customerID”,”ALFKI”);
55: }
56: </script>
This example hits the Northwind sample database’s CustOrderHist stored procedure, which
accepts the customer ID as an input parameter. Although very limited in functionality (espe-
cially because it allows for only one parameter), the SqlConnect class takes care of executing
the CustOrderHist procedure by first setting the newly instantiated Command object’s
CommandType property to CommandType.StoredProcedure (line 14). The following types exist
in the CommandType enumeration:
• StoredProcedure
• TableDirect
• Text
After the CommandType is specified, lines 16–20 take care of adding the appropriate parameter
to be used with the stored procedure and specify its direction. You’ll notice that the data type
of the parameter uses another enumeration named SqlDbType rather than simply listing adChar
as in ADO. The SqlDbType enumeration contains the following data types:
BigInt NVarChar
Binary Real
Bit SmallDateTime
Char SmallInt
DateTime SmallMoney
303
CHAPTER 8
Decimal Text
Float TimeStamp
Image TinyInt
Int UniqueIdentifier
Money VarBinary
NChar VarChar
NText Variant
After the name, data type, and size of the parameter are created, the procedure’s value is set in
line 20 and the connection is opened. The remainder of the class uses the DataReader men-
tioned earlier in the chapter to write data out to the ASP.NET page. As previously mentioned,
the code to accomplish this task looks much like the code used to do the same in ADO, even
though ADO.NET introduces a few new classes and enumerations.
If you’re using the ADO managed provider, the type of the parameter can be set using the
OleDbType enumeration. This enumeration contains the following data types:
BigInt LongVarBinary
8
Binary LongVarChar
XML FEATURES
ADO.NET’S
LEVERAGING
Boolean LongVarWChar
BSTR Numeric
Char PropVariant
Currency Single
Date SmallInt
DBDate TinyInt
DBTime UnsignedBigInt
DBTimeStamp UnsignedInt
Decimal UnsignedSmallInt
Double UnsignedTinyInt
Empty VarBinary
Error VarChar
Filetime Variant
Guid VarNumeric
IDispatch VarWChar
Integer WChar
IUnknown
304
The connection class shown in Listing 8.3 would look like Listing 8.4 when using the OLE DB
managed provider.
LISTING 8.4 Accessing Stored Procedures Using the OLE DB Provider

2: <%@ Import Namespace=”System.Data.OleDb”%>
5: private OleDbConnection dataConn = null;
6: private OleDbDataReader reader = null;
8: string dbConnectString,string cmdString,
9: string paramName,string paramValue) {
10: try {
11: dataConn = new OleDbConnection(dbConnectString);
12: OleDbCommand oleDbCmd = new OleDbCommand(cmdString,dataConn);
13: oleDbCmd.CommandType = CommandType.StoredProcedure;
14: OleDbParameter param = null;
15: param = oleDbCmd.Parameters.Add(paramName, OleDbType.Char, 5);
16: param.Direction = ParameterDirection.Input;
17: oleDbCmd.Parameters[paramName].Value = paramValue;
19: reader = oleDbCmd.ExecuteReader();
20: Response.Write(“<table><tr><td><b>Product Name</b></td>” +
21: “<td><b>Total</b></td></tr>”);
24: Response.Write(“<td>” + reader[“ProductName”].ToString() +
25: “</td>”);
26: Response.Write(“<td>” + reader[“Total”].ToString() +
27: “</td>”);
29: }
32: }
35: }
36: finally {
38: reader.Close();
39: }
305
CHAPTER 8

42: }
43: }
44: } //OpenConnection
45: } //OleDbConnect
46:
48: string connStr = @”Provider=SQLOLEDB.1;Data Source=(local);
49: uid=sa;pwd=;Initial Catalog=Northwind”;
50: string sql = “CustOrderHist”;
52: OleDbConnect oleDbconn = new OleDbConnect();
53: oleDbconn.OpenConnection(Response, connStr,sql,”@customerID”,”ALFKI”);
54: }
55: </script>
If you don’t feel like typing all the parameter information, you can always call the procedure
directly passing the appropriate parameters. This would be similar to calling the stored proce-
dure using the SQL Server Query Analyzer:
8
string cmdString = “exec CustOrderHist @customerID = ‘ALFKI’”;
XML FEATURES
SQLCommand sqlCmd = new SQLCommand(cmdString,dataConn);
ADO.NET’S
LEVERAGING
Working with Output and Return Values
If a stored procedure you’re working with passes back output or return values, the following
lines can be added as appropriate to the code shown in Listings 8.3 and 8.4 to handle these
values.
SQL Managed Provider Output Value:
param = sqlCmd.Parameters.Add(new SqlParameter(“@output”, SqlDbType.Int));
param.Direction = ParameterDirection.Output;
// ......Additional code here
Response.Write(sqlCmd.Parameters[“@output”].Value);
OLE DB Managed Provider Output Value:

param = oleDbCmd.Parameters.Add(“@output”,OleDbType.Integer);
param.Direction = ParameterDirection.Output;
// . . . . . . . . . . . .
Response.Write(oleDbCmd.Parameters[“@output”].Value);
If you need to access a return value from a stored procedure, use code similar to the following
code segments:
SQL Managed Provider Return Value:
306
param = sqlCmd.Parameters.Add(new SqlParameter(“RT_VAL”, SqlDbType.Int));

param.Direction = ParameterDirection.ReturnValue;
. . . . . . . . . . . .
Response.Write(sqlCmd.Parameters[“RT_VAL”].Value);
OLE DB Managed Provider Return Value:

param = oleDbCmd.Parameters.Add(“RETURN_VALUE”, OleDbType.Integer);
param.Direction = ParameterDirection.ReturnValue;
. . . . . . . . . . . .
Response.Write(oleDbCmd.Parameters[“RETURN_VALUE”].Value);
The SqlDataAdapter and OleDbDataAdapter Classes

The Command class discussed in the previous section is very useful for performing direct inserts,
updates, and deletes to a data source, and it can be used in combination with the DataReader
class to select and view data. However, nothing we’ve seen thus far allows us to work natively
with XML unless we use the DataReader to manually create the XML. Because ADO.NET
supposedly makes it easy to work with XML, what can we do? Enter the data adapter classes.
Each managed provider has its own data adapter class that can be used to work with data in a
disconnected and more message-based manner. The SQL managed provider uses an adapter
named SqlDataAdapter, whereas the OLE DB managed provider uses the OleDbDataAdapter.
For the sake of clarity, the next sections focus mainly on the SqlDataAdapter. However, both
adapters are intended to accomplish the same end result, so what you learn about one will be
applicable to the other in many cases.
On its own, the SqlDataAdapter class can’t work with XML, but it acts as a gateway to
accessing the DataSet class, which makes it easy to work with XML. The SqlDataAdapter
class also makes it possible to completely disconnect DataSets from the data source because it
takes care of all the interaction with the appropriate managed provider.
The SqlDataAdapter inherits from a base data adapter class named DataAdapter that provides
the bridge between a data source and a DataSet object. Here’s an example of using it to select
data using a SQL managed provider:
SqlDataAdapter sqlAdapter = new SqlDataAdapter(cmdString,dataConn);
The same thing can be accomplished with the OLE DB managed provider:
OleDbDataAdapter oleDbAdapter = new OleDbDataAdapter(cmdString,dataConn);
This code looks very similar to the code used to work with the command classes in Listings
8.1 and 8.2, doesn’t it? The different data adapter classes have constructors that accept the SQL
string to execute and the connection object. We won’t discuss them here, but several other con-
structors can be used, as well.
307
CHAPTER 8
The similarities between the command and data adapter classes stop when you take the
retrieved data and place it into a DataSet. Although the command classes have no methods for
filling a DataSet, the adapter classes expose the Fill() method. You’ll see this method in
action after the DataSet class has been defined more in the next section. The DataSet section
contains a discussion on how to work with data returned from a data source using XML.
The data adapter classes also allow you to define what stored procedures (or SQL string) to
use for doing inserts, updates, and deletes. This process occurs by assigning values to different
properties. After the properties have been set, you can call the Update() method on the appro-
priate data adapter class to execute the stored procedures or SQL strings. Any changes made to
the DataSet will then be updated in the data source, depending on what stored procedures you
specified. To see this in action, take a look at the following code. An example shown later in
the chapter shows how to use these in the context of a real application.
SqlDataAdapter sqlAdapter = new SqlDataAdapter(sql,conn);
// Insert Command
sqlAdapter.InsertCommand = new SqlCommand(“sp_InsertOrder”,conn);
sqlAdapter.InsertCommand.CommandType = CommandType.StoredProcedure;
// Update Command
sqlAdapter.UpdateCommand = new SqlCommand(“sp_UpdateOrder”,conn); 8
sqlAdapter.UpdateCommand.CommandType = CommandType.StoredProcedure;
XML FEATURES
// Delete Command
ADO.NET’S
LEVERAGING
sqlAdapter.DeleteCommand = new SqlCommand(“sp_DeleteOrder”,conn);
sqlAdapter.DeleteCommand.CommandType = CommandType.StoredProcedure;
After the stored procedures to use are established, the data adapter’s Update() method can
then be called. This will cause any records deleted from a DataSet to be deleted from the data
source. Likewise, any updates or inserts to the DataSet will also be updated or inserted, as
appropriate, into the data source.
Some of the stored procedures previously shown (sp_UpdateOrder, for instance) may require
that you pass in certain input parameters. For example, you may desire to update specific fields
in the data source rather than the entire row. Adding parameters to the appropriate insert,
update, or delete command can be accomplished by adding the parameter name, type and
value. The following code will set the @warehouse input parameter’s value:
sqlAdapter.UpdateCommand = new SqlCommand(“sp_UpdateOrder”,conn);
sqlAdapter.UpdateCommand.CommandType = CommandType.StoredProcedure;
sqlAdapter.UpdateCommand.Parameters.Add(new SqlParameter(“@warehouse”,
SqlDbType.NChar, 5));
sqlAdapter.UpdateCommand.Parameters[0].Value = “main”;
sqlAdapter.Update(DataSetObjectName);
Now that you’ve had a chance to get your feet wet with ADO.NET, let’s begin our discussion
on using ADO.NET and the DataSet class to work with XML data.
308
The DataSet Class

ADO.NET introduces a brand-new way of working with data in ASP.NET applications. Rather
than being connection based, a new messagelike mechanism has been developed that leverages
a class called the DataSet. Before learning more about the benefits that the DataSet class can
bring your ASP.NET applications, let’s look at a few limitations of the “classic” ADO
Recordset.
The RecordSet is one of the main objects used to work with data in ADO. Using a RecordSet,
data can be selected from a data source and placed into an ASP page or manipulated using fil-
ters or sorts. Although the RecordSet object is certainly very useful, it does have a few limita-
tions. One of these limitations (discussed earlier in the chapter) is linked to the difficulty of
passing RecordSet objects through firewalls. Although poking a hole in a firewall may be an
option within your organization (if you have an extremely lenient LAN manager), exchanging
Recordsets with other companies becomes next to impossible with the presence of many
potential firewalls.
Another limitation is that RecordSets (even disconnected RecordSets) are subject to COM
marshaling, which creates inefficiencies as data types in the RecordSet are converted to COM
data types. In this world of distributed systems, the capability to support a wide range of data
types is paramount to interoperate with other systems and applications. The RecordSet object
also relies on a connection-based model by default and lacks integrated support for working
with XML and XML schemas.
The DataSet class strives to remedy some of the limitations found in the Recordset and add
additional functionality to enable distributed applications to communicate more effectively. The
following list details some of the improvements found in the DataSet class:
• The DataSet is completely disconnected and has no knowledge of the data source. All
communication with the data source is done through the managed provider.
• The DataSet can easily be viewed as an XML document and queried using XPath. This
XML basis means that it can pass through firewalls by riding on top of the HTTP proto-
col or be integrated into Web services.
• The DataSet allows multiple tables to be added, along with relationships and constraints
between the tables.
• Because DataSets are XML aware, more robust data types can be described, and ineffi-
ciencies associated with COM marshaling are eliminated.
• DataSets can be mapped to XML schemas that can be used to create an initial structure.
To better understand how DataSets can be used in ASP.NET applications, let’s take a look at a
sample shown in Listing 8.5 that uses the data adapter classes to fill a DataSet with data from
SQL Server’s Northwind database.
309
CHAPTER 8
LISTING 8.5 Filling the DataSet Class with Data

3: <%@ Import Namespace=”System.Data.OleDb”%>
4:
7:
10: try {
11: DataSet dsOrders = new DataSet();
12: SqlConnection dataConn =
13: new SqlConnection(dbConnectString);
14: SqlDataAdapter sqlAdapter =
15: new SqlDataAdapter(cmdString,dataConn);
16: sqlAdapter.Fill(dsOrders,”Orders”);
17: // Loop through DataSet “Orders” table
18: Response.Write(“<table><tr><td><b>Order ID</b></td></tr>”);
19: foreach (DataRow Order in dsOrders.Tables[“Orders”].Rows) {
20: Response.Write(“<tr>”); 8
21: Response.Write(“<td>” + Order[“OrderId”].ToString() +
XML FEATURES
ADO.NET’S
LEVERAGING
22: “</td>”);
24: }
26: return “<p>Sql Server Data Connection Opened”;
27: }
30: }
31: }
32: }
33:
35:
38: try {
39: DataSet dsOrders = new DataSet();
40: OleDbConnection dataConn =
41: new OleDbConnection(dbConnectString);
42: OleDbDataAdapter oleDbAdapter =
43: new OleDbDataAdapter(cmdString,dataConn);
44: oleDbAdapter.Fill(dsOrders,”Orders”);
310

45: // Loop through DataSet “Orders” table
46: Response.Write(“<table><tr><td><b>Order ID</b></td></tr>”);
47: foreach (DataRow Order in dsOrders.Tables[“Orders”].Rows) {
49: Response.Write(“<td>” + Order[“OrderId”].ToString() +
50: “</td>”);
52: }
55: }
58: }
59: }
60: }
61:
63: string SqlconnStr = “server=localhost;uid=sa;pwd=;database=Northwind”;
64: string OleDbconnStr = @”Provider=SQLOLEDB.1;Data Source=(local);
65: uid=sa;pwd=;Initial Catalog=Northwind”;
66: string sql = “SELECT OrderID FROM Orders”;
67: Response.Write(“<b>SQL Managed Provider DataSet:</b>”);
68: SqlConnect sqlConn = new SqlConnect();
69: sqlConn.OpenConnection(Response,SqlconnStr,sql);
70:
71: Response.Write(“<p /><b>OleDb Managed Provider DataSet:</b>”);
72: OleDbConnect oleDbConn = new OleDbConnect();
73: oleDbConn.OpenConnection(Response,OleDbconnStr,sql);
74:
75: }
76: </script>
Listing 8.5 shows how both managed providers can be used to fill a DataSet object with data.
The discussion that follows will focus on the SQL managed provider code found in the
SqlConnect class.
When the ASP.NET page is first loaded, Page_Load is called, which takes care of instantiating
the SqlConnect class and calling its OpenConnection() (lines 68 and 69). This method accepts
the Response object, the connection string, and the SQL statement to execute against the data
source. Looking at line 11, you’ll notice that a DataSet class is instantiated and then filled
with the results of the SQL query. This is accomplished by calling the SqlDataAdapter’s
Fill() method (line 16).
311
CHAPTER 8
The Fill() method has several overrides. The one shown here passes the DataSet to fill and
the name of the table that is being filled in the DataSet (Orders in this example) as arguments.
Because only one table is being added to the DataSet, the second argument is optional.
Another version of the method allows the DataSet and table name to be passed as arguments
along with the record number to start with and the maximum number of records to include.
This allows you to say, “Fill the DataSet but start with row 25 and include the next 10 rows
that follow” by using this code:
sqlAdapter.Fill(dsOrders,25,10,”Orders”);
Viewing DataSets as XML

DataSets make it very easy to work with relational data and perform all types of manipula-
tions. However, many ASP.NET applications require data found within a DataSet to be con-
verted to XML so that it can be passed to an XML-aware application, transformed using
XSLT, or have some other action performed on it. Listing 8.6 shows one way of viewing
DataSet data as XML, but to make things more interesting, it fills a DataSet with two differ-
ent tables from the Northwind database. It then relates the two tables to each other based on
the CustomerID field. An explanation of the code follows. 8
XML FEATURES
LISTING 8.6 Filling a DataSet with Multiple Tables
ADO.NET’S
LEVERAGING
3: <%@ Import Namespace=”System.Xml”%>
4: <%@ Import Namespace=”System.Text”%>
5: <html>
8: public DataSet ReturnDataSet(string dbConnectString,
9: string table1,string table2) {
10: DataSet dsTables = new DataSet();
11: dsTables.DataSetName = “CustomersData”;
12: SqlConnection dataConn = new SqlConnection(dbConnectString);
13: SqlDataAdapter customersAdapter =
14: new SqlDataAdapter(table1,dataConn);
15: SqlDataAdapter ordersAdapter =
17: customersAdapter.Fill(dsTables,”Customers”);
18: ordersAdapter.Fill(dsTables,”Orders”);
19: dsTables.Relations.Add(“CustomerOrders”,
20: dsTables.Tables[“Customers”].Columns[“CustomerId”],
21: dsTables.Tables[“Orders”].Columns[“CustomerId”]);
312

22: return(dsTables);
23: }
24: }
26: string SqlconnStr = “server=localhost;uid=sa;pwd=;” +
27: “database=Northwind”;
28: string table1 = @”SELECT * FROM Customers WHERE CustomerID LIKE ‘h%’
29: ORDER BY ContactName”;
30: string table2 = “SELECT * FROM Orders WHERE CustomerID LIKE ‘h%’”;
31: StringBuilder output = new StringBuilder();
32: DataRow[] rows;
34: SqlConnect Sqlconn = new SqlConnect();
35: DataSet TablesDataSet = Sqlconn.ReturnDataSet(SqlconnStr,
36: table1,table2);
37:
38: foreach (DataRow Customer in TablesDataSet.Tables[“Customers”].Rows){
39: output.Append(“<div onClick=\”showDetails(‘“ +
40: Customer[“CustomerID”].ToString() + “‘)\”>”);
41: output.Append(“<br><span style=\”background: #02027a;” +
42: “width: 500px;”);
43: output.Append(“cursor: hand;\”>”);
44: output.Append(“<table width=\”100%\”><tr><td width=\”90%\”” +
45: “ style=\”color:#ffffff;font-weight:bold\”>” +
46: “Customer Name: “);
47: output.Append(Customer[“ContactName”].ToString() + “</td><td>”);
48: output.Append(“<img src=\”yellow_arrow_down2.gif\”></td>” +
49: “</tr></table></span>”);
50: output.Append(“</div>”);
51: output.Append(“<div style=\”width: 500px;display:none;” +
52: “background:#efefef\” id=’”);
53: output.Append(Customer[“CustomerID”].ToString() + “‘>”);
54: // Iterate over orders data
55: rows = Customer.GetChildRows(
56: TablesDataSet.Relations[“CustomerOrders”]);
57: if (rows.Length > 0) {
58: foreach (DataRow Order in rows) {
59: output.Append(“<br><b>Order:</b> #” +
60: Order[“OrderId”].ToString());
61: output.Append(“  <b>Shipping Address:</b>” +
62: Order[“ShipAddress”].ToString());
63: }
64: } else {
313
CHAPTER 8

65: output.Append(“<b>No Records</b>”);
66: }
67: output.Append(“</div>”);
68: }
69: content.InnerHtml = output.ToString();
70: XmlDataDocument dataDoc = new XmlDataDocument(TablesDataSet);
71: xml.Text = dataDoc.OuterXml;
72: }
73: </script>
74: <head>
75: <script language=”JavaScript”>
76: function showDetails(id) {
77: var loc = document.all(id);
78: if (loc.style.display == ‘none’) {
79: loc.style.display = ‘block’;
80: } else {
81: loc.style.display = ‘none’;
82: }
83:
84: } 8
85: </script>
XML FEATURES
ADO.NET’S
LEVERAGING
86: </head>
88: <h2>
89: <b>Walking Multiple DataSet Tables</b>
90: </h2>
91: <div id=”content” runat=”server” />
92: <p>
93:
94: </p>
95: <b>XML Data:</b>
96: <br />
97: <asp:TextBox ID=”xml” Runat=”server” Columns=”75” Rows=”15”
98: TextMode=”MultiLine” />
99: </body>
100: </html>
This example shows how you can walk relationships between tables. It also gives a quick
glimpse into how easy it is to view data held within a DataSet as XML.
When the page first loads, a class named SqlConnect is instantiated and its ReturnDataSet()
method is called (lines 34–36). The ReturnDataSet() method accepts a connection string and
314
two SQL queries as arguments. The method works by first creating a DataSet object named
dsTables (line 10). SqlConnection and SqlDataAdapter objects are then created and the
SqlDataAdapter objects are used to query the database and hold the results of data from two
SQL queries passed into the method (lines 13–16).
Two tables within the DataSet (dsTables) are then filled with data from the two data adapters
by calling each data adapter’s Fill() method (lines 17 and 18). This method has several over-
rides that can be used. When the two tables within the DataSet are filled, a relationship is
established between the tables, using the Relations property (lines 19–21). This associates
both tables based on their CustomerID fields and makes it possible to create a parent/child rela-
tionship within the DataSet. After all this has been done, the DataSet is returned to the caller
of the ReturnDataSet method.
When the filled DataSet is ready to use, each row of the Customers table (within the DataSet)
is looped through using a foreach loop (lines 38–68). Before moving to the next row, the cur-
rent CustomerID is used to find all the orders for the appropriate customer in the Orders table
of the DataSet. These rows are then looped through with an additional foreach loop (lines
58–63). The entire process adds data to a StringBuilder class named output that is written
out to the ASP.NET page on completion of the looping process.
Line 70 shows one potential way of viewing the data held within the DataSet as XML. It cre-
ates an XmlDataDocument class and passes in the DataSet (TablesDataSet in this example) to
its constructor. The XmlDataDocument class extends the XmlDocument class detailed in Chapter
6, “Programming the Document Object Model (DOM) with ASP.NET,” and provides a way to
work with data using the Document Object Model (DOM). This is a very powerful feature
because it means you can perform XPath queries or do XSLT transformations directly on data
received from a data source. You’ll see how to use this class in greater detail later in the
chapter.
The end result of the code in Listing 8.6 is a page that allows the end user to see what orders
each customer has placed without having to leave the page. A screen shot of this is shown in
Figure 8.1.
The next sample (Listing 8.7) follows along the lines of the previous example, except that it
uses one of the DataSet’s methods to access the DataSet as XML rather than creating an
XmlDataDocument object. The resulting XML document is placed into an XML Data Island in
the browser and queried on the client side using XPath.
315
CHAPTER 8
FIGURE 8.1.
Relating the Customers and Orders tables to create an ASP.NET page. 8
XML FEATURES
ADO.NET’S
LEVERAGING
LISTING 8.7 Viewing a DataSet as XML
6: public DataSet ReturnDataSet(string dbConnectString,string table1,
7: string table2,string key1, string key2) {
10: SqlDataAdapter dsCmdCustomers =
12: SqlDataAdapter dsCmdOrders =
14: dsCmdCustomers.Fill(dsTables,”Table1”);
15: dsCmdOrders.Fill(dsTables,”Table2”);
16: dsTables.Relations.Add(“Relation1”,
17: dsTables.Tables[“table1”].Columns[key1],
18: dsTables.Tables[“table2”].Columns[key2]);
316

20: }
21: }
22:
24: string xmlString;
25: string connStr = “server=localhost;uid=sa;pwd=;database=Northwind”;
26: string sCustomers = “SELECT * FROM Customers WHERE CustomerID” +
27: “ LIKE ‘f%’ ORDER BY ContactName”;
28: string sOrders = “SELECT * FROM Orders WHERE CustomerID LIKE ‘f%’”;
29:
31: DataSet TablesDataSet = Sqlconn.ReturnDataSet(connStr,sCustomers,
32: sOrders,”CustomerID”,”CustomerID”);
33: xmlString = TablesDataSet.GetXml();
34: XmlContents.InnerHtml = xmlString;
35: stats.InnerHtml = “<font color=\”#02027a\”><b>XML Data” +
36: “ Island Filled with DataSet Records</b></font>”;
37: xml.Text = xmlString;
38: }
39: </script>
40: <html>
41: <head>
42: <script language=”JavaScript”>
43: function showXml() {
44: var doc = document.all(“XmlContents”).XMLDocument;
45: output = “<p>Starting to gather information on client-side..”;
46: var node1 = doc.documentElement.selectNodes(“//Table1”);
47: var node2 = doc.documentElement.selectNodes(“//Table2”);
48: var stats = document.all(“stats”);
49: output += “<br>Information gathered! Totals follow:<br><hr>”;
50: output += “<br><b>Number of Customers: “ + node1.length;
51: output += “<br>Number of Orders: “ + node2.length; + “</b>”;
52: stats.innerHTML = stats.innerHTML + “<br>” + output;
53: }
54: </script>
55: </head>
56: <body onLoad=”showXml()”>
57: <XML id=”XmlContents” runat=”server” />
58: <div id=”stats” style=”width: 400px;” runat=”server” />
59: <p>
60:
61: </p>
62: <b>XML Data:</b>
63: <br />
64: <asp:TextBox ID=”xml” Runat=”server” Columns=”75” Rows=”15”
317
CHAPTER 8

65: TextMode=”MultiLine” NAME=”xml” />
66: </body>
67: </html>
This example shows another way that data within a DataSet can be viewed as XML. The
SqlConnect class functions much the same as the SqlConnect class shown in Listing 8.6. After
calling this class and getting a DataSet back, the DataSet’s GetXml() method is called and the
resulting XML data is placed within an XML Data Island in the ASP.NET page. Calling the
DataSet’s GetXml() method yields a string containing the XML schema and XML data
retrieved from the Northwind database tables. Alternatively, if you want only the XML
schema, you can call the GetXmlSchema() method.
After this data has been added to the ASP.NET page, client-side script takes care of executing
a few XPath statements to determine the total number of customers and orders contained
within the XML Data Island (lines 46 and 47). Doing this allows some of the processing bur-
den to be removed from the server and placed on the client. Although the example is very sim-
ple, the client-side code could bind tables to the XML Data Island to allow records to be
updated, deleted, or inserted. The data found within the XML Data Island could then be posted
8
XML FEATURES
back to the server and the original data store (or another) could be updated. You’ll see how
ADO.NET’S
LEVERAGING
XML data can be read into a DataSet later in the chapter. Figure 8.2 shows the results that the
code in Listing 8.7 returns.
FIGURE 8.2.
Passing data within a DataSet to the browser as XML.
318
Loading DataSets with XML

Aside from allowing relational or XML data views, the DataSet class can also be loaded with
XML data. As an ASP.NET programmer, you should certainly appreciate this feature because it
makes programming much simpler. For example, a DataSet can be loaded with data from an
XML document and then bound to an ASP.NET DataGrid WebControl. Doing this simplifies
development because the looping and manipulation of the data is taken care of for you by the
WebControl. The capability to load DataSets with data marked up as XML also allows data
coming in from a variety of applications or devices to be updated more easily into a data store.
Loading XML data from an XML document or stream into a DataSet can be accomplished by
using the DataSet’s ReadXml() method. This method has several overrides that allow a variety
of arguments, including a string or a stream. Listing 8.8 instantiates a StreamReader class and
passes it to the ReadXml() method.
After the DataSet is filled with the data from the XML file, a DataView object is created from
the DataSet and bound to a DataGrid WebControl. As an added feature, the DataGrid is set up
to allow the end user to sort based on a particular field.
LISTING 8.8 Binding a DataSet to a DataGrid

2: <%@ Import Namespace=”System.IO”%>
4: public class DSReadXml {
5: public static DataSet ReturnDataSet(string path) {
7: FileStream fs = null;
9: try {
10: fs = new FileStream(path,FileMode.Open,FileAccess.Read);
11: reader = new StreamReader(fs);
12: dsTables.ReadXml(reader);
13: return dsTables;
14: }
15: finally {
16: fs.Close();
17: reader.Close();
18: }
19: }
20: }
22: BindIt(“CustomerID”);
23: }
319
CHAPTER 8

24: public void OrdersDataGrid_Sort(Object Src,
25: DataGridSortCommandEventArgs sort) {
26: BindIt(sort.SortExpression);
27: }
28: public void BindIt(string field) {
29: DataSet ds = DSReadXml.ReturnDataSet(
30: Server.MapPath(“customerList.xml”));
31: DataView view = ds.Tables[0].DefaultView;
32: view.Sort = field;
33: OrdersDataGrid.DataSource = view;
34: OrdersDataGrid.DataBind();
35: }
36: </script>
37: <html>
39: <form runat=”server”>
40: <ASP:DataGrid id=”OrdersDataGrid” runat=”server”
41: Width=”700”
42: BackColor=”#E6E6CC”
43: BorderColor=”#000000” 8
44: ShowFooter=”false”
XML FEATURES
ADO.NET’S
LEVERAGING
45: CellPadding=”5”
46: CellSpacing=”0”
47: Font-Name=”Arial”
48: Font-Size=”8pt”
49: HeaderStyle-BackColor=”#6C0A00”
50: HeaderStyle-ForeColor=”#ffffff”
51: EnableViewState=”false”
52: OnSortCommand=”OrdersDataGrid_Sort”
53: AllowSorting=”true”
54: />
55: </form>
56: </body>
57: </html>
The ReadXml() method reads in an XML document’s schema and data. Line 12 shows how the
method can be used to read from a stream. To read in only the schema associated with an
XML document, the ReadXmlSchema() method can be called when working with the DataSet
class. The capability to dynamically create the structure of a DataSet class (including tables,
relationships, and so on) based on an XML schema opens up a new set of possibilities for
working with data in a disconnected manner.
320
Saving DataSets as XML

DataSets can also write the data they contain along with the XML schema that describes their
structure to a file for later use. This is accomplished by making a call to the WriteXml() or
WriteXmlSchema() methods. Listing 8.9 shows how these methods can be used to write a
DataSet’s XML data and schema information to a file.
LISTING 8.9 Writing a DataSet’s XML Data and Schema to a File

9: dsTables.DataSetName = “CustomersData”;
11: SqlDataAdapter customersAdapter =
13: SqlDataAdapter ordersAdapter =
15: customersAdapter.Fill(dsTables,”Customers”);
16: ordersAdapter.Fill(dsTables,”Orders”);
17: dsTables.Relations.Add(“CustomerOrders”,
21: }
22: }
24: string sqlConnStr = “server=localhost;uid=sa;pwd=;” +
26: string sqlCustomers = “SELECT * FROM Customers”;
27: string sqlOrders = “SELECT * FROM Orders”;
28: SqlConnect sqlConnect = new SqlConnect();
29: DataSet ds = sqlConnect.ReturnDataSet(sqlConnStr,
30: sqlCustomers,sqlOrders);
31:
32: //Save data and schema information to a file
33: ds.WriteXml(Server.MapPath(“Listing8.10.xml”),
34: XmlWriteMode.IgnoreSchema);
35: ds.WriteXmlSchema(Server.MapPath(“Listing8.11.xsd”));
36: Response.Write(“DataSet XML data and schema information saved.”);
37: }
38: </script>
321
CHAPTER 8
Listing 8.10 shows the results of calling the WriteXml() method on the DataSet just created.
LISTING 8.10 XML Data Generated by the DataSet’s WriteXml() Method

1: <?xml version=”1.0” standalone=”yes”?>
2: <CustomersData>
3: <Customers>
4: <CustomerID>ALFKI</CustomerID>
5: <CompanyName>Alfreds Futterkiste</CompanyName>
6: <ContactName>Maria Anders</ContactName>
7: <ContactTitle>Sales Representative</ContactTitle>
8: <Address>Obere Str. 57</Address>
9: <City>Berlin</City>
10: <PostalCode>12209</PostalCode>
11: <Country>Germany</Country>
12: <Phone>030-0074321</Phone>
13: <Fax>030-0076545</Fax>
14: </Customers>
15: <Customers>
16: <CustomerID>ANATR</CustomerID>
17: <CompanyName>Ana Trujillo Emparedados y helados</CompanyName> 8
18: <ContactName>Ana Trujillo</ContactName>
XML FEATURES
19: <ContactTitle>Owner</ContactTitle>
ADO.NET’S
LEVERAGING
20: <Address>Avda. de la Constitución 2222</Address>
21: <City>México D.F.</City>
22: <PostalCode>05021</PostalCode>
23: <Country>Mexico</Country>
24: <Phone>(5) 555-4729</Phone>
25: <Fax>(5) 555-3745</Fax>
26: </Customers>
27:
28: 
29:
30: <Orders>
31: <OrderID>11076</OrderID>
32: <CustomerID>BONAP</CustomerID>
33: <EmployeeID>4</EmployeeID>
34: <OrderDate>1998-05-06T07:00:00</OrderDate>
35: <RequiredDate>1998-06-03T07:00:00</RequiredDate>
36: <ShipVia>2</ShipVia>
37: <Freight>38.28</Freight>
38: <ShipName>Bon app’</ShipName>
39: <ShipAddress>12, rue des Bouchers</ShipAddress>
40: <ShipCity>Marseille</ShipCity>
41: <ShipPostalCode>13008</ShipPostalCode>
42: <ShipCountry>France</ShipCountry>
43: </Orders>
322

44: <Orders>
45: <OrderID>11077</OrderID>
46: <CustomerID>RATTC</CustomerID>
47: <EmployeeID>1</EmployeeID>
48: <OrderDate>1998-05-06T07:00:00</OrderDate>
49: <RequiredDate>1998-06-03T07:00:00</RequiredDate>
50: <ShipVia>2</ShipVia>
51: <Freight>8.53</Freight>
52: <ShipName>Rattlesnake Canyon Grocery</ShipName>
53: <ShipAddress>2817 Milton Dr.</ShipAddress>
54: <ShipCity>Albuquerque</ShipCity>
55: <ShipRegion>NM</ShipRegion>
56: <ShipPostalCode>87110</ShipPostalCode>
57: <ShipCountry>USA</ShipCountry>
58: </Orders>
59:
60: 
61:
62: </CustomersData>
Listing 8.11 shows what a schema generated by calling the DataSet’s WriteXmlSchema()
method looks like. This schema follows the standards found in the W3C schema specification
discussed in Chapter 4, “Understanding DTDs and XML Schemas”.
LISTING 8.11 XML Schema Generated by the DataSet’s WriteXmlSchema() Method

1: <?xml version=”1.0” standalone=”yes”?>
2: <xsd:schema id=”CustomersData”
3: targetNamespace=””
4: xmlns=””
5: xmlns:xsd=”http://www.w3.org/2001/XMLSchema”
6: xmlns:msdata=”urn:schemas-microsoft-com:xml-msdata”>
7: <xsd:element name=”CustomersData” msdata:IsDataSet=”true”>
9: <xsd:choice maxOccurs=”unbounded”>
10: <xsd:element name=”Customers”>
12: <xsd:sequence>
13: <xsd:element name=”CustomerID” type=”xsd:string”
15: <xsd:element name=”CompanyName” type=”xsd:string”
17: <xsd:element name=”ContactName” type=”xsd:string”
323
CHAPTER 8

19: <xsd:element name=”ContactTitle” type=”xsd:string”
21: <xsd:element name=”Address” type=”xsd:string”
23: <xsd:element name=”City” type=”xsd:string”
25: <xsd:element name=”Region” type=”xsd:string”
27: <xsd:element name=”PostalCode” type=”xsd:string”
29: <xsd:element name=”Country” type=”xsd:string”
31: <xsd:element name=”Phone” type=”xsd:string”
33: <xsd:element name=”Fax” type=”xsd:string”
35: </xsd:sequence>
37: </xsd:element> 8
38: <xsd:element name=”Orders”>
XML FEATURES
ADO.NET’S
LEVERAGING
40: <xsd:sequence>
41: <xsd:element name=”OrderID” type=”xsd:int”
43: <xsd:element name=”CustomerID” type=”xsd:string”
45: <xsd:element name=”EmployeeID” type=”xsd:int”
47: <xsd:element name=”OrderDate” type=”xsd:dateTime”
49: <xsd:element name=”RequiredDate” type=”xsd:dateTime”
51: <xsd:element name=”ShippedDate” type=”xsd:dateTime”
53: <xsd:element name=”ShipVia” type=”xsd:int”
55: <xsd:element name=”Freight” type=”xsd:decimal”
57: <xsd:element name=”ShipName” type=”xsd:string”
59: <xsd:element name=”ShipAddress” type=”xsd:string”
61: <xsd:element name=”ShipCity” type=”xsd:string”
324

63: <xsd:element name=”ShipRegion” type=”xsd:string”
65: <xsd:element name=”ShipPostalCode” type=”xsd:string”
67: <xsd:element name=”ShipCountry” type=”xsd:string”
69: </xsd:sequence>
71: </xsd:element>
72: </xsd:choice>
74: <xsd:unique name=”Constraint1”>
75: <xsd:selector xpath=”.//Customers” />
77: </xsd:unique>
78: <xsd:keyref name=”CustomerOrders” refer=”Constraint1”>
79: <xsd:selector xpath=”.//Orders” />
81: </xsd:keyref>
82: </xsd:element>
83: </xsd:schema>
Although the DataSet has many more features that haven’t been covered, you’ve been exposed
to a few of the different ways it can be used to work with relational and XML data formats.
The next section shows how the DataSet class can further be extended to work with XML data
using DOM properties and methods.
Working with the DataSet and

XmlDataDocument Classes
Up to this point in the chapter, you’ve been exposed to how data contained within a DataSet
can be accessed as XML data. You’ve also seen how XML data can be read into a DataSet and
written out to a file. Although the DataSet methods that allow for this type of functionality are
very useful, they don’t allow you to create or add new nodes into data using DOM properties
and methods.
To emulate this type of functionality, you could always load the XML from a DataSet into an
XmlDocument, perform the necessary manipulations, and then read the data back into the
DataSet using its ReadXml() method. Although this would certainly work, there’s a more effi-
cient way to manipulate data within a DataSet using DOM properties and methods normally
associated with the XmlDocument. In addition, data found in a DataSet can be queried using
325
CHAPTER 8
XPath statements and can be transformed using XSLT. How is this accomplished? The answer
lies with the XmlDataDocument class that you were briefly introduced to earlier. This class is
designed to work hand-in-hand with the DataSet class to allow you to manipulate data using
regular XML-style programming. It can be thought of as a DataSet-aware XmlDocument.
Figure 8.3 shows the relationship between the DataSet and XmlDataDocument classes in the
.NET framework.
XSL/T, X-Path Controls

Validation, etc Designers
Code-gen, etc
XmlDataDocument DataSet
Sync
8
XmlReader Managed Provider XmlReader
XML FEATURES
ADO.NET’S
XmlTextReader XmlNodeReader XmlTextReader XmlNodeReader
LEVERAGING
FIGURE 8.3.
Passing data within a DataSet to the browser as XML.
The most important concept shown in Figure 8.3 is that changes made to a DataSet are auto-
matically made in the corresponding XmlDataDocument as well. This automatic syncing
between the classes gives you the choice of working with data as XML or as a standard rela-
tional view! However, syncing between the XmlDataDocument and DataSet classes may not be
possible in cases where a node is added to the DOM tree that does not fit into a row found in
the DataSet’s relational structure. For complete syncing to occur, nodes added into the DOM
tree must match up with a row in the DataSet.
To see the XmlDataDocument in action, Listing 8.12 contains an example that utilizes code
shown earlier in Chapter 6. The code simply walks through the XML and displays it in the
browser.
LISTING 8.12 Using the XmlDataDocument Class

326

6: int indent = 1;
8:
9: public DataSet ReturnDataSet(string dbConnectString,string table1,
10: string table2,string key1,string key2) {
12: SqlConnection dataConn =
13: new SqlConnection(dbConnectString);
16: SqlDataAdapter dsCmdOrders = new SqlDataAdapter(table2,dataConn);
17: dsCmdCustomers.Fill(dsTables,”Table1”);
18: dsCmdOrders.Fill(dsTables,”Table2”);
20: dsTables.Tables[“Table1”].Columns[key1],
21: dsTables.Tables[“table2”].Columns[key2]);
22: return dsTables;
23: }
24: }
25: public class WriteXmlText {
26: private string m_Document = “”;
28: int indent = 1;
29:
30: public string ParseDoc(XmlDataDocument document) {
31: XmlNode oNode = document.DocumentElement;
32: writeNodeName(oNode,0);
33: XmlNodeList oNodeList = oNode.ChildNodes;
34: for (int i=0;i<oNodeList.Count;i++) {
35: XmlNode oCurrentNode = oNodeList[i];
37: writeNodeName(oCurrentNode,indent);
38: walkTheTree(oCurrentNode);
39: indent--;
40: } else {
42: }
43: }
44:
45: return output;
46: }
47:
48: private void walkTheTree(XmlNode oNodeToWalk) {
49: indent++;
327
CHAPTER 8

50: XmlNodeList oNodeList = oNodeToWalk.ChildNodes;
51: for (int j=0;j<oNodeList.Count;j++) {
52: XmlNode oCurrentNode = oNodeList[j];
53:
56: walkTheTree(oCurrentNode);
57: indent--;
58: } else {
60: }
61: }
62: }
63:
64: private void writeNodeName(XmlNode node,int iIndent) {
65: int h = 0;
66: for (int k=0;k<(iIndent * 10);k++) {
67: output += “ ”;
68: }
69: if (node.NodeType == XmlNodeType.Text) { // Text node 8
70: output += “<font color=’#ff0000’>” + node.Value +
XML FEATURES
ADO.NET’S
LEVERAGING
71: “</font><br>”;
72: } else {
73: if (node.Attributes.Count > 0) {
74: XmlNamedNodeMap oNamedNodeMap = node.Attributes;
75: output += “<b>” + node.Name + “</b> (“;
77: if (h!=0) output += “  ”;
78: h++;
79: output += “<i>” + att.Name + “</i>=\”” +
80: att.Value + “\””;
81: }
82: output += “)<br>\n\n”;
83: } else {
84: output += “<b>” + node.Name + “</b><br>\n\n”;
85: }
86: } // end if
87: } // writeNodeName
88: } //WriteXmlText
89:
91: String connStr = “server=localhost;uid=sa;pwd=;database=Northwind”;
92: String table1 = @”SELECT * FROM Customers WHERE CustomerID LIKE ‘d%’
93: ORDER BY ContactName”;
328

94: String table2 = “SELECT * FROM Orders WHERE CustomerID LIKE ‘d%’”;
95: String output = “”;
98: DataSet ds = Sqlconn.ReturnDataSet(connStr,table1,table2,
99: “CustomerID”,”CustomerID”);
100: ds.EnforceConstraints = false;
101: XmlDataDocument xmlDataDoc = new XmlDataDocument(ds);
102: xmlDataDoc.Normalize();
103: WriteXmlText writer = new WriteXmlText();
104: Response.Write(writer.ParseDoc(xmlDataDoc));
105: }
106: </script>
Much of what you’ve seen up to this point in the book takes place in the preceding sample.
A DataSet is filled with two tables using the SQL managed provider, and a relationship
between the tables is established. After this process occurs, an XmlDataDocument class is
instantiated and is passed the newly created DataSet named ds (line 101). This allows the data
within the DataSet to be treated like data found within an XmlDocument. To show this, the
XmlDataDocument (xmlDataDoc) is first normalized to organize the elements properly, and it
is then passed into the WriteXmlText class’s ParseDoc() method. This class loops through
all the child nodes and creates an HTML representation of the data contained within the
XmlDataDocument object. For more information on the code contained in the ParseDoc()
method, refer back to Chapter 6.
XmlDataDocument Properties and Methods

The properties and methods of the XmlDataDocument class are very similar to those found in
the XmlDocument class. This is because the XmlDataDocument extends the XmlDocument class.
Because the XmlDataDocument class has the additional requirement to be able to work with
data in a DataSet, it adds a few additional properties and methods not found in the
XmlDocument class. These additional properties and methods are listed in Table 8.1 and
Table 8.2, respectively.
TABLE 8.1 Additional Properties of the XmlDataDocument Class

DataSet Returns a DataSet object that can be used to access the data in a
relational manner with rows and columns. This property allows you
to easily transfer XML data into a DataSet.
329
CHAPTER 8
Table 8.2 Additional Methods of the XmlDataDocument Class

Method Description
GetElementFromRow Returns an XmlElement based on a given row in the XmlData
Document. Each child element of the returned element can then be
accessed using DOM properties and methods or XPath statements.
GetRowFromElement Returns a DataRow object based on a given XML element in the
XmlDataDocument.
Listing 8.13 provides a lot of code aimed at showing how the DataSet and XmlDataDocument
classes can be used interchangeably. The goal of the sample is to show you how both are
synced and to show several things you can do with each class. Although it does not show the
most efficient way to work with data (its goal is to show many facets of ADO.NET), it does
provide a good glimpse of the flexibility ADO.NET provides in moving XML data to and from
a data source.
LISTING 8.13 Working with XmlDataDocument and DataSet Classes

1: <%@ Import Namespace=”System.Data”%> 8
XML FEATURES
ADO.NET’S
LEVERAGING
4:
8: string table1,string table2,
9: string key1,string key2) {
14: SqlDataAdapter dsCmdOrders =
17: dsCmdCustomers.Fill(dsTables,”Customers”);
18: dsCmdOrders.Fill(dsTables,”Orders”);
20: dsTables.Tables[“Customers”].Columns[key1],
21: dsTables.Tables[“Orders”].Columns[key2]);
23: }
24: public int InsertDataSet(string dbConnectString,DataSet ds) {
330

25: string sql = @”INSERT INTO Customers (CustomerID,
26: CompanyName, ContactName,ContactTitle)
27: VALUES (@CustomerID,@CompanyName,
28: @ContactName,@ContactTitle)”;
29: SqlConnection dataConn = null;
30: SqlDataAdapter dsAdapter = null;
31: try {
33: dsAdapter = new SqlDataAdapter();
34: dsAdapter.InsertCommand = new SqlCommand(sql,dataConn);
35:
36: dsAdapter.InsertCommand.Parameters.Add(
37: new SqlParameter(“@CustomerID”,SqlDbType.NChar, 5));
38: dsAdapter.InsertCommand.Parameters[0].SourceColumn =
39: “CustomerID”;
40:
42: new SqlParameter(“@CompanyName”,SqlDbType.NVarChar,40));
44: “CompanyName”;
45:
47: new SqlParameter(“@ContactName”,SqlDbType.NVarChar,30));
49: “ContactName”;
50:
52: new SqlParameter(“@ContactTitle”,SqlDbType.NVarChar,30));
54: “ContactTitle”;
55:
57: dsAdapter.Update(ds,”Customers”);
58: }
60: return 1;
61: }
62: return 0;
63: }
64:
65: public int DeleteDataSet(string dbConnectString,DataSet ds,
66: string key) {
67: string sql = “DELETE FROM Customers WHERE CustomerID = ‘“ +
68: key + “‘“;
331
CHAPTER 8

69: SqlConnection dataConn = null;
70: SqlDataAdapter dsAdapter = null;
71: try {
73: dsAdapter = new SqlDataAdapter();
74: dsAdapter.DeleteCommand = new SqlCommand(sql,dataConn);
75: dsAdapter.Update(ds,”Customers”);
76: }
78: return 1;
79: }
80: return 0;
81: }
82: }
84: String connStr = “server=localhost;uid=sa;pwd=;” +
86: String customersSql = @”SELECT * FROM Customers
87: WHERE CustomerID LIKE ‘d%’
88: ORDER BY ContactName”; 8
89: String ordersSql = “SELECT * FROM Orders WHERE” +
XML FEATURES
ADO.NET’S
LEVERAGING
90: “ CustomerID LIKE ‘d%’”;
91: String output = “”;
92: //****** Save Schema to a File/Save XML data to a File
94: DataSet ds = Sqlconn.ReturnDataSet(connStr,customersSql,
95: ordersSql,
97: ds.WriteXmlSchema(Server.MapPath(“schema.xsd”));
98: ds.WriteXml(Server.MapPath(“customers.xml”),
99: XmlWriteMode.IgnoreSchema);
100:
101: //****** Load the schema.xsd file into an empty DataSet object
102: DataSet newDS = new DataSet();
103: newDS.ReadXmlSchema(Server.MapPath(“schema.xsd”));
104: newDS.ReadXml(Server.MapPath(“customers.xml”));
105: newDS.AcceptChanges();
106: XmlDataDocument xmlDataDoc = new XmlDataDocument(newDS);
107:
108: //****** Add a new row and fill 4 columns with data
109: DataRow row = newDS.Tables[“Customers”].NewRow();
110: row[“CustomerID”] = “DLWID”;
111: row[“CompanyName”] = “Tomorrows Learning”;
112: row[“ContactName”] = “Dan Wahlin”;
332

113: row[“ContactTitle”] = “Programmer/Author/Speaker”;
114:
115: //****** Let’s see how well the schema validates
116: try {
117: row[“test”] = “Testing Schema”;
118: }
119: catch (Exception exc) {
120: //column[“test”] doesn’t really exist
121: }
122: //****** Add new row to DataSet
123: newDS.Tables[“Customers”].Rows.Add(row);
124:
125: //****** Get changes to DataSet
126: DataSet changesDS = newDS.GetChanges();
127: //divChanges.InnerHtml = “<xmp>” + changesDS.GetXml() + “</xmp>”;
128:
129: //****** Show that the DataSet and XmlDataDocument are synced
130: XmlNode node = xmlDataDoc.SelectSingleNode
➥(“//CustomerID[.=’DLWID’]”);
131: if (node != null) {
132: Response.Write(“<b>XmlDataDocument Synced with” +
133: “ DataSet!</b><br />”);
134: Response.Write(“Newly added CustomerID in XmlDataDocument: “ +
135: node.InnerText);
136: }
137:
138: //****** Use the XmlDataDocument’s GetElementFromRow() Method
139: XmlElement customer = xmlDataDoc.GetElementFromRow(
140: newDS.Tables[“Customers”].Rows[2]);
141: Response.Write(“<br />CompanyName: “ +
142: customer.SelectSingleNode
➥(“CompanyName”).InnerText);
143:
144: BeforeDataGrid.DataSource =
145: newDS.Tables[“Customers”].DefaultView;
146: BeforeDataGrid.DataBind();
147:
148: //****** Insert the Record in the Database
149: Sqlconn.InsertDataSet(connStr,newDS);
150:
151: ds = Sqlconn.ReturnDataSet(connStr,customersSql,ordersSql,
153: AfterDataGrid.DataSource = ds.Tables[“Customers”].DefaultView;
154: AfterDataGrid.DataBind();
155:
333
CHAPTER 8

156: //****** Delete newly added ID from database
157: //****** Load DataSet into XmlDataDocument and use it to
158: //****** pass the DataSet to the DeleteDataSet() method
159:
161: xmlDataDoc = new XmlDataDocument(ds);
162: XmlNode root = xmlDataDoc.DocumentElement;
163: string xpath = “//Customers[CustomerID=’DLWID’]”;
164: XmlNode newNode = xmlDataDoc.SelectSingleNode(xpath);
165: root.RemoveChild(newNode);
166: Sqlconn.DeleteDataSet(connStr,xmlDataDoc.DataSet,”DLWID”);
167:
168: ds = Sqlconn.ReturnDataSet(connStr,customersSql,
169: ordersSql,”CustomerID”,”CustomerID”);
170: AfterDeleteDataGrid.DataSource =
171: ds.Tables[“Customers”].DefaultView;
172: AfterDeleteDataGrid.DataBind();
173:
174:
175: } 8
176: </script>
XML FEATURES
ADO.NET’S
LEVERAGING
177: <html>
179: <div id=”divChanges” runat=”server” />
180: <h2>
181: Data Before Submitting Insert to DataBase
182: </h2>
183: <ASP:DataGrid id=”BeforeDataGrid” runat=”server”
184: Width=”700” BackColor=”#E6E6CC” BorderColor=”#000000”
185: ShowFooter=”false” CellPadding=5 CellSpacing=”0”
186: Font-Name=”Arial” Font-Size=”8pt”
188: HeaderStyle-ForeColor=”#ffffff” EnableViewState=”false”
189: />
190: <br>
191: <h2>
192: Data After Being Submitted to DataBase
193: </h2>
194: <ASP:DataGrid id=”AfterDataGrid” runat=”server”
199: HeaderStyle-ForeColor=”#ffffff” EnableViewState=”false”
200: />
334

201: <h2>
202: Data After Deleting Single Row From DataBase
203: </h2>
204: <ASP:DataGrid id=”AfterDeleteDataGrid” runat=”server”
209: HeaderStyle-ForeColor=”#ffffff” EnableViewState=”false” />
210: </body>
211: </html>
Execution of this code results in three different DataGrid WebControls being displayed in the
browser, as shown in Figures 8.4, 8.5, and 8.6.
FIGURE 8.4.
State of the data after adding a single row but before being inserted into the database.
The example uses one main class (SqlConnect) to select, insert, and delete data from the
Northwind database. The code starts by calling this class’s ReturnDataSet() method, which
fills a DataSet with data. After filling the DataSet, the XML schema and XML data are writ-
ten out to two files named schema.xsd and customers.xml. From here, a new DataSet (named
newDS) is created and the data that was saved previously to the two files is read back in (lines
103 and 104). In line 105, the DataSet’s AcceptChanges() method is called so that the rows
read in from the customers.xml file aren’t treated as new rows. This is necessary to prevent
the rows from being reinserted into the database later in the code.
335
CHAPTER 8
FIGURE 8.5.
State of the data after inserting a single row into the database.
XML FEATURES
ADO.NET’S
LEVERAGING
FIGURE 8.6.
State of the data after deleting a single row from the database.
Line 106 shows the creation of an XmlDataDocument object named xmlDataDoc. This object is
associated with newDS (the DataSet created earlier). Later in the code, you’ll see that
xmlDataDoc will automatically be synced with newDS.
Lines 108–126 take care of adding a new row to newDS. While the row is being added, a simple
check is done to ensure that the DataSet’s schema is indeed validating individual column
entries (lines 116–121). The code attempts to enter data into a dummy column named “test”
as shown next:
116: try {
336
118: }
121: }
If you remove the try…catch block, you’ll see that the DataSet’s schema does ensure that
the column exists before allowing any entries. It will not allow bogus columns or data to be
entered into the DataSet. This validation offers the advantage of being able to check that data
is correct before any attempt to interact with the data source is made.
After this new row is added, the changes to the DataSet can be viewed by calling the
GetChanges() method. This method returns a DataSet containing the changed records. If the
changes need to be viewed as XML, the DataSet’s GetXml() method can be called.
At this point, the new row now exists in the DataSet. Before the record is updated in the data-
base, the XPathDocument (named xmlDataDoc) created earlier runs a few checks on the data it
contains using XPath statements. The code for this is shown next:
129: //****** Show that the DataSet and XmlDataDocument are synced
130: XmlNode node = xmlDataDoc.SelectSingleNode
➥(“//CustomerID[.=’DLWID’]”);
131: if (node != null) {
132: Response.Write(“<b>XmlDataDocument Synced with” +
133: “ DataSet!</b><br />”);
134: Response.Write(“Newly added CustomerID in XmlDataDocument: “ +
135: node.InnerText);
136: }
The newly added row (a node in the XmlDataDocument) is selected by using the
SelectSingleNode() method (line 130). The XPath statement instructs this method to find the
CustomerID node that is equal to “DLWID.” By running the code, you’ll see that this node is
found and its value is written out to the page. This shows that that the DataSet and
XmlDataDocument classes are synced for you automatically without any intervention on your
part!
After it has been verified that the DataSet and XmlDataDocument have been synced, a call is
made to the XmlDataDocument’s GetElementFromRow() method (line 138).
138: //****** Use the XmlDataDocument’s GetElementFromRow() Method
139: XmlElement customer = xmlDataDoc.GetElementFromRow(
140: newDS.Tables[“Customers”].Rows[2]);
141: Response.Write(“<br />CompanyName: “ +
142:
customer.SelectSingleNode(“CompanyName”).InnerText);
This method will find a specific node in the DOM structure based on a row in the DataSet.
Row numbering in DataSets starts with 0, and because only two rows initially existed in
337
CHAPTER 8
newDS, the newly added row’s position will be 2. Instead of hard-coding this value, the last row
in the DataSet could be found programmatically, as well.
Now that the new row has been added to newDS and synchronization between it and
xmlDataDoc has been verified, the DataSet can be bound to a DataGrid in the ASP.NET page
and then updated in the database:
144: BeforeDataGrid.DataSource =
145: newDS.Tables[“Customers”].DefaultView;
147:
148: //****** Insert the Record in the Database
149: Sqlconn.InsertDataSet(connStr,newDS);
Because the DataSet knows nothing about where the data came from, the code calls the
SqlConnect class’s InsertDataSet() method (line 149). This method creates a
SqlDataAdapter object and specifies what Insert statement to use to move the newly added
row in the DataSet to the Northwind database (lines 29–62). The wonderful thing about speci-
fying what command to use (insert, delete, or update), is that the SqlDataAdapter takes care of
executing the command automatically. Had the code inserted 10 rows, all these changes would
have been moved to the Northwind database automatically. Talk about your time-savers! 8
XML FEATURES
Continuing with the example in Listing 8.13, after the insert has been completed, a call is
ADO.NET’S
LEVERAGING
again made to the SqlConnect class’s ReturnDataSet() method, and the resulting DataSet is
bound to another DataGrid (lines 151–154). This is done to show that the new row was indeed
inserted into the Northwind database.
151: ds = Sqlconn.ReturnDataSet(connStr,customersSql,ordersSql,
153: AfterDataGrid.DataSource = ds.Tables[“Customers”].DefaultView;
154: AfterDataGrid.DataBind();
The next section of code (lines 156–172) loads the DataSet (named ds) into a new
XmlDataDocument. The code uses the XmlDataDocument’s SelectSingleNode() and
RemoveChild() methods to find and remove the node having a value of “DLWID.” After the
node has been removed, the SqlConnect’s DeleteDataSet() method is called, which accepts
the DataSet containing the row(s) to be deleted as its second argument. To accommodate this,
the original DataSet (ds) could have been passed in because the XmlDataDocument node dele-
tion is synced. However, line 166 shows how a DataSet can be passed directly using the
XmlDataDocument’s DataSet property:
156: //****** Delete newly added ID from database

157: //****** Load DataSet into XmlDataDocument and use it to
158: //****** pass the DataSet to the DeleteDataSet() method
159:
338

161: xmlDataDoc = new XmlDataDocument(ds);
162: XmlNode root = xmlDataDoc.DocumentElement;
163: string xpath = “//Customers[CustomerID=’DLWID’]”;
164: XmlNode newNode = xmlDataDoc.SelectSingleNode(xpath);
165: root.RemoveChild(newNode);
166: Sqlconn.DeleteDataSet(connStr,xmlDataDoc.DataSet,”DLWID”);
167:
168: ds = Sqlconn.ReturnDataSet(connStr,customersSql,
169: ordersSql,”CustomerID”,”CustomerID”);
170: AfterDeleteDataGrid.DataSource =
171: ds.Tables[“Customers”].DefaultView;
172: AfterDeleteDataGrid.DataBind();
The code in Listing 8.13 has shown how the DataSet and XmlDataDocument classes can make
it very easy to view data both in a relational fashion and as XML. Data retrieved from a data-
base can be queried via XPath statements, and data from an XML document can be treated as
relational data. The XmlDataDocument and DataSet classes also allow XML documents coming
from any source or location to be added to a different data source without requiring the XML
document to be manually parsed. As long as the document follows the rules outlined in the
DataSet schema, these two classes can use it to update a database or any other source.
It’s important to reemphasize that this code does not show the most efficient way to work with
data in your ASP.NET applications. It’s unlikely that you would use both the DataSet and
XmlDataDocument classes in every data-centric ASP.NET application. The example was
designed to make you aware of the key features offered by ADO.NET so that you can use the
features appropriately.
Shaping DataSet Columns with the

MappingType Enumeration
DataSet columns do not always have to be treated as XML elements when they are written out
to a file or manipulated in an XmlDataDocument. After all, what if you need some columns
within a DataSet to be treated as attributes rather than as elements for a specific ASP.NET
application?
The type of XML node that a given DataColumn found within a DataSet is associated with can
be programmatically assigned by using the DataSet’s ColumnMapping property along with the
MappingType enumeration. By using this property, a more custom XML structure can be gener-
ated when the GetXml() or WriteXml() methods are called or when the DataSet is used to
construct an XmlDataDocument object.
339
CHAPTER 8
The ColumnMapping property can be set equal to one of four MappingType enumeration values,
depending on what XML structure you desire. The different enumerations are listed in Table 8.3.
TABLE 8.3 MappingType Enumeration Values

Value Description
Attribute The field is mapped to an XML attribute.
Element The field is mapped to an XML element. This is the default value.
Hidden The field is mapped to an internal structure. The field name and
value will not be added in the XML.
SimpleContent The field is mapped to an XML text node.
Listing 8.14 shows how a table can be loaded with a few rows of data. The code allows most
columns to default to XML elements but programmatically sets one column (named
CustomerID) to be an attribute.
LISTING 8.14 Mapping a Field to an XML Attribute

8
XML FEATURES
ADO.NET’S
LEVERAGING
7: DataSet ds = new DataSet();
8: DataTable tblCustomers = new DataTable(“Customers”);
9:
10: DataColumn customerID = new DataColumn(“CustomerID”);
11: customerID.DataType = System.Type.GetType(“System.String”);
12: customerID.ColumnMapping = MappingType.Attribute;
13: customerID.Unique = true;
14: tblCustomers.Columns.Add(customerID);
15:
16: DataColumn companyName = new DataColumn(“CompanyName”);
17: companyName.DataType = System.Type.GetType(“System.String”);
18: companyName.Unique = false;
19: tblCustomers.Columns.Add(companyName);
20:
21: DataColumn contactName = new DataColumn(“ContactName”);
22: contactName.DataType = System.Type.GetType(“System.String”);
23: contactName.Unique = false;
24: tblCustomers.Columns.Add(contactName);
25:
340

26: DataColumn contactTitle = new DataColumn(“ContactTitle”);
27: contactTitle.DataType = System.Type.GetType(“System.String”);
28: contactTitle.Unique = false;
29: tblCustomers.Columns.Add(contactTitle);
30:
31: ds.Tables.Add(tblCustomers);
32:
33: DataRow row = ds.Tables[“Customers”].NewRow();
35: row[“CompanyName”] = “Wahlin Consulting”;
37: row[“ContactTitle”] = “Programmer/Author”;
38:
40: try {
42: }
45: }
46: ds.Tables[“Customers”].Rows.Add(row);
47: // Code to actually insert the row goes here.....
48:
49: BeforeDataGrid.DataSource = ds.Tables[“Customers”].DefaultView;
51: xmlText.Value = ds.GetXml();
52: }
53: </script>
54: <html>
56: <h2>Data Loaded into a DataSet and Validated by a Schema</h2>
58: Width=”700”
60: BorderColor=”#000000”
62: CellPadding=5
69: />
341
CHAPTER 8

70: <p>
71: <h2>XML Generated by DataSet</h2>
72: <TextArea rows=”25” cols=”80” id=”xmlText” runat=”server”>
➥</TextArea>
73: </body>
74: </html>
On executing the preceding code, the DataSet emits the following XML. As shown next, the
CustomerID field’s data is contained within an attribute:
<NewDataSet>
<Customers CustomerID=”DLWID”>
<CompanyName>Wahlin Consulting</CompanyName>
<ContactName>Dan Wahlin</ContactName>
<ContactTitle>Programmer/Author</ContactTitle>
</Customers>
</NewDataSet>
Mapping XSD Schemas to a DataSet 8

Listing 8.13 showed how the DataSet class’s ReadXmlSchema() can be used to load XML
XML FEATURES
ADO.NET’S
LEVERAGING
schemas. Why would you want to load an XML schema in the first place? One potential
answer is that it can save you a lot of work!
When a DataSet is first created, it is completely empty. To be able to add rows of data to it,
tables with columns must be programmatically added first, as was shown earlier in Listing
8.14. By loading a schema from a file, tables and columns are loaded for you along with their
appropriate data types. Also, any relationships specified between tables are also loaded for you.
This means that you can easily add new records and then persist them to a data source at a
later time. It also means that a schema can be used in different capacities. It can be used to val-
idate an XML document or to create the shell structure for a DataSet table (or tables). Listing
8.15 shows how an XML schema can be used to provide the shell structure for a DataSet
using the DataSet’s ReadXmlSchema(). After this structure is created, rows of data can be
added to the table.
LISTING 8.15 Mapping an XSD Schema to a DataSet

5: <html>
342

9: //**** Load the Schema File
10: ds.ReadXmlSchema(new StreamReader(Server.MapPath(“schema.xsd”)));
11: //**** Add Row to the Empty DataSet
17:
19: try {
21: }
23: //Don’t do the insert because column[“test”]
24: //doesn’t really exist
25: }
27: // Code to actually insert the row into a data source
28: // would go here.....
29:
32: }
33: </script>
34: </head>
37: <h2>Data Loaded into a DataSet and Validated by a Schema</h2>
39: Width=”700”
43: CellPadding=5
50: />
343
CHAPTER 8

51: </form>
52: </body>
53: </html>
Line 10 takes care of creating the structure that the DataSet should follow for you. After the
structure has been established using the schema, rows can be added and saved to a remote data
source. In cases where an XML document containing the new records needs to be returned to
the end user’s application, the DataSet can return an XML document as shown earlier. This is
a very powerful feature that can be leveraged when working with Web services. We’ll discuss
these more in Chapter 10, “Working with ASP.NET, XML, SOAP, and Web Services.”
At times, you may need to use XML data contained within attributes rather than elements. For
example, you may want to use the XML schema shown next to create the correct DataSet
structure for a given ASP.NET application.
<xsd:schema xmlns:msdata=”urn:schemas-microsoft-com:xml-msdata”
xmlns:xsd=”http://www.w3.org/2001/XMLSchema” id=”Xml”>
<xsd:element name=”Customers”>
<xsd:complexType>
8
<xsd:attribute name=”CustomerID” type=”xsd:string” />
XML FEATURES
ADO.NET’S
LEVERAGING
<xsd:attribute name=”CompanyName” type=”xsd:string” />
<xsd:attribute name=”ContactName” type=”xsd:string” />
<xsd:attribute name=”ContactTitle” type=”xsd:string” />
<xsd:attribute name=”Address” type=”xsd:string” />
<xsd:attribute name=”City” type=”xsd:string” />
<xsd:attribute name=”PostalCode” type=”xsd:string” />
<xsd:attribute name=”Country” type=”xsd:string” />
<xsd:attribute name=”Phone” type=”xsd:string” />
<xsd:attribute name=”Fax” type=”xsd:string” />
</xsd:complexType>
</xsd:element>
</xsd:schema>
Fortunately, ADO.NET DataSets allow this schema to be loaded using the ReadXmlSchema()
method shown earlier. After the schema is loaded, new rows can be added like normal. Calls
made to the DataSet’s GetXml() or WriteXml() methods will output an XML document with
the newly added data being contained within attributes rather than elements.
Creating DataSet Mappings Using XML

Some situations may occur in which a schema is not available to use in creating a DataSet’s
structure. In these situations, a schema mapping between an existing XML file and the
344
DataSet can be used to “infer” what the table structure of the DataSet should look like. The
process allows the XML document that contains the proper structure to be read through by the
DataSet. The DataSet then creates a representative XML schema dynamically. Listing 8.16
shows an example of this. Line 10 contains the code used to load the XML document and then
infer its schema for use within the DataSet.
LISTING 8.16 Inferring a DataSet Schema from an XML Document

8:
9: //**** Infer the Schema from the customers.xml file
10: ds.InferXmlSchema(Server.MapPath(“customers.xml”),null);
11:
12: //**** Add a Table and Row to the Empty DataSet
13: ds.Clear(); //Make sure all rows are empty from inference above
19:
21: try {
23: }
26: }
28: // Code to actually insert the row into a data source
29:
32: }
33: </script>
34: <html>
37: <h2>
345
CHAPTER 8
LISTING 8.16 Inferring a DataSet Schema from an XML Document

38: Data Loaded into a DataSet and Validated by a Schema
39: </h2>
45: HeaderStyle-ForeColor=”#ffffff” EnableViewState=”false” />
46: </form>
47: </body>
48: </html>
The InferXmlSchema() method has several overrides that can accept a file path, Stream,
TextWriter, or XmlReader. Each of these overrides accepts an array of namespace URI strings
that should be excluded when the DataSet is inferring the XML schema from an XML docu-
ment. This allows virtually any XML document to be used as a “model” for the DataSet’s
schema, even if it contains undesirable namespaces.
8
Using DataSets to Work with Hierarchical XML
XML FEATURES
ADO.NET’S
LEVERAGING
Data and XSLT
Up to this point, all data returned from a database and viewed as XML has been kept in sepa-
rate XML structures. For example, when the Customers and Orders tables were queried in the
Northwind database, all the XML data was kept in two element node-sets as shown next:
<Customers>
<CustomerID>DUMON</CustomerID>
<CompanyName>Du monde entier</CompanyName>
<ContactName>Janine Labrune</ContactName>
<ContactTitle>Owner</ContactTitle>
<Address>67, rue des Cinquante Otages</Address>
<City>Nantes</City>
<PostalCode>44000</PostalCode>
<Country>France</Country>
<Phone>40.67.88.88</Phone>
<Fax>40.67.89.89</Fax>
</Customers>
<Orders>
<OrderID>11036</OrderID>
<CustomerID>DRACD</CustomerID>
<EmployeeID>8</EmployeeID>
<OrderDate>1998-04-20T07:00:00</OrderDate>
<RequiredDate>1998-05-18T07:00:00</RequiredDate>
346
<ShippedDate>1998-04-22T07:00:00</ShippedDate>
<ShipVia>3</ShipVia>
<Freight>149.47</Freight>
<ShipName>Drachenblut Delikatessen</ShipName>
<ShipAddress>Walserweg 21</ShipAddress>
<ShipCity>Aachen</ShipCity>
<ShipPostalCode>52066</ShipPostalCode>
<ShipCountry>Germany</ShipCountry>
</Orders>
Although you’ve seen how to create relationships between the Customers and Orders tables
through XML schemas, no examples have shown how to nest the orders under the appropriate
customer that placed them. The capability to represent hierarchical relationships can be very
useful, depending on the application. Listing 8.17 revisits the example shown in Listing 8.6 but
uses hierarchical relationships along with XSLT to output the different orders placed by a par-
ticular customer.
LISTING 8.17 Creating Hierarchical Relationships in a DataSet

4: <%@ Import Namespace=”System.Xml.Xsl”%>
6: <html>
15: SqlDataAdapter dsCmdOrders = new SqlDataAdapter(table2,dataConn);
16: dsCmdCustomers.Fill(dsTables,”Customers”);
17: dsCmdOrders.Fill(dsTables,”Orders”);
18: DataRelation rel = new DataRelation(“CustomerOrders”,
21: rel.Nested = true;
22: dsTables.Relations.Add(rel);
24: }
25: }
26:
347
CHAPTER 8

28: string SqlconnStr = “server=localhost;uid=sa;” +
29: “pwd=;database=Northwind”;
30: string customerSql = “SELECT * FROM Customers WHERE CustomerID” +
31: “ LIKE ‘A%’ ORDER BY ContactName”;
32: string ordersSql = “SELECT * FROM Orders WHERE CustomerID LIKE ‘A%’”;
35: XmlTextWriter writer = new XmlTextWriter(sw);
38: DataSet ds = Sqlconn.ReturnDataSet(SqlconnStr,customerSql,ordersSql);
39: XslTransform xsl = new XslTransform();
40: xsl.Load(Server.MapPath(“customerOrders.xsl”));
41: xsl.Transform(new XmlDataDocument(ds),null,writer);
42: content.InnerHtml = sb.ToString();
43: }
44: </script>
45: <head>
46: <script language=”JavaScript”> 8
47: function showDetails(id) {
XML FEATURES
ADO.NET’S
LEVERAGING
48: var loc = document.all(id);
49: if (loc.style.display == ‘none’) {
50: loc.style.display = ‘block’;
51: } else {
52: loc.style.display = ‘none’;
53: }
54:
55: }
56: </script>
57: </head>
59: <h2>
60: <b>Walking Hierarchical XML in DataSets with XSLT</b>
61: </h2>
62: <div id=”content” runat=”server” />
63: </body>
64: </html>
Creating hierarchical relationships is as simple as using the Nested property of the Relation
class. This property can be set to a Boolean value with false being the default. Using this
property is shown in line 21. By setting the property to true, all parent/child relationships will
be nested as appropriate when the DataSet is viewed as XML.
348
In this example, all orders associated with a particular customer are automatically shown as
child elements in the XML document. This makes it easy to use XSLT to match each order and
write out the necessary data. Use of the Nested property is not restricted to just one relation-
ship. Each Relation object that is instantiated can have its Nested property set to true if nested
elements are desired. This means that orders can be placed under the appropriate customer that
placed the order, and the different products associated with each order can be placed under the
order.
This example also shows how a DataSet can be transformed into another structure such as
HTML by using the XmlDataDocument class and XSLT. Performing the transformation is as
simple as passing the DataSet into the XmlDataDocument’s constructor. The resulting object is
then passed into the XslTransform class’s Transform() method (line 41). This powerful fea-
ture allows you to transform data retrieved from virtually any data store into a specific struc-
ture via XSLT in your ASP.NET applications.
Summary
ADO.NET offers a very robust set of classes for working with data located in a variety of data
stores. Using the different ADO.NET managed providers, data found in an Oracle, Sybase,
Microsoft, or other databases can be accessed and even integrated into a single DataSet.
ADO.NET also allows much of the work involved with inserting, updating, or deleting to be
done while being completely disconnected from the data source. This is a big improvement
over “classic” ADO, in which disconnected RecordSets had to be explicitly created to be used
and were subject to COM marshaling.
Through using the DataSet and XmlDataDocument classes, data can be viewed in a relational
manner or as XML, and data contained within a DataSet can be moved to an
XmlDataDocument and queried using XPath statements. This opens up a whole new world of
opportunities for the creation of dynamic ASP.NET applications, especially if the applications
leverage XML. Many other features are associated with ADO.NET that can’t possibly be cov-
ered in this chapter alone. So many, in fact, that an entire book could be written on the topic.
Now that you’re familiar with how to access data using ADO.NET, the next chapter shows you
how to utilize SQL Server 2000’s native XML support to retrieve and update data. Combining
ASP.NET with SQL Server 2000 offers many exciting opportunities!
SQL Server 2000, XML, CHAPTER
9
and ASP.NET
IN THIS CHAPTER
• XML Features in SQL Server 2000 350
• Querying SQL Server 2000 Using HTTP 351
• Querying SQL Server 2000 Through HTTP
Using Templates, XPath, and XDR
Schemas 368
• Using EXPLICIT Mode Queries 377

• Using OPENXML to Manipulate XML 385
• XML Updategrams—Update, Insert, and Delete
Database Records with XML Using
ASP.NET 389
• Using ADO.NET with SQL Server 2000 394

350
XML Features in SQL Server 2000

In the previous chapter on ADO.NET, you saw how easy it is to view data both in a relational
manner and as XML using the DataSet and XmlDataDocument classes. Because DataSets can
contain multiple tables, establishing relationships between tables can be accomplished with just
a few lines of code. Creating custom XML structures based on these tables can be accom-
plished by using properties such as the Nested property found on the DataRelation class or by
using XML schemas. Although these features are very powerful, this chapter examines other
ways you can work with relational data and XML in ASP.NET applications using SQL Server
2000 on its own and in conjunction with ADO.NET.
SQL Server 2000 comes with a host of new XML-related features that allow for a variety of
tasks to be performed on relational and XML data. It allows relational data to be described and
structured as XML in a customized fashion. This means that you can position each order
placed by a customer under the appropriate customer’s data. Aside from the capability to gen-
erate custom XML structures, SQL Server 2000 offers many more XML features that can sim-
plify working with even the most complex XML documents. The features that will be covered
in this chapter include the following:
• XML data generated by SQL Server 2000 can be retrieved through HTTP by using
queries passed in the QueryString. XML data can also be returned through calling XML
templates that are stored on the Web server.
• XML data can be retrieved using a SELECT command and the FOR XML keywords
from an ASP.NET page, by calling a stored procedure, or by using XPath queries.
• SQL Server 2000 contains full support for XML Data-Reduced (XDR) schemas (at the
time of publication, beta support for XSD schemas was emerging as well). This schema
support includes the capability to map various XML elements and attributes to tables and
fields.
• XML documents can be “shredded” to perform inserts by using the OPENXML rowset
provider.
• Updates, inserts, and deletes can be accomplished by sending the database specially for-
matted XML documents referred to as UpdateGrams.
The sections that follow discuss these features and show how you can leverage them to build
powerful and robust ASP.NET applications. Although samples of integrating XML data
retrieved from SQL Server 2000 into ASP.NET pages will be shown, the focus of the chapter
will be on the different techniques that can be used to access relational data as XML.
SQL Server 2000, XML, and ASP.NET
351
CHAPTER 9
Querying SQL Server 2000 Using http

Although ADO.NET allows for accessing XML data quite easily, wouldn’t it be nice if you
could load an XmlDocument class with XML data retrieved directly from the data source? By
using HTTP queries along with SQL Server 2000, you can do just that. In this section you’ll
see how this is accomplished and what steps need to be taken to allow for this type of func-
tionality. Let’s first take a look at configuring Internet Information Services (IIS) to support
SQL Server 2000 HTTP queries.
Configuring SQL Server Virtual Directories in IIS

SQL Server 2000 allows virtual directories created for IIS to have direct access to data within
a SQL database. On installing SQL Server 2000 (either the full installation or the client tools)
on a machine with IIS, the process of creating these specialized virtuals is as simple as access-
ing the correct tool.
To begin the configuration process, you must go to Start, Programs, Microsoft SQL Server
program group and select the Configure SQL XML Support in IIS option. After selecting this
option, you should see a screen similar to the one you’re used to seeing when accessing the
Internet Services Manager. Open up the default Web server, right-click it, select New, and then
click the Virtual Directory option. This process is shown in Figure 9.1.
2000, XML, AND

SQL SERVER
ASP.NET
FIGURE 9.1
Creating a new SQL Server Virtual Directory.
The next screen prompts you for the virtual directory’s alias and physical location. Supply the
same information that you would for creating a normal IIS virtual. For the samples in this
352
chapter, a virtual alias of Northwind will be used with a physical path pointing to a Northwind
directory located in the wwwwroot folder. Feel free to change the virtual’s alias and physical
path if desired.
After the preceding information has been completed, click the Security tab (located at the top
of the screen). Enter the correct user ID and password information to allow users to access
SQL Server 2000 through the virtual directory. This section allows you to specify a Windows
or SQL Server account type. It also allows you to use Windows Integrated Security, if desired.
Security is always a crucial piece of any application and the SQL Server 2000 documentation
provides detailed information about this important topic. Although database security is beyond
the scope of this book, properly securing a database is extremely important and should not be
taken lightly when building your ASP.NET applications.
After entering the correct security information, click the Data Source tab. The information
required by this section is similar to what you would provide when creating a DSN or
Universal Data Link (UDL). Simply supply the name of the SQL Server 2000 machine
along with the appropriate database within the server. For the examples in this chapter, the
Northwind database will be used as shown in Figure 9.2.
FIGURE 9.2
Establishing the proper SQL Server and Database to use with the Virtual Directory.
After completing the preceding steps, click the Settings tab. This tab contains several options
that can be selected to lock down the virtual directory’s access to the database. Figure 9.3
shows the different options.
353
CHAPTER 9
FIGURE 9.3
The Settings tab.
For the purposes of this chapter, check the boxes next to each option and click the Apply but-
ton. When building real applications, you’ll want to select the options that allow adequate
access to the SQL Server machine but also provide an adequate level of security. This topic as
well as a discussion of each option will be covered in the next few sections.
The Virtual Names tab allows templates, schemas, and dbobjects to be associated with the
newly created virtual directory. This screen allows you to tell the virtual directory where to go
when the calling application needs access to a particular template, schema, or dbobject. 9
Because these topics haven’t been covered to this point, I’ll defer a more in-depth discussion
2000, XML, AND

SQL SERVER
about them until later in the chapter. For now, click the OK button so that the SQL Server 2000
ASP.NET
virtual is created with all the different properties you have entered.
After it is completed, click the default Web server located in the left pane; the newly created
virtual should now appear in the right pane. You can make changes to any of the virtual proper-
ties by right-clicking it and selecting Properties. Now that you’ve created a SQL Server 2000
virtual directory, let’s take a look at the many ways it can be used in ASP.NET pages.
Querying SQL Server 2000 Through HTTP Using the

FOR XML Keywords
Using the virtual directory just created, queries can be executed against the Northwind data-
base by placing the SQL query into a URL. To see this in action, open your browser and type
354
the following URL into the location box. If you used a virtual alias other than Northwind,
you’ll need to substitute the appropriate name instead:
http://localhost/Northwind?sql=SELECT+*+FROM+Customers+WHERE+CustomerID=
‘ALFKI’+FOR+XML+AUTO&root=root
The following XML document will appear in the browser:

<?xml version=”1.0” encoding=”utf-8” ?>
<root>
<Customers CustomerID=”ALFKI” CompanyName=”New Name” ContactName=”Maria
Anders” ContactTitle=”Sales Representative” Address=”Obere Str. 57”
City=”Berlin” PostalCode=”12209” Country=”Germany” Phone=”030-0074321”
Fax=”030-0076545”
/>
</root>
Before preceding much further, let’s break the preceding URL shown into pieces. To start, the
URL path to the virtual named Northwind is followed by the actual SQL query that needs to be
executed against the Northwind database. In this case, the query is defined as
SELECT * FROM Customers WHERE CustomerID=’ALFKI’
To allow the database to recognize the query properly, it is set equal to a QueryString parame-
ter named sql. You’ll also notice that the query is URL encoded so that it can be passed prop-
erly to the database. All spaces have been replaced with the + character and other characters
that invalidate the URL are URL encoded as well.
At the end of the query, two new keywords have been added. These are the FOR XML keywords
that serve the obvious purpose of returning data from the database in the form of XML. These
keywords are followed by the AUTO keyword that tells the database to return an XML document
that allows for elements to be nested. If this SQL query returned customers and orders, using
the AUTO keyword will automatically nest all order elements under the appropriate customer
that placed the order. All the fields specified in the query will be returned as attributes, as
shown in the previous XML document.
After the FOR XML AUTO keywords, another QueryString parameter is included named root.
This parameter is used to name the root element of the XML document that will be returned.
You can run the same query and substitute the value of northwind for the root QueryString
parameter value. The resulting XML document will look the same as the one shown earlier,
except that the root element will now be named northwind, as shown next:
<northwind>
<Customers CustomerID=”ALFKI” CompanyName=”New Name” ContactName=”Maria
Anders” ContactTitle=”Sales Representative” Address=”Obere Str. 57”
355
CHAPTER 9
City=”Berlin” PostalCode=”12209” Country=”Germany” Phone=”030-0074321”

Fax=”030-0076545”
/>
</northwind>
Although this sample is quite simple, more complex queries that join different tables can be
executed using HTTP as well. The following query builds on the previous one by joining the
Customers and Orders tables found in the Northwind database:
http://localhost/northwind?sql=SELECT+Customer.CustomerID%2c
Customer.ContactName%2c%5bOrder%5d.OrderID+FROM+Customers+Customer+
INNER+JOIN+Orders+%5bOrder%5d+ON+Customer.CustomerID%3d%5bOrder%5d.CustomerID+
FOR+XML+AUTO&root=Northwind
The output from this query is shown next:

<Northwind>
<Customer CustomerID=”ALFKI” ContactName=”Maria Anders”>
<Order OrderID=”10643”/><Order OrderID=”10692”/>
<Order OrderID=”10702”/>
</Customer>
<Customer CustomerID=”ANATR” ContactName=”Ana Trujillo”>
</Customer>
 9
2000, XML, AND

</Northwind>
SQL SERVER
ASP.NET
In cases where you do not want any nesting to occur between the customers and orders, SQL
Server 2000 offers another keyword that can used instead of AUTO. This alternative keyword is
named RAW. Using the RAW keyword results in all fields being grouped into one XML element
“row” structure, as shown next:
<Northwind>
<row CustomerID=”ALFKI” ContactName=”Maria Anders” OrderID=”10643”/>
<row CustomerID=”ALFKI” ContactName=”Maria Anders” OrderID=”10692”/>

</Northwind>
You’ll see in the next few examples that the FOR XML keywords can be followed by several other
keywords in addition to AUTO and RAW that allow for complete customization of XML data.
356
You’ll notice that as HTTP queries become more complex, URL encoding them can be diffi-
cult to do by hand. Listing 9.1 shows how to make this process easier by using the ASP.NET
Server object’s UrlEncode() method. It allows a regular SQL query to be entered and outputs
either a URL-encoded string version of the query or executes the query and returns the result-
ing XML document. Using the same type of logic shown in this listing, you can automatically
URL encode SQL queries in your ASP.NET pages.
LISTING 9.1 SQL Query URL-Encoding Utility

1: <html>
2: <head>
3: <script language=”c#” runat=”server”>
4: void SubmitButton_Click(Object Src, EventArgs E) {
5: String sql = sqlBox.Text;
6: String root = txtRoot.Text;
7: if (root.Length == 0) root = “root”;
8: if (sql.Length > 0) {
9: String encodedSql = Server.UrlEncode(sql);
10: if (rdoChoice.SelectedIndex == 0) {
11: lblErrors.Text = encodedSql;
12: } else {
13: lblErrors.Text = “”;
14: Response.Redirect(@”http://localhost/northwind?sql=”
15: + encodedSql + “&root=” + root);
16: }
17: } else {
18: lblErrors.Text = “You did not enter a SQL query.”;
19: }
20: }
21: </script>
22: </head>
25: <asp:Label id=”lblErrors”
26: style=”font-family: arial;font-weight: bold;color: red”
27: runat=”server”/>
28: <p/>
29: <span style=”font-family:arial;font-weight:bold;”>
30: Root Name:
31: </span>
32: <asp:TextBox id=”txtRoot” runat=”server”/>
33: <p/>
34: <span style=”font-family:arial;font-weight:bold;”>
35: SQL Query:
36: </span>
357
CHAPTER 9

37: <br/>
38: <asp:TextBox id=”sqlBox” Text=”” Height=”300px” Width=”600px”
39: TextMode=”MultiLine” Wrap=”true” runat=”server”/>
40: <p/>
41: <asp:RadioButtonList style=”font-family:arial;” id=”rdoChoice”
42: RepeatDirection=”Vertical” runat=”server”>
43: <asp:ListItem Selected=”true”>
44: Return Encoded SQL
45: </asp:ListItem>
46: <asp:ListItem>Execute SQL Statement</asp:ListItem>
47: </asp:RadioButtonList>
48: <p/>
49: <asp:Button id=”submitbutton” text=”Run Query”
50: OnClick=”SubmitButton_Click” runat=”server”/>
51: </form>
52: </body>
53: </html>
Virtually any query can be executed using HTTP. For example, assume a particular application
has a stored procedure named sp_GetXml, as shown next:
CREATE PROCEDURE sp_GetXml
(
@CustomerID varchar(5)
)
AS
BEGIN
SELECT CustomerID, CompanyName, ContactName 9
FROM Customers
2000, XML, AND

WHERE CustomerID LIKE @CustomerID + ‘%’
SQL SERVER
ASP.NET
FOR XML AUTO
END
To execute this procedure and pass in the appropriate argument, the following HTTP query can
be used:
http://localhost/northwind?sql=exec+sp_GetXml+’A’&root=root
This same logic applies to more advanced stored procedures and allows for dynamic substitu-
tion of the argument values (“A” in this case), depending on the results that the end user wants
to see.
At this point you may be wondering why you would ever use an HTTP query to obtain XML
data from a database located on SQL Server 2000? Because it returns XML data directly to the
358
browser, what good is it? Thinking back to Chapter 7, “Transforming XML with XSLT and
ASP.NET,” you’ll remember that the XPathDocument class has several constructors that can be
used to load an XML document. One of those constructors accepts a string specifying the path
to the XML document. This presents an excellent opportunity to obtain an XML document
directly from a SQL Server database.
The next listing shows how data from SQL Server 2000 can be loaded directly into an
XPathDocument class and then transformed into HTML using the XslTransform class. Listing
9.2 shows this process.
Listing 9.2 Loading XML Data Directly from SQL Server 2000 into the XPathDocument Class
<%@ Import Namespace=”System.Xml.XPath” %>
<%@ Import Namespace=”System.Xml.Xsl” %>
string sql = Server.UrlEncode(@”SELECT Customers.CustomerID,
Customers.ContactName, Orders.OrderID,
Orders.CustomerID FROM Customers
INNER JOIN Orders
ON Customers.CustomerID = Orders.CustomerID
FOR XML AUTO”);
string url = “http://localhost/northwind?sql=” + sql +
➥“&root=Northwind”;
try {
XPathDocument xmlDoc = new XPathDocument(url);
XslTransform xslDoc = new XslTransform();
xslDoc.Load(Server.MapPath(“customers.xsl”));
xslDoc.Transform(xmlDoc,null,Response.Output);
}
Response.Write(exc.ToString());
}
}
</script>
Listing 9.3 follows along the lines of the previous listing but shows how XML data from SQL
Server 2000 can be loaded directly into a DataSet class using its ReadXml() method.
LISTING 9.3 Loading XML Data Directly from SQL Server 2000 into the DataSet Class
359
CHAPTER 9

5: string sql = Server.UrlEncode(@”SELECT Customers.CustomerID,
6: Customers.ContactName, Orders.OrderID,
7: Orders.CustomerID FROM Customers
8: INNER JOIN Orders
9: ON Customers.CustomerID = Orders.CustomerID
10: WHERE Customers.CustomerID = ‘ALFKI’
11: FOR XML AUTO”);
12: string schemaFile = Server.MapPath(“httpSelect.xsd”);
14:
15: //**** Load the Schema File and XML
16: ds.ReadXmlSchema(schemaFile);
17: ds.ReadXml(“http://localhost/northwind?sql=” + sql +
18: “&root=Northwind”);
19: //**** Add Two Rows to the DataSet
20:
21: DataRow row = ds.Tables[0].NewRow();
24:
25: DataRow row2 = ds.Tables[1].NewRow();
26: row2[“OrderID”] = “11000”;
27: row2[“CustomerID”] = “DLWID”;
28:
29: ds.Tables[0].Rows.Add(row);
30: ds.Tables[1].Rows.Add(row2);
31: // Code to actually insert row goes here.....
32: DataView view = new DataView(ds.Tables[0]);
33: view.Sort = “CustomerID ASC”;
9
2000, XML, AND

34: Customers.DataSource = view;
SQL SERVER
35: Customers.DataBind();
ASP.NET
36:
37: view.Table = ds.Tables[1];
38: Orders.DataSource = view;
39: Orders.DataBind();
40: //Let’s look at the schema
41: schemaDiv.InnerHtml = ds.GetXmlSchema();
42:
43: }
44: </script>
45: <html>
360

48: <h2>XML Data Loaded Directly from SQL Server 2000 into a DataSet</h2>
49: <ASP:DataGrid id=”Customers” runat=”server”
50: Width=”700”
54: CellPadding=5
60: MaintainState=”false”
61: />
62: <p> </p>
63: <ASP:DataGrid id=”Orders” runat=”server”
64: Width=”700”
68: CellPadding=5
74: MaintainState=”false”
75: />
76: <p> </p>
77: <textarea rows=”20” cols=”80” id=”schemaDiv” runat=”server”></textarea>
78: </form>
79: </body>
80: </html>
Lines 5–11 start things off by URL encoding a SQL statement that will be executed using an
HTTP query. After this string is created, it is used to hit the database and generate XML data
that is then loaded into the DataSet (lines 17 and 18). You’ll notice that an XSD schema is also
loaded into the DataSet so that the data read from the database is structured correctly (line 16).
Lines 21–30 take care of adding two new rows. One row is added to the Customers XML
information (table[0]) and another row is added to the Orders XML information (table[1]).
After this operation is complete, two DataViews are created based on the two tables and then
bound to an ASP.NET DataGrid. Figure 9.4 shows what the code generates in the browser.
361
CHAPTER 9
FIGURE 9.4
HTML generated by Listing 9.3.
The XML generated by this process is shown next:

<NewDataSet>
<Customers CustomerID=”ALFKI” ContactName=”Maria Anders”>
<Orders OrderID=”10643” CustomerID=”ALFKI” />
<Orders OrderID=”11011” CustomerID=”ALFKI” /> 9
2000, XML, AND

</Customers>
SQL SERVER
<Customers CustomerID=”DLWID” ContactName=”Dan Wahlin” />
ASP.NET
<Orders OrderID=”11000” CustomerID=”DLWID” />
</NewDataSet>
The first thing that stands out is that the newly added Order row isn’t placed into the XML
document as a child of the Customers element that it relates to (CustomerID=”DLWID”), as is
the case with the other Orders elements. This is because the new Orders row was added to the
Orders table (table[1]) and therefore has no relationship with the newly added Customers
row. Although a relationship between the CustomerID attribute in the Customers element and
the CustomerID attribute in the Orders element can be created using the DataRelation class
along with the DataSet’s Relations property, there is another way to accomplish this task
with the DataSet class.
Looking back at Figure 9.4, you’ll see an interesting feature of the DataSet. Each bound table
shows a Customers_id field that was not a part of the original SQL statement executed using
362
the HTTP query. Where did this mysterious field come from? Because the XML information
for Orders is subordinate to the Customers XML information, the DataSet automatically
added an identity field to help track which Orders are associated with which Customers.
Looking at the second DataGrid on the page, you can see that several of the orders have the
same Customers_id number. These different Orders elements show up under the same
Customers element. However, the newly added Orders row doesn’t have any value in its
Customers_id field. This is because the DataSet doesn’t know that it is related to a specific
Customers element.
To put the newly created Order row under the appropriate Customer element in the DataSet,
the following line of code can be added after line 30 in Listing 9.3:
ds.Tables[1].Rows[ds.Tables[1].Rows.Count-1][“Customers_id”] =
ds.Tables[0].Rows[ds.Tables[0].Rows.Count-1][“Customers_id”];
This line of code makes the new Orders row a child element of the proper Customers element
by relating their Customers_id fields. Adding this one line of code results in the following
XML fragment:
<Customers CustomerID=”DLWID” ContactName=”Dan Wahlin”>
<Orders OrderID=”11000” CustomerID=”DLWID”/>
</Customers>
You can see that the newly added Order is now a child element of the new Customer element
because the DataSet now knows that the two elements are related. A more correct way to
relate the newly added order to the customer is to associate the rows using the SetParentRow()
method:
ds.Tables[1].Rows[ds.Tables[1].Rows.Count-1].SetParentRow(
ds.Tables[0].Rows[ds.Tables[0].Rows.Count-1])
Keep in mind that if nesting and relationships between tables wasn’t specified in advance, the
SetParentRow() method would not do anything useful. Fortunately, the nesting was specified
in the XML schema and in the structure of the XML data returned from SQL Server 2000 that
was initially loaded into the DataSet. Because of this, the SetParentRow() method can be
called when nesting among newly added rows needs to occur.
Using HTTP Queries to Return Elements and Schemas

Thus far, all the examples involving HTTP queries have resulted in XML data being returned
within attributes. What if you need some fields returned as elements instead? This can easily be
accomplished by using a new keyword in SQL Server 2000 named ELEMENTS. To understand this
keyword better, let’s take a look at a simple example that can be run directly in the browser:
http://localhost/northwind?sql=SELECT+*+FROM+Customers+FOR+XML+AUTO%2c+
ELEMENTS&root=Northwind
363
CHAPTER 9
This query results in the following XML fragment:

<Northwind>
<Customers>
<CustomerID>ALFKI</CustomerID>
<CompanyName>New Name</CompanyName>
<ContactName>Maria Anders</ContactName>
<ContactTitle>Sales Representative</ContactTitle>
<Address>Obere Str. 57</Address>
<City>Berlin</City>
<PostalCode>12209</PostalCode>
<Country>Germany</Country>
<Phone>030-0074321</Phone>
<Fax>030-0076545</Fax>
</Customers>

</Northwind>
Use of the ELEMENTS keyword is restricted to cases where the FOR XML AUTO keywords are also
used. Whenever data needs to be contained within elements rather than attributes, simply add
FOR XML AUTO, ELEMENTS to the end of the SQL SELECT statement or stored procedure.
The FOR XML AUTO keywords can also be followed by the XMLDATA and BINARY BASE64 key-
words. The XMLDATA keyword instructs the database to return an inline XDR schema along
with the XML data. The BINARY BASE64 keywords are used to BASE64 encode binary
columns that would otherwise contain illegal characters. More details about working with
binary data can be found in the SQL Server 2000 documentation.
Querying SQL Server 2000 Through HTTP with 9

XML Templates
2000, XML, AND

SQL SERVER
The capability to query SQL Server 2000 directly through HTTP by embedding SQL directly
into the query string is certainly a powerful and useful feature. However, it can represent a ASP.NET
potential security risk even if the URL being used to query the database is embedded within a
compiled ASP.NET page. After one end user learns about the capability to query the database
directly using the browser, data within the database could be compromised.
To allow for more secure HTTP-based queries, SQL Server 2000 introduces the concept of
XML templates. These templates can be used in a manner similar to the HTTP queries shown
earlier. In fact, templates can be embedded into the query string if desired, as shown next:
http://localhost/northwind?template=<ROOT+
xmlns:sql=”urn:schemas-microsoft-com:xml-sql”>
<sql:query>SELECT+*+FROM+Customers+
364
FOR+XML+AUTO</sql:query></ROOT>
Querying the database in this manner still poses a potential security risk because an end user
with knowledge of templates could use them on the query string to manipulate data in the data-
base. And it’s obviously not the easiest way to write out a query.
To combat the potential problem with security, SQL Server virtual directories (discussed ear-
lier) can be configured to block HTTP URL queries. Instead, queries can be directed toward
XML template files that contain the desired SQL query. Before discussing template specifics,
let’s revisit the SQL Server virtual directory administration console discussed earlier. As a
refresher, you can reach this tool by going to the Start, Programs, Microsoft SQL Server pro-
gram group and selecting the Configure SQL XML Support in IIS option. Open up the com-
puter hierarchy to reveal the default Web site. Click this, and in the right pane you’ll see the
Northwind virtual directory created earlier in the chapter. Right-click this virtual, choose
Properties, and then go to the Settings tab.
The first check box, Allow URL Queries, should be checked. To prevent users from accessing
the database through a URL query, uncheck the box. This will only allow SQL HTTP queries
to be executed against the database that are XML templates, XPath queries, or form template
posts. HTTP URL queries will be blocked completely.
To allow XML templates to perform SQL queries, go to the Virtual Names tab and click the
New button. Although the virtual name can be anything you like, the examples that follow will
use a name of Templates. Enter this name, and in the drop-down box named Type, select
Template. Next, either type in a path to where your XML templates will be stored or click the
Browse button. A path of d:\inetpub\wwwroot\testbed\chapter9 was used for the examples
that follow, although the path can point to anywhere you would like. After you have supplied
all the necessary information, click the Save button.
Now that a virtual has been mapped to a folder designated to hold XML query templates, let’s
take a look at how to create a valid template that can be used to execute SQL queries. Listing
9.4 shows a sample template. It uses a namespace prefix of sql with a URI of urn:schemas-
microsoft-com:xml-sql. One of the elements is named query. It serves the rather obvious
purpose of marking up SQL query statements located within the template file.
LISTING 9.4 A Sample XML Template File

1: <Northwind xmlns:sql=”urn:schemas-microsoft-com:xml-sql”>
2: <sql:query>
3: SELECT Customers.CustomerID, Customers.ContactName,
4: Orders.OrderID, Orders.CustomerID
365
CHAPTER 9

5: FROM Customers
8: FOR XML AUTO
9: </sql:query>
10: </Northwind>
After the template is saved to the proper directory, you can test it by going to the following URL:
http://localhost/northwind/templates/listing9.4.xml
Breaking the URL down into individual pieces, you can see that the Northwind virtual root is
first specified (case does not matter here). Within this virtual, the templates virtual name is
then used. It maps to the physical templates directory that was configured earlier. Finally, the
name of the template is specified (listing9.4.xml). Executing this template using a browser
results in an XML document containing different Orders elements nested under Customers
elements, as shown earlier in the chapter.
Several advantages exist to using templates as compared to URL queries. First of all, an end
user now has no control over changing the SQL statements. By removing access to SQL Server
through URL queries, only an individual with write access to the template file can make
changes. This prevents inserts, updates, or deletes from being performed in an unauthorized
manner. Second, XML templates support the dynamic inclusion of parameters. This allows you
to change the value of a SQL WHERE clause without having to make adjustments to the actual
template file.
Working with parameters is as simple as including an XML header element, as shown in
Listing 9.5: 9
2000, XML, AND

SQL SERVER
LISTING 9.5 Specifying Parameters in an XML Template
1: <Northwind xmlns:sql=”urn:schemas-microsoft-com:xml-sql”> ASP.NET
2: <sql:header>
3: <sql:param name=’CustomerID’>A</sql:param>
4: </sql:header>
5: <sql:query>
6: SELECT Customers.CustomerID, Customers.ContactName,
7: Orders.OrderID, Orders.CustomerID
8: FROM Customers
11: WHERE Customers.CustomerID LIKE @CustomerID + ‘%’
12: FOR XML AUTO
366

13: </sql:query>
14: </Northwind>
Within the header element, a param element has been defined with a name attribute equal to
CustomerID. This parameter is given a default value of A. You can use the parameter in the
template just like you would in a stored procedure. Simply append the “@” character to the
front of the parameter and then place it in within the SQL statement or stored procedure. For
this example, the CustomerID parameter is used in the WHERE clause. If the parameter is set to
“B”, the SQL statement will return all rows from the Customers and Orders tables with a
CustomerID that starts with “B.”
Calling this template and passing the correct CustomerID parameter value is as simple as
adding the parameter name and value on the end of the query string, as shown next:
http://localhost/northwind/templates/listing9.5.xml?CustomerID=B
NOTE
Parameters used in XML templates are case sensitive just as XML is. If customerID was
used as the parameter name on the query string instead of CustomerID, the default
value of “A” would be used no matter what value was passed in.
Templates can also be used to execute stored procedures. Listing 9.6 shows an example of call-
ing the sp_GetXml procedure shown earlier in the chapter:
LISTING 9.6 Using an XML Template to Call a Stored Procedure

2: <sql:header>
3: <sql:param name=’CustomerID’/>
4: </sql:header>
5: <sql:query>
6: EXEC sp_GetXml @CustomerID
7: </sql:query>
8: </Northwind>
Using templates to load an XPathDocument or DataSet class within an ASP.NET page is as

simple as supplying the URL to the template file in the XmlPathDocument’s constructor or the
DataSet’s ReadXml() method. Listing 9.7 shows the process of loading each of these classes
with XML data obtained from SQL Server 2000 templates.
367
CHAPTER 9
LISTING 9.7 Using XML Templates in ASP.NET

1: <%@ Import Namespace=”System.Data” %>
8: if (!IsPostBack) {
9: BindDropDown();
10: }
11: }
12: void SubmitButton_Click(object sender, EventArgs e) {
13: string CustID = customers.SelectedItem.Value;
14: string url = “http://localhost/northwind/templates/” +
15: “listing9.5.xml?CustomerID=” + CustID;
18: XmlTextWriter writer = new XmlTextWriter(sw);
19: try {
20: XPathDocument xmlDoc = new XPathDocument(url);
22: xslDoc.Load(Server.MapPath(“customers.xsl”));
23: xslDoc.Transform(xmlDoc,null,writer);
24: transform.InnerHtml = sb.ToString();
25: }
27: Response.Write(exc.ToString());
28: }
29: }
30: void BindDropDown() {
9
2000, XML, AND

31: string url = “http://localhost/northwind/” +
SQL SERVER
32: “templates/getCustomerID.xml”;
ASP.NET
34: ds.ReadXml(url);
35: customers.DataTextField = “CustomerID”;
36: customers.DataSource = ds.Tables[0].DefaultView;
37: customers.DataBind();
38: }
39: </script>
40: <html>
42: <form runat=”server” ID=”Form1”>
43: <b>Select a CustomerID to View:</b>
44: <asp:DropDownList id=”customers” runat=”server” />
368

45: <asp:button id=”SubmitButton” text=”Submit”
46: OnClick=”SubmitButton_Click” runat=”server” />
47: <p />
48: <div id=”transform” runat=”server”></div>
49: </form>
50: </body>
51: </html>
This simple example doesn’t introduce anything new in the way of classes or methods. In fact,
everything just shown was covered in Chapters 6–8. However, it does show different ways you
can interact with data in SQL Server 2000 using ASP.NET.
Querying SQL Server 2000 Through HTTP Using

Templates, XPath, and XDR Schemas
Aside from containing SQL statements and parameters, SQL Server 2000 template files can
also contain XPath queries. Why would you ever want to query the database using XPath when
T-SQL can be used instead? The capability to use XPath can be beneficial in situations where
individuals desiring to get XML data from the database have little or no T-SQL experience.
Assuming they know how to use XPath queries, they can query the database and treat it like
they would any other XML document. Using XPath also allows database tables and their asso-
ciated fields to be treated as an XML document rather than as relational entities.
Although many types of queries can be accomplished using XPath, SQL Server 2000 currently
supports a subset of the W3C XPath specification. As such, the following limitations apply:
• Root queries (/) are not supported.
• Because document order is not determined when performing an XPath query in SQL
Server 2000, axes using document order and predicates involving numeric values are not
supported. The following axes are not currently supported: ancestor, ancestor-or-self,
descendant, descendant-or-self (//), following, following-sibling, namespace,
preceding, preceding-sibling.
• The following string functions are not currently supported: string(), concat(),
starts-with(), contains(), substring-before(), substring-after(), substring(),
string-length(), normalize(), translate().
• The lang() Boolean function is not supported.

• The following numeric functions are currently not supported: sum(), floor(),
ceiling(), round().
369
CHAPTER 9
Keep in mind that as different service packs and “Web Releases” are released, this information
may change. To stay up to date with the different SQL Server 2000 XML features, visit
http://msdn.microsoft.com/sqlserver.
SQL Server 2000 Schemas and Annotations

Having seen the current limitations, let’s take a look at how you can use XPath queries to
return XML information from a SQL Server 2000 database. First, an XPath statement must be
used along with an XDR schema (XSD schema support was being released as this book went
to print). Chapter 4, “Understanding DTDs and XML Schemas,” covers the specifics of XDR
schemas. For an XDR schema to be used with XPath query in SQL Server 2000, it must con-
tain specific schema annotations. These annotations allow tables and fields within a database to
be mapped to attributes or elements. They also allow for additional pieces of functionality,
including the establishment of relationships between tables. Table 9.1 shows the available SQL
Server 2000 XDR schema annotations.
TABLE 9.1 Annotations to XDR Schemas

Annotation Description
sql:field Used to map elements or attributes to specific fields within a
table.
sql:relation Used in mapping a particular element or attribute to a data-
base table or view. This annotation is very useful when a table
name such as Order Details needs to be mapped to an XML
element. Because Order Details is not valid in XML
because of the space, a different name can be assigned to the
XML element that relates to the Order Details table. When
specified on an element, the scope of the annotation applies to 9
2000, XML, AND

all attributes and child elements.
SQL SERVER
Used on elements that do not map to a specific table or field.
ASP.NET
sql:is-constant
The annotation value can be a Boolean. Uses of this annota-
tion include specifying top-level nodes in an XML document
or container nodes. XML items with this annotation will show
in the output.
sql:key-fields Specifies that the element or attribute is the primary key that
uniquely identifies a row in a table.
sql:id-prefix Used to create a valid XML ID, IDREF, or IDREFS attribute
type. Because these types must start with an alphabetic char-
acter or underscore character, a valid prefix is specified that
will be used to start the ID, IDREF, or IDREFS type attribute.
370
TABLE 9.1 continued

Annotation Description
sql:target-namespace Places elements and attributes normally in the default name-
space into a different namespace for query results.
sql:relationship Used to establish relationships between different XML ele-
ments. The key, key-relation, foreign-key and foreign-
relation attributes are used to establish the relationship.
sql:use-cdata Specifies that an element is a CDATA type.
sql:url-encode Used to specify that an element or attribute’s value should be
URL encoded. This is normally used with BLOB fields.
sql:overflow-field Specifies the database field that contains overflow data.
sql:limit-field and sql: Allows fields to be filtered based on specific values.
limit-value
Without the schema and associated SQL Server annotations, it would be very difficult to match
up the tables and fields within a database to different elements and attributes within an XML
document. How could an XPath statement possibly return results from the database if it had no
idea what fields to look for when an attribute or element is used in an XPath query? The obvi-
ous answer is that XPath queries cannot access the database on their own. They must be used
with schemas to identify the structure of the data within the database.
The best way to learn about using the annotations within a schema is to see an example of
them in action. Listing 9.8 contains a modified version of the XDR schema used in the SQL
Server 2000 documentation. It can be used to query the Northwind database using XPath.
LISTING 9.8 Northwind Mapping Schema

2: <Schema xmlns=”urn:schemas-microsoft-com:xml-data”
3: xmlns:dt=”urn:schemas-microsoft-com:datatypes”
4: xmlns:sql=”urn:schemas-microsoft-com:xml-sql”>
5:
6: <ElementType name=”Customer” sql:relation=”Customers”>
7: <AttributeType name=”CustomerID” dt:type=”id” />
8: <AttributeType name=”CompanyName” />
9: <AttributeType name=”ContactName” />
10: <AttributeType name=”City” />
11: <AttributeType name=”Fax” />
12: <AttributeType name=”Orders” dt:type=”idrefs” sql:id-prefix=”Ord-” />
13:
371
CHAPTER 9

14: <attribute type=”CustomerID” />
15: <attribute type=”CompanyName” />
16: <attribute type=”ContactName” />
17: <attribute type=”City” />
18: <attribute type=”Fax” />
19: <attribute type=”Orders” sql:relation=”Orders” sql:field=”OrderID”>
20: <sql:relationship
21: key-relation=”Customers”
22: key=”CustomerID”
23: foreign-relation=”Orders”
24: foreign-key=”CustomerID” />
25: </attribute>
26:
27: <element type=”Order”>
29: key-relation=”Customers”
30: key=”CustomerID”
31: foreign-relation=”Orders”
32: foreign-key=”CustomerID” />
33: </element>
34: </ElementType>
35:
36: <ElementType name=”Order” sql:relation=”Orders”>
37: <AttributeType name=”OrderID” dt:type=”id” sql:id-prefix=”Ord-” />
38: <AttributeType name=”EmployeeID” />
39: <AttributeType name=”OrderDate” />
40: <AttributeType name=”RequiredDate” />
41: <AttributeType name=”ShippedDate” />
42:
43: <attribute type=”OrderID” />
9
2000, XML, AND

44: <attribute type=”EmployeeID” />
SQL SERVER
45: <attribute type=”OrderDate” />
ASP.NET
46: <attribute type=”RequiredDate” />
47: <attribute type=”ShippedDate” />
48:
50: <element type=”OrderDetail”>
52: key-relation=”Orders”
53: key=”OrderID”
54: foreign-relation=”[Order Details]”
55: foreign-key=”OrderID” />
56: </element>
57: <element type=”Employee”>
372

59: key-relation=”Orders”
60: key=”EmployeeID”
61: foreign-relation=”Employees”
62: foreign-key=”EmployeeID” />
63: </element>
64: </ElementType>
65:
66: <ElementType name=”OrderDetail” sql:relation=”[Order Details]”
67: sql:key-fields=”OrderID ProductID”>
68: <AttributeType name=”ProductID” dt:type=”idref”
69: sql:id-prefix=”Prod-” />
70: <AttributeType name=”UnitPrice”/>
71: <AttributeType name=”Quantity” />
72:
73: <attribute type=”ProductID” />
74: <attribute type=”UnitPrice”/>
75: <attribute type=”Quantity” />
76:
77: <element type=”Discount” sql:field=”Discount”/>
78: </ElementType>
79:
80: <ElementType name=”Discount” dt:type=”string”
81: sql:relation=”[Order Details]”/>
82:
83: <ElementType name=”Employee” sql:relation=”Employees”>
84: <AttributeType name=”EmployeeID” dt:type=”idref”
85: sql:id-prefix=”Emp-” />
86: <AttributeType name=”LastName” />
87: <AttributeType name=”FirstName” />
88: <AttributeType name=”Title” />
88: <attribute type=”EmployeeID”/>
89:
90: <attribute type=”LastName” />
91: <attribute type=”FirstName” />
92: <attribute type=”Title” />
93: </ElementType>
94: </Schema>
The first thing you’ll notice when looking at Listing 9.8 is that it follows the XDR specifica-
tion by declaring element and attribute types. These declarations are used to define the struc-
ture of data within the database. The schema annotations are integrated directly into the
schema as attributes (except in the case of the relationship element, which is discussed
later). Each annotation is associated with the sql namespace prefix. The URI for this
373
CHAPTER 9
namespace references urn:schemas-microsoft-com:xml-sql. This URI must be used exactly

as shown for the annotations to work properly with SQL Server 2000.
Line 6 contains the first annotation (sql:relation), which is used to specify that the Customer
element and its attributes reference the Customers table. Alternatively, each attribute could
have used the sql:field annotation to specify what field they are associated with. However,
because all the attribute names correspond to a field in the Customers table, this is unneces-
sary. In cases where element or attribute names differ from the actual field name, the
sql:field annotation can be used as follows:
<attribute type=”My Customer ID” sql:field=”CustomerID”/>
Line 12 contains an attribute declaration named Orders that has an IDREFS data type. To
ensure that the data returned from the database follows the IDREFS naming restrictions, the
sql:id-prefix annotation is used. This dynamically adds the value “Ord-” to each referenced
IDREF value.
Lines 20–24 contain the only element annotation that SQL Server 2000 uses. This annotation,
named relationship, is used to specify which IDREF values should be added as values for
the Orders attribute. It automatically associates each order a customer places with the appro-
priate OrderID by using the key-relation, key, foreign-relation, and foreign-key attrib-
utes. The key-relation attribute specifies the primary relation table; Customers, in this
example. The key attribute then locates the primary key within this table (CustomerID). These
two attributes are followed by the foreign-relation attribute, which references the foreign
relation table (Orders). The foreign-key attribute then identifies the primary key within this
table (CustomerID). The end result of using these various annotations on the Orders attribute is
a list of IDREF values each prefixed with “Ord-.”
Jumping to line 58 in Listing 9.8, another annotation named key-fields is used to identify 9
what UnitPrice and Quantity fields should be pulled from the Order Details table. The
2000, XML, AND

key-fields annotation specifies that the OrderID and ProductID fields are used to uniquely
SQL SERVER
ASP.NET
identify a given row in the table.
Although Listing 9.8 does not show how to use each annotation listed in Table 9.1, it gives you
a good feel for how annotations can be used to map database table fields to XML items and
create relationships among different tables. For more information on the annotations that were
not covered, refer to the SQL Server 2000 documentation.
After a schema is created and annotations are added, it can be saved in two ways. First, it can
be saved in the same folder as the XML template files were saved. Doing this allows templates
to reference the schema based on its relative path. This process will be discussed in a moment.
Second, the schema can be saved in a folder specially designed to hold schemas. Referring
back to our discussion on setting up the Northwind SQL Server virtual directory earlier in the
374
chapter, you’ll remember that when XML templates needed to be used, a special folder was
configured for this purpose. This was done by going to the Virtual Settings tab of the virtual
directory’s properties sheet. Following the same steps outlined earlier, a schema folder can be
created by clicking the Add button on the Virtual Settings tab, giving the folder to hold
schemas a virtual name (“Schemas” is always a good name), selecting schema from the drop-
down box, and then specifying the physical path to the schemas folder. Note that a schema vir-
tual name can also point to a particular schema file. Pointing to a file can hide the fact that the
returned data is actually coming from a database.
Schemas present an excellent opportunity to define custom XML structures that can then be
returned from the database using XPath queries. Although an alternative to creating complex
XML document structures exists in the form of EXPLICIT mode queries (discussed later),
schemas present a straightforward way of creating XML structures that fit a specific mold used
by ASP.NET applications. To see how this works, let’s take a look at how XPath queries can
use schemas to return XML documents.
Using XPath Queries and Schemas in a URL

After a schema directory has been successfully configured, the following URL can be executed
in the browser. It leverages the schema shown in Listing 9.8 along with an XPath statement to
return XML data:
http://localhost/northwind/schemas/listing9.8.xdr/Customer
[@CustomerID=’ALFKI’]/Order?root=Northwind
This XPath query results in the following XML document being returned:
<Northwind>
<Order OrderID=”Ord-10643” EmployeeID=”6” OrderDate=”1997-08-25T00:00:00”
RequiredDate=”1997-09-22T00:00:00” ShippedDate=”1997-09-02T00:00:00”>
<Employee EmployeeID=”Emp-6” LastName=”Suyama” FirstName=”Michael”
Title=”Sales Representative”/>
<OrderDetail ProductID=”Prod-28” UnitPrice=”45.6” Quantity=”15”>
<Discount>0.25</Discount>
</OrderDetail>
<OrderDetail ProductID=”Prod-39” UnitPrice=”18” Quantity=”21”>
</OrderDetail>
<OrderDetail ProductID=”Prod-46” UnitPrice=”12” Quantity=”2”>
</OrderDetail>
</Order>

</Northwind>
375
CHAPTER 9
The first thing you’ll notice is that the XPath query immediately follows the schema filename
in the URL (listing9.8.xdr). The XPath query finds all Order elements that have a parent
element named Customer with a CustomerID attribute equal to ALFKI. It should be noted that
the query is case sensitive. Typing /customer[@customerID=’ALFKI’]/Order will result in an
error being returned because the case of the element and attribute being named does not match
the case specified in the schema. It’s also important to note that a root element name must be
supplied after the XPath query, as shown, for a well-formed XML document to be returned.
Turning off URL querying capability (through the virtual directory’s Settings tab) will not
affect XPath queries, because they are treated separately. In cases where querying the database
through the use of XPath queries is undesirable, an administrator can disable them by uncheck-
ing the Allow XPath option on the Settings tab of the appropriate virtual directory’s property
sheet. Any attempts to use them after they have been disabled will result in the following error
condition:
ERROR: 400.100 Bad Request
HResult: 0x80004005
Source: Microsoft SQL isapi extension
Description: XPath queries are not allowed.
Although useful, querying via XPath queries embedded in a URL can be problematic because
of URL-encoding issues. The next section demonstrates how to embed XPath queries directly
into XML template files.
Using XPath Queries, Schemas, and Templates

Like SQL queries, XPath queries can also be embedded within an XML template file. Listing
9.9 exhibits a simple XML template containing an XPath query. This query uses the schema
shown in Listing 9.8 to return all orders placed by a customer with a CustomerID equal to 9
2000, XML, AND

ALFKI.
SQL SERVER
LISTING 9.9 Embedding an XPath Query into an XML Template ASP.NET
2: <sql:xpath-query mapping-schema=”listing9.8.xdr”>
3: /Customer[@CustomerID=’ALFKI’]/Order
4: </sql:xpath-query>
5: </Northwind>
Like the SQL queries embedded in XML template files earlier in the chapter, XPath queries
use the urn:schemas-microsoft-com:xml-sql URI along with the sql prefix to identify
custom elements and attributes used in the template. For XPath queries, an element named
xpath-query is used to mark up the query syntax. This element also has an attribute named
376
mapping-schema that is used to identify the path to the appropriate schema file
(listing9.8.xdr in this case) used to map tables and fields to specific XML items.
Listing 9.10 shows another template file that uses a more complex XPath query.
LISTING 9.10 Embedding an XPath Query into an XML Template

3: /Customer[@CustomerID=’ALFKI’]/Order/Employee[@LastName=’Suyama’]
5: </Northwind>
When executed, the XPath query returns which employee was involved with handling a spe-
cific order placed by a customer, as shown next:
<Northwind xmlns:sql=”urn:schemas-microsoft-com:xml-sql”>
<Employee EmployeeID=”Emp-6” LastName=”Suyama” FirstName=”Michael”
Title=”Sales Representative”/>
</Northwind>
XPath queries used within template files can also accept parameters. This process functions in
a manner similar to parameters used in XSLT style sheets. Like XSLT, the $ character is used
to designate a variable. Listing 9.11 shows how to incorporate variables into templates contain-
ing XPath queries.
LISTING 9.11 Using Parameters with XPath Query Templates

2: <sql:header>
3: <sql:param name=”ID”/>
4: </sql:header>
6: /Customer/Order[@OrderID=$ID]
8: </Northwind>
Passing a parameter to the template can be accomplished by passing the parameter name along
with its associated value in the URL as shown next:
http://localhost/northwind/templates/listing9.11.xml?ID=ORD-10643
As was shown earlier in the chapter, passing the preceding URL to the XPathDocument class
constructor (or the Load() method of an XmlDocument class) or to the ReadXml() method of the
DataSet class results in ASP.NET pages that access XML data directly. For an example of
doing this, please refer back to Listing 9.7.
377
CHAPTER 9
Using EXPLICIT Mode Queries

Schemas present an excellent way to customize and mold the structure of XML data returned
from SQL Server 2000. By mapping specific database tables and fields to different XML ele-
ments and attributes, virtually any structure can be created. However, using schemas requires
that the schema annotations be included for the mapping to work correctly. For individuals
who do not feel comfortable writing XDR schemas or using the SQL Server 2000 schema
annotations, there is another alternative to creating a custom XML structure. This alternative
comes in the form of EXPLICIT mode queries. Although they can be more complex than writ-
ing XDR schemas, EXPLICIT mode queries provide a lot of power for shaping XML docu-
ments in a custom fashion. In general, EXPLICIT mode queries also offer better performance as
compared to XPath queries, especially because they can be included within stored procedures.
EXPLICIT mode queries are best understood by thinking in terms of hierarchical relationships
among XML elements. Figure 9.5 shows how EXPLICIT mode queries would classify the fol-
lowing XML document’s structure:
<Northwind>
<Customer>
<Order>
<OrderDetails/>
</Order>
</Customer>
</Northwind>
Northwind
Parent = Null, Tag = 1
Customer
9
2000, XML, AND

Parent = 1, Tag = 2
SQL SERVER
ASP.NET
Order
Parent = 2, Tag = 3
OrderDetails
Parent = 3, Tag = 4
FIGURE 9.5
Understanding EXPLICIT mode query hierarchies.
As you can see, EXPLICIT mode queries assign a tag number and a parent value to every XML
element within a document. This allows relationships between elements to be established and
378
XML structures to be created based on these relationships. For example, the OrderDetails ele-
ment in Figure 9.5 has a tag number of 4 because it’s the fourth tag listed in the document.
When doing a query against the database, this element can be nested as a child of the Order
element by identifying that its parent tag number is 3. Had the application required the
OrderDetails tag to be a child element of the Customer element, a parent tag number of 2
could have been identified.
Now that you have a feel for how EXPLICIT mode queries use tag numbers to create hierarchi-
cal XML document structures, Listing 9.12 demonstrates a query that returns the following
XML document:
<MyCustomer CustID=”ALFKI”>
<MyCustomerOrder OrdID =”10643”/>
</MyCustomer >
LISTING 9.12 EXPLICIT Mode Query Example
1: SELECT 1 as Tag,
2: NULL as Parent,
3: Customers.CustomerID as [MyCustomer!1!CustID],
4: NULL as [MyCustomerOrder!2!OrdID]
5: FROM Customers
6: WHERE Customers.CustomerID=’ALFKI’
7:
8: UNION ALL
9:
10: SELECT 2,
11: 1,
12: Customers.CustomerID,
13: Orders.OrderID
14: FROM Customers, Orders
15: WHERE Customers.CustomerID = Orders.CustomerID
16: AND Customers.CustomerID = ‘ALFKI’
17: ORDER BY [MyCustomer!1!CustID], [MyCustomerOrder!2!OrdID]
18: FOR XML EXPLICIT
At first glance, this SQL query may look rather strange. After all, it uses ! characters and
strangely named field aliases. However, when you break the query down into small pieces, the
purpose and reason for the oddities become more clear and understandable.
379
CHAPTER 9
Let’s first examine lines 1 through 6, which are used to build the MyCustomer element. If
you’ve worked with SQL much, you’ll recognize this as a fairly simple SELECT query, except
that it has a few odd characters mixed in it. This first SELECT query serves the purpose of
defining all elements and attributes that will be returned in the XML document. Lines 1 and 2
create two metadata fields named Tag and Parent. These fields are used to track node position
and hierarchical relationships, as shown earlier in Figure 9.5. Because the MyCustomer element
appears first in the XML document, it has a parent value of null. This is because it is the root
of the tree and therefore has no parent. The tag number for the MyCustomer element is given a
value of 1 because it is the first element tag in the XML document. Line 3 then selects the
CustomerID field from the Customers table and assigns it a special alias that contains three
pieces of information, all separated by the ! character. This alias is based on a special format
recognized by SQL Server 2000:
ElementName!TagNumber!AttributeName!Directive
The first part of the alias is the name of the element that the field will be associated with. In
this case, it gets associated with the MyCustomer element. The second piece of information is
the tag number of the element it will be associated with, which is 1. The third piece of infor-
mation is the name of the attribute that will show in the XML document. In this example,
CustID is specified as the attribute name. It’s important to realize that all field aliases are
designed to be attributes by default. To change them to elements, the element directive must
be specified in the Directive section of the alias. We’ll discuss directives a little later in the
chapter.
Line 4 goes through the same process as outlined previously. It selects NULL and gives it a spe-
cial alias. First it assigns the field to the MyCustomerOrders element, which has a tag value of
2 because it is the second element tag in the XML document. Next, it gives the attribute a
name of OrdID. Why is this field being created in the resultset when it is assigned a NULL 9
value? A little earlier it was mentioned that this first SELECT statement contains all the elements
2000, XML, AND

and attributes that will be returned in the XML document. Because the OrderID field’s value
SQL SERVER
ASP.NET
will be returned, a place for it must be reserved initially for a union operation to occur (dis-
cussed later). This is accomplished by defining that there will be an XML attribute named
OrdID but that the attribute doesn’t have a value yet (it is NULL). This will become more clear
as you look at the second SELECT statement in Listing 9.12. After the four fields are defined,
the final part of the SELECT statement is shown in line 6. This SQL code simply ensures that
only the customer associated with the unique CustomerID field will be returned.
Line 8 tells the query to perform a Union operation by using the UNION ALL SQL keywords.
Union operators serve the purpose of combining all fields from two or more SQL SELECT state-
ments into one. For this operation to work correctly, each of the SELECT statements involved in
the union must contain the same number of fields. These fields must also be named the same
380
and must appear in the same order in every SELECT statement. To better understand how a
union operation works, Figure 9.6 provides a graphical representation.
SELECT 1 SELECT 2
ID Name ID Name
123 NULL 234 Dan
UNION
ID Name
123 NULL
234 Dan
FIGURE 9.6.
Graphical representation of a Union operation involving two SELECT statements.
As shown, the Union operation simply combines the two SELECT statements. This is possible
because the fields are named the same and appear in the same order. Getting back to Listing
9.12, the UNION ALL statement combines the first SELECT statement with the one that follows.
The first SELECT statement begins the process of building the MyCustomers element (tag 1).
Because of the aliases assigned to each field in this first statement, you can see that the CustID
attribute is the only one that should be associated with the MyCustomers element. Although the
OrdID attribute is also listed, remember that it isn’t associated with tag 1. Its association is with
tag 2 as shown in its alias.
The second SELECT statement begins the process of building the MyCustomerOrders element
(tag 2). This element contains the OrdID attribute. We know from the first SELECT statement
that OrdID is associated with tag 2. Although the CustomerID field is listed in this second
select statement (line 12), it is not included as an attribute of the MyCustomerOrders element
because its alias points to tag 1 (the MyCustomer element). It is listed to allow the union opera-
tion to combine the two SELECT statements as mentioned earlier. Any value can actually be put
in for the CustomerID value in this second statement as long as the value can be converted to
the proper data type defined for the CustomerID field (nvarchar in this case).
Line 17 finishes off the SQL statement by determining how to order the results, and line 18
uses the magical FOR XML EXPLICIT keywords. These keywords are necessary for XML to be
381
CHAPTER 9
returned from the database in the structure specified by the two SELECT statements. By using
the EXPLICIT keyword, you are telling the database to evaluate the values of the Tag and
Parent metadata fields to organize the resulting XML document’s structure.
Table 9.2 shows all the fields and their respective values after the two SELECT statements have
been executed and combined using the UNION All syntax.
TABLE 9.2 Result of the Union Operation

Tag Parent MyCustomer!1!CustID MyCustomerOrder!2!OrdID
1 NULL ALFKI NULL
2 1 ALFKI 10643
2 1 ALFKI 10692
2 1 ALFKI 10702
2 1 ALFKI 10835
2 1 ALFKI 10952
2 1 ALFKI 11011
Using Directives in EXPLICIT Mode Queries

Now that the code shown in Listing 9.12 has been described in detail, let’s take a look at some
more advanced capabilities of EXPLICIT mode queries. Directives are the last item that can be
specified when defining a field’s alias (ElementName!TagNumber!AttributeName!Directive).
Table 9.3 contains a list of the possible directive statements.
TABLE 9.3 SQL Server 2000 EXPLICT Mode Query Directives 9
2000, XML, AND

Directive Description
SQL SERVER
ASP.NET
ID Used to represent an attribute ID type. This can be used with IDREF and
IDREFS to build relationships. This directive is useful only when FOR XML
EXPLICIT, XMLDATA is specified in the query.
IDREF Used to represent an attribute IDREF type. This can be used with IDREF and
IDREFS Used to represent an attribute IDREFS type. This can be used with IDREF and
hide This attribute is not displayed when the hide directive is specified. This is
useful when you need to order the results by an attribute that you do not
want to appear in the XML document.
382
TABLE 9.3 SQL Server 2000 EXPLICT Mode Query Directives

Directive Description
element Use of this directive results in a contained element as opposed to an attribute.
Any entity characters found within the data (such as < or >) will automati-
cally be encoded.
xml This directive functions the same as an element directive, except that charac-
ters that are normally encoded (such as < or >) will be left intact. This cannot
be used with any other directive except for hide.
xmltext This directive is used to store overflow XML data stored in a field by
OPENXML (discussed later).
cdata This directive wraps data in a CDATA section. It can be used only with the
hide directive and the AttributeName must not be specified.
The directives listed previously can be used to gain more control over what type of content is
returned by SQL Server 2000. Listing 9.13 builds on the previous query by using ID, IDREF,
element, and cdata directives to build a more detailed XML document structure. The entire
SQL statement is embedded within an XML template file.
LISTING 9.13 Using Directives in EXPLICIT Mode Queries in XML Templates

2: <sql:query>
3: SELECT 1 as Tag,
4: NULL as Parent,
5: C.CustomerID as [Customer!1!cid],
6: C.ContactName as [Customer!1!name],
7: NULL as [Order!2!id],
8: NULL as [Order!2!date],
9: NULL as [Order!2!RepID],
10: NULL as [OrderDetails!3!id!id],
11: NULL as [OrderDetails!3!pid!idref],
12: NULL as [OrderDetails!3!total!element]
13: FROM Customers C
14: WHERE C.CustomerID = ‘ALFKI’
15: UNION ALL
16: SELECT 2 as Tag,
17: 1 as Parent,
18: C.CustomerID,
19: NULL,
20: O.OrderID,
21: O.OrderDate,
22: O.EmployeeID,
383
CHAPTER 9

23: NULL,
24: NULL,
25: NULL
27: INNER JOIN Orders O ON O.CustomerID = C.CustomerID
28: AND C.CustomerID = ‘ALFKI’
29: UNION ALL
30: SELECT 3 as Tag,
31: 2 as Parent,
32: C.CustomerID,
33: NULL,
34: O.OrderID,
35: NULL,
36: NULL,
37: OD.OrderID,
38: OD.ProductID,
39: (‘$’ + CAST(((OD.UnitPrice - OD.Discount)
40: * OD.Quantity) as varchar(10)))
42: INNER JOIN Orders O ON O.CustomerID = C.CustomerID
43: INNER JOIN [Order Details] OD ON OD.OrderID = O.OrderID
44: AND C.CustomerID = ‘ALFKI’
45:
46: ORDER BY [Customer!1!cid], [Order!2!id]
47: FOR XML EXPLICIT, XMLDATA
48: </sql:query>
49: </Northwind>
The concepts used to build this query are the same as those discussed in Listing 9.12. 9
However, this query uses directives in lines 10 and 11 to specify that the attributes are ID and
2000, XML, AND

SQL SERVER
IDREF types, respectively. To ensure that these tags are used properly in the XML document,
ASP.NET
the FOR XML EXPLICIT keywords are followed by another keyword named XMLDATA. This tells
SQL Server to place an inline schema in the resulting XML document. Line 12 uses the ele-
ment directive to place the result of (UnitPrice-Discount)*Quantity into an element named
total. This query also adds one more hierarchical level of tags by going out to tag number 3.
Executing the query in Listing 9.13 results in the following XML document (only part of the
document is shown). It’s interesting to note that the inline schema doesn’t fully define the
structure of the XML document. Looking at the schema contents, you can see that the
ElementType named Customer does not include the Order and OrderDetails ElementTypes.
It’s clear that the schema is based on the three distinct SELECT statements included in
EXPLICIT mode query rather than the XML that is generated.
384
<Northwind xmlns:sql=”urn:schemas-microsoft-com:xml-sql”>
<Schema name=”Schema1” xmlns=”urn:schemas-microsoft-com:xml-data”
xmlns:dt=”urn:schemas-microsoft-com:datatypes”>
<ElementType name=”Customer” content=”mixed” model=”open”>
<AttributeType name=”cid” dt:type=”string”/>
<AttributeType name=”name” dt:type=”string”/>
<attribute type=”cid”/>
<attribute type=”name”/>
</ElementType>
<ElementType name=”Order” content=”mixed” model=”open”>
<AttributeType name=”id” dt:type=”i4”/>
<AttributeType name=”date” dt:type=”dateTime”/>
<AttributeType name=”RepID” dt:type=”i4”/>
<attribute type=”date”/>
<attribute type=”RepID”/>
</ElementType>
<ElementType name=”OrderDetails” content=”mixed” model=”open”>
<AttributeType name=”id” dt:type=”id”/>
<AttributeType name=”pid” dt:type=”idref”/>
<attribute type=”pid”/>
<element type=”total”/>
</ElementType>
<ElementType name=”total” content=”textOnly” model=”closed”
dt:type=”string”/>
</Schema>
<Customer xmlns=”x-schema:#Schema1” cid=”ALFKI” name=”Maria Anders”>
<Order id=”10643” date=”1997-08-25T00:00:00” RepID=”6”>
<OrderDetails id=”10643” pid=”28”>
<total>$680.25</total>
</OrderDetails>
</OrderDetails>
</OrderDetails>
</Order>

</Customer>
</Northwind>
You can obtain these XML results by navigating to the following URL:
http://localhost/northwind/templates/listing9.13.xml
385
CHAPTER 9
Using OPENXML to Manipulate XML

SQL Server 2000’s OPENXML is a rowset provider that allows XML documents placed in mem-
ory to be treated as rowset data. This feature can be very useful when you would like to send
an XML document in as a parameter to a SQL statement or stored procedure and then insert
the data found in the document into the appropriate relational tables. The process of taking
information from the XML document and placing it into relational database tables is referred
to as “shredding XML into tables.”
To provide rowset functionality, the XML document must first be parsed using a specific sys-
tem stored procedure named sp_xml_preparedocument. This procedure takes a single input
parameter (the source XML document) and returns a single output parameter representing a
handle to the parsed XML document. After parsing has completed, queries can be executed
against the XML document using normal SQL and XPath statements. This allows data to be
selected from the document and then inserted into a database table. After processing has com-
pleted on the XML document, another system stored procedure named sp_xml_removedocu-
ment must be called to remove the document from memory. This procedure accepts a single
input parameter, which is the handle of the XML document originally generated by the
sp_xml_preparedocument procedure.
Listing 9.14 shows an example of using the OPENXML rowset functionality to select values from
an existing XML document. As outlined previously, the document is loaded into memory and
parsed. On completing the parsing operation, data is extracted and formatted into a rowset.
Finally, the memory allocated to hold the XML document is released.
LISTING 9.14 Using OPENXML to Shred an XML Document

1: DECLARE @docID int
2: DECLARE @XmlDoc varchar(2000)
9
2000, XML, AND

3: SET @XmlDoc =’
SQL SERVER
ASP.NET
4: <ROOT>
5: <Customer CustomerID=”DLWID” ContactName=”Dan Wahlin”>
6: <Order OrderID=”10248” CustomerID=”DLWID”
7: OrderDate=”2000-09-09T00:00:00”>
8: <OrderDetail ProductID=”11” Quantity=”12”/>
10: </Order>
11: </Customer>
12: <Customer CustomerID=”JJDID” ContactName=”John Doe”>
13: <Order OrderID=”10283” CustomerID=”JJDID”
14: OrderDate=”2000-09-04T00:00:00”>
16: </Order>
386

17: </Customer>
18: </ROOT>’
19: EXEC sp_xml_preparedocument @docID OUTPUT, @XmlDoc
20: SELECT *
21: FROM OPENXML (@docID, ‘/ROOT/Customer’,1)
22: WITH (CustomerID varchar(10),
23: ContactName varchar(20),
24: OrderDate DateTime ‘./Order/@OrderDate’,
25: ProductID int ‘./Order/OrderDetail/@ProductID’)
26: EXEC sp_xml_removedocument @docID
This query can be executed by placing it into SQL Query Analyzer or a similar tool. Doing so
returns the following results:
CustomerID ContactName OrderDate ProductID

DLWID Dan Wahlin 2000-09-09 00:00:00.000 11
JJDID John Doe 2000-09-04 00:00:00.000 72
Let’s break Listing 9.14 into pieces to better understand how OPENXML works. After defining
variables in lines 1 and 2, the XML document that will be parsed is loaded into the XmlDoc
variable. Line 19 makes a call to the sp_xml_preparedocument stored procedure, which loads
the XML document into memory and parses it. Next, a SELECT statement is created that uses
OPENXML to query the XML data. The handle to the XML document (@docID) is passed in along
with an XPath statement that identifies the nodes that will be converted to a rowset. Finally, a
flag is passed in to specify that the data for the fields in the rowset will be found in attributes
within the XML document. Table 9.4 shows the possible flag values.
TABLE 9.4 OPENXML Flag Values

Value Description
0 Defaults to attribute-centric mapping.
1 This flag specifies attribute-centric mapping. It can also be used with XML_
ELEMENTS, which will apply attribute-centric mapping first followed by element-
centric mapping for columns that have not been dealt with yet.
2 This flag specifies element-centric mapping. It can also be used with
XML_ELEMENTS, which will apply element-centric mapping first followed by
attribute-centric mapping for columns that have not been dealt with yet.
8 This flag allows XML_ATTRIBUTES or XML_ELEMENTS.
387
CHAPTER 9
Lines 22–25 create the schema declaration that is used to map attributes located within
@XmlDoc to fields that will be found in the rowset. This SQL code dictates that four fields will
be contained within the rowset. This is accomplished by listing the field name, data type, and
location within the XML document (an XPath statement). After these fields are configured
and the rowset is generated, the handle to @XmlDoc (named @docID) is passed to the
sp_xml_removedocument stored procedure to remove the XML document from memory.
OPENXML is most useful when you need to insert data located within an XML document into
relational tables. Listing 9.15 builds on the previous listing by allowing multiple inserts to
occur.
LISTING 9.15 SQL Code for Inserting XML Data into Tables Using OPENXML
2: <status>
3: <sql:header>
4: <sql:param name=”XmlDoc”/>
5: </sql:header>
6: <sql:query>
7: DECLARE @docID int
8: EXEC sp_xml_preparedocument @docID OUTPUT, @XmlDoc
9: INSERT INTO Customers
10: SELECT *
11: FROM OPENXML(@docID, ‘//Customers’)
12: WITH Customers
13: IF @@ERROR > 0
14: SELECT ‘Error’
15: ELSE
16: SELECT ‘Success’
17: EXEC sp_xml_removedocument @docID 9
18: </sql:query>
2000, XML, AND

19: </status>
SQL SERVER
ASP.NET
20: </Northwind>
The SQL code shown in Listing 9.15 performs the simple function of inserting the CustomerID,
ContactName, and Address information found in the XML document into the Customers table
in the Northwind database. This process is kicked off by an ASP.NET page that presents the
user with a form containing three fields. Upon submitting the form, the ASP.NET page
processes the values entered in the form fields and creates an XML document. This document is
then passed as a parameter to the template shown in Listing 9.15. The XML result returned
from the template is loaded into an XmlDocument class and then written out to the page. Listing
9.16 shows the ASP.NET page.
388
LISTING 9.16 ASP.NET Page Used to Send Information to the XML Template
3: void SubmitButton_Click(Object sender, EventArgs e) {
5: string xmlDoc = “<Northwind><Customers CustomerID=\”” +
6: CustomerID.Text;
7: xmlDoc += “\” ContactName=\”” + ContactName.Text;
8: xmlDoc += “\” CompanyName=\”” + CompanyName.Text +
9: “\”/></Northwind>”;
10: string url = “http://localhost/northwind/templates/” +
11: “listing9.15.xml?XmlDoc=” + Server.UrlEncode(xmlDoc);
13: doc.Load(url);
14: Response.Write(doc.DocumentElement.OuterXml);
15: Response.End();
16: }
17: </script>
18: <html>
19: <body>
21: <table cellpadding=”4”>
22: <tr>
23: <td>
24: <b>CustomerID:</b></td>
25: <td>
26: <asp:TextBox id=”CustomerID” Text=”DLWID”
27: runat=”server” />
28: </td>
29: </tr>
30: <tr>
31: <td>
32: <b>Name:</b>
33: </td>
34: <td>
35: <asp:TextBox id=”ContactName” Text=”Dan Wahlin”
36: runat=”server” />
37: </td>
38: </tr>
39: <tr>
40: <td>
41: <b>Company Name:</b>
42: </td>
389
CHAPTER 9

43: <td>
44: <asp:TextBox id=”CompanyName”
45: Text=”Tomorrow’s Learning” runat=”server” />
46: </td>
47: </tr>
48: <tr>
50: <asp:button text=”Submit” id=”SubmitButton”
51: runat=”server” onClick=”SubmitButton_Click” />
52: </td>
53: </tr>
54: </table>
55: </form>
56: </body>
57: </html>
XML Updategrams—Update, Insert, and Delete

Database Records with XML Using ASP.NET
Up to this point in the chapter, you’ve seen several ways to select data from SQL Server 2000.
These range from URL queries to templates to templates with embedded XPath statements.
You have also seen how data within an XML document can be inserted into the database using
OPENXML. Although you can perform updates, inserts, or deletes using XML template files or
URL queries, SQL Server 2000 also allows this type of functionality to be performed by using
Updategrams. At the time this section was written, Updategrams were in their first Web
release. As such, the information that follows may change in future releases.
9
work by presenting a before and after view of data that is marked up using XML
2000, XML, AND

Updategrams
SQL SERVER
syntax. If data in the before view is different from the data in the after view, an update is per-
ASP.NET
formed to the appropriate fields. For example, if the before view of a record shows a value of
Paul Allsing for the ContactName field, but the after view shows a value of Lance Edwards,
the Updategram will know that the ContactName field needs to be updated because the values
have changed. If the before view contains data, but the after view is empty, the appropriate row
will then be deleted. To understand this more fully, let’s take a look at how Updategrams are
structured in Listing 9.17.
LISTING 9.17 Updategram Structure
1: <ROOT xmlns:updg=”urn:schemas-microsoft-com:xml-updategram”>
2: <updg:sync>
390

3: <updg:before>
4: <TABLENAME [updg:id=”value”] col=”value” col=”value”…../>
5: </updg:before>
6: <updg:after>
7: <TABLENAME [updg:id=”value”] [updg:at-identity=”value”]
8: col=”value” col=”value”…../>
9: </updg:after>
10: </updg:sync>
11: </ROOT>
The first thing you should notice is that Updategrams look a lot like the XML templates dis-
cussed earlier in the chapter. They start with a root element tag and are followed by a name-
space declaration (line 1). However, instead of using the sql prefix, Updategrams use the updg
prefix and tie it to a URI of urn:schemas-microsoft-com:xml-updategram. Line 2 introduces
a new element tag named sync, which acts a container for the before and after views of the
data being evaluated. The sync block defines the boundaries for transactions. If any statements
within the sync block fail, the transaction will be rolled back. The before and after data views
are enclosed within the sync block using the before and after element tags (lines 3–9). The
table within the database that will be changed is used as the element name in both the before
and after data views (lines 4 and 7). The table name element is followed by attributes that list
the columns (and their respective values) that will be used to determine whether an insert,
update, or delete operation should be performed.
The table name may also be followed by an id attribute in the case of before views and an id
and/or at-identity attribute in after views. The id attribute can be used to identify elements
within the before and after data view sections in cases where more than one insert, update, or
delete is being performed in the Updategram. By manually specifying a unique id for each ele-
ment, the elements within the before and after data sections will be matched up properly.
The at-identity attribute can be used along with an attribute named returnid to return the
identity of a newly inserted record. Using this attribute is similar to calling @@Identity in a
stored procedure following the insertion of a new row.
Performing an insert, update, or delete to a database table using Updategrams is as simple as
providing the proper information to the before and after data sections. Table 9.5 gives an
overview of how to accomplish each operation using Updategrams.
Let’s take a look at performing an insert, update, and delete to the Customers table of the
Northwind database. Listings 9.18, 9.19, and 9.20 show each operation.
391
CHAPTER 9
TABLE 9.5 Performing Inserts, Updates, or Deletes Using Updategrams

Operation Before and After Section Status
Insert The before view is empty, whereas the after view contains the name of the
table to insert into along with the appropriate columns to be inserted.
Because no data exists initially, SQL Server 2000 knows to perform an
insert.
Update The before view contains the initial data (only the primary key for the row is
absolutely necessary). The after view contains the columns to be updated.
Delete The before view contains the initial data (only the primary key for the row is
absolutely necessary). The after view is left empty. Because the after view is
empty, SQL Server 2000 knows to delete the row.
LISTING 9.18 Performing an Insert Using Updategrams

1: <Northwind xmlns:updg=”urn:schemas-microsoft-com:xml-updategram”>
2: <updg:sync>
3: <updg:before>
4: </updg:before>
5: <updg:after>
6: <Customers CustomerID=”DLWID” CompanyName=”Wahlin Widget’s”/>
7: </updg:after>
8: </updg:sync>
9: </Northwind>
Hitting the URL for the template shown in Listing 9.18 results in an insert being performed
into the Customers table of the Northwind database because data does not appear in the before
section but does appear in the after section. 9
2000, XML, AND

In cases where a table’s primary key is determined by an auto-incrementing field, the identity
SQL SERVER
ASP.NET
of a newly added row can be obtained by adding the returnid attribute to the sync element
and the at-identity attribute to the appropriate table element, as shown next:
<Root xmlns:updg=”urn:schemas-microsoft-com:xml-updategram”>
<updg:sync updg:returnid=”x”>
<updg:before>
</updg:before>
<updg:after>
<MyTable updg:at-identity=”x” Column1=”Some Value”
Column2=”Some Other Value”/>
</updg:after>
</updg:sync>
</Root>
392
The identity of the newly added record will be returned in the following format:
<returnid>
<x>404</x>
</returnid>
LISTING 9.19 Performing an Update Using Updategrams

2: <updg:sync >
3: <updg:before>
4: <Customers CustomerID=”DLWID”/>
5: </updg:before>
6: <updg:after>
7: <Customers CompanyName=”Dan’s Golf Shack”/>
8: </updg:after>
9: </updg:sync>
10: </Northwind>
Referring back to Table 9.5, you can see that an update is performed when there is data within
the before and after sections. Listing 9.19 first uniquely identifies a row in the Customers table
by listing the CustomerID attribute’s value in the before section of the template. The after sec-
tion contains only the name and value of the field in the database that should be updated.
Because the CustomerID value was provided in the before section, it is automatically used in
the after section. Although this example shows only one column being updated, others can be
listed as necessary.
LISTING 9.20 Performing a Delete Using Updategrams

2: <updg:sync >
3: <updg:before>
4: <Customers CustomerID=”DLWID”/>
5: </updg:before>
6: <updg:after>
7: </updg:after>
8: </updg:sync>
9: </Northwind>
Using Updategrams to perform deletes is as simple as providing the table name along with the
value of the table’s primary key in the before data section. By leaving the after data section
blank, the database knows that this row should be deleted.
393
CHAPTER 9
Although the examples shown to this point have shown how easy it is to work with Updategrams
to perform inserts, updates, or deletes, they haven’t been very practical. How many times have
you found it useful to hard-code a value into a SQL statement or stored procedure to perform an
insert, update, or delete? This is because insert, update, and delete operations are normally
designed to accept variable input parameters.
Like XML templates, Updategrams also allow parameters to be passed in. Listing 9.21 shows
how to use parameters in Updategrams to update a record.
LISTING 9.21 Using Parameters in Updategrams

2: <updg:header>
3: <updg:param name=”CustomerID”></updg:param>
4: <updg:param name=”CompanyName”></updg:param>
5: </updg:header>
6: <updg:sync>
7: <updg:before>
8: <Customers CustomerID=”$CustomerID”/>
9: </updg:before>
10: <updg:after>
11: <Customers CompanyName=”$CompanyName”/>
12: </updg:after>
13: </updg:sync>
14: </Northwind>
Lines 2–5 take care of adding two input parameters named CustomerID and CompanyName.
These parameters are then dynamically added to the before and after data sections by append-
ing them with the $ character, as shown in lines 8 and 11. Passing the parameters into the
Updategram is accomplished in the same manner as passing parameters to regular template
9
2000, XML, AND

files:
SQL SERVER
ASP.NET
http://localhost/northwind/templates/listing9.21.xml?CustomerID=DLWID&
CompanyName=Wahlin+Consulting
Several more advanced aspects of Updategrams have not been covered in this section. For
example, multiple updates, inserts, or deletes can be performed in a single Updategram, and
updates involving multiple tables can be performed with the aid of mapping schemas.
Updategrams are a promising technology that can provide additional flexibility to ASP.NET
applications.
Updategrams can be especially useful in applications where end users do not always have con-
nectivity to the network or database (traveling sales representatives, for example). By storing
394
the necessary database changes in the form of Updategram files, when the user can connect to
the network, any database changes can be sent and processed automatically by the SQL Server
2000 database. There are, of course, many other potential uses; the sky’s the limit!
Using ADO.NET with SQL Server 2000

You may work on different ASP.NET projects that require ADO.NET be used to query SQL
Server rather than HTTP. In these situations, how can you leverage the XML features found in
SQL Server 2000? Fortunately, you can still retrieve XML directly from the database by using
the FOR XML keywords in your queries. However, the XML data returned from these types of
queries will not be well formed because the returned XML will not have a single root tag
wrapping all elements.
To work with non–well-formed XML returned from FOR XML queries, ADO.NET’s
SqlCommand class contains a method, named ExecuteXmlReader(), designed for this purpose.
This method returns an XmlReader object that can be used to parse through the XML, as shown
back in Chapter 5, “Using the XmlTextReader and XmlTextWriter Classes in ASP.NET”
Listing 9.22 demonstrates how to use this method.
LISTING 9.22 Using the ExecuteXmlReader() Method with FOR XML Queries
2: <%@ Import Namespace=”System.Data.SqlClient” %>
7: string sql = “SELECT * FROM Customers FOR XML AUTO”;
8: string connStr =”server=localhost;uid=sa;pwd=;database=Northwind”;
9: SqlConnection sqlConn = new SqlConnection(connStr);
10: sqlConn.Open();
11: SqlCommand cmd = new SqlCommand(sql,sqlConn);
12: XmlReader reader = cmd.ExecuteXmlReader();
13: Response.Write(“<?xml version=\”1.0\” encoding=\”utf-8\”?>”);
14: Response.Write(“<root>”);
16: Response.Write(“<” + reader.Name + “ “);
17: if (reader.HasAttributes) {
19: Response.Write(reader.Name + “=\”” +
20: new StringBuilder(reader.Value).Replace(“&”,”&”) +
21: “\” “ );
22: }
23: }
395
CHAPTER 9

24: Response.Write(“/>”);
25: }
26: Response.Write(“</root>”);
27: sqlConn.Close();
29: }
30: </script>
The code in Listing 9.22 creates well-formed XML that is simply written out to the browser.
This same type of process can be made more useful by dynamically loading the XML frag-
ment returned from SQL Server 2000 into an XPathDocument class and then transforming it
using XSLT. Listing 9.23 shows how this is accomplished.
LISTING 9.23 Transforming XML data returned by the ExecuteXmlReader() Method

9: string sql = @”SELECT * FROM Customers
12: FOR XML AUTO”;
13:
14:
string connStr =”server=localhost;uid=sa;pwd=;database=Northwind”;
SqlConnection sqlConn = new SqlConnection(connStr);
9
2000, XML, AND

15: sqlConn.Open();
SQL SERVER
17: XmlTextReader reader = (XmlTextReader)cmd.ExecuteXmlReader();
ASP.NET
18: //XPathDocument will add a root node for us
19: XPathDocument xmlDoc = new XPathDocument(reader);
22: xslDoc.Transform(xmlDoc,null,Response.Output);
25: }
26: </script>
396
Because the SQL query can return many customer records, you would think that these records
need to be wrapped with a root node before the XML could be passed to the XPathDocument’s
constructor. Fortunately, the XPathDocument class will take care of adding a root node automat-
ically, making it easy to transform the XML via XSLT.
The XmlParserContext Class

In cases where the XML data returned from a FOR XML query depends on portions of a DTD,
such as an entity definition, for instance, the XmlParserContext class can be used. One of this
class’s constructors allows internal DTD subsets to be “hooked up” with the XML fragment
returned from the SQL server query. In more understandable terms, a subset is simply part of a
DTD, such as an entity definition (<!ENTITY company “ABC Hardware”>) that must exist for
the XML fragment to be used properly.
The .NET SDK defines one of the XmlParserContext class’s constructors in the following
manner:
public XmlParserContext(
XmlNameTable nt,
XmlNamespaceManager nsMgr,
string docTypeName,
string pubId,
string sysId,
string internalSubset,
string baseURI,
string xmlLang,
XmlSpace xmlSpace
);
When namespace URIs not included in the XML fragment must be identified as the XML frag-
ment is parsed, the XmlNamespaceManager class can be created, filled, and passed in as an
argument. If this (along with other classes such as the XmlNameTable class) is not needed by
the application, NULL can be passed instead (in C#). For our purposes, we’ll focus specifically
on the docTypeName and internalSubset arguments. The docTypeName argument defines the
DOCTYPE name. The internalSubset argument defines one or more parts of a DTD that may
be referenced by the XML fragment.
The code shown in Listing 9.24 shows how the XmlParserContext class can be used to define
an entity named contact that is referenced by the XML fragment returned from the FOR XML
query. This entity definition is created in a string named entityString (line 10) that is added
to the constructor for the XmlParserContext class (lines 31–33). Any entity definitions in the
XML that are defined as &contact; will be resolved to a value of Anders.
397
CHAPTER 9
LISTING 9.24 Using the XmlParserContext Class

9: //Define the entity returned by the data
10: string entityString = “<!ENTITY contact ‘Anders’>”;
11: string sql = @”SELECT * FROM Customers
14: WHERE Customers.CustomerID = ‘ALFKI’
15: FOR XML AUTO”;
16: string connStr=”server=localhost;uid=sa;pwd=;database=Northwind”;
17: SqlConnection sqlConn = new SqlConnection(connStr);
18: sqlConn.Open();
20: XmlReader reader = cmd.ExecuteXmlReader();
21: reader.MoveToContent();
22: //SQL 2000 will automatically escape & with &
23: //so do a replace
24: string xmlString =
25: new StringBuilder(reader.ReadOuterXml()).Replace(“&”,
26: “&”).ToString();
27:
28: //Create the parser context. This specifies that the
29: //DOCTYPE root is Customers and the subset to add in
30: //(entityString in this case)
9
2000, XML, AND

31: XmlParserContext xmlContext =
SQL SERVER
32: new XmlParserContext(null, null, “Customers”, null,
ASP.NET
33: null, entityString,””, “”, XmlSpace.None);
34: XmlValidatingReader xmlReader =
35: new XmlValidatingReader(xmlString,XmlNodeType.Element,
36: xmlContext);
37: xmlReader.ValidationType = ValidationType.None;
38: XmlDocument xmlDoc = new XmlDocument();
39: XmlElement root = xmlDoc.CreateElement(“Northwind”);
40: xmlDoc.AppendChild(root);
41: while (!xmlReader.EOF) {
42: root.AppendChild(xmlDoc.ReadNode(xmlReader));
43: }
44:
398

45: //Do Transformation
48: xslDoc.Transform(xmlDoc,null,Response.Output);
50: if (xmlReader != null) xmlReader.Close();
52: }
53: </script>
Summary
SQL Server 2000 provides a rich set of useful features dedicated to working with data in the
form of XML. Through leveraging these features, layers of ASP.NET code can be eliminated.
Direct access to the database can be accomplished via URL queries, templates, XPath queries,
and through OPENXML and Updategrams. This allows XmlDocument, XPathDocument, or DataSet
objects to be loaded directly without going through the SQL managed provider. When used
with XSLT, this presents an efficient solution that can simplify your ASP.NET application
development.
In the next chapter, you’ll be presented with an in-depth look at Simple Object Access Protocol
(SOAP) and see how Web services can be created to expose your data or services as XML to
interested parties anywhere in the world. You’ll also see how you can utilize external Web ser-
vices to enhance your ASP.NET applications.
Working with ASP.NET, XML, CHAPTER
10
SOAP, and Web Services
IN THIS CHAPTER
Understanding SOAP 400
Understanding Web Services 421
Retrieving Customer Orders Through a Web

Service 444
XML Related Technologies
400
Understanding SOAP
Making calls to distributed objects to obtain data or perform specific functions has not been an
easy task in the past. For example, if your application needs to obtain address or credit card
information found on a remote system, accessing the appropriate object or objects on that sys-
tem has been somewhat problematic. Add the fact that many remote systems rely on a specific
vendor technology or language, and the problem is compounded.
Many alternatives to this problem have been presented over the years, including CORBA
(Common Object Request Broker Architecture), DCOM (Distributed Common Object Model),
and Java RMI (Remote Method Invocation), but none have resulted in simplifying remote
object calls without inventing some type of new technology or relying on vendor-specific tech-
nology. With the introduction of version 1.0 of the Simple Object Access Protocol (SOAP) in
September of 1999, a new mechanism for making Remote Procedure Calls (RPC) emerged that
combines the power of XML with the HTTP protocol. Version 1.1 of SOAP provides more
flexibility in working with other types of wire transport protocols, including FTP and SMTP.
In this chapter we’ll take a look at the importance of the SOAP specification and detail how
SOAP can be used in ASP.NET to provide applications with more power and flexibility. This
discussion includes information about the SOAP specification itself, alternatives to SOAP, and
what role SOAP plays in Web services. By the end of the chapter you’ll have a good under-
standing of how you can consume and expose a variety of services using SOAP along with
other description languages built in to the .NET platform. Before formally introducing SOAP,
let’s take a look at a few alternatives.
Alternatives to SOAP
The SOAP protocol is the latest in many attempts to simplify the process of making calls to
objects found in distributed environments. In this section we’ll take a high-level look at a few
of the alternative protocols to give you a better perspective on why the SOAP protocol is so
promising. Let’s start off by examining a protocol many Microsoft developers are already
familiar with: DCOM.
Distributed Component Object Model (DCOM)

When Microsoft first introduced the DCOM, it provided a tightly coupled component model
system that allowed client programs to make calls to objects located in remote network loca-
tions. This is accomplished by looking up the object’s location through calling a Service
Control Manager (SCM) or a system Registry. The functionality provided by DCOM is needed
by developers to allow for easier maintenance of COM-based applications and incorporation
of a tiered approach to application development. Although DCOM works well in a Windows
environment (a few non-Windows implementations of DCOM also exist), it is generally not
Working with ASP. NET, XML, SOAP, and Web Services
401
CHAPTER 10
considered to be a scalable solution because of how it handles garbage collection and connec-
tion management. DCOM clients instantiate a connection to a remote object by using a proxy.
After a reference to the object has been obtained, clients send ping messages to let the object
know that they are still acting as a consumer. This creates scalability problems as the number
of clients sending ping messages across the network increases.
DCOM also presents challenges dealing with firewalls and state management. With regard to
firewalls, if the firewall administrator has not opened ports used to pass through DCOM mar-
shaling calls, the calls will likely be blocked and therefore fail before reaching the intended
recipient object. DCOM also relies on state management to know when a particular client is no
longer connected (the pinging mechanism). This makes using it in a stateless arena such as the
Web more difficult.
For more information concerning DCOM, visit http://www.microsoft.com/com/tech/
dcom.asp.
Common Object Request Broker Architecture (CORBA)

CORBA offers another way of calling remote objects from a client. As with DCOM, CORBA
is connection based, making it less scalable and more difficult to work with in a stateless Web
environment. In general, CORBA works by sending a LocateRequest message across the wire
(IIOP is used with TCP/IP wire transmissions) to an information repository called an ORB to
determine an object’s location. After this location is determined, direct communication
between the client and remote object can occur. Although many implementations of the ORB
exist on a variety of systems, CORBA is subject to many of the same problems that restrict
DCOM from excelling in a platform-neutral distributed environment.
For more information concerning CORBA, visit http://www.corba.org.
Java Remote Method Invocation (RMI)

From a high-level perspective, Java RMI works in a manner similar to both DCOM and
CORBA in that it connects a client with a remote object and works by using a distributed
object architecture. The location of the object is found by making a call to an RMI Registry.
This registry must be known to clients and have a specific place on the network. When the
remote object location is found, the client can download a stub used to access the object. After
the stub has been downloaded to the client, access to the object’s properties and methods is
then available.
Compared to both DCOM and CORBA, Java RMI arguably represents a more scalable solu-
10
ASP, NET, XML,
tion because of its capability to use HTTP tunneling. However, because it focuses on the Java
WEBSERVICES
SOAP, AND
language, it doesn’t present an optimal solution for exposing services to clients written using
other languages.
For more information concerning Java RMI, visit http://java.sun.com/products/jdk/rmi/.
402
XML-RPC
XML-RPC was one of the first specifications to use XML structures in making remote calls. It
allows calls to ride on top of HTTP, which allows them to pass through firewalls on port 80.
The XML-RPC Specification revolves around the HTTP POST to transmit XML structures to
and from different distributed systems. Many high-level concepts described by XML-RPC have
been adopted into the SOAP protocol. An example of a simple remote method call using XML-
RPC is shown next:
<methodCall>
<methodName>GetName</methodName>
<params>
<param>
<value>
<string>ALFKI</string>
</value>
</param>
</params>
</methodCall>
Looking at this, you can see that it is an extremely verbose way of marking up or “serializing”
a method call. On the other hand, it’s very easy to understand and parse. However, you’ll see in
the next few sections that SOAP extends many of the XML-RPC concepts and minimizes some
of the more verbose XML-based calls. SOAP also refines data typing by leveraging the W3C
Schema Part II specification. Although XML-RPC does present a good starting point for using
XML in distributed environments, it has several areas that can be improved on. Sometimes
more is less. This is especially true when it comes to transmitting data over the wire with
HTTP.
For more information concerning XML-RPC, including information on where to download
XML-RPC–compatible implementations for a variety of platforms, visit http://
www.xmlrpc.com.
What Is SOAP?
Aside from XML-RPC, the SOAP alternatives previously listed (as well as many others that
were not covered) rely on complex distributed object architectures installed on tightly inte-
grated systems. SOAP provides a much simpler architecture. In fact, SOAP is simply a wire
protocol that can be used to transfer information between a client and a server using XML to
maintain structure. It does not rely on complex object architectures to make calls between
distributed objects. Rather, SOAP provides a framework for exchanging messages between
distributed environments without relying on a particular operating system, a programming
403
CHAPTER 10
language, or an object model architecture. The authors of the SOAP version 1.1 protocol pro-
vide the following definition:
SOAP provides a simple and lightweight mechanism for exchanging structured and typed infor-
mation between peers in a decentralized, distributed environment using XML. SOAP does not
itself define any application semantics such as a programming model or implementation spe-
cific semantics; rather it defines a simple mechanism for expressing application semantics by
providing a modular packaging model and encoding mechanisms for encoding data within
modules. This allows SOAP to be used in a large variety of systems ranging from messaging
systems to RPC.
One of the design goals first identified by the SOAP authors was to develop a protocol that
invented no new technology. After all, with the rise of XML and HTTP over the years, an
excellent foundation already existed that could be leveraged by the SOAP authors. As a result,
SOAP relies on XML and a variety of other protocols including HTTP to exchange calls
between objects.
SOAP works by serializing object calls into an XML structure that can then be passed to
another machine using a protocol such as HTTP. “What is serialization?” you ask. You saw a
simple example of this in the XML-RPC section, but let’s take a look at a more abstract sam-
ple. Assume a distributed object exposes the following method that is used to get a customer
name from a database:
GetName(int CustomerID) {}
Serializing this method call into an XML structure could result in the following XML
fragment:
<callAMethod>
<GetName>
<params>
<CustomerID type=”int”>1234</CustomerID>
</params>
</GetName>
</callAMethod >
As shown, serialization simply takes a given object and “serializes” particular aspects of the
object into a structure that can be easily transported to a remote system. After the remote sys-
tem receives it, processing must be done to extract the information, instantiate the correct
object, make the appropriate method call, and return a response to the caller. 10
ASP, NET, XML,
Although the previous serialized XML structure may work for your own objects, how do other
WEBSERVICES
SOAP, AND
objects know how to work with it? And if the remote object is required to return a response,
what structure should the response follow? This is where the SOAP protocol comes into play.
404
It provides a standard way to serialize calls made to remote objects and retrieve results
returned by the object. This allows any application with an understanding of SOAP to be
accessed remotely, no matter what language or operating system it relies on. Imagine the
possibilities!
Before you jump into more specifics on the SOAP protocol, Listing 10.1 provides a quick
look at how the GetName() method shown earlier can be serialized into a valid SOAP request
structure.
LISTING 10.1 Serialization of the GetName() Method into a SOAP Request Structure
1: <soap:Envelope xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance”
3: xmlns:soap=”http://schemas.xmlsoap.org/soap/envelope/”>
4: <soap:Body>
5: <GetName xmlns=”Some-URI”>
7: </GetName>
8: </soap:Body>
9: </soap:Envelope>
Details on the different pieces used to create this SOAP request follow in the next few sections.
After learning these details, you’ll see how SOAP can be used with Web services to enhance
your ASP.NET applications.
NOTE
Although the next few sections provide a detailed look at the different pieces that
make up a SOAP request or response, a more thorough look at SOAP is provided by
the book Understanding SOAP (ISBN 0-672-31922-5) by Kennard Scribner and Mark
Stiver, Sams Publishing.
Analyzing SOAP’s Structure

Now that you understand the basic purpose behind the SOAP protocol and have seen a few
of the alternatives, let’s take a look at the different pieces that are needed to construct a valid
XML-based SOAP request and response. Because SOAP relies on XML to serialize calls,
many of the concepts you learned in prior chapters, including the rules associated with XML
and the necessity of namespaces, will be applicable. You’ll find that your current knowledge of
XML will make learning to work with SOAP rather easy.
405
CHAPTER 10
SOAP messages can contain some or all of the following elements, depending on the function-
ality that a given SOAP structure includes:
• Envelope
• Header
• Body
• SOAP namespaces
• Attributes
• SOAP fault
To fully understand these elements and how they relate to each other in a SOAP message,
Listing 10.2 presents the schema that a SOAP message follows. Each of the items listed in the
schema will be discussed in turn in the next few sections.
NOTE
The schema shown in Listing 10.2 is based on an older schema specification. It is
shown here only to help you understand the structure and contents of SOAP mes-
sages, not to demonstrate how to create XML schemas.
LISTING 10.2 The SOAP Message Schema (http://schemas.xmlsoap.org/soap/

envelope/)
2: 
3: 
6: <schema xmlns=’http://www.w3.org/1999/XMLSchema’
7: xmlns:tns=’http://schemas.xmlsoap.org/soap/envelope/’
8: targetNamespace=’http://schemas.xmlsoap.org/soap/envelope/’>
9:
10: 
11:
12: <element name=”Envelope” type=”tns:Envelope”/>
13: <complexType name=’Envelope’>
14: <element ref=’tns:Header’ minOccurs=’0’/> 10
ASP, NET, XML,
15: <element ref=’tns:Body’ minOccurs=’1’/>

WEBSERVICES
SOAP, AND
16: <any minOccurs=’0’ maxOccurs=’*’/>

17: <anyAttribute/>
406

18: </complexType>
19:
20: <element name=”Header” type=”tns:Header”/>
21: <complexType name=’Header’>
23: <anyAttribute/>
24: </complexType>
25:
26: <element name=”Body” type=”tns:Body”/>
27: <complexType name=’Body’>
29: <anyAttribute/>
30: </complexType>
31:
32: 
35: <attribute name=’mustUnderstand’ default=’0’>
36: <simpleType base=’boolean’>
37: <pattern value=’0|1’/>
38: </simpleType>
39: </attribute>
40:
41: <attribute name=’actor’ type=’uri-reference’/>
42: 
47:
48: <simpleType name=’encodingStyle’ base=’uri-reference’
49: derivedBy=’list’ />
50: <attributeGroup name=’encodingStyle’>
51: <attribute name=’encodingStyle’ type=’tns:encodingStyle’/>
52: </attributeGroup>
53: 
54: <complexType name=’Fault’ final=’extension’>
55: <element name=’faultcode’ type=’qname’/>
56: <element name=’faultstring’ type=’string’/>
57: <element name=’faultactor’ type=’uri-reference’ minOccurs=’0’/>
58: <element name=’detail’ type=’tns:detail’ minOccurs=’0’/>
59: </complexType>
60: <complexType name=’detail’>
407
CHAPTER 10

62: <anyAttribute/>
63: </complexType>
64: </schema>
Let’s first take a look at the wrapper around SOAP messages: the Envelope.
The SOAP Envelope

Because the SOAP protocol adheres to the XML version 1.0 specification, all SOAP XML
structures must have a single root element that acts as a wrapper for all other content. The
Envelope element serves as this single root in SOAP. Although some parts of a SOAP message
are optional, the Envelope element is required and must appear for SOAP calls to function
properly. The Envelope element may contain namespace declarations and attributes, although
these items are optional. The following example shows the different elements that the
Envelope element may wrap:
<soap:Envelope
xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance”
xmlns:xsd=”http://www.w3.org/2001/XMLSchema”
xmlns:soap=”http://schemas.xmlsoap.org/soap/envelope/”>
<soap:Header>

</soap:Header>
<soap:Body>

</soap:body>
</soap:Envelope>
As shown, the Envelope element acts as a container for all content found within a SOAP mes-
sage. Notice that the element contains a SOAP namespace named soap that points to a URI of
http://schemas.xmlsoap.org/soap/envelop/ (the actual location of the SOAP message
schema). It also contains namespace declarations relating to the XML schema specification
and the schema instance.
Although Envelope element attributes are optional, any attribute that does appear in the
Envelope element must be namespace qualified. Before proceeding, let’s take a more in-depth
look at Envelope namespaces and attributes.
10
SOAP Envelope Namespaces and Attributes
ASP, NET, XML,
WEBSERVICES
SOAP, AND
The previous example of the SOAP message structure included the soap namespace. Looking
back at Listing 10.2, you’ll notice that the Envelope element definition includes the
<anyAttribute/> tag (line 17). This allows for custom attributes to be added to the
408
Envelope element as necessary. As mentioned earlier, these attributes must be namespace qual-
ified to be included in the SOAP message.
With regard to namespaces, an interesting aspect of the soap namespace shown in the previous
example is that it does not specify what version of SOAP we are working with. This is accord-
ing to the SOAP version 1.1 specification. The specification states the following:
SOAP does not define a traditional versioning model based on major and minor version num-
bers. A SOAP message MUST have an Envelope element associated with the “http://
schemas.xmlsoap.org/soap/envelope/” namespace. If a message is received by a SOAP
application in which the SOAP Envelope element is associated with a different namespace, the
application MUST treat this as a version error and discard the message. If the message is
received through a request/response protocol such as HTTP, the application MUST respond
with a SOAP VersionMismatch faultcode message (see section 4.4) using the SOAP
“http://schemas.xmlsoap.org/soap/envelope/” namespace.
What does this tell us about the namespace URI? To send properly structured SOAP messages,
the namespace URI must be included on the Envelope element exactly as shown:
http://schemas.xmlsoap.org/soap/envelope/
Keep in mind that as the SOAP specification goes through the W3C working group process,
the namespace may eventually change. But for now, changing the URI will result in the remote
application returning a faultcode message (discussed later). Although the example shown ear-
lier used the soap namespace prefix, you are free to use your own prefix name as long as it
points to the proper URI.
The SOAP Header

Referring back to the SOAP schema shown in Listing 10.2, you can see that the Header
element may optionally appear as the first child of the Envelope element. If it does appear, it
must be the first child in the SOAP message. The specification states that only direct children
of the Header element can be processed by an application. As is the case with the Head element
in HTML, the Header element can be used to specify special information that needs to be
embedded within the SOAP message.
The SOAP version 1.1 specification shows an example of embedding transaction information
within the Header element, as shown next:
<SOAP-ENV:Header>
<t:Transaction
xmlns:t=”some-URI” SOAP-ENV:mustUnderstand=”1”>
5
</t:Transaction>
</SOAP-ENV:Header>
409
CHAPTER 10
This example shows how independent elements such as the Transaction element must be
namespace qualified. Assuming the remote object being called understands the data contained
within the Transaction element, the appropriate action can be taken. What happens if the
object doesn’t understand how to deal with transactions properly? Should it go ahead and
process the request? This is where the mustUnderstand attribute comes into play.
SOAP Header Attributes

As with the Envelope element, the SOAP Header element schema definition contains the
<anyAttribute/> tag in its definition. This allows custom attributes to be used as long as they
are namespace qualified. The SOAP version 1.1 schema also includes two global attribute defi-
nitions named mustUnderstand and actor, as shown next:
<attribute name=’mustUnderstand’ default=’0’>
<simpleType base=’boolean’>
<pattern value=’0|1’/>
</simpleType>
</attribute>
<attribute name=’actor’ type=’uri-reference’/>
The global nature of these attributes allows them to be applied to any complexType definition
within a SOAP message, meaning that it can be used on any element. However, they will nor-
mally be used within the Header element section of a SOAP message.
When the mustUnderstand attribute is used on the Header element, the recipient of the mes-
sage must “understand” how to process the information contained within the Header element.
Because incorrect return values may result from a recipient that does not understand a particu-
lar section contained within the Header element, the mustUnderstand attribute provides a
safety net. The SOAP specification says that any recipient that cannot understand the informa-
tion within the Header element when the mustUnderstand attribute is true (1) must fail to
process the message and return the SOAP-ENV:MustUnderstand fault (discussed later). If this
attribute is not present, it defaults to a value of false (0).
The actor attribute is used in cases where a SOAP message may pass through one or more
intermediaries before reaching the final destination recipient. This attribute can contain a URI
value type that is used to identify the recipient of the Header element. The specification pro-
vides a special URI with a value of http://schemas.xmlsoap.org/soap/actor/next for
cases where a SOAP message header is intended for the first SOAP application that processes
the message. A Header element containing no actor attribute automatically infers that the 10
recipient is the end point or final recipient in the SOAP message path.
ASP, NET, XML,
WEBSERVICES
SOAP, AND
An intermediary serves the dual purpose of receiving and forwarding SOAP messages. As a
SOAP message starts along the path toward the final recipient, some of the message’s header
information may be directed to a particular intermediary involved in the transferal process.
410
After an intermediary receives a message with information directed to them (identified by the
actor attribute’s URI value), the SOAP specification states that they may not pass this infor-
mation along to other destinations in the path. This means that they may not forward the
Header element after they’ve consumed it. The specification does allow for intermediaries to
insert a new header element to pass along, but the “contract” for this newly added element will
be between the intermediary and the intended recipient. The originator of the SOAP message
will not have any knowledge nor ever see the newly added header.
The SOAP Body

Now that you’ve seen how the Envelope element is used to wrap SOAP messages and how the
Header element can be used to mark up special instructions, let’s introduce the focal point of
the SOAP message: the Body element. As with HTML, the Body element contains the bulk of
the information that is passed to a given SOAP recipient. Whereas the Header element is
optional, the SOAP version 1.1 specification states that the Body element must be present for a
SOAP message to be considered valid.
Within the Body element, you can find all the information needed to call a method. This
includes all the necessary parameter values that a particular method may be expecting. It is
also designed to contain response information and, in the case of errors, fault information.
Each of these different Body element types will be discussed in turn. Let’s first examine
making a SOAP call.
The Call Element

Serializing a given object’s information into a SOAP call is a fairly straightforward process.
Immediately following the SOAP Body element, the method to be called must be included
followed by any parameters that the method may be expecting. The following SOAP request
revisits Listing 10.1 to show this process:
<soap:Envelope
<soap:Body>
<GetName xmlns=”Some-URI”>
</GetName>
</soap:Body>
</soap:Envelope>
Looking at the previous SOAP message, you can see that the GetName() method introduced
earlier in the chapter is being called. The method name is listed immediately following the
411
CHAPTER 10
Body element and is namespace qualified. The CustomerID parameter is then listed as a child
element of the method name and contains the data to be passed to the method.
Listing 10.3 contains a slightly more complex method call. The method being serialized is
shown next:
GetOrders(string CustomerID,int SalesRepID,float OrderMax)
LISTING 10.3 Serializing the GetOrders() Method into a SOAP Message

1: <soap:Envelope
2: xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance”
5: <soap:Body>
6: <GetOrders xmlns=”Some-URI”>
8: <SalesRepID>85</SalesRepID>
9: <OrderMax>49.99</OrderMax>
10: </GetOrders >
11: </soap:Body>
12: </soap:Envelope>
The Body element structure for this method call is very similar to the structure shown earlier
aside from the additional parameters.
The Response Element

The capability to initiate calls to distributed objects by serializing them into SOAP messages
would certainly be useless without the capability to access the information returned by the
remote application. Listing 10.4 shows an example of a response generated by the SOAP
application called using the SOAP message shown in Listing 10.3.
LISTING 10.4 Serialized GetOrders() SOAP Response

1: <soap:Envelope
2: xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance”
5: <soap:Body>
6: <GetOrdersResponse xmlns=”Some-URI”> 10
ASP, NET, XML,
7: <return>
WEBSERVICES
SOAP, AND
8: <Orders>
9: <Order orderID=”1453” quantity=”1” />
10: <Order orderID=”1672” quantity=”3” />
412

11: </Orders>
12: </return>
13: </GetOrdersResponse>
14: </soap:Body>
15: </soap:Envelope>
Notice that the method name has been appended with the word “Response” to identify
this SOAP message as a response. Immediately under the method response element
(GetOrdersResponse), the return element is used to encapsulate the SOAP response.
This example shows how data records can be serialized into an XML structure that can then
be sent back to the calling SOAP application. A more simple SOAP response may have only
a single return value, as follows:
<return>1582</return>
The return element must always be the first one listed under the method response element. If
the method also contains out parameters, these elements may then follow the return element
in the SOAP body.
The Fault Element

Virtually any application has the potential to generate errors because of a variety of circum-
stances. This is especially true when calls are being made between distributed systems located
anywhere in the world. A remote system may not be available, may have changed a given
method’s signature, or may experience problems deserializing a SOAP message. Because of
these problems, no RPC mechanism framework would be complete without a way to return
errors that arise during the process.
SOAP has a built-in mechanism to return any errors encountered while SOAP messages are
transferred or manipulated. This is accomplished by embedding the Fault element within the
body section of a SOAP message. Before examining the different information that may be
contained when a fault occurs, let’s take a look at a sample SOAP message with embedded
fault information. Listing 10.5 shows an example of a SOAP response that contains fault
information.
LISTING 10.5 A SOAP Response Message with Embedded Fault Information

<soap:Envelope
<soap:Body>
413
CHAPTER 10

<soap:Fault>
<faultcode>soap:Server</faultcode>
<faultstring>Server Error</faultstring>
<detail xmlns=”some-URI”>
<ErrorDesc>Incorrect number of supplie parameters</ErrorDesc>
<ErrorCode>624</ErrorCode>
</detail>
</soap:Fault>
</soap:Body>
</soap:Envelope>
When an error occurs, the SOAP application is required to place the Fault element as the first
child of the Body element in the SOAP message. The Fault element may have some or all of
the child elements listed in Table 10.1, depending on the nature of the error that occurred.
TABLE 10.1 Fault Element Subelements
Element Name Description

faultcode Provides a mechanism for identifying the fault. The faultcode ele-
ment must be present as a child of the Fault element if a fault occurs
and its value must be qualified. Table 10.2 shows the four fault code
values defined by the SOAP specification.
faultstring The faultstring element provides a human-readable version of the
fault. If a fault occurs, this element must be present as a child of the
Fault element.
faultactor In cases where a fault was caused by an intermediary actor, the
faultactor element provides information about the actor causing
the fault. The value of the element will contain a URI that identifies
the actor. An actor that is not the end-point destination for a SOAP
message must provide this element as a child of the Fault element to
let the calling application know where the fault was caused.
detail The detail element provides application-specific information related
to faults caused within the body section of a SOAP message. Faults
caused by a header section of a SOAP message must not return a
detail element as a child of the Fault element. This allows calling
applications to know whether the body was processed. 10
ASP, NET, XML,
Children of the detail element are referred to in the SOAP version 1.1
WEBSERVICES
SOAP, AND
specification as detail entries. These child elements must be name-

space qualified.
414
TABLE 10.2 Fault Codes Defined in the SOAP Specification

Name Description
VersionMismatch This code is used when an application finds an invalid namespace
used for the SOAP Envelope.
MustUnderstand A child element of the Header element within a SOAP message
was not understood by an application.
Client This error specifies that the message was incorrectly formed or
didn’t contain proper information. An example of when this fault
code could be used is when information supplied by the client is
not enough to authenticate with the remote SOAP application. This
code normally indicates that a SOAP message will need to be
changed to be successful.
Server Well-formed and valid SOAP messages that are not processed cor-
rectly or are not able to be processed by the server application will
result in this fault code. Although the server may be functioning
properly, another “up-stream processor” queried by the server may
not be accessible. Sending the message at a later time may result in
success.
SOAP Encoding and Data Types

Up to this point, you have seen the different elements that make up a SOAP message. Although
these are very useful in providing a structure for a remote call, serialized method calls require
that parameters be properly encoded to be successfully used by a remote SOAP application. In
this section we’ll take a high-level look at the main encoding data types in the SOAP specifica-
tion. For a complete listing of the data types, refer to the SOAP version 1.1 specification or the
book Understanding SOAP (ISBN 0-672-31922-5) by Sams Publishing.
Encoding parameters within the body of a SOAP message involves specifying what data type
a parameter is associated with. What data types can be used, though? Can we arbitrarily pick
a type, and if so, how can we guarantee that a remote system will know how to handle it?
Fortunately, the SOAP specification includes a detailed section concerning this topic that is
based upon the W3C XML Schema Part 2. Let’s first take a look at simple types.
SOAP Simple Types

The SOAP version 1.1 specification includes the primitive and derived simple types defined in
the W3C XML Schema Part 2 document mentioned previously. These include all the data types
shown in Table 10.3.
415
CHAPTER 10
TABLE 10.3 SOAP Simple Data Types

Datatype Numeric
primitive string false
boolean false
float true
double true
decimal true
duration false
dateTime false
time false
date false
gYearMonth false
gYear false
gMonthDay false
gDay false
gMonth false
hexBinary false
base64Binary false
anyURI false
QName false
NOTATION false
derived normalizedString false
token false
language false
IDREFS false
ENTITIES false
NMTOKEN false
NMTOKENS false
Name false
NCName false 10
ASP, NET, XML,
ID false
WEBSERVICES
SOAP, AND
IDREF false
ENTITY false
416

Datatype Numeric
integer true
nonPositiveInteger true
negativeInteger true
long true
int true
short true
byte true
nonNegativeInteger true
unsignedLong true
unsignedInt true
unsignedShort true
unsignedByte true
positiveInteger true
The SOAP specification allows for single and multiple references to a particular type. A single
reference occurs when information appears only once within a SOAP message. All the previ-
ous SOAP message examples up to this point demonstrated single references. A multiple
reference occurs when the same piece of information occurs more than once within a SOAP
message. For example, if a string listed in two parameters contains the same value, each para-
meter can instead refer to a reference as shown next. Doing this conserves space by making the
serialized call less verbose, especially when an information item may appear multiple times
within a SOAP message:
<soap:Envelope
<soap:Body>
<CustomerID href=”#param”/>
<CustomerRef href=”#param”/>
</GetName>
<getNameData id=”param” xsi:type=”xsd:string”>
ALFKI
</getNameData>
</soap:Body>
</soap:Envelope>
417
CHAPTER 10
This example shows how the id and href attributes can be used to allow multiple parameters
to reference the same piece of information.
SOAP Compound Types

Although the encoding types shown in Table 10.3 will be sufficient in many situations, at times
more complex structures may be needed to properly serialize and encode parameter types.
After all, what if a remote SOAP application expects an array or struct to be passed to it? With
simple types alone this cannot be accomplished. The SOAP authors were aware of this and
included compound types in the SOAP specification.
The SOAP specification defines an array as “a compound value in which ordinal position
serves as the only distinction among member values.” In cases where an array needs to be
serialized into a SOAP message, the following XML structure can be used:
<soap:Envelope
xmlns:soap=”http://schemas.xmlsoap.org/soap/envelope/”
soap:encodingStyle=”http://schemas.xmlsoap.org/soap/encoding/”
xmlns:SOAP-ENC=”http://schemas.xmlsoap.org/soap/encoding/”>
<soap:Body>
<GetName xmlns”Some-URI”>
<CustomerArray href=”#arrayParam”/>
</GetName>
<SOAP-ENC:Array id=”arrayParam” SOAP-ENC:arrayType=”xsd:string[3]”>
<SOAP-ENC:string>ALFKI</SOAP-ENC:string>
<SOAP-ENC:string>GPKME</SOAP-ENC:string>
<SOAP-ENC:string>DLWID</SOAP-ENC:string>
<SOAP-ENC:string>PAAIQ</SOAP-ENC:string>
</SOAP-ENC:Array>
</soap:Body>
</soap:Envelope>
Notice that the array’s structure is qualified by the SOAP-ENC namespace (SOAP Encoding)
and has an attribute named arrayType. This attribute’s value specifies the type of array and the
number of members. After this information is specified, the actual members of the array are
listed sequentially. The array structure is referenced by using the href and id attributes as
shown earlier.
Serialized arrays can also be defined as sparse arrays, partially transmitted arrays, and multi- 10
dimensional arrays. Examples of these are shown next:
ASP, NET, XML,
WEBSERVICES
SOAP, AND
Sparse Array:
<SOAP-ENC:Array SOAP-ENC:arrayType=”xsd:string[,][4]”>
<SOAP-ENC:Array SOAP-ENC:position=”[2]”
418
SOAP-ENC:arrayType=”xsd:string[10,10]>
<item SOAP-ENC:position=”[2,2]”>Third row, third col</item>
<item SOAP-ENC:position=”[7,2]”>Eighth row, third col</item>
</SOAP-ENC:Array>
</SOAP-ENC:Array>
Partially Transmitted Array:

<SOAP-ENC:Array SOAP-ENC:arrayType=”xsd:string[5]” SOAP-ENC:offset=”[2]”>
<item>The third element</item>
<item>The fourth element</item>
</SOAP-ENC:Array>
Multidimensional Arrays:
<SOAP-ENC:Array SOAP-ENC:arrayType=”xsd:string[][2]”>
<item href=”#array1”/>
<item href=”#array2”/>
</SOAP-ENC:Array>
<SOAP-ENC:Array id=”array1” SOAP-ENC:arrayType=”xsd:string[2]”>
<item>r1c1</item>
<item>r1c2</item>
<item>r1c3</item>
</SOAP-ENC:Array>
<SOAP-ENC:Array id=”array2” SOAP-ENC:arrayType=”xsd:string[2]”>
<item>r2c1</item>
<item>r2c2</item>
<item>r2c3</item>
</SOAP-ENC:Array>
In addition to arrays, the SOAP specification also allows compound types such as structs to be
serialized. The specification defines a struct as “a compound value in which accessor name is
the only distinction among member values, and no accessor has the same name as any other.”
As an example, the SOAP version 1.1 specification provides the following example of serializ-
ing a struct named Book:
<e:Book>
<author>Henry Ford</author>
<preface>Prefatory text</preface>
<intro>This is a book.</intro>
</e:Book>
A more complex example of using structs is shown next:

<e:Book>
<title>My Life and Work</title>
<firstauthor href=”#Person-1”/>
<secondauthor href=”#Person-2”/>
419
CHAPTER 10
</e:Book>
<e:Person id=”Person-1”>
<name>Henry Ford</name>
<address xsi:type=”m:Electronic-address”>
<email>mailto:[email protected]</email>
<web>http://www.henryford.com</web>
</address>
</e:Person>
<e:Person id=”Person-2”>
<name>Samuel Crowther</name>
<address xsi:type=”n:Street-address”>
<street>Martin Luther King Rd</street>
<city>Raleigh</city>
<state>North Carolina</state>
</address>
</e:Person>
Finally, both arrays and structs can coexist in a SOAP message structure:
<xyz:PurchaseOrder>
<CustomerName>Henry Ford</CustomerName>
<ShipTo>
<Street>5th Ave</Street>
<City>New York</City>
<State>NY</State>
<Zip>10010</Zip>
</ShipTo>
<PurchaseLineItems SOAP-ENC:arrayType=”Order[2]”>
<Order>
<Product>Apple</Product>
<Price>1.56</Price>
</Order>
<Order>
<Product>Peach</Product>
<Price>1.48</Price>
</Order>
</PurchaseLineItems>
</xyz:PurchaseOrder>
SOAP HTTP Headers

The HTTP protocol works in conjunction with TCP/IP through the use of headers that contain 10
ASP, NET, XML,
information and methods specific to a request or response. These headers provide necessary
WEBSERVICES
SOAP, AND
functionality that allows SOAP messages to be transported to and from distributed systems
using HTTP. Within the headers, three methods can be used— GET, POST, and M-POST.
Headers can also contain Content-Length, Content-Type, and name/value pair information spe-
cific to the request or response.
420
The following example shows a simple HTTP POST request that contains the mandatory
SOAPAction field. According to the SOAP version 1.1 specification, this field must be included
within HTTP requests to allow firewalls to appropriately filter SOAP messages sent via HTTP.
The value of this field normally contains a namespace URI followed by the # character and the
name of the method being called within the SOAP body. A firewall can then match the name-
space URI and method name with those specified within the SOAP message:
POST /StockQuote HTTP/1.1
Content-Type: text/xml; charset=”utf-8”
Content-Length: nnnn
SOAPAction: “Some-URI#GetName”

<soap:Envelope
<soap:Body>
</GetName>
</soap:Body>
</soap:Envelope>
An HTTP response looks similar, although the status of the response needs to be returned to
the client. In this case a success code of 200 is being returned, although you’re more than
likely quite familiar with other return codes such as the infamous 404 Not Found error code:
HTTP/1.1 200 OK
Content-Type: text/xml; charset=”utf-8”
Content-Length: nnnn

<soap:Envelope
<soap:Body>
<GetOrdersResponse xmlns=”Some-URI”>
<return>
<Orders>
<Order orderID=”1453” quantity=”1”/>
<Order orderID=”1672” quantity=”3”/>
</Orders>
</return>
421
CHAPTER 10
</GetOrdersResponse>
</soap:Body>
</soap:Envelope>
For more information concerning the HTTP protocol, visit http://www.normos.org/ietf/

rfc/rfc2616.txt.
Understanding Web Services

Now that you’ve seen how SOAP messages are structured and the role that SOAP plays in
integrating distributed applications, let’s take a look at how SOAP can be used with Web ser-
vices. Web services can be described best as services or applications that expose programmatic
functionality through the Internet. This functionality allows virtually any application to either
expose or consume services that can perform a variety of tasks.
For example, a Web service may perform the task of validating an address. An ASP.NET
application could capture a customer’s shipping address through a Web form and send it to
the address validation Web service. In response to receiving the address, the service can vali-
date it and make any necessary adjustments to ensure that it follows local postal and shipping
regulations.
Another Web service may expose local weather data. By passing the service a regional identi-
fier, a response can be generated that provides the current temperature, five-day forecast, and
historical weather for the region. These are just two examples of Web services that can be used
by ASP.NET applications to allow for the dynamic inclusion of data. In reality, there is no limit
to the different types of Web services that can be developed for use today using the .NET
platform.
At this point you may wonder what the difference between a Web service and a regular HTML
page is? After all, you can hit a site such as http://www.weather.com and get weather infor-
mation! What advantages do Web services offer?
Although it’s true that you can get a lot of functionality out of regular HTML pages by using
forms to post data to the Web server, Web sites are geared to work with display devices such
as the browser. To manipulate the data contained within an HTML page and use it within your
own site, you would need to use string manipulation routines to parse through the entire
HTML page. Not only is this cumbersome (and similar to trying to find a needle in a hay-
stack), it is certainly more complex than directly interacting with an object to extract data in a 10
format that is easy to work with.
ASP, NET, XML,
WEBSERVICES
SOAP, AND
Web services allow you to work with objects and receive data in a nicely formatted XML
structure. This means that you don’t have to employ costly string-parsing routines or resort
422
to other screen-scraping methods. You simply invoke the remote object by sending a serialized
request, get a response, deserialize the response message, and utilize that response data as
appropriate.
NOTE
To see just how much work can be involved when screen scraping an HTML file to
gather the required data, take a look the following articles: http://
www.asptoday.com/articles/19991229.htm http://www.asptoday.com/
articles/19991231.htm.
Web Service Protocols (SOAP, HTTP-GET, HTTP-POST)

Before analyzing how to create or consume a Web service, it’s important to know what proto-
cols can be used for communication. Because the bulk of the chapter up to this point has
focused on SOAP, you can bet that SOAP can be used in communicating with Web services.
Aside from SOAP, Web services can also be exposed and consumed via HTTP-POST and HTTP-
GET commands. Offering these different protocols allows for a given Web service to maximize
its flexibility in working with a variety of applications from large-scale servers to smaller PDA
devices. You’ve already had the opportunity to see how SOAP can be used to work with dis-
tributed applications, but let’s take a quick look at the other two options.
HTTP-GET
You have more than likely been exposed to working with the GET verb in creating classic ASP
applications that utilize forms containing the method=”get” attribute. Forms that use this
method will pass all form values as name/value pairs on the QueryString. For example, a Web
form containing first name and last name as inputs could yield the following QueryString
when submitted:
http://www.someServer.com/processForm.aspx?firstname=Dan&lastname=Wahlin
Using C# and ASP.NET, you can extract name/value pairs found in the QueryString by doing
the following:
Response.Write(“First Name: “ + Request.QueryString[“firstname”]);
Response.Write(“<br />”);
Response.Write(“Last Name: “ + Request.QueryString[“lastname”]);
}
</script>
423
CHAPTER 10
When used with Web services, a QueryString containing the necessary name/value pairs can be
passed to the service. In response, the XML structure will contain return values wrapped
within XML elements that specify the value’s data type. For example, the following structure
could represent a response to an HTTP-GET request when first and last names are passed to a
Web service that returns the number of orders placed:
<int xmlns=”http://tempuri.org/”>4</int>
Although HTTP-GET requests are limited to name/value pairs, HTTP-GET does support returning
any COM+ type as a return value. This means that types such as arrays, structs, DataSets, and
arrays of structs can be returned. The following call to a Web service shows how an array of
names can be sent using name/value pairs:
http://www.someSite.com/arrays/returnArray?name=Dan&name=Todd&name=Elaine&
name=Danny&name=Michelle
The returned structure would look as follows:

<ArrayofString>
<string>Dan</string>
<string>Todd</string>
<string>Elaine</string>
<string>Danny</string>
<string>Michelle</string>
</ArrayofString>
HTTP-POST
The HTTP-POST verb works in a manner similar to the HTTP-GET, except that the name/value
pairs are not passed using the QueryString. Instead, this information is UUEncoded and passed
within the body of the message sent to the Web service. Like HTTP-GET requests, the response
contains return values wrapped within XML elements that specify the value’s data type.
It’s important to note that both HTTP-GET and HTTP-POST Web service requests and responses
are restricted to using name/value pairs. This is very limiting when more robust data types
need to be sent. When these types are needed, SOAP must be used because it provides built-in
support for sending complex types, as shown earlier in the chapter.
Web Service Architecture 10

ASP, NET, XML,
Web services provide a fairly simple architecture that can be consumed to add additional func-
WEBSERVICES
SOAP, AND
tionality and power to ASP.NET applications. Using the .NET framework, Web services can
also be written that non-Windows clients can consume using a variety of languages. Most of
the more difficult parts associated with using Web services, including the serialization and
424
deserialization of messages, occurs behind the scenes through using utilities provided by the
.NET platform. These utilities (discussed later) automatically create the appropriate support
files to expose or consume a Web service.
So what pieces are needed for Web services to become functional? On the client side, a proxy
can be created that allows the client application to serialize and send messages to the Web
service. This proxy must also be able to deserialize responses sent back by the Web service.
Before this proxy can be created, however, the client must know where to find the Web service
and know what interface(s) the Web service exposes, including what parameters must be
passed to it.
After these resources are known, the client proxy can be created and a request message can be
sent to the Web service, which can then send back the appropriate response. The relationship
between the client and the Web service is depicted in Figure 10.1. Notice that because mes-
sages are transmitted using HTTP and SOAP, firewalls become much less of an issue when
compared to other technologies such as DCOM and CORBA.
WSDL
Response
Client Web Service
Request
Proxy
UDDI
Firewall Firewall
FIGURE 10.1
Web service architecture.
In the next section, you’ll learn how to build the Web service portion shown in Figure 10.1.
Introducing Web Service Attributes

When the architects and programmers at Microsoft developed the .NET platform, they realized
that languages such as C# and VB.NET needed the capability to be extended to handle a vari-
ety of programmatic scenarios. Through the introduction of attributes, the .NET platform has
been infused with the capability to handle many different tasks, including that of creating Web
services. The .NET SDK documentation defines attributes in the following way:
[.NET] provides a mechanism for defining declarative tags, called attributes, which you
can place on certain entities in your source code to specify additional information. The
425
CHAPTER 10
information that attributes contain can be retrieved at run time through reflection. You can use
predefined attributes or you can define your own custom attributes.
Using attributes and processing directives along with the System.Web.Services namespace,
Web services can be constructed quite easily and existing ASP.NET code can quickly be con-
verted into a Web service that can be consumed by others. The main class used for Web
service specific attributes is named WebMethodAttribute. It contains the properties listed in
Table 10.4.
TABLE 10.4 WebMethodAttribute Class Properties
BufferResponse Determines whether the response from the Web service request is
buffered. Setting this property to true (the default) serializes the
Web service response into a buffer. After the entire response is
serialized, it is returned to the client. This property should be set to
false only when large messages must be serialized.
CacheDuration Determines the number of seconds the Web service response
should be held in the cache. The default value is 0, which results in
no caching at all. In cases where a Web service’s return value may
not need to be completely dynamic, using this property can result
in increased performance and efficiency. For large responses, turn-
ing on caching can reduce available memory.
Description This is useful for describing the Web service and methods it
exposes on the service description page.
EnableSession Determines if session state is enabled for a Web service method.
The default is false.
MessageName This property provides the name used for the Web service method
in the data passed to and returned from a Web service method. It
can be useful for aliasing methods and properties and when over-
loaded methods exist in a Web service class.
TransactionOption Used when a Web service method needs transaction support. This
property is disabled by default. Values can be members of the
TransactionOption enumeration:
• Disabled
• NotSupported 10
ASP, NET, XML,
• Supported
WEBSERVICES
SOAP, AND
• Required
• RequiresNew
426
Another class, named WebServiceAttribute, can also be used to provide a description of the
Web service and to provide a namespace in which the Web service operates. By walking you
through several examples, the next few sections show you how to use these attribute classes
and their properties.
Using Attributes and Directives in Web Services

The WebMethodAttribute class can be applied to methods in a Web service by using the
WebMethod keyword. For C#, the attribute needs to be wrapped in square brackets and placed
directly above the method you are exposing in the Web service:
[WebMethod]
public int myMethod(){}
VB.NET wraps the WebMethod attribute with < and > brackets and places it before the method
definition:
<WebMethod> Public Function MyMethod() as Integer
End Function
By applying this attribute, the method is made publicly accessible through HTTP-GET, HTTP-
POST, or SOAP requests. Aside from attributes, code destined to be used as a Web service must
also use a special processing directive named WebService. The use of this directive as well as
the WebMethod attribute is shown in Listing 10.6. This listing simply returns a string value
based on the input that is provided.
LISTING 10.6 The Greeting Web Service (Listing10.6.asmx)

1: <%@ WebService Language=”C#” class=”Greeting” %>
2: using System.Web.Services;
3:
4: [WebService(Namespace=”http://www.TomorrowsLearning.com/Greeting”)]
5: class Greeting: WebService {
6:
7: [WebMethod]
8: public string Greet(string name) {
9: return “Hi “ + name + “!”;
10: }
11: }
The WebService directive must be the first line of your Web service. It lets the compiler know
what type of component it is so that it can be compiled properly. Although the code references
a class within the asmx file, it could reference another class stored in the \bin directory, if
desired, by listing the namespace that the external class resides in:
<%@ WebService Language=”C#” Class=”Greeting.Greet” %>
427
CHAPTER 10
In the majority of applications, referencing external classes is recommended because it encour-

ages code reuse. For the samples that follow, the code will be shown within the asmx file for
simplicity.
Line 4 shows the WebService attribute. This attribute allows a namespace to be declared in
which the Web service is a member. The Greeting class derives from the WebService class
found in the System.Web.Services namespace (line 5). Doing this gives the class access to
intrinsic ASP.NET objects such as Application, Session, and Context. In the next few sec-
tions, you’ll see how these ASP.NET objects can be used in a Web service. The WebMethod
attribute (line 7) is applied to the Greet() method to make the method available to clients that
call the Web service.
By referencing the System.Web.Services namespace as well as adding the necessary directive
and attributes as shown in Listing 10.6, all serialization/deserialization of SOAP messages will
be taken care of automatically.
Adding Descriptions to Web Service Classes and Methods

Although optional, the WebService attribute has a property that can be used to provide a
description for a Web service. A description can also be associated with the WebMethod
attribute. The following code segment shows how to incorporate descriptions:
<%@ WebService Language=”C#” class=”Greeting” %>
using System.Web.Services;
[WebService(Namespace=”http://www.TomorrowsLearning.com/Greeting”,
Description=”This Web Service responds by saying ‘Hi’”)]
class Greeting: WebService {
[WebMethod(Description=”This method says ‘Hi’ and appends the input “ +
“param to the response”)]
public string Greet(string name) {
return “Hi “ + name + “!”;
}
}
Each description acts as a property of the appropriate attribute.
Handling Overloaded Methods

In cases where a class has overloaded methods that need to be exposed to Web service clients,
the potential name conflict that may arise when a client hits the Web service can be resolved 10
by adding a MessageName property to the WebMethod attribute, as shown in Listing 10.7:
ASP, NET, XML,
WEBSERVICES
SOAP, AND
428
LISTING 10.7 Providing Method Aliases to Overloaded Methods

1: <%@ WebService Language=”C#” class=”Greeting” %>
3:
5: class Greeting: WebService {
6: [WebMethod]
8: return “Hi “ + name + “!”;
9: }
10: [WebMethod(MessageName=”GreetVerbose”)]
11: public string Greet(string fname,string lname) {
12: return “Hi “ + fname + “ “ + lname + “!”;
13: }
14: }
Failing to uniquely identify the second Greet() method by using the MessageName property
would result in the following error:
Both System.String Greet(System.String, System.String) and System.String
Greet(System.String) use the message name ‘Greet.’ Use the MessageName property
of the WebMethod custom attribute to specify unique message names for the
methods.
Transaction Support in Web Services

The WebMethod attribute also has a property named TransactionOption that can be used in sit-
uations where Web services must execute a method within the context of transaction. Adding
this property is similar to adding the Description property shown earlier because you simply
append it to the WebMethod attribute and then set it equal to a custom enumeration value
(TransactionOption.RequiresNew, for example). Doing this causes the appropriate Web
service method to execute within the context of a transaction, as shown in Listing 10.8.
LISTING 10.8 Transaction Support in Web Services

1: <%@ WebService Language=”C#” class=”DeleteData” %>
2: <%@ Assembly name=”System.EnterpriseServices” %>
3:
5: using System.EnterpriseServices;
6: using System.Data;
7: using System.Data.SqlClient;
8:
429
CHAPTER 10

10: class DeleteData: WebService {
11: [WebMethod(TransactionOption=TransactionOption.RequiresNew)]
12: public int Delete(string sql) {
13: string connStr =”server=localhost;uid=sa;pwd=;database=Northwind”;
14: SqlConnection dataConn = new SqlConnection(connStr);
15: SqlCommand cmd = new SqlCommand(sql,dataConn);
16: try {
17: cmd.ExecuteNonQuery();
18: return 0;
19: }
20: catch {
21: ContextUtil.SetAbort();
22: return 1;
23: }
24: finally {
25: if (dataConn != null) dataConn.Close();
26: }
27: }
28: }
Notice that to use the TransactionOption property, you must reference the System.
EnterpriseServices assembly and namespace.
State Management in Web Services

State is not maintained by default when calls are made to Web services. As a result, the state of
variables or objects that are instantiated by the Web service are not maintained across multiple
Web service calls. Each object is instantiated on request and destroyed after the response has
been returned. In cases where state must be maintained, the Application and Session objects
can be used.
To use these objects, a Web service class must subclass the System.Web.Service.WebService
class. This was shown in the previous code examples. Although there may be many reasons
that require a Web service to have built-in state management, the .NET SDK documentation
provides an example similar to that shown in Listing 10.9 that serves the purpose of updating a
simple counter.
LISTING 10.9 Accessing Application State from a Web Service 10

ASP, NET, XML,
1: <%@ WebService Language=”C#” class=”Counter”%>

WEBSERVICES
SOAP, AND
2: using System;
4:
430

5: [WebService(Namespace=”http://www.TomorrowsLearning.com/Counter”)]
6: public class Counter : WebService {
7: [WebMethod]
8: public int UpdateAppCounter() {
9: if (Application[“HitCounter”] == null) {
10: Application[“HitCounter”] = 0;
11: }
12: else {
13: Application.Lock();
14: Application[“HitCounter”] = ((int) Application[“HitCounter”])
15: + 1;
16: Application.UnLock();
17: }
18: return Convert.ToInt32(Application[“HitCounter”]);
19: }
20: }
Because the Counter class has been subclassed, it now has access to Application state.
In cases where access to session state is needed with a Web service, the code shown in Listing
10.10 can be used:
LISTING 10.10 Accessing Session State from a Web Service

1: <%@ WebService Language=”C#” class=”Counter”%>
2: using System;
4:
5: [WebService(Namespace=”http://www.TomorrowsLearning.com/Counter”)]
6: public class Counter : WebService {
7:
8: [WebMethod(EnableSession=true)]
9: public int UpdateSessionCounter() {
10: if (Session[“HitCounter”] == null) {
11: Session[“HitCounter”] = 0;
12: }
13: else {
14: Session[“HitCounter”] = ((int) Session[“HitCounter”]) + 1;
15: }
16: return Convert.ToInt32(Session[“HitCounter”]);
17: }
18: }
431
CHAPTER 10
Line 8 shows a new attribute named EnableSession that is set to true. Because Session state
will oftentimes not be needed in Web services, its default value is false. It’s important to
keep in mind that state management is not a part of any Web service specification. Because
your Web services can be hit by a variety of clients on a multitude of platforms, it is recom-
mended that you use state-management techniques (such as the Session object) sparingly and
keep your Web services stateless when possible.
The .asmx Extension

Web services running within the .NET framework are associated with a special file extension
value of asmx rather than the standard aspx extension used for ASP.NET pages. Files with the
asmx extension reference the System.Web.Services namespace and contain special keywords
that allow classes within the file to be exposed as a Web service.
Making the Web services shown in the previous listings available to the rest of the world is
accomplished by naming each file with the .asmx extension. This extension is mapped to the
WebServiceHandlerFactory class in the machine.config file, which takes care of serializing
and deserializing messages. The section of the machine.config file that contains the appropri-
ate mapping is shown next:
<httpHandlers>

<add verb=”*” path=”*.asmx”
type=”System.Web.Services.Protocols.WebServiceHandlerFactory,
System.Web.Services” validate=”false”/>
</httpHandlers>
When a request comes to the server containing a path of *.asmx, the

WebServiceHandlerFactory class is instantiated to process the request. When
Listing 10.6 is saved with an .asmx extension and run, the following screen appears:
10
ASP, NET, XML,
WEBSERVICES
SOAP, AND
FIGURE 10.2
The Web service test screen.
432
Without any coding on your part, the asmx file is dynamically compiled and a test page is gen-
erated. To try out the Greeting Web Service, click the Greet link shown in Figure 10.2, and on
the next screen enter a name into the input box and hit the Invoke button. Doing this should
display a result similar to that shown in Figure 10.3.
FIGURE 10.3
Invoking a Web service.
TIP
A template file named DefaultWsdlHelpGenerator.aspx is used to generate the Web
service test screen shown in Figure 10.2. By editing this file, you can easily customize
the test screen to fit into the look and feel of your Web site.
Web Service Description Language (WSDL)

Exposing a Web service that client applications throughout the world can use is certainly useful
and full of potential. However, a method for querying what programmatic interfaces the Web
service exposes and what functions it is capable of performing is essential in order to be con-
sumed by a client. After all, sending messages to a remote object you know nothing about
would prove to be rather futile. What’s a Web service to do?
433
CHAPTER 10
This problem is solved through using an XML grammar titled Web Service Description
Language, or WSDL. WSDL provides consumers of the Web service with detailed information
about binding to the service. It also details what methods are accessible and the parameters and
data types expected by those methods. The data type(s) of the response issued from each Web
service method are also described.
So how is all this information structured? Using XML syntax, of course. Looking back to the
test Web service page shown in Figure 10.2, you’ll notice that a link is provided to the WSDL
document that describes the Greeting Web service. Clicking this link results in the document
shown in Listing 10.11.
NOTE
Direct access to a given Web service’s WSDL document can be obtained by adding
?WSDL after the name of the asmx file. For example, the following URL will pull up
the WSDL document for the Web service shown in Listing 10.6:
http://localhost/testbed/Chapter10/listing10.6.asmx?WSDL
LISTING 10.11 WSDL Document Describing the Greeting Web Service

1: <?xml version=”1.0” encoding=”utf-8”?>
2: <definitions xmlns:s=”http://www.w3.org/2001/XMLSchema”
3: xmlns:http=”http://schemas.xmlsoap.org/wsdl/http/”
4: xmlns:mime=”http://schemas.xmlsoap.org/wsdl/mime/”
5: xmlns:urt=”http://microsoft.com/urt/wsdl/text/”
6: xmlns:soap=”http://schemas.xmlsoap.org/wsdl/soap/”
7: xmlns:soapenc=”http://schemas.xmlsoap.org/soap/encoding/”
8: xmlns:s0=”http://www.TomorrowsLearning.com/Greeting”
9: targetNamespace=”http://www.TomorrowsLearning.com/Greeting”
10: xmlns=”http://schemas.xmlsoap.org/wsdl/”>
11: <types>
12: <s:schema attributeFormDefault=”qualified”
14: targetNamespace=”http://www.TomorrowsLearning.com/Greeting”>
15: <s:element name=”Greet”>
16: <s:complexType>
17: <s:sequence>
18: <s:element minOccurs=”1” maxOccurs=”1” name=”name”
10
ASP, NET, XML,
19: nillable=”true” type=”s:string” />

WEBSERVICES
SOAP, AND
20: </s:sequence>
21: </s:complexType>
434

22: </s:element>
23: <s:element name=”GreetResponse”>
24: <s:complexType>
25: <s:sequence>
26: <s:element minOccurs=”1” maxOccurs=”1” name=”GreetResult”
27: nillable=”true” type=”s:string” />
28: </s:sequence>
29: </s:complexType>
30: </s:element>
31: <s:element name=”string” nillable=”true” type=”s:string” />
32: </s:schema>
33: </types>
34: <message name=”GreetSoapIn”>
35: <part name=”parameters” element=”s0:Greet” />
36: </message>
37: <message name=”GreetSoapOut”>
38: <part name=”parameters” element=”s0:GreetResponse” />
39: </message>
40: <message name=”GreetHttpGetIn”>
41: <part name=”name” type=”s:string” />
42: </message>
43: <message name=”GreetHttpGetOut”>
44: <part name=”Body” element=”s0:string” />
45: </message>
46: <message name=”GreetHttpPostIn”>
47: <part name=”name” type=”s:string” />
48: </message>
49: <message name=”GreetHttpPostOut”>
50: <part name=”Body” element=”s0:string” />
51: </message>
52: <portType name=”GreetingSoap”>
53: <operation name=”Greet”>
54: <input message=”s0:GreetSoapIn” />
55: <output message=”s0:GreetSoapOut” />
56: </operation>
57: </portType>
58: <portType name=”GreetingHttpGet”>
60: <input message=”s0:GreetHttpGetIn” />
61: <output message=”s0:GreetHttpGetOut” />
62: </operation>
63: </portType>
64: <portType name=”GreetingHttpPost”>
435
CHAPTER 10

66: <input message=”s0:GreetHttpPostIn” />
67: <output message=”s0:GreetHttpPostOut” />
68: </operation>
69: </portType>
70: <binding name=”GreetingSoap” type=”s0:GreetingSoap”>
71: <soap:binding transport=”http://schemas.xmlsoap.org/soap/http”
72: style=”document” />
74: <soap:operation
75: soapAction=”http://www.TomorrowsLearning.com/Greeting/Greet”
77: <input>
78: <soap:body use=”literal” />
79: </input>
80: <output>
82: </output>
83: </operation>
84: </binding>
85: <binding name=”GreetingHttpGet” type=”s0:GreetingHttpGet”>
86: <http:binding verb=”GET” />
88: <http:operation location=”/Greet” />
89: <input>
90: <http:urlEncoded />
91: </input>
92: <output>
93: <mime:mimeXml part=”Body” />
94: </output>
95: </operation>
96: </binding>
97: <binding name=”GreetingHttpPost” type=”s0:GreetingHttpPost”>
98: <http:binding verb=”POST” />
100: <http:operation location=”/Greet” />
101: <input>
102: <mime:content type=”application/x-www-form-urlencoded” />
103: </input>
104: <output>
105: <mime:mimeXml part=”Body” />
10
ASP, NET, XML,
106: </output>
WEBSERVICES
SOAP, AND
107: </operation>
108: </binding>
109: <service name=”Greeting”>
436

110: <port name=”GreetingSoap” binding=”s0:GreetingSoap”>
111: <soap:address
112: location=”http://localhost/TestBed/Chapter10/listing10.6.asmx” />
113: </port>
114: <port name=”GreetingHttpGet” binding=”s0:GreetingHttpGet”>
115: <http:address
117: </port>
118: <port name=”GreetingHttpPost” binding=”s0:GreetingHttpPost”>
119: <http:address
121: </port>
122: </service>
123: </definitions>
This document describes three ways for interacting with the Web service:
• SOAP
• HTTP-GET
• HTTP-POST
Although each of the three sections described in the WSDL document exhibit different ways of
describing the service, they all serve the purpose of providing the structure for the Web service
request and response.
The <type> Element
The type element serves as a container for an XSD schema that is used to define parameter
names and data types used in the Web service. The schema it wraps is referenced by other ele-
ments within the WSDL document, as shown in the next few sections.
The <message> Element
The message element is used to describe the content of the messages sent to and from the Web
service. For the SOAP request, lines 34–39 are used to describe the input and response para-
meters using the message and part elements:
34: <message name=”GreetSoapIn”>
35: <part name=”parameters” element=”s0:Greet” />
36: </message>
37: <message name=”GreetSoapOut”>
38: <part name=”parameters” element=”s0:GreetResponse” />
39: </message>
437
CHAPTER 10
Looking at this section of code, you’ll see that each part element references an element
described in the XSD schema contained within the type element (lines 11–33). This allows
parameter names and data types to be defined using the XML schema standard.
The <portType> Element
The portType element defines the different operations that are supported by the Web service
along with the message(s) involved with each operation. Looking at lines 52–57 (shown
below), you will see that the message elements are referenced by a adding a message attribute
to elements named input and output. These elements are wrapped by an operation element
that defines the name of the method being called (Greet, in this example). All this information
is wrapped by a portType element that has a name attribute:
52: <portType name=”GreetingSoap”>
54: <input message=”s0:GreetSoapIn” />
55: <output message=”s0:GreetSoapOut” />
56: </operation>
57: </portType>
The <binding> Element

The binding element serves the purpose of specifying the protocol to be used with operations
defined in the portType element and how SOAP messages should look on the wire:
70: <binding name=”GreetingSoap” type=”s0:GreetingSoap”>
71: <soap:binding transport=”http://schemas.xmlsoap.org/soap/http”
74: <soap:operation
75: soapAction=”http://www.TomorrowsLearning.com/Greeting/Greet”
77: <input>
79: </input>
80: <output>
82: </output>
83: </operation>
84: </binding>
In line 70, the binding element’s type attribute references the GreetingSoap portType 10
ASP, NET, XML,
defined earlier. Lines 71 and 72 define the type of transport to be used (HTTP) and the “style”
WEBSERVICES
SOAP, AND
of the SOAP message. For this example a document style is being defined which lends itself
well to XML documents being embedded in SOAP messages. The alternative is rpc, which
would actually work fine with this simple Web service.
438
The soap:body element appears within the input and output elements and serves the purpose
of describing the content within the body of the SOAP message. This element can have use,
encodingStyle, parts, and namespace attributes. The use attribute determines whether the
content of the body is encoded. Optionally, a soap:header element could be included as a
child of the input and output elements as well.
The <service> Element
The service element references the different portType elements defined in the WSDL document
and provides the address for accessing each one. When Web services expose multiple methods,
several different portType and service elements may exist.
109: <service name=”Greeting”>
110: <port name=”GreetingSoap” binding=”s0:GreetingSoap”>
111: <soap:address
113: </port>
114: <port name=”GreetingHttpGet” binding=”s0:GreetingHttpGet”>
115: <http:address
117: </port>
118: <port name=”GreetingHttpPost” binding=”s0:GreetingHttpPost”>
119: <http:address
121: </port>
122: </service>
Now that you see how a client can learn about what capabilities a Web service has, it’s time to
see how to consume the services offered by the Web service.
NOTE
You can find the WSDL specification at http://msdn.microsoft.com/xml/general/
wsdl.asp.
Consuming a Web Service from an ASP.NET File

Connecting to a Web service from a client application involves knowing about the services that
the Web service is capable of performing. As shown in the previous section, this is accom-
plished through the creation of the WSDL document. After you know what the Web service
offers, how can you invoke the service and use it on the client end? As with many remote-
messaging architectures, a proxy can be created that emulates the remote object exposed by
439
CHAPTER 10
the Web service to marshal data back and forth between the client and the Web service. The
client application actually knows nothing about the remote Web service. Instead, it treats the
proxy as if it were the real object performing the task. In turn, the proxy contacts the Web ser-
vice and then passes the response back to the client. The process of creating a client proxy is
shown next.
Creating a Client Proxy to Consume a Web Service

Creating a proxy that acts as the middleman between a client and a Web service is accom-
plished by using a command-line utility named WSDL.exe, which ships with the .NET platform.
Although you could create a proxy by hand (if you were really bored!), the WSDL.exe utility
will import a WSDL file and create the necessary proxy for you. To do this, you pass the path to
the WSDL file and specify a few switch options:
wsdl [options] {URL | path}
Table 10.5 shows the switch options documented in the .NET SDK documentation.
TABLE 10.5 WSDL.exe Switch Options

Option Description
/appsettingurlkey:key Specifies the configuration key to use to read the default
or value for the URL property when generating code.
/urlkey:key
/appsettingbaseurl:baseurl Specifies the base URL to use when calculating the URL
or fragment. The tool calculates the URL fragment by
/baseurl:baseurl converting the relative URL from the baseurl argument to
the URL in the WSDL document. You must specify the
/appsettingurlkey option with this option.
/d[omain]:domain Specifies the domain name to use when connecting to a
server that requires authentication.
/l[anguage]:language Specifies the language to use for the generated proxy
class. You can specify CS (C#; default), VB (Visual
Basic), or JS (JScript) as the language argument. You can
also specify the fully qualified name of a class implement-
ing System.CodeDom.Compiler.CodeDomProvider.
/n[amespace]:namespace Specifies the namespace for the generated proxy or tem-
plate. The default namespace is the global namespace. 10
ASP, NET, XML,
/no[backup] Overwrites the existing output file. The default is to

WEBSERVICES
SOAP, AND
produce a backup (.bak) file.

440

Option Description
/nologo Suppresses the Microsoft startup banner display.
/o[ut]:filename Specifies the file in which to save the generated proxy
code. The tool derives the default filename from the Web
service name. The tool saves generated datasets in differ-
ent files.
/p[assword]:password Specifies the password to use when connecting to a server
that requires authentication.
/protocol:protocol Specifies the protocol to implement. You can specify SOAP
(default), HttpGet, HttpPost or a custom protocol speci-
fied in the configuration file.
/proxy:URL Specifies the URL of the proxy server to use for HTTP
requests. The default is to use the system proxy setting.
/proxydomain:domain Specifies the domain to use when connecting to a proxy
or server that requires authentication.
/pd:domain
/proxypassword:password Specifies the password to use when connecting to a proxy
/pp:password
/proxyusername:username Specifies the username to use when connecting to a proxy
/pu:username
/server Generates an abstract class for a Web service based on the
contracts. The default is to generate client proxy classes.
/u[sername]:username Specifies the username to use when connecting to a server
that requires authentication.
/? Displays command syntax and options for the tool.
To generate a proxy for the Web service shown back in Listing 10.6, the following command-
line code can be used. Because no language switch is specified, C# code will be generated by
default:
wsdl /out:greetingProxy.cs
➥http://localhost/testbed/chapter10/listing10.6.asmx?WSDL
➥/n:Greeting
Executing this statement on the command line will result in the creation of the C# document
shown in Listing 10.12.
441
CHAPTER 10
LISTING 10.12 Proxy Code Generated by the WSDL Utility

1: //-------------------------------------------------------------------
2: // <autogenerated>
3: // This code was generated by a tool.
4: // Runtime Version: 1.0.2914.11
5: //
6: // Changes to this file may cause incorrect behavior and
7: // will be lost if the code is regenerated.
8: // </autogenerated>
9: //-------------------------------------------------------------------
10:
11: //
12: // This source code was auto-generated by wsdl, Version=1.0.2914.11.
13: //
14: namespace Greeting {
15: using System.Diagnostics;
16: using System.Xml.Serialization;
17: using System;
18: using System.Web.Services.Protocols;
20:
21:
22: [System.Web.Services.WebServiceBindingAttribute(Name=”GreetingSoap”,
23: Namespace=”http://www.TomorrowsLearning.com/Greeting”)]
24: public class Greeting :
25: System.Web.Services.Protocols.SoapHttpClientProtocol {
26:
27: [System.Diagnostics.DebuggerStepThroughAttribute()]
28: public Greeting() {
29: this.Url =
30: “http://localhost/testbed/chapter10/listing10.6.asmx”;
31: }
32:
34: [System.Web.Services.Protocols.SoapDocumentMethodAttribute(
35: “http://www.TomorrowsLearning.com/Greeting/Greet”,
36: RequestNamespace=”http://www.TomorrowsLearning.com/Greeting”,
37: ResponseNamespace=”http://www.TomorrowsLearning.com/Greeting”,
38: Use=System.Web.Services.Description.SoapBindingUse.Literal,
39: ParameterStyle=
40: System.Web.Services.Protocols.SoapParameterStyle.Wrapped)]
10
ASP, NET, XML,

WEBSERVICES
SOAP, AND
42: object[] results = this.Invoke(“Greet”, new object[] {

43: name});
44: return ((string)(results[0]));
442

45: }
46:
48: public System.IAsyncResult BeginGreet(string name,
49: System.AsyncCallback callback, object asyncState) {
50: return this.BeginInvoke(“Greet”, new object[] {
51: name}, callback, asyncState);
52: }
53:
55: public string EndGreet(System.IAsyncResult asyncResult) {
56: object[] results = this.EndInvoke(asyncResult);
57: return ((string)(results[0]));
58: }
59: }
60: }
Although the specifics of this file will not be discussed, you can see that the code specifies the
URL of the Web service (line 30) and then emulates the remote Web service object by creating
a Greet() method structure (lines 41–45). Depending on how complex the Web service is that
you need to create a proxy for, you can see that the WSDL utility is a big time-saver after you
know how to work with the switches!
To compile the proxy code into an assembly that can be used within an ASP.NET page, the fol-
lowing statement can be run from the command line (C# compile switches shown):
csc /t:library /r:system.web.services.dll /r:system.xml.dll
➥/out:d:\inetpub\wwwroot\testbed\bin\greetingProxy.dll greetingProxy.cs
TIP
Although not shown in this section, the WSDL.exe utility can also be used to create a
WSDL file and perform other tasks. This is accomplished by using the switches shown in
Table 10.5.
Calling a Web Service from an ASP.NET Page

After a proxy has been created that can connect a client to a Web service, it can be used within
an ASP.NET page. Doing this simply requires that you import the proper namespace contained
within the proxy assembly that was generated earlier (greetingProxy.dll, which has a
Greeting namespace). Listing 10.13. shows how the proxy can be used to hit the Web service
and then write out the return results to an ASP.NET page.
443
CHAPTER 10
LISTING 10.13 Using the Proxy Assembly Within an ASP.NET Page

2: public void submitButton_Click(Object Src, EventArgs E) {
3: Greeting.Greeting objGreet = new Greeting.Greeting();
4: wsResults.InnerHtml = objGreet.Greet(myName.Value);
5: }
6: </script>
7: <html>
8: <head>
9: <title>Hitting the Greeting Web Service From an ASP.NET Page</title>
10: </head>
13: <b>Enter your Name:</b>
14: <input type=”text” name=”myName” id=”myName” runat=”server”/>
15: <input type=”button” id=”submitButton” value=”Hit the Web Service!”
16: onServerClick=”submitButton_Click” runat=”server”/>
17: <p>
18: <b>Value returned from Greeting Web Service:</b><p/>
19: <div id=”wsResults”
20: style=”width:300px;background:#efefef;color:#ff0000;font-weight:bold;
21: font-size:18pt;” runat=”server”/>
22: </form>
23: </body>
24: </html>
On entering a value into the text box and hitting the Submit button, the submitButton_Click
event handler is called (line 2). This handler instantiates the proxy created earlier and then
passes the value in the text box to the Greet() method (lines 3 and 4). From here, the proxy
takes care of serializing and deserializing the request and response message to and from the
Web service. This is shown in Figure 10.4.
10
ASP, NET, XML,
WEBSERVICES
SOAP, AND
FIGURE 10.4
Hitting a Web service from an ASP.NET page.
444
Web Service Discovery

Now that you’ve seen the different steps that must be taken to create a Web service and client
proxy, you may wonder how you can find someone else’s service to use within an ASP.NET
application. For Web services to be useful, their service description document must be located.
Although the .NET platform has support for the Discovery of Web Services (DISCO),
Microsoft is now collaborating with more than 36 other companies to create a universal direc-
tory of Web services. This directory service is being termed Universal Description, Discovery,
and Integration (UDDI). Although a complete discussion of UDDI is beyond the scope of this
chapter, more information on this evolving specification can be found at http://
www.uddi.org.
Now that you’ve seen all the pieces that come into play when building or consuming a Web
service, the following section provides a simple example of how Web services can be used to
solve legitimate business problems.
Retrieving Customer Orders Through a Web

Service
In this section of the chapter, you’ll be introduced to a fictitious Web service for a company
named ACME Distribution. After the service’s purpose has been outlined, you’ll be provided
with a description of the code used to construct it.
Web Service Description

Wahlin’s Widgets sells a variety of products through its Web site. The site allows customers to
browse through a product catalog, place orders, and check the status of orders. Rather than car-
rying an inventory and handling the shipment of customer orders, Wahlin’s Widgets has a con-
tract with ACME Distribution, Inc., a distribution center that takes care of stocking and
shipping the products.
Accessing the catalog of products supplied by ACME Distribution is accomplished via FTP
access to an XML document on a server owned by ACME Distribution. After the document is
retrieved, the data within it is imported into a SQL Server 2000 database owned by Wahlin’s
Widgets. This allows customers to access product information quickly and allows Wahlin’s
Widgets to have complete control over how the products are presented to the customers.
Because the product catalog is only updated on a weekly basis, this method is sufficient for the
time being.
After a customer views the products available on the Wahlin’s Widgets Web site, the customer
can place an order for the item. This order is then transmitted via FTP to a server located at
one of ACME’s distribution centers. Because of the time it takes to send this order information
445
CHAPTER 10
between the companies, Wahlin’s Widgets is not able to provide customers with instant feed-
back on whether a particular product is in stock. As a result, customer orders for products that
are not currently in stock at the distribution center may be delayed for lengthy amounts of
time. When this situation occurs, Wahlin’s Widgets receives notification from an ACME distri-
bution center and lets the customer know via e-mail that the shipment of their product may be
delayed. Although this methodology works, there have been many customer complaints and
order cancellations because of unsatisfied customers.
To remedy this situation, Wahlin’s Widgets has worked with ACME Distribution to develop a
new strategy for checking inventory in a more timely fashion. The two companies have also
agreed to provide customers with the capability to check the status of orders and double-check
that orders were shipped to the proper address. Because ACME Distribution acts as the distrib-
ution center for several other companies, it has agreed to create a Web service that can be used
to check product availability and shipping status so that customers can get immediate feed-
back.
Creating the ACME Distribution, Inc. Web Service

ACME realized that it needed to provide “live” feedback to different companies about the
availability of specific products within its warehouse. While building the Web service to
accomplish this task, ACME also provided functionality that allowed customers to check
the status of an order. The implementation of these features into a Web service is shown in
Listing 10.14.
LISTING 10.14 ACME Distribution Web Service

1: <%@ WebService language=”C#” class=”ACMEProducts” %>
2: using System.Data;
3: using System.Data.SqlClient;
5:
6: [WebService(Namespace=”http://www.ACMEDistribution.com/Products”)]
7: public class ACMEProducts: WebService {
8:
9: [WebMethod(Description=”This method allows access “ +
10: “ to product stock numbers”)]
11: public DataSet CheckStock(int productID) {
12:
13:
string sql = @”SELECT ProductName,UnitsInStock, UnitsOnOrder,
Discontinued FROM Products WHERE ProductID = ‘“ +
10
ASP, NET, XML,
14: productID + “‘“;

WEBSERVICES
SOAP, AND
15: DataSet dsProductStatus = getDataSet(sql,”ProductQuantity”);

16: return(dsProductStatus);
17: }
446

18:
19: [WebMethod(Description=”This method allows access to “ +
20: “ order status information”)]
21: public DataSet GetOrders(string CustomerID) {
22: string sql = @”SELECT ShipName,ShipAddress,ShipCity,ShippedDate
23: FROM Orders WHERE CustomerID = ‘“ + CustomerID + “‘“;
24: DataSet dsOrders = getDataSet(sql,”Orders”);
25: return(dsOrders);
26: }
27:
28: [WebMethod(Description=”This method allows selection of “ +
29: “products by name”)]
30: public DataSet GetProductByName(string productName) {
31: string sql = @”SELECT ProductID,ProductName,UnitPrice,
32: QuantityPerUnit FROM Products
33: WHERE ProductName LIKE ‘“ + productName + “%’”;
34: DataSet dsProducts = getDataSet(sql,”Products”);
35: return(dsProducts);
36: }
38: “products by ID”)]
39: public DataSet GetProductByID(int id) {
40: string sql = @”SELECT ProductID,ProductName,UnitPrice,
41: QuantityPerUnit FROM Products
42: WHERE ProductID = “ + id;
45: }
47: “all products”,CacheDuration=120)]
48: public DataSet GetProducts() {
49: string sql = “SELECT ProductID,ProductName FROM Products”;
52: }
53:
54: private DataSet getDataSet(string sql, string tableName) {
55: string SQLconnStr = “server=localhost;uid=sa;” +
56: “pwd=;database=Northwind”;
58: SqlConnection dataConn = new SqlConnection(SQLconnStr);
59: SqlDataAdapter dsAdapter = new SqlDataAdapter(sql,dataConn);
60: dsAdapter.Fill(ds, tableName);
447
CHAPTER 10

62: return(ds);
63: }
64: }
This Web service exposes several methods that return records from the ACME database,
including:
• CheckStock()
• GetOrders()
• GetProducts()
• GetProductByID()
• GetProductByName()
Consuming the ACME Distribution, Inc. Web Service

After ACME Distribution’s Web service was online and available to use, Wahlin’s Widgets ran
the following statements at the command prompt. The WSDL.exe portion of the commands use
ACME’s WSDL document to create a client proxy:
wsdl /out:d:\temp\acmeProxy.cs /n:ACME.Distribution
➥http://localhost/testbed/chapter10/listing10.14.asmx?WSDL
csc /t:library /r:system.web.services.dll /r:system.xml.dll

➥/r:system.data.dll /out:d:\inetpub\wwwroot\testbed\bin\acmeProxy.dll
➥d:\temp\acmeProxy.cs
After the proxy was completed, Wahlin’s Widgets created an ASP.NET file that used the proxy
to communicate with ACME Distribution’s Web service via SOAP messages. Listing 10.15
shows a portion of the ASP.NET page code.
LISTING 10.15 ASP.NET Application Using ACME’s Web Service

5: string id = Request.QueryString[“id”];
6: string order = Request.QueryString[“placeOrder”]; 10
ASP, NET, XML,
7: if (!IsPostBack && id==null && order==null) {

WEBSERVICES
SOAP, AND
8: ShowHidePanels(“pnlSplash”);
9: } else {
10: ShowHidePanels(“”);
448

11: }
12: if (id != null) {
13: CheckStock(Convert.ToInt32(id));
14: }
15: if (order != null) {
16: PlaceOrder(order);
17: }
18: }
19: public void GetOrderStatus_Click(Object Src, EventArgs E) {
20: ShowHidePanels(“pnlOrderStatus”);
21: ACME.Distribution.ACMEProducts custOrders =
22: new ACME.Distribution.ACMEProducts();
23: DataSet ds = custOrders.GetOrders(customerID.Value);
24: OrdersDataGrid.DataSource=ds.Tables[0].DefaultView;
25: OrdersDataGrid.DataBind();
26: }
27:
28: public void PlaceOrder(string order) {
29: ShowHidePanels(“pnlPlaceOrder”);
30: lblPlaceOrder.Text = “<p><b>Your order has been placed “ +
31: “for product ID: “ + order + “</b>”;
32: }
33:
34: public void CheckStock(int productID) {
35: ShowHidePanels(“pnlStock”);
38: DataSet ds = custOrders.CheckStock(productID);
39: StockDataGrid.DataSource=ds.Tables[0].DefaultView;
40: StockDataGrid.DataBind();
41: }
42: public void ProductSubmit_Click(Object Src, EventArgs E) {
43: ShowHidePanels(“pnlProducts”);
46: DataSet ds = custOrders.GetProductByName(txtProductName.Text);
47: ProductsDataGrid.DataSource=ds.Tables[0].DefaultView;
48: ProductsDataGrid.DataBind();
49: }
50: public void CheckOrderStatusLink_Click(Object Src, EventArgs E) {
51: ShowHidePanels(“pnlCheckOrder”);
449
CHAPTER 10

52: }
53: public void BrowseProductsLink_Click(Object Src, EventArgs E) {
54: ShowHidePanels(“pnlBrowseProducts”);
55: ACME.Distribution.ACMEProducts prods =
57: DataSet ds = prods.GetProducts();
58: DataView dv = ds.Tables[0].DefaultView;
59: lstProducts.DataSource = dv;
60: lstProducts.DataBind();
61: }
62: public void lstProducts_Select(Object Src, EventArgs E) {
63: ShowHidePanels(“pnlProducts”);
66: DataSet ds = custOrders.GetProductByID(
67: Convert.ToInt32(lstProducts.SelectedItem.Value));
68: ProductsDataGrid.DataSource=ds.Tables[0].DefaultView;
69: ProductsDataGrid.DataBind();
70: }
71: public void ShowHidePanels(string panel) {
72: Control ctl = null;
73: Control frm = Page.FindControl(“frmWidgets”);
74: int ctlsCount = frm.Controls.Count;
75: for (int i=0;i<ctlsCount;i++) {
76: ctl = frm.Controls[i];
77: if (ctl.Parent == frm) {
78: if (ctl.ID == panel) {
79: ctl.Visible = true;
80: } else {
81: if (ctl.ID != null && ctl.ID.IndexOf(“pnl”) != -1) {
82: ctl.Visible = false;
83: }
84: }
85: }
86: }
87: }
88: </script>
Because Wahlin’s Widgets has integrated the Web service into its Web site, customer satisfac- 10
tion has increased as customers hitting the site can check whether a particular product is in
ASP, NET, XML,
WEBSERVICES
SOAP, AND
stock before ordering and can check on the status of any pending orders. Figure 10.5 shows the
results of checking the availability of a product on the Wahlin’s Widgets Web site.
450
FIGURE 10.5
Integrating ACME’s Web service into the Wahlin’s Widgets Web site.
Many important aspects of Web services are not shown in this example. These range from
security implementations to asynchronous calls. Before creating or consuming any Web ser-
vice, it is highly recommended that you examine these and other Web service features. To
see a more full-featured demonstration of a Web service in action, visit http://
www.TomorrowsLearning.com and examine the golfer Tee-Times service.
Summary
As more and more distributed systems enter the network of computers throughout the world,
the capability to send messages that are used to programmatically manipulate remote objects
will become increasingly necessary. In this chapter you were provided with an in-depth look at
the SOAP specification and learned how SOAP messages are structured. Although their struc-
ture is fairly simplistic, SOAP messages provide a powerful way to exchange serialized infor-
mation between applications.
You were also presented with information on Web services and saw how they can be created
using attributes and directives built in to the .NET platform. After they are created, Web ser-
vices can be consumed by clients through using a proxy mechanism. Because Web services can
be hit by client applications written in any language capable of serializing and deserializing
SOAP messages (or sending messages through HTTP-GET and HTTP-POST), they present a pow-
erful and useful means for clients to access and consume distributed information program-
matically.
NUMBERS AND SYMBOLS
INDEX
#FIXED value, 99
#IMPLIED value, 99
#REQUIRED value, 99
$ (dollar sign) character, 263
% (percent) character, 40
& (ampersand) character, 40
&copy entity, 36
&nbsp entity, 35
&reg entity, 36
* (asterisk), 96
* (wildcard character), 58
. (period) character, 66
.asmx extension, 431-432
.Net platform, 10-11
/ (slash) character, 54, 66
/ value, 251
/? switch, 440
/appsettingbaseurl:baseurl switch, 439
/appsettingurlkey:key switch, 439
/baseurl:baseurl switch, 439
/d[omain]:domain switch, 439
/l[anguage]:language switch, 439
/n[amespace]:namespace switch, 439
/no[backup] switch, 439
/nologo switch, 440
/o[ut]:filename switch, 440
/p[assword]:password switch, 440
/pd:domain switch, 440
/pp:password switch, 440
/protocol:protocol switch, 440
/proxy:URL switch, 440
/proxydomain:domain switch, 440
/proxypassword:password switch, 440
/proxyusername:username switch, 440
/pu:username switch, 440
/server switch
452
/server switch, 440 reusable classes, Extensible

/u[sername]:username
A Stylesheet Language
switch, 440 abbreviations, XPath, 66, Transformations (XSLT),
/urlkey:key switch, 439 68-69 287-290
; (semicolon) character, 38, 40 abort( ) method, 213 simple links, XLink docu-
? (question mark), 22, 96 AcceptChanges( ) method, ments, 82
@ (at) symbol, 66 334 AddNode( ) method, 230-231
[ character token, 59 accessing AddParam( ) method, 277-278
] character token, 59 application state, Web ser- address element, 261
_ (underscore), 24 vices, 429-430 ADO. See ActiveX Data
_valid variable, 165 MSXML3 through Interop, Objects
{} (curly brackets), 263 179, 181 ADO.NET
“ (quotes), 20 nodes, XPointer , 76 classes, Extensible Stylesheet
+ (plus) character, 96 return values, Extensible Language Transformations
< (less than) character, 36, 40 Stylesheet Language (XSLT), 272-273, 275-277,
<a> tag, 24, 82 Transformations (XSLT) 279-282, 284-285, 287
<body> element, 251 templates, 266-267 DataSet class, 295, 306-308,
<body> tag, 24 session state, Web services, 310-314, 316-317, 319-324,
<body> tags, 9 430 326, 328-332, 334, 336-344,
<br> tag, 18-19 stored procedures, Command 346-348
<br> tags, 9 class, 300-301, 303, 305 managed providers, 296-301,
<div> tag, 24 ActiveX Data Objects (ADOs) 303, 305-307
<font> element, 25 vs. ADO.NET, 294-295 using with SQL Server 2000,
<font> tag, 4 actor attribute, 409 394-396, 398
<hr> tag, 24 actuate attribute, 81-82 vs. classic ActiveX Data
<hr> tags, 9 Add( ) method, 161 Objects (ADOs), 294-295
<html> element, 251 AddExtensionObject(name- XmlDataDocument class, 324,
<hyperlink> element, 24 space,URI,object) method, 326, 328-332, 334, 336-344,
<img> tag, 7 277 346-348
<img> tags, 9 adding after tag, 390
<li> tags, 9 comments, eXtensible Markup aliases, providing to over-
<menuItem> element, 24-25 Language (XML), 35 loaded methods, 428
<meta> tags, 9 database records, 389-394 all element, 122
<name> element, 24 descriptions, Web service Allow URL Queries check
<option> tags, 9 classes and methods, 427 box, 364
<p> tag, 18, 82 eXtensible Markup Language ampersand (&) character, 38,
<p> tags, 9 (XML) data into tables, 40
<root> elements, applying OPENXML rowset ancestor axis, 56
default namespaces to, provider, 387 ancestor-or-self axis, 56
29-30 external objects to annotations, XDR schema,
> (greater than) character, XsltArgumentList class, 369-370, 372-374
36, 96 283-284 AppendChild( ) method, 185,
0 value, 386 nodes, XmlDocument class, 191
1 value, 386 196-200 appending information to
2 value, 386 documents with standard
8 value, 386 ASP.NET classes, 46-48
attributes
453
application state, accessing sending data to eXtensible content, 106

in Web services, 429-430 Markup Language (XML) CustID, 380
applications templates, 388-389 dataFld, 217
building, ASP.NET objects, updating, inserting, and delet- dataSrc, 217
44-45, 47-50 ing database records, default, 110-111
converting from ASP to 389-394 described, 25, 27
ASP.NET, 180-181 XsltTransform class, 289 disable-output-escaping, 253
converting to true eXtensible ASP.NET objects, generating Document Type Definition
Markup Language (XML) eXtensible Markup (DTD), 97-99
applications, 225-227, Language, 44-45, 47-50 dt:type, 107, 111, 113
229-231 ASP.NET pages dt:values, 111
creating, Extensible Stylesheet calling SAXParser class from, element tag, 108-109
Language Transformations 158-159 elementFormDefault, 129
(XSLT), 278-281 calling Validator class from, ElementType tag, 106-107
hierarchical eXtensible 165 Enumeration, 98
Markup Language (XML) assemblies Envelope element, 407-408
menu, creating, 219, compiling proxy code into, eXtensible Markup Language
222-223, 225 442 (XML), mapping fields to,
sharing data between, 7 referencing, XmlText Reader 339-341
applying default namespaces class, 152 foreign-relation, 373
to <root> elements, 29-30 System.Xml, Document form, 129
arc element, 84-86 Object Model (DOM) from, 81, 85
arc keyword, 79 classes, 182-183 Header element, 409-410
architecture, Web services, asterisk (*), 58, 96 href, 80, 82, 417
423-424 at (@) symbol, 66 ID, 98, 229-230, 390, 417
arcrole attribute, 80 at-identity attribute, 390-391 IDREF/IDREFS, 98
arguments, AddNode( ) attaching event handlers, indent, 250
method, 231 XmlValidatingReader class, itemNumber, 25
ArrayList class, 221 161-162 key, 373
arrays, 417-419 attribute axis, 56 key-relation, 373
arrayType attribute, 417 attribute element, 116 label, 81, 85
ASP, converting applications attribute tag, 111 mapping-schema, 376
to ASP.NET from, 180-181 Attribute value, 339 match, 251
asp:Xml Web control, 289- AttributeCount property, 139 maxOccurs, 109
290 attributeFormDefault minOccurs, 108
ASP.NET attribute, 129 mode, 252
calling Web services from, attributes model, 106
442-443 actor, 409 mustUnderstand, 409-410
capturing data, 226-227, 229 actuate, 81-82 name, 106, 110, 252, 257,
consuming Web services, arcrole, 80 263, 266, 276
438-442, 444 arrayType, 417 naming, 25
converting applications from at-identity, 390-391 NMTOKEN/NMTOKENS, 99
ASP to, 180-181 attribute tag, 111 noNamespaceSchemaL
eXtensible Markup Language attributeFormDefault, 129 ocation, 130
(XML) templates, 367-368 AttributeType tag, 110-111 NOTATION, 99
CDATA, 97 omit-xml-declaration, 260
color, 25 order, 107
attributes
454
OrdID, 379-380 Auto member, comments, eXtensible Markup

passing automatically, 145 ValidationType enumera- Language (XML), 35
priority, 252 tions, 160-161 DataSet mappings, eXtensible
quoted, 10 axes, 56, 59, 66 Markup Language (XML),
quoting, 20 axis types, XPath, 55-57 343-345
recordsets with, 26 Document Object Model
ref, 116 (DOM) structures, 183
required, 110-111 Document Type Definition
returnid, 390-391 B (DTD) syntax, ease of, 90
role, 81 bare names, identifying frag- eXtensible Markup Language
schemaLocation, 130 ments with, 75 (XML) applications,
select, 252-253, 255-256, BaseURI property, 139, 184, ASP.NET objects, 44-45,
263-266, 276 190 47-50
setting to preserve and default, before tag, 390 eXtensible Markup Language
42 bin.base64 data type, 112 (XML) documents,
show, 81 bin.hex data type, 112 XmlTextWriter class, 166,
src, 7 BINARY BASE64 keyword, 168-169, 172-173
style, 256 363 generic validation classes,
targetNamespace, 128-130 binding DataSets to 162-165
test, 252, 256 DataGrids, 318-320 hierarchical eXtensible
title, 81 binding element, 437-438 Markup Language (XML)
to, 81, 85 bit shifting, 23 menu applications, 219,
type, 81-82, 85, 108, 111 blocks, catch and finally, 158 222-223, 225
value, 120 Body element, 410-414 hierarchical relationships,
version, 242-243 boolean data type, 112 DataSets, 345-348
vspace, 7 boolean functions, 65-66 Hypertext Markup Language
Web services, 424-432, boolean(value) function, 65 (HTML) pages to view user
434-438 brackets, curly ({}), 263 entries, 211, 214-219
WebMethod, 426-429 browsers nodes, XmlDocument class,
WebService, 426-427 &nbsp entity, 36 196-200
when required for elements, creating eXtensible Markup reusable classes, Extensible
XLink, 87-88 Language (XML) document Stylesheet Language
XLink, 80-81, 87-88 structures, 205, 207-209 Transformations (XSLT),
XML Data – Reduced (XML- Internet Explorer 287-290
DR) schema, 106-111 validating eXtensible SAX-style push models,
XML schema, 116 Markup Language XmlTextReader class, 145,
xml:space, 42 (XML) documents, 92 147, 149-154, 156-159
xsl:template element, 251-252 xml:space attribute, 42 structures, eXtensible Markup
Attributes class, 154 BufferResponse property, Language (XML) docu-
Attributes property, 184, 189 425 ments, 205, 207-209
AttributeType element, 116, building unique fields, keys, and rela-
121 applications, Extensible tionships with XML schema,
AttributeType tag, 110-111 Stylesheet Language 124-125, 127-128
authentication credentials, Transformations (XSLT), Web services, 445-447
passing, 166 278-281 XML fragments,
AUTO keyword, 354-355 client proxies, 439-442 XmlTextWriter class,
198-199
classes
455
C char data type, 112

character data (CDATA)
DataSet, 295, 306-308,
310-314, 316-317, 319-324,
CacheDuration property, 425 sections, 40-41 326, 328-332, 334, 336-344,
Call element, 410-411 Character Entities, 37-38 346-348, 358-362
calling character points, 71 Date/Time extension, 282
external objects within XSLT characters, Document Type Document Object Model
stylesheets, 284-286 Definition (DTD) elements, (DOM), System.Xml
SAXParser class from 96 namespace and assembly,
ASP.NET pages, 158-159 checking 182-183
stored procedures, eXtensible element nodes, EdiToXml, 173
Markup Language (XML) XmlTextReader class, 154, generic validation, creating,
templates, 366 156 162-165
Validator class from ASP.NET eXtensible Markup Language HashTable, 278
pages, 165 (XML) documents, 20-21 NetworkCredential, 166
Web services from ASP.NET validity, eXtensible Markup OleDbCommand, 299-300,
pages, 442-443 Language (XML) docu- 304-305
CanResolveEntry property, ments, 92 OleDbConnection, 297-298
139 child axis, 56, 59 OleDbDataAdapter, 306-307
capturing data, eXtensible child elements, 53. See also OleDbDataReader, 300
Markup Language (XML) nodes Relation, 347
and ASP.NET, 226-227, 229 child sequence numbers, 76 reusable, creating, 287-290
case sensitivity child sequences, identifying SAXParser, 152, 158-159
eXtensible Markup Language fragments with, 76 SqlCommand, 298-299,
(XML), 20 ChildNodes property, 184, 301-302
parameters, eXtensible 188, 190 SqlConnect, 310-311,
Markup Language (XML) Chinese language, 23 313-314, 317
templates, 366 choice element, 121-122 SqlConnection, 296-297
tags, 10 choosing nodes SqlDataAdapter, 306-307
case statement, 154, 156 SelectSingleNode( ) method, SqlDataReader, 300
catch block, 158 218-219 standard ASP.NET, appending
catching errors, XPath, 204-205 information to documents,
XmlTextReader class, classes 46-48
157-158 ADO.NET, Extensible StreamReader, 173, 318
CDATA attribute, 97 Stylesheet Language StreamWriter, 221
cdata directive, EXPLICIT Transformations (XSLT), Validator, 162-165, 205, 207-
mode queries, 382 272-273, 275-277, 279-282, 209
CDATA (character data) sec- 284-285, 287 Web services, adding
tions, 40-41 ArrayList, 221 descriptions to, 427
ceiling(number) function, 65 Attributes, 154 WebMethodAttribute, 425
changing Command, 298-303, 305-306 WebServiceAttribute, 426
color attribute to <font> ele- ContentHandler, 159 XmlAttribute, 182, 188
ment, 25 CredentialCache, 166 XmlAttributeCollection, 201,
database records, 389-394 DataAdapter, 306 204
links, Hypertext Markup DataReader, 300, 303, 306 XmlCDataSection, 188
Language (HTML) DataRelation, 361 XmlComment, 188
documents, 86
classes
456
XmlDataDocument, 273, 324, validating eXtensible code listings

326, 328-332, 334, 336-344, Markup Language asp:Xml Web control, 289-290
346-348 (XML) documents, classes
XmlDocument, 182, 189-195, 159-162 DataSet, 329-334
197-200, 205, 207-209, 272 vs. Document Object Date/Time extension, 282
XmlDocumentFragment, 188 Model (DOM), 179 generic validation, 163-
XmlDocumentType, 188 XmlTextWriter 164
XmlElement, 182, 189 creating eXtensible OleDbCommand, 299-300,
XmlEntity, 189 Markup Language 304-305
XmlEntityReference, 189 (XML) documents, 166, SAXParser, in ASP.NET
XmlMenu, 221 168-169, 172-173 pages, 158, 162-165,
XmlNamedNodeMap, 182, creating XML fragments, 167-168, 170-173
201-202, 204-205, 207-209 198-199 SqlCommand, 301-302,
XmlNameTable, 145 in-memory vs. forward- 298-299
XmlNode, 182-189, 205, only parsing, 135-137 Validator, calling from
207-209 push vs. pull models, ASP.NET pages, 165
XmlNodeList, 182, 200-201 137-138 XmlDataDocument, 325-
XmlNodeReader, 183, System.Xml assembly, 334
209-210 134-135 XmlDocument, 197-199
XmlNotation, 189 XmlValidatingReader, validat- XmlNamedNodeMap,
XmlParserContent, 396, 398 ing eXtensible Markup 202-203
XmlProcessingInstruction, Language (XML) docu- XmlNode, property exam-
189 ments, 159-162 ples, 187
XmlSchemaCollection, 161 XPathDocument, 273, 275, XmlNodeReader, 209-210
XmlText, 189 358 XmlParserContext, 397-
XmlTextReader XsltArgument List, 273 398
building SAX-style push XsltArgumentList, 276-278, XpathDocument, instanti-
models, 145, 147, 283-284 ating, 274-275
149-154, 156-159 XslTransform, 240-241, 273, XmlTextReader, 142-144,
creating eXtensible 275-276 149-152
Markup Language XsltTransform, ASP.NET, 289 XmlTextWriter class,
(XML) document struc- clauses, WHERE, 58, 365-366 198-199
ture, 205, 207-209 clearing namespaces, 33 XsltArgumentList, adding
in-memory vs. forward- Client code, 414 external objects to, 283-
only parsing, 135-137 client proxies, creating, 284
parsing eXtensible Markup 439-442 XslTransform, 240-241,
Language (XML), 138- Clone( ) method, 185, 191 276
145 CloneNode(( ) method, 185, XsltTransform, 289
passing authentication 191 creating Simple API for XML
credentials, 166 Close( ) method, 141 (SAX), 147-149
push vs. pull models, closed model, 107 creating Simple API for XML
137-138 closing tags, 9-10, 18-19 (SAX) parsers,149-152
System.Xml assembly, 134- COBRA (Common Object DataSets
135 Request Broker binding to DataGrids,
Architecture), 401 318-320
code listings
457
creating hierarchical viewing DataSets as, IContentHandler interface,

relationships, 346-347 315-317 146-147
filling with data, 309-310 eXtensible Markup Language methods
filling with multiple tables, (XML) documents CreateMenu( ) method,
311-313 creating structure of, 205- 222-223, 225, 227-229
mapping XSD schema to, 208 ExecuteXmlReader( ),
341-343 embedding XPath queries 394-395
viewing as eXtensible into, 375-376 overloaded, providing
Markup Language inferring DataSet schema aliases to, 428
(XML), 315-317 from, 344-345 Page Load event and
Deletegrams, 392 parsing, 181 FillDropDown( ), 280
directives in EXPLICIT mode samples of, 93-95, 102, SelectSingleNode( ),
queries, 382-383 124, 237-238, 240-241, 218-219
Document Type Definition 246-249, 258-259 WalkTree( ), 224-225
(DTD) documents, 104-105 schema, 103 nodes
EXPLICIT mode query, 378 shredding, OPENXML adding, XmlDocument
extending complex types, 123 rowset provider, class, 197-199
eXtensible Markup Language 385-386 selecting,
(XML) transforming Extensible SelectSingleNode( )
capturing data, 226-227, Stylesheet Language method, 218-219
229 Transformations (XSLT) operations
converting Electronic Data into, 259-260 Insert, 391
Interface (EDI) to, eXtensible Markup Language Delete, 392
172-173 (XML) templates Update, 392
creating fragments, ASP.NET page for sending parameters
XmlTextWriter class, data to, 388-389 in XPath query
198-199 calling stored procedures, templates, 376
data generated by 366 Updategrams, 393
WriteXml( ) method, embedding XPath queries properties, XmlNode
321-322 into, 375-376 class, 187
data generated by examples of, 364-365 proxy assembly in ASP.NET
WriteXmlSchema( ) in ASP.NET, 367-368 page, 443
method, 322-324 specifying parameters, proxy code generated by
generating, XmlTextWriter 365-366 WSDL.exe , 441-442
class, 168 Extensible Stylesheet purchase order Electronic
inserting data into tables, Language Transformations Data Interface (EDI)
OPENXML rowset (XSLT) documents, 170
provider, 387 calling external objects schema
loading data directly from within stylesheets, eXtensible Markup
SQL Server, 358-360 284-286 Language (XML)
loading element data, document examples, 260 documents, 103
drop-down boxes, 214- reusable classes, 287-288 eXtensible Markup
217 transforming from Language (XML),
mapping fields to attrib- eXtensible Markup 114-115, 125-127
utes, 339-341 Language (XML), Northwind mapping,
schema, 114-115, 125-127 259-260 370-372
transforming, 281, 395
code listings
458
Simple Object Access comma-delimited files, mark- eXtensible Markup Language

Protocol (SOAP) ing up in eXtensible (XML) applications,
messages, 405-407 Markup Language (XML), ASP.NET objects, 44-45,
Simple Object Access 5-6 47-50
Protocol (SOAP) Command class, 298-303, eXtensible Markup Language
responses with embedded 305-306 (XML) documents,
fault information, commands, 422-423 XmlTextWriter class, 166,
412-413 CommandType property, 302 168-169, 172-173
responses, serialized comment( ) node test, 58 generic validation classes,
GetOrders( ) method, comments, 34, 113 162-165
411-413 Common Object Request hierarchical eXtensible
serialization of GetName( Broker Architecture Markup Language (XML)
) method into structure (COBRA), 401 menu applications, 219,
of, 404 compiling proxy code into 222-223, 225
serialization of GetOrders( assemblies, 442 hierarchical relationships,
) method into messages complexType element, DataSets, 345-348
from, 411 declaring, 120-124 Hypertext Markup Language
stored procedures components, sharing data (HTML) pages to view user
accessing, between, 7 entries, 211, 214-219
OleDbCommand class, compound data types, nodes, XmlDocument class,
304-305 Simple Object Access 196-200
accessing, SqlCommand Protocol (SOAP), 417, 419 reusable classes, Extensible
class, 301-302 computers, text storage, 23 Stylesheet Language
calling, eXtensible Markup concat(string1,string2,string Transformations (XSLT),
Language (XML) 3, ...) function, 62 287-290
templates, 366 configuring SAX-style push models,
Updategrams, 389-392 handlers, XmlTextReader XmlTextReader class, 145,
URL-encoding, 356-357 class, 153 147, 149-154, 156-159
Web Service Description SQL Server virtual directo- structures, eXtensible Markup
Language (WSDL) docu- ries, Internet Information Language (XML) docu-
ment, 433-436 Server (IIS), 351-353 ments, 205, 207-209
Web services constructing unique fields, keys, and rela-
accessing states, 429-430 applications, Extensible tionships with XML schema,
ASP.NET applications Stylesheet Language 124-125, 127-128
with, 447-449 Transformations (XSLT), Web services, 445-447
example of, 445-447 278-281 XML fragments,
greeting, 426 client proxies, 439-442 XmlTextWriter class,
transaction support, comments, eXtensible Markup 198-199
428-429 Language (XML), 35 constructors
codes, Fault element, 414 DataSet mappings, eXtensible vReader, 165
coding eXtensible Markup Markup Language (XML), XPathDocument class, 274
Language (XML), 16-20 343-345 consuming Web services,
collapsed ranges, 71 Document Object Model 438-442, 447-450
colons (:), root elements, 54 (DOM) structures, 183 contactName node, 229
color attribute, 25 Document Type Definition contains(string1, string2)
ColumnMapping property, (DTD) syntax, ease of, 90 function, 62
339-341 content attribute, 106
data
459
content models, 96, 120 CreateWhitespace( ) method, structures, eXtensible Markup

ContentHandler class, 159 192 Language (XML) docu-
context node, 55, 57, 66, 253 CreateXmlDeclaration( ) ments, 205, 207-209
controls method, 192 unique fields, keys, and rela-
asp:Xml, 289-290 creating tionships with XML schema,
DataGrid WebControl, applications, Extensible 124-125, 127-128
318-320, 334 Stylesheet Language Web services, 445-447
converting Transformations (XSLT), XML fragments,
applications 278-281 XmlTextWriter class,
from ASP to ASP.NET, client proxies, 439-442 198-199
180-181 comments, eXtensible Markup CredentialCache class, 166
to true eXtensible Markup Language (XML), 35 credentials, authentication,
Language (XML) appli- DataSet mappings, eXtensible 166
cations, 225-227, Markup Language (XML), curly brackets ({}), 263
229-231 343-345 CustID attribute, 380
Electronic Data Interface Document Object Model customer orders, retrieving,
(EDI) and legacy data to (DOM) structures, 183 444-450
eXtensible Markup Document Type Definition CustomerID field, 379-380
Language (XML), 169, (DTD) syntax, ease of, 90 CustomerID parameter,
172-173 eXtensible Markup Language 366, 411
Count property, 200, 202 (XML) applications,
count( ) function, 60, 252 ASP.NET objects, 44-45,
courseDifficulty element, 122 47-50
CreateAttribute( ) method, eXtensible Markup Language D
191 (XML) documents, data
CreateCDataSection( ) XmlTextWriter class, 166, capturing, eXtensible Markup
method, 191 168-169, 172-173 Language (XML) and
CreateComment( ) method, generic validation classes, ASP.NET, 226-227, 229
191 162-165 Electronic Data Interface
CreateDocumentFragment( ) hierarchical eXtensible (EDI), converting to
method, 191 Markup Language (XML) eXtensible Markup
CreateDocumentType( ) menu applications, 219, Language (XML), 169,
method, 192 222-223, 225 172-173
CreateElement( ) method, 192 hierarchical relationships, element, loading into drop-
CreateEntityReference( ) DataSets, 345-348 down boxes, 214-217
method, 192 Hypertext Markup Language eXtensible Markup Language
CreateMenu( ) method, (HTML) pages to view user (XML)
221-224 entries, 211, 214-219 generated by WriteXml( )
CreateNavigator( ) method, nodes, XmlDocument class, method, 321-322
185, 192, 273 196-200 generated by
CreateNode( ) method, 192 reusable classes, Extensible WriteXmlSchema( )
CreateProcessingInstruction( ) Stylesheet Language method, 322-324
method, 192 Transformations (XSLT), inserting into tables,
CreateSignificantWhiteSpace 287-290 OPENXML rowset
( ) method, 192 SAX-style push models, provider, 387
CreateTextNode( ) method, XmlTextReader class, 145,
192 147, 149-154, 156-159
data
460
loading directly from SQL DataSet class, 295, 306-308, descendent-or-self axis, 56
Server into DataSet 310-314, 316-317, 319-324, description elements, 34, 113
class, 358-360 326, 328-332, 334, 336-344, description markup lan-
loading directly from SQL 346-348, 358-362 guages, 8-9
Server into DataSet property, 328 Description property, 425
XPathDocument class, dataSrc attribute, 217 descriptive tags, 7
358 datatype element, 113 detail subelement, 413
returned by date data type, 112 directives
ExecuteXmlReader( ) Date/Time extension class, EXPLICIT mode queries,
method, transforming, 282 381-384
395 dateTime data type, 112 Web services, 426-427
filling DataSets with, 309-310 dateTime.tz data type, 112 directories, virtual, 351-353
Hypertext Markup Language DCOM (Distributed disable-output-escaping
(HTML), embedding into Component Object Model), attribute, 253
eXtensible Markup 400-401 DISCO (Discovery of Web
Language (XML) elements, declarations Services), 444
41 DOCTYPE, 94-95 Discovery of Web Services
legacy, converting to eXtensible Markup Language (DISCO), 444
eXtensible Markup (XML), 21-23 displaying
Language (XML), 169, 172- External Entities, 39 DataSets as eXtensible
173 Parameter Entities, 40 Markup Language (XML),
sending from ASP.NET pages declaring 311-314, 316-317
to eXtensible Markup complexType element, 120- metadata files, 180
Language (XML) templates, 124 user entries,
388-389 entities, Document Type XMLHTTPRequest objects,
sharing between applications Definitions (DTDs), 38 211, 214-219
and components, 7 links, 82 Distributed Component
data types parameters and variables, 264 Object Model (DCOM), 400-
Simple Object Access XmlTextReader class, 153 401
Protocol (SOAP), 414, 416- default attribute, 110-111 DOCTYPE declaration, 94-95
417, 419 default namespaces, 29-31, Document Object Model
XML Data – Reduced (XML- 33 (DOM)
DR) schema, 111-113 default values, Document accessing MSXML3 through
XML schema, 116-118, Type Definition (DTD), 99 Interop, 179, 181
120-124 DefaultCredentials property, classes, System.Xml name-
DataAdapter class, 306 166 space and assembly, 182-
databases defaults, setting attributes 183
Northwind, 124-125, 127-128 to, 42 converting applications to true
updating, inserting, and delet- defining file structures, 7 eXtensible Markup
ing records, 389-394 DeleteDataSet( ) method, Language (XML) applica-
dataFld attribute, 217 337 tions, 225-227, 229-231
DataGrid WebControl, deleting database records, creating hierarchical
318-320, 334 389-394 eXtensible Markup
DataReader class, 300, 303, Depth property, 139 Language (XML) menu
306 derived data types, 117-118 application, 219, 222-223,
DataRelation class, 361 descendant axis, 56 225
Element value
461
creating structures, 183, 205, Electronic Data Interface linking, XLink, 76-77, 79-83,
207-209 (EDI), 170 85-88
described, 176-178 Extensible Hypertext Markup source, assigning to asp:Xml
how it works, 136 Language (XHTML), 43-44 Web control, 290
in-memory vs. forward-only eXtensible Markup Language Web Service Description
parsing, 178-179 (XML) Language (WSDL), 433-436
selecting nodes, XPath, 204- closing tags, 18-19 DocumentType property, 190
205 contents of, 14, 16 dollar sign ($) character, 263
troubleshooting memory prob- creating structures in DOM. See Document Object
lems, 136 browsers, 205, 207-209 Model
XmlDocument class, 189-195, creating, XmlTextWriter DOMDocument30 interface,
197-200 class, 166, 168-169, 181
XMLHTTPRequest object, 172-173 drop-down boxes, loading
211-213, 215-219 encoding types, 22-23 element data into, 214-217
XmlNamedNodeMap class, examples of, 93-95, 102, dsTables object, 314
201-202, 204 124, 258-259 dt:type attribute, 107, 111,
XmlNode class, 183-184, inferring DataSet schema 113
186-189 from, 344-345 dt:values attribute, 111
XmlNodeList class, 200-201 loading, 22, 194-196 DTD member, ValidationType
XmlNodeReader class, nesting tags, 19-20 enumerations, 161
209-210 nodes, 183 DTD. See Document Type
Document property, 290 parsing, 181 Definition
Document Type Definition proper coding of, 16-20
(DTD), 4, 7 referencing XML schema
attributes, 97-99 within, 129-130
declaring entities, 38 root elements, 17-18 E
DOCTYPE declaration, 94-95 samples of, 237-238, EDI (Electronic Data
elements, 95, 97 240-241, 246-249 Interface), 169-170, 172-173
entities, 38, 99-101 saving updates, 195 editing
external, 95 schema, 103 color attribute to <font> ele-
internal, 95 shredding, OPENXML ment, 25
notations, 101 rowset provider, database records, 389-394
reasons to use, 90-91 385-386 links, Hypertext Markup
sample eXtensible Markup templates, 236-237 Language (HTML) docu-
Language (XML) docu- transforming Extensible ments, 86
ments, 93-95 Stylesheet Language EdiToXml class, 173
validating documents with, 21, Transformations (XSLT) Electronic Data Interface
90-92 into, 259-260 (EDI), 169-170, 172-173
DocumentElement property, unique identifiers, 27-28 element data, loading into
190 validating, 20-21, 90-92, drop-down boxes, 214-217
documents 159-162 element directive, EXPLICIT
appending information to, versions of, 22 mode queries, 382
standard ASP.NET classes, Extensible Stylesheet element element, 116
46-48 Language Transformations ELEMENT keyword, 95, 97
Document Type Definition (XSLT), 260 element nodes, 154, 156
(DTD), 104-105 Hypertext Markup Language element tag, 108-109
(HTML), changing links, 86 Element value, 339
elementFormDefault attribute
462
elementFormDefault hyperLink, 96-97 xsl:copy, 244, 261

attribute, 129 key, 124-125, 127-128 xsl:copy-of, 244, 261
elements keyref, 124-125, 127-128 xsl:decimal-format, 244
<body>, 251 loc, 83 xsl:element, 244, 261
<font>, 25 locator, 86 xsl:fallback, 244
<html>, 251 maxInclusive, 118 xsl:for-each, 244, 261
<hyperlink>, 24 menuItem, 96-97 xsl:if, 244, 252
<menuItem>, 24-25 message, 436-437 xsl:import, 244
<name>, 24 minInclusive, 118 xsl:include, 245
<root>, applying default MyCustomer, 379-381 xsl:key, 245
namespaces to, 29-30 MyCustomerOrders, 380 xsl:message, 245
address, 261 name, 96, 121-122 xsl:namespace-alias, 245
all, 122 nesting, Hypertext Markup xsl:number, 245
arc, 84-86 Language (HTML), 10 xsl:otherwise, 245, 255
attribute, 116 pattern, 120 xsl:output, 245, 250, 260
AttributeType, 116, 121 portType, 437 xsl:param, 245, 257, 261,
binding, 437-438 recordsets with, 25 263-267
Body, 410-414 Response, 411-412 xsl:preserve-space, 245
Call, 410-411 return, 412 xsl:processing-instruction, 245
child, 53 returning from HTTP queries, xsl:sort, 245
choice, 121-122 362-363 xsl:strip-space, 245
complexType, declaring, root, 17-18, 53-54, 96 xsl:stylesheet, 242, 245, 250,
120-124 row, 261 258, 262, 264, 266
courseDifficulty, 122 selector, 124-125, 127-128 xsl:template, 245, 251-252,
datatype, 113 sequence, 121-122 264
described, 24-25 service, 438 xsl:text, 245
description, 113 simple, 84 xsl:transform, 242, 246, 250,
Document Type Definition simpleType, 118, 120 264, 266
(DTD), 95, 97 type, 436 xsl:value-of, 246, 252, 256,
element, 116 unique, 124-125, 127-128 261
ElementType, 116, 121 when required for attributes, xsl:variable, 246, 261,
embedding Hypertext Markup XLink, 87-88 263-267
Language (HTML) data XML Data – Reduced (XML- xsl:when, 246, 255
into, 41 DR) schema, 106-111 xsl:with-param, 246, 257, 265
Envelope, 407-408 XML schema, 116 ELEMENTS keyword, 362-363
extend, 84 xpath-query, 375 ElementType element, 116,
extendLink, 83 xsd:schema, 128-129 121
Extensible Stylesheet xsl:apply-import, 243 ElementType tag, 106-107
Language Transformations xsl:apply-templates, 243, 253, embedding
(XSLT), 242, 244-245, 255, 257, 265 Hypertext Markup Language
247-253, 255-256, 258-261, xsl:attribute, 244, 255-256, (HTML) data into
263-267 261, 263-264 eXtensible Markup
Fault, 412-414 xsl:attribute-set, 244 Language (XML) elements,
favoriteCourses, 121-122 xsl:call-template, 244, 257, 41
field, 124-125, 127-128 265-267 XPath queries into eXtensible
group, 109, 122 xsl:choose, 244, 255 Markup Language (XML)
Header, 408, 410 xsl:comment, 244 templates, 375-376
eXtensible Markup Language
463
EnableSession property, 425 SqlDbType, 302 eXtensible Markup Language

encoding Simple Object XmlNodeType, 183 (XML)
Access Protocol (SOAP), Envelope element, 407-408 attributes, 25, 27
414, 416-417, 419 EOF property, 139 benefits of, 7-8
Encoding property, 139 erasing database records, capturing data, 226-227, 229
encoding types, eXtensible 389-394 case sensitivity of, 20
Markup Language (XML) error messages character data (CDATA)
documents, 22-23 A name was started with an sections, 40-41
end element nodes, 156 invalid character, 25 comments, 34
End tag ‘Name’ does not End tag ‘Name’ does not converting applications from
match the start tag ‘name’ match the start tag ‘name’, ASP to ASP.NET, 180-181
error message, 20 20 converting applications to,
End tag ‘root’ does not End tag ‘root’ does not match 225-227, 229-231
match the start tag ‘p’ error the start tag ‘p’, 18 converting Electronic Data
message, 18 Incorrect syntax was used in a Interface (EDI) and legacy
end-point(location-set) func- comment, 35 data to, 169, 172-173
tion, 74 Whitespace is not allowed at creating DataSet mappings,
endDocument( ) method, 157 this location, 36 343-345
endElement( ) method, errors creating fragments,
155-156 catching, XmlTextReader XmlTextWriter class,
ending parsing, class, 157-158 198-199
XmlTextReader class, Resource, 75 creating hierarchical menu
157-158 Subresource, 75 applications, 219, 222-223,
ending-resource keyword, 79 Syntax, 75 225
entities XPointer, 75 creating hierarchical relation-
&copy, 36 event handlers, 161-162 ships, DataSets, 345-348
&nbsp, 35 events data generated by WriteXml( )
&reg, 36 onChange, 214-217 method, 321-322
Character, 37-38 Page Load, 280 data generated by
declaring, Document Type ValidationCallBack( ), 162 WriteXmlSchema( ) method,
Definitions (DTDs), 38 ValidationEventHandler, 322-324
described, 35-36 161-162 declarations, 21-23
Document Type Definition ExecuteXmlReader( ) method, documents
(DTD), 99-101 394-396 closing tags, 18-19
External, 38-39, 99 EXPLICIT mode queries, contents of, 14, 16
handling, XmlTextReader 377-384 encoding types, 22-23
class, 157 expressions examples of, 93-95, 102,
Internal, 38-39, 99 identifying fragments with, 124
Parameter, 38, 40, 100-101 XPointer, 76 loading, 22
Standard, 36-37 leaving axes out of, 66 nesting tags, 19-20
entries, users, 211, 214-219 extend element, 84 proper coding of, 16-20
Enumeration attribute, 98 extended link keyword, 79 referencing XML schema
enumerations extended links, 83, 85-86 within, 129-130
CommandType, 302 extendLink element, 83 root elements, 17-18
MappingType, 338 Extensible Hypertext Markup schema, 103
OleDbType, 303 Language (XHTML), 42-44 unique identifiers, 27-28
464
validating, 20-21 example of, 114 eXtensible Markup Language

validation of, 90-92 namespace support, 128- (XML) templates
versions of, 22 129 directives in EXPLICIT mode
elements, 24-25 referencing within queries, 382-383
entities, 35-40 eXtensible Markup querying SQL Server 2000
features of in SQL Server Language (XML) docu- through, 363-365, 367-368
2000, 350 ments, 129-130 sending data from ASP.NET
generating with ASP.NET transforming data returned by pages to, 388-389
objects, 44-45, 47-50 ExecuteXmlReader( ) Extensible Stylesheet
generating, XmlTextWriter method, 395 Language Transformations
class, 168 transforming Extensible (XSLT)
handling whitespaces, 41 Stylesheet Language ADO.NET classes, 272-273,
how ASP.NET developers can Transformations (XSLT) 275-277, 279-282, 284-285,
use, 11-12 into, 258 287
Hypertext Markup Language transforming to Hypertext creating hierarchical relation-
(HTML) vs., 8-10 Markup Language (HTML), ships, DataSets, 345-348
inserting data into tables, 281 creating reusable classes,
OPENXML rowset provider, updating, inserting, and delet- 287-290
387 ing database records, 389- described, 234-235
integrating with ADO.NET, 394 elements, 242, 244-245,
294 validating documents, 247-253, 255-256, 258-261,
loading DataSets, 318-319 XmlTextReader and 263-267
loading element data, drop- XmlValidatingReader examples of, 237-238,
down boxes, 214-217 classes, 159-162 240-241
manipulating, OPENXML viewing DataSets as, 311-314, functions, 267-269, 271
rowset provider, 385-389 316-317 templates, 236-237
mapping fields to attributes, why it’s needed, 5-7 transforming eXtensible
339-341 eXtensible Markup Language Markup Language (XML)
marking up comma-delimited (XML) documents into, 258
files, 5-6 creating structures in transforming Hypertext
namespaces, 27, 29-34 browsers, 205, 207-209 Markup Language (HTML)
origin of, 4-5 examples of, 258-259 into, 237-238, 240-241,
parsing, XmlTextReader class, inferring DataSet schema 246-253, 255-256, 258
138-145 from, 344-345 extension objects, Extensible
processing instructions (PIs), loading, XmlDocument class, Stylesheet Language
34 194-196 Transformations (XSLT),
saving DataSets as, 320-324 nodes, 183 282, 284-285, 287
schema parsing, 181 extensions, .asmx, 431-432
attributes, 116 samples of, 237-238, 240-241, external Document Type
creating unique fields, 246-249 Definition (DTD), 95
keys, and relationships, saving updates, 195 External Entities, 38-39, 99
124-125, 127-128 templates, 236-237 external linksets, XLink,
data types, 116-118, transforming Extensible 86-87
120-124 Stylesheet Language external schema, 104
elements, 116 Transformations (XSLT)
into, 259-260
generating
465
F firewalls, Distributed
Component Object Model
false( ), 66
floor(number), 64
false( ) function, 66 (DCOM), 401 here( ), 74
Fault element, 412-414 FirstChild property, 184, 188, id( ), 60, 75
faultactor subelement, 413 190 lang(string), 66
faultcode message, 408 fixed.14.4 data type, 112 last( ), 60
faultcode subelement, 413 flag values, OPENXML local-name( ), 61
faultstring subelement, 413 rowset provider, 386 name( ), 61
favoriteCourses element, float data type, 112 namespace-uri(node), 61
121-122 floor(number) function, 64 node-set, 60-61
field element, 124-125, following axis, 56 normalize-space(string), 63
127-128 following-sibling axis, 56 not(boolean), 65
fields FOR XML AUTO keyword, number, 64-65
CustomerID, 379-380 363 origin( ), 74
mapping to eXtensible FOR XML EXPLICIT keyword, position( ), 59, 60, 66
Markup Language (XML) 383 range( ), 72-74
attributes, 339-341 FOR XML EXPLICIT keywords, round(number), 65
SOAPAction, 420 380 start-point(location-set), 74
unique, creating with XML FOR XML keywords, query- starts-with(string1,string2), 62
schema, 124-125, 127-128 ing SQL Server 2000 string, 61-64
fileName parameter, 49 through HTTP, 353-358, string-length(string), 63
files 360, 362 string-range(location-
.asmx extension, 431-432 for..next loop, 201 set,string,number,number),
ASP.NET, consuming Web foreach loop, 201, 314 73
services, 438-442, 444 foreign-relation attribute, substring(string,number,num-
comma-delimited, marking up 373 ber), 63
in eXtensible Markup form attribute, 129 substring-
Language (XML), 5-6 forward-only parsing vs. after(string1,string2), 63
defining structures of, 7 in-memory parsing, 135- substring-
eXtensible Markup Language 137, 178-179 before(string1,string2), 62
(XML), building with fragment identifiers, 71 sum(node-set), 64
ASP.NET objects, 44-45, 47- fragments, identifying with translate(string,from,to), 63-64
50 XPointer, 75-76 true( ), 65
Hypertext Markup Language from attribute, 81, 85 XPointer, 72-74
(HTML), screen scraping, functions
422 boolean, 65-66
linkbases, 86-87 ceiling(number), 65
metadata, viewing, 180 concat(string1,string2,string G
schema, 7 3, ...), 62 GenerateXml( ) method, 173
Fill( ) method, 307, 311, 314 contains(string1, string2), 62 generating
FillDropDown( ) method, 280 count( ), 252 eXtensible Markup Language
filtering nodes, 58 count(node-set), 60 (XML)
finally block, 158 end-point(location-set), 74 ASP.NET objects, 44-45,
finding paths to nodes, 54-60 Extensible Stylesheet 47-50
Language Transformations XmlTextWriter class, 168
(XSLT), 267-269, 271 proxies, Web services, 440
generic validation class, creating
466
generic validation class, cre- creating pages to view user

ating, 162-165
H entries, 211, 214-219
GET method, 419 handlers elements, 24
getAllResponseHeaders event, attaching with Extensible Hypertext Markup
property, 213 XmlValidatingReader class, Language (XHTML) vs., 43
GetAttribute( ) method, 141 161-162 eXtensible Markup Language
GetChanges( ) method, 336 setting, XmlText Reader class, (XML) vs., 8-10
GetDateTime( ) method, 284, 153 limitations of links in, 77
287 handling screen scraping files, 422
GetElementByID( ) method, overloaded methods, 427-428 transforming eXtensible
192 processing instructions, white- Markup Language (XML) to
GetElementFromRow spaces, and entities, , 281
method, 329, 336-337 XmlTextReader class, 157 transforming Extensible
GetElementsByTagName( ) whitespaces, 41 Stylesheet Language
method, 192, 204 HasAttributes property, 139 Transformations (XSLT)
GetEnumerator( ) method, HasChildNodes property, into, 246-253, 255-256, 258
200, 202 184, 190 transforming into Extensible
GetExtensionObject( ) HashTable class, 278 Stylesheet Language
method, 277 HasValue property, 139 Transformations (XSLT),
GetName( ) method, 404, 410 Header element, 408, 410 237-238, 240-241
GetNamedItem( ) method, headers, Hypertext Transfer Hypertext Transfer Protocol
202 Protocol (HTTP), 419, 421 (HTTP)
GetNamespaceOfPrefix( ) here( ) function, 74 headers, Simple Object Access
method, 185, 192 Hidden value, 339 Protocol (SOAP), 419, 421
GetOrders( ) method, 411-413 hide directive, EXPLICIT querying SQL Server 2000
GetParam( ) method, 277 mode queries, 381 with templates, XPath, and
GetPrefixOfNamespace( ) hierarchical eXtensible XDR schema, 368-370, 372-
method, 185, 193 Markup Language (XML) 374, 376
GetRemainder( ) method, 141 menu applications, creat- querying SQL Server 2000
getRequest(( ) method, ing, 219, 222-223, 225 with, 351-352, 354-358,
214-217 hierarchies, creating DataSet 360, 362-365, 367-368
getResponseHeader( ) prop- relationships, 345-348
erty, 213 href attribute, 80, 82, 417
GetRowFromElement HTML. See Hypertext
method, 329 Markup Language I
getXML value, 218 HTTP. See Hypertext Transfer i1 data type, 112
GetXml( ) method, 317, 336, Protocol i2 data type, 112
338, 343 HTTP-GET command, 422-423 i4 data type, 112
GetXmlSchema( ) method, HTTP-POST command, 423 i8 data type, 112
317 hyperLink element, 96-97 IContentHandler interface,
greater than ( >) character, Hypertext Markup Language 146-147
36, 96 (HTML) ID attribute, 98, 229-230,
group element, 109, 122 changing links in documents, 390, 417
groups, XML Data – Reduced 86 ID directive, EXPLICIT mode
(XML-DR) schema, 106-111 queries, 381, 383
id variable, 217-218
keywords
467
id( ) function, 60, 75 inserting

identifiers database records, 389-394
J
fragment, 71 eXtensible Markup Language Japanese language, 23
unique, eXtensible Markup (XML) data into tables, Java Remote Method
Language (XML) docu- OPENXML rowset Invocation (RMI), 401
ments, 27-28 provider, 387
Universal Resource (URI), 29 instantiating
identifying fragments in XmlNode classes, 186
XPointer, 75-76 XmlTextReader class, 138, K
IDREF directive, EXPLICIT 153-154 key attribute, 373
mode queries, 381, 383 XmlValidatingReader class, key element, 124-125,
IDREF/IDREFS attribute, 98 160 127-128
IDREFS directive, EXPLICIT XPathDocument classes, key-relation attribute, 373
mode queries, 381 274-275 keyref element, 124-125,
IIS (Internet Information instructions, processing (PI), 127-128
Server), configuring SQL 34, 157 keys, creating with XML
Server virtual directories, int data type, 112 schema, 124-125, 127-128
351-353 interfaces keywords
IL Disassembler tool, viewing DOMDocument30, 181 arc, 79
metadata files, 180 IContentHandler, 146-147 AUTO, 354-355
Implementation property, internal Document Type BINARY BASE64, 363
190 Definition (DTD), 95 ELEMENT, 95, 97
ImportNode(node,deep) Internal Entities, 38-39, 99 ELEMENTS, 362-363
method, 193 Internet Explorer ending-resource, 79
in-memory parsing vs. for- validating eXtensible Markup extended link, 79
ward-only parsing, 135-137, Language (XML) docu- FOR XML, querying SQL
178-179 ments, 92 Server 2000 through HTTP,
Incorrect syntax was used in xml:space attribute, 42 353-358, 360, 362
a comment error message, Internet Information Server FOR XML AUTO, 363
35 (IIS), configuring SQL FOR XML EXPLICIT, 380,
indent attribute, 250 Server virtual directories, 383
InferXmlSchema( ) method, 351-353 inline link, 79
345 Interop, accessing MSXML3 linkbase, 79
information retrieval, through, 179, 181 locator, 79
XPointer, 69-76 IsDefault property, 139 multidirectional link, 79
inline link keyword, 79 IsEmptyElement property, out-of-line, 80
InnerText property, 184, 190 139 RAW, 355
InnerXml property, 184, 190 ISO-8859-1 standard, 23 Schema, using with name-
InsertAfter( ) method, 185, IsReadOnly property, 184, spaces, 105-106
193 190 simple link, 80
InsertBefore( ) method, 185, IsStartElement( ) method, standalone, 23
193 141 traversal, 80
InsertDataSet( ) method, 337 Item property, 139, 184, 190 UNION ALL, 379-381
Item( ) method, 200, 202 XLink, 79-80
itemNumber attribute, 25 xml, 22
XMLDATA, 363, 383
xmlns, 29
label attribute
468
L XmlDataDocument,
325-334
(XML)
label attribute, 81, 85 XmlDocument, 197-199 capturing data, 226-227,
lang(string) function, 66 XmlNamedNodeMap, 229
last( ) function, 60 202-203 converting Electronic Data
LastChild property, 184, 190 XmlNode, property exam- Interface (EDI) to, 172-
launching IL Disassembler ples, 187 173
tool, 180 XmlNodeReader, 209-210 creating fragments,
legacy data, converting to XmlParserContext, 397- XmlTextWriter class,
eXtensible Markup 398 198-199
Language (XML), 169, 1 XpathDocument, instanti- data generated by
72-173 ating, 274-275 WriteXml( ) method,
less than (<) character, 36, 40 XmlTextReader, 142-144, 321-322
LineNumber property, 139 149-152 data generated by
LinePosition property, 139 XmlTextWriter class, WriteXmlSchema( )
linkbase keyword, 79 198-199 method, 322-324
linkbases, XLink, 86-87 XsltArgumentList, adding generating, XmlTextWriter
linking external objects to, class, 168
documents, XLink, 76-77, 283-284 inserting data into tables,
79-83, 85-88 XslTransform, 240-241, OPENXML rowset
Hypertext Markup Language 276 provider, 387
(HTML), limitations of, 77 XsltTransform, 289 loading data directly from
resources, 77, 83, 85-86 creating Simple API for XML SQL Server, 358-360
links (SAX), 147-149 loading element data,
changing, Hypertext Markup creating Simple API for XML drop-down boxes,
Language (HTML) docu- (SAX) parsers,149-152 214-217
ments, 86 DataSets mapping fields to attrib-
declaring, 82 binding to DataGrids, utes, 339-341
extended, XLink, 83, 85-86 318-320 schema, 114-115, 125-127
simple, XLink, 82 creating hierarchical rela- transforming, 281, 395
linksets, external, 86-87 tionships, 346-347 viewing DataSets as,
listings filling with data, 309-310 315-317
asp:Xml Web control, 289-290 filling with multiple tables, eXtensible Markup Language
classes 311-313 (XML) documents
DataSet, 329-334 mapping XSD schema to, creating structure of,
Date/Time extension, 282 341-343 205-208
generic validation, viewing as eXtensible embedding XPath queries
163-164 Markup Language into, 375-376
OleDbCommand, 299-300, (XML), 315-317 inferring DataSet schema
304-305 Deletegrams, 392 from, 344-345
SAXParser, in ASP.NET directives in EXPLICIT mode parsing, 181
pages, 158, 162-165, queries, 382-383 samples of, 93-95, 102,
167-168, 170-173 Document Type Definition 124, 237-238, 240-241,
SqlCommand, 301-302, (DTD) documents, 104-105 246-249, 258-259
298-299 EXPLICIT mode query, 378 schema, 103
Validator, calling from extending complex types, 123
ASP.NET pages, 165
location sets
469
shredding, OPENXML nodes serialization of GetOrders(

rowset provider, adding, XmlDocument ) method into messages
385-386 class, 197-199 from, 411
transforming Extensible selecting, stored procedures
Stylesheet Language SelectSingleNode( ) accessing,
Transformations (XSLT) method, 218-219 OleDbCommand class,
into, 259-260 operations 304-305
eXtensible Markup Language Insert, 391 accessing, SqlCommand
(XML) templates Delete, 392 class, 301-302
ASP.NET page for sending Update, 392 calling, eXtensible Markup
data to, 388-389 parameters Language (XML) tem-
calling stored procedures, in XPath query templates, plates, 366
366 376 Updategrams, 389-392
embedding XPath queries Updategrams, 393 URL-encoding, 356-357
into, 375-376 properties, XmlNode class, Web Service Description
examples of, 364-365 187 Language (WSDL) docu-
in ASP.NET, 367-368 proxy assembly in ASP.NET ment, 433-436
specifying parameters, page, 443 Web services
365-366 proxy code generated by accessing states, 429-430
Extensible Stylesheet WSDL.exe , 441-442 ASP.NET applications
Language Transformations purchase order Electronic with, 447-449
(XSLT) Data Interface (EDI) docu- example of, 445-447
calling external objects ments, 170 greeting, 426
within stylesheets, schema transaction support,
284-286 eXtensible Markup 428-429
document examples, 260 Language (XML) docu- Load( ) method, 193-196, 208,
reusable classes, 287-288 ments, 103 275
transforming from eXtensible Markup loading
eXtensible Markup Language (XML), DataSets, 318-319
Language (XML), 114-115, 125-127 eXtensible Markup Language
259-260 Northwind mapping, 370- (XML) data from SQL
IContentHandler interface, 372 Server, 358-360
146-147 Simple Object Access eXtensible Markup Language
methods Protocol (SOAP) mes- (XML) documents, 22, 194-
CreateMenu( ) method, sages, 405-407 196
222-223, 225, 227-229 Simple Object Access eXtensible Markup Language
ExecuteXmlReader( ), Protocol (SOAP) (XML) element data, drop-
394-395 responses with embedded down boxes, 214-217
overloaded, providing fault information, LoadXML( ) method, 193, 196,
aliases to, 428 412-413 229
Page Load event and responses, serialized loc element, 83
FillDropDown(), 280 GetOrders( ) method, local-name( ) function, 61
SelectSingleNode(), 411-413 LocalName property, 139,
218-219 serialization of GetName() 184, 190
WalkTree( ), 224-225 method into structure LocateRequest message, 401
of, 404 location sets, 71
Location Steps, XPath
470
Location Steps, XPath, 54, members, ValidationType CreateDocumentFragment( ),

59-60 enumerations, 160-161 191
locations, 71 memory, troubleshooting, CreateDocumentType( ), 192
locator element, 86 136 CreateElement( ), 192
locator keyword, 79 menu applications, creating, CreateEntityReference( ), 192
logError parameter, 164 219, 222-223, 225 CreateMenu( ), 221-224
logFile parameter, 164 menuItem element, 96-97 CreateNavigator( ), 185, 192,
LookupNamespace( ) message element, 436-437 273
method, 141 MessageName property, 425 CreateNode( ), 192
loops, 201, 314 messages CreateProcessingInstruction( ),
error 192
A name was started with CreateSignificantWhiteSpace
an invalid character, 25 ( ), 192
M End tag ‘Name’ does not CreateTextNode( ), 192
M-POST method, 419 match the start tag CreateWhitespace( ), 192
managed providers, ‘name’, 20 CreateXmlDeclaration( ), 192
ADO.NET, 296-301, 303, End tag ‘root’ does not DeleteDataSet( ), 337
305-307 match the start tag ‘p’, endDocument( ), 157
manipulating eXtensible 18 endElement( ), 155-156
Markup Language (XML), Incorrect syntax was used ExecuteXmlReader( ), 394-
OPENXML rowset provider, in a comment, 35 396
385-389 Resource, 75 Fill( ), 307, 311, 314
mapping Subresource, 75 FillDropDown( ), 280
fields to eXtensible Markup Syntax, 75 GenerateXml( ), 173
Language (XML) attributes, Whitespace is not allowed GET, 419
339-341 at this location, 36 GetAttribute( ), 141
XSD schema to DataSets, XPointer, 75 GetChanges( ), 336
341-343 faultcode, 408 GetDateTime( ), 284, 287
mapping-schema attribute, LocateRequest, 401 GetElementByID( ), 192
376 Simple Object Access GetElementFromRow, 329,
mappings for DataSets, cre- Protocol (SOAP), 405-407, 336-337
ating with eXtensible 419 GetElementsByTagName( ),
Markup Language (XML), metadata, 4, 180 204
343-345 methods GetElementsByTagName( ),
MappingType enumeration, abort( ), 213 192
338 AcceptChanges( ), 334 GetEnumerator( ), 200, 202
marking up comma-delimited Add( ), 161 GetExtensionObject( ), 277
files, eXtensible Markup AddExtensionObject( ), 277 GetName( ), 404, 410
Language (XML), 5-6 AddNode( ), 230-231 GetNamedItem( ), 202
markup languages, presenta- AddParam( ), 277-278 GetNamespaceOfPrefix( ),
tion vs. description, 8-9 AppendChild( ), 185, 191 185, 192
markup tags, 6 Clone( ), 185, 191 GetOrders( ), 411-413
match attribute, 251 Close( ), 141 GetParam( ), 277
maxInclusive element, 118 CreateAttribute( ), 191 GetPrefixOfNamespace( ),
maxOccurs attribute, 109 CreateCDataSection( ), 191 185, 193
Megginson, David, 137 CreateComment( ), 191 GetRemainder( ), 141
models
471
getRequest( ), 214-217 343 WriteTo( ), 186, 194

GetRowFromElement, 329 RemoveAll( ), 186, 193 WriteXml( ), 48-49, 320-322,
GetXml( ), 317, 336, 338, 343 RemoveChild( ), 186, 193, 338, 343
GetXmlSchema( ), 317 337 WriteXmlDecl( ), 168
ImportNode( ), 193 RemoveExtensionObject( ), WriteXmlSchema( ), 320,
InferXmlSchema( ), 345 277 322-324
InsertAfter( ), 185, 193 RemoveNamedItem( ), 202 XmlDataDocument class,
InsertBefore( ), 185, 193 RemoveParam( ), 277 328-332, 334, 336-338
InsertDataSet( ), 337 ReplaceChild( ), 186, 194 XmlDocument class, 191-194
IsStartElement( ), 141 ResolveEntity( ), 142 XMLHTTPRequest object,
Item( ), 200, 202 ReturnDataSet( ), 313, 334, 213
Load( ), 193-196, 208, 275 337 XmlNamedNodeMap class,
LoadXML( ), 193, 196, 229 Save( ), 195 202
LookupNamespace( ), 141 Save( ), 194 XmlNode class, 185-186
M-POST, 419 SelectNodes( ), 186, 194, 204- XmlNodeList class, 200
MoveToAttribute( ), 141 205, 214-217, 229 XmlTextReader class, 141-142
MoveToContent( ), 141 SelectSingleNode( ), 186, 194, XmlTextWriter class, 168-169
MoveToElement( ), 141, 155 204-205, 218-219, 229, 336- XPathDocument class, 273
MoveToFirstAttribute( ), 141 337 XsltArgumentList class, 277
MoveToNextAttribute( ), 141, send( ), 217 XslTransform class, 275
154 setContentHandler( ), 153 Microsoft
Normalize( ), 186, 193 setErrorHandler( ), 153 article on exchanging docu-
open( ), 217, 300 SetNamedItem( ), 202 ments between computer
openConnection( ), 300, 310 SetParentRow( ), 362 systems, 23
overloaded, handling, 427-428 Skip( ), 142 Distributed Component Object
parse( ), 153 startDocument( ), 154 Model (DCOM), 400
ParseDoc( ), 208, 328 startElement( ), 155 System.Xml assembly, 102
POST, 419 Supports( ), 186, 194 Universal Description,
PrependChild( ), 193 Transform( ), 275, 278, 348 Discovery, and Integration
PrependNode( ), 186 Update( ), 307 (UDDI), 444
Read( ), 141, 154, 162 updateBindings( ), 217 XML Data – Reduced
ReadAttributeValue( ), 141 UrlEncode( ), 356-357 (XML-DR) schema, 101
ReadBase64, 141 Validate, 159, 164-165, 208 minInclusive element, 118
ReadBinHex, 141 WalkTheTree( ), 209 minOccurs attribute, 108
ReadChars, 142 WalkTree( ), 224-225 mixed content models, 97
ReadChars( ), 156 Web services, adding descrip- mode attribute, 252
ReadElementString( ), 142 tions to, 427 model attribute, 106
ReadEndElement( ), 142 WriteAttributeString(attName, models
ReadInnerXml( ), 142 value), 169 closed, 107
ReadNode( ), 193 WriteComment(comment), content, 96, 120
ReadOuterXml( ), 142 169 Distributed Component Object
ReadStartElement( ), 142 WriteContentTo( ), 186, 194 (DCOM), 400-401
ReadString( ), 142 WriteElementString( ), 169 Document Object (DOM)
ReadToEnd( ), 49 WriteEndElement( ), 169 accessing MSXML3
ReadXml( ), 318-319 WriteNodeName( ), 208 through Interop, 179,
ReadXmlSchema( ), 319, 341, WriteStartElement( ), 169 181
models
472
classes, System.Xml name- links, Hypertext Markup support, XML schema,

space and assembly, Language (HTML) docu- 128-129
182-183 ments, 86 System.Web.Services, 431
converting applications to MoveToAttribute( ) method, System.Xml, Document
true eXtensible Markup 141 Object Model (DOM)
Language (XML) appli- MoveToContent( ) method, classes, 182-183
cations, 225-227, 141 using Schema keyword with,
229-231 MoveToElement( ) method, 105-106
creating hierarchical 141, 155 xsd, 115, 119
eXtensible Markup MoveToFirstAttribute( ) Namespaces property, 139
Language (XML) menu method, 141 NamespaceURI property, 140,
application, 219, 222- MoveToNextAttribute( ) 184, 190
223, 225 method, 141, 154 NameTable property, 140,
creating structures, 183, MSXLM3, 10-11, 179, 181 190
205, 207-209 multidimensional arrays, 418 naming attributes, 25
described, 176-178 multidirectional link key- naming conventions, tags, 7
how it works, 136 word, 79 Nested property, 347
in-memory vs. forward- mustUnderstand attribute, nesting
only parsing, 178-179 409 elements, Hypertext Markup
selecting nodes, XPath, MyCustomer element, Language (HTML), 10
204-205 379-381 tags, 19
troubleshooting, 136 MyCustomerOrders element, NetworkCredential class, 166
XmlDocument class, 380 NextSibling property, 184,
189-195, 197-200 188, 190
XMLHTTPRequest object, NMTOKEN/NMTOKENS
211-213, 215-219 attribute, 99
XmlNamedNodeMap class, N node parameter, 48
201-202, 204 name attribute, 106, 110, node points, 71
XmlNode class, 183-184, 252, 257, 263, 266, 276 node tests, XPath, 57-58
186-189 name element, 96, 121-122 node( ) node test, 58
XmlNodeList class, Name property, 139, 184, 190 node-set functions, 60-61
200-201 name( ) function, 61 nodes
XmlNodeReader class, names, bare, 75 accessing, XPointer, 76
209-210 namespace axis, 56 contactName, 229
mixed content, 97 Namespace context, 55, 57, 66, 253
pull vs. push model, 137-138 Recommendation, 29 creating, XmlDocument class,
push namespace-uri(node) 196-200
building, XmlTextReader function, 61 element, checking with
class, 145, 147, 149-154, namespaces XmlTextReader class, 154,
156-159 clearing, 33 156
vs. pull model, 137-138 default, 29-31, 33 eXtensible Markup Language
modifying described, 27, 29 (XML) documents, 183
color attribute to <font> Envelope element, 407-408 filtering, 58
element, 25 qualified, 31-34 finding paths to, 54-60
database records, 389-394 soap, 408 request, 230-231
structure of, 29 root, 66
parameters
473
selecting RecordSet, 294-295, 308 OrdID attribute, 379-380

SelectSingleNode( ) Request, 230 origin( ) function, 74
method, 218-219 Response, 221 other value, 82
XPath, 204-205 SqlConnection, 314 out parameter, 180
text, reading with SqlDataAdapter, 314, 337 out-of-line keyword, 80
XmlTextReader class, stack, 154 OuterXml property, 185, 191
156-157 XMLHTTPRequest, 211-213, output values, SQL and OLE
See also child elements; root 215-219 DB managed providers,
elements XmlXMLHTTPRequest, 183 305-306
NodeType property, 140, 185, oDoc argument, 231 overloaded methods, han-
190 OLE DB Managed Provider dling, 427-428
noNamespaceSchemaLocatio Command class, 298-303, OwnerDocument property,
n attribute, 130 305-306 185, 191
None member, described, 296
ValidationType enumera- OleDbConnection class, 297-
tions, 161 298
none value, 82 OleDbDataAdapter class, P
Normalization property, 140 306-307 Page Load event, 280
Normalize(( ) method, 186, OleDbCommand class, 299- pages
193 300, 304-305 ASP.NET
normalize-space(string) OleDbConnection class, calling SAXParser class
function, 63 297-298 from, 158-159
Northwind database, OleDbDataAdapter class, calling Validator class
124-125, 127-128 306-307 from, 165
not(boolean) function, 65 OleDbDataReader class, 300 calling Web services from,
NOTATION attribute, 99 OleDbType property, 303 442-443
notations, Document Type omit-xml-declaration Hypertext Markup Language
Definition (DTD), 101 attribute, 260 (HTML), creating to view
NULL value, 379 onChange event, 214-217 user entries, 211, 214-219
number data type, 112 onLoad value, 82 Parameter Entities, 38, 40,
number functions, 64-65 oNodeToAppendTo argu- 100-101
number(value) function, 64 ment, 231 parameters
numbers, child sequence, 76 onreadystatechange prop- case sensitivity, eXtensible
erty, 211 Markup Language (XML)
onRequest value, 82 templates, 366
open( ) method, 217, 300
O open(method,url,async,useri
CustomerID, 366, 411
Extensible Stylesheet
objects d,password) property, 213 Language Transformations
ASP.NET, generating openConnection( ) method, (XSLT), 264-266
eXtensible Markup 300, 310 fileName, 49
Language, 44-45, 47-50 opening IL Disassembler logError, 164
dsTables, 314 tool, 180 logFile, 164
extension, Extensible OPENXML rowset provider, node, 48
Stylesheet Language 385-389 out, 180
Transformations (XSLT), order attribute, 107 QueryString, 354
282, 284-285, 287 orders from customers, recordID, 49
retrieving, 444-450
parameters
474
root, 48 POST method, 419 CanResolveEntry, 139

schemaCol, 164 preceding axis, 56 ChildNodes, 184, 188, 190
specifying, eXtensible Markup preceding-sibling axis, 56 ColumnMapping, 339-341
Language (XML) templates, predicate statements, 58-59, CommandType, 302
365-366 365-366 Count, 200, 202
Updategrams, 393 Prefix property, 140, 185, 191 DataSet, 328
Validate( ) method, 164 PrependChild( ) method, 193 DefaultCredentials, 166
version, 49 PrependNode( ) method, 186 Depth, 139
writeAtts, 49 presentation markup lan- Description, 425
writeEndRoot, 49 guages, 8-9 Document, 290
writeXML( ) method, 48-49 preserve, setting attributes DocumentElement, 190
xmlFilePath, 164 to, 42 DocumentType, 190
XPath query templates, 376 PreserveWhiteSpace prop- EnableSession, 425
parent axis, 56 erty, 42, 191 Encoding, 139
ParentNode property, 185, PreviousSibling property, EOF, 139
191 185, 191 FirstChild, 184, 188, 190
parse( ) method, 153 primitive data types, 117 getAllResponseHeaders, 213
ParseDoc( ) method, 208, 328 priority attribute, 252 getResponseHeader(header
parsers, MSXLM3, 10-11 processing instructions (PIs), name), 213
parsing 34, 157 HasAttributes, 139
eXtensible Markup Language processing-instruction( ) HasChildNodes, 184, 190
(XML), XmlTextReader node test, 58 HasValue, 139
class, 138-145 programs Implementation, 190
eXtensible Markup Language building, ASP.NET objects, InnerText, 184, 190
(XML) documents, 181 44-45, 47-50 InnerXml, 184, 190
in-memory vs. forward-only, converting from ASP to IsDefault, 139
135-137, 178-179 ASP.NET, 180-181 IsEmptyElement, 139
strings, 196 converting to true eXtensible IsReadOnly, 184, 190
partially transmitted arrays, Markup Language (XML) Item, 139
418 applications, 225-227, 229- Item, 184, 190
particles, 120 231 LastChild, 184, 190
passing creating, Extensible Stylesheet LineNumber, 139
attributes automatically, 145 Language Transformations LinePosition, 139
authentication credentials, (XSLT), 278-281 LocalName, 139
XmlTextReader class, 166 hierarchical eXtensible LocalName, 184, 190
paths, nodes, 54-60 Markup Language (XML) MessageName, 425
pattern element, 120 menu, creating, 219, 222- Name, 139
percent (%) character, 40 223, 225 Name, 184, 190
period (.) character, 66 sharing data between, 7 Namespaces, 139
PI (processing instructions), properties NamespaceURI, 140
Extensible Markup AttributeCount, 139 NamespaceURI, 184, 190
Language (XML), 34, 157 Attributes, 184, 189 NameTable, 140
plus sign (+) character, 96 BaseURI, 139 NameTable, 190
points, 71-72 BaseURI, 184, 190 Nested, 347
portType element, 437 BufferResponse, 425 NextSibling, 184, 188, 190
position( ) function, 59-60, 66 CacheDuration, 425 NodeType, 140
r8 data type
475
NodeType, 185, 190 XmlResolver, 140 purchase order Electronic

Normalization, 140 XmlResolver, 275 Data Interface (EDI) docu-
OleDbType, 303 XmlSpace, 140 ments, 170
onreadystatechange, 211 XmlTextReader class, 139- push models
open(method,url,async,userid, 140 building, XmlTextReader
password), 213 XslTransform class, 275 class, 145, 147, 149-154,
OuterXml, 185, 191 protocols 156-159
OwnerDocument, 185, 191 Common Object Request vs. pull models, 137-138
ParentNode, 185, 191 Broker Architecture
Prefix, 140 (COBRA), 401
Prefix, 185, 191 Distributed Component
PreserveWhiteSpace, 42, 191 Object Model (DCOM), Q
PreviousSibling, 185, 191 400-401 qualified namespaces, 31-34
QuoteChar, 140 Hypertext Transfer Protocol queries
ReadState, 140 (HTTP) EXPLICIT mode, 377-384
readyState, 212 headers, Simple Object Hypertext Transfer Protocol
Relations, 314, 361 Access Protocol (SOAP), (HTTP), returning elements
responseBody, 212 419, 421 and schema, 362-363
responseStream, 212 querying SQL Server 2000 SELECT, 379
responseText, 212 with templates, XPath, query languages. See XPath
responseXML, 212 and XDR schema, 368- querying SQL Server 2000
send( ), 213 370, 372-374, 376 EXPLICIT mode queries, 377-
setRequestHeader(header querying SQL Server 2000 384
name,value), 213 with, 351-352, 354-358, through Hypertext Transfer
SqlDbType, 302 360, 362-365, 367-368 Protocol (HTTP) with tem-
status, 212 Java Remote Method plates, XPath, and XDR
statusText, 212 Invocation (RMI), 401 schema, 368-370, 372-374,
TransactionOption, 425 Simple Object Access (SOAP) 376
Transform, 290 Body element, 410-414 with Hypertext Transfer
validateOnParse, 159 described, 400, 402, 404 Protocol (HTTP), 351-352,
ValidationType, 160-161 encoding, 414, 416-417, 354-358, 360, 362-365, 367-
Value, 140 419 368
Value, 185, 191 Envelope element, 407- QueryString parameter, 354
WebMethodAttribute class, 408 question mark (?), 22, 96
425 Header element, 408, 410 QuoteChar property, 140
WhitespaceHandling, 140 HTTP headers, 419, 421 quoted attributes, 10
XmlDataDocument class, retrieving customer quotes (“), 20
328-332, 334, 336-338 orders, 444-450 quoting attributes, 20
XmlDocument class, 189-191 structure of, 404, 406-407
XMLHTTPRequest object, Web services, 421-432,
211-212 434-442, 444
XmlLang, 140 XML-RPC, 402 R
XmlNamedNodeMap class, proxies, clients, 439-442
r4 data type, 112
202 pull models vs. push models,
r8 data type, 112
XmlNode class, 184-186 137-138
XmlNodeList class, 200
Ramdom Access Memory
476
Ramdom Access Memory recordsets, 25-26 SQL and OLE DB managed

(RAM), troubleshooting in ref attribute, 116 providers, 305-306
Document Object Model referencing assemblies, ReturnDataSet( ) method,
(DOM), 136 XmlTextReader class, 152 313, 334, 337
range(location-set) function, Reformulation of HTML 4 in returnid attribute, 390-391
73 XML, 42-43 returning elements and
range-inside(location-set) Relation class, 347 schema from HTTP queries,
function, 73-74 Relations property, 314, 361 362-363
range-to(expression) relationships, creating with reusable classes, creating,
function, 72 XML schema, 124-125, 127- 287-290
ranges, 71-72 128 RMI (Java Remote Method
RAW keyword, 355 RemoveAll( ) method, 186, Invocation), 401
Read( ) method, 141, 154, 162 193 role attribute, 81
ReadAttributeValue( ) RemoveChild( ) method, 186, root elements, 53, 96
method, 141 193, 337 colons (:) in, 54
ReadBase64 method, 141 RemoveExtensionObject( ) eXtensible Markup Language
ReadBinHex method, 141 method, 277 (XML) documents, 17-18
ReadChars( ) method, 142, RemoveNamedItem( ) Extensible Stylesheet
156 method, 202 Language Transformations
ReadElementString( ) RemoveParam( ) method, 277 (XSLT), 242
method, 142 removing database records, See also nodes
ReadEndElement( ) method, 389-394 root nodes, 66
142 ReplaceChild( ) method, 186, root parameter, 48
reading 194 round(number) function, 65
Document Type Definition request node, 230-231 row element, 261
(DTD), ease of, 90 Request object, 230 running IL Disassembler tool,
from streams, XmlTextReader required attribute, 110-111 180
class, 154 ResolveEntity( ) method, 142
text nodes, XmlTextReader Resource Error, 75
class, 156-157 resources, linking, 77, 83,
ReadInnerXml( ) method, 142 85-86 S
ReadNode( ) method, 193 Response element, 411-412 Save( ) method, 194-195
ReadOuterXm( ) method, 142 Response object, 221 saving
ReadStartElement( ) method, responseBody property, 212 DataSets as eXtensible
142 responseStream property, Markup Language (XML),
ReadState property, 140 212 320-324
ReadString( ) method, 142 responseText property, 212 schema, 373-374
ReadToEnd( ) method, 49 responseXML property, 212 updates to eXtensible Markup
ReadXml( ) method, 318-319 retrieving customer orders, Language (XML) docu-
ReadXmlSchema( ) method, 444-450 ments, 195
319, 341, 343 return element, 412 SAX. See Simple API for XML
readyState property, 212 return values SAXParser class, 152, 158-159
recordID parameter, 49 accessing, Extensible schema
records, databases, 389-394 Stylesheet Language external, 104
RecordSet object, 294-295, Transformations (XSLT) reasons to use, 90-91
308 templates, 266-267 returning from HTTP queries,
362-363
sID variable
477
sample eXtensible Markup schemaLocation attribute, serialized arrays, 417

Language (XML) docu- 130 Server code, 414
ments, 102 SCM (Service Control Service Control Manger
saving, 373-374 Manager), 400 (SCM), 400
Simple Object Access Protocol scope, default namespaces, service element, 438
(SOAP) messages, 405-407 30-31 services
XDR, querying SQL Server scraping Hypertext Markup consuming, 438-442, 447-450
2000 through HTTP, 368- Language (HTML) files, 422 creating, 445-447
370, 372-374, 376 screen scraping, Hypertext retrieving customer orders,
XML Markup Language (HTML) 444-450
attributes, 116 files, 422 Simple Object Access
creating unique fields, Scribner, Kennard, 404 Protocol (SOAP), 421-432,
keys, and relationships, searching paths to nodes, 434-442, 444
124-125, 127-128 54-60 session state, accessing in
data types, 116-118, sections, character data Web services, 430
120-124 (CDATA), 40-41 setContentHandler( ) method,
elements, 116 security, querying SQL Server 153
example of, 114 2000 directly through HTTP, setErrorHandler( ) method,
namespace support, 363-365, 367-368 153
128-129 select attribute, 252-253, SetNamedItem( ) method, 202
referencing within 255-256, 263-266, 276 SetParentRow( ) method, 362
eXtensible Markup SELECT query, 379 setRequestHeader(header
Language (XML) docu- SELECT statement, 379-381, name,value) property, 213
ments, 129-130 383-384 setting
XML Data – Reduced (XML- selecting nodes attributes to preserve and
DR) SelectSingleNode( ) method, default, 42
attributes, 106-111 218-219 handlers, XmlTextReader
data types, 111-113 XPath, 204-205 class, 153
described, 91 SelectNodes( ) method, 186, ValidationType property,
description element, 113 194, 204-205, 214-217, 229 XmlValidatingReader class,
elements, 106-111 selector element, 124-125, 160-161
example of, 102, 104-105 127-128 variables, Extensible
groups, 106-111 SelectSingleNode( ) method, Stylesheet Language
origin of, 101-102 186, 194, 204-205, 218-219, Transformations (XSLT),
Schema keyword, using 229, 336-337 261, 263-264
with namespaces, self axis, 56 SGML (Standard Generalized
105-106 semicolon (;) character, 38, Markup Language), 4
XSD, mapping to DataSets, 40 sharing data between appli-
341-343 send( ) method, 217 cations and components, 7
validating documents with, 21 send( ) property, 213 shifting, bit, 23
schema files, 7 sending data from ASP.NET Shift_JIS standard, 23
Schema keyword, using with pages to eXtensible show attribute, 81
namespaces, 105-106 Markup Language (XML) shredding eXtensible Markup
Schema member, templates, 388-389 Language (XML) docu-
ValidationType enumera- sequence element, 121-122 ments, OPENXML rowset
tions, 161 sequences, child, 76 provider, 385-386
schemaCol parameter, 164 serialization, 403-404 sID variable, 229-230
Simple API for XML (SAX)
478
Simple API for XML (SAX), creating, Extensible Stylesheet sql:field annotation, 369, 373
137, 145, 147, 149-154, 1 Language Transformations sql:id-prefix annotation, 369,
56-159 (XSLT), 278-281 373
simple data types, Simple hierarchical eXtensible sql:is-constant annotation,
Object Access Protocol Markup Language (XML) 369
(SOAP), 414, 416-417 menu, creating, 219, sql:key-fields annotation,
simple element, 84 222-223, 225 369, 373
simple link keyword, 80 sharing data between, 7 sql:limit-field annotation,
simple links, XLink, 82 source documents, assigning 370
Simple Object Access to asp:Xml Web control, sql:limit-value annotation,
Protocol (SOAP) 290 370
Body element, 410-414 sp xml removedocument sql:overflow-field annota-
described, 400, 402, 404 stored procedure, 385 tion, 370
encoding, 414, 416-417, 419 spaces, &nbsp entity, 36 sql:relation annotation, 369,
Envelope element, 407-408 sparse arrays, 417 373
Header element, 408, 410 specifying parameters, sql:relationship annotation,
HTTP headers, 419, 421 eXtensible Markup 370, 373
retrieving customer orders, Language (XML) templates, sql:target-namespace anno-
444-450 365-366 tation, 370
structure of, 404, 406-407 SQL Managed Provider sql:url-encode annotation,
Web services, 421-432, Command class, 298-303, 370
434-442, 444 305-306 sql:use-cdata annotation, 370
SimpleContent value, 339 described, 296 SqlCommand class, 298-299,
simpleType element, 118, 120 SqlConnect class, 310-311, 301-302
singletons, 71 313-314, 317 SqlConnect class, 310-311,
Skip( ) method, 142 SqlConnection class, 296-297 313-314, 317
slash (/) character, 54, 66 SqlDataAdapter class, 306- SqlConnection class, 296-297
sNodeName argument, 231 307 SqlConnection object, 314
sNodeText argument, 231 SQL Server SqlDataAdapter class, 306-
soap namespace, 408 EXPLICIT mode queries, 307
SOAP. See Simple Object 377-384 SqlDataAdapter object, 314,
Access Protocol eXtensible Markup Language 337
SOAPAction field, 420 (XLM) features in, 350 SqlDataReader class, 300
Social Security Numbers, OPENXML rowset provider, SqlDbType property, 302
eXtensible Markup 385-389 src attribute, 7
Language (XML) docu- querying stack object, 154
ments, 28 through HTTP with tem- standalone keywords, 23
software plates, XPath, and XDR standard ASP.NET classes,
building, ASP.NET objects, schema, 368-370, appending information to
44-45, 47-50 372-374, 376 documents, 46-48
converting from ASP to with HTTP, 351-352, Standard Entities, 36-37
ASP.NET, 180-181 354-358, 360, 362-365, Standard Generalized
converting to true eXtensible 367-368 Markup Language
Markup Language (XML) system tables, 4 (SGML), 4
applications, 225-227, Updategrams, 389-394 start-point(location-set)
229-231 using ADO.NET with, function, 74
394-396, 398
text modes
479
startDocument( ) method, Subresource Error, 75 tags

154 substring(string,number,num <a>, 24, 82
startElement( ) method, 155 ber) function, 63 <body>, 9, 24
starts-with(string1,string2) substring- <br>, 9, 18-19
function, 62 after(string1,string2) <div>, 24
state management, Web function, 63 <font>, 4
services, 429-431 substring- <hr>, 9, 24
statements before(string1,string2) <img>, 7, 9
case, 154, 156 function, 62 <li>, 9
predicate, 58-59 suffix characters, Document <meta>, 9
SELECT, 379-381, 383-384 Type Definition (DTD) <option>, 9
status property, 212 elements, 96 <p>, 9, 18, 82
status variable, 218 sum(node-set) function, 64 after, 390
statusText property, 212 support attribute, 111
Stiver, Mark, 404 namespaces, XML schema, AttributeType, 110-111
storage, text in computers, 128-129 before, 390
23 transactions, Web services, case sensitivity, 10
stored procedures 428-429 closing, 9-10, 18-19
accessing, Command class, Supports( ) method, 186, 194 descriptive, 7
300-301, 303, 305 Switches, WSDL.exe , element, 108-109
calling, eXtensible Markup 439-440 ElementType, 106-107
Language (XML) templates, sync tag, 390 markup, 6
366 syntax, Document Type naming conventions, 7
sp xml removedocument, 385 Definition (DTD), 90 nesting, 19
StreamReader class, 173, 318 Syntax Error, 75 nesting, Hypertext Markup
streams, reading from in system tables, SQL Server, 4 Language (HTML), 10
XmlTextReader class, 154 System.Web.Services name- sync, 390
StreamWriter class, 221 space, 431 targetNamespace attribute,
string functions, 61, 63-64 System.Xml, 11 128-130
string(value) function, 62 System.Xml assembly, 102, templates
string-length(string) func- 134-135, 182-183 eXtensible Markup Language
tion, 63 System.Xml namespace, (XML)
string-range(location- Document Object Model directives in EXPLICIT
set,string,number,number) (DOM) classes, 182-183 mode queries, 382-383
function, 73 querying SQL Server 2000
strings, parsing and loading through, 363-365, 367-
into Document Object 368
Model (DOM), 196 T Extensible Stylesheet
structs, 418-419 T-SQL (Transact Structured Language Transformations
style attribute, 256 Query Language), 52-53 (XSLT), 236-237, 266-267
stylesheets tables querying SQL Server 2000
described, 236 filling DataSets with, 311-313 through HTTP, 368-370,
XSLT, calling external objects inserting eXtensible Markup 372-374, 376
within, 284-286 Language (XML) data into, test attribute, 252, 256
sub-resources, 71 OPENXML rowset tests, nodes, 57-58
subelements, Fault element, provider, 387 text nodes, reading in
413 system, SQL Server, 4 XmlTextReader class,
156-157
text storage
480
text storage, 23 Stylesheet Language Universal Resource Identifier

text( ) node test, 58 Transformations (XSLT), (URI), 29
time data type, 112 237-238, 240-241 Update( ) method, 307
time.tz data type, 112 translate(string,from,to) updateBindings( ) method,
title attribute, 81 function, 63-64 217
Tlbimp.exe utility, 180 traversal keyword, 80 Updategrams, 389-394
to attribute, 81, 85 troubleshooting memory updates, saving to
tokens, 173 problems, Document eXtensible Markup
TomorrowsLearning.com, 289 Object Model (DOM), 136 Language (XML) docu-
tools, IL Disassembler and true( ) function, 65 ments, 195
Tlbimp.exe, 180 type attribute, 81-82, 85, updating
Transact Structured Query 108, 111 database records, 389-394
Language (T-SQL), 52-53 type element, 436 variables, Extensible
transaction support, Web Stylesheet Language
services, 428-429 Transformations (XSLT),
TransactionOption property, 261, 263-264
425 U uri data type, 113
Transform property, 290 UDDI (Universal Description, URI (Universal Resource
Transform( ) method, 275, Discovery, and Integration), Identifier), 29
278, 348 444 URL (Uniform Resource
transforming ui1 data type, 112 Locators)
eXtensible Markup Language ui2 data type, 112 Universal Resource Identifiers
(XML) ui4 data type, 113 (URIs), 29
data returned by ui8 data type, 113 XPath queries and schema,
ExecuteXmlReader( ) underscore ( _ ), 24 374-375
method, 395 Understanding SOAP, 404, UrlEncode( ) method, 356-357
into Extensible Stylesheet 414 user entries, viewing with
Language Unicode Consortium, 23 XMLHTTPRequest objects,
Transformations (XSLT), Unicode Transformation 211, 214-219
classes involved in, Formats (UTFs), 23 UTF (Unicode Transformation
272-273, 275-277, Uniform Resource Locators Formats), 23
279-282, 284-285, 287 (URLs) UTF-16 standard, 23
to Hypertext Markup Universal Resource Identifiers utilities, IL Disassembler and
Language (HTML), 281 (URIs), 29 Tlbimp.exe, 180
eXtensible Markup Language XPath queries and schema, uuid data type, 113
(XML) documents 374-375
examples of, 237-238, UNION ALL keywords, 379-
240-241 381
templates, 236-237 unique element, 124-125,
V
Extensible Stylesheet 127-128 validate( ) method, 159,
Language Transformations unique identifiers, 164-165, 208
(XSLT), 246-253, 255-256, eXtensible Markup validateOnParse property,
258 Language (XML) docu- 159
Hypertext Markup Language ments, 27-28 validating eXtensible
(HTML) into Extensible Universal Description, Markup Language (XML)
Discovery, and Integration documents, 20-21, 90-92,
(UDDI), 444 159-162
Web sites
481
ValidationCallBack( ) event, variables WalkTree( ) method, 224-225

162 default namespaces, 30-31 Web controls
ValidationEventHandler Extensible Stylesheet asp:Xml, 289-290
event, 161-162 Language Transformations DataGrid, 318-320, 334
ValidationType property, (XSLT), 261, 263-264 Web Service Description
setting with id, 217-218 Language (WSDL), 432, 434-
XmlValidatingReader class, sID, 229-230 438
160-161 status, 218 Web services
Validator class, 162-165, 205, _valid, 165 consuming, 438-442, 447-450
207-209 verifying creating, 445-447
value attribute, 120 element nodes, retrieving customer orders,
Value property, 140, 185, 191 XmlTextReader class, 154, 444-450
values 156 Simple Object Access
#FIXED, 99 eXtensible Markup Language Protocol (SOAP), 421-432,
#IMPLIED, 99 (XML) documents, 20-21 434-442, 444
#REQUIRED, 99 validity, eXtensible Markup Web sites
/, 251 Language (XML) docu- article about creating
0, 386 ments, 92 ASP.NET server control,
1, 386 version attribute, 242-243 289
2, 386 version parameter, 49 checking validity of
8, 386 VersionMismatch code, 414 eXtensible Markup
Attribute, 339 versions, eXtensible Markup Language (XML) docu-
default, Document Type Language (XML) documents, 92
Definition (DTD), 99 ments, 22 Common Object Request
Element, 339 viewing Broker Architecture
flag, OPENXML rowset DataSets as eXtensible (COBRA) information, 401
provider, 386 Markup Language (XML), Distributed Component Object
getXML, 218 311-314, 316-317 Model (DCOM) informa-
Hidden, 339 metadata files, 180 tion, 401
MappingType enumeration, user entries, DOM Level 2 Core XML
339 XMLHTTPRequest objects, Specification, 182
none, 82 211, 214-219 Extensible Stylesheet
NULL, 379 virtual directories, configur- Language Transformations
onLoad, 82 ing in Internet Information (XSLT) elements, 243
onRequest, 82 Server (IIS), 351-353 Extensible Stylesheet
other, 82 Visual Studio Magazine (ital- Language Transformations
output, SQL and OLE DB ics), 289 (XSLT) standard, 234
managed providers, 305-306 vReader constructor, 165 Java Remote Method
return vspace attribute, 7 Invocation (RMI), 401
accessing, Extensible Microsoft article on exchang-
Stylesheet Language ing documents between
Transformations (XSLT) computer systems, 23
templates, 266-267 W screen scraping articles, 422
SQL and OLE DB man- W3C. See World Wide Web Simple API for XML (SAX),
aged providers, 305-306 Consortium 137
SimpleContent, 339 WalkTheTree( ) method, 209 TomorrowsLearning.com, 289
Unicode Consortium, 23
Web sites
482
Universal Description, Reformulation of HTML 4 in writing

Discovery, and Integration XML, 42-43 applications
(UDDI) specification, 444 status of XLink language, 76 ASP.NET objects, 44-45,
Visual Studio Magazine (ital- XML 1.0 Recommendation, 47-50
ics), 289 24 Extensible Stylesheet
Web Service Description XML schema specification Language
Language (WSDL) specifi- attributes, 116 Transformations (XSLT),
cation, 438 creating unique fields, 278-281
World Wide Web Consortium keys, and relationships, hierarchical eXtensible
(W3C), 22 124-125, 127-128 Markup Language
XML schema specification, data types, 116-118, 120- (XML), 219, 222-223,
114 124 225
XML-RPC protocol informa- elements, 116 classes
tion, 402 example of, 114 generic validation, 162-
WebMethod attribute, 426- namespace support, 165
429 128-129 reusable, Extensible
WebMethodAttribute class, referencing within Stylesheet Language
425 eXtensible Markup Transformations (XSLT),
WebService attribute, Language (XML) 287-290
426-427 documents, 129-130 comments, eXtensible Markup
WebServiceAttribute class, XPath specification (Version Language (XML), 35
426 1.0), 53 Document Type Definition
WHERE clauses, 58, 365-366 XPointer Candidate (DTD) syntax, ease of, 90
Whitespace is not allowed at Recommendation, 69 WSDL (Web Service
this location error message, WriteAttributeString( ) Description Language), 432,
36 method, 169 434-438
WhitespaceHandling prop- writeAtts parameter, 49 WSDL.exe, creating client
erty, 140 WriteComment( ) method, proxies, 439-442
whitespaces, 22, 36, 41, 157 169
wildcard character (*), 58 WriteContentTo( ) method,
World Wide Web Consortium 186, 194
(W3C), 4-5 WriteElementString( ) X
definitions from, points, 71 method, 169 XDR member, ValidationType
document version recommen- WriteEndElement( ) method, enumerations, 161
dations, 22 169 XDR schema, querying SQL
DOM Level 2 Core XML writeEndRoot parameter, 49 Server 2000 through HTTP,
Specification, 182 WriteNodeName( ) method, 368-370, 372-374, 376
Extensible Stylesheet 208 XHTML (Extensible Hypertext
Language Transformations WriteStartElement( ) Markup Language), 42-44
(XSLT), 234 method, 169 XLink
finalization of XLink and WriteTo( ) method, 186, 194 attributes, 80-81, 87-88
XPointer languages by, 52 WriteXml( ) method, 48-49, described, 52, 76-77, 79
label attributes within 320-322, 338, 343 extended links, 83, 85-86
extended link grouping, WriteXmlDecl( ) method, 168 keywords, 79-80
example of, 85 WriteXmlSchema( ) method, linkbases and external
Namespace Recommendation, 320, 322-324 linksets, 86-87
29 simple links, 82
XPointer
483
XML 1.0 Recommendation, XmlMenu class, 221 ments, 159-162

24 XmlNamedNodeMap class, vs. Document Object Model
XML Data – Reduced 182, 201-202, 204-205, (DOM), 179
(XML-DR) schema 207-209 XmlTextWriter class
attributes, 106-111 XmlNameTable class, 145 creating eXtensible Markup
data types, 111-113 XmlNode class, 182-189, 205, Language (XML) docu-
described, 91 207-209 ments, 166, 168-169,
description element, 113 XmlNodeList class, 182, 172-173
elements, 106-111 200-201 creating XML fragments,
example of, 102, 104-105 XmlNodeReader class, 183, 198-199
groups, 106-111 209-210 in-memory vs. forward-only
origin of, 101-102 XmlNodeType enumeration, parsing, 135-137
Schema keyword, using with 183 push vs. pull models, 137-138
namespaces, 105-106 XmlNotation class, 189 System.Xml assembly,
xml directive, EXPLICIT mode xmlns keyword, 29 134-135
queries, 382 XmlParserContext class, 396, XmlValidatingReader class,
xml keywords, 22 398 validating eXtensible
XML. See eXtensible Markup XmlProcessingInstruction Markup Language (XML)
Language class, 189 documents, 159-162
xml:space attribute, 42 XmlResolver property, 140, XmlXMLHTTPRequest object,
XML-DR. See XML Data - 275 183
Reduced schema XmlSchemaCollection class, XPath
XML-RPC protocol, 402 161 abbreviations, 66, 68-69
XmlAttribute class, 182, 188 XmlSpace property, 140 axis types, 55-57
XmlAttributeCollection class, XmlText class, 189 described, 52-55
201, 204 xmltext directive, EXPLICIT functions, 60-61, 63-64
XmlCDataSection class, 188 mode queries, 382 Location Steps, 54, 59-60
XmlComment class, 188 XmlTextReader class node tests, 57-58
XMLDATA keyword, 363, 383 building SAX-style push predicate statements, 58-59
XmlDataDocument class, 273, models, 145, 147, 149-154, querying SQL Server 2000
324, 326, 328-332, 334, 156-159 through HTTP, 368-370,
336-344, 346-348 creating eXtensible Markup 372-374, 376
XmlDocument class, 182, Language (XML) document selecting nodes, 204-205
189-195, 197-200, 205, structure, 205, 207-209 XPointer as extension of, 72
207-209, 272 in-memory vs. forward-only XPath specification (Version
XmlDocumentFragment class, parsing, 135-137 1.0), 53
188 parsing eXtensible Markup xpath-query element, 375
XmlDocumentType class, 188 Language (XML), 138-145 XPathDocument class, 273,
XmlElement class, 182, 189 passing authentication creden- 275, 358
XmlEntity class, 189 tials, 166 XPointer
XmlEntityReference class, 189 push vs. pull models, 137-138 described, 52, 69-72
xmlFilePath parameter, 164 System.Xml assembly, errors, 75
XMLHTTPRequest object, 134-135 functions, 72-74
211-213, 215-219 validating eXtensible Markup identifying fragments with
XmlLang property, 140 Language (XML) docu- bare names, 75
XPointer
484
identifying fragments with element, 245

child sequences, 76 xsl:sort element, 245
identifying fragments with xsl:strip-space element, 245
expressions, 76 xsl:stylesheet element, 242,
xsd namespace, 115, 119 245, 250, 258, 262, 264, 266
XSD schema, mapping to xsl:template element, 245,
DataSets, 341-343 251-252, 264
xsd:schema element, 128-129 xsl:text element, 245
xsl:apply-import element, xsl:transform element, 242,
243 246, 250, 264, 266
xsl:apply-templates element, xsl:value-of element, 246,
243, 253, 255, 257, 265 252, 256, 261
xsl:attribute element, 244, xsl:variable element, 246,
255-256, 261, 263-264 261, 263-267
xsl:attribute-set element, 244 xsl:when element, 246, 255
xsl:call-template element, xsl:with-param element, 246,
244, 257, 265-267 257, 265
xsl:choose element, 244, 255 XSLT Programmer’s
xsl:comment element, 244 Reference (italics), 258
xsl:copy element, 244, 261 XSLT. See Extensible
xsl:copy-of element, 244, 261 Stylesheet Language
xsl:decimal-format element, Transformations
244 XsltArgumentList class, 273,
xsl:element, 261 276-278, 283-284
xsl:element element, 244 XslTransform class, 240-241,
xsl:fallback element, 244 273, 275-276
xsl:for-each element, 244, XsltTransform class, ASP.NET,
261 289
xsl:if element, 244, 252
xsl:import element, 244
xsl:include element, 245
xsl:key element, 245
xsl:message element, 245
xsl:namespace-alias element,
245
xsl:number element, 245
xsl:otherwise element, 245,
255
xsl:output element, 245, 250,
260
xsl:param element, 245, 257,
261, 263-267
xsl:preserve-space element,
245
xsl:processing-instruction

Dan Wahlin XML For ASP - NET Developers Kaleidoscope

Uploaded by

Copyright:

Available Formats

Dan Wahlin XML For ASP - NET Developers Kaleidoscope

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Dan Wahlin XML For ASP - NET Developers Kaleidoscope

Uploaded by

Copyright:

Available Formats

XML for ASP.

201 West 103rd St., Indianapolis, Indiana, 46290 USA

Copyright © 2002 by Sams ACQUISITIONS EDITOR

1 XML and ASP.NET Development 3

Using ASP.NET Objects to Generate XML ..........................................44

5 Using the XmlTextReader and XmlTextWriter Classes in

Sample Applications-Client Server-Side Hierarchical XML

9 SQL Server 2000, XML, and ASP.NET 349

Mail: Jeff Koch

• Why Do We Need XML? 5

• Differences Between XML and HTML 8

• MSXML3 Versus the .NET Platform’s

• How Can XML Be Used by an ASP.NET

Why Do We Need XML?

Listing 1.1 Marking Up a Comma-Delimited File Using XML

Listing 1.1 continued

Differences Between XML and HTML

Presentation Versus Description

XML requires that all attributes be quoted:

MSXML3 Versus the .NET Platform’s System.Xml

How Can XML Be Used by an ASP.NET

What’s in an XML Document?

LISTING 2.1 A Simple XML Document

Well-Formed XML Documents

• All attribute values must be quoted.

The incorrect code could be remedied by adding in a root element as shown:

<br> Versus <br/>

This code is a less verbose way of handling the following code:

This is easily fixed by adding quotes around the bgcolor attribute:

Valid XML Documents

The XML Declaration

<%@ Page Language=”C#” EnableSessionState=”False” %>

<%@ Import Namespace=”System.Xml” %>

An XML document can be loaded using Microsoft’s System.Xml assembly (discussed in

The xml Keyword

Correct: <?xml version=”1.0” encoding=”UTF-8” standalone=”yes”?>

The XML Document Version

The Encoding Type

The standalone Keyword

LISTING 2.2 XML Elements

Whereas the following are not legal elements:

LISTING 2.3 XML Recordset with Elements Only

LISTING 2.4 XML Recordset with Attributes and Elements

You could make birthName and nickName attributes of fname as well:

LISTING 2.5 Company A’s XML Document—Unique Identifier

LISTING 2.5 continued

LISTING 2.6 Company B’s XML Document—Social Security Number

LISTING 2.7 Working with Different Default Namespaces

LISTING 2.8 A Qualified Namespace Example

In Listing 2.8, the namespace URI (“http://www.anotherCompany.com”) is associated with

The default namespace (“http://www.myCompany.com”) applies to the <root>, <menuItem>,

<companyB:street>1234 Any Street</companyB:street>

XML Processing Instructions

LISTING 2.9 Adding Comments to XML

TABLE 2.1 XML Standard Entities

LISTING 2.10 Character Entity ASP.NET Script

Internal, External, and Parameter Entities

<!ENTITY companyName “Tomorrow’s Learning”>

In this simple example, the HTML (<font color=”#02027a”><b>The current number