0% found this document useful (0 votes)

78 views

Introduction To XML: A Universal Data Format

The document provides an overview of XML including its introduction, structure, editing, parsing and syntax. It discusses the drawbacks of earlier markup languages that led to XML's development. It describes the components of an XML document including the prolog, root element, and logical structure. It also explains XML editing, parsing, browsing and the steps to create a well-formed XML document. Finally, it covers XML syntax elements like comments, processing instructions, character data classification and entities.

Uploaded by

Phuong Le

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views

Introduction To XML: A Universal Data Format

Uploaded by

Phuong Le

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 41

Introduction to XML

A Universal data format

Module Introduction

Welcome to the module, Introduction to XML.

The module describes drawbacks of earlier mark up languages that led to the development of XML.
The module also explains the structure and lifecycle of the XML document. This module covers more on the XML syntax and the various parts of the XML document. In this module, you will learn about: 1. Introduction to XML 2. Exploring XML 3. Working with XML 4. XML Syntax

#1 - Introduction to XML

Outline the features of markup languages and list their drawbacks. Define and describe XML. State the benefits and scope of XML.

Features and Drawback of Markup Languages

Evolution of markup languages: GML SGML HTML. Features SGML ensures to represent the data in its own way. HTML allows the user to use any text editor Drawbacks GML and SGML were not suited for data interchange over the web. HTML possesses instructions on how to display the content rather than the content they encompass.

Evolution of XML

The Extensible Markup Language (XML) was created in order to address the issues raised by earlier markup languages XML is a W3C recommendation. XML is a set of rules for defining semantic tags that Break a document into parts And identify the different parts of the document. XML was developed over HTML.

Features of XML

XML stands for Extensible Markup Language XML is a markup language much like HTML XML was designed to describe data XML tags are not predefined. You must define your own tags XML uses a Document Type Definition (DTD) or an XML Schema to describe the data

XML with a DTD or XML Schema is designed to be self-descriptive

XML Markup

XML markup defines the physical and logical layout of the document. XML's markup divides a document into separate information containers called elements.

A document consists of one outermost element, called root element that contains all the other elements, plus some optional administrative information at the top, known as XML declaration.

Benefits of XML

Data independence: separates the content from its presentation. Easier to parse: frameworks for data exchange. Reducing server load: using DOM to manipulate the data.

Easier to create: it is text-based.

Web site content: transforms to HTML using XSLT and CSS. Remote procedure call: allows distributed computing. Ecommerce: sends data from one company to another.

#2 - Exploring XML Lesson Overview

Describe the structure of an XML document. Explain the lifecycle of an XML document. State the functions of editors for XML and list the popularly used editors. State the functions of parsers for XML and list names of commonly used parsers. State the functions of browsers for XML and list the commonly used browsers.

XML Document Structure

XML documents are commonly stored in text files with extension .xml. The two sections of an XML document are: Document Prolog Root Element

$1- Document Prolog

Help XML parser to get information about the content in the document

Document prolog contains metadata and consists of two parts:

XML Declaration Specifies the version of XML being used Document Type Declaration. Defines entities' or attributes' values Checks grammar of markup Checks vocabulary of markup

$2 - Root Element

Also called a document element. It must contain all the other elements and content in the document. An XML element has a start tag and end tag.

Logical Structure

Gives information about the elements and the order in which they are to be included in the document. It shows how a document is constructed rather than what it contains.

Life cycle of an XML document

XML Editors

The main functions that editors provide are as follows: Add opening and closing tags to the code Check for validity of XML Verify XML against a DTD/Schema Perform series of transforms over a document Color the XML syntax Display the line numbers Present the content and hide the code Complete the word The popularly used editors are: XMLwriter XML Spy XML Pro XMLmind XMetal

Parsers

An XML parser/XML processor reads the document and verifies it for its well-formedness. After the document is verified, the processor converts the document into a tree of elements or a data structure. Speed and performance are the criteria against which XML parsers are selected. Commonly used parsers are: Crimson Oracle XML Parser JAXP (Java API for XML) MSXML

Browsers

After the XML document is read, the parser passes the data structure to the client application (web browser) The browser then formats the data and displays it to the user. Other programs like database, MIDI program or a spreadsheet program may also receive the data and present it accordingly. Commonly used web browsers are as follows: Netscape Mozilla Internet Explorer Firefox Opera

#3 - Working with XML

Explain the steps towards building an XML Define what is meant by well-form XML

Creating an XML document

An XML document has three main components: Tags (markup) and text (content) DTD or Schema Formatting or display specifications

The steps to build an XML document are as follows: Create an XML document in an editor. Save the XML document. Load XML document in a browser.

Exploring the XML document

The various building blocks of an XML document are:

1. 2.

XML Version Declaration Document Type Definition (DTD) Document instance in which the content is defined by the mark up

$1- XML Version Declaration

<?xml It indicates that the document is an XML document version="1.0 Specific the version of XML encoding = "iso-8859-l Characters are encoded using standalone="yes Indicates the presence of external markup declarations. yes" indicates no external mark up declarations no" indicate mark up declarations might exist.

$2 - Document Type Definition (DTD)

<!DOCTYPE student Declares and defines the elements used in the document Externally <!DOCTYPE student SYSTEM "studatabase.dtd"> Internally

$3 - Document instance

< student > This part defines the content of the XML document called as mark up. It describes the purpose and function of each element.

Meaning in Markup

Markup can be divided into following three parts:

Structure 1. Describes the form of the document by specifying the relationship between different elements in the document. 2. It emphasizes to specify a single nonempty, root element that contains other elements and the content
Semantic Describes how each element is specified to the outside world of the document. ex. Web browser assigns "paragraph" to the tags <P> and </P>

Style It specifies how the content of the tag or element is displayed.

Well-formed XML document

Well-formedness refers to the standards that are to be followed by the XML documents. Rules: Minimum of one element is required, XML tags are case sensitive. Every start tag should end with end tag. XML tags should be nested properly. XML tags should be valid. Length of markup names XML attributes should be valid. XML documents should be verified

#4- XML Syntax

State and describe the use of comments and processing instructions in XML. Classify character data that is written between tags. Describe entities, DOCTYPE declarations and attributes.

Comments

Give information about the code Can appear in the document prolog, DTD or in the textual content. Not appear inside the tags or attribute values. Syntax: <! -- <comments> -->

XML Elements

An XML element is everything from (including) the element's start tag to (including) the element's end tag. An element can contain other elements, simple text or a mixture of both. Elements can also have attributes. XML Naming Rules Names can contain letters, numbers, and other characters Names must not start with a number or punctuation character Names cannot contain spaces

Processing Instructions

Processing instructions are information which is application specific. These instructions do not follow XML rules or internal syntax. With the help of a parser these instructions are passed to the application. The main objective of a processing instruction is to present some special instructions to the application. Syntax

Classification of character data

An XML document is divided into markup and character data. Character data describes the document's actual content with the white space. The text in character data is not processed by the parser and thus not treated as a regular text. The character data can be classified into: CDATA PCDATA

PCDATA (parsed character data)

The data that is parsed by the parser The PCDATA specifies that the element has parsed character data. It is used in the element declaration. Escape character like "<" when used in the XML document will make the parser interpret it as a new element.

CDATA

The text inside a CDATA section is not parsed by the XML parser. A text is considered in a CDATA section if it contains '<' or '<&>' characters. The syntax for CDATA "<![CDATA[]]>

The CDATA sections: Cannot be nested. Does not accept line breaks or spaces inside the "]]>" string.

Entities

Entities are a construct that are referenced in the document Every entity consists: name - value. As the XML document is parsed, it checks for entity references.

For every entity reference, the parser checks the memory to replace the entity reference with a text or markup.
Syntax for an entity reference: &<entity name>;. All the entities must be declared before they are used in the document. An entity can be declared either in a document prolog or in a DTD.

Predefined entities

Entity Categories

Entities are used as shortcuts to refer to the data pages. The two types of entities are as follows: General Entity Parameter Entity

Entity Categories

General Entity These are the entities used within the document content. They refer to the content of a named entity. References to these entities: &<entity_name>;

Parameter Entity These types of entities are used only in the DTD. These type of entities are declared in DTD. References to these entities: %<entity_name>;

DOCTYPE declarations

Defines the elements to be used in the document. To indicate what DTD the document adheres to. It can be declared either: In the XML document (internal) Referenced to the external document (external)

Example of DOCTYPE declarations - Internal

Example of DOCTYPE declarations - External

DTD file (note.dtd)

XML file

Attributes

Additional information about the attributes can be given in the form of attributes. Attributes are created in the DTD along with the elements. Every attribute within an element is associated with a name-value pair. Attributes can be used to distinguish between the elements of the same name. Attributes occur in the start-tags after the element name.

Attribute values are always enclosed in single or double quotes.

Attributes are case sensitive and must start with a letter or underscore

Thats all for today !

Introduction to XML Exploring XML Working with XML XML Syntax

Thank you all for your attention and patient !

Components of Android
No ratings yet
Components of Android
36 pages
Bhi & Cae Assessment Cover Sheet
No ratings yet
Bhi & Cae Assessment Cover Sheet
16 pages
List of ISO Standards, 2016
No ratings yet
List of ISO Standards, 2016
32 pages
Security Consideration in Lotus Notes and Domino 7
No ratings yet
Security Consideration in Lotus Notes and Domino 7
244 pages
Linux Sea
No ratings yet
Linux Sea
223 pages
Practical PowerShell Security and Compliance Center
From Everand
Practical PowerShell Security and Compliance Center
Damian Scoles
No ratings yet
XML Quick Guide
No ratings yet
XML Quick Guide
30 pages
Extensible Markup Language
100% (1)
Extensible Markup Language
89 pages
XML Basics
No ratings yet
XML Basics
9 pages
The DOM
No ratings yet
The DOM
6 pages
21 Free E-Books On Linux Programming
No ratings yet
21 Free E-Books On Linux Programming
3 pages
Configuration Profile Reference PDF
No ratings yet
Configuration Profile Reference PDF
123 pages
Linux OS Basic Commands
No ratings yet
Linux OS Basic Commands
132 pages
LINUX Lab Experiments-Edited Version
100% (1)
LINUX Lab Experiments-Edited Version
85 pages
Win Powershell Command 5676
No ratings yet
Win Powershell Command 5676
4 pages
Components of An XML Document
100% (6)
Components of An XML Document
21 pages
Power Shell Scripts 1
No ratings yet
Power Shell Scripts 1
8 pages
Methods of Malware Persistence: On OS X Mavericks
No ratings yet
Methods of Malware Persistence: On OS X Mavericks
57 pages
Ejabberd User Guide
100% (2)
Ejabberd User Guide
102 pages
ZQ410 Unit 5 Transcript: Setting Up Server Security, Approvals, and Quality Gates
100% (1)
ZQ410 Unit 5 Transcript: Setting Up Server Security, Approvals, and Quality Gates
36 pages
Linuxfun PDF
No ratings yet
Linuxfun PDF
365 pages
Sudoers Installation in AIX
No ratings yet
Sudoers Installation in AIX
2 pages
XML Publisher User Guide
No ratings yet
XML Publisher User Guide
230 pages
Arkadi N Anywhere User Guide
No ratings yet
Arkadi N Anywhere User Guide
13 pages
IDA Pro Shortcuts
100% (2)
IDA Pro Shortcuts
1 page
Unix Shells Bash Fish KSH TCSH Zsh-Hyperpolyglot
No ratings yet
Unix Shells Bash Fish KSH TCSH Zsh-Hyperpolyglot
23 pages
Base IDE Netbeans 8.2
No ratings yet
Base IDE Netbeans 8.2
6 pages
How To Write A Bash Script To Run Commands - Linux Tutorials - Learn Linux Configuration
No ratings yet
How To Write A Bash Script To Run Commands - Linux Tutorials - Learn Linux Configuration
10 pages
An A-Z Index of Commands: Windows Powershell
No ratings yet
An A-Z Index of Commands: Windows Powershell
8 pages
Manual - Netgear ReadyNAS Pro 6 RNDP6000
No ratings yet
Manual - Netgear ReadyNAS Pro 6 RNDP6000
132 pages
Domino & Lotus Install 852
No ratings yet
Domino & Lotus Install 852
289 pages
Lotus Domino For Solaris 10
No ratings yet
Lotus Domino For Solaris 10
612 pages
Git-workshop-2024
No ratings yet
Git-workshop-2024
99 pages
Guide To UNIX Using Linux Fourth Edition Chapter 01
No ratings yet
Guide To UNIX Using Linux Fourth Edition Chapter 01
4 pages
Understanding Single Sign-On (SSO) Between IBM WebSphere Portal and IBM Lotus Domino
100% (9)
Understanding Single Sign-On (SSO) Between IBM WebSphere Portal and IBM Lotus Domino
26 pages
Windows+Sysmon+Logging+Cheat+Sheet Aug 2019
No ratings yet
Windows+Sysmon+Logging+Cheat+Sheet Aug 2019
9 pages
Linux Administration a Beginner s Guide Wale Soyinka 2024 Scribd Download
100% (3)
Linux Administration a Beginner s Guide Wale Soyinka 2024 Scribd Download
55 pages
Pick v10r3
No ratings yet
Pick v10r3
243 pages
FTP Command
No ratings yet
FTP Command
9 pages
Input, Process, Output (L)
100% (1)
Input, Process, Output (L)
31 pages
Implementation of Quick UDP Internet Connections - Paper
No ratings yet
Implementation of Quick UDP Internet Connections - Paper
6 pages
Ubuntu or MacOS Whitepaper
No ratings yet
Ubuntu or MacOS Whitepaper
13 pages
Pandas PDF
No ratings yet
Pandas PDF
171 pages
Unix Commands
No ratings yet
Unix Commands
4 pages
KDE Frameworks Cookbook
No ratings yet
KDE Frameworks Cookbook
52 pages
Case Study 6
No ratings yet
Case Study 6
5 pages
Notas Profesionales para Latex (Ingles)
No ratings yet
Notas Profesionales para Latex (Ingles)
60 pages
Mathematica 12. Insetting Objects in Graphics
No ratings yet
Mathematica 12. Insetting Objects in Graphics
3 pages
Python Console Application Development 2
No ratings yet
Python Console Application Development 2
27 pages
Bodhi Linux 3 for Beginners
From Everand
Bodhi Linux 3 for Beginners
Roger Carter
No ratings yet
How to Hack Like a Ghost: Breaching the Cloud
From Everand
How to Hack Like a Ghost: Breaching the Cloud
Sparc Flow
No ratings yet
Creating and Managing Virtual Machines and Networks Through Microsoft Azure Services for Remote Access Connection
From Everand
Creating and Managing Virtual Machines and Networks Through Microsoft Azure Services for Remote Access Connection
Dr. Hidaia Mahmood Alassouli
No ratings yet
Online Security: protecting your personal information
From Everand
Online Security: protecting your personal information
skyline
No ratings yet
A Comprehensive Guide About Computers and Technology
From Everand
A Comprehensive Guide About Computers and Technology
Dale Carnegie
No ratings yet
The complete guide to Hardware Technician Terminology: A simplified guide
From Everand
The complete guide to Hardware Technician Terminology: A simplified guide
Sumitra Kumari
No ratings yet
Email Spam: Fundamentals and Applications
From Everand
Email Spam: Fundamentals and Applications
Fouad Sabry
No ratings yet
How to Switch from Windows to Linux at Home without Fear of Change, Aimed at Users with No Experience in Linux and with Amazing Results
From Everand
How to Switch from Windows to Linux at Home without Fear of Change, Aimed at Users with No Experience in Linux and with Amazing Results
JUAN FERNANDO AVILÉS BLANCO
No ratings yet
The Ultimate Windows 10 Guide: Tips & Tricks to Save Time & Use Windows 10 Like a Pro
From Everand
The Ultimate Windows 10 Guide: Tips & Tricks to Save Time & Use Windows 10 Like a Pro
Jon Albert
No ratings yet
Let's Use Bash on Windows 10! The Lite version
From Everand
Let's Use Bash on Windows 10! The Lite version
John E. Meister, Jr
No ratings yet
Instant Migration from Windows Server 2008 and 2008 R2 to 2012 How-to
From Everand
Instant Migration from Windows Server 2008 and 2008 R2 to 2012 How-to
Santhosh Sivarajan
No ratings yet
OSPF A Clear and Concise Reference
From Everand
OSPF A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Foundations of Tensor Analysis For Students of Physics and Engineering With An Introduction To Theory of Relativity (Nasa)
No ratings yet
Foundations of Tensor Analysis For Students of Physics and Engineering With An Introduction To Theory of Relativity (Nasa)
92 pages
3GPP TR - 123975v110000p
No ratings yet
3GPP TR - 123975v110000p
43 pages
ACCP I7.1-Sem 4 Practical Paper Set3 Integrating XML With Java
No ratings yet
ACCP I7.1-Sem 4 Practical Paper Set3 Integrating XML With Java
2 pages
Map Xtreme 2008 Object Model Poster
No ratings yet
Map Xtreme 2008 Object Model Poster
1 page
Update 201 Activationtimeoffset 1 Update 201 Expectreorderingpdcp 2 Update 202 Activationtimeoffset 3 Update 202 Expectreorderingpdcp 4
No ratings yet
Update 201 Activationtimeoffset 1 Update 201 Expectreorderingpdcp 2 Update 202 Activationtimeoffset 3 Update 202 Expectreorderingpdcp 4
4 pages
2G SitesReHomingPlanTemplate
No ratings yet
2G SitesReHomingPlanTemplate
1 page
MapXtreme2008 DevGuide
No ratings yet
MapXtreme2008 DevGuide
580 pages
Manipulating Text
No ratings yet
Manipulating Text
13 pages
Manipulating Text
No ratings yet
Manipulating Text
13 pages
Printing Text File in C
No ratings yet
Printing Text File in C
5 pages
Debugging Microsoft
No ratings yet
Debugging Microsoft
15 pages
Adding Functionality To A
No ratings yet
Adding Functionality To A
19 pages
Binding Source For DataGridView From Linq To SQL Query
No ratings yet
Binding Source For DataGridView From Linq To SQL Query
4 pages
Web Form
No ratings yet
Web Form
25 pages
A Software Engineer Learns HTML5 JavaScript and Jquery Dane Cameron PDF
100% (1)
A Software Engineer Learns HTML5 JavaScript and Jquery Dane Cameron PDF
703 pages
FM-XML-Cookbook
No ratings yet
FM-XML-Cookbook
110 pages
RTU Solution 5CS4-04 Computer Graphics & Multimedia
No ratings yet
RTU Solution 5CS4-04 Computer Graphics & Multimedia
43 pages
Abdul Rahman
No ratings yet
Abdul Rahman
127 pages
Top 50 XML Interview Questions & Answers: 1. What Is A Markup Language?
No ratings yet
Top 50 XML Interview Questions & Answers: 1. What Is A Markup Language?
9 pages
Mil STD 3001 1
No ratings yet
Mil STD 3001 1
190 pages
Subject Name: Web Engineering Subject Code: CS-7003 Semester: 7
No ratings yet
Subject Name: Web Engineering Subject Code: CS-7003 Semester: 7
25 pages
Web Development Lab: Rajat Goyal 1 14IT056
No ratings yet
Web Development Lab: Rajat Goyal 1 14IT056
20 pages
WebTechnology Study Materials
100% (2)
WebTechnology Study Materials
143 pages
Mos Protocol PDF
No ratings yet
Mos Protocol PDF
167 pages
'O'-level-(M2.R4)-Chapter-8 (4) 123
No ratings yet
'O'-level-(M2.R4)-Chapter-8 (4) 123
4 pages
(Web Design / Web Technology / Web Engineering) : Follow Us On Facebook Join Our Telegram Channel Join Discussion Board
No ratings yet
(Web Design / Web Technology / Web Engineering) : Follow Us On Facebook Join Our Telegram Channel Join Discussion Board
31 pages
Document Management Techniques and Techn
No ratings yet
Document Management Techniques and Techn
9 pages
HTML Interview Questions and Answers
No ratings yet
HTML Interview Questions and Answers
28 pages
transactional-updates_en
No ratings yet
transactional-updates_en
23 pages
HyTime-Hypermedia-Time Document Structuring Language
No ratings yet
HyTime-Hypermedia-Time Document Structuring Language
17 pages
Chapter 7: Information Representation Method - XML Solutions
No ratings yet
Chapter 7: Information Representation Method - XML Solutions
5 pages
Introduction To Java Scripts
No ratings yet
Introduction To Java Scripts
5 pages
6-Evolution of Technical Publications Fulsunge
100% (1)
6-Evolution of Technical Publications Fulsunge
23 pages
Mincom: Release Notes
No ratings yet
Mincom: Release Notes
44 pages
Module 2 - XML
No ratings yet
Module 2 - XML
68 pages
3 ADVANCED MOBILE STORE DOCUMENTATION
No ratings yet
3 ADVANCED MOBILE STORE DOCUMENTATION
57 pages
IWT Unit-1 Notes: Dept. of CSE, PIEMR, Indore Prepared By: Er. Ankit Chopra, Asst. Prof., CSE
No ratings yet
IWT Unit-1 Notes: Dept. of CSE, PIEMR, Indore Prepared By: Er. Ankit Chopra, Asst. Prof., CSE
17 pages
Sathi A Das 2003
No ratings yet
Sathi A Das 2003
10 pages
Standards For CAD Data Exchange: S.Balamurugan
No ratings yet
Standards For CAD Data Exchange: S.Balamurugan
28 pages
Web Technology Handout
No ratings yet
Web Technology Handout
159 pages
Online Voting
No ratings yet
Online Voting
32 pages
Quest Homework - Get Your Assignments Done With
100% (1)
Quest Homework - Get Your Assignments Done With
106 pages
Unit-1 Web and Internet Technology
No ratings yet
Unit-1 Web and Internet Technology
21 pages

Introduction To XML: A Universal Data Format

Uploaded by

Introduction To XML: A Universal Data Format

Uploaded by

Introduction to XML

A Universal data format

Welcome to the module, Introduction to XML.

Features and Drawback of Markup Languages

XML with a DTD or XML Schema is designed to be self-descriptive

Easier to create: it is text-based.

#2 - Exploring XML Lesson Overview

XML Document Structure

$1- Document Prolog

Document prolog contains metadata and consists of two parts:

Life cycle of an XML document

#3 - Working with XML

Creating an XML document

Exploring the XML document

The various building blocks of an XML document are:

$1- XML Version Declaration

$2 - Document Type Definition (DTD)

Markup can be divided into following three parts:

Style It specifies how the content of the tag or element is displayed.

Well-formed XML document

#4- XML Syntax

Classification of character data

PCDATA (parsed character data)

Example of DOCTYPE declarations - Internal

Example of DOCTYPE declarations - External

DTD file (note.dtd)

Attribute values are always enclosed in single or double quotes.

Thats all for today !

Introduction to XML Exploring XML Working with XML XML Syntax

Thank you all for your attention and patient !

You might also like