.NET
Java
Open Source
Mobile
Database
Architecture
RIA & Web
- CSS
- Flash
- Flex
- HTML
- JavaScript
- Silverlight
- XML
Toolbox

Extensible Markup Language (XML) Tutorial

27 Jun 2003 | by Gez Lemon | Filed in

Comments
PDF

Using a Document Type Definition (DTD)

Structured Data

Each element in an XML document has a relationship with other elements, which defines the structure of the data. Structuring data ensures that data is found in the correct place, and adds context to the document. This results in self-describing, organised information, separating content from style. Explicit rules state where a specific part of the document structure may exist. Structured data is easily processed by search engines, as they're able to index only the relevant elements.

The explicit rules that state where elements may exist are defined in a Document Type Definition (DTD). The DTD provides a formal definition of the document structure and elements that may be used. An XML document is said to be valid if it contains a DTD, and the content conforms to the constraints expressed in the DTD. The DTD is part of the prolog, and must be placed before the root element of the document.

The following is the contents of an XML document called user.xml. If your browser has an XML Parser, you can View the XML Document here. If your browser doesn't have an XML Parser, you will just see the contents of the XML document.

user.xml <?xml version="1.0" ?> <!DOCTYPE user [ <!ELEMENT user (name,email)> <!ELEMENT name (forename, surname)> <!ELEMENT forename (#PCDATA)> <!ELEMENT surname (#PCDATA)> <!ELEMENT email (#PCDATA)> ]> <user> <name> <forename>Gez</forename> <surname>Lemon</surname> </name> <email>[email protected]</email> </user>

The above example is an XML Document defining a DOCTYPE of "user", where "user" is the top-level element. Following the document type declaration are the element declarations. The element declarations determines how often, and in what context the elements appear in the document. The document consists of the elements "name" and "email", in that order. The "name" element is defined as having the child elements, forename, and surname, in that order.

Character Data (CDATA)

AN XML document consists of markup, and character data, where the character data is the text of the document. The markup provides information about the character data, and is differentiated from the character data using special characters. The special characters used to differentiate markup from character data are angled brackets ("<", and ">"), ampersands ("&"), and semicolons (";"). Data specified as Character Data (CDATA), will not be parsed by the XML parser. The following example uses a CDATA section to define a JavaScript section in an XHTML document.

<script type="text/javascript"> <![CDATA[ function someFunction() { // Function definition } </script>

Parsed Character Data (#PCDATA)

The "forename", "surname", and "email" elements are defined as elements that can contain Parsed Character Data (#PCDATA). PCDATA is data validated to ensure it is valid. PCDATA may not contain the characters used to differentiate markup from character data.

The DTD can be stored in an external file. In this case, the DOCTYPE declaration contains the name of the external file.

user.dtd < !ELEMENT user (name,email)> <!ELEMENT name (forename, surname)> <!ELEMENT forename (#PCDATA)> <!ELEMENT surname (#PCDATA)> <!ELEMENT email (#PCDATA)>

The xml document then specifies the location for the external DTD.

user.xml < ?xml version="1.0" ?> <!DOCTYPE user SYSTEM "user.dtd"> <user> <name> <forename>Gez</forename> <surname>Lemon</surname> </name> <email>[email protected]</email> </user>

You might also like...

Comments

About the author

Gez Lemon

I'm available for contract work. Please visit Juicify for details.

www.juicystudio.com

Interested in writing for us? Find out more.

XML tutorials

XML books

Access 2010 Bible

The expert guidance you need to get the most out of Access 2010Get the Access 2010 information you need to succeed with this comprehensive reference. If this is your first encounter with Access, you'll appreciate the thorough attention to database fu...

XML forum discussion

Invitation to take part in an academic research study

by researchlab (0 replies)
How to insert & edit unique value using store procedure

by umeshdaiya (0 replies)
How to troubleshoot Epson laser printer?

by daisywyatt618 (0 replies)
view state is stored after the page post-back

by shriniwas.khatri852 (0 replies)
Transfer selected rows from one GridView to another GridView in aspxform(ASP.NET)

by dorsa (0 replies)

XML podcasts

Stack Overflow Podcast: SE Podcast #27 – Dave Winer

Published 9 years ago, running time 1h2m

Jeff & Joel are joined today by Dave Winer, who’s upset that we don’t have a jingle to start the show! He “invented” (well, pioneered, really) the XML-RPC protocol. Dave tells us the story of how and why the protocol came to be. Right now, Dave’s working on a “magnificent symphony of software

Managed hosting by Everycity

Extensible Markup Language (XML) Tutorial

Using a Document Type Definition (DTD)

You might also like...

Comments

About the author

Gez Lemon

XML tutorials

XML books

Access 2010 Bible

XML forum discussion

Invitation to take part in an academic research study

by researchlab (0 replies)

How to insert & edit unique value using store procedure

by umeshdaiya (0 replies)

How to troubleshoot Epson laser printer?

by daisywyatt618 (0 replies)

view state is stored after the page post-back

by shriniwas.khatri852 (0 replies)

Transfer selected rows from one GridView to another GridView in aspxform(ASP.NET)

by dorsa (0 replies)

XML podcasts

Stack Overflow Podcast: SE Podcast #27 – Dave Winer

Published 9 years ago, running time 1h2m

Contribute

Web Development

Developer Jobs

Our tools