Nearby: XMLSpec page
This report describes the XML specification ("XMLspec") DTD used for World Wide Web Consortium (W3C) specifications and notes related to XML. It is maintained by Eve Maler (<elm@east.sun.com>, +1 781 442 3190) of Sun Microsystems. This release of the report corresponds to the DTD with the formal public identifier "-//W3C//DTD Specification Version 2.1//EN".
This report describes the DTD used for World Wide Web Consortium (W3C) specifications and notes related to XML.
This report is available at http://www.w3.org/XML/1998/06/xmlspec-report-v21.htm. The DTD file is available at http://www.w3.org/XML/1998/06/xmlspec-v21.dtd. The latest version of the DTD is always available at http://www.w3.org/XML/1998/06/xmlspec.dtd, and the latest version of the report is always available at http://www.w3.org/XML/1998/06/xmlspec-report.htm.
The following people have been major contributors to the DTD design:
| Jon Bosak, Sun |
| Tim Bray, Textuality |
| Dan Connolly, W3C |
| Eve Maler, Sun |
| Gavin Nicol, Inso |
| C. Michael Sperberg-McQueen, W3C |
| Lauren Wood, SoftQuad |
| James Clark |
This report is organized as follows:
Chapter 1 lists the sources of input used during the DTD design effort, describes the project design parameters, and describes global outstanding issues. Read this to understand the basic principles underlying the design results and to review the issues.
Chapter 2 through Chapter 5 contain the markup models for the DTD, and Chapter 6 describes information linking relationships, nontextual data formats, and special symbols that the DTD encodes. Read these chapters to understand how the markup will be used with World Wide Web Consortium XML information, and the reasons for their design.
Note that, where appropriate, some processing expectations have been documented for the markup. This information is not to be considered a complete style specification; it simply records known requirements.
Chapter 7 contains the common categories of elements and the commonly used element mixtures in content models. Read this to understand the "mixture" content models described in Chapter 3, Chapter 4, and Chapter 5.
Appendix A contains a non-normative list of known applications that do formatting and other processing of XMLspec documents.
Appendix B contains a listing of the DTD.
To understand the graphical "elm tree diagrams" used in this report, use the following legend.

Following is information on the sources of analysis input, the design principles governing the markup model, the implementation principles governing its expression in DTD form, and outstanding issues.
The following have been used as analysis input in designing the DTD:
The original XML specification DTD, jointly developed and revised by Michael Sperberg-McQueen, Tim Bray, and Jon Bosak
Michael's SWEB documentation
The XML, XLink, XPointer, DOM, MathML, XSL, and Schema specifications and their customizations of XMLspec
The XML/SGML comparison note
Following are the design principles governing the markup model of the DTD.
Although the DTD has come to be called "XMLspec," it is intended for W3C working drafts, notes, recommendations, and all other document types that fall under the category of "technical reports."
The DTD is responsible for covering three main aspects of XML technical reports:
Basic W3C technical report structure and content, including paragraphs, lists, cross-references, and so on
Markup structures specific to the XML-related family of W3C technical reports, such as EBNF productions, validity constraints, and element and attribute names
Proper headers and metadata for W3C technical reports
The DTD is intended to support the following functions, in order of priority:
Production of technical reports
First and foremost, the DTD should facilitate hassle-free production and publication. Many of the documents in the scope are made available in several output forms, including source XML and derived HTML, RTF, and PostScript. It is important to produce these outputs in a form that meets W3C requirements, and produce them quickly in order to speed the W3C publication release process. Also, it may be useful to extract different parts of the document content (for example, just the productions) for distribution.
In recent revisions, the markup has evolved to capture an increasing amount of technical detail, for example, the IDL structures contributed from the DOM work. This is motivated entirely by considerations of successful publication from a single source to multiple outputs, rather than by abstract concerns about source purity; however, it makes an effective illustration of the power of XML.
Creation and modification of content
The DTD should provide an intuitive, efficient interface to the creation process. This means that the DTD shouldn't be overly large or complicated, but that it should provide support for information structures using the jargon, and to the depth, that authors will tend to understand the information.
Review of content
To a lesser degree, the DTD should support the informal workflow that goes on when co-editors pass around drafts for review. To this end, the DTD should provide markup for editor "communication" inside the document source.
Proof of concept of XML publishing
Finally, where possible, this DTD and its associated applications should use good XML practice and conforming XML tools, because many will look to this application as an example.
This DTD's first few revisions existed before there were any XML parsers, and indeed before XML 1.0 itself was standardized. In today's world, XML's future has been assured and XML tools are plentiful. However, while a "proof of concept" is no longer needed, XMLspec does in fact continue to serve as an example of best practices.
The DTD should avoid presentational markup where possible. Sometimes this principle comes into conflict with the production focus, but in general, presentation independence helps serve the goal of production of multiple outputs. In any case, egregious examples of formatting-specific markup should be avoided.
The following information gives background on implementation decisions.
This DTD was previously given a formal public identifier in the following pattern:
-//W3C//DTD XML Specification::yyyymmdd//EN
For easier identification, the DTD now uses a formal public identifier in the following pattern:
-//W3C//DTD XML Specification Version n.n//EN
The current version is identified as:
-//W3C//DTD Specification Version 2.1//EN
The policy, beginning with Version 2.0, is as follows.
Minor revisions (revising n.m to produce n.m+1, for example, going from V2.0 to V2.1 or from V2.1 to V2.2) can add to the markup model, but cannot change it in a backwards-incompatible way. For example, a new kind of list element could be added, but it would not be acceptable for the existing itemized-list model to start requiring two list items inside it instead of only one. Thus, any document conforming to an old version would also conform to the new minor revision.
Major revisions (revising n.m to produce (n+1).0, for example, going from V2.1 to V3.0) can both add to the markup model and make backwards-incompatible changes. Ideally these will be accompanied by XSLT transformation specifications that help legacy documents to conform to the new version.
Always review the change history in any new version of the DTD carefully before deploying it.
Currently, DTD changes are at the discretion of the maintainer and the heaviest users of the DTD within the W3C. A more formal procedure may be put in place later.
This DTD does not yet have an official schema. If you want to refer to the XMLspec vocabulary for namespace management purposes, use the following URI to identify it:
http://www.w3.org/XML/1998/06/xmlspec-v21.dtd
Note that this identifying URI may change in the future.
The element names from the original DTD on which XMLspec was based were mostly kept; changes were made in a few cases only to rationalize the naming scheme.
Hyphens are typically avoided, except in the "w3c-" prefix.
Whitespace and tabs are used relatively sparingly to enhance readability; excessive whitespace is avoided in the interest of a compact and "unthreatening" DTD.
Parameter entities are used in several different capacities in the DTD. To indicate their different roles, unique suffixes are used as follows:
The name, declared value, and default value specifications for a set of one or more attributes.
Some descriptions may have a sub-suffix, such as -req, which means that the attribute (or one of the attributes) in question is required.
A set of related elements that are typically available as options in certain content model "free mixtures" (repeatable-OR groups). These entities are referenced from within descrip.mix entity declarations and content models.
If you add a new standalone or phrase-level element, make sure that you add it to the appropriate class entity, or create a new class for it. If you create a new class, incorporate references to that class in the appropriate mixture entity declarations.
A set of elements that are available to writers in certain contexts as a "free mixture" (repeatable-OR group). These entities are referenced from within content models.
A set of #PCDATA and elements that are available to writers in certain contexts as a "free mixture" (repeatable-OR group). These entities are referenced from within content models. The presence of #PCDATA makes these "mixed" content models, which means that document creators can type regular text here and that downstream applications will need to take account of whitespace found here.
An empty placeholder that is available to be used in extending an attribute list.
An empty placeholder that is available to be used in extending an element class.
A content model fragment (other than a "free mixture") that is common or customizable.
The goal in naming the entities was to be consistent and brief, without losing readability. The keyword indicating the entity type always appears last because the location of an entity reference will already give a clue as to the entity type, and so this is not the information that needs to be seen first when the DTD is read. This naming scheme also allows for easier searching.
The DTD conforms to XML V1.0 and makes use of the XML and XLink namespaces. Because DTDs and namespaces don't mix well, the xlink namespace prefix has been hardwired. (The xml prefix is hardwired into the XML specification itself.)
While XLink is used for all URI-style linking, the IDREF addressing mechanism is still used heavily for internal links. As support for the XPointer construct xpointer(id("xxx")) grows and that specification stabilizes, we will consider moving to this style of addressing for internal linking.
This version of the DTD does not yet conform fully to the latest working draft of the XLink specification, because this would have required backwards-incompatible changes. When the XLink specification reaches Recommendation status and if such backwards-incompatible changes are still required, a new major version of this DTD will be published that will require changes to legacy documents.
The DTD is beginning to be used more widely by W3C contributors (as well as by authors of non-W3C specifications). While this DTD was designed with the needs of XML technical reports firmly in mind, quite a lot of the markup design would be useful for technical reports produced by others in the W3C. Therefore, the DTD has been parameterized to allow for:
Modification of certain content models that are likely to be subject to personal and Working Group preference
Addition of new elements at the "standalone" and "phrase" levels
Some limited removal of existing elements at the "standalone" and "phrase" levels
Addition of new attributes to an attribute list
If it is found that the DTD can be made more widely useful solely by heavier parameterization, it would probably be worth it to add the new parameters.
Heed the following advice if you plan to develop a variant of the DTD:
Plan and document both the substance of your changes and the reasons for them.
Build variants only by redefining the original parameter entities, if possible; don't edit any of the original DTD files.
If you plan to interchange your files with other DTD users, favor markup changes that place tighter validation restrictions on documents (subsets), rather than changes that would allow instances that no longer conform to the standard DTD (extensions).
Avoid confusion by using a different formal public identifier for your DTD variant if you have changed any element or attribute markup characteristics. You may want to indicate the derivation with an identifier that uses the following pattern:
-//owner-ID//DTD XML Specification Version n.n-Based your-descrip-and-version//lang
Following are outstanding global issues:
Consider adding an optional href to name and affiliation, and allowing them and email in regular text.
Revise XLink usage as necessary. Note that in the current version, all the XLink simple linking elements in this DTD do not conform to the latest XLink working draft; a major revision of XMLspec (along with conversions of legacy documents) will be needed to bring it back into sync. See the section called XLink-Related Attributes in Chapter 2 for more information.
According to W3C guidelines (http://www.w3.org/Guide/Reports), all references should be to a bibliography entry, and then the bibliography entry should point to the Web resource (if possible). This would suggest that we should freely allow bibref, but allow loc only in the special header fields such as "Latest Version" and in bibliography entries. Consider migrating over to this scheme at the next major revision (because it is backwards incompatible).
Research the proper form of Web resource bibliography entries, and consider adding to the structure of bibl to optimize for it.
Perhaps related is the fact that titleref is freely allowed in paragraphs as well as in bibl. Since titleref is like a restricted or subclassed form of loc, it may also be obsolescent. In addition, titleref appears to duplicate the hypertext function of bibl (or maybe it's the other way around, since it may be inappropriate to make the entire contents of bibl "hot").
Dan Connolly has requested that the element type names in this DTD match HTML wherever possible. The question is, how much is possible? About a dozen of the element types in this DTD are strongly reminiscent of element types in HTML. However, in all cases, there are subtle differences (sometimes simply amounting to different subelements allowed inside the element in question). Should the element type names be made to match?
Following are the potential correspondences:
| XML specification DTD | HTML | Comments |
|---|---|---|
| loc | a | The semantic is slightly different |
| p | p | No change needed |
| ulist | ul | |
| item | li | |
| olist | ol | |
| slist | sl | For consistency; not HTML-based |
| glist | dl | Contents are significantly different |
| gitem | dli | For consistency; not HTML-based |
| label | dt | |
| def | dd | |
| blist | bl | For consistency; not HTML-based |
| eg | pre | The semantic and contexts are different |
| graphic | img | The source attribute would also have to be renamed to src |
| emph | em |
If a glossary list is used to organize term definitions, how can termdef properly be used? Currently, at least in the XML Linking drafts, the contents of the label element are surrounded with a termdef element and a term element isn't provided, while the actual definition text in def goes unmarked-up as such.
Consider making the nt element be empty, and have its content generated by reference to the production's lhs. This would avoid redundancy.
Incorporate the XHTML 1.0 table model, now that it's a Recommendation.
Add an inline graphic element.
Consider adding attributes to division elements to record whether the division has WG consensus, whether it is obsolete, etc.
At the next major revision (because it is backwards-incompatible):
Require the abstract to go before the status section
Fix the order of prevlocs and latestloc
Revise XLink usage in backwards-incompatible ways as necessary
Require graphic alt text
Require issue heads?
Change over to use constraint and constraintnote exclusively, since only the XML spec needs wfc/wfcnote and vc/vcnote?
Consider changes to BNF internal structure? (Check against DocBook's use of the BNF structure)
Following is the change history for the DTD. Note that you can search the DTD file for "#" in the first column to see change history comments.
Added proto element, its arg subelement, and the %argtypes; entity.
Added function, var, sub, sup, phrase, el, att, attval elements.
Expanded emph to %p.pcd.mix;.
Allowed status and abstract to appear in the opposite order.
Updated XLink usage to the latest WD, except for href and source.
Removed the xml:attributes attribute from graphic.
Added %local.graphic.att; to graphic.
Added common diff attribute.
Added div5 element.
Broadened content models of publoc, prevlocs, and latestloc.
Added head, source, resolution, and status attribute to issue.
Added cr, issues, and dispcmts to w3c-doctype attribute on spec.
Added example element.
Added %loc.class; to all PCD mixes that didn't already have it.
Removed unused %loc.pcd.mix;.
Made version in spec header optional.
Added three new attributes to spec.
Broadened content of edtext.
Added optional copyright element to header.
Reorganized XLink-related parameter entities; added xmlns:xlink.
Changed edtext content from #PCDATA to %p.pcd.mix;.
Added show/actuate atts and default values to all href elements.
Changed versioning scheme from 8-digit dates to version numbers.
Added w3c-doctype, other-doctype, status atts to spec element.
Added prodrecap element inside scrap.
Added headstyle attribute to scrap.
Fixed character entities with respect to escaping of ampersands.
Added many more explanatory comments.
Added new unused elements to support DocBook translation.
Updated maler phone numbers.
Fixed %illus.class; to mention table instead of htable.
Added definitions to %illus.class; for DOM model.
Added DOM definitions element and its substructure.
Updated XLink usage in %href.att; to use xlink:form and #IMPLIED.
Added clarifying comments to HREF-using elements.
Declared generic constraint and constraintnote elements.
Added constraintnote to %note.class;.
Added constraint to %eg.pcd.mix; and prod content model.
Fixed mdash, ldquo, and rdquo character entities.
Switched to the full HTML 4.0 table model.
Removed htable/htbody elements and replaced them with table/tbody.
Added issue element to %note.class; and declared it.
Allowed prevlocs and latestloc in either order.
Added key-term, htable, htbody, and statusp as unused elements.
Removed real statusp element in favor of plain p.
Cleaned up some comments and removed some others.
Added xml:space semi-common attribute to eg and bnf elements.
Added show and embed attributes on all the uses of href.
Added %common.att; to all HTML table elements.
Added a real URI to the "typical invocation" comment.
Merged the character entity and table modules into the main file.
Added ldquo and rdquo entities.
Added common attributes to prodgroup.
Made the email element in header optional.
Removed reference to the SGML Open table model.
Added ednote element.
Added quote element.
Updated XLink usage to reflect 3 March 1998 WD.
Added "local" entities to the class entities for customization.
Parameterized several content models to allow for customization.
Moved #PCDATA occurrences to come before GIs in content models.
Removed use of the SGML Open table module.
Added xspecref element.
Ensured that all FPIs contain 4-digit year.
(Modified the character entity module.)
Added support for prodgroup and its attributes.
Added support for HTML tables.
Added loc and bibref to content of com.
Added loc to general p content models.
Allowed p as alternative to statusp in status.
Added non-null system IDs to external parameter entity declarations.
(Modified the SGML Open table module to make it XML-compliant.)
(Modified the character entity module.)
Added character entity module and added new entities.
Removed p from appearing directly in self; created %p.mix;.
Added inform-div (non-normative division) element.
Fixed xtermref comment to mention HREF, not ref.
Extended orglist model to allow optional affiliation.
Modified author to make affiliation optional.
Added %speclist.class; and %note.class; to %obj.mix; and %p.mix;.
Added %note.class; and %illus.class; to %termdef.pcd.mix;.
Added unused HTML elements.
Put empty system ID next to public ID in entity declarations.
Changed main attribute of xtermref from def to href.
Added termdef.class to label contents.
Allowed term element in general text.
Changed bibref to EMPTY.
Added ref.class to termdef.pcd.mix.
Updated FPI.
Removed namekey element and put key attribute on name element.
Made statusp element and supporting entities.
Added slist element with sitem+ content.
Required head on scrap and added new bnf subelement.
Added an xnt element and allowed it and nt in regular text and rhs.
Removed the ntref element.
Added back the com element to the content of rhs.
Added a key attribute to bibl.
Removed the ident element.
Added a term element to be used inside termdef.
Added an xtermref element parallel to termref.
Beefed up DTD comments.
This chapter describes the markup design for attributes that appear on multiple element types in substantially similar form.
The following attributes are truly "common"; they are available on every element type and have the same basic meaning everywhere.
The id attribute is for uniquely identifying an element so that it can be linked to from elsewhere. The id attribute is declared as type ID.
The id attribute appears on every element. Its value is optional on most elements; however, a value is required on the following elements because they are always meant to serve as the target of a link:
| issue |
| wfcnote |
| vcnote |
| constraintnote |
| prod |
| termdef |
The %common.att; entity is used for those elements that don't require an id attribute, and the %common-idreq.att; entity is used for those elements that do require an id attribute.
ID values may be linked to; each linking element has its own processing expectations.
IDs are generally useful in document management. Thus, they are made available on every element, just in case. For those elements that generally serve as targets of links, IDs are made mandatory.
Note that an attribute with type ID can be used both by IDREF attributes and by XPointers. The unique identification of an element does not presume a linking solution.
The role attribute is for extending an element type by giving it an additional descriptive keyword, which a stylesheet can act on if necessary. The role attribute is declared as NMTOKEN. The %common.att; and %common-idreq.att; entities both contain role, in optional form.
Roles help extend the life of a DTD between revisions and serve as a way to prototype new DTD extensions.
Note that the XMLspec DTD does not prescribe values for, or dictate usage of, the role attribute in any way. This attribute is intended for extensions to the "official" application. Thus, where an element type is expected to be subclassed on a regular basis, it is given an additional non-role attribute to serve this purpose.
The diff attribute indicates whether a particular element has changed since the last version of the document, which a stylesheet can act on if necessary. When a value is not provided for an element, it should inherit a value from its parent. If the root element has no value supplied, assume "off". The diff attribute can have one of four values: "add" (the element was added), "chg" (the element or its content was changed), "del" (the element was deleted), or "off" (even though a higher-level element has one of the other three values, this subelement has not changed). The %common.att; and %common-idreq.att; entities both contain diff, in optional form.
Difference values may or may not be operated on by stylesheets. For example, changed and added content may be presented in a different color or emboldened, and deleted content may have its text struck through or may be suppressed entirely. Alternatively, the values may be used to control marginal change bars.
This attribute is not intended to take the place of a full code versioning or content management system; rather, difference values will help specification authors manage the process of issuing new revisions and accepting comments. Currently, some specification authors use differencing tools to compare the generated HTML versions and produce difference formatting. This markup will allow them to reflect the differences directly in the XML form, which is sometimes used as a final form for viewing.
The following attributes each appear on a few similar elements, and generally have similar meaning in each case.
See the section called HTML-Style Table (table) in Chapter 4 for information about the common attributes associated with tables.
The key attribute provides a string that can be used in sorting, indexing, and general description, when it is suspected or known that the element content won't suffice.
The key attribute appears on the following elements:
| name |
| bibl |
The value of the attribute is used in sorting, indexing, or generating cross-reference text. See the sections on the individual elements for more information.
It was felt that a subelement solution to the problem of sorting (in the case of name) was not ideal, because you need to surround some existing element content with the sorting subelement, and the existing content may not be suitable for sorting. (This is the same problem that index-term markup has.) We decided that an attribute would be more effective and less intrusive.
The def attribute points to the element where the relevant definition can be found, using the IDREF mechanism. The %def.att; parameter entity is used for optional def attributes (of which there are currently none), and the %def-req.att; parameter entity is used for required def attributes.
The def attribute appears on the following elements, and is required on all of them:
| wfc |
| vc |
| constraint |
| nt |
| termref |
The content of this element should allow the user to link to the definition.
The IDREF mechanism was used for now, until the XPointer #id(xxx) mechanism is more widely supported.
Though %def.att; isn't used for now, it's coded in case it's needed later.
The ref attribute points to the element where additional information can be found, using the IDREF mechanism. The %ref.att; parameter entity is used for optional ref attributes (of which there are currently none), and the %ref-req.att; parameter entity is used for required ref attributes.
The def attribute appears on the following elements, and is required in all cases:
| bibref |
| specref |
| prodrecap |
For the "ref" elements, the content of the element should allow the user to link to the additional information. For the "recap" element, the link should embed the content of the target at the point of reference (much like an XLink "embed", though XLink is not being used here yet).
The IDREF mechanism was used for now, until the XPointer #id(xxx) mechanism is more widely supported.
Though %ref.att; isn't used for now, it's coded in case it's needed later.
A set of parameter entities helps to manage the XLink-related attributes. The %simple-xlink.att; entity sets up the xmlns:xlink and xlink:type attributes, which declare the following element types to be XLink simple links:
| bibl |
| graphic |
| loc |
| titleref |
| xnt |
| xspecref |
| xtermref |
The %href.att; and %href-req.att; entities set up the href attribute (in optional and required forms), which allows most of the simple XLink elements to point to their targets. When a value for href is supplied (as it will be in all the required cases), The simple links will be traversable. The href attribute is available on the following elements:
| bibl (optional) |
| loc |
| titleref (optional) |
| xnt |
| xspecref |
| xtermref |
Conformance Note: Because the current Working Draft of the XLink specification has removed certain features, proper usage now requires that XLink-related attributes have a namespace prefix. However, because this would require a disruptive backwards-incompatible change, the eight simple XLink elements in XMLspec are not quite conforming. In a future version, the seven elements using href will be changed to have a prefix, and the graphic element (which has a source attribute) will need to switch to a prefixed href attribute.
Several parameter entities are set up for convenience of applying XLink behavior semantics. The %auto-embed.att;, %user-replace.att;, and %user-new.att; entities set up the XLink show and actuate attributes. The simple XLink elements have these settings:
See the table above. For detailed information about processing expectations and how to distinguish among the various XLink elements, see the section called Linking Relationships in Chapter 6.
This section describes the major document hierarchy structures:
| spec |
| header |

The spec element contains, in order, a header; an optional front; a body; and an optional back.
The header provides metadata about the specification document (see the section called Specification Header (header) for more information). The front matter is for prefatory material. The body matter is primary content. The back matter is supplementary content. All three are organized into divisions.
The elements front and body contain one or more div1 elements. The main element for structuring content is div1, the equivalent of a preface, chapter, or appendix. It can be subdivided to four additional levels, down to div5. At each level, the division contains a head (title), optionally followed by a mixture of standalone elements (see Table 7-1), optionally followed by the next level of subdivision (except in the case of div4).
The back element contains div1 and/or inform-div1 (non-normative or informational division) elements. If both are present, the normative divisions must appear first. The back element cannot be empty.
In addition to the common attributes, the spec element has three unique attributes:
Indicates the type of document, so that the appropriate stylesheet or workflow routing can be applied. It can optionally have a value of "rec" (Recommendation), "pr" (Proposed Recommendation), "cr" (Candidate Recommendation), "wd" (Working Draft), "note" (Note), "issues" (Issues List), "dispcmts" (Disposition of Comments), or "other". It should not generate any text (such as the "REC-" or "NOTE-" prefix on the W3C designation content). It has no default. If this value is "other", other-doctype should be filled in.
If w3c-doctype is "other", this value should be filled in with a keyword chosen or negotiated by the document authors and rendering application developers. It has no default.
Indicates the stage of review of the document. It has no default.
Divisions are expected to be numbered, and a report of the numbers and heads should normally be made into a Table of Contents before any front content.
The w3c-doctype attribute should not be used to generate any text (such as the "REC-" or "NOTE-" prefix on the W3C designation content). The value of the status attribute might affect the stylesheet's treatment of editorial notes (for example, whether to output them).
Originally, divisions went down to four levels total (to div4), but the MathML specification required one more level.
Elements serving the same function were merged to make a cleaner design.
The original text element, which wrapped all the non-header content, was removed because it didn't add anything to the structure, and its meaning ("a text") doesn't seem very relevant to W3C specifications. The original header contents have been consolidated under the header element, which meant that the original front element was no longer required because its titlepage contents have been done away with. The original backtabs element was removed, as agreed.
The original type attributes on the division elements were removed, as agreed.
Here is where it becomes apparent that the original "special lists" were removed from their special place in the division content models.

The header contains an ordered series of metadata elements:
The title of the technical report, for example, "Extensible Cheese Language (ECL)".
This was previously the wd-title element. It contains character data (see Table 7-2).
The subtitle of the technical report, if it has one. It contains character data (see Table 7-2).
The version of the technical report, for example, "Version 4.0". It contains character data (see Table 7-2).
The code by which the technical report is known in URIs and such, for example, "WD-xcl-991231".
This was previously the wd-num element. It contains character data (see Table 7-2).
The full name for the type of W3C technical report, for example, "W3C Working Draft" or "W3C Note". It contains character data (see Table 7-2).
The day (optionally), month, and year of publication of the document, separated out into day, month, and year elements. This should be the date on which the final text has been handed over to the W3C for official publication. Note that in internal interim drafts, you may want to provide a notice explaining the situation. The day, month, and year elements contain character data (see Table 7-2).
This was previously the wd-date element.
A generic notice to readers, for example, "This draft is for public discussion." You can add as many notices as are required.
The notice element contains a mixture of standalone elements (see Table 7-1).
The one or more Web resources corresponding to the different published forms (for example, XML, HTML, and PostScript) of this technical report. It contains character data and phrase-level elements (see Table 7-2), typically just one or more loc elements.
This was previously the thisver element.
The one or more Web resources corresponding to the previously published versions of this technical report. It contains character data and phrase-level elements (see Table 7-2), typically just one or more loc elements. If there are no previous versions, the prevlocs element itself should not be provided.
This was previously the previousver element.
The one or more Web resources corresponding to the latest version of this technical report. It contains character data and phrase-level elements (see Table 7-2), typically just one or more loc elements.
This was previously the latestver element.
The list of editors contributing to the document. It contains one or more author elements, each of which contains a name (has an optional key attribute), followed by an optional affiliation, followed by an optional email. The last requires an href attribute. The name, affiliation, and email elements contain character data (see Table 7-2).
This was previously the authors element. The affiliation element is optional because some editors may not be affiliated with any organization. The email element is optional to help avoid spamming of the editors.
A copyright notice. W3C now requires that specifications have copyright notices.
The copyright element contains a mixture of standalone elements (see Table 7-1). Note that its content should be "cooked," that is, the author should not rely on a rendering process to produce a copyright symbol, punctuation, or other style elements that might make up a proper copyright notice. These should be provided directly in the XML source.
A brief description of the document contents. It contains a mixture of zero or more standalone elements (see Table 7-1). Putting the abstract after the status statement is deprecated.
A description of the status of the document, following W3C rules. This element contains a normal mixture of standalone elements (see Table 7-1). Putting the status statement before the abstract is deprecated.
A brief bibliographic statement about this publication according to Text Encoding Initiative rules, for example, "Burlington, Seekonk, et al.: World-Wide Web Consortium, XML Working Group, 1999." It contains a mixture of standalone elements (see Table 7-1), so the example text would have to be inside a p.
A brief statement about the original source for this document, for example, "Created in electronic form." It contains a mixture of standalone elements (see Table 7-1), so the example text would have to be inside a p.
A catalog of languages used in the document. It contains one or more language elements, each of which might have an id attribute on it so that it can be referenced from prod elements. The language element contains character data (see Table 7-2).
A catalog of changes made to the document, in more or less rigorous form. It contains a mixture of zero or many standalone elements (see Table 7-1); typically, an slist element would be used, with its sitem elements corresponding to individual change descriptions and dates.
The various parts of the header are used in creating a title page that follows W3C rules. The content of some elements is used twice or more, while the content of other elements is suppressed from display. Some of the elements (such as publoc) should cause heading text to be generated.
The element name has, in addition to the common attributes, a key attribute, which optionally provides a sort-key string for use in collecting and outputting names mentioned in a document.
The element email has common attributes and a required href attribute.
The content of publoc, prevlocs, and latestloc was broadened so that the various published forms and auxiliary resources of a specification can be mentioned. For example, the XML Schema specifications refer not only to the XML source and the HTML published form, but also to the special stylesheet used to produce the output and normative DTDs and schemas associated with the specification.
W3C publication guidelines require the abstract to come before the status statement; it is allowed after the status statement only for backwards compatibility with V2.0 of the DTD.
The content model of header has been parameterized so that the metadata can be customized, subsetted, and extended as necessary.
The metadata elements that were in the original DTD were cherry-picked, based on the data found in a survey of typical W3C technical report cover and title pages. Where an element is optional, generally content is required inside it to ensure that it's not abused or accidentally left empty. The version element was made optional for Version 2.0 of the DTD because versions are not required on Working Drafts.
The subtitle element was added unilaterally because it seemed like a generally good idea.
The w3c-doctype element should perhaps more properly be an attribute with a small set of enumerated values, if the DTD gets wider use and the types are quantified. So far, the element formulation has stood us in good stead because we began to publish "notes" in addition to "working drafts" and did not need to make any stylesheet changes.
The pubdate contains, in order, day (optional), month, and year elements so that different forms of the date could be published in different locations: "31 December 1999" on the title page and "December 1999" on the cover, for example. The content model of pubdate has been parameterized so that a different form of date information can be supplied if necessary.
The status element originally required the statusp variant of the p element because Dan indicated that HTML-style links should be allowed only where they're appropriate. Because inclusions are not allowed in XML and we wanted this DTD to be XML compliant, the only way to allow status to contain loc was to give it a special subelement where loc is allowed. However, we've since found that it's very difficult to excise all need for HTML-style links in the body of the spec, so we ended up extending p to allow it to contain loc and, in preparation for losing statusp entirely, allowed p inside status. Now, statusp is obsolete and has been removed from the DTD.
It was decided to make the copyright element contain "cooked" content for simplicity and so that it is most likely that various renderings of the document will contain the correct copyright notice, whole and uncut. It is expected that if the XML source itself needs to have explicit copyright protection, an XML comment (in each file making up the document) is the right tool for the job.
Following are the standalone element structures ("paragraph-level elements"), which make up the bulk of the content of divisions in a technical report. These structures fall into classes, as follows. (Note that the DTD makes slightly finer distinctions than these, for purposes of managing content models.)
Paragraphs
Regular lists
Special lists
Notes
Illustrations
Following is the sole member of the paragraph class:
| Paragraph |

The p element is a general-purpose paragraph which can contain regular character data, phrase-level elements, and some nested standalone elements (see Chapter 7).
The p element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
This element should be presented as vertically set off from other surrounding elements, and its inline text contents should be wrapped as appropriate to fit in the line size available.
Originally, only p was made available, and it contained loc . Dan requested that the loc element not be made generally available, because properly these should only occur in the status section of a technical report, and statusp was therefore created because SGML exceptions, which would have allowed for a clean implemention of the restriction, aren't available in XML-compliant DTDs. However, later all the editors came to the conclusion that it was too restrictive not to allow loc anywhere else, and we added loc to regular paragraphs and p to the status section.
The statusp element was a special version of a paragraph that was created to allow loc (see the section called URI Reference (loc) in Chapter 5) inside it. A statusp contained a mixture of one or more %statusobj.mix; and/or %statusp.pcd.mix;. However, this element has finally been removed for simplicity, since p can do the job itself.
The following are the members of the regular list class:
| Unordered list |
| Ordered list |
| Simple list |
| Glossary list |

The ulist element identifies unordered lists (for example, with items indicated with bullets) and the olist element identifies ordered lists (for example, with items indicated with arabic numbers). Both ulist and olist contain one or more item elements, which identifies a list item. An item contains one or more standalone elements (see Table 7-1). Thus, a list item intended to contain a simple text string must first contain a paragraph.
The ulist, olist, and item elements have the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
In addition to the common attributes, the ulist and olist elements have one unique attribute:
Specifies the vertical spacing between the list items. Use "normal" to get normal vertical spacing for items; use "compact" to get less spacing. The default is dependent on the stylesheet.
List style and formatting are not strictly dictated. An unordered list at the top level (not nested in another unordered list) should generate a bullet for each item. A nested unordered list should typically generate a distinct bullet (e.g., unfilled vs. filled). An ordered list at the top level (not nested in another ordered list) should generate sequenced numbers for its item. A nested ordered list should typically generate a distinct numbering style (e.g., lowercase roman vs. arabic).
The ulist element was previously list type="bullets". The olist element was previously list type="number". The element type was split out to achieve greater content model control, and the names were chosen for consistency.

The slist element identifies a simple list, in which the items are presumed to contain only a small word or phrase. The slist contains one or more sitem elements, which contains character data and phrase-level elements (see Table 7-2). Simple list items are unlike regular list items in that the simple version can't contain standalone elements.
The slist and sitem elements have the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
List style and formatting are not strictly dictated. Typically, simple list items are each on their own line, with no bullet or other explicit enumeration.
The slist element was previously list type="simple". The element type was split out to achieve greater content model control, and the name was chosen for consistency.

The glist element identifies a glossary list, in which terms or keywords are given a definition. The glist element contains one or more gitem elements. The gitem element is a pair of label and def. A label contains character data and phrase-level elements (see Table 7-2), and a def contains standalone elements (see Table 7-1)
The glist, gitem, label, and def elements have the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
List style and formatting are not strictly dictated. Typically, glossary list items are formatted as classic hanging-indent or two-column definition lists.
The gitem element is a container for paired items in glist elements. A container wasn't previously available; this should make formatting and other processing (such as sorting) easier.
The def element was previously the item element used in a "paired" context. It's easier to process these two uses of "items" if they have distinct element types.
The glist element was previously list type="gloss". The element type was split out to achieve greater content model control, and the name was chosen for consistency.
The following are the members of the special list class:
| Bibliography list |
| Organization list |
These elements are available in divisions and %obj.mix; content, but are not available inside (for example) paragraphs.

The blist element identifies a bibliography list. It contains one or more bibl elements, each of which optionally functions as a hypertext reference to the referred-to resource through its href attribute.
The bibl element contains character data and phrase-level elements (see Table 7-2). Its content model does not constrain authors to the use of a particular bibliographic format.
The blist and bibl elements have the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
In addition to the common attributes, the bibl element has the following semi-common attributes:
| key (see the section called Key Attribute in Chapter 2) |
| href (see the section called XLink-Related Attributes in Chapter 2) |
List style and formatting are not strictly dictated. Typically, bibliography list items are formatted on their own line, and may use a definition list format by putting the value of the key attribute as the "term."

The orglist element identifies an organization list (for example, a list of Working Group or Interest Group members). It contains one or more member elements. A member contains, in order, name, an optional affiliation, and an optional role.
The name, affiliation, and role elements contain character data (see Table 7-2).
The orglist, member, affiliation, role, and name elements have the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
The name element also has the following semi-common attribute:
| key (see the section called Key Attribute in Chapter 2) |
List style and formatting are not strictly dictated. Typically, organization list items are formatted as a "textual list," wrapped in the content of a paragraph with items and their constituent parts separated by appropriate punctuation.
The orglist element was previously the wglist element. The member element was previously the wgm element. The names were changed for consistency and clarity.
The following are the members of the note class:
| Regular note |
| Issue |
| Well-formedness constraint note |
| Validity constraint note |
| Generic constraint note |

The note element is for admonitions to readers. It contains one or more standalone elements (see Table 7-1).
The note element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
Although this element type was originally note place="inline", it was never used in its inline form as far as we can tell; plain notes should be formatted as vertically set-off content with some kind of generated "Note" heading.
This was previously note place="inline". The element was split out for greater content model and linking control. We don't expect that notes other than "constraint notes" will be used very often.

The issue element is for the text of outstanding issues related to the technical report. It contains an optional head, an optional series of source elements that describe the communication (typically a document or email message) in which the issue was raised, a mixture of one or more standalone elements (see Table 7-1), and an optional resolution element.
The issue element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id (required) |
| role |
| diff |
In addition to the common attributes, it also has the status attribute, which can have a value of "open" (the default) or "closed". It it possible for an issue with no resolution provided to be closed.
Issues should be formatted as vertically set-off content with some kind of generated "Issue" heading, the issue ID, and the issue head (if any) prominently displayed. The source and resolution information should be given generated heading as necessary. It is expected that an xref element referring to an issue will refer to it by its issue ID.
The status attribute may control, for Recommendation-track documents, whether the issue is output at all; closed issues could be suppressed. Issues-list and errata documents might want to print all issues.
James added this element for his own purposes in working on XSL, and it seems like a generally good idea.

The wfcnote element identifies a well-formedness constraint note and the vcnote element identifies a validity constraint note. The constraintnote element identifies a generic constraint note. All three contain, in order, a head followed by one or more standalone elements (see Table 7-1).
All three elements must each have an id attribute specified so that it can be pointed to from a wfc, vc, or constraint element, respectively, in a production (see the section called Code Scrap (scrap)).
The wfcnote, vcnote, and constraintnote elements have the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id (required) |
| role |
| diff |
In addition, the constraintnote element has a unique attribute, type, which the author must fill in to indicate the type of constraint being described.
These note elements should be displayed vertically set off, with a generated heading something like "Well-Formedness Constraint" or "Validity Constraint" followed by the specific head provided. The specific head should also be reproduced as part of the display of the related wfc, vc, or constraint elements. The type attribute on constraintnote should be used to trigger the appropriate generated heading and to contribute to the appearance of the related constraint elements.
These elements now require an ID so that a constraint element can link to a constraint note element from inside prod. There is no point having a note if there is not at least one corresponding constraint in a production pointing to it.
The two specific elements were previously note type="wf-check" and note type="v-check". The elements were split out for greater content model and linking control.
The generic constraint note element was created because we foresaw a need for additional kinds of constraints when the Namespaces in XML draft was written. (It invented "namespace constraints.") In order to avoid constantly needing to update the DTD to add new constraint types, we chose this solution. Because of the importance of well-formedness and validity constraints to base XML, these specialized types were retained.
The following are the members of the illustration class:
| Code example |
| Example |
| Graphic |
| Code scrap |
| HTML-style table |
| IDL definitions |
| Function prototype |

The eg element identifies technical examples. It contains character data and phrase-level elements (see Table 7-2).
The eg element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
It also has an xml:space attribute with a #FIXED value of "preserve" to indicate that all white space inside the example should be retained by applications.
The element should be displayed as vertically set off (even if it appears inside a paragraph) and be given a monospaced font to ensure that characters and white space inside the example line up correctly.
We expanded its content a bit from just #PCDATA, so that it can contain footnote and highlighting markup if necessary.

The example element identifies code examples accompanied by explanations. It contains an optional head and one or more standalone elements (see Table 7-1).
The example element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
The element should be displayed as vertically set off (even if it appears inside a paragraph), with a generated "Example" heading of some kind, followed by the supplied head text (if any). Typically, in HTML form it is displayed as a table with one column and a background color that distinguishes it from the surrounding content..
The XML Schema specifications, among others, use this heavily in the form of a note with a role on it, so it seemed important enough to have its own element.

The graphic element is used to reference external graphic files. The graphic data must reside at the location pointed to using the source attribute. The graphic element is declared EMPTY.
The graphic element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
In addition to the common attributes, the graphic element has the following unique attributes:
A required hypertext reference pointing to the graphic data content to be displayed. Other related attributes are also present. See the section called XLink-Related Attributes in Chapter 2 for more information.
An optional alternate string to display if the graphic data cannot be displayed or viewed.
The data content pointed to by the source attribute of the graphic element should be presented in place. The values for the XLink behavior attributes show and actuate are fixed to be "embed" and "auto", respectively.
There seems to be no need for a formal figure with a caption or head, nor for an additional layer of container element to allow for the later addition of graphic metadata.
XLink is the obvious mechanism for pointing to the graphic data content, but for now, the graphic element is not strictly XLink-conforming because the ability to do attribute remapping has gone away. The fixed-value attribute xml:attributes on this element was therefore removed.

The scrap element identifies a code scrap containing language productions. It contains, in order, a head element containing character data, followed by a free mixture of one or more bnf, prod, prodgroup, and prodrecap elements.
The main element for structuring productions is prod. It contains, in order, an lhs (left-hand side) element identifying the nonterminal that is being defined, followed by one or more groups of rhs (right-hand side fragments) and an optional mixture consisting of com (commentary on the production), wfc (indications of well-formedness constraints), vc (indications of validity constraints), and constraint (generic indications of language constraints). It has a required id attribute so that specref cross-references (see the section called Specification Reference (specref) in Chapter 5) and nt mentions of nonterminals (see the section called Nonterminal Reference (nt) in Chapter 5) can link to it.
The prodgroup element groups productions within a single scrap.
The bnf (Backus-Naur Format) element is for "raw," unformatted productions without internal markup. It contains the same mixture of character data and phrase-level elements as eg does (see Table 7-2).
The prodrecap element links to an existing production for the purpose of reproducing it in this location for the reader's convenience. It is empty and has a required ref attribute to the desired production.
The wfc, vc, and constraint elements are empty. These indications of constraints must each use their required def attribute to link to an actual wfcnote, vcnote, or constraintnote element that defines it.
The scrap, head, bnf, prod, prodgroup, prodrecap, lhs, rhs, com, nt, xnt, wfc, vc, and constraint elements all have the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id (an attribute value is required for prod) |
| role |
| diff |
In addition to the common attributes, the wfc, vc, and constraint elements have the following semi-common attribute:
| def (required; see the section called Definition Attribute in Chapter 2). |
In addition to the common attributes, the prodrecap element has the following semi-common attribute:
| ref (required; see the section called Definition Attribute in Chapter 2). |
In addition to the common attributes, the scrap element has two unique attributes:
An optional IDREF link to a description of the language used, found in a language element in the header (see the section called Specification Header (header) in Chapter 3).
A presentational attribute to control the output of the scrap head. The default is "show", the other possible value is "suppress". The head should be suppressed only if the scrap appears in like-named section and there is no other content in the section, such that the section head and the scrap head would merely repeat each other.
In addition to the common attributes, the prodgroup element has several unique attributes:
Presentational attributes to control the width of the "pseudo-table" columns used to output groups of productions. The first column is pcw1. It can contain up to four more columns, down to pcw5. Values are optional to supply.
Each scrap is expected to be displayed vertically set off, with its head used as the label for the whole scrap. Each production should be numbered, and in some presentations, it may be appropriate to produce a "table of productions" at the front that lists the scrap heads, production numbers, and the nonterminals (the content of the lhs) they define. The style of production we have been using involves the generated output of a ::= LHS/RHS equivalence separator. Comments (com) are typically displayed between BNF comment delimiters to the right of each RHS fragment, and possibly italicized. Each RHS fragment is displayed on a separate line. The wfc, vc, and constraint elements should generate in place either "WFC:" or "VC:" or text corresponding to the linked constraintnote element's type attribute, followed by the head of the note they link to. If the scrap element's headstyle attribute is set to "suppress", the scrap head should be suppressed from output.
We considered several different "depths" of production markup model, and settled on the current model as the best balance of functionality and presentational control. Modeling EBNF exactly would have required a very heavy markup burden, which most of the editors were not willing to live with, as well as a presenting a difficult formatting challenge, so we compromised by having (for example) several rhs elements per lhs to correspond to each display line.
The prodgroup element and its attributes were added to solve a thorny formatting problem involving the output of tables.
In general, the design here shows very clearly the tension between the design principles of presentation independence and efficient W3C document production.
Originally, the content model for a scrap called for either a bnf element or one or more prod elements or one or more prodgroup elements. This strong separation of each type of scrap content began to seem unwarranted when the prodrecap element was added, since it seems eminently possible to format the result even when all four subelements are mixed freely. Thus, the content model was broadened.

The table element is a full HTML Version 4.0-style table.
Note that the implementation of HTML 4.0 tables in the XMLspec DTD is a new one, with different parameterization, fewer comments, changes to make the DTD fragment compatible with XML, and slight changes to the model. Following are the significant changes made:
The names of the table and table body elements were changed back from htable and htbody to table and tbody.
The common attributes were added to all the table elements (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
All the markup names (element types, attribute names, attribute values, and so on) were made lowercase, to help in manual input since XML is case sensitive.
The content mixture used for paragraphs was also used for the content of the th and td elements (see Table 7-1 and Table 7-2).
The full-SGML OMITTAG flags were removed.
Uses of the full-SGML NUMBER attribute type were changed to NMTOKEN.
The HTML common, reserved, and datapagesize attributes were removed.
For a full description of the features of HTML 4.0 tables that have been retained in XMLspec, see http://www.w3.org/TR/REC-html40, which is the normative reference for table structure and processing expectations.
Note: The Version 4.0 model removes some attribute defaults that were in force in the simplified 3.2 model used previously. In general, if you want a particular value, make sure to set it yourself.
At first, the DTD offered only SGML Open Exchange tables, for which DSSSL formatting support already existed. However, HTML is the primary output for W3C documents, and HTML table formatting was written in DSSSL, so we added the HTML table model as well. More recently, we removed the SGML Open model because only the HTML model was actually being used.
We can easily add the SGML Open table model again if it is ever needed.
Most recently, we switched from an extremely simple HTML-style table model to full HTML 4.0 tables on request from the DOM group. This was to avoid unnecessary transformations in converting to HTML and to allow authors to take full advantage of the geometrical table capabilities that HTML 4.0 offers.
The definitions element structure was contributed by the W3C DOM Working Group. No detailed descriptions, processing expectations, or design rationales have been supplied. This section offers only a structural description.
Note: This model may change in backwards-incompatible ways in the future, to account for the manner in which the markup is actually being used by the DOM group. See the XML source for the DOM Recommendation for examples, and use this model with caution.
Following are the IDL element classes. Every class entity has the naming scheme %name.class;, and has an empty %local.name.class; entity in it for customization purposes.
Descriptive elements (%idl-desc.class;)
| Paragraph (p) |
| Note (note) |
Type definition elements (%idl-tdef.class;)
| typedef |
| constant |
| exception |
| reference |
| group |
Module/interface elements (%idl-mod.class;)
| module |
| interface |
Struct elements (%idl-struct.class;)
| struct |
| enum |
| sequence |
| union |
| typename |
Method/attribute elements (%idl-meth.class;)
| method |
| attribute |
Table 4-1 shows the element mixtures built up out of the IDL-related elements.
Table 4-1. IDL-Related Mixtures
| desc | tdef | mod | struct | meth | |
|---|---|---|---|---|---|
| %idl-grp.mix; (used in group) | X | X | X | X | X |
| %idl-defn.mix; (used in definitions, module) | X | X | X | ||
| %idl-intfc.mix; (used in interface) | X | X | X | ||
| %idl-type.mix; (used in typdef, component, case as mutually exclusive choices) | X |
The model for definitions is as follows.

The models for group, interface, module, reference, constant, and exception are as follows.

The model for typedef and its contents is as follows.

The models for method and attribute are as follows.


The proto element structure was contributed by the W3C XSL Working Group. Only limited descriptions, processing expectations, and design rationales are provided here.
The proto element describes functions available in XSLT and similar languages. It contains a series of zero or more arg elements.
The proto element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
In addition to the common attributes, the proto element has the following unique attributes:
The name of the function. A value is required.
The type of value returned by the function. It is required; the value can be "boolean", "expression", "location-set", "node-set", "number", "object", "point", "string", or "range".
Each function prototype is expected to be displayed vertically set off, with the function name emphasized. It should be flanked on the left by the return type and on the right by a list of argument types. If a particular argument is optional, is should be indicated so. Depending on the language being described, the list is typically parenthesized and the argument types separated by commas, with optional arguments being followed by question marks.
Following are the phrase-level element structures ("inline-level elements"), which are typically used along with character data. These structures fall into classes, as follows. (Note that the DTD makes slightly finer distinctions than these, for purposes of managing content models.)
Annotations
Terms and definitions
Emphasized text
References
Technical
Editorial note
The footnote element is the only member of the annotation class.
The footnote element serves as both a marker for the location of the footnote callout and a container for the footnote content. It contains one or more elements in the %obj.mix; mixture.
The footnote element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
For print, the location of the footnote element should be given a generated superscripted number or symbol that serves as a callout, and the footnote content should be presented along with the callout at the bottom of the page. In online presentation, the footnote could be presented as a pop-up dialog box keyed to an icon placed at the location of the footnote element.
The following are the members of the term/definition class:
| Defined term |
| Term definition |
The term element identifies a term being defined in text. It contains character data (see Table 7-2). It is mostly used as a substructure of termdef, though it may occasionally be used outside of a termdef context for an "informally" defined term. For information on cross-referencing a term, see the section called Term Reference (termref) and the section called External Term Definition Reference (xtermref).
The term element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
This element should be given some sort of typographical emphasis, for example italics.
This element exists mostly to allow control of the typographical emphasis, since the term attribute on termdef does the work of supplying a "canonical" form of the term for use in generating definitions. If the canonical term and the term as it appears in text are identical, there is some slight redundancy, but the overhead of using the canonical term in the flow of text (or modifying it if it's inappropriately pluralized or capitalized) isn't worth it.
The element termdef contains a term definition embedded in text. It contains a mixture of character data and phrase-level elements (see Table 7-2), including somewhere within it a mention of the term being defined (in a term element). Note that because the termdef element has mixed content, the presence of term within it can't be guaranteed by means of a validating XML processor. However, there is an editorial expectation that term will be present. (See issue 7 in the section called Issues in Chapter 1.)
The termdef element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2). It must have an id attribute so that it can be linked to from termref elements ( the section called External Term Definition Reference (xtermref)).
| id (required) |
| role |
| diff |
In addition to the common attributes, termdef has one unique attribute:
The canonical form of the term or phrase being defined must appear in this attribute, even if the term or phrase also appears in the element content in identical form (in the term element).
While no special behavior or formatting is required, there are some opportunities for clever definition handling. For example, the terms and definitions could be assembled into a generated glossary, or definitions could be given some sort of boxing or generated-text boundaries in running text.
Because we wanted to continue to allow definitions in running text, the mixed-content solution was the only reasonable choice even though it means that the DTD can't ensure that proper markup has been used.
The following are the members of the emphasized text class:
| Emphasized text |
| Phrase |
| Quoted text |
| Subscripted text |
| Superscripted text |
The emph element identifies text that should be given extra rhetorical emphasis. It contains character data (see Table 7-2).
The emph element has the common attributes (see the section called Attributes Appearing on Every Element in Chapter 2):
| id |
| role |
| diff |
The content of the emph element should be given typographical emphasis, typically italic or boldface.
If not abused, this element can be useful, and its presence probably forestalls abuse of other elements that happen to produce typographical emphasis. Since it is expected to contain only a word or two of natural language, it need only contain #PCDATA.
The phrase element identifies text that needs attribute values set on it, but is otherwise undistinguished from the surrounding text. It contains a mixture of character data and the same phrase-level elements that are allowed in paragraphs (see Table 7-2).