This document is also available in these non-normative formats: Single HTML file, PostScript version, PDF version, ZIP archive, or Gzip'd TAR archive.
Copyright ©2001-2008 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark and document use rules apply.
This document is version 1.1 of XHTML Modularization, an abstract modularization of XHTML and implementations of the abstraction using XML Document Type Definitions (DTDs) and XML Schemas. This modularization provides a means for subsetting and extending XHTML, a feature needed for extending XHTML's reach onto emerging platforms. This specification is intended for use by language designers as they construct new XHTML Family Markup Languages. This specification does not define the semantics of elements and attributes, only how those elements and attributes are assembled into modules, and from those modules into markup languages. This second version of this specification includes several minor updates to provide clarifications and address errors found in the first version. It also provides an implementation using XML Schemas.
This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.
This is a Proposed Recommendation of "XHTML Modularization 1.1". It reflects changes arising from comments during the Last Call period. A "Disposition of Comments" report is available at http://www.w3.org/MarkUp/2008/xhtml-modularization-lc-doc-20080221.html. An implementation report is available at http://www.w3.org/MarkUp/2008/xhtml-m12n-11-implementation.html. This specification is considered stable by the XHTML2 Working Group. W3C Advisory Committee Representatives are invited to submit their formal review per the instructions in the Call for Review (see Advisory Committee questionnaires). The review period ends on 15 July 2008. Members of the public are also invited to send comments on this Proposed Recommendation to www-html-editor@w3.org (archive). It is inappropriate to send discussion email to this address. Public discussion may take place on www-html@w3.org (archive).
Publication as a Proposed Recommendation does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
The Working Group invites implementation feedback during this period. Since this specification represents an incremental improvement to its predecessor, and since there is already substantial implementation experience, we expect to have adequate validation before the end of the Proposed Recommendation review period. While we welcome implementation experience reports, the XHTML2 Working Group will not allow early implementation to constrain its ability to make changes to this specification prior to final release.
If this document is approved as a W3C Recommendation, it will supercede version 1.0 as defined in http://www.w3.org/TR/2001/REC-xhtml-modularization-20010410/.
This document has been produced by the W3C XHTML 2 Working Group as part of the HTML Activity. The goals of the XHTML 2 Working Group are discussed in the XHTML 2 Working Group charter.
This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
Public discussion of HTML takes place on www-html@w3.org (archive). To subscribe send an email to www-html-request@w3.org with the word subscribe in the subject line.
Please report errors in this document to www-html-editor@w3.org (archive).
This section is informative.
XHTML is the reformulation of HTML 4 as an application of XML. XHTML 1.0 [XHTML1] specifies three XML document types that correspond to the three HTML 4 DTDs: Strict, Transitional, and Frameset. XHTML 1.0 is the basis for a family of document types that subset and extend HTML.
XHTML Modularization is a decomposition of XHTML 1.0, and by reference HTML 4, into a collection of abstract modules that provide specific types of functionality. These abstract modules are implemented in this specification using the XML Schema and XML Document Type Definition languages. The rules for defining the abstract modules, and for implementing them using XML Schemas and XML DTDs, are also defined in this document.
These modules may be combined with each other and with other modules to create XHTML subset and extension document types that qualify as members of the XHTML-family of document types. Note that while this specification only contains trivial examples of combining these modules into markup languages. For more complete examples, see [XHTMLBASIC] or [XHTML11].
The modularization of XHTML refers to the task of specifying well-defined sets of XHTML elements that can be combined and extended by document authors, document type architects, other XML standards specifications, and application and product designers to make it economically feasible for content developers to deliver content on a greater number and diversity of platforms.
Over the last couple of years, many specialized markets have begun looking to HTML as a content language. There is a great movement toward using HTML across increasingly diverse computing platforms. Currently there is activity to move HTML onto mobile devices (hand held computers, portable phones, etc.), television devices (digital televisions, TV-based Web browsers, etc.), and appliances (fixed function devices). Each of these devices has different requirements and constraints.
Modularizing XHTML provides a means for product designers to specify which elements are supported by a device using standard building blocks and standard methods for specifying which building blocks are used. These modules serve as "points of conformance" for the content community. The content community can now target the installed base that supports a certain collection of modules, rather than worry about the installed base that supports this or that permutation of XHTML elements. The use of standards is critical for modularized XHTML to be successful on a large scale. It is not economically feasible for content developers to tailor content to each and every permutation of XHTML elements. By specifying a standard, either software processes can autonomously tailor content to a device, or the device can automatically load the software required to process a module.
Modularization also allows for the extension of XHTML's layout and presentation capabilities, using the extensibility of XML, without breaking the XHTML standard. This development path provides a stable, useful, and implementable framework for content developers and publishers to manage the rapid pace of technological change on the Web.
An XHTML document type is defined as a set of abstract modules. A abstract module defines one kind of data that is semantically different from all others. Abstract modules can be combined into document types without a deep understanding of the underlying schemas that define the modules.
A module implementation consists of a set of element types, a set of attribute-list declarations, and a set of content model declarations, where any of these three sets may be empty. An attribute-list declaration in a module may modify an element type outside the element types defined in the module, and a content model declaration may modify an element type outside the element type set of the module.
One implementation mechanism is XML DTDs. An XML DTD is a means of describing the structure of a class of XML documents, collectively known as an XML document type. XML DTDs are described in the XML 1.0 Recommendation [XML]. Another implementation mechanism is XML Schema [XMLSCHEMA]. For purposes of discussion in this document, we refer to the various implementations of XHTML Modularization as "schema".
A hybrid document type is an document type composed from a collection of XML document types or Modules. The primary purpose of the modularization framework described in this document is to allow a document type author to combine elements from multiple abstract modules into a hybrid document type, develop documents against that hybrid document type, and to validate that document against the associated hybrid schema.
One of the most valuable benefits of XML over SGML is that XML reduces the barrier to entry for standardization of element sets that allow communities to exchange data in an interoperable format. However, the relatively static nature of HTML as the content language for the Web has meant that any one of these communities have previously held out little hope that their XML document types would be able to see widespread adoption as part of Web standards. The modularization framework allows for the dynamic incorporation of these diverse document types within the XHTML-family of document types, further reducing the barriers to the incorporation of these domain-specific vocabularies in XHTML documents.
The use of well-formed, but not valid, documents is an important benefit of XML. In the process of developing a document type, however, the additional leverage provided by a validating parser for error checking is important. The same statement applies to XHTML document types with elements from multiple abstract modules.
An XHTML-family document is an instance of one particular document type as specified by the document's author, either in the document's prologue or through some other mechanism (e.g., the
schemaLocation attribute). Validating the document is the process of testing whether the document complies with the rules in the associated schema.
One document can consist of multiple document fragments. Validating only fragments of a document, where each fragment is of a different document type than the other fragments in the document, is beyond the scope of this framework - since it would require technology that is not yet defined.
However, the modularization framework allows multiple schema to be integrated and form a new document type (e.g. SVG integrated into XHTML). The new schema can be used for normal XML 1.0 validation.
Earlier versions of HTML attempted to define parts of the model that user agents are required to use when formatting a document. With the advent of HTML 4, the W3C started the process of divorcing presentation from structure. XHTML 1.0 maintained this separation, and this document continues moving HTML and its descendants down this path. Consequently, this document makes no requirements on the formatting model associated with the presentation of documents marked up with XHTML Family document types.
Instead, this document recommends that content authors rely upon style mechanisms such as CSS to define the formatting model for their content. When user agents support the style mechanisms, documents will format as expected. When user agents do not support the style mechanisms, documents will format as appropriate for that user agent. This permits XHTML Family user agents to support rich formatting models on devices where that is appropriate, and lean formatting models on devices where that is appropriate.
This section is informative.
While some terms are defined in place, the following definitions are used throughout this document. Familiarity with the W3C XML 1.0 Recommendation [XML] is highly recommended.
This section is normative.
In order to ensure that XHTML-family documents are maximally portable among XHTML-family user agents, this specification rigidly defines conformance requirements for both of these and for XHTML-family document types. While the conformance definitions can be found in this section, they necessarily reference normative text within this document, within the base XHTML specification [XHTML1], and within other related specifications. It is only possible to fully comprehend the conformance requirements of XHTML through a complete reading of all normative references.
The keywords "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119].
It is possible to modify existing document types and define wholly new document types using both modules defined in this specification and other modules. Such a document type is "XHTML Host Language Conforming" when it meets the following criteria:
XHTML".It is also possible to define document types that are based upon XHTML, but do not adhere to its structure. Such a document type is "XHTML Integration Set Conforming" when it meets the following criteria:
This specification defines a method for defining XHTML-conforming modules. A module conforms to this specification when it meets all of the following criteria:
A conforming XHTML family document is a valid instance of an XHTML Host Language Conforming Document Type. For the avoidance of doubt, the behavior of User Agents in the presence of invalid documents is undefined.
A conforming user agent must meet all of the following criteria (as defined in [XHTML1]):
ID (e.g., the id
attribute on most XHTML elements) as fragment identifiers.xml:space, as defined in [XML], is set to "preserve". If the value of that attribute is set to "default", that is the same as if it were set to "preserve". On rendering, whitespace is processed
according to the rules of [CSS2].XHTML Host Language document types must adhere to strict naming conventions so that it is possible for software and users to readily determine the relationship of document types to XHTML. The
names for document types implemented as XML Document Type Definitions are defined through Formal Public Identifiers (FPIs). Within FPIs, fields are separated by double slash character sequences
(//). The various fields must be composed as follows:
W3C.EN).Using these rules, the name for an XHTML Host Language conforming document type might be -//MyCompany//DTD XHTML MyML 1.0//EN. The name for an XHTML family conforming module might be
-//MyCompany//ELEMENTS XHTML MyElements 1.0//EN. The name for an XHTML Integration Set conforming document type might be -//MyCompany//DTD Special Markup with XHTML//EN.
Each module defined in this specification is given a unique identifier that adheres to the naming rules in the previous section. Over time, a module may evolve. A logical ramification of such evolution may be that some aspects of the module are no longer compatible with its previous definition. To help ensure that document types defined against modules defined in this specification continue to operate, the identifiers associated with a module that changes will be updated. Specifically, the Formal Public Identifier and System Identifier of the module will be changed by modifying the version identifier included in each. Document types that wish to incorporate the updated functionality will need to be similarly updated.
In addition, the earlier version(s) of the module will continue to be available via its earlier, unique identifier(s). In this way, document types developed using XHTML modules will continue to function seamlessly using their original definitions even as the collection expands and evolves. Similarly, document instances written against such document types will continue to validate using the earlier module definitions.
Other XHTML Family Module and Document Type authors are encouraged to adopt a similar strategy to ensure the continued functioning of document types based upon those modules and document instances based upon those document types.
This section is normative.
An abstract module is a definition of an XHTML module using prose text and some informal markup conventions. While such a definition is not generally useful in the machine processing of document types, it is critical in helping people understand what is contained in a module. This section defines the way in which XHTML abstract modules are defined. An XHTML-conforming module is not required to provide an abstract module definition. However, anyone developing an XHTML module is encouraged to provide an abstraction to ease in the use of that module.
The abstract modules are not defined in a formal grammar. However, the definitions do adhere to the following syntactic conventions. These conventions are similar to those of XML DTDs, and should be familiar to XML DTD authors. Each discrete syntactic element can be combined with others to make more complex expressions that conform to the algebra defined here.
expr ?expr +expr *a , ba is required, followed by expression b.a | ba - b&).*).|), inside of
parentheses following the attribute name. If the attribute has a default value, that value is followed by an asterisk (*). If the attribute has a fixed value, the attribute name is
followed by an equals sign (=) and the fixed value enclosed in quotation marks.Abstract module definitions define minimal, atomic content models for each module. These minimal content models reference the elements in the module itself. They may also reference elements in other modules upon which the abstract module depends. Finally, the content model in many cases requires that text be permitted as content to one or more elements. In these cases, the symbol used for text is PCDATA; this is short for "parsed character data", denoting sequences of characters which are to be parsed for markup by an XML processor .(processed character data). A content type can also be defined as EMPTY, meaning the element has no content in its minimal content model.
In some instances, it is necessary to define the types of attribute values or the explicit set of permitted values for attributes. The following attribute types (defined in the XML 1.0 Recommendation) are used in the definitions of the abstract modules:
| Attribute Type | Definition |
|---|---|
| CDATA | Character data |
| ID | A document-unique identifier |
| IDREF | A reference to a document-unique identifier |
| IDREFS | A space-separated list of references to document-unique identifiers |
| NAME | A name with the same character constraints as ID above |
| NMTOKEN | A name composed of only name tokens as defined in XML 1.0 [XML] |
| NMTOKENS | One or more white space separated NMTOKEN values |
In addition to these pre-defined data types, XHTML Modularization defines the following data types and their semantics (as appropriate):
| Data type | Description | ||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Character | A single character from [ISO10646]. | ||||||||||||||||||||||||||||||||
| Charset | A character encoding, as per [RFC2045]. | ||||||||||||||||||||||||||||||||
| Charsets | A space-separated list of character encodings, as per [RFC2045]. | ||||||||||||||||||||||||||||||||
| Color |
The attribute value type "Color" refers to color definitions as specified in [SRGB]. A color value may either be a hexadecimal number (prefixed by a hash mark) or one of the following sixteen color names. The color names are case-insensitive.
Thus, the color values "#800080" and "Purple" both refer to the color purple. |
||||||||||||||||||||||||||||||||
| ContentType | A media type, as per [RFC2045]. | ||||||||||||||||||||||||||||||||
| ContentTypes | A comma-separated list of media types, as per [RFC2045]. | ||||||||||||||||||||||||||||||||
| Coords | Comma separated list of coordinates to use in defining areas. | ||||||||||||||||||||||||||||||||
| Datetime | Date and time information. | ||||||||||||||||||||||||||||||||
| FPI | A character string representing an SGML Formal Public Identifier. | ||||||||||||||||||||||||||||||||
| FrameTarget | Frame name used as destination for results of certain actions. | ||||||||||||||||||||||||||||||||
| LanguageCode | A language code, as per [RFC3066] or its successor. | ||||||||||||||||||||||||||||||||
| LanguageCodes | A comma-separated list of language ranges with optional q parameters, as defined in section 14.4 of [RFC2616] as the field value of the Accept-Language request header. Individual language codes should conform to [RFC3066] or its successors. | ||||||||||||||||||||||||||||||||
| Length | The value may be either in pixels or a percentage of the available horizontal or vertical space. Thus, the value "50%" means half of the available space. | ||||||||||||||||||||||||||||||||
| LinkTypes |
Authors may use the following recognized link types, listed here with their conventional interpretations. A LinkTypes value refers to a space-separated list of link types. White space characters are not permitted within link types. These link types are case-insensitive, i.e., "Alternate" has the same meaning as "alternate". User agents, search engines, etc. may interpret these link types in a variety of ways. For example, user agents may provide access to linked documents through a navigation bar.
Authors may wish to define additional link types not described in this specification. If they do so, they should use a profile to cite the conventions used to define the link types. Please see the profile attribute of the head element for more information. Note that in a future version of this specification, the Working Group expects to evolve this type from a simple name to a Qualified Name (QName). |
||||||||||||||||||||||||||||||||
| MediaDesc |
The MediaDesc attribute is a comma-separated list of media descriptors. The following is a list of recognized media descriptors:
Future versions of XHTML may introduce new values and may allow parameterized values. To facilitate the introduction of these extensions, conforming user agents must be able to parse the media attribute value as follows:
Note. Style sheets may include media-dependent variations within them (e.g., the CSS @media construct). In such cases it may be appropriate to use "media =all". |
||||||||||||||||||||||||||||||||
| MultiLength | The value may be a Length or a relative length. A relative length has the form "i*", where "i" is an integer. When allotting space among elements competing for that space, user agents allot pixel and percentage lengths first, then divide up remaining available space among relative lengths. Each relative length receives a portion of the available space that is proportional to the integer preceding the "*". The value "*" is equivalent to "1*". Thus, if 60 pixels of space are available after the user agent allots pixel and percentage space, and the competing relative lengths are 1*, 2*, and 3*, the 1* will be allotted 10 pixels, the 2* will be allotted 20 pixels, and the 3* will be allotted 30 pixels. | ||||||||||||||||||||||||||||||||
| MultiLengths | A comma separated list of items of type MultiLength. | ||||||||||||||||||||||||||||||||
| Number | One or more digits | ||||||||||||||||||||||||||||||||
| Pixels | The value is an integer that represents the number of pixels of the canvas (screen, paper). Thus, the value "50" means fifty pixels. For normative information about the definition of a pixel, please consult [CSS2] | ||||||||||||||||||||||||||||||||
| QName | An [XMLNS]-qualified name. See QName for a formal definition. | ||||||||||||||||||||||||||||||||
| QNames | A space-separated list of QNames, as defined above. | ||||||||||||||||||||||||||||||||
| Script |
Script data can be the content of the "script" element and the value of intrinsic event attributes. User agents must not evaluate script data as HTML markup but instead must pass it on as data to a script engine. The case-sensitivity of script data depends on the scripting language. Please note that script data that is element content may not contain character references, but script data that is the value of an attribute may contain them. |
||||||||||||||||||||||||||||||||
| Shape | The shape of a region. | ||||||||||||||||||||||||||||||||
| Text | Arbitrary textual data, likely meant to be human-readable. | ||||||||||||||||||||||||||||||||
| URI | A Uniform Resource Identifier Reference, as defined by the type anyURI in XMLSCHEMA. |
||||||||||||||||||||||||||||||||
| URIs | A space-separated list of URIs as defined above. |
This section is informative
This section defines a sample abstract module as an example of how to take advantage of the syntax rules defined above. Since this example is trying to use all of the various syntactic elements defined, it is pretty complicated. Typical module definitions would be much simpler than this. Finally, note that this module references the attribute collection Common. This is a collection defined in the XHTML Modularization specification that includes all of the basic attributes that most elements need.
The XHTML Skiing Module defines markup used when describing aspects of a ski lodge. The elements and attributes defined in this module are:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| resort | Common, href (CDATA) | description, Aspen+ |
| lodge | Common | description, (Aspen - lift)+ |
| lift | Common, href | description? |
| chalet | Common, href | description? |
| room | Common, href | description? |
| lobby | Common, href | description? |
| fireplace | Common, href | description? |
| description | Common | PCDATA* |
This module also defines the content set Aspen with the minimal content model lodge | lift | chalet | room | lobby | fireplace.
This section is normative.
This section specifies the contents of the XHTML abstract modules. These modules are abstract definitions of collections of elements, attributes, and their content models. These abstract modules can be mapped onto any appropriate specification mechanism. XHTML DTD Module Implementations, for example, maps these modules onto DTDs as described in [XML].
Content developers and device designers should view this section as a guide to the definition of the functionality provided by the various XHTML-defined modules. When developing documents or defining a profile for a class of documents, content developers can determine which of these modules are essential for conveying their message. When designing clients, device designers should develop their device profiles by choosing from among the abstract modules defined here.
Except when overridden in this document, the semantics of these elements and attributes are defined in [HTML4].
Many of the abstract modules in this section define the required attributes for elements. The table below defines some collections of attributes that are referenced throughout the modules. These expressions should in no way be considered normative or mandatory. They are an editorial convenience for this document. When used in the remainder of this section, it is the expansion of the term that is normative, not the term itself.
The following basic attribute sets are used on many elements. In each case where they are used, their use is identified via their collection name rather than enumerating the list.
Each of the attributes defined in an XHTML attribute collection is available for use when their corresponding module is included in an XHTML Host Language or an XHTML Integration Set. In such a
situation, the attributes are available for use in the definition of elements that are NOT in the XHTML namespace when they are referenced using their namespace-qualified identifier (e.g.,
xhtml:class). The semantics of the attributes remain the same regardless of whether they are referenced using their qualified identifier or not. It is an error to use an XHTML
namespace-qualified attribute on elements from the XHTML Namespace.
For the avoidance of doubt, if an attribute collection is referenced by an element's definition, and an attribute in that collection is also explicitly referenced by that element's definition, this does NOT cause a conflict. It is up to the markup language schema implementation to update the content models accordingly.
| Collection Name | Attributes in Collection |
|---|---|
| Core | xml:space ("default"* | "preserve"), class (NMTOKENS), id (ID), title (CDATA) |
| I18N | dir ("ltr" | "rtl"), xml:lang (CDATA) |
| Events | onclick (Script), ondblclick (Script), onmousedown (Script), onmouseup (Script), onmouseover (Script), onmousemove (Script), onmouseout (Script), onkeypress (Script), onkeydown (Script), onkeyup (Script) |
| Style | style (CDATA) |
| Common | Core + Events + I18N + Style |
Note that the Events collection is only defined when the Intrinsic Events Module is selected. Otherwise, the Events collection is empty.
Also note that the Style collection is only defined when the Style Attribute Module is selected. Otherwise, the Style collection is empty.
Finally, note that the I18N collection only contains the xml:lang attribute unless the Bi-directional Text Module module is selected.
The core modules are modules that are required to be present in any XHTML Family Conforming Document Type.
The Structure Module defines the major structural elements for XHTML. These elements effectively act as the basis for the content model of many XHTML family document types. The elements and attributes included in this module are:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| body | Common | (Heading | Block | List)* |
| head | I18N, id (ID), profile (URIs) | title |
| html | I18N, id (ID), version (CDATA), xmlns (URI = "http://www.w3.org/1999/xhtml") | head, body |
| title | I18N, id (ID) | PCDATA |
This module is the basic structural definition for XHTML content. The html element acts as the root element for all XHTML Family Document Types.
Note that the value of the xmlns attribute is defined to be "http://www.w3.org/1999/xhtml". Also note that because the xmlns attribute is treated specially by XML namespace-aware parsers [XMLNAMES], it is legal to have it present as an attribute of each element. However, any time the xmlns attribute is used in the context of an XHTML module, whether with a prefix or not, the value of the attribute shall be the XHTML namespace defined here. See Defining the Namespace of a Module for more on rules regarding namespace usage with XHTML family modules.
Implementations: DTD, XML Schema
This module defines all of the basic text container elements, attributes, and their content model:
| Element | Attributes | Minimal Content Model |
|---|---|---|
| abbr | Common | (PCDATA | Inline)* |
| acronym | Common | (PCDATA | Inline)* |
| address | Common | (PCDATA | Inline)* |
| blockquote | Common, cite (URI) | (Heading | Block | List)* |
| br | Core | EMPTY |
| cite | Common | (PCDATA | Inline)* |
| code | Common | (PCDATA | Inline)* |
| dfn | Common | (PCDATA | Inline)* |
| div | Common | (PCDATA | Flow)* |
| em | Common | (PCDATA | Inline)* |
| h1 | Common | (PCDATA | Inline)* |
| h2 | Common | (PCDATA | Inline)* |
| h3 | Common | (PCDATA | Inline)* |
| h4 | Common | (PCDATA | Inline)* |
| h5 | Common | (PCDATA | Inline)* |
| h6 | Common | (PCDATA | Inline)* |
| kbd | Common | (PCDATA | Inline)* |
| p | Common | (PCDATA | Inline)* |
| pre | Common | (PCDATA | Inline)* |
| q | Common, cite (URI) | (PCDATA | Inline)* |
| samp | Common | (PCDATA | Inline)* |
| span | Common | (PCDATA | Inline)* |
| strong | Common | (PCDATA | Inline)* |
| var | Common | (PCDATA | Inline)* |
The minimal content model for this module defines some content sets:
Implementations: DTD, XML Schema
The Hypertext Module provides the element that is used to define hypertext links to other resources. This module supports the following element and attributes:
| Element | Attributes | Minimal Content Model |
|---|---|---|
| a | Common, accesskey (Character), charset (Charset), href (URI), hreflang (LanguageCode), rel (LinkTypes), rev (LinkTypes), tabindex (Number), type (ContentType) | (PCDATA | Inline - a)* |
This module adds the a element to the Inline content set of the Text Module.
Implementations: DTD, XML Schema
As its name suggests, the List Module provides list-oriented elements. Specifically, the List Module supports the following elements and attributes:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| dl | Common | (dt | dd)+ |
| dt | Common | (PCDATA | Inline)* |
| dd | Common | (PCDATA | Flow)* |
| ol | Common | li+ |
| ul | Common | li+ |
| li | Common | (PCDATA | Flow)* |
This module also defines the content set List with the minimal content model (dl | ol | ul)+ and adds this set to the Flow content set of the Text Module.
Implementations: DTD, XML Schema
This module is deprecated. Similar functionality can be found in the Object Module.
The Applet Module provides elements for referencing external applications. Specifically, the Applet Module supports the following elements and attributes:
| Element | Attributes | Minimal Content Model |
|---|---|---|
| applet | Core, alt* (Text), archive (CDATA), code (CDATA), codebase (URI), height* (Length), object (CDATA), width* (Length) | (PCDATA | Flow | param)* |
| param | id (ID), name* (CDATA), type (ContentType), value (CDATA), valuetype ("data"* | "ref" | "object") | EMPTY |
When the Applet Module is used, it adds the applet element to the Inline content set of the Text Module.
Implementations: DTD, XML Schema
This section defines a variety of additional textual markup modules.
This module defines elements, attributes, and a minimal content model for simple presentation-related markup:
| Element | Attributes | Minimal Content Model |
|---|---|---|
| b | Common | (PCDATA | Inline)* |
| big | Common | (PCDATA | Inline)* |
| hr | Common | EMPTY |
| i | Common | (PCDATA | Inline)* |
| small | Common | (PCDATA | Inline)* |
| sub | Common | (PCDATA | Inline)* |
| sup | Common | (PCDATA | Inline)* |
| tt | Common | (PCDATA | Inline)* |
When this module is used, the hr element is added to the Block content set of the Text Module. In addition, the b, big, i, small, sub, sup,
and tt elements are added to the Inline content set of the Text Module.
Implementations: DTD, XML Schema
This module defines elements and attributes for use in editing-related markup:
| Element | Attributes | Minimal Content Model |
|---|---|---|
| del | Common, cite (URI), datetime (Datetime) | (PCDATA | Flow)* |
| ins | Common, cite (URI), datetime (Datetime) | (PCDATA | Flow)* |
When this module is used, the del and ins elements are added to the Inline content set of the Text Module.
Implementations: DTD, XML Schema
The Bi-directional Text module defines an element that can be used to declare the bi-directional rules for the element's content.
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| bdo | Core, dir* ("ltr" | "rtl"), xml:lang | (PCDATA | Inline)* |
When this module is used, the bdo element is added to the Inline content set of the Text Module. Selecting this module also adds the attribute dir*
("ltr" | "rtl") to the I18N attribute collection.
Implementations: DTD, XML Schema
The Basic Forms Module provides the form-related elements, but only in a limited form. Specifically, the Basic Forms Module supports the following elements, attributes, and minimal content model:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| form | Common, action* (URI), method ("get"* | "post"), enctype (ContentType) | (Heading | List | Block - form)+ |
| input | Common, accesskey (Character), checked ("checked"), maxlength (Number), name (CDATA), size (Number), src (URI), tabindex (Number), type ("text"* | "password" | "checkbox" | "radio" | "submit" | "reset" | "hidden" ), value (CDATA) | EMPTY |
| label | Common, accesskey (Character), for (IDREF) | (PCDATA | Inline - label)* |
| select | Common, multiple ("multiple"), name (CDATA), size (Number), tabindex (Number) | option+ |
| option | Common, selected ("selected"), value (CDATA) | PCDATA |
| textarea | Common, accesskey (Character), cols* (Number), name (CDATA), rows* (Number), tabindex (Number) | PCDATA |
This module defines two content sets:
When this module is used, it adds the Form content set to the Block content set and it adds the Formctrl content set to the Inline content set as these are defined in the Text Module.
The Basic Forms Module is a subset of the Forms Module. These modules may not be used together in a single document type.
Implementations: DTD, XML Schema
The Forms Module provides all of the forms features found in HTML 4.0. Specifically, the Forms Module supports:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| form | Common, accept (ContentTypes), accept-charset (Charsets), action* (URI), method ("get"* | "post"), enctype (ContentType) | (Heading | List | Block - form | fieldset)+ |
| input | Common, accept (ContentTypes), accesskey (Character), alt (Text), checked ("checked"), disabled ("disabled"), maxlength (Number), name (CDATA), readonly ("readonly"), size (Number), src (URI), tabindex (Number), type ("text"* | "password" | "checkbox" | "button" | "radio" | "submit" | "reset" | "file" | "hidden" | "image"), value (CDATA) | EMPTY |
| select | Common, disabled ("disabled"), multiple ("multiple"), name (CDATA), size (Number), tabindex (Number) | (optgroup | option)+ |
| option | Common, disabled ("disabled"), label (Text), selected ("selected"), value (CDATA) | PCDATA |
| textarea | Common, accesskey (Character), cols* (Number), disabled ("disabled"), name (CDATA), readonly ("readonly"), rows* (Number), tabindex (Number) | PCDATA |
| button | Common, accesskey (Character), disabled ("disabled"), name (CDATA), tabindex (Number), type ("button" | "submit"* | "reset"), value (CDATA) | (PCDATA | Heading | List | Block - Form | Inline - Formctrl)* |
| fieldset | Common | (PCDATA | legend | Flow)* |
| label | Common, accesskey (Character), for (IDREF) | (PCDATA | Inline - label)* |
| legend | Common, accesskey (Character) | (PCDATA | Inline)* |
| optgroup | Common, disabled ("disabled"), label* (Text) | option+ |
This module defines two content sets:
When this module is used, it adds the Form content set to the Block content set and it adds the Formctrl content set to the Inline content set as these are defined in the Text Module.
The Forms Module is a superset of the Basic Forms Module. These modules may not be used together in a single document type.
Implementations: DTD, XML Schema
The Basic Tables Module provides table-related elements, but only in a limited form. Specifically, the Basic Tables Module supports:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| caption | Common | (PCDATA | Inline)* |
| table | Common , width ( Length ), summary ( Text ) | caption?, tr+ |
| td | Common, abbr (Text), align ("left" | "center" | "right"), axis (CDATA), colspan (Number), headers (IDREFS), rowspan (Number), scope ("row" | "col"), valign ("top" | "middle" | "bottom") | (PCDATA | Flow - table)* |
| th | Common, abbr (Text), align ("left" | "center" | "right"), axis (CDATA), colspan (Number), headers (IDREFS), rowspan (Number), scope ("row" | "col" ), valign ("top" | "middle" | "bottom") | (PCDATA | Flow - table)* |
| tr | Common, align ("left" | "center" | "right"), valign ("top" | "middle" | "bottom") | (td | th)+ |
When this module is used, it adds the table element to the Block content set as defined in the Text Module.
The Basic Tables Module is a subset of the Tables Module. These modules may not be used together in a single document type.
Implementations: DTD, XML Schema
As its name suggests, the Tables Module provides table-related elements that are better able to be accessed by non-visual user agents. Specifically, the Tables Module supports the following elements, attributes, and content model:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| caption | Common | (PCDATA | Inline)* |
| table | Common, border (Pixels), cellpadding (Length), cellspacing (Length), frame ("void" | "above" | "below" | "hsides" | "lhs" | "rhs" | "vsides" | "box" | "border"), rules ("none" | "groups" | "rows" | "cols" | "all"), summary (Text), width (Length) | caption?, ( col* | colgroup* ), (( thead?, tfoot?, tbody+ ) | ( tr+ )) |
| td | Common, abbr (Text), align ("left" | "center" | "right" | "justify" | "char"), axis (CDATA), char (Character), charoff (Length), colspan (Number), headers (IDREFS), rowspan (Number), scope ("row" | "col" | "rowgroup" | "colgroup"), valign ("top" | "middle" | "bottom" | "baseline") | (PCDATA | Flow)* |
| th | Common, abbr (Text), align ("left" | "center" | "right" | "justify" | "char"), axis (CDATA), char (Character), charoff (Length), colspan (Number), headers (IDREFS), rowspan (Number), scope ("row" | "col" | "rowgroup" | "colgroup"), valign ("top" | "middle" | "bottom" | "baseline") | (PCDATA | Flow)* |
| tr | Common, align ("left" | "center" | "right" | "justify" | "char"), char (Character), charoff (Length), valign ("top" | "middle" | "bottom" | "baseline") | (td | th)+ |
| col | Common, align ("left" | "center" | "right" | "justify" | "char"), char (Character), charoff (Length), span (Number), valign ("top" | "middle" | "bottom" | "baseline"), width (MultiLength) | EMPTY |
| colgroup | Common, align ("left" | "center" | "right" | "justify" | "char"), char (Character), charoff (Length), span (Number), valign ("top" | "middle" | "bottom" | "baseline"), width (MultiLength) | col* |
| tbody | Common, align ("left" | "center" | "right" | "justify" | "char"), char (Character), charoff (Length), valign ("top" | "middle" | "bottom" | "baseline") | tr+ |
| thead | Common, align ("left" | "center" | "right" | "justify" | "char"), char (Character), charoff (Length), valign ("top" | "middle" | "bottom" | "baseline") | tr+ |
| tfoot | Common, align ("left" | "center" | "right" | "justify" | "char"), char (Character), charoff (Length), valign ("top" | "middle" | "bottom" | "baseline") | tr+ |
When this module is used, it adds the table element to the Block content set of the Text Module.
The Tables Module is a superset of the Basic Tables Module. These modules may not be used together in a single document type.
Implementations: DTD, XML Schema
The Image Module provides basic image embedding, and may be used in some implementations independently of client side image maps. The Image Module supports the following element and attributes:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| img | Common, alt* (Text), height (Length), longdesc (URI), src* (URI), width (Length) | EMPTY |
When this module is used, it adds the img element to the Inline content set of the Text Module.
Implementations: DTD, XML Schema
The Client-side Image Map Module provides elements for client side image maps. It requires that the Image Module (or another module that supports the img element) be included. The
Client-side Image Map Module supports the following elements:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| a& | coords (CDATA), shape ("rect" | "circle" | "poly" | "default") | n/a |
| area | Common, accesskey (Character), alt* (Text), coords (CDATA), href (URI), nohref ("nohref"), shape ("rect"* | "circle" | "poly" | "default"), tabindex (Number) | EMPTY |
| img& | usemap (IDREF) | n/a |
| input& | usemap (IDREF) | Note: Only when the Forms or Basic Forms module is included |
| map | I18N, Events, class (NMTOKEN), id* (ID), title (CDATA) | ((Heading | Block) | area)+ |
| object& | usemap (IDREF) | Note: Only when the object module is included |
When this module is used, the map element is added to the Inline content set of the Text Module.
Implementations: DTD, XML Schema
The Server-side Image Map Module provides support for image-selection and transmission of selection coordinates. It requires that the Image Module (or another module that supports the
img element) be included. The Server-side Image Map Module supports the following attributes:
| Elements | Attributes | Minimal Content Model | Notes |
|---|---|---|---|
| img& | ismap ("ismap") | n/a | |
| input& | ismap ("ismap") | n/a | When the Forms or Basic Forms Module is selected. |
Implementations: DTD, XML Schema
The Object Module provides elements for general-purpose object inclusion. Specifically, the Object Module supports:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| object | Common , archive (URIs), classid (URI), codebase (URI), codetype (ContentType), data (URI), declare ("declare"), height (Length), name (CDATA), standby (Text), tabindex (Number), type (ContentType), width (Length) | (PCDATA | Flow | param)* |
| param | id (ID), name* (CDATA), type (ContentType), value (CDATA), valuetype ("data"* | "ref" | "object") | EMPTY |
When this module is used, it adds the object element to the Inline content set of the Text Module.
Implementations: DTD, XML Schema
As its name suggests, the Frames Module provides frame-related elements. Specifically, the Frames Module supports:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| frameset | Core , cols ( MultiLength ), rows ( MultiLength ) | (frameset | frame)+, noframes? |
| frame | Core , frameborder ("1" | "0"), longdesc ( URI ), marginheight ( Pixels ), marginwidth ( Pixels ), noresize ("noresize"), scrolling ("yes" | "no" | "auto"*), src ( URI ) | EMPTY |
| noframes | Common | body |
When this module is selected, the minimal content model of the html element of the Structure Module is changed to (head, frameset).
Implementations: DTD, XML Schema
The content of a frame can specify destination targets for a selection. This module adds the target attribute to the area and link defining elements. This is defined as a separate
module so it can be included in documents that will be included in frames and documents that use the target feature to open a new window.
| Elements | Attributes | Notes |
|---|---|---|
| a& | target ( CDATA ) | |
| area& | target ( CDATA ) | When the Client-side Image Map Module is selected. |
| base& | target ( CDATA ) | When the Base Module is selected. |
| link& | target ( CDATA ) | When the Link Module is selected. |
| form& | target ( CDATA ) | When the Basic Forms or Forms module is selected. |
Implementations: DTD, XML Schema
The Iframe Module defines an element for the definition of inline frames. The element and attribute included in this module are:
| Elements | Attributes | Minimal Content Model |
|---|---|---|
| iframe | Core, frameborder ("1" | "0"), height (Length), longdesc (URI), marginheight (Pixels), marginwidth ( |