This draft includes a number of editorial changes based on the previous review draft. This satisfies the action item taken by Cynthia, Kerstin, and Wendy.
W3C published the Web Content Accessibility Guidelines 1.0 (WCAG 1.0) as a Recommendation in May 1999. This Working Draft for version 2.0 builds on WCAG 1.0. It has the same aim: to explain how to make Web content accessible to people with disabilities and to define target levels of accessibility. Incorporating feedback on WCAG 1.0, this Working Draft of version 2.0 focuses on checkpoints. It attempts to apply checkpoints to a wider range of technologies and to use wording that may be understood by a more varied audience.
This document is an editors' copy that has no official standing.
This document is prepared by the Web Content Accessibility Guidelines Working Group (WCAG WG) to show how more generalized (less HTML-specific) WCAG checkpoints might read. This draft is not yet based on consensus of the WCAG Working Group nor has it gone through W3C process. This Working Draft in no way supersedes WCAG 1.0.
This is a draft document and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress". A list of current W3C Recommendations and other technical documents is available.
The Working Group welcomes comments on this document at firstname.lastname@example.org. The archives for this list are publicly available. Archives of the WCAG WG mailing list discussions are also publicly available.
Patent disclosures relevant to this specification may be found on the WCAG Working Group's patent disclosure page in conformance with W3C policy.
This document has been produced as part of the W3C Web Accessibility Initiative (WAI). The goals of the WCAG WG are discussed in the Working Group charter. The WCAG WG is part of the WAI Technical Activity.
The text equivalent fulfills the same function as the author intended for the non-text content (i.e. it presents all of the intended information and/or achieves the same function of the non-text content). [I#332]
This includes (where audio descriptions and captions are provided per [@@ref checkpoint 1.2] a text document that merges all audio descriptions and captions into a collated script.
Editorial Note: CKW: we moved this from the additional note on checkpoint 1.2 because it seems to be fairly easy to do if audio descriptions and captions are provided. It is also the only way to make the audio/video accessible to someone who is deaf and blind. We are looking for feedback about the feasibility for authors and needs of users.
Individuals who are blind, have low vision, have cognitive disabilities or have trouble reading text for any reason can have the text read aloud to them.
Individuals who are deaf, are hard of hearing or who are having trouble understanding the audio information for any reason can read the text presentation or have it translated and presented as sign language by their assistive technology.
Individuals who are blind or deaf-blind can have the information presented in braille.
Example 1: an image used as a button. (short description of function)
A right arrow icon is used to link to the next slide in a slide show. The text equivalent is "Next Slide," so that what is read by a screen reader would be "link: Next Slide."
Example 2: a data chart. (short label + longer description)
A bar chart compares how many widgets were sold in June, July, and August. The short label says, "Figure one - Sales in June, July and August." The longer description identifies the type of chart or graph, provides a high-level summary of the data comparable to that available from the chart or graph, and lists the data themselves.
Example 3: an animation. (short label + longer description)
An animation shows how to tie a knot. The short label says, "An animation showing how to tie a square knot." The longer explanation describes the hand movements needed to tie the knot.
Example 4: an audio file of a speech. (short label + transcript)
An audio file is embedded in a Web page. The short label says, "Chairman's speech to the assembly." A link to a text transcript is provided immediately after the clip.
Example 5: an audio file of a symphony. (short label)
An audio file is embedded in a Web page. The short label says, "Beethoven's 5th Symphony performed by the Vienna Philharmonic Orchestra."
Editorial Note: CKW: example for "null" descriptions? e.g., null alt-text.
Editorial Note (06/10/03): There is discussion about moving some of the current success criteria from Required to Best Practice or to an Extended checkpoint. The issue stems from trying to apply the success criteria to every Web cam, newscast, and home broadcast. Another approach is to allow a conformance claim to state, for example, "All pages and applications on this site meet the Core checkpoints of WCAG 2.0 except the Web cam at http://example.org/webcam/."
an audio description is provided.
Editorial Note: CKW: Information about audio description was moved to the glossary since it seemed more like a definition of audio description than a constraint on the requirement.
all significant dialogue and sounds are captioned
If the Web content is real-time and audio-only and not time-sensitive and not interactive a transcript or other non-audio equivalent is sufficient.
Editorial Note: This exception also applies to item 3.
if the Web content is real-time video with audio, real-time captions are provided unless the content:
is a music program that is primarily non-vocal
Editorial Note: CKW: At what point does a music program become "primarily non-vocal?" Is it possible to caption real-time music programs that have vocals? For example, Shania Twain performing "live" at half-time of the Super Bowl is real-time video with audio and is a music program that has vocals. Is it required to caption the lyrics or is it ok to say, "Shania Twain performs her song XYZ?"
if the Web content is real-time non-interactive video (e.g., a Webcam of ambient conditions), either provide an equivalent that conforms to checkpoint 1.1 (e.g., an ongoing update of weather conditions) or link to an equivalent that conforms to checkpoint 1.1 (e.g., a link to a weather Web site).
Editorial Note: CKW: This checkpoint (1.2) is about synchronization of equivalents. Does it make sense to have a criteria about text equivalents that are not synchronized? Perhaps this fits best in Checkpoint 1.1?
if an audio-only or video-only presentation requires a user to respond interactively at specific times in the presentation, then a time-synchronized equivalent (audio, visual or text) presentation is provided.
Editorial Note: CKW: we weren't sure what this meant.
People who are deaf or have a hearing loss can access the auditory information through the captions.
People who are blind or have low vision as well as those with cognitive disabilities who have difficulty interpreting visually what is happening benefit from the audio descriptions of the visual information.
People without disabilities also benefit from the media equivalents.
People in noisy environments or with muted sound often use captions.
Captions are used by many to develop language and reading skills.
Audio descriptions also provide visual information for people who are temporarily looking away from the video presentation such as when following an instructional video and looking at their hands.
Captions and text descriptions can also be used to index and search media files.
Example 1: a movie clip with audio description and captions.
A clip from a movie is published on a Web site. In the clip, a child is trying to lure a puppy to the child's bedroom by laying a trail of crumbs. The child mumbles inaudibly to himself as he lays the trail. When not watching the video, it is not obvious that he is laying a trail of crumbs since all you hear is the mumbling. The audio description that is interspersed with the child's mumbling says "Charlie lays a crumb on each stair leading to his room." The caption that appears as he mumbles is, "[inaudible mumbling]."
Example 2: a video clip of a news story.
A video clip accompanies a news story about the recent flooding in a major city. The reporter describes what is seen, for everyone. No audio description is necessary. The captions display what the reporter is saying.
Example 3: a silent animation.
An animation shows a pantomime climbing a ladder. There is no audio track for this animation. No captions or audio description are required. Instead, a text equivalent is provided as described in checkpoint 1.1.
the following can be derived programmatically (i.e. through a markup or data model that is assistive technology compatible) from the content without requiring user interpretation of presentation. [I#337]
any hierarchical elements and relationships, such as headings, paragraphs and lists
any non-hierarchical relationships between elements such as cross-references and linkages, associations between labels and controls, associations between cells and their headers, etc.
any information presented using color is also available without color (e.g. through context or markup). [I#317]
text content that is presented over a background image or pattern is implemented using mechanisms that allow the user to display the text without the background image or pattern.
Editorial Note: CKW: We moved this success criteria from the extended visual/audio contrast checkpoint. That checkpoint has been divided into 2 contrast checkpoints: one for auditory contrast the other for visual contrast. However, this could be a benefit of separating content from presentation rather than a requirement. The difference between this criteria and the contrast criteria is that this applies to text as text (i.e., "text content") and the other applies to images (not photos) including text in raster images (used to display text in non-standard fonts).
Separating content and structure from presentation allows Web pages to be presented differently to meet the needs and constraints of different users without losing any of the information or structure. For example, information can be presented via speech or braille (text) that was originally intended to be presented visually.
It can also facilitate automatic emphasis of structure or more efficient navigation.[I#340]
All of these can benefit people with cognitive, physical, hearing, and visual disabilities.[I#340]
Example 1: a multi-column document.
A document is marked up with headings, paragraphs and other structural features. It is presented visually in three columns. The markup that creates the columns is separate from the markup that specifies the logical structure of the document.
Example 2: a scrolling list of stock prices.
Current stock quotes are scrolled horizontally across the screen. The data are separate from the methods used to scroll the text across the page.
Example 3: a 3-dimensional site map.
A custom user interface renders 3D visualizations of the pages on a site and how they relate to one another from a data source. Any hierarchical relationships, groupings, cross-references, etc. would originate in the data source so that alternate interfaces could be rendered (from the same source) that expose the structure of the site in an accessible form. (See also checkpoint 4.3)
Example 4: a list that allows users to sort information on a page according to preference.
A script allows a user to rearrange a categorical listing of music files by date, artist, genre, or file size. The script updates both the structure and the presentation accordingly when generating alternate views.
Editorial Note: CKW: Positioning and labels are not mentioned in the success criteria, thus delete from the checkpoint. "to more people" seems redundant with the underyling goal of the guidelines.
the structural elements present have a different visual appearance or auditory characteristic from each other and from body text.
Editorial Note: CKW: need definitions for "visual appearance" and "auditory characteristic."
Presentation that emphasizes structure:
enables users with cognitive and visual disabilities to orient themselves within the content,
enables all users to move quickly through the content and notice major content divisions
enables all users, but particularly users with visual or cognitive disabilities to focus on important content,
enables all users, but particularly users with visual or cognitive disabilities to distinguish the different types of content.
Example 1: documentation for a product.
Identifying chapters in the structure of a book is appropriate and accepted use of labeling the structure. Within the chapters, headings identify (label) changes in context and highlight ideas contained in the following text. Subtle differences between the appearance of the chapter title and the section headings helps the user understand the hierarchy and relationship between the title and headings. The only difference might be font size and margin indentation when presented visually, and spoken in a difference voice or preceded by a sound when presented auditorily.
Example 2: a data table.
Groups of rows or columns are labeled with headers.
Editorial Note: CKW: This example is about labels, but the checkpoint is about presentation of structure.
Example 3: an audio presentation.
An audio rendering of a document, generated according to a style sheet, uses a different, more formal voice to read titles and headers so the listener can easily identify the words as a title and not part of the running text.
Editorial Note: CKW: This checkpoint used to address auditory and visual contrast. We moved audio contrast to a separate checkpoint.
text that is presented over a background color or grayscale has a mechanism that allows the text to be presented in a fashion that has a contrast greater than ______ between text and background color as measured by ______.[I#344]
Editorial Note: CKW: Refer to recent threads about color contrast and color "closeness" algorithms. Propose including a list of color combinations to avoid or perhaps using one of the algorithms discussed or some combination of the two.
Individuals with low vision can easily make out characters in the content even if they don't have the wide field of view used by fully sighted persons to separate text from background images.
Editorial Note: CKW: Instead of "field of view" should it be "even if they do not have the ability to perceive the full range of colors needed to separate text from background images and colors" ?
Example 1: a background image on a page.
A background image and text are arranged so that there is no image behind the text or the image is so faint that the difference between the darkest part of the image and the text (which is dark) meets the standard foreground/background contrast requirements. The image behind the text also does not contain lines that are about the same width as the characters so they do not interfere with character recognition.
audio content does not contain background sounds OR the background sounds are at least 20 db lower than the foreground audio content.
A 20 db difference in sound level is roughly 4 times quieter (or louder).
Individuals with hearing impairments that limit their ability to hear all of the frequencies of speech can make out the words from the sounds they can hear because they are not mixed with residual sounds from the music.
Example 1: speech over background sounds.
Because speech is often naturally mixed with background sounds (movies, live news etc) and cannot be easily removed or separated, captions are provided (under checkpoint 1.2) to make dialog understandable. However not all people can see or read the captions. Where speech is mixed or recorded so that it is at least 20 db above any background sounds people do not need to rely on captions to understand the dialog.
all of the functionality of the content, where the functionality or its outcome can be expressed concisely in words, is operable at a minimum through a keyboard or keyboard interface.
refer to checkpoint 4.3 for information regarding user agent support.
Editorial Note: CKW: "expressed concisely in words" is subjective and not defined.
Individuals who are blind (and cannot use pointing devices) can have access to the functionality of the Web content or site.
Individuals with severe physical disabilities can use speech input (which simulates keystrokes) to both enter data and operate the interface elements on the page.
Example 1: operation with multiple input devices.
The content relies only on focus-in, focus-out, and activation events; these are defined in the API of the environment for which the content is written, and are intended to be operable by a variety of input devices, including pointing devices, keyboards and speech input systems.
Example 2: examples of Web content that would and would not be operable from a keyboard or keyboard interface
If it's written to be operable from a computer keyboard, it conforms. (because it is operable from the keyboard.)
If it's written to be used on a device that doesn't usually have a keyboard such as a cell phone and but it can be controlled by an optional keyboard for that device, it conforms. (A person who needs a keyboard - or alternate keyboard - can use it to control the application.)
If it's written to be used with a device that doesn't have a keyboard, but it could also be used by similar devices that do and it would work with their keyboard, it conforms. (A person who needs a keyboard would not buy the device without the keyboard. That device may itself not be considered accessible. But the content can be controlled from a device with a keyboard and therefore conforms to this checkpoint.)
If it's written to work with devices that do not have keyboards and it can not be used by any other devices that do have keyboards, then it does not conform. (It cannot be accessed via keyboard.)
at least one of the following is true for each time limit:
the user is allowed to deactivate the time limits,
or the user is allowed to adjust the time limit over a wide range which is at least ten times or default setting or average user's preference,[I#347]
or the user is warned before time expires and given at least 10 seconds to extend the time limit,
or the time limit is due to a real-time event (e.g. auction) and no alternative to the time limit is possible,
or the time limit is part of a competitive activity where timing is an essential part of the activity (e.g. competitive gaming or time based testing).
People with reading disabilities, cognitive disabilities, and learning disabilities often need more time than most people to read and comprehend written text. People with physical disabilities might not be able to move quickly or accurately enough to interact with moving objects.
Content that is updated often might not be processed and read in time or in the proper order by an assistive technology or voice browser.
Examples of content that requires comprehension or a response within a timed interval[I#348]:
blinking or scrolling text
dialog that disappears after a short period
shutdown or deactivation of page if activity is not received in a set amount of time
Example 1: blinking text.
Client-side scripting is used to create blinking text. The user can deactivate the use of scripting in his or her browser or override the use of scripts with a user style sheet.
Example 2: a news site that is updated regularly.
A news site causes its front page to be updated every 1/2 hour. The front page contains minimal text and primarily consists of links to content. A user who does not wish the page to update selects a checkbox. The checkbox is in the "user preferences" portion of the site which is one of the first links on each page.
Editorial Note (06/10/03): This Checkpoint is currently included in the Core set of Checkpoints because the WCAG WG expects that it will be possible to test content for flicker and the result will be a flicker rate in Hz that can be stored in a machine-readable format. The WG also expects that a user could, in the future, be able to configure a client to only render content that flickers at "a rate greater than X" by which the client could compare the content metadata and the user's preference. If the assumption regarding a testing tool does not hold at time of final review of these guidelines, this checkpoint will be moved to the Extended set of Checkpoints."
At least one of the following is true:
content was not designed to flicker (or flash) in the range of 3 to 49 Hz.
Editorial Note: CKW: how should we incorporate the information presented by Dr. Harding that flicker can occur in less than x degrees (or y percent) of the field of vision and it is not an issue?
if flicker is unavoidable, the user is warned of the flicker before they go to the page, and as close a version of the content as is possible without flicker is provided.
Editorial Note: We would like to include a third criteria here that would state that a test that was conducted and the pages passed. No test or tool exists yet though. We're looking into how such a test and/or tool might be designed.
Individuals with photosensitive epilepsy can have seizures triggered by flickering or flashing in the 3 to 49 flashes per second (Hertz) range with a peak sensitivity at 20 flashes per second.
Individuals who are easily distracted may not be able to focus on page content with flicker occurring in the same visual field.
a user is able to move through the content in an order that follows the visual layout or follows the order the page is read.
Editorial Note: CKW: trying this out instead of "logical" - hopefully a bit more testable? This was discussed at the 27 August telecon. Wendy took an action to create a proposal and has not completed it yet.
Editorial Note: CKW: The success critieria seemed to be techniques rather than testable criteria. We don't have proposed text at this time to replace what has been moved to techniques. There is clearly overlap between this checkpoint and 1.3 (separate presentation and structure). One side effect of good structure is the ability of a user agent to provide navigation mechanisms. Thus, the distinctions between this and 1.3 need to be clear. There are differences between page level, site level, and application level navigation that ought to be clarified or handled separately.
Editorial Note: CKW: The first group of benefits about logical structure, continue the confusion between this checkpoint and 1.3 and further make this feel like a technique for 1.3 rather than a separate checkpoint.
When the logical structure is provided in markup or a data model,
Users with physical disabilities can use structure to more easily jump between paragraphs, chapters, sections etc.
Users with cognitive disabilities can use structure (chapter titles, headers, etc.) to provide more context for the text that follows them. They also provide warning of a change in context and reorient the user to the new focus.
Users with blindness or low vision can jump from header to header to get an overview or to more quickly "skim" to the section they are interested in.
Readers with low vision can sometimes (depending on display technology) change how chapter titles and headers are displayed to make them more visible -and easier to use when skimming the document.
the content can be presented on a variety of devices because the device software can choose only those elements of the content that it is able to display and display them in the most effective way for that device.
Providing different navigation mechanisms can provide a better match between different people's skills, background knowledge, visual vs. text orientation, and the type of information they are seeking at the moment.
Individuals with cognitive disabilities may find it easier to ask for what they want than to deduce its location from categorical choices.
Individuals with low vision or blindness may find search techniques that fetch everything that relates to a topic of interest to be easier than techniques that require them to scan lists or pages for the items.
Example 1: a physics dissertation.
A dissertation contains well-defined sections such as "Abstract," "Table of Contents," "Chapter 1," etc. The pieces in each section (paragraphs, subheadings, quotes) are denoted with structural markup.
Example 2: a scalable image of a bicycle.
Lines and a circle (spokes and rim) are grouped into a "wheel." Lines in a triangle that attach to each wheel are grouped into a "frame."
Example 3: user interface.
User interface controls are divided into organized groups.
if an user error is detected, the error is identified and (where possible) suggestions for correction are provided (in an accessible form that meets core checkpoints). [I#349]
Editorial Note: CKW: Since this is an extended checkpoint, we included the Best Practice to provide suggestions for corrections. As it was, this checkpoint only required a notification of the error which could be satisfied with a "404" on a website.
where consequences are significant and time-response is not important, one of the following is true
actions are reversible where possible
where not reversible, actions are checked for errors in advance
where not reversible, and not checkable, a confirmation is asked before acceptance
Individuals with writing disabilities and people with dyslexia often have difficulty writing text in forms or other places that need text input.
Individuals with speech disabilities might not be recognized properly in voice input applications.
passages or fragments of text occurring within the content that are written in a language other than the primary natural language of the content as a whole, are identified, including specification of the language of the passage or fragment.
Editorial Note: CKW: We discussed moving this to extended since it is difficult to test and to generate, particularly for multi-national organizations. For example, a bank that does business in a variety of languages between an array of countries that is generating content from large databases.
Phrases from various languages, acronyms and abbreviations are often interspersed in writing. When these phrases are identified, a speech synthesizer can voice text with the appropriate accent and pronunciation. When they are not identified, the speech synthesizer will use the default accent and pronunciation of the language on the rest of the page, which can make the phrase unintelligible. Identifying changes in language and marking abbreviations and acronyms as such will also allow a tool to ask for automatic translations of that content. When editing content, authoring tools can switch between appropriate spelling dictionaries.
Editorial Note: CKW: This mentions abbreviations and acronyms, but that is covered in a separate checkpoint.
Editorial Note: CKW: We need the braille benefit that Jason has described. Also, what about automatic sign language interpretation as a future beneficiary?
Example 1: a French phrase in an English sentence.
In the following sentence, "And with a certain je ne sais quoi, she entered both the room, and his life, forever." the French phrase "je ne sais quoi" is marked as French. Depending on the markup language, English may either be marked as the language for the entire document except where specified, or marked at the paragraph level.
Editorial Note: CKW: Combined core 1.4 with this one and it is now an extended checkpoint; proposing that it become an extended checkpoint is a known and controversial issue with this proposal. However, the category of this checkpoint (core or extended) is an issue no matter which organization the working group decides to use. The wording of this checkpoint needs work.
abbreviations and acronyms are clearly identified each time they occur or are available in a glossary that is explicitly associated with the content.
Editorial Note: CKW: Were not sure what to do with these two controverisal phrases: "if they collide with a word in the standard language that would also logically appear in the same case (e.g. all caps). (See also checkpoint 3.1) [I#341]" and "do not appear first in standard unabridged dictionaries for the language or define the first time the first time they appear or are available in a glossary on the site.[I#330]."
symbols such as diacritic marks that are found in standard usage of the natural language of the content, and that are necessary for unambiguous identification of words, are present or another standard mechanism for disambiguation is provided.
Editorial Note: If a standard format for doing it can be achieved, we might require that linkages to glossaries for all abbreviations and acronyms that are created by the author or site be provided. We could also recommend that linkages to any abbreviations, acronyms, etc. used by the authors also be provided. We could also have a weaker recommendation for acronyms and abbreviations appearing on the site that linkages to glossaries explaining all abbreviations acronyms, etc. that appear in any documents on the site be provided.
Summarizing information that is difficult to understand helps people who do not read well.
Editorial Note: CKW: what happened to summaries? Seems to have moved to 3.3?
Providing a summary of the visual cues that show relationships between complex information helps people who do not use visual cues or who have difficulty using visual cues. For example, people who are completely blind do not use any visual cues, while people with dyslexia or with low vision might have difficulty interpreting visual cues.
Editorial Note: CKW: another summary benefit, but doesn't seem to apply to this checkpoint.
Defining key terms and specialized language will help people who are not familiar with the topic.
Providing the expansion of abbreviations and acronyms not only helps people who are not familiar with the abbreviation or acronym but can clarify which meaning of an abbreviation or acronym is appropriate to use. For example, the acronym "ADA" stands for both the American with Disabilities Act as well as the American Dental Association.
Facilitating unambiguous decoding of characters and words in content is also helpful for individuals who are learning to read or learning a second language.
Editorial Note: CKW: this was taken from the previous 1.4 checkpoint and may not fit in this new context.
Example 1: an abbreviation in a document.
The following appears in a paragraph, @@ example of an abbreviation that when a screen reader attempts to pronounce, it doesn't make sense.
Example 2: @@use of hebrew characters w/out vowels
@@include example about diacritic marks.
Editorial Note: CKW: no magic bullet (i.e., we don't have a proposed solution for this one. Still working on taking Lisa's last proposal and reworking it. There is overlap between several checkpoints and they all seem to depend on using semantic markup or a style guide. Jason took an action item at the 4 Sept telecon that might resolve part of this issue.
the content has been reviewed, taking into account the following strategies for evaluating the complexity of the content, applying them as appropriate.
familiarity of terms and language structure
reasonableness of length and complexity of sentences
coherence of paragraphs (and sensibility in length)
clarity of headings and linked text when read out of context
accuracy and uniqueness of page titles
care in the use of all-capital letters where normal sentence case might increase comprehension
inclusion of non-text content to supplement text for key pages or sections of the site where they felt it was appropriate.
the content has been reviewed, taking into account the following strategies for evaluating the complexity of the content, applying them as appropriate.
use of sentence structures that increase understanding
such as active voice in languages where this form helps convey information
length of noun phrases
strings of no more than three or four nouns are easiest to understand
clarity of reference with pronouns and anaphoric expressions (these refer back to something already said in the text)
example of potential ambiguity: "Scientists study monkeys. They eat bananas."
correct use of conjunction forms and adverbs to make explicit the relationship between phrases or parts of the text
such as "and," "but," "furthermore," "not only"
complexity of verb tenses
do the tenses used in a document seem overly complicated?
intelligibility of verb phrases
familiarity of idioms or slang
logic in the order and flow of information
consequences of ambiguity or abstraction
improved readability of vertical lists might offer in place of long paragraphs of information
use of summaries to aid understanding
thoroughness in the explanation of instructions or required actions
consistency in the use of names and labels
clarity where the document:
explains choices and options
labels options to get more information
instructs users how to modify selections in critical functions (such as how to delete an item from a shopping cart)
proper markup to highlight key information
goal-action structure for menu prompts
default settings (and the ease in re-establishing them)
two-step "select and confirm" processes to reduce accidental selections for critical functions
calculation assistance to reduce the need to calculate
testing with potential users for ease of accessibility
use of a controlled language
providing support for conversion into symbolic languages
adding non-text content to the site for key pages or sections specifically to make the site more understandable by users who cannot understand the text only version of the site.
All users, especially those with cognitive, learning, and/or reading disabilities benefit from the use of clear and simple writing. This should not discourage you from expressing complex or technical ideas.
Using clear and simple language also benefits people whose first language differs from your own, including those people who communicate primarily in sign language.
Sounds, graphics, videos and animations can help make concepts presented in a Web site easier to understand, especially for people with cognitive, reading, or learning disabilities or those who are unfamiliar with the language of the text of the site.
Designers need to be cautious in deciding when to use illustrations. Reading a picture is probably a learned activity that is easier for some than others. Some users skip the pictures; others read only the pictures. Designers must also recognize that visual conventions are not universal and that individuals develop their own mental schema and expectations in interpreting visual information.
Example 1: a description of a process.
A page describes how to learn to play soccer. Each step in learning the fundamentals of the game is illustrated with a photograph of a player doing what is described in the text.
Example 2: a concrete concept.
The primary concept on a page is concrete. It is discussing Mt. Pinatubo. It includes both a description of the 1991 eruption as well as photos of the eruption and the aftermath. It links to another site that contains video and another site that contains a 3D simulation of what happened underneath the crust and within the volcano during the eruption.
Example 3: child's report of school trip.
A child went with her school on a trip to a bicycle manufacturing plant. She wrote a report for her family and friends to post to the Web. In the report, she includes the company logo as well as a picture of a bicycle on the assembly line. She links to the company Web site for more information. She includes photos she took of the plant.
Example 4: stock trading data.
A news site is comparing the performance of the economy from 3rd quarter of this year with 3rd quarter from the last 3 years. They compare prices of the most popular stocks. They present the data in a bar graph with a link to the raw data they used to create the bar graph.
Example 5: history of music.
A grandfather's hobby is listening to and playing music. He creates a Web site that includes examples of many different types of music and musical instruments. When describing specific types of music, he links to a short sound clip.
Editorial Note: CKW: In particular, the current wording does not seem testable. Words such as, "key," "consistent," "predictable," "inconsistent," and "unpredictable" are subjective. There seems to be overlap with other checkpoints. Success criterion 1 seems to relate more to navigation mechanisms. Success criterion 2 seems to relate more to custom interfaces (4.3). "Extreme changes in context" is a slippery term that seems to refer primarily to pop-ups and to content changes that occur away from the current point of focus. The Additional Notes are littered with "similar," "same," "likely," and "familiar" however in some cases appear to be more testable.The idea of "user interface components" only seems to show up in the Additional Notes and should be moved up to required success criteria.
key orientation and navigational elements (such as navigation bars) are generally found in one or two consistent locations or their locations are otherwise predictable.
where inconsistent or unpredictable responses are essential to the function of the content (e.g. mystery games, adventure games, tests, etc.) the user is warned in advance of encountering them.
wherever there are extreme changes in context, one of the following is true:
an easy to find setting, that persists for the site visit, is provided for the user to deactivate processes or features that cause extreme changes in context or
extreme changes in context are identified before they occur so the user can determine if they wish to proceed or so they can be prepared for the change
Individuals who are unable to detect extreme changes in context or may not realize that the context has changed are less likely to become disoriented while navigating a site. This applies to people in the following ways:
Individuals who are blind or have low vision may have difficulty knowing when a visual context change, such as a new window popping up, has occurred. In this case, warning users of context changes in advance minimizes confusion when the user discovers that the back button no longer behaves as expected.
Using captions to note changes in speaker is beneficial for individuals who are deaf or hard of hearing and who may be unable to discern changes in speaker for audio-only presentations.
Some individuals with low vision, with dyslexia and who have difficulty interpreting visual cues may benefit from additional cues in order to detect extreme changes in context.
Providing consistent and predictable responses to user actions is important feedback for the user. This lets them know that your site is working properly and encourages them to continue interacting with the content. When the user receives an unexpected response, they might conclude that something is wrong or broken. Some people might get so confused they will not be able to use your site.
Example 1: a form to deactivate pop-up windows.
A checkbox is provided on a page of links to let the user select whether they want the resultant pages to appear in new windows or not.
Example 2: a warning given before a pop-up window.
At the end of a news story, several links are provided for more information. At the beginning of each link is an icon of an arrow with the text equivalent, "Link will open in new window."
Example 3: frames that do not track history making the back button behave unexpectedly.
Example 4: forms.
Editorial Note: Some of these examples are very brief. Should they be expanded and clarified with further details?
Editorial Note: CKW: This checkpoint only applies to markup. Should we clarify in some way?
except where a specification was violated for backward compatibility, the markup has:
passed validity tests of the language (whether it be conforming to a schema, Document Type Definition (DTD), or other tests described in the specification)
structural elements and attributes are used as defined in the specification
accessibility features are used
This checkpoint further emphasizes that following specifications increases the likelihood of accessible content. While other checkpoints refers to individual pieces of content, this checkpoint takes a step back to look at the broad picture. It also exists to help cover future technologies or issues that we did not anticipate at the time of writing this checkpoint. Thus, the benefits of following specifications are primarily that assistive technologies and user agents can render the content according to spec.
Editorial Note: CKW: This was checkpoint 4.3 ("Technologies used for presentation and user interface support accessibility or alternate versions of the content are provided that do support accessibility.") We moved it to core because direct accessibility of plug-ins and programmatic objects should be required and if not, provide an alternative.
any applications with custom user interfaces conform to at least Level A of the User Agent Accessibility Guidelines 1.0. If the application cannot be made accessible, an alternative solution is provided that conforms to WCAG 2.0. Determining which UAAG 1.0 checkpoints to apply is as follows:
If the application renders visual text, it should conform to the VisualText checkpoints.
If the application renders images, it should conform to the Image checkpoints.
If the application renders animations, it should conform to the Animation checkpoints.
If the application renders video, it should conform to the Video checkpoints.
If the application renders audio, it should conform to the Audio checkpoints.
If the application performs its own event handling, it should conform to the Events checkpoints.
If the application implements a selection mechanism, it should conform to the Selection checkpoints.
If the application implements voice or pointer input, it should conform to the Input Modality checkpoints.
accessibility conventions of the markup or programming language (API's or specific markup) are used (@@in UAAG somewhere?)
plug-ins required to access the content conform to at least Level A of UAAG 1.0. If required plug-ins are not accessible, an alternative solution is provided that conforms to WCAG 2.0.
Editorial Note: CKW: Passed this by Ian. He says it is headed in the right direction but needs some tweaking.
Editorial Note: CKW: needs to be updated to reflect changes in checkpoint and success criteria.
Authors who utilize technologies designed to support accessibility will:
encounter fewer challenges when implementing these guidelines
avoid the need to create custom solutions and workarounds to address accessibility concerns
avoid the need to provide accessible alternate versions for content rendered in a technology that does not fully address these guidelines
Individuals who rely on assistive technologies to access the Web will be able interact with the content.
Individuals who access the Web with older technologies or alternative browsing devices such as PDAs and cell phones also benefit from the inclusion of accessible alternatives to custom user interfaces.
Editorial Note: CKW: "widely" is ambiguous (note: there is a glossary entry for "widely available" that is not yet defined). The benefits talk about "baseline" but the idea has disappeared from the success criteria.
the Web resource includes a list of the technologies (other than standard HTML) users must have in order for its content to work as intended. Users who do not have one or more of these technologies can still access and use the resource, though the experience may be degraded.[I#351]
When determining your list of technological requirements, consider that assistive hardware and software is often slow to adapt to technological advances, and the availability of assistive technology varies across natural languages. Verify that assistive technology compatible with the technologies you choose is available in the natural language(s) of your content.
Editorial Note: This checkpoint is currently in the set of extended checkpoints. The implications of this are that there is no core checkpoint that says content must transform gracefully or that it must be backwards compatible. However, if the set of core checkpoints is designed well, core conformance would result in content that transforms gracefully. This checkpoint might be too subjective or difficult to test and may be deleted.
Benefits of determining and documenting baseline user agent requirements:
Individuals can identify (either through site documentation or automatically through metadata) whether or not they are likely to be able to use a site. In conjunction with a search engine or a proxy server, this could be used to automatically filter out sites a user can not access or to automatically filter to the top sites that would be most usable.
Requiring sites to document their baseline will cause them to evaluate assumptions about user agents and will minimize the number of sites that are inadvertently inaccessible because they are unaware of backward compatibility issues.
Benefits of designing for backward compatibility:
Individuals who must use alternative browsing technologies and devices will be able to access the content.
Individuals who can not afford or otherwise do not have access to newer technologies also benefit from backward compatibility in that they will not need to purchase upgrades or equipment as often.
Example 1: an online store.
By documenting minimum user agent requirements, the store makes it possible for people using particular technologies to determine if they are going to have trouble using the store or its checkout mechanism without having to go through the entire process of shopping and checkout only to find out that they are unable to complete their transaction at the end. They can, therefore, shop at stores they can be successful at.
Example 2: an Intranet site.
A large company was concerned about the ability to address individuals at many diverse sites that have different technology bases. They have, therefore, created two versions of their content and documented the requirements for each, making it easy for individual sites to determine which version would work for their technologies.
the markup has:
passed validity tests of the language (whether it be conforming to a schema, Document Type Definition (DTD), or other tests described in the specification)
structural elements and attributes are used as defined in the specification
accessibility features are used
no deprecated features are used
Editorial Note: The WCAG WG has not tackled the definitions of the terms that we are using and acknowledges that we sometimes use terms inconsistently. We need to coordinate our terms and definitions with the WAI Glossary and are working on proposals for a variety of definitions. We have been looking at the UAAG 1.0 glossary and other glossaries within the W3C.
content that can be expressed accurately and unambiguously in a reasonable number of words (for example, diagrams, charts, illustrations, etc.) Content such as a musical performance or visual artwork is considered "content that can not be expressed in words," since this type of content relies heavily on the visual (or auditory) experience.[I#320]
An audio description is a verbal description of all significant visual information in scenes, actions, and events that cannot be perceived from the sound track alone to the extent possible given the constraints posed by the existing audio track and limitations on freezing the audio visual program to insert additional auditory description. (( are equivalents of visual information from actions, body language, graphics, and scene changes that are voiced (either by a human or a speech synthesizer) and synchronized with the multimedia presentation.))
Note: When adding audio description to existing materials, the amount of information conveyed through audio description is constrained by the amount of space available in the existing audio track unless the audio/video program is periodically frozen to insert audio description. However, it is often impossible or inappropriate to freeze the audio/visual program to insert additional audio description.
@@ are text equivalents of auditory information from speech, sound effects, and ambient sounds that are synchronized with the multimedia presentation.
@@ that provides dialog, important sounds and important visual information in a single text document
A competitive activity is an activity where timing is an essential part of the design of the activity. Removal of the time element would change the performance of the participants. Versions of the activity (e.g. test) that have no time basis or time limits might be preferred and may be required for some venues but this would require a complete redesign of the activity (e.g. test) and may change the character and validation methodology and would therefore not fall under these guidelines.
Content is considered complex if the relationships between pieces of information are not easy to figure out. If the presentation of the information is intended to highlight trends or relationships between concepts, these should be explicitly stated in the summary.
Examples of complex information:
concepts that are esoteric or difficult to understand,
content that involves several layers.
Controlled languages use a restricted vocabulary taken from natural language. The purpose is to make texts easier to understand and translate. Standards generally limit words to a single meaning and prescribed part of speech. Complex syntax is avoided. Information about controlled language applications is available on the World Wide Web.
A feature is a specific component of a technology, for example an element in a markup language or a function call in an Application Programming Interface. Typically, a given feature may only be available in specific versions of the technology, and thus may need to be noted explicitly in the required list.
Functionality is the function or purpose of the content. The functionality or purpose of content can include presentation of information, as well as any other functionality such as data collection, securing a response from the user, providing user experience, testing, confirmation, purchasing, etc.[I#338]
A keyboard interface is the point where the application accepts any input that would come from the keyboard (or optional keyboard).
Mechanisms that cause extreme changes in context include:
opening a new browser window unexpectedly and without any nonvisual cue (back button suddenly appears nonfunctional)
in an auditory presentation, the speaker changes with no visual cue and no notation in captions
captions that do not identify a change in speaker
Common user actions include:
use of browser navigation buttons (e.g. back and forward)
opening new browser windows
Common responses to user actions include:
loading a new page
exposing/concealing content based on mouse position or keyboard focus
displaying the contents of a menu (auditorily or visually)
displaying pop-up menus or windows
submitting a form
It is important that responses to user actions be predictable and sensible to the end user and that interactions are consistent, both throughout the site and with commonly used interaction metaphors used throughout the Web.
Natural languages are those used by humans to communicate, including spoken, written, and signed languages.
non-text content includes but is not limited to images, text in raster images, image map regions, animations (e.g., animated GIFs), ASCII art, images used as list bullets, spacers, graphical buttons, sounds (played with or without user interaction), stand-alone audio files, audio tracks of video, and video.
Non-text content includes images, text in raster images, image map regions, animations (e.g., animated GIFs), applets and programmatic objects, ASCII art, scripts, images used as list bullets, spacers, graphical buttons, sounds (played with or without user interaction), stand-alone audio files, audio tracks of video, and video.
The term operable includes the concept of efficiency. That is, it implies that the device can be operated from the keyboard in a reasonably efficient fashion. Using MouseKeys or having to tab dozens of times to move through a small section of a document or page, or other unreasonably inefficient keyboard access would not qualify. If a document has a very large number of links, some mechanism other than tabbing through them one at a time needs to be provided. This might include provision of headers (for header navigation), the use of skip navigation, links, etc.[I#346]
Presentation is the rendering of the content and structure in a form that can be sensed by the user.
Real-time events are those that are based on the occurrence of events in real-time where the events are not under the control of the author.
A site navigation mechanism is a mechanism for easily orienting and moving about within the site. Site navigation mechanisms include but are not limited to:
A home page with hyperlinks on it and subsequent pages that link to the other pages at the site
dynamic fisheye views showing all linked pages or topics related to any page.
3-D virtual representations of site content
Structure includes both hierarchical structure of the content and non-hierarchical relationships such as cross-references, or the correspondence between header and data cells in a table.
The structure of content represents changes in context. For example,
A book is divided into chapters, paragraphs, lists, etc. Chapter titles help the reader anticipate the meaning of the following paragraphs. Lists clearly indicate separate, yet related ideas. All of these divisions help the reader anticipate changes in context.
A bicycle is divided into wheels and a frame. Further, a wheel is divided into a tire and a rim. In an image of the bicycle, one group of circles and lines becomes "wheel" while another group becomes "frame."
A technology is a
markup or programming language
application Programming Interface (API)
or communication protocol
A text equivalent
serves the same function as the non-text content was intended to serve.
communicates the same information as the non-text content was intended to convey.
may contain structured content or metadata.
Note: Text-equivalents should be easily convertible to braille or speech, displayed in a larger font or different colors, fed to language translators or abstracting software, etc.
A time-dependent presentation is a presentation that
is composed of synchronized audio and visual tracks (e.g., a movie), OR
requires the user to respond interactively at specific times in the presentation.
Content might be unfamiliar if you are using terms specific to a particular community. For example, many of the terms used in this document are specific to the disability community.
@@definition of "widely available" should be added here to include something which is low cost and available in many?/most? countries/languages.
Since the release of WCAG 1.0 in May 1999, the WCAG Working Group has received feedback on priorities of checkpoints, the usability of the set of documents, and requests for clarifications on the meaning of specific checkpoints and what is needed to satisfy them. Thus, it is intended that WCAG 2.0, when it eventually becomes a W3C Recommendation:
will be more efficiently organized,
may adjust the priority of some checkpoints,
may modify, remove, or add requirements due to changes in Web technologies since the publication of WCAG 1.0,
will incorporate the Errata from WCAG 1.0,
will reflect the experience gained in implementing WCAG 1.0.
For a checkpoint by checkpoint comparison, refer to the Checkpoint Mapping Between WCAG 1.0 and the WCAG 2.0 Working Draft.
We hope that WCAG 2.0 will have several improvements over WCAG 1.0. While the primary goal of WCAG 2.0 is the same as WCAG 1.0 (to promote accessibility of Web content) additional goals for WCAG 2.0 include improvements that will:
Ensure that requirements may be applied across technologies
Ensure that the conformance requirements are clear
Ensure that the deliverables are easy to use
Write to a more diverse audience
Clearly identify who benefits from accessible content
Ensure that the revision is "backward compatible" with WCAG 1.0
For more information about the intended improvements in WCAG 2.0 Working Draft, please refer to Requirements for WCAG 2.0.