Using the track element to provide captions



  • New technique.

This technique would be listed as one of the technology-specific techniques implementing G87, listed in the third bullet of sufficient techniques in Understanding 1.2.2.

  • Added to XML as H95


  • HTML5

This technique relates to:

User Agent and Assistive Technology Support Notes


The objective of this technique is to use the HTML5 track element to specify a captions timed text track for a video element. Caption timed text tracks contain transcription or translation of the dialogue, sound effects, relevant musical cues, and other relevant audio information, suitable for when sound is unavailable or not clearly audible.

The src attribute of the track element is a URL that is the address of the text track data.

The kind attribute of the track element indicates the kind of information in the timed text. captions text tracks provide a text version of dialogue and other sounds important to understanding the video. subtitles contain only the dialogue. If other audio information is important to understanding the video, a subtitle track will not be sufficient to meet the success criteria.

Note: Some regions use the term "subtitle" for any visible text representation of the audio track. An author may mark up a timed text track in the language of the audio track as kind=subtitles, instead of kind=captions, and may include additional relevant audio information. It is not best practice to use subtitles in this situation, since it may confuse users who are trying to find captions. But such a timed text track would meet the requirements of Success Criterion 1.2.2.


Example 1

A video element for a video in the English language with an English caption track. The captions are provided in the WebVTT format.

  <video poster="myvideo.png" controls>
    <source src="myvideo.mp4" srclang="en" type="video/mp4">
    <track src="myvideo_en.vtt" kind="captions" srclang="en" label="English">

See live example

Example 2

A video element for a video with both an English and French language source element, and with an English caption track and a French captions track. WebVTT (vtt) and Timed Text Markup Language (ttml) are different caption file formats.

  <video poster="myvideo.png" controls>
    <source src="myvideo.mp4" srclang="en" type="video/mp4">
    <source src="myvideo.webm" srclang="fr" type="video/webm">
    <track src="myvideo_en.vtt" kind="captions" srclang="en" label="English">
    <track src="myvideo_fr.ttml" kind="captions" srclang="fr" label="French">


Related Techniques



For the video element used to play a video:

  1. Check that the video contains a track element of kind captions in the language of the video.

Expected Results

  • Check that #1 is true