28235 – [WebVTT] Ignore metadata tracks for computed line position

This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 28235 - [WebVTT] Ignore metadata tracks for computed line position

Summary: [WebVTT] Ignore metadata tracks for computed line position

Status:	NEW

Alias:	None

Product:	TextTracks CG
Classification:	Unclassified
Component:	WebVTT (show other bugs)
Version:	unspecified
Hardware:	PC All

Importance:	P2 normal
Target Milestone:	---
Assignee:	This bug has no owner yet - up for the taking
QA Contact:	Web Media Text Tracks CG

URL:	http://dev.w3.org/html5/webvtt/
Whiteboard:
Keywords:

Depends on:
Blocks:

Reported:	2015-03-19 03:12 UTC by Philip Jägenstedt
Modified:	2015-04-19 03:30 UTC (History)
CC List:	2 users (show)

See Also:

Attachments

Description Philip Jägenstedt 2015-03-19 03:12:17 UTC

"Let n be the number of text tracks whose text track mode is showing and that are in the media element's list of text tracks before track."

This doesn't exclude metadata tracks, so if there are lots of metadata tracks and one captions track, that captions track could get a computed line position placing it where it makes no sense.

Comment 1 Philip Jägenstedt 2015-03-19 03:14:39 UTC

I filed this bug because I noticed that Blink only considers tracks which are showing *and* are either subtitles or captions, which is not per spec.

Actually, this is a problem if there are many captions track but only one has an active cue, too.

I wonder why this shouldn't simply return -1, the overlap avoidance will take care of the rest.

Comment 2 Silvia Pfeiffer 2015-04-19 03:30:59 UTC

Metadata tracks are definitely never rendered, so I agree that they should be excluded, no matter if they are showing or not.

The HTML spec has some text towards this interpretation where it defines "Showing":

"Showing - Indicates that the text track is active. [..] In addition, for text tracks whose kind is subtitles or captions, the cues are being overlaid on the video as appropriate; for text tracks whose kind is descriptions, the user agent is making the cues available to the user in a non-visual fashion; and for text tracks whose kind is chapters, the user agent is making available to the user a mechanism by which the user can navigate to any point in the media resource by selecting a cue."

It might be a good idea to more explicitly follow this in the WebVTT spec, too, and exclude metadata, descriptions and chapter tracks from this *visual* rendering algorithm.