14:56:49 <RRSAgent> RRSAgent has joined #me
14:56:49 <RRSAgent> logging to https://www.w3.org/2021/10/11-me-irc
14:56:53 <Zakim> Zakim has joined #me
14:57:10 <cpn> Meeting: Media Timed Events: WebVTT unbounded cues
15:03:20 <calvaris> calvaris has joined #me
15:03:54 <ChrisLorenzo> ChrisLorenzo has joined #me
15:04:19 <zacharycava> zacharycava has joined #me
15:05:41 <cpn> present+ Chris_Needham, Chris_Lorenzo, Zachary_Cava, Xabier_Rodriguez_Calvar, Rob_Smith
15:05:45 <cpn> Chair: Chris_Needham
15:06:02 <cpn> Topic: Agenda
15:07:53 <cpn> ChrisN: Covered unbounded cues in segmented media last time
15:07:55 <cpn> ... Any features or syntax changes in WebVMT that should be added to WebVTT?
15:08:30 <cpn> ... Discuss whether we need to identify cues across WebVTT documents, if so how, and where to specify?
15:09:06 <cpn> ... Anything more generally on DataCue?
15:09:22 <cpn> Topic: WebVMT features into WebVTT
15:09:48 <cpn> Rob: This ties to what's happening with DataCue. One feature worth porting across is aligning metadata with VTT and DataCue
15:10:16 <cpn> ... Metadata is a JSON and it's amorphous, no formatting. A small amount of restriction on that would make it more useful or interoperable
15:11:10 <cpn> Chris: In terms of other features, not sure what Gary had in mind
15:11:24 <cpn> Zack: How far along is WebVMT?
15:12:03 <cpn> Rob: Not on standards track. Dash-cam market has use cases that would benefit from it, which currently uses proprietary formats
15:12:24 <cpn> ... A way to export the data allows use of proprietary formats, but users can export to a common format to share data
15:12:37 <Larry_Zhao> Larry_Zhao has joined #me
15:13:41 <cpn> Chris: Let's look at the DataCue API relation.
15:14:43 <cpn> Rob: DataCue has a 'type' field and content in a 'value' field, which can be anything
15:15:18 <cpn> ... Too open-ended. In the WebVTT document, it's just a WebVTT document it's only JSON
15:16:00 <cpn> ... The simple addition of the 'type' such as a URN allows you to recognise what it is
15:17:12 <cpn> Chris: Examples in the explainer: https://github.com/WICG/datacue/blob/main/explainer.md such as org.id3
15:20:00 <cpn> ... What's the scope of the type field? Are they globally defined, or do we have types specific to HLS defined in one place, WebVMT types defined else where
15:20:21 <cpn> Rob: Thinking of an IANA type registration, to stop different uses stepping on each others toes
15:21:53 <cpn> Zack: The HLS spec includes the date range type. Prior to that, Disney and Hulu implemented their own, and used com.hulu
15:22:06 <cpn> ... The same functionality as the Apple-defined, but non-conflicting
15:22:37 <cpn> ... When things get more standardised and more adopted, the ability to change the name is helpful
15:22:42 <cpn> ... Having a URN makes it flexible
15:22:54 <cpn> present+ Larry_Zhao
15:24:03 <cpn> Rob: When to do this? In the dash-cam market there'll be different variants, so having an open format is beneficial. There may be commonality, which may lead to something like mime types being set up
15:26:05 <cpn> ChrisN: Is it enough to say this cue is WebVMT data, or do you need to be more specific?
15:26:32 <calvaris> calvaris has joined #me
15:27:15 <cpn> Rob: We don't really have WebVMT data. Started with location, now can be anything. Speed, direction, acceleration. With drones, there's altitude, camera orientation, sensors
15:27:32 <cpn> ... So the data is really sensor data
15:28:42 <cpn> ChrisN: Thinking of use cases. In HLS, cues are surfaced by the browser to the web app
15:29:05 <cpn> ... This the in-band time metadata cues, same with emsg boxes in DASH
15:29:39 <cpn> ... The other case is where the web app creates cues after reading a WebVMT document
15:30:39 <cpn> ... Where are the interop points?
15:31:24 <cpn> Rob: WebVMT was designed for devices without connectivity, autonomous. Recording data such as temperature with minimum overhead
15:31:53 <cpn> ... Being able to record in a common format allows others to recognise what it is without converting
15:32:57 <cpn> ChrisN: We can make setting up a registry for data types a part of the proposal
15:34:32 <cpn> ... Although as a developer I should be able to put any arbitrary data in a DataCue without registering
15:36:03 <cpn> Zack: SCTE-35 has schemes in the spec, and people use it without registering their scheme. If people use URNs, you'd rarely get conflicts
15:36:58 <cpn> ... Having a spec with optional registering encourages private adoption, proprietary uses. If then later they need external interop, they'll be able to register at that time
15:38:16 <cpn> ChrisN: Thinking about DataCue more generally - we've focused a lot on defining the emsg mapping, without really resolving
15:39:08 <cpn> ... We could perhaps usefully split DataCue into two parts: the API, as it is in WebKit, the second part is the mapping to emsg
15:39:27 <cpn> ... If there's still interest in emsg, I'd like to come back to that
15:39:49 <cpn> ... Unclear on the extent of interopt there
15:41:49 <cpn> ChrisN: If we end up with app-level emsg parsing, we'd still need a DataCue to put the data on the timeline. Current solution is using VTTCue
15:43:39 <cpn> ... Still need a way to distinguish VTT caption cues from metadata cues
15:43:58 <cpn> Rob: Can use different TextTracks, one for captions, one for metadata
15:44:40 <cpn> ... If you have a metadata track, all the JSON objects would have a top-level 'type' and 'data' objects that would allow DataCues to be created
15:45:34 <cpn> ... It would be easy to inspect the list of cues to extract the ones you're interested in
15:45:51 <cpn> ChrisN: So the presence of the 'type' field in itself is helpful, which isn't available with VTTCue
15:46:10 <cpn> ... We can write this into the explainer
15:46:50 <cpn> Rob: With the current VTTCue workaround, it's a more complex object than a DataCue, as it has presentation stuff we don't need
15:49:41 <cpn> ChrisN: Mapping of emsg box type information is complex, depends on the DASH-IF event interop work
15:50:28 <cpn> ... Need to summarise the open questions
15:50:45 <cpn> Topic: Identifying cues across WebVTT documents
15:51:24 <cpn> ChrisN: Context is segmented delivery. Each segment would have its VTT document specific to that section
15:56:31 <cpn> ... Do we need cue identifiers, or identifiers to higher level concepts?
15:56:42 <cpn> ... eg: { chapter: 1 }
15:57:01 <cpn> ... or { id: 'some-id', chapter: 1 }
15:57:34 <cpn> Rob: Do you need identifier at the type level?
15:58:00 <cpn> Zack: Answer could be driven by need for de-duplication. I'd expect user agents to do the collapsing
15:58:41 <cpn> ... Up-levelling anything needed to enable de-duplication is important. So pulling the id and type out, to bring them together
15:59:20 <cpn> ... In DASH, you have event id, start time, payload. The id allows de-duplication across period boundaries without parsing the data itself
16:00:07 <cpn> Rob: So if the id is at the same level as type, do we get the same issue  - cue ids should be unique within a document, but not between documents?
16:01:05 <cpn> Zack: From a media perspective, i'd expect it to be unique within the track it's operating in. Could be multiple documents making up the track
16:01:27 <cpn> ... It's not unheard of property. It happens for audio and video, where track descriptions are shared across segments
16:05:41 <zacharycava> zacharycava has left #me
16:06:31 <cpn> ChrisN: Next steps: Follow up with Gary on WebVTT cue ids, revisit emsg mapping proposal, talk with DASH-IF about emsg cue id scope
16:06:35 <cpn> [adjourned]
16:06:39 <cpn> rrsagent, draft minutes
16:06:39 <RRSAgent> I have made the request to generate https://www.w3.org/2021/10/11-me-minutes.html cpn
16:06:44 <cpn> rrsagent, make log public
16:09:25 <cpn> s/interopt/interop/
16:09:27 <cpn> rrsagent, draft minutes
16:09:27 <RRSAgent> I have made the request to generate https://www.w3.org/2021/10/11-me-minutes.html cpn
16:09:44 <cpn> s/Chris:/ChrisN:/
16:09:46 <cpn> rrsagent, draft minutes
16:09:46 <RRSAgent> I have made the request to generate https://www.w3.org/2021/10/11-me-minutes.html cpn
16:10:10 <cpn> s/Chris:/ChrisN:/
16:10:13 <cpn> rrsagent, draft minutes
16:10:13 <RRSAgent> I have made the request to generate https://www.w3.org/2021/10/11-me-minutes.html cpn
16:10:41 <cpn> s/Chris:/ChrisN:/
16:10:43 <cpn> rrsagent, draft minutes
16:10:43 <RRSAgent> I have made the request to generate https://www.w3.org/2021/10/11-me-minutes.html cpn
16:21:13 <cpn> s/eg: { chapter: 1 }/eg: { chapter: 1 } with consistent cue ids across VTT documents/
16:21:16 <cpn> rrsagent, draft minutes
16:21:16 <RRSAgent> I have made the request to generate https://www.w3.org/2021/10/11-me-minutes.html cpn