14:54:54 RRSAgent has joined #me 14:54:54 logging to https://www.w3.org/2022/08/15-me-irc 14:54:58 Zakim has joined #me 15:00:04 Meeting: Media Timed Events TF 15:00:16 Present: Chris_Needham, Yuhao_Fu 15:00:23 Chair: Chris_Needham 15:00:29 scribe: cpn 15:00:43 present+ Kazuyuki_Ashimura 15:00:54 xfq has joined #me 15:01:19 present+ Fuqiao_Xue 15:03:29 Topic: Introduction 15:03:42 ChrisN: Main topic is preparations for upcoming TPAC meeting 15:04:31 ... Agenda for Media WG and MEIG is in planning, we could include an update on the work here 15:05:19 Topic: TPAC plans 15:05:20 RobSmith has joined #me 15:05:32 ChrisN: I've started preparing some slides 15:06:03 Yuhao has joined #me 15:06:08 ... We have open questions, particularly about SEI events 15:06:29 ... It's not clear that DataCue is the right solution in all cases 15:06:39 agenda: https://lists.w3.org/Archives/Public/public-web-and-tv/2022Aug/0002.html 15:07:11 Could someone post a link to join the meeting please. 15:08:33 ... For example, if we have metadata on every video frame, then DataCue not the right solution 15:09:07 ... Need something more like the proposal from Yuhao 15:09:51 Yuhao: Do we need additional description for this scenario? 15:10:09 ChrisN: I think so, yes. Is that a use case you are interested in? 15:11:43 Yuhao: Yes, sometimes, but I don't think it's a good case. If video is 60FPS, each frame takes 16ms to trigger the metadata, frequency is a problem 15:12:20 q+ 15:13:35 Kaz: Wondering about TPAC agenda 15:14:11 ... Should we concentrate on SEI event and clarify use cases, we can look into some expected template for that description 15:14:33 ... For device and network settings, expected behaviour and pain points, e.g., time resolution 15:14:50 ack k 15:14:55 present+ Rob_Smith 15:15:18 Thanks for the link 15:15:21 ChrisN: We do need a clear written description for what we propose for SEI event, linked to application use cases 15:17:34 ChrisN: For TPAC, what do we want to say? What do we want input on, what do we want from browser vendors, etc? 15:18:03 ... There's interest here in adding support for SEI events 15:18:31 ... We've discussed, separately from these meetings, about possible alignment with DataCue proposal 15:18:46 ... But it's not clear DataCue is the right solution 15:28:25 ChrisN: [talks through draft slides] 15:29:08 ... Overall, I think we can separate the DataCue API part from the in-band event surfacing parts 15:29:36 ... There's interest from DASH-IF on MSE and DASH emsg events and MPEG timed metadata tracks 15:30:04 ... But that needs further exploration, and that may depend on how much implementer interest this has 15:30:52 ... Do we do more analysis work on figuring out the MSE implementation algorithms? Or do we seek indications of level of interest before going ahead with that? 15:31:14 ... MSE issue: https://github.com/w3c/media-source/issues/189 15:31:49 ... Question about the SEI events, and what do we want to report there? 15:32:34 ... Needs its own consideration, may be different solution proposals than DataCue? 15:32:41 ... https://github.com/leonardoFu/video-sei-event/blob/main/explainer.md 15:33:26 ... I need your help to drive that forward 15:34:46 ... Link to draft slides: https://docs.google.com/presentation/d/1Xy6nb2RIxMCdbiO6rxRYbmjFug1cvESjWdVdXhiujkY/edit 15:35:05 ... What additional information to include for SEI event support? 15:36:14 ... Options: We could have a fully developed proposal, go into the detail of the problem space and requirements 15:36:57 ... Or we can mention briefly but say that this is ongoing work and we would develop a proposal in the future 15:37:23 ... Or we present the existing proposal 15:37:28 q+ 15:38:25 RobSmith: So I think that's a good approach. The two parts, the API (accessing the data), and in-band handling (whichi is processing the data) 15:40:07 ChrisN: The browser would only parse to extract the events, then present them to the JS layer 15:40:18 ... Otherwise the browser would need to understand the data format in the in-band event 15:40:32 RobSmith: There's a type field 15:40:45 ... So it can present the bytes and the type? 15:41:10 s/whichi is/which is/ 15:41:32 ChrisN: Yes. In the emsg case, the type field could say "this is a DASH emsg" event, and here are the bytes. 15:41:42 q+ 15:41:46 ack r 15:41:55 ... But within the emg there's its own type infomation 15:42:13 RobSmith: Nested type information 15:42:37 ... But if it's exposed as the top-level type, with the data, it's useful, and as a first step to explore how to process further 15:43:24 ... If we want the UA to do something as a result, could handle in some automatic way. It would allow developers and browsers to explore further on what's needed 15:44:01 ... On VTTCue vs DataCue, it'll work, but there's at least two overheads: stringifying data, and the unnecessary rendering attributes 15:44:57 ChrisN: There are different kinds of tracks, metadata. Does using a metadata track prevent rendering of a VTTCue? 15:45:47 RobSmith: If you have a VTTCue on a metadata track, it's still valid. Does it see the text and apply region and other rendering attributes? It could be an overhead 15:45:56 ... DataCue is a cut down version of VTTCue 15:47:37 ... Another thought, if we expose data through the API, we'd quickly find out the response issues of processing overhead of VTTCues and their attributes (processing time, memory, latency) 15:48:36 ChrisN: That would be part of the further exploration, e.g., around MSE integration 15:49:07 RobSmith: May help gauge interest and use cases 15:50:09 Kaz: I think the two-step proposal might be OK. We might want to check the existing possible solutions and workarounds used by vendors and broadcasters, as basis for further discussion 15:50:25 ... What has been done to cope with the existing problems? 15:51:13 ChrisN: Many media players have this 15:51:50 ... Ready to present progress? 15:52:28 q+ 15:52:32 ack k 15:54:30 RobSmith: Ready to present a first draft, we're fairly happy that that's well designed. That would be a talking point for feedback on the next stage 15:54:54 ack r 15:55:40 ChrisN: I agree, the second stage, MSE integration for DASH emsg, is a lot of work. It seems dependent on whether browser vendors are interested, in principle, in a solution. Then, it's dependent on the group coming up with a proposed solution 15:55:51 ... I see it similarly with SEI events 15:56:28 ... Potential interest in WebCodecs 15:57:10 ... But that's only a small part of a solution. More generally, it's also MSE playback but also HLS playback on devices that don't support MSE 15:57:44 q+ 15:57:47 ... I don't think there's time to document it all ahead of TPAC 15:58:20 Kaz: I'm ok with starting with the initial draft proposal, and have more discussion at TPAC 15:59:19 ChrisN: I was thinking of presenting this at the Media WG meeting. We may only have 30 minutes, not sure we'll have a full hour 15:59:39 ... WG agenda should be finalised during this week 16:00:16 ... ChrisN: At this stage, I'd like to include the key points for SEI events in the presentation 16:00:25 s/... ChrisN/ChrisN/ 16:00:38 Yuhao: What's the most important thing to explain? 16:01:46 ChrisN: We currently have two possible proposals. One is to use DataCue. The other is to define an event listener on the