13:58:14 <RRSAgent> RRSAgent has joined #me
13:58:19 <RRSAgent> logging to https://www.w3.org/2026/04/07-me-irc
13:58:19 <Zakim> Zakim has joined #me
13:58:53 <cpn> Meeting: Media & Entertainment Interest Group
13:58:57 <cpn> Agenda: https://www.w3.org/events/meetings/f61979af-ab49-43a8-a35c-30b92c2e1571/20260407T150000/
14:00:19 <ohmata> ohmata has joined #me
14:00:41 <kaz> kaz has joined #me
14:01:44 <song> song has joined #me
14:02:08 <wschildbach> wschildbach has joined #me
14:02:13 <wschildbach> present+
14:02:17 <song> present+
14:02:43 <cpn> present+ Song Xu, Wolfgang Schildbach, Chris Needham, Bernd Czelhan, Kazuyuki Ashimua, Shunsuke Iwamura, Hisayuki Ohmata, Paul Adenot, Hiroki Endo, Niko Farber, Rob Smith
14:02:49 <kaz> rrsagent, make log public
14:02:53 <kaz> rrsagent, draft minutes
14:02:54 <RRSAgent> I have made the request to generate https://www.w3.org/2026/04/07-me-minutes.html kaz
14:03:01 <cpn> Chair: Song Xu, Chris Needham, Wolfgang Schildbach
14:03:01 <cpn> scribe+ cpn
14:03:45 <tidoust> present+ Francois Daoust
14:03:58 <song> walk through of the use case and req. document
14:04:08 <cpn> scribe+ song
14:04:11 <kaz> agenda: https://lists.w3.org/Archives/Public/public-web-and-tv/2026Apr/0003.html
14:04:54 <song> two topics: webtransport API entering final stage. looking for wide review feedback
14:05:27 <kaz> rrsagent, draft minutes
14:05:29 <RRSAgent> I have made the request to generate https://www.w3.org/2026/04/07-me-minutes.html kaz
14:05:35 <song> in web app., ppl. has experience to using it.
14:05:49 <song> sugeestions for changes or improvement is very welcome on webTransport
14:06:04 <song> Group set a deadline, end of this month.
14:06:13 <cpn> https://www.w3.org/TR/webtransport/
14:06:18 <kaz> present- song
14:06:32 <kaz> present- wschildbach
14:06:36 <song> Video delivery and streaming in combination with Web Codex for lower latency
14:06:44 <cpn> https://github.com/w3c/ColorWeb-CG/blob/main/hdr-big-picture.md
14:06:49 <kaz> present+ Atsushi Shimono
14:06:52 <Niko> Niko has joined #me
14:06:53 <kaz> rrsagent, draft minutes
14:06:54 <RRSAgent> I have made the request to generate https://www.w3.org/2026/04/07-me-minutes.html kaz
14:07:00 <cpn> https://github.com/SMPTE/st2094-50
14:07:09 <song> The spec there, follow the spec through to Github issues, ppl to send feedback.
14:07:22 <song> A couple od doc for review and feedback
14:07:41 <song> You're an HDR speciallist, doc interesting for you.
14:07:45 <RobSmith> RobSmith has joined #me
14:08:06 <song> Might need to be adapted to support HDR. including sort of canvas, and CSS
14:08:28 <song> processing model for it is going through simply spec.
14:08:50 <song> sort of proposed a necessarily unless anybody has sort of clarifying questions.
14:09:09 <song> For the color stuff, recommend joning thit on the web community
14:09:31 <song> leave a bit of time at the end if AOB
14:09:50 <cpn> Topic: Next Generation Audio
14:09:53 <cpn> scribe+ cpn
14:09:58 <song> Next Generation Audio. Wolfgang will lead it.
14:10:14 <cpn> Wolfgang: The document is a group draft note
14:10:15 <song> thanks
14:10:24 <cpn> ... I suggest opening the document as I talk you through it
14:10:34 <cpn> ... I have a presentation too that summarises the document
14:11:29 <cpn> ... https://w3c.github.io/me-next-generation-audio
14:11:48 <kaz> i|walk through|topic: Announcements|
14:11:59 <cpn> ... As some history, we decided in TPAC to make a Group Note
14:12:07 <cpn> ... Made a draft in December
14:12:29 <cpn> ... I'm hoping we can have a call for consensus
14:13:16 <cpn> ... The Note has use cases, requirements, gap analysis, and privacy considerations
14:13:30 <kaz> rrsagent, draft minutes
14:13:32 <RRSAgent> I have made the request to generate https://www.w3.org/2026/04/07-me-minutes.html kaz
14:13:46 <cpn> ... What use cases do we want to enable? Dolby and Fraunhofer have developed this together
14:13:55 <cpn> ... And what would an API need to provide?
14:14:26 <cpn> ... The requirements describes cross-cutting concerns, e.g., it should work for all codecs
14:14:46 <cpn> ... The gap analysis answers why we think existing APIs can't be used to support the use cases
14:15:03 <cpn> ... The privacy considerations is a collection of thoughts, regarding privacy implications
14:15:26 <cpn> ... Please interrupt to ask questions
14:16:14 <cpn> ... The first use case is selecting a preselection. Interacting with the gain or volume, e.g., to increase the dialog
14:16:42 <cpn> ... Related is position interactivity, where you could put the dialog in a position where it doesn't overlap other audio elements
14:17:14 <cpn> ... Then, selecting individual audio elements in the mix. e.g., musical instruments to apply gain to them
14:17:25 <cpn> ... Another is where all these elements are controlled in conjunction
14:17:42 <cpn> ... The document has more detail
14:18:21 <cpn> ... Requirements - the first is to be codec agnostic. We're not asking for an API that's specific to one company's codec
14:18:47 <cpn> ... It should work for protected media. If the API only works in non-protected use cases, we think it doesn't solve the commercially relevant use cases
14:19:14 <cpn> ... It should work where there are multiple media streams. At a minimum, you'll have audio and video. (By the way, some of these concepts could also apply to video)
14:19:39 <cpn> ... With multiple audio streams, the personalisation can apply to one of those, so apply to the media stream, not just the device
14:20:08 <cpn> ... Controls should happen in realtime. If a user selects a specific preselection, they want it to be active right away, no perceivable latency
14:20:52 <cpn> ... Non-blocking hardware access: what we mean is that users will interact with the media, as it plays, and the APIs should be asynchronous, and not require waiting until a presentation is done. It should be async to the media playback
14:21:15 <cpn> ... Gap Analysis. In meetings so far, we were asked to explain why existing web APIs can't be used
14:21:58 <cpn> ... There's HTMLMediaElement, WebCodecs, e.g., in conjunction with AudioNode. Or implement it all in WASM or JS?
14:22:55 <cpn> ... HTMLMediaElement has an audioTracks attribute. It could conceivably be used to select audio preselections. But doing this confuses tracks and preselections. Some subtle problems are described in the Note
14:23:15 <cpn> ... Some audio tracks have very few attributes, e.g., language and kind, not enough to select by the user
14:23:38 <cpn> ... Selection semantics may not work, selecting audio tracks is mutually exclusive and may not work for preselections
14:24:05 <cpn> ... Use WebCodecs and AudioNodes to mix and process the result? A limitation here is that WebCodecs output is in the clear, and AudioNode input is in the clear
14:24:46 <cpn> ... So would work for non-protected content. And no object audio support. If we want to support spatial and object audio, moving it in space, it would have to be built in. It's not available today.
14:25:23 <cpn> ... If the pipeline is built by the application developer, the content creator has no control of the end result. This is important to creatives to ensure their product is presented
14:26:07 <cpn> ... JS and WASM implementations have a performance concern. It might work on a PC, but not on a TV set. Battery life is related to performance limitations. Also this doesn't work with content protection
14:27:48 <cpn> ... Finally, the privacy considerations. It's hard to compartmentalise these. Everything comes down to fingerprinting mechanisms. Not all of these are really germane to this API, they apply to any new API. For example, if the API is supported on one platform but not others, that's fingerprinting surface. It might also provide information about the
14:27:48 <cpn> media the user consumes
14:28:27 <cpn> ... That might happen, through same-origin information leakage. If you open a media stream and query what's available, or what the default is, it might leak information about the user's default
14:28:48 <cpn> ... If the user sets preferences and those are shared between sessions, another session might query those set in the first session.
14:29:34 <cpn> ... This might happen implicitly, e.g., a smart implementation might pre-filter the personalisation options available against some preferences. So if you are able to get at the list of pre-filtered choices, it reveals something about what was filtered out
14:29:46 <cpn> ... These are considerations for implementers
14:30:03 <cpn> ... Data persistence, as preselection choices might persist beyond one session
14:30:32 <cpn> ... Any questions?
14:31:11 <cpn> (none)
14:31:16 <cpn> ... Can we do the CfC?
14:31:34 <cpn> Paul: Might need some wider exposure on the mailing list. So stakeholders can read through it
14:33:09 <RobSmith> Is there a relevant audio CG from which we could seek feedback?
14:33:41 <song> In essence, IG cannot publish spec. we could go through the same process as WG
14:35:26 <song> The mental model would be if a browser implementer looks at it, the doc expose enough info.
14:35:27 <kaz> q+
14:37:03 <cpn> Chris: Is the document comprehensive, in terms of which codecs it includes? We've had a liaison in Media WG on 3GPP
14:37:13 <cpn> s/3GPP/3GPP IVAS/
14:37:27 <song> in terms of the set of codecs that it's considering, the next generation audio codecs describing in the group note.  like 3GPP EVAS
14:37:54 <song> API approach would work across all of those codecs
14:38:07 <cpn> Chris: So we could reach out to those groups
14:39:31 <kaz> q+ RobSmith
14:39:37 <kaz> q- later
14:39:38 <cpn> Bernd: Could we make a list? We could put a first version out there?
14:40:59 <cpn> RobSmith: A suggestion, would putting together a demo be a good idea, to show how it works? It invites others to review how it fits their own model
14:42:20 <cpn> Wolfgang: We've made demos, e.g., one at a previous TPAC
14:43:00 <cpn> Bernd: There are standards around world using NGA with a certain toolset. We could do another demo, but I feel we don't make progress on the formal status of this note
14:44:08 <cpn> Wolfgang: We'll need support of browser vendors, so that will need demos. I don't think demos progress the Group Note
14:47:04 <kaz> q+ PaulAdenot
14:47:07 <kaz> ack R
14:47:10 <kaz> q- later
14:47:22 <tidoust> -> https://github.com/w3c/webcodecs/issues/41 Support for content protection in WebCodecs
14:48:57 <cpn> q?
14:49:20 <cpn> Paul: Worth reading the WebCodecs issue on protection in WebCodecs
14:49:44 <kaz> q+ RobSmith
14:49:48 <kaz> ack P
14:49:50 <kaz> q- later
14:50:23 <cpn> Rob: WebVMT was published as a Note 3 years ago, the process we used was to invite a 6 or 8 week deadline, and advertising it. I got some good feedback
14:51:22 <kaz> ack R
14:52:45 <tidoust> q+
14:54:29 <tidoust> q-
14:59:17 <tidoust> cpn: I think next step is getting stakeholders feedback
14:59:28 <tidoust> i/cpn: I think/scribe+ tidoust/
14:59:31 <tidoust> q+
14:59:41 <tidoust> ... and then do the formal publication later on.
15:01:08 <tidoust> wolfgang: I'll send this around and set a deadline for review. 4-6 weeks.
15:01:19 <tidoust> ... It's understood that we need to have more discussions with implementers.
15:02:53 <tidoust> q-
15:05:27 <cpn> Kaz: As Rob mentioned, there's a chicken and egg question. Asking Audio WG makes sense. The requirements in Section 4 is broader than Web Audio. So I suggest we ask the other W3C groups, like WoT and Voice Interaction for comments also. Voice Interaction guys organised a workshop on smart voice agents, and discussion included time synchronisation.
15:05:27 <cpn> Talking with them would make sense
15:05:46 <kaz> -> https://www.w3.org/2025/10/smartagents-workshop/report.html fyi, report from the Smart Voice Agents Workshohp
15:06:07 <kaz> s/The requirements/However, the requirements/
15:06:10 <cpn> Topic: Next meeting
15:07:22 <cpn> Chris: It's scheduled for 5 May, but I'll be away
15:08:07 <kaz> s/synchronisation/synchronisation among multiple data streams/
15:09:46 <kaz> q-
15:09:46 <cpn> [adjourned]
15:16:04 <cpn> rrsagent, draft minutes
15:16:06 <RRSAgent> I have made the request to generate https://www.w3.org/2026/04/07-me-minutes.html cpn
18:05:44 <cabanier> cabanier has joined #me
18:28:12 <Zakim> Zakim has left #me