14:57:24 <RRSAgent> RRSAgent has joined #webmachinelearning
14:57:29 <RRSAgent> logging to https://www.w3.org/2024/12/19-webmachinelearning-irc
14:57:29 <Zakim> RRSAgent, make logs Public
14:57:30 <Zakim> please title this meeting ("meeting: ..."), anssik
14:57:30 <anssik> Meeting: WebML WG Teleconference – 19 December 2024
14:57:35 <anssik> Chair: Anssi
14:57:44 <anssik> Agenda: https://github.com/webmachinelearning/meetings/blob/main/telcons/2024-12-19-wg-agenda.md
14:57:44 <anssik> Scribe: Anssi
14:57:49 <anssik> scribeNick: anssik
14:57:58 <anssik> Present+ Anssi_Kostiainen
14:58:04 <anssik> RRSAgent, draft minutes
14:58:05 <RRSAgent> I have made the request to generate https://www.w3.org/2024/12/19-webmachinelearning-minutes.html anssik
14:58:52 <anssik> Present+ Zoltan_Kis
15:00:15 <zkis> zkis has joined #webmachinelearning
15:00:16 <ningxin> ningxin has joined #webmachinelearning
15:00:29 <anssik> Present+ Dwayne_Robinson
15:00:52 <anssik> Present+ Christian_Liebel
15:01:00 <anssik> Present+ Ningxin_Hu
15:01:28 <anssik> Present+ Yuichiro_Tachibana
15:01:32 <dom> Present+ Dominique_Hazael-Massieux
15:01:36 <christianliebel> christianliebel has joined #webmachinelearning
15:01:38 <jsbell> jsbell has joined #webmachinelearning
15:01:45 <anssik> Present+ Joshua_Bell
15:01:50 <christianliebel> present+
15:02:14 <McCool> McCool has joined #webmachinelearning
15:02:29 <anssik> anssik: Welcome to our new participants Yang Gu, Jiajia Qin and Shaobo Yan from Microsoft
15:02:46 <anssik> ... also warm welcome to Yuichiro Tachibana from Hugging Face joining as a guest!
15:04:05 <anssik> Yuichiro: I met W3C members at a conference discussing Transformers.js, thanks for inviting me, interested in client-side ML execution
15:04:54 <anssik> Topic: WebML Working Group Charter update
15:05:04 <anssik> gb, this is w3c/machine-learning-charter
15:05:05 <gb> anssik, OK.
15:05:14 <anssik> anssik: Working Group charter is up for renewal in 2025
15:05:38 <anssik> ... today we'll review the current charter, discuss and solicit input on proposed changes, triage open charter issues, and confirm the timeline
15:05:49 <anssik> anssik: the key principle for WG rechartering is "if it ain't broken don't change it"
15:05:56 <anssik> ... this concerns the scope expansion specifically
15:06:08 <anssik> ... if the scope expands, existing participants need to re-join
15:06:40 <anssik> Subtopic: Current WG Charter
15:06:45 <anssik> -> WG Charter 2023-2024 https://www.w3.org/2023/04/web-machine-learning-charter.html
15:07:22 <anssik> anssik: Motivation and Background is informational
15:07:29 <RafaelCintron> RafaelCintron has joined #webmachinelearning
15:07:56 <anssik> ... Scope is an important section and is future-proofed with "well-known model architectures" and "major platform APIs"
15:08:12 <DwayneR> DwayneR has joined #webmachinelearning
15:08:32 <Yuichi> Yuichi has joined #webmachinelearning
15:08:35 <anssik> ... Out of Scope section includes training, hardware features and hardware algorithms
15:09:03 <anssik> ... Deliverables are WebNN API, Model Loader API as tentative, Ethical Principles WG Note
15:09:29 <anssik> ... Success Criteria is the standard one, two independent interoperable implementations
15:10:00 <anssik> ... Coordination enumerates groups we usually work with, but does not prevent us from working with other groups or projects outside this list
15:10:08 <anssik> ... the rest is standard charter boilerplate
15:10:54 <anssik> Dom: good overview Anssi, Scope and Deliverables are the most important
15:11:12 <anssik> ... think no need to change Scope or add new Deliverables
15:11:38 <anssik> ... Dom submitted a PR #39 to align with the latest charter template, thanks!
15:11:39 <gb> https://github.com/w3c/machine-learning-charter/pull/39 -> Pull Request 39 Align with latest charter template (by dontcallmedom)
15:11:45 <anssik> -> Diff https://pr-preview.s3.amazonaws.com/w3c/machine-learning-charter/39/dd0dbc1...429a03a.html
15:12:01 <jsbell> q+
15:12:03 <anssik> ... any questions or comments about the current charter or Dom's proposed changes?
15:12:08 <anssik> ack jsbell
15:12:24 <anssik> anssik: next we'll look at open charter issues
15:12:27 <anssik> -> WG Charter open issues https://github.com/w3c/machine-learning-charter/issues
15:12:37 <anssik> Subtopic: Model Loader API, keep as tentative or remove from scope?
15:12:41 <anssik> anssik: issue #38
15:12:42 <gb> https://github.com/w3c/machine-learning-charter/issues/38 -> Issue 38 Model Loader API, keep as tentative or remove from scope? (by anssiko)
15:12:53 <anssik> ... Model Loader API incubation has been on a pause since Feb 2023 and its known implementation was removed from Chromium after experimentation phase
15:13:07 <anssik> ... unless there's interest, I'd propose to remove this from the WG Charter and revive this work in the WebML CG as appropriate
15:13:52 <anssik> q?
15:14:15 <anssik> jsbell: not currently being persued, supportive of removing
15:14:44 <anssik> Subtopic: Core operator set, scope and coordination
15:14:50 <anssik> anssik: issue #37
15:14:51 <gb> https://github.com/w3c/machine-learning-charter/issues/37 -> Issue 37 Core operator set, scope and coordination (by anssiko)
15:15:10 <anssik> ... the current WG charter scope section is abstract enough to not warrant a revision to allow work on core op set to happen
15:15:28 <anssik> ... we could update the informative list of major platform APIs if there are changes:
15:15:35 <anssik> "The APIs in scope of this group are not tied to any particular platform and are implementable on top of existing major platform APIs, such as Android Neural Networks API, Windows DirectML, and macOS/iOS Metal Performance Shaders and Basic Neural Network Subroutines."
15:15:54 <anssik> ... this is an open ended list, not being on this list does not exclude any platform APIs
15:16:23 <anssik> anssik: related, in Coordination we note StableHLO, while we also consider MLIR Linalg, PyTorch Prims IR, TOSA for our compositional fundamentals research
15:16:44 <anssik> ... we probably don't want to enumerate all possible projects (MLIR, PyTorch, TOSA), so perhaps it is more balanced to remove the StableHLO reference?
15:16:55 <anssik> q?
15:17:50 <anssik> Dom: being open-ended and informative, the only thing is this can be useful as a reminder of communities who to seek wide review from
15:18:12 <jsbell> q+
15:18:16 <anssik> ... perhaps under that lens, consider removing
15:18:30 <anssik> Christian: seems good to remove, that seems quite specific one
15:18:31 <anssik> ack jsbell
15:18:52 <anssik> jsbell: we normally list WHATWG in charters, most specs have dependencies to WebIDL, Infra and other specs an standards
15:19:02 <anssik> Dom: we usually do that when there's a specific requirement
15:19:29 <anssik> ... e.g. new types for WebIDL required, any browser WG has a WHATWG relationship, would not put that as a hard requirement, can put it in if wanted
15:19:31 <anssik> q?
15:20:06 <anssik> q?
15:20:17 <anssik> Subtopic: Task-based APIs and Prompt API
15:20:20 <anssik> anssik: issue #36
15:20:21 <gb> https://github.com/w3c/machine-learning-charter/issues/36 -> Issue 36 Task-based APIs and Prompt API (by anssiko)
15:20:24 <anssik> ... Task-based APIs and Prompt API were adopted into the WebML Community Group earlier this month
15:20:29 <anssik> ... these APIs are now incubated in the WebML CG
15:20:46 <anssik> ... proposal is to check for readiness for WebML WG adoption from time to time considering implementation experience, end user feedback
15:21:29 <anssik> Dom: we could list them as tentative if we feel they are likely target for adoption, concern is this space is very active with interesting IP questions
15:21:48 <anssik> ... bringing into charter might create friction in the AC review and require everyone re-join
15:22:07 <anssik> ... proposal is to wait and see the traction
15:22:08 <anssik> q?
15:22:37 <dom> q+
15:22:46 <anssik> Christian: personally, would support adding the APIs, but understand and appreciate Dom's perspective
15:23:12 <anssik> ... compared to Model Loader API these APIs seem compatible
15:23:58 <anssik> Dom: we are free to charter again 6 months from now when the incubations have made more progress
15:24:26 <anssik> ... Model Loader API has different IPR scope from task-based APIs
15:24:49 <christianliebel> q+
15:24:55 <dom> q-
15:25:11 <anssik> jsbell: given what Dom said, it has a very specific API called out, maybe in 6 months from now we would be in a better position to see what would make sense to capture as Tentative then
15:25:48 <anssik> q?
15:26:22 <anssik> Dom: as part of our communication to AC, I will make sure we mention these APIs have been adopted in the CG and are consideration for the WG future revision
15:26:36 <anssik> q?
15:26:39 <anssik> ack christianliebel
15:27:22 <anssik> christianliebel: what Dom says makes sense, having publicly visible commitment we are looking at these new APIs, it makes sense to now add to the WG charter right now to not create friction
15:27:23 <anssik> q?
15:27:32 <anssik> Subtopic: Speech Synthesis
15:27:36 <anssik> anssik: issue #31
15:27:37 <gb> https://github.com/w3c/machine-learning-charter/issues/31 -> Issue 31 Speech synthesis and machine learning (by r12a) [deferred]
15:27:48 <anssik> ... suggestion to mention Speech Synthesis for symmetry with Speech Recognition in informative Motivation and Background
15:27:52 <anssik> ... I propose we update the informative text e.g. as follows:
15:28:03 <anssik> "Speech Recognition and Speech Synthesis enable computers to recognize and translate spoken language into text and vice versa."
15:28:08 <anssik> q?
15:28:17 <anssik> Subtopic: On-device training
15:28:27 <anssik> anssik: no feedback that'd suggest we should bring on-device training in scope for the WG
15:28:47 <anssik> ... issue #27
15:28:47 <gb> https://github.com/w3c/machine-learning-charter/issues/27 -> Issue 27 On-device training (by anssiko) [deferred]
15:29:14 <anssik> Subtopic: Charter development timeline
15:29:22 <anssik> ... I'll prepare an updated charter with Dom early new year for your review
15:29:51 <anssik> ... then complete horizontal review, around mid-Feb we will initiate the AC review
15:29:57 <anssik> ... new charter start date would be 2025-05-01
15:30:34 <anssik> q?
15:31:06 <anssik> Topic: Device selection abstractions update
15:31:09 <anssik> gb, this is webmachinelearning/webnn
15:31:09 <gb> anssik, OK.
15:31:15 <anssik> anssik: PR #784
15:31:16 <gb> https://github.com/webmachinelearning/webnn/pull/784 -> Pull Request 784 Add device selection explainer (by zolkis)
15:31:44 <anssik> anssik: Zoltan has updated the explainer proposal, added new use cases, known implementation limitations, added MVP solution to remove explicit deviceType to make contexts device agnostic, allow multiple devices per context
15:31:55 <anssik> -> Updated explainer (preview) https://github.com/webmachinelearning/webnn/blob/f45266fb1223988894b0ccad7701c41aa753c5f1/device-selection-explainer.md
15:32:41 <anssik> Zoltan: folks will need some time to digest the space, we have documented intro, history, key use cases and requirements, considered alternatives
15:32:54 <anssik> ... recently added Minimum Viable Solution for your review
15:33:20 <anssik> ... one pain point is we were tied to device per context, while some platforms can execute on multiple devices
15:33:42 <anssik> ... also our device selection mechanism did not map well to platform APIs
15:33:52 <anssik> ... proposal is to:
15:33:54 <anssik> ... - Remove MLDeviceType as explicit context option.
15:34:20 <anssik> ... - Update MLContext so that it becomes device agnostic, or default/generic context. Allow supporting multiple devices with one context.
15:34:45 <anssik> ... - Add notes to implementations on how to map power preference to devices.
15:35:11 <anssik> ... - Improve the device selection hints in context options and define their implementation mappings.
15:35:21 <anssik> ... - Check if requesting a certain device type or combination of devices is still a use case.
15:36:15 <anssik> ... please review and chime in the PR if this direction has issues, otherwise I will proceed with a spec PR per this design
15:36:50 <jsbell> q+
15:37:03 <anssik> ack jsbell
15:37:18 <anssik> jsbell: explainer looks great, support getting the explainer merged
15:37:27 <RafaelCintron> q+
15:37:31 <anssik> ... where to capture feedback to support the explainer, is there an issue?
15:37:59 <anssik> ... Google team is on a vacation for the next few weeks
15:38:14 <anssik> Zoltan: I can wait over the holiday for feedback
15:39:02 <anssik> q?
15:39:49 <anssik> ack RafaelCintron
15:40:03 <anssik> RafaelCintron: thanks for putting this together, couple of questions about the explainer
15:40:16 <anssik> ... at the end it says "allow supporting multiple devices in one context"
15:40:39 <anssik> ... does "other devices" mean a new type?
15:41:00 <anssik> Zoltan: this would be abstracted away, if defining a context as a combination of devices, should be also possible
15:41:12 <anssik> Rafael: low-latency, what kind of device would be low latency?
15:41:15 <jsbell> +1 that I had the same question as Rafael at first, so clarifying the text would be great.
15:41:47 <anssik> Zoltan: low latency, if you want to optimize e.g. LLM throughput tell that to the implementation and it's do the best to satisfy that, consider it a hint
15:42:50 <anssik> ... just a suggestion in which direction to extend the context options, I found low-latency hint in OpenVINO, we need to validate the use cases and craft hints based on that
15:43:24 <zkis> https://blog.openvino.ai/blog-posts/automatic-device-selection-and-configuration
15:43:53 <anssik> Rafael: high-performance and low-power I know how to deal with, they exist on Windows, low-latency I'm not familiar with, CPU could be low-latency
15:44:27 <anssik> Zoltan: I'd spec these as hints that if they cannot be fulfilled they're not errors
15:45:30 <anssik> Zoltan: PR #784 would be the perfect place for feedback
15:45:31 <gb> https://github.com/webmachinelearning/webnn/pull/784 -> Pull Request 784 Add device selection explainer (by zolkis)
15:45:36 <ningxin> +1 to provide feedback on PR
15:46:22 <anssik> q?
15:46:44 <anssik> Topic: WebNN Operator Update Wave 3
15:47:09 <anssik> anssik: Dwayne has a WIP PR for op set Wave 3, thanks Dwayne!
15:47:29 <anssik> ... OK to push WIP PR to upstream repo and mark it as a Draft
15:47:39 <anssik> ... this enables PR Preview and CI checkers that can be helpful for development
15:47:45 <anssik> ... and folks can help contribute
15:48:39 <anssik> Dwayne: Austin has a few question on uint4 on CoreML, does not block the spec PR
15:49:04 <anssik> Topic: Core op set: MLIR Linalg findings revisited
15:49:15 <anssik> anssik: we discussed Dwayne's extensive mapping table month ago, Linalg specifically
15:49:18 <anssik> -> Mapping table https://onedrive.live.com/view.aspx?resid=EE82F5C6F06C7371%21345450&authkey=!AK8f-RDTleqlLXE
15:49:38 <anssik> ... I wanted to bump this topic to get feedback on the 6 primitive ops proposed for inclusion into WebNN API informed by this reasearch, they are:
15:50:43 <anssik> -> 1-D convolution with no channels https://mlir.llvm.org/docs/Dialects/Linalg/#linalgconv_1d-linalgconv1dop
15:50:46 <anssik> -> 3-D convolution with no channels conv_3d https://mlir.llvm.org/docs/Dialects/Linalg/#linalgconv_3d-linalgconv3dop
15:50:49 <anssik> -> Fill output with random numbers fill_rng_2d https://mlir.llvm.org/docs/Dialects/Linalg/#linalgfill_rng_2d-linalgfillrng2dop
15:50:53 <anssik> -> Sum pooling pooling_nchw_sum https://mlir.llvm.org/docs/Dialects/Linalg/#linalgpooling_nchw_sum-linalgpoolingnchwsumop
15:50:56 <anssik> -> Min pooling pooling_nhwc_min https://mlir.llvm.org/docs/Dialects/Linalg/#linalgpooling_nhwc_min-linalgpoolingnhwcminop
15:50:59 <anssik> -> Round(x) elementwise round https://mlir.llvm.org/docs/Dialects/Linalg/#linalground-linalgroundop
15:51:27 <anssik> Dwayne: this set 6 is implementable across backends
15:51:43 <anssik> ... no direct mapping for all of them, most directly implementable
15:52:06 <anssik> ... I can fill in the table with all the ops, take a subset of three backends Chromium uses and show how they map to these
15:53:03 <anssik> jsbell: following the usual process, if implemetable across backends, use cases, sounds good
15:54:00 <anssik> Topic: Get to know Task-specific APIs and Prompt API
15:54:04 <anssik> -> Incubations landing page https://webmachinelearning.github.io/incubations/
15:54:49 <anssik> anssik: companion WebML Community Group now incubates selected task-specific APIs to enable reuse of the built-in models that are distributed as part of the browser or the underlying software platform
15:55:03 <dom> q+
15:55:07 <anssik> ... as an observation, it seems Prompt API as a more general purpose and flexible API has received the most feedback
15:55:20 <anssik> ... on our last meeting we discussed how to versioning the models, and it was suggested as a topic to be discussed in this group, adapters and LoRAs, model management
15:55:20 <anssik> q?
15:55:23 <christianliebel> q+
15:55:24 <anssik> ack dom
15:55:54 <anssik> Dom: open-ended question, one topic I'd expect to be broad up is questions on ethical considerations for ML
15:56:23 <anssik> ... would a model that is used by these APIs be documented in a way developers can learn how they're trained, bias, other qualities, a la Model Cards style info
15:56:43 <anssik> ... as we think bringing specific models as part of API surface in browsers, thinking about ethical aspects is important
15:56:58 <anssik> q?
15:57:05 <anssik> ack christianliebel
15:57:36 <anssik> Christian: good questions, touch to answer, I have given dozen of presentation on these APIs, can confirm Prompt API has the most interest
15:57:36 <jsbell> q+
15:57:48 <anssik> ... restricting only for extensions is problem for developers
15:58:00 <anssik> ... ethical part, how to make sure the answers are safe?
15:58:10 <anssik> ... is there filtering, can I query?
15:58:21 <anssik> ... overall I'm happy we have these APIs here, and look forward to see more
15:59:00 <anssik> ... for specific issues, output constraining is important, our customers use function calling, multi-modal, Prompt API represents what LLMs were year ago
15:59:09 <McCool> q+
15:59:14 <anssik> q?
15:59:16 <anssik> ack jsbell
15:59:45 <anssik> jsbell: +1 to what Christian said, we want to work through the Prompt API issues, also Dom thanks for ethical considerations
15:59:58 <dom> +1 that WebNN delegates that question rather than solving it :)
16:00:21 <anssik> ... in WebNN API there's transparency, but it pushes complexity to developer on how "responsible" the model is
16:00:56 <anssik> jsbell: this is not the first time we have ML-backed APIs in the browser, we have Web Speech API
16:01:01 <anssik> q?
16:01:04 <anssik> ack McCool
16:01:37 <anssik> McCool: STT and TTS, use cases for Prompt API?
16:01:51 <anssik> jsbell: we'll explore multimodal in 2025, current APIs are only text-to-text
16:01:52 <anssik> q?
16:02:04 <anssik> Topic: Happy Holidays!
16:02:18 <anssik> anssik: Thank You everyone for your significant contributions during 2024!
16:02:25 <anssik> ... our Working Group accomplished a lot this year
16:02:30 <anssik> ... a few highlights:
16:02:53 <anssik> ... WebNN API evolved driven by research into popular more advanced models, more diverse implementations
16:03:03 <anssik> ... WebNN API Candidate Rec Snapshot milestone was met in 2Q 2024
16:03:16 <anssik> ... the WG made in total ~100 spec publications and merged +180 PRs this year
16:03:39 <anssik> ... many new active contributors join, Dwayne as a co-editor, the group grew and diversied further
16:03:58 <anssik> ... we converged on new API abstractions for tensors, device selection, defined op set principles
16:04:13 <anssik> ... we improved the spec quality significantly with expert advice
16:04:38 <anssik> ... we organized our first F2F in Anaheim and it was a blast
16:04:55 <anssik> ... we made strong progress on the implementations across 3 backends, XPUs, multiple OSes
16:05:11 <anssik> ... we witnessed positive buzz in the tech industry around WebNN, made a few keynote appearances
16:05:26 <anssik> ... a lot of exciting demos and samples were published, wpt test coverage improved
16:05:31 <anssik> ... and much more!
16:05:39 <anssik> ... we're entering an exciting phase of development in 2025
16:05:57 <anssik> ... the WebNN API is expected to get in the hands of more developers for large-scale trials, and more
16:06:03 <anssik> ... feedback from developers and users will help guide our priorities
16:06:20 <anssik> ... Happy Holidays everyone -- please relax, disconnect, and recharge
16:06:29 <anssik> ... see you on our next call 16 Jan 2025!
16:07:02 <anssik> RRSAgent, draft minutes
16:07:03 <RRSAgent> I have made the request to generate https://www.w3.org/2024/12/19-webmachinelearning-minutes.html anssik
16:11:17 <anssik> s/… Dom submitted/anssik: Dom submitted
16:11:20 <anssik> RRSAgent, draft minutes
16:11:21 <RRSAgent> I have made the request to generate https://www.w3.org/2024/12/19-webmachinelearning-minutes.html anssik
16:16:48 <anssik> s/to now add/to not add
16:20:21 <anssik> s/broad up/brought up
16:21:00 <anssik> s/touch/tough
16:22:28 <anssik> s/contributors join/contributors joined
16:23:17 <anssik> RRSAgent, draft minutes
16:23:19 <RRSAgent> I have made the request to generate https://www.w3.org/2024/12/19-webmachinelearning-minutes.html anssik
16:24:06 <anssik> s/Dom submitted a PR/submitted a PR
16:24:08 <anssik> RRSAgent, draft minutes
16:24:09 <RRSAgent> I have made the request to generate https://www.w3.org/2024/12/19-webmachinelearning-minutes.html anssik
16:27:30 <anssik> s/diversied/diversified
16:27:32 <anssik> RRSAgent, draft minutes
16:27:33 <RRSAgent> I have made the request to generate https://www.w3.org/2024/12/19-webmachinelearning-minutes.html anssik
17:46:39 <zkis> zkis has joined #webmachinelearning
18:30:50 <Zakim> Zakim has left #webmachinelearning