13:51:13 <RRSAgent> RRSAgent has joined #webmachinelearning
13:51:13 <RRSAgent> logging to https://www.w3.org/2022/10/20-webmachinelearning-irc
13:51:15 <Zakim> RRSAgent, make logs Public
13:51:19 <Zakim> please title this meeting ("meeting: ..."), anssik
13:51:19 <anssik> Meeting: WebML WG Teleconference – 20 October 2022
13:51:20 <anssik> Chair: Anssi
13:51:24 <anssik> Agenda: https://github.com/webmachinelearning/meetings/blob/main/telcons/2022-10-20-wg-agenda.md
13:51:30 <anssik> Scribe: Anssi
13:51:34 <anssik> scribeNick: anssik
13:55:34 <anssik> Present+ Anssi_Kostiainen
13:55:38 <anssik> Regrets+ Dominique_Hazael-Massieux
13:55:46 <anssik> RRSAgent, draft minutes
13:55:46 <RRSAgent> I have made the request to generate https://www.w3.org/2022/10/20-webmachinelearning-minutes.html anssik
14:00:08 <bruce_dai> bruce_dai has joined #webmachinelearning
14:00:30 <ningxin_hu> ningxin_hu has joined #webmachinelearning
14:00:48 <anssik> Present+ Zoltan_Kis
14:00:57 <anssik> Present+ Bruce_Dai
14:01:03 <anssik> Present+ Ningxin_Hu
14:03:35 <zkis> zkis has joined #webmachinelearning
14:03:39 <anssik> Present+ Chai_Chaoweeraprasit
14:03:45 <anssik> Present+ Rafael_Cintron
14:03:46 <RafaelCintron> RafaelCintron has joined #webmachinelearning
14:04:29 <chai> chai has joined #webmachinelearning
14:05:34 <anssik> Topic: WebML WG Charter 2023-2025 under development
14:06:22 <anssik> anssik: Web Machine Learning Working Group Charter for 2023-2025 is now under development.
14:07:16 <anssik> ... Please review the draft PR and open issues, provide your comments and open new issues as appropriate to help shape the WG's technical scope.
14:07:21 <anssik> -> Announcement https://lists.w3.org/Archives/Public/public-webmachinelearning-wg/2022Oct/0002.html
14:07:27 <anssik> -> Charter PR https://github.com/w3c/machine-learning-charter/pull/19
14:07:36 <anssik> -> Charter open issues https://github.com/w3c/machine-learning-charter/issues
14:08:05 <anssik> anssik: in the charter PR I included the expected timeline:
14:08:12 <anssik> ... - Q4 '22: Charter development
14:08:25 <anssik> ... - Q1 '23: W3C Advisory Committee review
14:08:50 <anssik> ... - Q2' 23: Charter approved
14:09:13 <anssik> anssik: I propose we quickly touch each of the issue that has been recorded based on the feedback from the WG
14:09:29 <anssik> ... if you have immediate feedback or comments, feel free to queue yourself, otherwise please provide your feedback in the GH issues
14:09:37 <RafaelCintron> q+
14:09:38 <anssik> ... we expect to have a good draft charter ready EOY 2022
14:09:41 <anssik> ack RafaelCintron
14:09:56 <anssik> RafaelCintron: how the charter is changing from the last time?
14:10:26 <anssik> anssik: up to the WG
14:11:21 <anssik> ... we can also ask for an extension
14:11:30 <anssik> RafaelCintron: what is the proposal then, keep going or change?
14:12:50 <anssik> RafaelCintron: I'm happy with our current charter, but happy to review any adjustments too
14:13:59 <anssik> q?
14:14:29 <anssik> Subtopic: Features deferred to WebNN v2
14:15:04 <anssik> -> Features deferred to WebNN v2 https://github.com/w3c/machine-learning-charter/issues/25
14:15:41 <anssik> https://github.com/webmachinelearning/webnn/labels/v2
14:15:53 <anssik> anssik: "WebNN v2" is a construct that refers to the WebNN API spec post initial Candidate Recommendation
14:16:11 <anssik> ... v1 is in itself is expected to be a useful API
14:16:37 <anssik> ... the WG has labeled a few issues as "v2", so please check out those and let us know if you have other suggestion
14:17:18 <anssik> https://github.com/webmachinelearning/webnn/issues/128
14:18:18 <anssik> Subtopic: Dedicated ML hardware accelerators: NPU, VPU, xPU
14:18:28 <anssik> -> Dedicated ML hardware accelerators: NPU, VPU, xPU https://github.com/w3c/machine-learning-charter/issues/24
14:18:43 <anssik> anssik: The initial version of WebNN specifies two device types, "cpu" and "gpu".
14:19:01 <anssik> ... However, the API is extensible with new device types and in our discussion support for NPU, VPU, or XPU has come up as a new "v2" feature.
14:19:26 <anssik> ... The initial charter refers to "dedicated ML hardware accelerators" in its Motivation and Background, but if this is important we could be more explicit regarding NPU/VPU/XPU device type support.
14:19:55 <anssik> q?
14:20:14 <chai> q+
14:20:19 <anssik> ack chai
14:21:55 <anssik> chai: I think NPU is quite important especially for mobile scenarios
14:22:09 <anssik> ... if we can make it part of the charter, it is quite important
14:23:23 <anssik> q?
14:24:35 <anssik> anssik: is a unified hardware-agnostic WebNN API the WG's primary goal?
14:24:41 <anssik> chai: I think so
14:25:27 <ningxin_hu> q+
14:25:30 <anssik> ack ningxin_hu
14:25:59 <anssik> ningxin_hu: talked with Chai about this unified agnostic behaviour
14:26:27 <zkis> q+
14:26:35 <anssik> ... some hardware platforms may not support e.g. some data types, this should be defined by the WebNN spec that whether it allows detecting features so web apps can adapt to the hardware differences
14:26:46 <anssik> ... also backwards compatibility, versioning considerations
14:26:58 <chai> q+
14:27:01 <anssik> ... when we go to WebNN v2 how we deal with this versioning and feature detection
14:27:17 <anssik> q?
14:27:32 <anssik> ack zkis
14:28:31 <anssik> zkis: to reinforce Chai, depending on the underlying platform, we can do op without an error, but it is dynamic, cannot know it a priori
14:28:36 <anssik> ack chai
14:29:09 <anssik> chai: re fallback, there are two use cases, WebNN is a backend, fw is on top
14:29:29 <anssik> ... 1st case fw can handle fallback, e.g. CPU fallback
14:30:10 <anssik> ... 2nd use case, NPU may get more popular, can only support a subset of ops
14:30:33 <anssik> ... the fallback needs to happen below the WebNN API
14:30:47 <anssik> ... in the 1st use case the fallback happens above WebNN in the fw
14:31:12 <anssik> ... example, given "auto" device type, then the fallback happens underneath the WebNN implementation
14:31:36 <anssik> ... the second use case is important to v2 I think, especially when the app says "I don't care, handle this for me"
14:31:48 <anssik> ... e.g. app does not want to deal with the error codes
14:32:07 <anssik> ... if we make explicit what errors make WebNN fail we have a possible fingerprint issue
14:33:02 <anssik> q?
14:34:26 <zkis> q+
14:34:32 <anssik> ack zkis
14:36:54 <anssik> q?
14:37:02 <anssik> Subtopic: Set of ops supported must be more comprehensive
14:37:07 <anssik> -> Set of ops supported must be more comprehensive https://github.com/w3c/machine-learning-charter/issues/23
14:37:14 <anssik> anssik: Chai shared that external partners are looking for a more comprehensive set of ops
14:37:30 <anssik> ... The current charter Scope enumerates a few common ones: "convolution, pooling, softmax, normalization, fully connected, activation, recurrent neural network (RNN) and long short-term memory (LSTM)".
14:37:52 <anssik> ... This is not meant to be an all inclusive list and does give the WG ability to adapt to the changes in this landscape.
14:38:10 <anssik> ... At minimum, we should review the bullets in the Scope section, and see whether to explicitly mention some of the more recent work such as transformers.
14:38:29 <anssik> ... We want to give enough detail to give good direction without constraining the WG too much. The list of ops mentioned in the charter would be open-ended.
14:39:31 <anssik> chai: transformer is a huge class of models
14:40:03 <anssik> ... this is a big class of emerging models, but not very information
14:40:18 <anssik> ... natural language processing is a friendlier to the audience
14:40:30 <anssik> ... NLP represents current transformer models
14:41:04 <anssik> ... in future transformers may become less popular when the next hotter one comes around, similarly to LSTMs in the past
14:41:39 <anssik> q?
14:41:54 <ningxin_hu> q+
14:41:57 <anssik> ack ningxin_hu
14:42:32 <anssik> ningxin_hu: I want to clarify what Chai said, do you mean the charter should focus more on usages e.g. NLP or computer vision?
14:42:38 <anssik> ... usage can change from time to time
14:42:48 <anssik> ... we use different ML techniques to address these usages
14:43:17 <anssik> ... architectures change, do you suggest we focus more on usages?
14:44:48 <anssik> Chai: correct, given the popularity of these more recent models
14:45:24 <anssik> q?
14:45:32 <anssik> Subtopic: Level of abstraction for neural net operations
14:45:39 <anssik> -> Level of abstraction for neural net operations https://github.com/w3c/machine-learning-charter/issues/22
14:45:54 <anssik> anssik: WebNN explainer has a nice section that explains the rationale for the chosen level of abstraction for the neural network operations in WebNN API.
14:46:07 <anssik> ... Chai proposed we could integrate some of this explainer text into the next charter to provide more context on the level of abstraction. This could fit into the Scope section.
14:46:49 <anssik> ... I recall past discussion around this topic, for example Google was interested in XLA (Accelerated Linear Algebra) domain-specific compiler compatibility. XLA project seems to be moving to an open governance mode and is being decoupled from the TensorFlow project.
14:47:10 <anssik> ... I'd welcome someone from Google to talk about their plans and expectations with XLA and its input language HLO IR (High Level Operations), and how they see it being part of the WebNN implementation story.
14:47:23 <anssik> q?
14:47:33 <anssik> ack zkis
14:47:34 <zkis> q-
14:49:00 <zkis> q+
14:49:11 <ningxin_hu> q+
14:49:15 <anssik> ack ningxin_hu
14:49:34 <anssik> ningxin_hu: XLA moved to open governance, OpenXLA
14:50:07 <anssik> ... Google previously proposed that, we investigated that with Chai, mapping ops to ONNX and XLA, gap was not so big to me
14:50:23 <anssik> ... I'd look forward to a concrete proposal from that community to understand the mapping to that abstraction and gap
14:50:35 <anssik> ... there was another proposal TOSA (Tensor Op Set Arch)
14:51:11 <anssik> ... we also investigated that, my question probably cannot be answered right now, but I'd like to understand whether we should follow up closely with one of these
14:51:40 <anssik> q?
14:51:51 <chai> q+
14:51:57 <anssik> ack chai
14:52:24 <anssik> chai: just quickly, I'm aware of OpenXLA when they started, not sure what they intend to do with it
14:52:38 <anssik> ... it seems to be split from TensorFlow project
14:52:50 <zkis> q-
14:52:59 <ningxin_hu> q+
14:53:02 <anssik> ... it is good for us to point to them they should strive for compat with WebNN
14:53:04 <anssik> q?
14:53:12 <anssik> ack ningxin_hu
14:53:43 <anssik> ningxin_hu: WebNN positions itself as backend framework
14:54:18 <anssik> ... another issue is graph compiler, we should make it clear what is WebNN position in this stack regarding DL compiler
14:54:49 <anssik> ... is compiler a implementation backend, or use WebNN for codegen, or complementary to WebNN
14:55:24 <anssik> ... Google raised an issue for MLIR to this WG in the past, that questions did not last for long, but there are some DL compilers that are actively developed
14:56:32 <anssik> q?
14:56:53 <anssik> Subtopic: WebRTC coordination
14:56:57 <anssik> -> WebRTC coordination https://github.com/w3c/machine-learning-charter/issues/21
14:57:13 <anssik> anssik: We added an Integration with real-time video processing use case based on learnings from our experimentation into WebNN API spec
14:57:25 <anssik> ... For the next charter, we could be more explicit and confident in Coordination re WebNN and WebRTC
14:57:32 <anssik> ... I made a suggestion in the issue and Dom +1'd it, so I'm thinking of tweaking the WebRTC coordination accordingly.
14:57:37 <anssik> ... any further suggestions for perhaps even more explicit text for our WebRTC integration interests?
14:57:57 <anssik> q?
14:58:06 <anssik> Subtopic: WebGPU interoperability
14:58:12 <anssik> -> WebGPU interoperability https://github.com/w3c/machine-learning-charter/issues/20
14:58:41 <anssik> anssik: We discussed WebGPU interoperability expectations on our 6 October 2022 call and concluded working with WebGPU contributors is important for the success of the WebNN spec. I'd want us to revise the charter language around WebNN-WebGPU interoperability expectations accordingly.
14:58:56 <anssik> ... The initial charter mentions WebGPU in the context of Out of Scope and Coordination
14:59:02 <anssik> ... I think this needs to be revised to reflect our evolved thinking. For example:
14:59:10 <anssik> ... "to avoid overlap with existing work, generic primitives used by traditional machine learning algorithms such as base linear algebra operations are out of scope. The WebGL and WebGPU shaders and WebAssembly SIMD are expected to address these requirements, see the Coordination section for details."
14:59:45 <anssik> q?
15:00:04 <ningxin_hu> q+
15:00:06 <anssik> ack ningxin_hu
15:00:51 <anssik> ningxin_hu: current charter address usage of custom ops, early WebNN issue re custom ops with WebNN and the solution to say custom ops can be implemented with more generic APIs, Wasm SIMD etc.
15:01:18 <anssik> ... Raphael mentioned some use cases e.g. super resolution, require WebNN to interact with WebGPU with resource and buffer sharing
15:01:35 <anssik> ... this is not mentioned in the current charter, if this usage is important, propose to make this more explicit
15:02:45 <anssik> RafaelCintron: WebGPU interop is critical and this should not compromise the perf of the API, details subject to discussion
15:03:16 <anssik> RRSAgent, draft minutes
15:03:16 <RRSAgent> I have made the request to generate https://www.w3.org/2022/10/20-webmachinelearning-minutes.html anssik
16:33:34 <zkis_> zkis_ has joined #webmachinelearning
16:59:48 <dom> dom has joined #webmachinelearning
17:27:44 <Zakim> Zakim has left #webmachinelearning