14:58:58 <RRSAgent> RRSAgent has joined #webmachinelearning
14:59:02 <RRSAgent> logging to https://www.w3.org/2023/11/30-webmachinelearning-irc
14:59:02 <Zakim> RRSAgent, make logs Public
14:59:03 <Zakim> please title this meeting ("meeting: ..."), anssik
14:59:06 <anssik> Meeting: WebML WG Teleconference – 30 November 2023
14:59:11 <anssik> Chair: Anssi
14:59:15 <anssik> Agenda: https://github.com/webmachinelearning/meetings/blob/main/telcons/2023-11-30-wg-agenda.md
14:59:19 <anssik> Scribe: Anssi
14:59:23 <anssik> scribeNick: anssik
14:59:30 <anssik> gb, this is webmachinelearning/webnn
14:59:30 <gb> anssik, OK.
14:59:35 <anssik> Present+ Anssi_Kostiainen
14:59:43 <chai> chai has joined #webmachinelearning
14:59:53 <anssik> Present+ Chai_Chaoweeraprasit
14:59:59 <anssik> Present+ Zoltan_Kis
15:00:13 <anssik> Present+ Rachel_Yager
15:00:19 <anssik> RRSAgent, draft minutes
15:00:20 <RRSAgent> I have made the request to generate https://www.w3.org/2023/11/30-webmachinelearning-minutes.html anssik
15:00:28 <etiennenoel> etiennenoel has joined #webmachinelearning
15:00:28 <anssik> Regrets+ Dominique_Hazael-Massieux
15:00:30 <jsbell> jsbell has joined #webmachinelearning
15:00:30 <anssik> Present+ Etienne_Noel
15:00:52 <anssik> Present+ Joshua_Bell
15:01:09 <Ningxin_Hu> Ningxin_Hu has joined #webmachinelearning
15:02:00 <anssik> Present+ Joshua_Lochner
15:02:07 <anssik> Present+ Rafael_Cintron
15:02:22 <anssik> Present+ Dwayne_Robinson
15:02:30 <Joshua_Lochner> Joshua_Lochner has joined #webmachinelearning
15:02:57 <anssik> Topic: Announcements
15:03:01 <anssik> Subtopic: Implementation status
15:03:06 <anssik> -> Implementation Status of WebNN Operations https://webmachinelearning.github.io/webnn-status/
15:03:10 <anssik> anssik: implementation status has been updated for:
15:03:19 <anssik> ... - WebNN CPU XNNPack backend
15:03:19 <anssik> ... - WebNN GPU DirectML backend
15:03:19 <anssik> ... - ORT
15:03:23 <anssik> ... for details, please see the webnn_status.json diff at https://github.com/webmachinelearning/webmachinelearning.github.io/pull/58/files
15:03:30 <anssik> anssik: this was a team effort, thanks @lisa0314 @miaobin @Honry @mingmingtasd @BruceDai @shiyi9801!
15:03:30 <gb> https://github.com/lisa0314 -> @lisa0314
15:03:30 <gb> … https://github.com/miaobin -> @miaobin
15:03:30 <gb> … https://github.com/Honry -> @Honry
15:03:30 <gb> … https://github.com/mingmingtasd -> @mingmingtasd
15:03:31 <gb> … https://github.com/BruceDai -> @BruceDai
15:03:34 <gb> … https://github.com/shiyi9801 -> @shiyi9801
15:03:42 <RafaelCintron> RafaelCintron has joined #webmachinelearning
15:04:03 <RachelY> RachelY has joined #webmachinelearning
15:04:03 <anssik> Subtopic: Upcoming discussion with the Web LLM author Tianqi Chen
15:04:17 <dwayner> dwayner has joined #webmachinelearning
15:04:19 <anssik> anssik: I had a discussion with Tianqi Chen from CMU, OctoML, creator of Web LLM, a JS library that accelerates select LLMs in browsers with WebGPU
15:04:27 <anssik> -> Web LLM repo https://github.com/mlc-ai/web-llm
15:04:35 <anssik> ... Tianqi shared he's very supportive of our work in this WG and is interested in working with us more closely
15:04:47 <anssik> ... given Tianqi's highly relevant expertise and interest, I've initiated the process to bring him on board the WG as an Invited Expert
15:04:52 <anssik> ... this will allow him to contribute in a full capacity
15:04:58 <anssik> ... I've tentatively scheduled a WG discussion with Tianqi on 11 January 2023
15:05:08 <anssik> ... Tianqi has already shared use cases with this WG for:
15:05:11 <anssik> ... - hybrid execution of models i.e. WebGPU for custom ops + WebNN)
15:05:26 <anssik> ... - JSON schema of the webNN declaration i.e. the compiler projects can generate a schema and invoke executions without explicitly doing so in JS
15:05:39 <anssik> ... if you have questions to Tianqi e.g. about his work on WebLLM that will be a great opportunity to ask those questions
15:05:43 <anssik> ... we have opened a dedicated issue for the hybrid execution use case, currently a high-level description, but can be appended to with more details
15:05:46 <anssik> -> Hybrid execution use case issue https://github.com/webmachinelearning/webnn/issues/480
15:05:50 <gb> https://github.com/webmachinelearning/webnn/issues/480 -> Issue 480 Hybrid execution use case from Web LLM project (by anssiko) [use case]
15:06:10 <anssik> Topic: WebNN v2: Review transformer ops spec contributions (continued)
15:06:15 <anssik> anssik: issue #375 and PR #478
15:06:16 <gb> https://github.com/webmachinelearning/webnn/issues/478 -> Pull Request 478 Add support for operations needed for well-known transformers e.g. Segment Anything, Stable Diffusion, etc. (by wchao1115)
15:06:16 <gb> https://github.com/webmachinelearning/webnn/issues/375 -> Issue 375 Support for transformers (by dontcallmedom) [v2] [operation set]
15:06:32 <anssik> ... on our last call I drew the WG's attention to this major PR #478 that was looking for everyone’s review and feedback
15:07:04 <anssik> ... I also shared my expectation that by this meeting on 30 Nov we are in a position to make a merge decision, let's discuss now whether we're there yet or whether we want some additional time for further refinements
15:07:22 <anssik> ... first, I want to thank the the entire group for your active review and Chai for responding to the review comments that reflect the group' consensus
15:07:28 <anssik> ... this major PR has been a great group effort, thank you all!
15:07:42 <anssik> ... I produced a hand-rolled IDL diff between the latest published version from 26 October 2023 and this PR
15:07:45 <anssik> -> Hand-rolled IDL diff https://github.com/webmachinelearning/webnn/pull/478#issuecomment-1833600693
15:08:19 <anssik> ... I compiled a list of opens from the PR review, some of these may require no action, so we can go through quickly
15:08:45 <zkis> zkis has joined #webmachinelearning
15:08:45 <anssik> Subtopic: NavigatorML mixin
15:08:51 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#discussion_r1410542879
15:09:04 <anssik> anssik: I suppose the NavigatorML mixin was removed by accident, without this mixin, the ML object is not exposed via navigator.ml
15:09:10 <anssik> ... a fix is to bring this back
15:10:54 <anssik> chai: thanks for the review everyone, especially Ningxin and Dwayne for careful comments
15:11:13 <anssik> ... all reviewers please resolve discussions in the GH PR that have been resolved
15:11:56 <anssik> ... for changes unrelated to this PR, please open a separate issue and link to this big PR
15:14:19 <anssik> Subtopic: Hard to tell whether an MLOperand is a constant or not
15:14:26 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#discussion_r1410103395
15:14:31 <anssik> -> https://www.w3.org/TR/webnn/#dom-mlgraphbuilder-constant
15:14:53 <anssik> Ningxin: It's hard to tell whether an MLOperand is a constant or not. The current algorithm of constant only creates an implementation-defined platform operand and sets values to it.
15:15:01 <Ningxin_Hu> q+
15:15:43 <anssik> ack Ningxin_Hu
15:16:06 <anssik> Ningxin_Hu: this is for the gather op validating index parameter
15:17:09 <anssik> ... a step in the validation algorithm, unless constant the implementation can access the data of that operand, otherwise it is runtime behavior
15:18:10 <anssik> ... current algorithm step only works with constant operation, Chai wanted me to propose some text to address this, we don't mark op as constant or not, need some text ensure
15:19:00 <anssik> ... in Chromium impl discussion on how to address OOB, proposed for discussion in a separate issue
15:19:43 <anssik> Chai: I think this should be tracked as a separate issue
15:19:55 <anssik> Zoltan: platform tests for this that it is always a constant?
15:20:06 <anssik> Ningxin_Hu: it does not need to always be constant I think
15:20:49 <anssik> anssik: proposal to create a separate issue for this
15:20:53 <anssik> Ningxin_Hu: I'll do that
15:20:56 <anssik> anssik: thanks!
15:21:06 <anssik> Subtopic: Make standalone argMax and argMin operators
15:21:15 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#discussion_r1410104309
15:21:21 <anssik> -> https://pr-preview.s3.amazonaws.com/webmachinelearning/webnn/pull/478.html#mlgraphbuilder-reduce-op
15:21:39 <anssik> Ningxin: two issues of output operand creation:
15:21:46 <anssik> ... - The output shape should be calculated instead of just copying input's.
15:21:59 <anssik> ... - The output data type of reduceArgMax and reduceArgMin should be unsigned integer rather than setting to input's.
15:22:34 <anssik> Chai: I agree these should be separated out
15:23:05 <anssik> ... I will commit that change soon to this branch
15:23:28 <anssik> Subtopic: Gather op implementation considerations for out-of-bound indices
15:23:33 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#discussion_r1396672041
15:23:44 <anssik> anssik: Jiewei has a proposal for an informative section:
15:23:53 <anssik> ... 1. Runtime out-of-bounds indices should be explicitly handled by either the browser implementation or the platform implementation, to avoid OOB memory accesses.
15:23:53 <anssik> ... 2. If the platform implementation doesn't handle out-of-bounds indices, the browser implementation should take steps to ensure the platform operator doesn't receive out-of-bound indices
15:24:35 <anssik> anssik: I propose these bullets to be added in gather section as an informative note and a link added to https://www.w3.org/TR/webnn/#security
15:24:48 <anssik> ... for the third bullet, a separate GH issue should be opened:
15:24:52 <anssik> ... 3. Mention what caller should expect as a result of list item 2: 0, NaN, first/last indices (if implemented with clamp)
15:25:46 <anssik> Dwayne: discussing this in context of the separate gather issue sounds good
15:26:22 <anssik> Subtopic: MLReduceOptions.keepDimensions scan direction
15:26:27 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#discussion_r1396829866
15:26:31 <anssik> -> https://pr-preview.s3.amazonaws.com/webmachinelearning/webnn/pull/478.html#dictdef-mlreduceoptions
15:26:51 <anssik> anssik: Dwayne notes that for PT/TF compat, should support both increasing and decreasing axis scan directions for tied values
15:27:08 <anssik> ... this issue preexists this PR, proposed as a separate issue
15:28:19 <anssik> Ningxin_Hu: this is related to reduceArgMax and reduceArgMin, they don't have separate signatures
15:28:40 <anssik> Dwayne: will discuss with Chai to come up with a solution
15:28:50 <anssik> q?
15:28:59 <anssik> Subtopic: Rename MLOperand.type() to dataType()
15:29:08 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#pullrequestreview-1745700960
15:29:12 <anssik> -> https://www.w3.org/TR/webnn/#dom-mloperanddescriptor-datatype
15:29:19 <anssik> anssik: to align with MLOperandDescriptor.dataType
15:29:32 <anssik> Chai: already fixed
15:29:47 <anssik> Subtopic: Examples of how gather works in different slicing schemes
15:29:54 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#discussion_r1402915672
15:29:58 <anssik> -> https://pr-preview.s3.amazonaws.com/webmachinelearning/webnn/pull/478.html#example-3d538e0c
15:30:02 <anssik> anssik: Jiewei proposes a more extreme example as the last one
15:30:07 <anssik> input.shape = (2,3,2)
15:30:07 <anssik> indices.shape = (3,4,5)
15:30:07 <anssik> output.shape = (2,3,4,5,2)
15:30:35 <anssik> Chai: for gather there are quite a few samples, all major cases covered probably
15:30:58 <anssik> Subtopic: Naming logicalNot or not
15:31:04 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#discussion_r1406997415
15:31:30 <anssik> Dwayne: OK to make this a separate issue
15:31:50 <anssik> Subtopic: Naming where arguments
15:31:55 <anssik> -> https://github.com/webmachinelearning/webnn/pull/478#discussion_r1408773430
15:32:10 <anssik> anssik: proposal from CL review, rationale "both input and other are 'inputs'"
15:32:17 <anssik> ... From: MLOperand where(MLOperand condition, MLOperand input, MLOperand other);
15:32:21 <anssik> ... To:   MLOperand where(MLOperand condition, MLOperand trueValue, MLOperand falseValue);
15:33:31 <jsbell> q+
15:33:38 <anssik> Chai: naming arguments is hard
15:33:40 <anssik> q?
15:33:50 <anssik> ack jsbell
15:34:46 <anssik> Chai: discoverability is important, in that if someone implements this API in their framework they need to be able to find the thing and map it to their implementation
15:34:54 <jsbell> q-
15:35:39 <anssik> Subtopic: Merge readiness check
15:36:19 <jsbell> q+
15:36:45 <anssik> ack jsbell
15:38:06 <anssik> jsbell: regarding resolving comments in the PR review, my comments are addressed, I just couldn't resolve those (due to GH permissions thing)
15:38:33 <anssik> ... looks like this PR tries to resolve all in one PR, rather than multiple smaller PRs
15:39:05 <anssik> q?
15:39:41 <Ningxin_Hu> +1 to have small PRs after this one
15:39:44 <anssik> Chai: agree we should do incremental PRs going forward
15:40:21 <Ningxin_Hu> q+
15:41:04 <anssik> anssik: once all PR review discussions have been resolved we are ready to merge the PR, agreed?
15:41:29 <anssik> ... either by spinning off into separate open issues or resolving discussion on the spot in the PR
15:42:29 <anssik> q?
15:42:44 <anssik> Dwayne: I don't see resolve conversation button either
15:43:54 <anssik> anssik: please thumb up on the very last comment to signal you're OK to resolve
15:44:28 <anssik> q?
15:44:30 <anssik> ack Ningxin_Hu
15:44:41 <Ningxin_Hu> https://github.com/webmachinelearning/webnn/pull/478#issuecomment-1815864650
15:44:47 <anssik> Ningxin_Hu: to respond to jsbell re conformance tests, see the link above
15:45:00 <anssik> ... the baseline implementation is pure JS
15:45:18 <anssik> ... we have WIP wpt tests too for these ops
15:45:29 <anssik> ... I hope that will unblock this PR
15:45:50 <anssik> jsbell: I saw the table, no strong opinion where this lives, someplace where it is maintained properly
15:46:06 <anssik> Ningxin_Hu: I'll move the table to a separate issue
15:46:22 <anssik> q?
15:46:31 <chai> q+
15:46:34 <anssik> ack chai
15:47:13 <anssik> chai: I have a separate ask, I think that at some point the spec should have a way for the people to identify what are the ops they are talking about
15:47:28 <anssik> ... the browser needs to identify the implementation to be compliant to something
15:48:00 <anssik> ... in the early HTML days, there was a notion of CSS Layer 1 and Layer 2, sounded dubious in the beginning, don't know what it might look like here
15:48:01 <anssik> q?
15:48:18 <anssik> anssik: Living Standard is the trend
15:49:18 <anssik> jsbell: there's no perfect answer, one approach some Chrome DevRel folks are working on is called baseline, that's not at the level of individual methods etc. but saying in 2024 there APIs work across browsers
15:49:39 <anssik> ... not sure if they've looked at lower level "there 5 methods are supported in 2024 across X, Y and Z"
15:50:21 <anssik> ... the v1 of the spec, with multiple implementations across browsers, we want that browsers only claim compliance when they pass all wpt tests
15:50:50 <anssik> ... approaches in my specs, I've called things that are new and track as implementations adopt those
15:51:07 <anssik> ... new functionality could be advertised and communicate what is implemented and where
15:51:19 <anssik> https://webmachinelearning.github.io/webnn-status/
15:52:15 <anssik> jsbell: some non-Chromium browsers often say they have no support for a feature unless they pass all the wpt tests
15:52:40 <anssik> ... sometimes tests miss something obvious, so this model fails in that aspect sometimes
15:53:02 <anssik> ... good wpt coverate is important for our team
15:53:04 <anssik> q?
15:53:42 <anssik> https://github.com/webmachinelearning/meetings/blob/main/telcons/2023-11-30-wg-agenda.md
15:54:02 <zkis_> zkis_ has joined #webmachinelearning
15:54:05 <anssik> q?
15:54:57 <anssik> Subtopic: Should scale and bias be required inputs for batchNormalization op?
15:54:57 <anssik> anssik: issue #481
15:54:57 <gb> https://github.com/webmachinelearning/webnn/issues/481 -> Issue 481 Should `scale` and `bias` be required inputs for `batchNormalization` op? (by huningxin)
15:54:58 <anssik> anssik: Ningxin did a very thorought investigation into this issue, summary:
15:55:10 <anssik> ... - currently in batchNormalization scale and bias operands are optional members of MLBatchNormalizationOptions dictionary
15:55:30 <anssik> ... - current algorithm: if scale is not present, the element-wise multiplication can be eliminated, and if bias is not present, the element-wise addition can be eliminated too
15:55:56 <anssik> anssik: Ningxin notes, however, there's an issue: "the optional scale and bias are not widely supported across frameworks and native ML APIs. This would cause the implementation more complex for those native ML APIs which don't support optional scale and bias"
15:56:30 <anssik> ... for details of the framework and native MP APIs, see the GH issue
15:56:30 <anssik> ... Ningxin proposes a solution: "make the two operands required"
15:56:30 <anssik> ... and notes that models that won't use scale and bias the frameworks can set scale to 1 and bias to 0
15:56:38 <anssik> q?
15:56:51 <anssik> Dwayne: makes sense, don't know why these were originally optional
15:57:02 <anssik> q?
15:57:10 <jsbell> q+
15:57:13 <anssik> ack jsbell
15:57:35 <anssik> jsbell: I'm relaying a comment from someone from my team who pointed out there may be some confusion re optional and required
15:58:14 <anssik> ... if we give scale and bias default values, do we want to force developers to pass values? Or use common defaults?
15:58:16 <anssik> q?
15:58:59 <anssik> ... i.e. make default scale be 1 and bias be 0
15:59:12 <anssik> Dwayne: these will be called from frameworks with tensors lying around
15:59:31 <anssik> Chai: I need some time to read this issue one more time
15:59:56 <Ningxin_Hu> q+
15:59:56 <anssik> ... the idea of optional in the API is to not make the API signature cluttered and help with future revisions
16:00:13 <anssik> ... for this specific issue, scale and bias as required, need to require how strong this feedback is
16:00:24 <anssik> ... you have something optional, you can always ask to be more explicit about it
16:00:34 <anssik> ack Ningxin_Hu
16:01:04 <anssik> Ningxin_Hu: SGTM, I think there's an opportunity to make this an optimization opportunity
16:01:26 <anssik> ... if some native ML API can make use of this optimization then keeping this optional is reasonable
16:01:31 <anssik> ... open for discussion
16:01:40 <anssik> q?
16:01:40 <jsbell> I'll discuss in more detail w/ my folks, see if they want to add comments to the issue
16:01:47 <anssik> RRSAgent, draft minutes
16:01:48 <RRSAgent> I have made the request to generate https://www.w3.org/2023/11/30-webmachinelearning-minutes.html anssik
16:03:11 <anssik> q?
16:07:15 <anssik> s|https://github.com/webmachinelearning/meetings/blob/main/telcons/2023-11-30-wg-agenda.md|Topic: Enhancements
16:07:18 <anssik> RRSAgent, draft minutes
16:07:20 <RRSAgent> I have made the request to generate https://www.w3.org/2023/11/30-webmachinelearning-minutes.html anssik
16:10:26 <anssik> s|@shiyi9801|@shiyi9801 @ibelem
16:10:26 <gb> https://github.com/shiyi9801 -> @shiyi9801
16:10:26 <gb> … https://github.com/ibelem -> @ibelem
16:10:31 <anssik> RRSAgent, draft minutes
16:10:32 <RRSAgent> I have made the request to generate https://www.w3.org/2023/11/30-webmachinelearning-minutes.html anssik