13:58:36 RRSAgent has joined #webmachinelearning 13:58:40 logging to https://www.w3.org/2023/08/10-webmachinelearning-irc 13:58:40 RRSAgent, make logs Public 13:58:41 please title this meeting ("meeting: ..."), anssik 13:58:42 Meeting: WebML WG Teleconference – 10 August 2023 13:58:47 Chair: Anssi 13:59:11 Agenda: https://github.com/webmachinelearning/meetings/blob/main/telcons/2023-08-10-wg-agenda.md 13:59:15 Scribe: Anssi 13:59:21 scribeNick: anssik 14:00:38 dom__ has joined #webmachinelearning 14:01:18 ningxin_hu has joined #webmachinelearning 14:01:31 Present+ Dominique_Hazael-Massieux 14:01:41 Regrets+ Dominique_Hazael-Massieux 14:01:41 Present+ Joshua_Bell 14:02:09 JoshuaBell has joined #webmachinelearning 14:02:28 Rachel has joined #webmachinelearning 14:02:28 Present+ Anssi_Kostiainen 14:02:37 Present+ Zoltan_Kis 14:02:43 Present+ Ningxin_Hu 14:02:46 Joshua_Lochner has joined #webmachinelearning 14:02:48 Present+ Chai_Chaoweeraprasit 14:03:06 Present+ Rachel_Yager 14:03:13 Present+ Vivek_Sekhar 14:03:22 Present+ Joshua_Lochner 14:03:37 Regrets+ Dominique_Hazael-Massieux 14:03:47 RRSAgent, draft minutes 14:03:48 I have made the request to generate https://www.w3.org/2023/08/10-webmachinelearning-minutes.html anssik 14:04:04 Vivek has joined #webmachinelearning 14:04:21 chai has joined #webmachinelearning 14:04:36 anssik: Welcome back after the summer break to an amazing second half! 14:04:39 zkis has joined #webmachinelearning 14:05:54 Joshua_Bell: thanks for the intro, Anssi! 14:06:47 ... very new to ML space and diving in to learn and contribute on behalf of Google here, working with other Googlers like Vivek, on how to improve the Web Platform with ML, enhancements to Wasm, WebGPU, Storage APIs near to my heart, also WebNN and work this group has done so far 14:07:37 AramZS has joined #webmachinelearning 14:08:36 Topic: Announcements 14:08:53 -> Implementation Status of WebNN Operations updated https://webmachinelearning.github.io/webnn-status/ 14:09:09 anssik: in response to the WG's feedback the implementation status page has been updated, notably changes: 14:09:16 ... - List implementation status for all 60 v1 ops 14:09:36 ... - Clearer distinction between browser implementation and framework 14:09:41 ... - Add "Browser version" column 14:10:04 ... - Add DirectML backend as "WIP" while in a Chromium fork 14:10:17 ... - MDN Browser Compat Data 14:10:34 ... this status page fetches its data from webnn_status.json that contains the compat data in a machine-readable browser-compat-data ("BCD") format 14:10:38 -> webnn_status.json https://webmachinelearning.github.io/assets/json/webnn_status.json 14:10:55 anssik: This BCD data is used in web apps and software such as MDN Web Docs, CanIUse.com, Visual Studio Code, WebStorm and more 14:11:02 RafaelCintron has joined #webmachinelearning 14:11:17 ... this will help web developers write websites easier with WebNN, including auto-complete in IDE, checking for supported features per browser version 14:11:36 ... huge thanks to Belem and the team for putting this data together, maintaining this as the implementation and spec evolve is important 14:12:02 ... a future work item is to feed this data to MDN Web Docs and other sites to integrate this BCD data format 14:12:12 ... questions, comments? 14:12:36 RRSAgent, draft minutes 14:12:37 I have made the request to generate https://www.w3.org/2023/08/10-webmachinelearning-minutes.html anssik 14:12:43 q+ 14:13:25 q? 14:13:28 ack chai 14:13:41 Chai: thank you Anssi! 14:14:04 ... I noticed we refer to first 60 ops as "v1" and what comes after as "v2" 14:14:35 ... do we want to relate our internal "v1" and "v2" with spec milestones 14:16:40 anssik: how was IndexDB 2.0 story, Joshua? 14:17:14 Joshua_Bell: we used versioning for that spec as a best effort and it led to a bit of confusion and we moved to a Living Standard model 14:17:41 ... each version for IndexedDB is *not* incompatible with its previous major version 14:17:50 ... changes are additive 14:18:06 ... this is a little different on how specs work in some other WGs such as CSS 14:18:38 ... I was going to suggest we use the Living Standard model here, as this maps to how browsers evolve features and what developers expect 14:20:43 Chai: we say "The total number of WebNN v1 ops is 60" in the status page 14:21:32 anssik: let's remove all versioning related labeling from our web developer-facing docs 14:21:36 q? 14:21:47 ... other questions, comments? 14:21:51 RRSAgent, draft minutes 14:21:52 I have made the request to generate https://www.w3.org/2023/08/10-webmachinelearning-minutes.html anssik 14:22:26 Topic: WebNN v2: text-to-image and text-to-text use cases and requirements 14:22:33 anssik: As a post-break thing, I want to come back to transformer-based generative AI use cases for text-to-image and text-to-text models. 14:23:31 ... we choose these use cases as our initial v2 targets and are now soliciting input on new ops and data types required to support selected models informed by prototyping efforts. 14:23:42 ... As a refresher, we've had a few great presos in the group lately: 14:23:49 -> Transformers.js presentation by Joshua Lochner https://lists.w3.org/Archives/Public/www-archive/2023Jun/att-0000/Transformers_js.pdf 14:23:58 -> Transformer models presentation by Dwayne Robinson https://lists.w3.org/Archives/Public/www-archive/2023Jun/att-0005/2023-06-29_WebNN_and_Transformers_Progress_W3C.pdf 14:24:46 anssik: Transformer-based models and use cases discussion happens in #375 14:24:47 https://github.com/webmachinelearning/webnn/issues/375 14:24:55 dwayner has joined #webmachinelearning 14:25:16 Present+ Dwayne_Robinson 14:25:56 ... you can filter these v2 op proposals with "operation set" and "v2" label: 14:26:03 -> v2 op proposals https://github.com/webmachinelearning/webnn/issues?q=is%3Aopen+label%3A%22operation+set%22+label%3A%22v2%22 14:26:25 anssik: we have received one v2 op proposal, dedicated square root operator #438, a joint effort bt Dwayne and Ningxin. Thank you! 14:26:54 https://github.com/webmachinelearning/webnn/issues/438 14:27:40 ... you can use this issues #438 as an example on how to propose other v2 ops. Following our contribution guidelines this issues lists use case/sample model, discusses cross-framework support, cross-platform implementability. I expect us to have a similar GH issue for every new v2 op where to solicit input from all stakeholders 14:27:53 -> Contribution guidelines for proposing and adding a new operation https://github.com/webmachinelearning/webnn/blob/main/CONTRIBUTING.md#proposing-and-adding-a-new-operation 14:28:28 anssik: an action to everyone interested in v2 ops is to create similar issues for v2 op proposals, I'll apply GH labels to them so help aid discoverability 14:28:32 RRSAgent, draft minutes 14:28:33 I have made the request to generate https://www.w3.org/2023/08/10-webmachinelearning-minutes.html anssik 14:28:47 q? 14:30:30 q+ 14:30:55 ningxin_hu: re dedicated square root operator #438, XNNPACK only support a dedicated square root so we map that to the squareroot of XNNPACK, if we have this dedicated op we can map explicitly instead of indirectly, should be done by the user code 14:31:22 ack chai 14:32:15 chai: a reminder, from the spec review point of view, we'll also review the PR that is sent out to make changes to the spec after issue discussion 14:32:30 ... a PR is expected in the coming weeks 14:32:50 q? 14:33:44 Topic: WebIDL and Infra standard conventions 14:34:23 anssik: The `zk-conventions-integration` integration branch is now ready for review as announced on our last call. Let's discuss how to do the final review of this branch to get it merged into main soon as discussed. 14:34:38 ... earlier today Zoltan submitted a catch-all PR to integrate these changes to the main branch: 14:35:12 RafaelCintron_ has joined #webmachinelearning 14:35:12 ... PR https://github.com/webmachinelearning/webnn/pull/446 14:35:12 Present+ Rafael_Cintron 14:35:44 anssik: ... This PR is composed of 111 commits, and includes also commits that address the review feedback received to date. Much thanks Zoltan for this pushing this effort! 14:36:01 ... as discussed earlier, to recap, these changes align the entire specification with modern specification conventions and add stylistic improvements 14:36:41 ... now that we have a PR against main we can use the familiar tools for preview and diffing, here are the URLs: 14:36:46 -> PR preview https://pr-preview.s3.amazonaws.com/zolkis/webnn/pull/446.html 14:36:49 -> PR diff https://pr-preview.s3.amazonaws.com/webmachinelearning/webnn/446/8e63e07...zolkis:b262354.html 14:37:07 https://zolkis.github.io/webnn/ 14:37:19 anssik: I want us discuss and agree on how the editors would like to integrate these changes and how extensively we want to review this PR and for how long we expect it to take 14:37:33 ... I'm understood the WG wants to rebase its v2 op work on top of this and thus would like to merge this rather soon 14:37:37 ... thoughts? 14:38:29 Zoltan: a full PR is temporary so it includes everyone's commits, earlier there was a request to squash everything into a single commit, that's doable after 14:38:50 ... current commits are clean adding algorithms one by one 14:39:38 Chai: thank you Zoltan, this is massive set of work! 14:40:03 ... I helped review the integration branch during the break, it looks pretty nice, only one change to integration branch that is approved and merged now 14:40:10 ... related to note section styling 14:40:26 ... this is looking pretty good now! 14:40:44 ... if there are no more updates on this change, I'll let Ningxin look at it too 14:41:01 ... I think we can merge soon, additional changes on top of this should be rather small and we can take this to main directly 14:41:12 Ningxin: echo Chai, thanks a lot Zoltan for this work! 14:41:32 ... discussed this work with Zoltan, and agree Zoltan sent this PR to make it easier to review and preview as Anssi shared 14:42:11 ... I talked with other implementers e.g. Chrome code contributors who implemented DML and XNNPACK backend, most of the algorithms are parameter validation clarification 14:42:32 ... the implementer team plans to review these changes in the coming weeks and can come back in the next meeting perhaps 14:42:47 ... we provide our feedback in the PR as usual 14:43:04 ... we have already provided review comments and Zoltan has addressed them in his branch already 14:43:41 ... summarize, we have started review work and want to ensure algorithms are matching implementations 14:44:36 anssik: proposal is to check on our next meeting how far we are with the review and decide if we're ready to merge to main or need more time 14:45:15 Zoltan: want to merge soon to allow new work on top using the latest style 14:45:27 ... can we do a basic sanity check by the next meeting? 14:45:59 ningxin_hu: I expect it is more about sanity check, translation from declarative to algorithmic prose 14:46:22 ... we'll report back on our review progress, I believe our next meeting in 2 weeks we know better 14:47:24 Chai: the concern I had was that the size of the change and the impact on the entire document, we resolved this by working on a branch until we have an atomic change that we can apply at once 14:48:14 ... would it be better for Zoltan to merge the whole change and then have incremental updates to the content that is still pending review? 14:49:12 Zoltan: can manage either way, many algorithms haven't received any review yet, the rest would benefit from a sanity check, unless we want to merge a number of PRs soon 14:50:04 ningxin_hu: if this is a blocker for new PRs we can do a sanity check review and merge 14:52:19 anssik: are you OK to make a decision on whether we are ready to merge at our next meeting? 14:52:22 Chai: OK with that 14:52:40 RRSAgent, draft minutes 14:52:41 I have made the request to generate https://www.w3.org/2023/08/10-webmachinelearning-minutes.html anssik 14:52:59 Topic: New features 14:53:10 anssik: I triaged a few issues from June we did not have time to discuss then, let's check them out 14:53:14 Subtopic: Allow 0 size dimensions 14:53:37 https://github.com/webmachinelearning/webnn/issues/391 14:54:04 ... Dwayne spotted in Chromium review that 0 size dimensions really should just be treated as "no operation" (aka NOP, no-op, NOOP... etc.) 14:54:27 ... we did have quite extensive look into native ML APIs for this in the issue including some test cases: NumPy, TensorFlow, PyTorch, ONNX / ONNX Runtime, XNNPack, DirectML 14:54:38 ... it looks like Apple's CoreML / MPS / BNNS documentation is not clear on what the behavior is for 0 size dimensions 14:55:21 Dwayne: re implementation difficulties, CoreML not sure due to documentations lacking detail 14:56:10 anssik: anyone able to check support on Apple's platform? 14:56:34 q+ 14:56:39 ack ningxin_hu 14:58:08 ningxin_hu: question re some backends disallowing 0 size dimensions, is this a framework or WebNN API responsibility? Web frameworks could also apply a workaround for this. 14:58:09 q? 14:58:33 Dwayne: it's going to be more challenging for frameworks to adopt WebNN if they need to do the work 14:58:44 ... burden on higher-level framework vs backend 14:59:59 anssik: (per https://www.w3.org/TR/design-principles/#priority-of-constituencies the implementation should take the burden, not higher-level frameworks) 15:01:27 anssik: plan to do some more prototyping and come back? 15:01:27 Dwayne: correct 15:01:58 q? 15:03:29 RRSAgent, draft minutes 15:03:30 I have made the request to generate https://www.w3.org/2023/08/10-webmachinelearning-minutes.html anssik 17:03:08 Zakim has left #webmachinelearning