14:57:52 <RRSAgent> RRSAgent has joined #webmachinelearning
14:57:52 <RRSAgent> logging to https://www.w3.org/2022/03/10-webmachinelearning-irc
14:57:55 <Zakim> RRSAgent, make logs Public
14:57:55 <Zakim> please title this meeting ("meeting: ..."), anssik
14:58:01 <anssik> Meeting: WebML WG Teleconference – 10 March 2022
14:58:06 <anssik> Chair: Anssi
14:58:11 <anssik> Agenda: https://github.com/webmachinelearning/meetings/blob/main/telcons/2022-03-10-wg-agenda.md
14:58:17 <anssik> Scribe: Anssi
14:58:24 <anssik> scribeNick: anssik
14:58:32 <anssik> scribe+ dom
14:58:33 <anssik> Present+ Anssi_Kostiainen
14:58:38 <anssik> RRSAgent, draft minutes
14:58:38 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/10-webmachinelearning-minutes.html anssik
14:58:50 <Rama> Rama has joined #webmachinelearning
14:59:59 <anssik> Present+ Ganesan_Ramalingam
15:00:17 <anssik> Present+ Wan_Xiaojian
15:01:13 <anssik> Present+ Jonathan_Bingham
15:01:31 <anssik> Present+ Rafael_Cintron
15:01:36 <ningxin_hu> ningxin_hu has joined #webmachinelearning
15:01:38 <anssik> Present+ Ningxin_Hu
15:02:11 <anssik> Present+ Dominique_Hazael-Massieux
15:02:21 <anssik> Present+ Chai_Chaoweeraprasit
15:03:28 <Jonathan> Jonathan has joined #webmachinelearning
15:03:40 <RafaelCintron> RafaelCintron has joined #webmachinelearning
15:03:50 <chai> chai has joined #webmachinelearning
15:03:54 <anssik> Topic: Security considerations - last call for review
15:04:25 <dom> -> issue: General Security Questions https://github.com/webmachinelearning/webnn/issues/241
15:04:36 <anssik> -> issue: General Security Questions: https://github.com/webmachinelearning/webnn/issues/241
15:04:41 <anssik> -> PR: Update Security Considerations per review feedback: https://github.com/webmachinelearning/webnn/pull/251
15:04:46 <anssik> -> All security-tracker issues: https://github.com/webmachinelearning/webnn/issues?q=label%3Asecurity-tracker+
15:05:44 <anssik> -> Op metadata that helps avoid implementation mistakes (issue #243) https://github.com/webmachinelearning/webnn/issues/243
15:06:37 <dom> Anssi: PR#251 addresses most questions of #241, but doesn't address #243
15:06:45 <dom> ... propose we leave that for later
15:07:30 <anssik> dom: happy to review PR #251
15:07:41 <dom> dom: +1 to leave #243 for later
15:08:58 <anssik> q?
15:09:18 <anssik> Topic: Graph execution methods used in different threading models: immediate, async, queued
15:09:49 <anssik> s/Topic: Graph execution methods used in different threading models: immediate, async, queued//
15:09:54 <anssik> Topic: Ethical considerations update
15:11:19 <anssik> RRSAgent, draft minutes
15:11:19 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/10-webmachinelearning-minutes.html anssik
15:11:42 <dom> James: we conducted an internal / external literature review, which led to the writing of a draft consunltation document
15:11:51 <dom> ... not complete, but enough for people to engage and react with
15:12:00 <dom> s/consunltation/consultation/
15:12:10 <dom> ... please take a look at the document and bring comments
15:12:24 <dom> ... it contains material that may or may not end up in the final WG note
15:12:40 <dom> ... including the thinking process, background on ethics and ML
15:12:52 <dom> ... still incomplete and work in progress
15:13:20 <dom> ... A summary of the process: I looked at existing principles rather than developing our own
15:13:41 <anssik> Present+ James_Fletcher
15:13:43 <dom> ... we want these principles to be universal given the reach of the Web
15:13:58 <dom> ... align with W3C values & principles
15:14:14 <dom> ... which led to recommending to use the UNESCO value & principles - with more justification in the doc
15:14:19 <anssik> RRSAgent, draft minutes
15:14:19 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/10-webmachinelearning-minutes.html anssik
15:14:53 <dom> ... Unesco has a set of 4 values and 10 principles, developed through very wide review and approval, globally
15:15:27 <dom> ... confirmed their fitness through meta-analysis about completeness and focus
15:15:53 <dom> ... We're looking for feedback on the process, and where it led to in terms of proposed principles
15:16:09 <dom> ... this is leading to the next phase where we want to hear from experts and stakeholders
15:16:36 <dom> ... The principles are very high level - one challenge is how to turn them into practices, which the document wants to tackle
15:16:52 <dom> ... We're running group sessions early April to kickstart that process
15:17:18 <dom> ... between principles and risks/mitigations, there may be guidance that elaborates on principles with more context, with more details and more specific to the W3C context
15:17:58 <dom> ... e.g. mapping to the W3C TAG ethical principles, which has "autonomy" or "decentralization" principles that don't emerge in UNESCO
15:18:53 <dom> ... We'll look to synthetize this into a single guidance per principle, shorter than what we've extracted and specific to the W3C context
15:19:01 <dom> ... that guidance would then to risks & mitigations
15:19:24 <dom> ... The document also presents case studies that illustrates typical issues in ML ethics
15:19:45 <dom> ... Re risks & mitigations, the doc will contain high level considerations, not yet to the level of individual specs
15:20:04 <dom> ... the document has an example of a possible risk & mitigation to illustrate this
15:20:51 <dom> ... We'll update this version of the document by March 21st, including guidance, feedback received incl on issues & case studies, and high level risks & mitigations
15:21:33 <dom> ... Week of April 4 will run group review & brainstorm sessions to feed risks & mitigations
15:21:45 <anssik> q?
15:21:48 <dom> ... and we're targeting April 21st as the time to approve this as a WG Note
15:21:58 <anssik> -> Draft consultation document https://docs.google.com/document/d/1n55liw3cAcrIdMlvRPEAdV1ANWT9QzgOZ6R0pUaSVY4/
15:22:23 <anssik> dom: thanks for turning out plans into this document
15:22:24 <dom> Dom: THANKS!
15:23:14 <bbcjames> bbcjames has joined #webmachinelearning
15:23:27 <anssik> q?
15:23:40 <dom> anssi: we'll want participants from this group to be involved in the live session; got lots of interest from other W3C groups, e.g "horizontal" groups
15:23:43 <anssik> Topic: Graph execution methods used in different threading models: immediate, async, queued
15:24:00 <anssik> -> issue #230: Should WebNN support async APIs? https://github.com/webmachinelearning/webnn/issues/230
15:24:02 <dom> anssi: we started discussing in issue #230
15:24:14 <dom> ... Chai produced 2 alternative designs up for review
15:24:14 <anssik> -> PR #255: Define graph execution methods used in different threading models https://github.com/webmachinelearning/webnn/pull/255
15:24:21 <anssik> -> PR #257: Context-based graph execution methods for different threading models https://github.com/webmachinelearning/webnn/pull/257
15:24:56 <dom> Anssi: this is a substantial change to the API - I want us to get it right
15:25:00 <anssik> q?
15:25:40 <dom> Chai: the two pull requests are both trying to do the same thing; I recommend starting with #255 - #257 builds on top of it
15:26:00 <dom> ... the core change in #255 is trying to add several execution methods that the user can use to execute the compiled graph
15:26:24 <dom> ... and based on the requirements we collected the few months, there are 3 ways people want to use WebNN
15:26:41 <dom> ... the summary in #255 describes what we want to do: we want to enable:
15:27:03 <dom> ... 1- immediate execution from the calling thread,where you wait until the result is available in the output buffer - a blocking call
15:27:15 <dom> ... this is the simplest way to execute a graph
15:27:43 <dom> ... this is needed in scenarios where it runs on the CPU
15:28:32 <dom> ... 2- the second method allow async execution with a promise, allowing not to block the UI thread
15:29:00 <dom> ... 3- the 3rd method is specific to WebGPU; with WebGPU you can buffer commands before they get executed in order
15:29:36 <dom> ... sync or async doens't help here, you wouldn't get a deterministic execution
15:30:11 <dom> ... the key difference is that it doesn't run the graph, it records the commands in the command buffer, and leaves it to the caller to execute the buffered commands
15:30:50 <dom> ... in terms of API shape, in #255, I tried to not change too much of the existing API - 90% of the API is GraphBuilder
15:31:16 <anssik> q+ to ask if  this interop method suggests any changes to the WebGPU API, whether we should seek explicit WebGPU WG review
15:31:25 <dom> ... because the execution methods have a strong dependency on the kind of context they're creating, I tried to separate the various mode of executions in execution interfaces
15:31:39 <dom> ... MLExecution, MLAwaitedExecution, and MLCommandEncoder
15:31:50 <dom> ... the latter name is directly inspired from the WebGPU spec
15:32:31 <dom> ... on #255, Dom & Ningxin pointed that having the MLContext being something that calls the execution makes more sense
15:32:51 <dom> ... a separate Execution interface makes it harder to see the dependency
15:32:57 <dom> ... #257 addresses that
15:33:11 <dom> q+
15:33:34 <dom> ... it builds on #255 - it no longer has a separate Execution interface, but instead they become methods in MLContext
15:33:55 <dom> ... with a compute method and a computeAsync method
15:34:15 <dom> ... the runtime dependency is still there - if you try to use compute to execute on the GPU context, it's not allowed
15:35:22 <dom> ... The caller that tries to execute the graph should know a lot about the context - we're not supporting a mode where the context is created independently from the execution
15:36:30 <dom> ... when it comes to WebGPU, I chose to ise a createCommandEncoder that creates an MLCommandEncoder that uses an interface consistant with the WebGPU command queue
15:36:42 <anssik> q?
15:37:09 <anssik> ack anssik
15:37:09 <Zakim> anssik, you wanted to ask if  this interop method suggests any changes to the WebGPU API, whether we should seek explicit WebGPU WG review
15:37:35 <dom> anssik: re MLCommandEncoder command buffer, does it require any chance to WebGPU?
15:37:50 <dom> ... should we seek explicit WebGPU WG review on the proposal
15:38:00 <RafaelCintron> q+
15:38:01 <dom> chai: it doesn't require any change to WebGPU
15:38:04 <dom> q- later
15:38:46 <anssik> q?
15:38:57 <dom> i|James: we:|Slideset: https://lists.w3.org/Archives/Public/www-archive/2022Mar/att-0001/0310_W3C_Ethical_Web_ML_Update.pdf
15:39:24 <anssik> q?
15:39:27 <anssik> ack RafaelCintron
15:39:59 <dom> RafaelCintron: re commandencoder - initialize and dispatch take a Graph; initialize should be called only once
15:40:11 <dom> ... could initialize be a constructor for an MLCommandEncoder?
15:40:32 <dom> ... what happens if someone calls dispatch with a different graph than the one used to initialize
15:40:49 <dom> chai: this is similar to what ningxin asked on #255
15:41:09 <dom> ... initializeGraph records the commands we need to initialize the graph; it's not initializing the encoder
15:41:19 <dom> ... if you put in the constructor, it would be misleading
15:41:35 <dom> ... in many systems that we know, before you want to process the model, you want to pre-process the weights
15:41:42 <dom> ... e.g. on GPUs and one some NPUs
15:42:05 <dom> ... at the driver level, when you have the weights, they want the opportunity to process it and cache it in their driver in their layout format
15:42:24 <dom> ... passing the weights in initializeGraph, the command encoder will record a copy into the GPU buffer
15:43:12 <dom> ... that will send this down to the GPU driver with a flag that some systems would use to indicate the opportunity to initialize them at least once
15:43:38 <anssik> q?
15:43:43 <dom> ... the actual commands get dispatched when the inference happens
15:43:51 <dom> RafaelCintron: what happens initialize multiple times?
15:43:59 <dom> Chai: wouldn't be efficient, but wouldn't fail
15:44:08 <dom> RafaelCintron: what about multiple dispatch?
15:44:24 <dom> Chai: the encoder is reusable, it doesn't carry state
15:45:13 <dom> ... compared to compute, it's a lower level API, matching the WebGPU approach
15:45:20 <dom> ... compute could be implemented on top of it
15:45:40 <dom> RafaelCintron: what if initialize A with some input, and then dispatch it with other input
15:45:53 <dom> Chai: they're different inputs - only constant weights for initialize
15:46:34 <dom> ... this preprocessing step matches the approach taken by several low level model API (incl DirectML)
15:46:58 <dom> anssi: does the spec talk about these 2 inputs being different?
15:47:22 <dom> chai: feedback welcomed in the PR, which could use more explanation in places
15:47:38 <dom> anssi: maybe name them differently to help improve the ergonomics
15:47:59 <dom> chai: would also like a section with a sample with WebGPU usage
15:48:09 <ningxin_hu> q+
15:48:10 <dom> ... but probably done in a separate PR
15:48:26 <anssik> q?
15:48:30 <anssik> ack dom
15:48:41 <anssik> dom: thanks for this piece of work!
15:49:00 <anssik> ... I prefer #257 over #255, the API shape is explained better in that
15:49:16 <anssik> ... not sure still on CPU only compute() method, but will comment on the PR
15:49:35 <anssik> ... Anssi raised question about WebGPU intersection, we will need WebGPU WG to chime in
15:50:06 <anssik> ... we need WebGPU WG review for the intersection, there was a GH thread that pointed out some gaps, this PR might start address those
15:50:07 <anssik> https://github.com/gpuweb/gpuweb/issues/2500
15:50:36 <dom> anssi: not requiring changes to WebGPU is definitely a big +
15:51:26 <anssik> dom: we have Rafael as a bridge between WebML-WebGPU WGs
15:51:44 <anssik> ... if someone can give us reliable review on this PR from WebGPU let's check with them
15:51:51 <anssik> q?
15:51:58 <anssik> ack ningxin_hu
15:52:03 <dom> Anssi: would Brian be able to give a WebGPU-angled review of this PR?
15:52:24 <dom> Ningxin: +1 to ask Brian, in addition to Rafael's review
15:52:32 <anssik> s/Brian/Bryan
15:52:37 <dom> ... Also thanks again to Chair - very significant contribution
15:52:47 <anssik> s/Chair/Chai
15:53:08 <dom> ... the PR brings both sync/async, and integration with WebGPU
15:53:32 <dom> ... If we were to interact with the WebGPU people, we would want to highlight the latter - MLCommandEncoder
15:54:24 <dom> ... the discussion on constants weights, it reminds me of an open comment I made on the first PR
15:54:42 <dom> ... we have 2 surfaces to upload constants / weights for a context built on a WebGPU device
15:55:18 <dom> ... the MLContext via GPUBuffer; with MLCommandEncoder, this gives another path to provide the weights
15:55:42 <anssik> q?
15:55:53 <dom> ... Do we need to remove the constant method for the GPUBuffer binding, and move that to the initialize graph method?
15:56:48 <dom> ... if we do so, the graph building code gives 2 different paths for graph building based on different contexts, builder vs initialize graph
15:57:04 <anssik> q?
15:57:05 <dom> ... this isn't ideal; can we find a way to combine them?
15:57:20 <dom> chai: I understand that feedback; let's iterate on the PR
15:57:39 <dom> ... re integration with WebGPU, a lot of these ideas came from Bryan
15:58:24 <dom> q+
15:59:12 <dom> chai: my original idea was to have the ml path in WebGPU - but it creates a hard dependency to WebGPU spec & implementation
15:59:40 <dom> q-
15:59:53 <anssik> q?
16:00:02 <dom> anssi: I'm hearing #257 as the PR to continue with
16:00:24 <dom> ... thanks for the good progress
16:00:45 <dom> ... summarizing: reviews expected on #257, including looping people from WebGPU (Bryan, RafaelCintron)
16:01:01 <dom> ... and maybe later seek review from the broader WebGPU WG (possibly after the PR landed)
16:01:01 <anssik> q?
16:01:15 <dom> RRSAgent, draft minutes
16:01:15 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/10-webmachinelearning-minutes.html dom
16:10:43 <anssik> i|James: we conducted an internal / external literature review, which led to the writing of a draft consunltation document|->UPDATE Ethical Web Machine Learning https://lists.w3.org/Archives/Public/www-archive/2022Mar/att-0002/0310_W3C_Ethical_Web_ML_Update.pdf
16:11:02 <anssik> RRSAgent, draft minutes
16:11:02 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/10-webmachinelearning-minutes.html anssik
16:11:44 <anssik> i|James: we conducted an internal / external literature review, which led to the writing of a draft consultation document|->UPDATE Ethical Web Machine Learning https://lists.w3.org/Archives/Public/www-archive/2022Mar/att-0002/0310_W3C_Ethical_Web_ML_Update.pdf
16:11:46 <anssik> RRSAgent, draft minutes
16:11:46 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/10-webmachinelearning-minutes.html anssik
18:01:02 <Zakim> Zakim has left #webmachinelearning