14:45:07 <RRSAgent> RRSAgent has joined #webmachinelearning
14:45:07 <RRSAgent> logging to https://www.w3.org/2022/03/24-webmachinelearning-irc
14:45:10 <Zakim> RRSAgent, make logs Public
14:45:11 <Zakim> please title this meeting ("meeting: ..."), anssik
14:45:11 <anssik> Meeting: WebML WG Teleconference – 24 March 2022
14:45:21 <anssik> Chair: Anssi
14:45:21 <anssik> Agenda: https://github.com/webmachinelearning/meetings/blob/main/telcons/2022-03-24-wg-agenda.md
14:45:26 <anssik> Scribe: Anssi
14:45:30 <anssik> scribeNick: anssik
14:45:34 <anssik> scribe+ dom
14:45:41 <anssik> Present+ Anssi_Kostiainen
14:45:52 <anssik> RRSAgent, draft minutes
14:45:52 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/24-webmachinelearning-minutes.html anssik
15:00:56 <ningxin_hu> ningxin_hu has joined #webmachinelearning
15:01:44 <anssik> Present+ James_Fletcher
15:01:52 <anssik> Present+ Ningxin_Hu
15:01:59 <dom> Present+ Dominique_Hazael-Massieux
15:02:49 <anssik> Present+ Ping_Yu
15:03:05 <anssik> Present+ Rachel_Yager
15:03:35 <anssik> Present+ Dominique_Hazael-Massieux
15:04:06 <anssik> Topic: Announcements
15:04:43 <jonathanbingham> jonathanbingham has joined #webmachinelearning
15:05:00 <anssik> Present+ Jonathan_Bingham
15:05:05 <anssik> -> Ethical Principles for Web Machine Learning https://webmachinelearning.github.io/ethical-webmachinelearning/
15:05:09 <anssik> -> 10-minute intro video https://www.w3.org/2022/03/ml-ethics/
15:05:29 <dom> -> https://github.com/webmachinelearning/ethical-webmachinelearning Repo for Ethical Principles for Web Machine Learning
15:05:39 <dom> james: consultation doc migrated to draft note, now migrated to github
15:05:50 <dom> ... it solidifies the principles (based on UNESCO) and the guidance attached to them
15:05:58 <dom> ... the guidance making the principles more concrete
15:06:21 <dom> ... risks & mitigations is the next steps - they'll be even more concrete and need y'all input
15:06:40 <dom> ... either via github, or through the live sessions week of Apr 4 which Dom is running
15:07:09 <dom> anssi: thanks, amazing work
15:07:22 <dom> ... I invite you all to watch James' intro video
15:07:56 <anssik> q?
15:07:58 <dom> Present+ chai
15:08:11 <anssik> Present+ Chai_Chaoweeraprasit
15:08:58 <anssik> Topic: Security considerations - last call for review
15:09:25 <anssik> -> PR: Update Security Considerations https://github.com/webmachinelearning/webnn/pull/251
15:09:33 <dom> anssi: I've been bugging you all to make progress on this work - security is a key component of our wide review commitment
15:09:49 <dom> ... thanks for the feedback received which I've incorporated in PR #251
15:10:06 <anssik> anssik: changes per review feedback:
15:10:14 <anssik> ... - Drop avoid reshaping tensors guideline
15:10:19 <Rachel_> Rachel_ has joined #webmachinelearning
15:10:25 <anssik> ... - Note shape inference is done during the graph building stage
15:10:39 <anssik> ... - Note concrete device selection is left to the implementation
15:10:56 <anssik> ... - Update timing attack mitigation considerations
15:11:51 <dom> anssi: should we go ahead and merge this for another round of review?
15:11:54 <dom> dom: +1
15:12:10 <RafaelCintron> RafaelCintron has joined #webmachinelearning
15:12:18 <ningxin_hu> +1
15:12:21 <dom> anssi: let's circle back with the chrome review and then bring it W3C security review
15:12:41 <dom> ... in any case, this is a first draft on which we'll keep iterating on as the rest of the document
15:12:42 <anssik> q?
15:12:57 <anssik> Topic: Context-based graph execution methods for different threading models
15:13:09 <anssik> -> Context-based graph execution methods for different threading models https://github.com/webmachinelearning/webnn/pull/257
15:13:29 <dom> anssi: #257 is the pull request on which we agreed to converge during our previous meeting
15:14:38 <dom> chai: update from last time: there has been additional feedback and we're converging on the idea of making the default device option more explicit
15:14:40 <anssik> Present+ Geunhyung_Kim
15:14:42 <Geun-Hyung> Geun-Hyung has joined #webmachinelearning
15:14:47 <anssik> Present+ Rafael_Cintron
15:14:54 <dom> ... with the default being the CPU
15:14:59 <Geun-Hyung> present+
15:15:09 <dom> ... this should resolve the problem of determining when the sync method can be used
15:15:30 <dom> ... Another progress is around the interop with WebGPU
15:15:59 <dom> ... Rafael, Brian and myself have spent quite a bit of time discussing this
15:16:26 <dom> ... we're converging on a somewhat more abstract layer that would be friendlier to implementations based on Vulkan, CoreML or linux
15:16:44 <dom> ... this hasn't been brought to the PR yet
15:17:12 <dom> ... it comes down to having the context taking the WebGPU queue and populating the ML workload the WebGPU queue
15:17:21 <dom> ... which would then be submitted and executed async
15:17:32 <anssik> q?
15:18:04 <ningxin_hu> q+
15:18:04 <dom> Rafael: chai has been doing a great job pushing this forward
15:18:54 <dom> anssi: I notice Brian shared comments in PR #255 with a response from ningxin - not sure if that was incorporated
15:19:11 <dom> chai: yes, he's on board with our converged direction
15:19:19 <dom> q?
15:19:37 <anssik> -> Device selection with MLDevicePreference and MLPowerPreference https://github.com/webmachinelearning/webnn/issues/169
15:19:46 <dom> anssi: re device selection, #169 is an existing issue on the topic
15:20:06 <anssik> dom: I think Chai summarized my point re device selection
15:20:32 <anssik> ... it has been an explicit choice to make device selection a hint due to privacy reasons
15:20:38 <anssik> ... I'm thinking defaulting to CPU might be actually OK
15:20:53 <anssik> ... if you ask for a CPU you'll get a CPU-backed context
15:21:31 <anssik> -> "default" v.s. "auto" in MLDevicePreference and MLPowerPreference https://github.com/webmachinelearning/model-loader/issues/30
15:21:31 <dom> anssi: model loader is also considering reusing the device preference and power preference from our spec, with similar questions
15:21:45 <anssik> q?
15:21:46 <dom> q?
15:21:52 <anssik> ack ningxin_hu
15:22:11 <dom> ningxin_hu: thanks to chai, Rafael and brian to help moving the design forward
15:22:28 <dom> ... I've added a comment based on my investigation of the gpu-only processing pipeline
15:22:45 <dom> ... the WebGPU/WebNN interop plays a very important role there to make sure data stays on GPU for efficient processing
15:23:11 <dom> ... I've left some questions in PR #255 on resources sharing and execution order
15:23:17 <dom> ... I look forward to the updated PR
15:23:22 <anssik> -> https://github.com/webmachinelearning/webnn/pull/255#issuecomment-1077444422 Ningxin's response based on prototype investigation findings (in context of the older PR #255)
15:23:38 <dom> ... my scenario is that the WebGPU compute shader is used for pre- and post-processing to the WebNN compute
15:24:15 <dom> ... my comment has pseudo-code to illustrate that usage in the WebGPU/WebNN background blur
15:25:21 <dom> anssi: let's land this in our spec before seeking WebGPU formal review
15:25:21 <anssik> q?
15:25:41 <anssik> q?
15:26:04 <anssik> Topic: Integration with real-time video processing
15:27:10 <anssik> -> Video processing with insertable streams main thread version https://huningxin.github.io/webrtc-samples/src/content/insertable-streams/video-processing/
15:27:13 <dom> anssi: ningxin_hu has developed a media capture transform-based pipeline for background blur, one using WebGL, the other using WebGPU/WebNN that needs a prototype implementation of WebNN in Chromium
15:27:17 <anssik> -> Video processing with insertable streams worker version https://huningxin.github.io/webrtc-samples/src/content/insertable-streams/video-processing-worker/
15:27:25 <anssik> -> Video processing with insertable streams main thread version https://huningxin.github.io/webrtc-samples/src/content/insertable-streams/video-processing/
15:28:24 <anssik> -> Details of the processing pipelines in issue #226 https://github.com/webmachinelearning/webnn/issues/226#issuecomment-1074968279
15:29:18 <dom> ningxin: this follows up to Dom's suggestion to investigate the integration of media capture transform (Process/generator design pattern that allows video processing via a VideoFrame object), as illustrated in webrtc-samples both in main thread and in a worker
15:30:09 <dom> ... the demo I created is based on these two examples to construct a pipeline that uses only GPU for ML processing, with here background blur as our example
15:30:25 <dom> ... As I explain in my description of the pipeline, it requires several steps:
15:30:35 <dom> ... - get a gpu buffer for a video frame
15:30:41 <dom> ... - a shader to blur an image
15:31:01 <dom> ... - a step to segment the input image between background from other objects, using machine learning
15:31:49 <dom> ... - based on the segmentation map that annotates background/foreground, another shader blurs the background and leaves the foreground alone
15:31:55 <anssik> RRSAgent, draft minutes
15:31:55 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/24-webmachinelearning-minutes.html anssik
15:32:25 <dom> ... - this produces and output texture that can be drawn on an offscreencanvas to produce a videoframe fed into the media capture transform generator
15:32:46 <dom> ... that can be used for video playback or for sending on a webrtc connection
15:34:02 <dom> ... I implemented that pipeline with WebGL: WebGL shader and WebGL-TF.js backend for segmentation with TF.js DeepLabV3 model (it can be used on many objects, but this example focus on background detection)
15:34:08 <dom> ... this serves as a baseline
15:34:51 <dom> ... I initially wanted to make a WebGPU-only pipeline, but the WebGPU backend of TF.js has issues for segmentation (which I reported to the TF.js team)
15:35:11 <dom> ... once that resolved, I'll be able to update the sample with the WebGPU-only version
15:35:44 <dom> ... The final pipeline I published uses WebGPU shaders with a WebNN segmentation, which illustrates the interop with WebGPU
15:36:19 <dom> ... WebNN uses the output of the WebGPU shader as input to the compute
15:36:45 <dom> ... the output of WebNN compute is fed into another WebGPU shader for post-processing (blending input image with output segmentation map)
15:37:36 <dom> ... In this prototype, I still use the existing MLGraph.compute that uses GPUBuffer as input/output (found some related issue I documented in PR #255)
15:37:55 <anssik> q?
15:37:56 <dom> ... as I alluded before
15:39:35 <dom> ... I also ran into issues - the WebGPU backend of TF.js limitation; a WebGPU implementation bug in Chromium; and on entry level GPUs, the WebGL pipeline can freeze the browser UI
15:39:58 <anssik> Present+ Daniel_LaLiberte
15:40:21 <dom> ... Dom commented on the CPU usage, and asked whether main thread/worker will help reduce the CPU usage
15:40:31 <dom> ... it shouldn't based on my observation
15:41:02 <dom> ... the worker appraoch will mostly help with running sync version of the frameworks
15:41:27 <RafaelCintron> q+
15:41:31 <dom> q+
15:41:54 <anssik> ack RafaelCintron
15:42:13 <dom> RafaelCintron: congratulations ningxin_hu - this is really informative
15:42:40 <dom> ... being able to use seamlessly the GPU buffer from WebGPU to WebNN is a great signal
15:43:15 <anssik> ack dom
15:43:20 <dom> anssi: +1 - getting interop right is harder but critical
15:43:38 <anssik> dom: thanks for this amazing work, really impressive
15:44:50 <anssik> ... I wonder if you have a sense whether this estimated CPU usage ~20 with WebGPU and ~40% with WebGL is what we should expect or whether there's optimizations to be done
15:44:53 <dom> q+ to ask pixel formats/color space
15:45:05 <dom> ningxu: re CPU usage, it probably depends on device driver & implementations
15:45:28 <dom> ... I didn't observe a difference of CPU usage between the two pipelines on my own setup
15:45:28 <anssik> ningxin_hu: it depends on device drivers and implementation, I did not observe this big difference between the two pipelines
15:45:55 <dom> ... in terms of CPU usage, looking at WebGPU samples, they trigger similar CPU usage
15:46:11 <dom> ... e.g. when running the image blur sample
15:46:36 <dom> ... so this probably a question to discuss with WebGPU folks
15:47:05 <anssik> ack dom
15:47:05 <Zakim> dom, you wanted to ask pixel formats/color space
15:47:53 <dom> ningxin: I observed significant performance benefits using WebNN
15:47:56 <anssik> dom: did you get a sense whether WebNN does contribute to the better perf of the WebNN/WebGPU pipeline?
15:48:26 <anssik> ningxin_hu: yes
15:48:27 <dom> ningxin_hu: the WebGL pipeline runs into freezes with the browser UI, in which case the FPS goes down to 0 or 1
15:48:41 <dom> ... with an entry level GPU
15:48:56 <anssik> q?
15:49:09 <dom> ... WebGPU/WebNN runs with over 10 FPS in that situation
15:49:16 <dom> q+ to ask pixel formats/color space
15:49:44 <RafaelCintron> q+
15:49:52 <dom> q- later
15:49:56 <anssik> ack RafaelCintron
15:50:24 <dom> RafaelCintron: if you run your analysis on windows, we can trace the source of cpu usage
15:50:36 <dom> ningxin_hu: that would be great - it is on windows; I'll follow up with you
15:50:39 <anssik> ack dom
15:50:39 <Zakim> dom, you wanted to ask pixel formats/color space
15:51:45 <anssik> dom: in issue 226 we discussed VideoFrame does not let you pick pixel format/color space you get from the camera, impact on shaders?
15:51:57 <dom> ningxin_hu: I noted that discussion in the thread
15:52:18 <dom> ... in my implementation, I use the WebGPU copy to external texture which turns it into RGB format
15:52:38 <dom> ... at this stage, this hasn't been a problem
15:53:11 <anssik> q?
15:53:16 <dom> ... @@@ importExternalTexture @@@ will be a good next step to explore the performance implication of this
15:54:09 <dom> anssi: this started from a request from the WebRTC WG - is it time to report back to that WG?
15:54:55 <anssik> dom: a good question, I'll bring this up with WebRTC WG chairs on our regular call next week, we should present outcome of this prototyping on WebRTC WG's mid-April call
15:55:22 <anssik> ... we're pushing the limits of many bleeding edge web features in development with this prototyping
15:55:26 <dom> s/we should present/I'll suggest we present/
15:55:37 <anssik> q?
15:56:09 <anssik> q?
15:56:23 <anssik> Topic: Candidate Recommendation proposed new features
15:56:36 <anssik> Subtopic: WebNN should support int8 quantized models
15:56:44 <anssik> -> WebNN should support int8 quantized models https://github.com/webmachinelearning/webnn/issues/128
15:57:37 <dom> anssik: looking for input on int8 support as part of our first release of WebNN
15:58:17 <dom> chai: +1 - quantized int8 models for CR is important due to new NPU coming to the market; this would otherwise be a serious shortcoming
15:58:31 <dom> ... it also impacts device selection re NPU
15:58:40 <dom> ... they should be considered together
15:59:12 <anssik> q?
15:59:16 <dom> anssik: not hearing pushback, I'll mark it for CR
15:59:19 <anssik> Subtopic: WebNN / WebGPU interop
15:59:32 <anssik> -> WebGPU issue: WebNN / WebGPU interop https://github.com/gpuweb/gpuweb/issues/2500
15:59:57 <dom> anssik: what are remaining investigation items once #257 lands?
16:01:19 <anssik> ningxin_hu: my plan is to review Chai's update PR #257 and prototype that in Chromium, and then update the samples accordingly and that would be a good milestone for discussion with WebGPU WG
16:02:14 <anssik> chai: I think one of the WebGPU topics is we're wondering how Vulkan Linux community and Apple CoreML will implement it, when we push for WebGPU review then we get explicit review from those communities I assume
16:02:20 <RafaelCintron> q+
16:02:43 <anssik> ack RafaelCintron
16:03:15 <anssik> RafaelCintron: in the WebGPU CG Apple is present and attends all meetings
16:03:38 <anssik> ... no Vulkan reps in the CG, but people who are familiar with it are participants
16:04:39 <anssik> q?
16:05:16 <anssik> RRSAgent, draft minutes
16:05:16 <RRSAgent> I have made the request to generate https://www.w3.org/2022/03/24-webmachinelearning-minutes.html anssik
17:02:04 <anssik> anssik has joined #webmachinelearning
18:06:42 <Zakim> Zakim has left #webmachinelearning