14:47:31 RRSAgent has joined #ai-model 14:47:35 logging to https://www.w3.org/2024/09/25-ai-model-irc 14:47:35 RRSAgent, do not leave 14:47:36 RRSAgent, make logs public 14:47:37 Meeting: AI Model Management 14:47:37 Chair: Michael McCool 14:47:37 Agenda: https://github.com/w3c/tpac2024-breakouts/issues/15 14:47:37 Zakim has joined #ai-model 14:47:38 Zakim, clear agenda 14:47:38 agenda cleared 14:47:39 Zakim, agenda+ Pick a scribe 14:47:41 agendum 1 added 14:47:41 Zakim, agenda+ Reminders: code of conduct, health policies, recorded session policy 14:47:41 agendum 2 added 14:47:41 Zakim, agenda+ Goal of this session 14:47:42 agendum 3 added 14:47:42 Zakim, agenda+ Discussion 14:47:42 agendum 4 added 14:47:42 Zakim, agenda+ Next steps / where discussion continues 14:47:43 agendum 5 added 14:47:43 tpac-breakout-bot has left #ai-model 19:02:17 alcooper has joined #ai-model 20:03:27 lambda has joined #ai-model 20:09:42 sebastian has joined #ai-model 20:12:32 scribe sebastian 20:12:45 Max has joined #ai-model 20:13:20 McCool has joined #ai-model 20:13:43 gkellogg has joined #ai-model 20:14:25 HXavier has joined #ai-model 20:16:01 slides: https://github.com/webmachinelearning/hybrid-ai/blob/main/presentations/WebML%20Discussion%20-%20Hybrid%20AI%20for%20the%20Web%20-%20AI%20Model%20Management%20TPAC%202024%20Breakout.pdf 20:17:05 present+ Gregg_Kellogg 20:17:06 anatoly-scherbakov has joined #ai-model 20:17:11 present+ 20:17:12 Evan has joined #ai-model 20:17:38 asully has joined #ai-model 20:17:57 20:18:16 ora has joined #ai-model 20:18:23 present+ 20:18:26 ayu has joined #ai-model 20:18:49 Problem Statement ... Issues and Constaints ... Discussions ... Next steps 20:19:01 topic: Problem statements 20:19:16 chrisp has joined #ai-model 20:19:20 McCool: AI models ... 20:19:26 ... can be very large 20:19:35 ... will often be shared 20:19:43 ... are often updated 20:19:57 kenji_baheux has joined #ai-model 20:20:30 reillyg has joined #ai-model 20:20:33 present+ 20:20:44 sql has joined #ai-model 20:20:46 McCool: There is also the current same-origin storage portioning policy 20:21:02 ... this preserves privacy issues 20:21:22 zolkis has joined #ai-model 20:21:56 dezell has joined #ai-model 20:22:04 present+ 20:22:10 present+ 20:22:33 ... this works for images (not shared) and for software libraries (often shared) 20:23:12 tomayac has joined #ai-model 20:23:33 topic: Use Cases for Large Models 20:23:56 McCool: Language translation, ... 20:24:07 ... meeting captions 20:24:20 ... background removal 20:24:33 ... video creation and editing 20:24:49 ... written language recognition 20:24:55 ... personal assistant 20:26:05 who is speaking? 20:27:28 XXX: Shall we specify a shared repository which only has my personal information? 20:28:25 @@@ 20:28:39 topic: Why run AI? 20:29:14 McCool: There are pros such as latency and being used offline. 20:29:39 ... cons are size limitations, download time, and storage costs 20:29:58 topic: Model Size vs Download Time 20:30:40 McCool: Average home networks speeds are between 45-216 Mbps 20:32:01 ... downloading Phi-3-mini will result,e.g, to 22mins for baseline 20:32:12 topic: Existing APIs and Experiments 20:33:08 McCool: For the same-origin there exists HTTP Cache, ... 20:33:13 ... Cache API 20:33:20 IndexedDB API, 20:33:44 s/IndexedDB API,/... IndexedDB API, 20:33:56 ... Origin Private File System API 20:34:26 McCool: For the cross-origin there exists the File System Access API 20:34:48 topic: Caching Desired Properties 20:35:34 McCool: we need to reduce latency, ... 20:35:41 ... Bandwidth 20:35:46 ... Storage 20:36:00 ... and preserve privacy 20:36:11 topic: Security and Privacy Considerations 20:36:14 Domenic has joined #ai-model 20:36:55 McCool: browsers implement only per-origin local caches 20:37:34 ... the cross-site privacy risk based on cache timing analysis 20:37:42 ErikAnderson has joined #ai-model 20:37:48 dsinger has joined #ai-model 20:38:01 kirkwood has joined #ai-model 20:38:24 ... the per-origin caches tolerable for “typical” (non-AI) web resources. But AI Models are large and potentially shared 20:38:36 topic: Issue Starter Pack 20:39:13 RafaelCintron has joined #ai-model 20:39:16 kadirtopal has joined #ai-model 20:39:31 -> https://github.com/w3c/tpac2024-breakouts/issues/15 20:40:32 McCool: Here are first issues for discussions: 20:40:53 ... background model download and compilation, 20:41:03 ... model naming and versioning 20:41:19 ... allowing for model substitution when useful 20:41:48 ... common interface for downloadable and “platform” models 20:41:58 ... storage deduplication 20:42:10 ... model representation independence 20:42:17 ... API independence 20:42:27 ... browser independence 20:42:42 ... offline usage, including interaction with PWAs 20:42:50 ... cache transparency 20:43:13 s/discussions/discussion 20:44:58 q? 20:45:03 q+ 20:45:10 q+ 20:46:43 ack de 20:46:44 anatoly-scherbakov has left #ai-model 20:46:51 q+ to mention similar problems 20:47:39 David: Seems a big issue. Are interested are you working on specific APIs? 20:48:04 q+ 20:48:27 ack kenji_baheux 20:49:51 ack ds 20:49:51 dsinger, you wanted to mention similar problems 20:50:40 q+ 20:50:51 q+ 20:51:15 ack reillyg 20:52:15 estade has joined #ai-model 20:52:33 andrewnolan has joined #ai-model 20:52:37 topic: Alternatives 20:53:56 McCool: One option can be that we define model-aware caches 20:54:39 ... use 'fake misses' to avoid redundant downloads 20:55:02 q+ 20:56:28 ... identify cache items by content-dependent hashes 20:56:32 q+ 20:57:58 McCool: the idea is that model caches would behave as if they were per-origin caches 20:59:07 McCool: there is another alternative, the auto-expedite common models 21:00:06 ... the more common a model is, the less of a tracking risk it is 21:01:12 q? 21:01:18 ack tomayac 21:02:06 ack ErikAnderson 21:03:56 q+ 21:04:02 ack Domenic 21:07:31 ack kenji_baheux 21:10:32 ack estade 21:11:32 q+ arthur_coleman 21:12:08 echoing Erik's point on finding if there is an actual problem; want to share what I've heard so far from partners. Enterprise / Edu customers would like to use a custom LLM that speaks their users' language / lingo across different origins (typically internal websites and/or popular 3p solutions to Enterprise/Edu needs). Other extreme, some 21:12:08 partners want a custom LLM that speaks to their community but they wouldn't share it with other sites. Likely it would be running on the server as most of their users wouldn't necessarily want the big download. 21:13:33 It's a proposal supported only by Chrome and Edge, but Related Website Sets is an example of a technology that may be applicable here _if_ we hear from customers that it's a common use case to share a company-specific model across multiple top-level eTLD+1s. 21:14:08 ack arthur_coleman 21:17:24 adjourn 21:32:43 gkellogg has joined #ai-model 21:41:15 gkellogg has joined #ai-model 21:41:49 gkellogg has left #ai-model 21:46:49 reillyg has left #ai-model 21:51:13 dsinger has joined #ai-model 21:53:20 dsinger has left #ai-model 22:32:08 alcooper has joined #ai-model 22:33:05 alcooper has left #ai-model 23:19:40 kirkwood has joined #ai-model 23:51:44 Lei_Zhao has joined #ai-model 23:57:01 kirkwood has joined #ai-model