14:00:11 <RRSAgent> RRSAgent has joined #webmachinelearning
14:00:15 <RRSAgent> logging to https://www.w3.org/2023/05/11-webmachinelearning-irc
14:00:15 <Zakim> RRSAgent, make logs Public
14:00:16 <Zakim> please title this meeting ("meeting: ..."), anssik
14:00:17 <anssik> Meeting: WebML WG Teleconference – 11 May 2023
14:00:21 <anssik> Chair: Anssi
14:00:26 <anssik> Agenda: https://github.com/webmachinelearning/meetings/blob/main/telcons/2023-05-11-wg-agenda.md
14:00:33 <anssik> Scribe: Anssi
14:00:37 <anssik> scribeNick: anssik
14:00:42 <ningxin_hu> ningxin_hu has joined #webmachinelearning
14:00:53 <anssik> ghurlbot, this is webmachinelearning/webnn
14:00:53 <ghurlbot> anssik, OK.
14:00:58 <anssik> Present+ Anssi_Kostiainen
14:01:05 <anssik> Present+ Zoltan_Kis
14:01:13 <anssik> Present+ Ningxin_Hu
14:01:23 <anssik> Present+ Rafael_Cintron
14:01:40 <RafaelCintron> RafaelCintron has joined #webmachinelearning
14:01:43 <anssik> Present+ Wanming_Lin
14:02:38 <anssik> RRSAgent, draft minutes
14:02:39 <RRSAgent> I have made the request to generate https://www.w3.org/2023/05/11-webmachinelearning-minutes.html anssik
14:03:00 <anssik> Topic: WebNN - WebIDL and Infra standard conventions
14:03:30 <wm> wm has joined #webmachinelearning
14:04:03 <anssik> Subtopic: The constant() method steps
14:04:08 <anssik> anssik: meta issue #210 and PR #365
14:04:15 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/210 -> Issue 210 Use modern WebIDL and Infra standard conventions (anssiko) enhancement, Editorial
14:04:15 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/365 -> Pull Request 365 Add the constant() method steps. (zolkis)
14:04:21 <anssik> ... this PR now has the required approvals, ready to merge
14:04:31 <anssik> ... thanks Zoltan to interating on this PR carefully and Chai and Ningxin for review and comments!
14:05:18 <anssik> Zoltan: it looks like mergeable, first PR that is a dependency for the rest
14:06:32 <anssik> ningxin_hu: thanks Zoltan, do we want to squash the commits?
14:06:54 <anssik> anssik: editors to decide whether squash the commit on merge
14:07:10 <anssik> Subtopic: Sync and async algorithms
14:08:01 <anssik> anssik: issue #316 and PR #329
14:08:01 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/329 -> Pull Request 329 Rework the sync async algorithms based on #323 (zolkis)
14:08:02 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/316 -> Issue 316 Review sync vs async compute differences (zolkis) Editorial
14:08:08 <anssik> ... I proposed in the PR to remove the note the "During asynchronous execution ..." note
14:08:14 <anssik> ... because RFC 2119 terminology is not to be used in an informative note.
14:08:21 <anssik> ... Zoltan updated the PR accordingly
14:08:33 <anssik> ... it looks like that was the remaining open conversation on this PR?
14:08:55 <anssik> Zoltan: yes, that was the only open one
14:09:21 <anssik> anssik: ready to merge after Chai's final review
14:09:32 <anssik> Subtopic: The builder.input() method steps
14:09:36 <anssik> anssik: meta issue #210 and PR #364
14:09:36 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/364 -> Pull Request 364 Add the builder.input() method steps (zolkis)
14:09:47 <anssik> ... it looks like this PR has the required approvals, ready to merge
14:09:53 <anssik> ... thanks again Zoltan for this PR and Chai and Ningxin for your review
14:10:01 <anssik> Zoltan: this needs rebase before merge
14:10:19 <anssik> ... I will do the rebase
14:10:42 <anssik> ... editors are free to merge after that
14:11:00 <anssik> Subtopic: Review the rest of the standard conventions PRs
14:11:07 <anssik> anssik: my summary of the status of the remaining standard conventions PRs
14:11:13 <anssik> ... #337 MLOperand and MLActivation internal slots - 1 change requested by Chai
14:11:14 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/337 -> Pull Request 337 Add internal slots to MLOperand and MLActivation (zolkis)
14:11:18 <anssik> ... #348 clamp() algorithm - 1 change requested by Chai & addressed by Zoltan
14:11:18 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/348 -> Pull Request 348 Add the clamp() algorithm (zolkis)
14:11:22 <anssik> ... #364 builder.input() method steps - 2 approvals, ready to merge
14:11:26 <anssik> ... #366 concat() algorithm - 1 change requested by Chai & addressed by Zoltan
14:11:26 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/366 -> Pull Request 366 Add the concat algorithm (zolkis)
14:11:29 <anssik> ... #339 batchnorm() algorithm - re-review requested from Chai and Ningxin
14:11:30 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/339 -> Pull Request 339 Fix #334: Improve the batch norm algorithm (zolkis)
14:12:21 <anssik> Zoltan: #337 is being taken apart by other PRs because this is the dependency PR, so I'll leave this as the last
14:12:54 <anssik> ... the rest can be merged after review
14:13:39 <anssik> Zoltan: after merging these open PRs we do the stylistic PR, these introduce building blocks for the stylistic PR
14:14:44 <anssik> ... as soon reviews are cleared I make the stylistic change PR and we can merge Chai's outstanding PR #322
14:14:45 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/322 -> Pull Request 322 Simplify MLContext creation (wchao1115)
14:15:06 <anssik> Topic: WebNN - enhancements, editorials, questions
14:15:28 <anssik> Subtopic: float16 support
14:15:40 <anssik> anssik: I wanted to discuss prototype findings to inform v2 feature work
14:15:52 <anssik> ... I invited Wanming to join us for this call, working with Zesong he added float16 support to ONNX Runtime Web
14:15:56 <anssik> -> PR for ONNXRuntime float16 support https://github.com/Honry/onnxruntime/pull/1
14:16:01 <anssik> anssik: we discussed earlier that ECMA TC39 has a Float16Array proposal, see issue #373
14:16:01 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/373 -> Issue 373 heads up re: proposal to add Float16Array to JavaScript (bakkot) v2
14:16:05 <anssik> -> https://tc39.es/proposal-float16array/ Float16Array proposal (ECMA Stage 2 ~proposal)
14:16:28 <anssik> ... we don't yet have Float16Array in the official JS standard so one workaround is to pass raw bits via Uint16Array typed array that is an ECMA standard
14:16:32 <anssik> -> https://tc39.es/ecma262/multipage/indexed-collections.html#table-49 Uint16Array (ECMA Stage 4 ~ formal standard)
14:16:47 <anssik> ... Wanming, thanks for your work on this, I see the PR has been merged
14:17:06 <anssik> ... did you yet get a chance to try float16 support with some of the models that benefit from float16? Any other insights from your implementation work?
14:18:14 <anssik> Wanming: this implements translation of float16 to Uint16Array
14:19:37 <wm> 8% better than float32 model
14:20:49 <anssik> ningxin_hu: thanks Wanming, this PR is not merged in an official ONNX Runtime repo yet, it is in Wanming's fork as a staging repo for experiment
14:21:39 <anssik> ... open issue in ONNX Runtime about float16 support, various options discussed, this workaround is one of them
14:21:59 <anssik> ... Wanming demonstrated this workaround works
14:22:04 <anssik> ... even if in a downstream fork
14:22:57 <anssik> ... for WebNN spec we have a mapping table where we note this is not yet natively supported but there's a workaround, this work demonstrates this is doable with a workaround
14:23:16 <anssik> ... we can share this prototype with the audience
14:23:36 <anssik> ... performance improvement is good, memory footprint is reduced in ~half
14:23:49 <anssik> ... Btw. I submitted a small PR #386 to add a provisional reference to the upcoming TC39 Float16Array type into MLOperandType and ArrayBufferView compatibility table
14:23:50 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/386 -> Pull Request 386 Add float16 to MLOperandType and ArrayBufferView compatibility table (anssiko)
14:24:54 <anssik> anssik: thanks Wanming for this work!
14:26:04 <anssik> ningxin_hu: related announcement ONNX Runtime Web support, WebNN Execution Provider was merged in an official repo this week, float16 is a downstream experiment
14:26:31 <ningxin_hu> https://github.com/microsoft/onnxruntime/pull/15698
14:27:02 <anssik> Subtopic: Subclass MLGraph based on the context that creates it
14:27:07 <anssik> anssik: issue #344
14:27:07 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/344 -> Issue 344 Subclass MLGraph based on the context that creates it (huningxin) question
14:27:16 <anssik> ... we deferred this from our previous call, I was asking whether this mainly an API ergonomics improvement?
14:27:29 <anssik> ... does the WG feel like pursuing this proposal further?
14:28:04 <anssik> ningxin_hu: no update at this time
14:28:18 <anssik> Subtopic: Error handling of MLNamedArrayBufferViews transfer algorithm
14:28:22 <anssik> anssik: issue #351
14:28:22 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/351 -> Issue 351 Need to define error handling of MLNamedArrayBufferViews transfer algorithm (huningxin)
14:28:26 <anssik> ... Ningxin has a very good description of this in the issue comment, not repeating it here
14:28:44 <anssik> ... the proposal is for WebNN spec to define how to handle this exception and what's the impact to the MLNamedArrayBufferViews
14:28:51 <anssik> ... this issue was identified as part of implementation review by Jiawei
14:29:31 <anssik> ningxin_hu: we addressed the previous related issue and introduced this one
14:29:51 <anssik> ... main and worker thread accessing ArrayBufferView with input and output, transfer in a loop
14:30:01 <anssik> ... possible that an error happens when transferring a buffer in the middle
14:30:34 <anssik> ... in a detached state, this is what the Chromium impl does today, Jiawei feel this is not ideal so we brought this to the WG for input
14:30:44 <anssik> ... what is the ideal way to handle this error?
14:31:00 <anssik> ... should the implementation do something with the already detached ArrayBufferViews?
14:31:44 <anssik> anssik: is this a regression or clarification?
14:32:07 <anssik> ningxin_hu: before the previous PR there was no transfer defined, no such issue existed before
14:32:23 <RafaelCintron> q+
14:32:29 <ningxin_hu> https://github.com/webmachinelearning/webnn/issues/318
14:32:34 <anssik> ... this was introduced in PR #318
14:32:35 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/318 -> Issue 318 [closed] The input and output resources race condition issue of asynchronous execution (huningxin)
14:33:14 <anssik> anssik: this was a TODO in the implementation?
14:33:48 <anssik> ningxin_hu: I will need to check that
14:34:47 <ningxin_hu> TODO in the implementation: https://source.chromium.org/chromium/chromium/src/+/main:third_party/blink/renderer/modules/ml/webnn/ml_graph_xnnpack.cc;l=440
14:35:24 <anssik> q?
14:35:26 <anssik> ack RafaelCintron
14:36:04 <anssik> RafaelCintron: is the problem that when you detach an ArrayBuffer on the foreground thread it says sure, then in the worker when we open it we see problems and we don't know how to surface the problem to the main thread?
14:36:21 <anssik> ningxin_hu: not exactly, there's a loop that iterates one ArrayBuffer at a time
14:36:52 <anssik> ... the issue happens when we iterate in the middle of the loop and find the ArrayBuffer is not detachable, Wasm heap memory or mapped to WebGPU buffer that cannot be detached
14:37:00 <anssik> ... this loop breaks out, report an error
14:37:26 <anssik> ... there are several early ABs detached and cannot be used in the main thread and no recovery mechanism to help the current implementation
14:37:46 <anssik> RafaelCintron: we can loop through everything before detaching? to make sure everything can be detached?
14:38:12 <anssik> ... put a pointer to say "this cannot be detached" and only when we know everything can be detached we detach everything
14:38:45 <anssik> ningxin_hu: validation step before is a good idea, that would be an atomic operation, thanks for mentioning that approach, we can put that in spec language
14:39:14 <anssik> ... we want to hear the input from implementers on this issue what makes for the best design
14:39:37 <anssik> ... two loops are atomic operations, want to confirm if that is possible and doable, probably we can address this issue that way
14:40:03 <anssik> RafaelCintron: probably that is good, could also check with Dom for feedback, see if there are other APIs that have a similar problem
14:40:22 <anssik> ningxin_hu: I will summarize what RafaelCintron suggested and put that in the issue
14:40:53 <anssik> s/Dom/Domenic
14:41:23 <anssik> q?
14:41:45 <anssik> Subtopic: Support depth_multiplier > 1 for a depthwise conv2d op
14:41:52 <anssik> anssik: issue #353
14:41:53 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/353 -> Issue 353 Support `depth_multiplier > 1` for a depthwise conv2d op (Honry)
14:41:56 <anssik> ... an issue from Wanming
14:42:06 <anssik> ... suggests a fix to a note in conv2d op
14:42:10 <anssik> -> https://www.w3.org/TR/webnn/#api-mlgraphbuilder-conv2d The conv2d() method
14:42:15 <anssik> ... note now reads "A depthwise conv2d operation is a variant of grouped convolution, used in models like the MobileNet, where the options.groups = input_channels = output_channels"
14:42:36 <anssik> ... suggested fix: "options.groups = input_channels = output_channels / depth_multiplier"
14:43:02 <anssik> q?
14:44:12 <anssik> Wanming: we found this issue when implementing TFLite WebNN delegate
14:44:32 <ningxin_hu> q+
14:44:34 <anssik> q?
14:44:39 <anssik> ack ningxin_hu
14:45:06 <anssik> ningxin_hu: this is probably a framework compatibility issue, Wanming works on TFLite delegate
14:46:17 <anssik> ... for WebNN we try to cover depthwise conv2d op, we can emulate other variants
14:47:12 <anssik> ... another aspect, in implementation of XNNPACK there's depthwise conv kernel
14:48:27 <anssik> ... we could have a note in the spec, this issue is motivated by frameworks
14:49:00 <anssik> ningxin_hu: there's a real model that requires depth_multiplier, my open is we need a survey on implementability side, how to implement this with native ML APIs
14:49:53 <anssik> Wanming: I can do this investigation
14:49:58 <anssik> anssik: Thanks!
14:50:08 <anssik> q?
14:50:17 <anssik> Subtopic: Clarify interpolation algorithm for resample2d
14:50:27 <anssik> anssik: issue #358 and related issue #270
14:50:27 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/358 -> Issue 358 Please clarify interpolation algorithm for resample2d (BruceDai) enhancement
14:50:27 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/270 -> Issue 270 Support coordinate transformation modes for Resample2d (Honry)
14:51:08 <anssik> https://www.w3.org/TR/webnn/#enumdef-mlinterpolationmode
14:51:28 <anssik> anssik: resample2d() resamples the tensor values from the source to the destination spatial dimensions according to the scaling factors
14:51:42 <anssik> ... resample2d() supports two interpolation algorthms to fill the output tensor values:
14:51:48 <anssik> enum MLInterpolationMode {
14:51:48 <anssik>   "nearest-neighbor",
14:51:48 <anssik>   "linear"
14:51:48 <anssik> };
14:52:00 <anssik> anssik: the issue asks for clarifications to these modes on how to properly implement them
14:52:15 <anssik> ... it appears there are various interpretations of these algos depending on the domain
14:52:20 <anssik> ... Dwayne gives an example for nearest neighbor sampling
14:52:27 <anssik> ... the banking industry uses "round halves up"
14:52:38 <anssik> ... in graphics "round to nearest with X.5 halves toward negative infinity"
14:52:47 <anssik> ... thoughts?
14:54:13 <anssik> q?
14:54:55 <anssik> [ no comments at this time ]
14:55:03 <anssik> anssik: input on GH welcome
14:55:22 <anssik> Topic: Support for transformers
14:56:16 <anssik> anssik: Continue discuss transformers and related requirements and gaps. Test our improved contribution guidelines with these new ops: explore key use cases, sample models, cross-framework support, cross-platform implementability.
14:56:26 <anssik> ... Status: The WG decided to start explore support for transformers in WebNN.
14:56:42 <anssik> ... Next step: Identify use cases. Contributions welcome via issue #375.
14:56:43 <ghurlbot> https://github.com/webmachinelearning/webnn/issues/375 -> Issue 375 Mention transformer in use cases (dontcallmedom) v2
14:57:56 <RafaelCintron> q+
14:58:05 <anssik> anssik: for example, is there interest to look at the gaps in Stable Diffusion?
14:58:16 <anssik> ack RafaelCintron
14:58:40 <anssik> RafaelCintron: I have no objection to explore this space of transformers, they are upcoming cool thing in the industry
15:00:15 <anssik> q?
15:00:46 <anssik> ningxin_hu: Stable Diffusion would be good one
15:01:17 <anssik> ... Transformers.js is another good one it supports many HuggingFace models
15:01:17 <anssik> ... Segments anything and segmentation usages we can also check out
15:02:31 <anssik> q?
15:02:32 <anssik> RRSAgent, draft minutes
15:02:33 <RRSAgent> I have made the request to generate https://www.w3.org/2023/05/11-webmachinelearning-minutes.html anssik