21:58:30 <RRSAgent> RRSAgent has joined #mediawg
21:58:34 <RRSAgent> logging to https://www.w3.org/2024/01/09-mediawg-irc
21:58:37 <Zakim> Zakim has joined #mediawg
21:58:53 <cpn> Meeting: Media WG
21:59:08 <cpn> Agenda: https://www.w3.org/events/meetings/45b57f00-30f0-498b-ae71-b9241a515314/
21:59:35 <tidoust> RRSAgent, make logs public
22:02:11 <cpn> present+ Chris_Needham, Jan-Ivar Bruaroey, Bernard_Aboba, Mark_Watson, Youenn_Fablet, Francois_Daoust, Sun_Shin, Johannes_Kron, Thomas_Guilbert, Jean-Yves_Avenard
22:02:33 <cpn> Chair: Chris_Needham, Marcos_Caceres
22:02:44 <tidoust> present+ Marcos_Caceres, Jer_Noble
22:03:26 <tidoust> present+ Mark_Foltz
22:03:29 <markw> markw has joined #mediawg
22:03:37 <tidoust> present+ Eugene_Zemtov
22:03:42 <cpn> Present+ Tommy_Steimel, Mark_Foltz
22:03:48 <mfoltzgoogle> mfoltzgoogle has joined #mediawg
22:03:54 <mfoltzgoogle> Present+ Mark_Foltz
22:03:59 <markw> present+ markw
22:04:14 <tidoust> present+ Tommy_Steimel
22:05:28 <tidoust> scribe+
22:05:41 <marcos> marcos has joined #mediawg
22:05:50 <tidoust> present+ Dale_Curtis
22:07:07 <tidoust> Topic: Media Session
22:07:27 <tidoust> subtopic: Relationship between toggle microphone/camera actions and MediaStreamTrack mute/unmute events
22:07:54 <tidoust> jan-ivar: See -> https://github.com/w3c/mediasession/issues/307 issue #307
22:08:09 <tidoust> ... To my knowledge, no relationship for now, which is a bit unfortunate.
22:08:45 <tidoust> ... Also implementation in Chrome. Chrome never mutes the track.
22:09:17 <tidoust> ... When user clicks the buttons, these buttons are 100% javascript controlled. Which might be fine for picture-in-picture.
22:09:51 <tidoust> ... Users may have some expectation that a web page cannot toggle camera when they used the buttons.
22:10:47 <tidoust> ... User agents could decide to mute tracks based on whether the user clicked the toggle buttons, but there are some challenges with the API because the API can maintain the state.
22:10:58 <tidoust> ... Can we trust the web page with that?
22:11:17 <tidoust> ... If we don't, we end up with the double mute issue, which has some advantages and disadvantages.
22:11:22 <tidoust> ... Some proposals in the issue.
22:12:19 <tidoust> Youenn: I would maybe add that it would be interesting for us to use the Media Session API to mute/unmute capture. That seems reasonable. Right now, focused in PiP. Safari UI is pretty similar though.
22:12:51 <tidoust> ... The Safari UA does not trust the web page though. This is different from implementation in Chrome.
22:13:10 <tidoust> ... One model where UA would trust the web page, one model where UA would not trust the web page.
22:13:34 <tidoust> ... I would prefer to move to a world where the UA does not trust the web page, but that may be for another time.
22:14:05 <tidoust> ... It seems that we are mostly aligned on extending the API for the model where the UA does not trust the web page.
22:14:40 <tidoust> Tommy: So, setMicrophone(active) would return a Promise and maybe fail?
22:14:47 <tidoust> Jan-Ivar: Right?
22:14:52 <tidoust> s/Right?/Right.
22:14:58 <tidoust> Tommy: That seems fine.
22:15:38 <tidoust> Youenn: We might want to tighten the spec too, e.g., mute/unmute events might fire after the callback, these kinds of things.
22:16:19 <tidoust> ... When you get tracks, each track has a muted flag, with capture flag has well.
22:16:50 <tidoust> Tommy: So mute event is really there to tell the web page that you're about to mute, but there's nothing they can do.
22:16:57 <tidoust> Youenn: They could stop capture.
22:17:22 <tidoust> Tommy: In the Chrome case, the application would be responsible. In the Safari case, the UA would be.
22:17:25 <tidoust> Youenn: Right.
22:18:03 <tidoust> Jan-Ivar: So the goal is to synchronize the mute cases to reduce the double mute problem.
22:18:09 <tidoust> Youenn: Yes.
22:18:21 <tidoust> Tommy: I don't have a problem with that.
22:18:51 <tidoust> s/setMicrophone(active)/setMicrophoneActive()
22:19:03 <tidoust> Youenn: OK, then I may prepare two PRs.
22:19:49 <tidoust> Tommy: Sometimes, web sites will continue to listen to understand when someone is speaking while muted.
22:20:30 <tidoust> Youenn: Longer plan would be to have a dedicated event in muted tracks to alert when users are speaking while the track is muted. Some support at the OS level. Not spec-ed yet though.
22:20:39 <tidoust> Tommy: OK, that seems reasonable to me.
22:22:31 <tidoust> Youenn: The toggleMicrophone thing might become ambiguous. When you toggle, you might now know what the current state is. In the future, we may want to expose additional info, especially in the mode where the page is not trusted.
22:22:52 <tidoust> Tommy: Yes, there could be a race here.
22:22:59 <tidoust> Youenn: I'll file another issue for that.
22:23:25 <tidoust> ... One PR for the API change setMicrophoneActive, and one PR for mentioning that the mute events can fire after.
22:23:44 <tidoust> cpn: This issue was spawned as part of the double mute issue on media capture.
22:23:54 <tidoust> ... To be followed up in the WebRTC WG.
22:24:25 <tidoust> Youenn: Yes, I think the WebRTC WG should discuss this.
22:25:13 <tidoust> Jan-Ivar: the PR should also address -> https://github.com/w3c/mediasession/issues/279 #279
22:25:41 <tidoust> Youenn: To be closed once the PR is ready and merged.
22:25:47 <tidoust> Topic: Media Capabilities
22:26:27 <tidoust> cpn: I did an issue triage before Christmas. I added labels to some of these. Happy to adjust things based on your own prioritization.
22:26:48 <tidoust> ... For the V1 milestone, really clarification issues.
22:27:17 <tidoust> ... For the V2 milestone, extensions to the current API, such as the transition API, text track capabilities, and audio rendering capabilities.
22:27:33 <tidoust> ... Depending on priorities, we may want to move issues around between V2 and V1.
22:28:49 <tidoust> cpn: Next thing we talked about last time were the privacy issue raised by PING. We need to review that and answer their question more specifically about why we have a capabilities API to start with, as opposed to letting the user agent pick up the right option.
22:29:20 <tidoust> ... Call for people to help get started on this
22:30:02 <tidoust> Bernard: I can help with some of it. Some differences, but some similarities. In WebRTC, media is negotiated.
22:30:26 <tidoust> cpn: Thanks, that would be useful. I haven't looked at the current questionnaire.
22:30:39 <tidoust> Bernard: I've had the same questions come up in the WebRTC SVC case.
22:31:15 <tidoust> cpn: If you can start something and then we can look at it from an adaptive streaming perspective.
22:31:25 <tidoust> Bernard: Yes, these two areas may have slightly different answers.
22:32:12 <tidoust> ... Did the same questions come up in WebCodecs with isTypeSupported()
22:32:25 <tidoust> Eugene: I don't think we requested review from PING yet.
22:32:41 <tidoust> cpn: I'll check the status.
22:34:01 <tidoust> cpn: Given that we have you and Youenn, I'd like to explore whether it makes sense to have separate capabilities APIs.
22:34:34 <tidoust> Jer: Do they return the same information?
22:35:02 <tidoust> Youenn: Some differences between isConfigSupported and media capabilities. Related but not exactly the same thing.
22:35:18 <tidoust> ... Smoothness and power efficiency are good examples.
22:35:48 <tidoust> ... We could add a note on how developers could approach this (real time or not real time scenarios).
22:37:24 <tidoust> Eugene: Example: Encode video in 60fps in high definition. Most devices won't support that. Currently the spec does not say that it shoul be rejected because frame rate won't be supported.
22:37:28 <tidoust> ... But it could be extended.
22:38:17 <tidoust> ... WebCodecs allows people to do lots of different things. Different people mean different things when they use WebCodecs. I wouldn't add anything to Media Capabilities related to WebCodecs.
22:39:00 <tidoust> Jer: For a quick overview of isConfigSupported, it seems that it could be possible to define it in terms of the Media Capabilities.
22:39:33 <tidoust> ... With the update on smoothness linked to real-time scenarios.
22:39:53 <tidoust> Bernard: Except that it returns a configuration.
22:40:22 <tidoust> Eugene: If UA does not support some new key, it will return the configuration that it understands without the options that the developer requested.
22:40:58 <tidoust> Jer: OK, that seems similar to an issue we discussed last time, where Safari returns the dictionary paramters the way it understood them.
22:41:57 <tidoust> Bernard: In WebCodecs, you can have codec specific stuff that won't be in Media Capabilities. Per-frame QP for instance.
22:42:09 <tidoust> ... Not exposed through MIME type parameters.
22:42:19 <tidoust> ... It gives information at a more detailed level.
22:42:32 <tidoust> ... About what an encoder/decoder can do.
22:42:45 <tidoust> Jer: OK, I think I haven't looked into this into details.
22:43:29 <tidoust> Bernard: Side example of a feature for AV1, that we may want to revisit.
22:43:53 <tidoust> Jer: Another API that encodes media may want to expose the same information that we expose in WebCodecs.
22:44:17 <tidoust> Eugene: WebCodecs is designed to be low-level. Other media APIs are more for people who want video to just work.
22:44:45 <tidoust> Jer: I understand that. But it would be problematic to force people into WebCodecs if they need the detailed information for some reason.
22:44:52 <tidoust> ... We can talk about that in the future.
22:45:20 <tidoust> Bernard: Yes, example of contentHints in WebRTC and WebCodecs and they're not quite the same in both contexts.
22:46:10 <tidoust> Dale: The Media Capabilities API could perhaps need to ingest WebCodecs stuff.
22:46:35 <tidoust> ... Because there are so many WebCodecs parameters.
22:46:59 <tidoust> Jer: Unifying places where you can get answers about media capabilities seems good to me.
22:47:20 <tidoust> Bernard: The input could look very different for WebRTC and WebCodecs.
22:47:42 <tidoust> ... It's not clear to me that we wouldn't complicate Media Capabilities.
22:48:38 <tidoust> ... An example from WebRTC is resolution, handled by the user agent, and you might get lower resolution because CPU is busy for instance.
22:48:47 <tidoust> ... A little different from what you would get from WebCodecs.
22:49:43 <tidoust> Youenn: If I understand things, we're thinking that adding WebCodecs to Media Capabilities is something big and potentially difficult. Perhaps worth doing in the future.
22:50:16 <tidoust> ... We may still get developers who may want to express media capabilities in terms of WebCodecs parameters.
22:51:26 <tidoust> cpn: Next issue that I prioritized is around AudioConfiguration, and whether we should be distinguishing the decode capabilities from the rendering capabilities, as done for video.
22:52:00 <tidoust> ... For the channel configuration, right now it's a string, and I don't know what you can query through this.
22:52:25 <tidoust> ... Some developer need to tell whether they can deliver multi channel or whether the browser is going to downmix that to stereo.
22:52:49 <tidoust> ... Chris Cunningham put together a document at the time.
22:52:56 <tidoust> ... Is this something that we want to work on?
22:53:15 <tidoust> ... Perhaps for today, we can just look at it from a priorization point of view.
22:54:00 <tidoust> ... The document explores how we might add this through Media Audio Output Devices API.
22:55:38 <tidoust> Dale: The stuff about channel count is more on the Web Audio side. Developers may want to have something where they can specify through a dictionary whis channel is which.
22:56:22 <tidoust> jer: We heard through Netflix the need to know support for advance Dolby channels.
22:57:02 <tidoust> ... We have received feedback on spatial rendering but not on the number of channels per se. Multichannel or stereo, essentially.
22:58:45 <tidoust> jan-ivar: If there's a need to expose more information in the Audio Output Devices API, that would be in the WebRTC WG.
22:59:25 <tidoust> cpn: Also some issue on trimming down the AudioConfiguration, with a proposal to remove features not used in Chromium.
22:59:46 <tidoust> ... Other implementations may still find them useful, my suggestion is to leave them in.
23:01:03 <tidoust> cpn: If anybody has editorial capacity to work on some of these things, there's a number of things where it's just about working on spec text.
23:01:20 <tidoust> ... I labeled them with a "pr-needed" label.
23:01:59 <Marcosc> Marcosc has joined #mediawg
23:02:10 <tidoust> ... There are a number of other issues for which we may need a wider feedback on. I propose to bring that to the Media & Entertainment IG to collect industry input.
23:02:45 <tidoust> Topic: Next call
23:03:11 <tidoust> cpn: Next call on 13 February.
23:03:15 <tidoust> RRSAgent, draft minutes
23:03:16 <RRSAgent> I have made the request to generate https://www.w3.org/2024/01/09-mediawg-minutes.html tidoust