IRC log of textcueapi on 2019-09-18
Timestamps are in UTC.
- 01:46:59 [RRSAgent]
- RRSAgent has joined #textcueapi
- 01:46:59 [RRSAgent]
- logging to https://www.w3.org/2019/09/18-textcueapi-irc
- 01:47:01 [wseltzer]
- rrsagent, make logs public
- 01:47:04 [wseltzer]
- wseltzer has left #textcueapi
- 02:07:06 [nigel]
- nigel has joined #textcueapi
- 03:04:38 [nigel]
- nigel has joined #textcueapi
- 04:32:07 [nigel]
- nigel has joined #textcueapi
- 05:30:13 [nigel]
- nigel has joined #textcueapi
- 05:35:53 [gkatsev]
- gkatsev has joined #textcueapi
- 05:36:25 [atai]
- atai has joined #textcueapi
- 05:36:44 [nigel_]
- nigel_ has joined #textcueapi
- 05:36:58 [tidoust]
- tidoust has joined #textcueapi
- 05:37:54 [tidoust]
- scribe: tidoust
- 05:38:19 [tidoust]
- -> https://www.icloud.com/keynote/0GJUbJwWfA2i77M2JysKjj45w#Generic_Text_Cue_-_FOMS_2019 Slides
- 05:38:28 [nigel_]
- Present+ Nigel_Megitt
- 05:38:35 [tidoust]
- Tess: [introduces the problem space]
- 05:38:41 [pal]
- pal has joined #textcueapi
- 05:39:08 [takio]
- takio has joined #textcueapi
- 05:39:11 [tidoust]
- ... It you use the browser's built-in captioning system, you get things such as user stylesheets for free, but if you don't, you're on your own.
- 05:39:12 [MasayaIkeo]
- MasayaIkeo has joined #textcueapi
- 05:39:34 [tidoust]
- ... Also you don't get to participate in platform integration such as PiP.
- 05:40:08 [tidoust]
- ... Why? We have a large corpus of existing captions. Also, WebVTT does not handle all cases.
- 05:40:28 [tidoust]
- ... All this suggests that a smart thing to do would be to decouple delivery from display.
- 05:40:54 [tidoust]
- ... So that the user agent can at least participate in the rendering of captionings
- 05:41:43 [tidoust]
- ... Essentially, goal is to insert a cue model before the cue gets displayed so that, in the future, you can add support for a variety of formats.
- 05:42:12 [tidoust]
- ... In-band captions come in a variety of formats, if browser supports, this proposal would allow this to be handled.
- 05:42:40 [tidoust]
- ... What requirements for the data model?
- 05:43:04 [tidoust]
- ... It has to be reasonably expressive, and it should be really easy to use for the common caption formats out there, starting with WebVTT.
- 05:43:07 [cyril]
- cyril has joined #textcueapi
- 05:43:18 [tidoust]
- ... Also the data model itself should be easy to manipulate in JS.
- 05:43:57 [tidoust]
- ... [showing an example in WebVTT and IMSC1, and how a data model could represent that]
- 05:44:13 [tidoust]
- ... Web app should be able to create these things by hand
- 05:44:48 [tidoust]
- ... Basic proposal is to extend the basic TextTrackCue interface and to restructure parts of WebVTT and TTML.
- 05:45:46 [tidoust]
- ericc: [showing a demo of a version of a modified Webkit]
- 05:46:25 [tidoust]
- ... TextTrackCue is an abstract interface in HTML. No constructor. I have modified that so that it has a constructor that takes a start/end time and an object that follows the data model that describes the cue
- 05:47:12 [tidoust]
- ... [Big Buck Bunny demo]
- 05:48:14 [tidoust]
- [Polyfill demo at https://sandflow.com/ttapi-demo/big-buck-bunny.html]
- 05:49:33 [tidoust]
- ericc: The data model is verbose when there are styles. But caption can be text only, with default styles.
- 05:50:33 [tidoust]
- ... [showing demo with more styles and regions]
- 05:50:46 [tidoust]
- ... It's obviously possible to apply very complex styles.
- 05:51:15 [tidoust]
- pal: The basic idea is to use the TTML model constrained by IMSC, because that seems to be a good place to start.
- 05:51:28 [tidoust]
- ... If gaps are found, we can backport them in TTML and IMSC.
- 05:51:47 [tidoust]
- ... It's exactly the same model. Both the content model and the style properties.
- 05:51:58 [tidoust]
- ... Most of the styles are direct mappings to their CSS counterparts.
- 05:52:32 [tidoust]
- ... Some people may ask how to define the HTML rendering. There's some open source code that can show how to do it.
- 05:53:23 [tidoust]
- ericc: What I've done in this version of Webkit is to take a JS library that understands this data model and outputs a document fragment
- 05:53:40 [tidoust]
- glenn: Could you represent the data model as a document fragment directly?
- 05:54:11 [tidoust]
- ... Instead of going to the process of translating from this data model to document fragment, could you use a serialized version of a document fragment?
- 05:54:23 [tidoust]
- ericc: Allowing that as an input?
- 05:54:42 [tidoust]
- ... Allowing to take that as input might make some people nervous.
- 05:54:49 [tidoust]
- glenn: Yes, with scripts disabled.
- 05:55:29 [tidoust]
- nigel: We discussed this last year and the years before that. These concerns were raised.
- 05:56:18 [tidoust]
- glenn: The reason I ask is that there are other CSS properties than those defined in this list and if I want to construct a TextTrackCue object that uses them, it would be good to have a mechanism that makes it possible.
- 05:56:36 [tidoust]
- ... E.g. TTML2.
- 05:57:14 [tidoust]
- pal: There is a pseudo-classes in the proposal that follows WebVTT and that addresses a lot of use cases.
- 05:57:41 [tidoust]
- ... The model is easily extensible.
- 05:57:51 [tidoust]
- glenn: Adding means writing more code and landing the changes.
- 05:58:13 [tidoust]
- ericc: If CSS can be used, I don't see why you wouldn't be able to use your own stylesheet directly.
- 05:58:53 [tidoust]
- gary: WebVTT only allows certain CSS. Do we want to open it up?
- 05:58:59 [tidoust]
- ericc: No, I don't think so.
- 05:59:48 [tidoust]
- glenn: IMSC is evolving. We're adding new properties to it. I just don't want to be restricted. It would be good to have a built-in extensibility mechanism.
- 06:00:00 [tidoust]
- pal: In my mind, this evolves as well.
- 06:00:25 [tidoust]
- glenn: Yes, but I'm trying to avoid changing the code in the browsers. The way you suggest it is to have browser vendors update their code. I'd like to avoid that.
- 06:00:35 [tidoust]
- ericc: We should talk to find a safe way to do it.
- 06:00:47 [tidoust]
- nigel: What about the metadata?
- 06:00:52 [tidoust]
- pal: No metadata for now.
- 06:01:18 [tidoust]
- nigel: Would it be useful to have an API that allows the author to access metadata?
- 06:01:50 [tidoust]
- ericc: Since this is a JS API, the cues need to be created from script. You can add any attribute you want to, when the event fires, you'll get it back.
- 06:02:06 [tidoust]
- ... There is a "content" attribute, I think.
- 06:02:18 [tidoust]
- ... If you want to add something else to the cue, you can certainly do that.
- 06:03:55 [tidoust]
- mounir: we talked about that at FOMS. I don't have strong opinion on the API. Feedback is same as for WebVTT: people got fed up because of slightly different implementations. I think the same would happen here.
- 06:04:11 [tidoust]
- tess: How do you do captions in PiP?
- 06:04:17 [tidoust]
- mounir: We don't support captions in PiP.
- 06:05:09 [tidoust]
- nigel: Question about positioning.
- 06:05:30 [tidoust]
- ... Page gets some video in there and cues. How do you relate the pixels of what you draw to the video?
- 06:06:07 [tidoust]
- pal: Each cue renders a rectangular region which typically overlaps the video.
- 06:06:24 [tidoust]
- nigel: How does that relate to PiP
- 06:07:07 [shu]
- shu has joined #textcueapi
- 06:07:17 [tidoust]
- mounir: People cannot do that by design if they use WebVTT. Assuming we can have web sites that want to make use of this solution, that would be much easier.
- 06:08:08 [tidoust]
- greg: At Netflix, we'd be interested from a rendering, accessibility and performance perspective in this solution.
- 06:08:53 [cyril]
- s/and performance/, synchronization, and performance/
- 06:09:00 [tidoust]
- markw: For accessibility, we do have site-wide customization, if the customization in the browser has default values, there may be a conflict between Netflix default values and UA default values.
- 06:09:11 [tidoust]
- ericc: Right, that may be an issue.
- 06:09:36 [tidoust]
- greg: In this model, would there be device settings?
- 06:09:58 [tidoust]
- pal: Regulatory requirements whereby users can select particular styles?
- 06:10:22 [nigel]
- i/ericc: Right/nigel: Same issue for BBC
- 06:10:44 [tidoust]
- ... Generally providing hooks allows these styles to apply in the first place.
- 06:11:37 [tidoust]
- ericc: [showing Safari style UI]. Checkbox allows to let video override some of the settings.
- 06:11:44 [tidoust]
- ... We honor that.
- 06:11:51 [tidoust]
- s/Safari/Mac OS/
- 06:12:08 [tidoust]
- pal: Having a common API gives us an opportunity to have a common approach.
- 06:12:27 [tidoust]
- greg: OK, it looks that we can, we just have to do things in JS.
- 06:14:19 [tidoust]
- james: I want to clarify that this is not specific to Mac OS. That's written in the FCC mandate that the user style should be able to override.
- 06:14:34 [tidoust]
- greg: Not one by one, as a whole.
- 06:15:20 [tidoust]
- pal: My point is that by having a common model, we can at least have a common ground for discussion.
- 06:15:29 [tidoust]
- andreas: I agree with Pierre.
- 06:16:07 [tidoust]
- ... In general, I think that's a really great proposal. Speaking from a German / Swiss perspective, we have troubles bringing captions from broadcasters to the Web.
- 06:16:15 [tidoust]
- ... Different formats, etc.
- 06:16:35 [tidoust]
- ... That leads to accessibility issues.
- 06:16:45 [tidoust]
- ... Really important that we try to work on this.
- 06:17:21 [nigel]
- i/pal: My point is that/nigel: Please don't force FCC requirements on the whole world through standards - they're not accepted as good everywhere.
- 06:19:03 [tidoust]
- chcunningam: Can you find a way to work with HTML directly to avoid a JSON format?
- 06:19:17 [tidoust]
- pal: That's where I started but doesn't work in the end.
- 06:19:39 [tidoust]
- ericc: The browser needs to understand exactly what's what so that we can apply the user styles to the right portions of the captions.
- 06:19:58 [tidoust]
- cyril: Having pre-defined classes.
- 06:20:04 [tidoust]
- pal: That's how WebVTT works.
- 06:20:34 [nigel]
- i/greg: OK/nigel: You could probably spec how the OS settings are applied by defining a place where OS style settings are inserted into the JSON structure before presentation.
- 06:20:47 [tidoust]
- chcunningam: By adding an API that also doesn't respect Netflix's default styles, we're not solving the issue, right?
- 06:21:12 [tidoust]
- ericc: It depends on the perspective. For us, it's extremely important to respect user's accessibility settings.
- 06:21:38 [tidoust]
- ... This proposal is intended to make it easier for people who feel that they need to render their captions themselves.
- 06:22:29 [tidoust]
- greg: Having a proposal that allows user to go to the site or to the device is good.
- 06:23:03 [cyril]
- q+
- 06:23:18 [tidoust]
- ericc: We can't use styles outside of the shadow dom because that would be a massive fingerprinting issue.
- 06:23:32 [Zakim]
- Zakim has joined #textcueapi
- 06:23:41 [tidoust]
- s/use styles/expose user's styles/
- 06:23:46 [nigel]
- q+ cyril
- 06:24:04 [nigel]
- q+ to ask about exposing the document fragments to AT
- 06:24:46 [tidoust]
- chcunningam: If you had class names that say "this is the text" or "this is the speaker name", then perhaps you could handle some of it that way
- 06:25:05 [tidoust]
- ericc: You mean an alternative that uses a document fragment instead of this?
- 06:25:11 [tidoust]
- chcunningam: yes.
- 06:25:42 [tidoust]
- tess: I would just have to see it work with concrete examples.
- 06:25:45 [tidoust]
- ack cyril
- 06:25:46 [gkatsev]
- q+ to say that webvtt already has a example with their color classes like .blue and .bg_blue
- 06:25:58 [tidoust]
- cyril: I heard support for this activity. Where do we work on it?
- 06:26:34 [tidoust]
- tess: I understand that there's a Timed Text WG that meets tomorrow. That's a start. We could also start with WICG and take it from there.
- 06:26:56 [tidoust]
- ... The shape of the current proposal is patches to WebVTT and TTML specs.
- 06:27:13 [tidoust]
- ... This might be a temporary document.
- 06:27:24 [nigel]
- s/WebVTT and TTML/WebVTT and HTML
- 06:27:40 [tidoust]
- andreas: I would definitely support to propose this to WICG. We need more experts than we have in the Timed Text WG.
- 06:28:08 [tidoust]
- tess: I'm hearing we should start a WICG discource thread.
- 06:28:53 [tidoust]
- pal: Any new feature that may arise would need to be backported in TTML.
- 06:29:05 [tidoust]
- glenn: As this stands, the defined classes are close to IMSC
- 06:29:51 [tidoust]
- pal: If we need something that is neither in CSS nor TTML, then we need to take the discussion back to TTWG to understand why it needs to be added.
- 06:30:13 [tidoust]
- glenn: It could also add new entropy to the process.
- 06:30:33 [tidoust]
- mounir: The main point is the API design, not really the accessibility part of it.
- 06:30:46 [tidoust]
- pal: yes, just join the TTWG if you want to talk about accessibility.
- 06:31:04 [nigel]
- q?
- 06:31:34 [tidoust]
- cyril: If the WICG says that a Document fragment can be used, then it opens up other possibilities such as graphics overlay.
- 06:31:47 [tidoust]
- tess: We're out of time, thanks!
- 06:31:54 [tidoust]
- RRSAgent, make logs public
- 06:32:02 [tidoust]
- Meeting: Next Generation TextTrackCue
- 06:32:05 [MasayaIkeo]
- MasayaIkeo has joined #textcueapi
- 06:32:22 [tidoust]
- Chair: Eric Carlson, Tess O'Connor, Pierre-Anthony Lemieux
- 06:32:29 [tidoust]
- RRSAgent, draft minutes
- 06:32:29 [RRSAgent]
- I have made the request to generate https://www.w3.org/2019/09/18-textcueapi-minutes.html tidoust
- 06:44:06 [MasayaIkeo]
- MasayaIkeo has joined #textcueapi
- 07:04:35 [nigel]
- nigel has joined #textcueapi
- 07:05:11 [pal]
- pal has joined #textcueapi
- 07:18:47 [tidoust]
- tidoust has joined #textcueapi
- 07:21:58 [nigel]
- nigel has joined #textcueapi
- 07:22:32 [nigel_]
- nigel_ has joined #textcueapi
- 07:31:48 [pal_]
- pal_ has joined #textcueapi
- 07:31:52 [tidoust_]
- tidoust_ has joined #textcueapi
- 07:31:59 [nigel]
- nigel has joined #textcueapi
- 07:33:16 [nigel]
- zakim, list attendees
- 07:33:16 [Zakim]
- As of this point the attendees have been (no one)
- 07:33:24 [nigel]
- zakim, list participants
- 07:33:24 [Zakim]
- As of this point the attendees have been (no one)
- 07:35:35 [pal_]
- pal_ has left #textcueapi
- 07:45:25 [atai]
- atai has joined #textcueapi
- 07:47:17 [MasayaIkeo]
- MasayaIkeo has joined #textcueapi
- 07:51:57 [tidoust_]
- tidoust_ has joined #textcueapi
- 07:59:56 [tidoust]
- tidoust has joined #textcueapi
- 08:00:07 [nigel_]
- nigel_ has joined #textcueapi
- 08:00:52 [atai1]
- atai1 has joined #textcueapi
- 08:31:24 [tidoust]
- tidoust has joined #textcueapi
- 08:35:01 [nigel_]
- nigel_ has joined #textcueapi
- 08:35:46 [nigel]
- nigel has joined #textcueapi
- 08:37:41 [MasayaIkeo]
- MasayaIkeo has joined #textcueapi
- 08:38:14 [MasayaIkeo]
- MasayaIkeo has joined #textcueapi
- 09:31:45 [nigel]
- nigel has joined #textcueapi
- 09:33:21 [MasayaIk_]
- MasayaIk_ has joined #textcueapi
- 09:57:30 [Zakim]
- Zakim has left #textcueapi
- 10:06:29 [MasayaIkeo]
- MasayaIkeo has joined #textcueapi
- 11:55:32 [tidoust]
- tidoust has joined #textcueapi