15:55:31 RRSAgent has joined #webfonts 15:55:36 logging to https://www.w3.org/2023/06/06-webfonts-irc 15:55:43 Zakim has joined #webfonts 15:58:08 zakim, start meeting 15:58:08 RRSAgent, make logs Public 15:58:10 Meeting: Web Fonts Working Group Teleconference 15:58:23 present+ 15:58:37 chair: Vlad 15:58:42 chair: Garret 15:59:11 Garret has joined #webfonts 15:59:22 skef has joined #webfonts 16:00:46 bberning has joined #webfonts 16:01:56 present+ 16:01:56 present+ 16:02:01 present+ 16:02:09 JHudson has joined #webfonts 16:02:49 Skef: re: alternate substitutions 16:03:07 Skef: a mechanism for choosing substitutions. 16:03:19 Skef: eg. aalt 16:03:40 Skef: often have the kitchen sync, their not really normal features. When you turn them on you're not really turning them on. 16:03:59 Skef: there's a selection phase and a use phase. You might present a user a UI with the alternates for a glyph. 16:04:24 Skef: in the use phase you put in notation saying i want a substitution for a codepoint at this index. 16:04:37 Skef: this is slightly complicated. Sometimes there are duplicate tables 16:04:55 Skef: same feature name seperate tables. 16:05:23 Skef: question is, when we are supporting these. Do we require you subset with the kitchen sink. Or do we have the option of picking the particular index. 16:05:41 John: I can give additional background. 16:06:08 John: aalt palt are no a good example of this. Not user facing. 16:06:19 John: one we should look at are the character variant features. 16:06:36 John: where each feature is associated with a base character and some alternate variants of it. 16:06:56 John: eg. font Athena Ruby. Has 25 different forms of the greek Alpha 16:07:30 John: eg. font Athena Ruby. Has 25 different forms of the greek Alpha or do we try to find the specific variant. 16:08:13 Skef: I think it's true that aalt/palt is close to zero, but since we're making a protocol that has general use we need to consider how their generally used across applications. 16:08:24 Skef: probably the largest problem in terms of use and size. 16:08:33 Skef: can increase the size significantly. 16:08:51 John: my point was that there are non-zero cases of user features. 16:09:03 John: we need a general protocol principle for type 3 lookups. 16:09:23 Skef: I have some evidence as features everything works by virtue of the fact the subsetter keeps everything. 16:09:31 Skef: harfbuzz will happily include all of those. 16:10:19 Skef: but that can be a big set. Is it worth providing the option in the protocol to set an index. Either requiring or giving an option to the server side to restrict the subset. 16:10:45 Skef: and what that means in terms of the tables that will be generated. You'd fake the earlier substitutions and have the correct set of values. 16:11:02 Skef: for earlier values do an identity substitution. 16:12:47 Garret: from a technical perspective: the first request doesn't know what the substitution indices are and so must grab everything (for the requested codepoints). 16:13:17 Skef: the first request may match a font that the indices are already known. 16:13:38 Skef: with these tables there may be some earlier point where you've requested the full subset. 16:13:52 Skef: with character variant it's possible that's not true. 16:14:06 Skef: they may have a specification for the font that says what they are. 16:15:28 Garret: another issue is that the indices are implementation details. They may change when the font changes. 16:16:03 Skef: indices are somewhat of a convention. 16:16:15 John: indices are into the lookup tables. 16:17:03 Skef: so when you're using one of these things and you want to render the glyph. They pick out of a list and you remember the index and in subsequent calls use it. 16:17:18 Skef: these indices are low positive integers are the interface. 16:17:28 John: CSS did define ways to address these. 16:17:38 John: not sure if they are still in there. 16:18:29 Garret: would be great to find it and review. 16:18:50 Vlad: earlier we had discussions on glyph indices vs codepoints as the basis. 16:19:19 Vlad: which addresses this, but the argument against requires a mandatory roundtrip to grab layout data in order to grab glyph indices. 16:19:33 Vlad: and that roundtrip kills the benefit of reducing extra data. 16:19:44 https://www.w3.org/TR/css-fonts-4/#ref-for-character-variant 16:19:54 Vlad: we might run into the same problem, by saving time we might require an additional round trip. 16:20:09 [Link for CSS Fonts Module details on character variants] 16:21:26 Skef: I don't think so, we'd have to have a way of saying we want the full table. The reality of this table is they are not commonly used and they're unusual in the way that you use them. Just as the best practice for storing state is to start with the codepoints and set of features. Because that allows you to move between versions of fonts. 16:21:41 Skef: the best practice with these is to keep track of codepoints + features + index. 16:22:20 Vlad: when a client starts rendering a page is the character set used from the page and which features are used. There is no font information. 16:22:46 Vlad: so right now the client asks the server and it gives anything that's relevant. 16:23:31 Vlad: the rest of the implementation would be up the server. 16:24:53 John: fonts that have alternate substitution lookups is still a relatively small number. 16:25:07 John: looking at pretty small benefits. The difference between 2 and 4 glyphs. 16:26:16 John: Skef's point because you can use CSS to turn on lower level open type features. If someone did turn on the aalt feature I'm not sure what it would do to the text. It's a dumping ground for all the substitutions in their UI. 16:26:26 John: a lot of tools automatically build it. 16:26:43 John: it could involve a lot of glyphs being downloaded. 16:27:06 John: my inclination is if someone makes a bad decision in CSS we don't need to worry about it too much at the protocol level in IFT> 16:27:35 Vlad: depending on the byte savings may not give us noticeable improvements. 16:28:01 John: because CSS does provide a mechanism targeting character variants. 16:31:00 Garret: It's good to see the CSS feature for this, that addresses some of the concerns as it provides the indices up front. 16:31:09 JHudson_ has joined #webfonts 16:31:20 Garret: also it might be reasonable to just keep range request as the option for dealing with fonts with large alternate sets. 16:31:37 Link for GSUB type 3 test font (Athena Ruby, unclear license) : https://www.doaks.org/resources/athena-ruby 16:31:40 Vlad: might be worth verifying the protocol can communicate the feature sets. 16:33:32 Garret: also worth noting these features are optional and not sent by default, so the standard use cases won't get them. 16:34:28 Update on noise privacy 16:34:29 topic: privacy simulations 16:34:58 (Addresses concern about codepoint sets revealing content) 16:36:13 zakim, who is here? 16:36:13 Present: Vlad, skef, Garret, bberning 16:36:15 On IRC I see JHudson_, skef, Garret, Zakim, RRSAgent, Vlad, Mike5Matrix, drott, emilio, MariaAngelaPileri, Github 16:36:29 present+ JHudson 16:36:46 Added to the simulation: Making the additional codepoints added proportional to the needed number of codepoints 16:37:07 The variable metric works better than the fixed metric in the simulation 16:37:11 scribenick: skef 16:37:55 Roughly doubles the size (with Latin) but is still well under the size of a conventional font 16:38:50 Also tested some additional languages, including Arabic 16:39:46 Also worked well: didn't need to add too many codepoints to match many pages 16:40:38 Similar with Devanagari and Japanese (although the page set for the former was small) 16:40:54 Chinese and Korean results were similar to Japanese 16:43:47 Question: To what extent will noise reduce the need for future requests. 16:48:18 Skef: one concerning thing is the impact this has on caching. This would break the ability to do regional caching. We likely want to have the randomization seeded by the content. 16:48:23 Re: noise - specifically because the noise is frequency weighted 16:48:42 Garret: Agreed, and this may actually be better from a privacy perspective since you won't have multiple requests adding new noise to the same content. 16:48:54 Garret: I'll look into this more and see if we should include it. 16:49:26 next agenda item 16:52:50 topic: caching [issues #119](https://github.com/w3c/IFT/issues/119) 16:57:37 Garret: started a tag review. There was a comment that there was still some open issues from the HTTP review. I clarified only patch subset is in scope forthe review and caching concerns are the only open issue on the patch subset side. Planning to add additional spec text to explain caching concerns/implementation approaches to enable caching server side, client side, and with a cdn in the middle. 16:59:59 Vlad: next issue is about the expiring brotli spec. 17:00:12 Garret: I'll check in on that and see if we can get it renewed. 17:00:26 Garret: we have a fallback of copying the relevant paragraph from the specification if we need to. 17:02:16 zakim, who is here? 17:02:16 Present: Vlad, skef, Garret, bberning, JHudson 17:02:18 On IRC I see JHudson_, skef, Garret, Zakim, RRSAgent, Vlad, Mike5Matrix, drott, emilio, MariaAngelaPileri, Github 17:02:45 rrsagent, make minutes 17:02:46 I have made the request to generate https://www.w3.org/2023/06/06-webfonts-minutes.html Vlad