Publishing Working Group Telco — Minutes

Date: 2018-08-27

Attendees

Present: Deborah Kaplan, Avneesh Singh, Teenya Franklin, George Kerscher, Tzviya Siegman, Gregorio Pellegrino, Caitlin Gebhard, Dave Cramer, Ivan Herman, Matt Garrish, Rachel Comerford, Tim Cole, Romain Deltour, Ric Wright, Jeff Buehler, Derek Jackson, Garth Conboy, Marisa DeMeglio, Hadrien Gardeur, Ben Schroeter, Bill Kasdorf, Brady Duga, Charles LaPierre, Zheng Xu, Chris Maden, Juan Corona, Karen Myers

Regrets: Luc Audrain, Laurent Le Meur, Nick Ruffilo, Murata Makoto, Adam Sisco, Jean Kaplansky, Franco Alvarado, Benjamin Young, Josh Pyle

Guests:

Chair: Tzviya Siegman

Scribe(s): Matt Garrish

Tzviya Siegman: https://www.w3.org/publishing/groups/publ-wg/Meetings/Minutes/2018/2018-08-20-pwg.html

Resolution #1: last week’s minutes accepted

Tzviya Siegman: minutes approved

0.1. issue 291, TOC

Tzviya Siegman: take a look at Rachel’s summary style and please try to adopt it for issues

Tzviya Siegman: https://github.com/w3c/wpub/issues/291#issuecomment-415510019

Tzviya Siegman: Differing opinions about how to specify the ToC in WPUB. Some are of the opinion that a ToC can include anything as long as the HTML includes aria role=”doc-toc” to signal to the UA that it is the ToC. Some are of the opinion that the ToC must be more strictly defined so that the ToC can be processed in a standard way by UAs.

Garth Conboy: there is contingent of people who think we shouldn’t put substantial constraints on the toc to work with the web, but for reading systems we need a toc that can be machine processable. We ought to have the ability to provide a machine-readable toc similar to the epub 3 nav doc that can also be pointed to

George Kerscher: There would be 0 or 1 file with DocTOC?

Deborah Kaplan: when there is a toc that cannot be machine readable there should be the option for a nav doc

Dave Cramer: all html documents are machine readable because that’s what UAs do - the question is more are we trying to create something they will be happy with - a user agent can do something just by taking all the anchor elements, for many books that will be useful - may not get you nesting, but could look at placement in the dom - better to look at what machines can do than constrain authors

Tzviya Siegman: one of the things we’ve seen with epub is that the restrictions on epub table of contents are not awful - don’t restrict people too much - but when there are multiple tables of contents I use the machine one - I think the simplest way to consensus is to reduce the restrictions and still allow authors to include what they need - you can still have two tables of contents

Tzviya Siegman: the one that is processed by reading systems will be the one with somewhat restricted html

Deborah Kaplan: +1 tzviya

Garth Conboy: the epub restricted navigation document is successful but also need the option to allow more expansive tocs

Dave Cramer: i think styling is orthoganal to the issue - what css uses to make it look nice is independent of the markup

Bill Kasdorf: it’s not about making the toc look pretty but needing more flexibility - in education you need a longer table of contents with sidebars and other content

Tzviya Siegman: the proposal is not to restrict the number of tables of contents - only the markup for one

Deborah Kaplan: +1 dauwhe.

Tzviya Siegman: +1 to dauwhe

Dave Cramer: if we were to adopt something like epub 3’s restrictions we should look at whether we can relax the restrictions - like allowing either ul or ol as the root list element

Deborah Kaplan: But maybe allowing ul is one thing, but not allowing, say, sliders. :)

Dave Cramer: what will give us the most benefit instead of just demanding one way

Bill Kasdorf: +1 to both Tzviya and Dave

Tzviya Siegman: it sounds like we’re coming to consensus that restrictions should be placed on the referenced table of contents - #2 in the github issue

Romain Deltour: putting restrictions on the html would be a way to help implementers extract the toc data - goes against the priorities of constituencies - authors should have the flexibility and the implementer should have to figure out the algorithms to extract - i would rather come up with an algorithm to extract the toc in a predictable manner

Ivan Herman: i am trying to understand where we are - at this moment in the manifest we have a way to address the toc - are we saying that that element must have a structure we have to define - or is it required? if it is not required we can acknowledge the fact that there are two types we can provide - if the author doesn’t want to use one she can use a different kind of toc but don’t expect some

Matt Garrish: regularity in the reading system

Hadrien Gardeur: Edge has a native UI for ToC as well

Deborah Kaplan: I disagree, dauwhe: I think it’s arguably also AT centric

Dave Cramer: I agree with what romain said and this discussion is very epub-centric - we are expecting that the UA will want to process and display the toc in an entirely different way - does not normally happen on the web

Bill Kasdorf: +1 to Dave. This is an issue that should probably be a point of difference between a Web Publication and an EPUB.

Tzviya Siegman: unless we’re saying there is a way to structure the content, it makes it harder to trigger an algorithm

Tzviya Siegman: i like the idea of relaxing the restrictions, though

Garth Conboy: i don’t think authors need to worry about this because tools will handle - if user agents don’t do special things with WPs then I think we’re wasting our time - having two pointers to tocs might be the best solution

Deborah Kaplan: i don’t want a machine to have to compile a toc

Dave Cramer: i would like to require that everything in the linear reading order be in the table of contents - we need examples of tocs and see whether they can be machine processed and discover why not

Avneesh Singh: from accessibility point of view the hierarchy of the publication is what makes it different from the web general - beyond machine readability some kind of restriction would help

Romain Deltour: what happens when the author doesn’t respect the restrictions? we have to specify what a UA must do then. But when we specify that, and unless we tell the UA it can ignore the toc, why do we need to specify the restrictions in the first place?

Tzviya Siegman: in the real world we could get all kinds of tag soup in the toc

Brady Duga: are you looking for things that will break? that’s very easy - just include a picture of a table of contents

Dave Cramer: so a table of contents must have links

Tzviya Siegman: which is an example of a restriction

Garth Conboy: it needs to have structure - nesting - so list elements - we’ve learned this lesson in epub

Romain Deltour: I don’t see what kind of restriction would be useful - if the toc is useless that’s the author’s choice

Deborah Kaplan: I’d like to be able to tell AT how to navigate a wpub’s TOC

Tzviya Siegman: the concern comes from UAs - they will be required to generate some kind of navigation - we need to have accessibility and multiple ways to access the content - if the author provides an image it won’t help the UA

Romain Deltour: we don’t require accessibility - accessibility is something the author has to ensure - handling a toc that doesn’t respect the restrictions will have to be handled no matter what

Romain Deltour: so long as we define how to extract data why do we need restrictions

Avneesh Singh: wcag talks about multiple ways but it doesn’t define how the hierarchy must be presented - in epub we have lists - but if publisher is providing a toc that is clear to a sighted reader and not to blind then it is not accessible - having some rules ensure consistency

George Kerscher: having no doc-toc for a publication would be fine in some cases - if there is one pointed to in the manifest then it should be possible to evaluate what the UA is going to do with the markup

Tzviya Siegman: validators on the web are nice to use but the content has to work even if content isn’t validated

Ivan Herman: at the moment we have the concept of a toc in the manifest - it is in the reading order or resource list - has to point to an html document which contains an element marked with role=doc-toc - i propose that we have two: one with doc-toc and another with some other semantic

Hadrien Gardeur: +1 to what ivan said, but need to discuss if this is in the context of WP or EPUB4

Ivan Herman: the user agent has the option to put one or both tocs into the manifest - we then have the idea to play with restrictions - epub 4 might have stronger restrictions for reading systems

Hadrien Gardeur: also need to discuss if the second (optimized for machine readability) is HTML or directly in the manifest

Ivan Herman: both could be marked as doc-toc but one has a stricter structure

Garth Conboy: both would be possible in WP

Ivan Herman: in epub perhaps we only want the restricted one

Tzviya Siegman: what is the purpose of having two tables of contents?

Ivan Herman: the user agent could decide if the restricted version isn’t available if does nothing

Ivan Herman: we could have a different rel value for the loosely-structured toc

Hadrien Gardeur: I’m in favour of having two tocs, but what is the context - do we need two html nav elements or should the machine-readable one be purely in json

Tzviya Siegman: if authors want to create a separate toc that’s fine and it can be anywhere

Ivan Herman: authors still have choice to provide one, none or both of the toc types - I’m not in favour of putting the toc in json as putting it in html will be simpler for authoring

Deborah Kaplan: Hadrien, Tzviya wasn’t saying more confusing for UAs, but for authors

Hadrien Gardeur: having two documents shouldn’t make things more complex - with the current toc you can only provide a way to get to it - with a machine readable the UA can present it - not complicated for UAs

Garth Conboy: I tend to think having them both html leaves open the possibility of having a toc that is both displayable and machine-readable

Tzviya Siegman: so the proposal is to have two table of contents - one structured and one with an open structured - you can have any of these, including none

Garth Conboy: +1

Deborah Kaplan: +0

Hadrien Gardeur: +1

Tim Cole: +1

Bill Kasdorf: +1

Ivan Herman: +1

Jeff Buehler: +1

Caitlin Gebhard: +1

Romain Deltour: -0.1

Derek Jackson: +1

Garth Conboy: (max one structuredd, max one unstructuredd)

Avneesh Singh: +1

Ric Wright: +1

Marisa DeMeglio: -1

Zheng Xu: +1

EvanOwens: +1

Dave Cramer: -0

Deborah Kaplan: people are already confused by having too many choices in epub - worried that no one will get it right or understand why the possibilities exist

Romain Deltour: don’t understand why the structured one when an algorithm can process the html

Ivan Herman: outlining algorithms are a bottomless pit

Romain Deltour: not to compile an outline but only to process the toc markup

Tzviya Siegman: need to consider in the future that validation cannot be expected

Garth Conboy: a pull request to match the the proposal would be useful

1. Resolutions

Resolution #1: last week’s minutes accepted