EPUB 3 Working Group Telco — Minutes
Date: 2021-06-10
See also the Agenda and the IRC Log
Attendees
Present: Toshiaki Koike, Matthew Chan, Masakazu Kitahara, Wendy Reid, Dave Cramer, Shinya Takami (高見真也), Matt Garrish, Ben Schroeter, Dan Lazin, Marisa DeMeglio
Regrets:
Guests:
Chair: Dave Cramer
Scribe(s): Matthew Chan
Content:
1. Removing the PLS section
See github issue #1690.
Matt Garrish: basically we created this for 3.0 without W3C input, and as it turns out there hasn’t been wide uptake
… if we really want this to work it shouldn’t be the epub content document spec
… is it time to remove this until there is some real world implementation? possibly after taking this to W3C
Dave Cramer: so the spec text basically just describes how to use the link element, which is really an HTML thing, not an epub specific thing
… do we keep statements about these things not requiring fallbacks?
Matt Garrish: we already say link element doesn’t need fallback, nothing changes by cutting this
Dave Cramer: yes, i’m also concerned about people seeing this and thinking that it does more than it really will
Matt Garrish: right, and if we really want this to work, we should take it from here and make a standard for it, or turn it into a note
Marisa DeMeglio: good in theory, but i’ve never seen it used in the wild
Matt Garrish: there are also other avenues to pursue to get to the same goal, parts of ARIA for example
Proposed resolution: Remove PLS section from the authoring document, close issue 1690 (Wendy Reid)
Ben Schroeter: +1
Toshiaki Koike: +1
Shinya Takami (高見真也): +1
Matt Garrish: +1
Masakazu Kitahara: +1
Wendy Reid: +1
Dave Cramer: +1
Resolution #1: Remove PLS section from the authoring document, close issue 1690
Dave Cramer: do we need to have the CG look at it? No?
Marisa DeMeglio: https://www.w3.org/TR/2021/WD-spoken-html-20210518/
Wendy Reid: if any issues come up, we can redirect to one of the WG or CG that are working on this already
Marisa DeMeglio: interesting work done with SSML in HTML
2. URLs and the package document
See github issue #1681, #1374, #1688, #1686.
Dave Cramer: this is a bunch of issues that revolve around how you interpret URLs in the package document, especially if they’re absolute URLs
… came from an issue in epubcheck
… and there’s also an older issue about what the IRI of the package document is
… or what if there are file scheme URLs in the manifest
… and what happens if two URLs resolve to the same item in the manifest?
Matt Garrish: in epubcheck there was a root-relative URL that caused an error, and that spawned all of this
… e.g. “/something/thing”
… so what is the root of the epub?
… to me it doesn’t make sense that we even allow these root-relative URLs
… the root differs based on the RS
… and Romain mentioned that we require that all resources resolve to something inside container, but depending on what RS does, there is even ambiguity about what that even is
Dave Cramer: in issue 1688 Romain he suggests that manifest items should have one of the special schemes (except file:
)
Matt Garrish: there are edge cases where file scheme items make sense, but not generally for epub
Dave Cramer: it goes against epub as a portable format, and the file scheme ties the epub to a specific file system
… how much out there does have file URLs on purpose, not by accident?
Matt Garrish: never heard of one
… and they’d end up being remote resources
Dave Cramer: okay, so what if we just say no file URLs in epub?
… what is the risk that we break something?
… maybe this is something where we try to enforce it and see if anyone complains
Matt Garrish: most RS probably won’t do anything with file URL
… probably security concern
Wendy Reid: depending on platform you might not even be able to access parts of the file system (e.g. iOS apps)
Dave Cramer: can we start by resolving on this point from 1688?
Proposed resolution: Absolute URLs for manifest items should have a special scheme that is not
file:
, close issue 1688 (Wendy Reid)
Dave Cramer: +1
Matthew Chan: +1
Matt Garrish: +1
Wendy Reid: +1
Toshiaki Koike: +1
Shinya Takami (高見真也): +1
Ben Schroeter: +1
Resolution #2: Absolute URLs for manifest items should have a special scheme that is not
file:
, close issue 1688
Dan Lazin: is there a use case for some of these other schemes? Why would you have an FTP in your epub?
Matt Garrish: if we go too far, do we prevent future stuff? will we have to come back and re-add this in the future?
… FTP kind of fits within the web framework
… maybe we just leave it to authors to stick with HTTP, HTTPS, etc.
Ben Schroeter: is the idea that if we disallow file scheme, then we also disallow “slash URLs”?
Matt Garrish: not sure those are the same
… i think 1681 is contingent on us forcing RS to unpack epub in a certain way
… otherwise we can’t say there is a single consistent root that can be referenced
… and we don’t tell RS how/where to unpack right now
… this kind of came up 5 years ago with multiple rendition, but we left it buried in the discussions we had
Dave Cramer: what would be the consequences of forbidding root-relative paths?
Matt Garrish: not sure there are any, because epubcheck had forbidden these until a recent update
… we’re reasonably safe from backwards compatibility point of view
Dave Cramer: and this is just for href on manifest?
Matt Garrish: no, this would be anywhere, e.g. in content docs too
… all the “../” stuff would still be okay
… i proposed somewhere that we say all content must be below the packat document
… if we could enforce an authoring requirement that made a root, then we could enforce these relative paths
… but maybe its cleaner to just disallow them
Dan Lazin: do we support the base tag?
… and does that have implications for the handling of these issues?
Dave Cramer: we’ve been phasing out xml:base
, its been forbidden from package file for example
Dan Lazin: the base tag allows you to define what the relative path is relative to
… so if we’re allowing or disallowing certain types of URLs, maybe we should take a stance on base too
… not sure what stance though
Matt Garrish: base would force you to have all external resources, right? It exists, but I don’t imagine anyone really going there
Dan Lazin: https://developer.mozilla.org/en-US/docs/Web/HTML/Element/base
Marisa DeMeglio: there was a resolution a few weeks ago about dumping xml:base
from the spec
Dave Cramer: and that’s separate from the HTML base element
… i think i just want to say no root-relative URLs
Dan Lazin: if you set base to some website, and then use root-relative URLs, your URLs would appear to be relative, when they are actually absolute
… but maybe that’s too far of a stretch
Dave Cramer: but can we really say anything about base because its part of HTML?
Matt Garrish: so you must not use root-relative URLs unless you use a base?
… but it also applies to SVGs, to the package document…
Dan Lazin: what was the harm in not banning root-relative?
Matt Garrish: because the RS might treat zip root as the root, but they could also treat location of package doc as root
… so no consistent root
Dan Lazin: maybe permit it, but use SHOULD NOT?
… is it acceptable for an author to write an epub for a specific RS?
… and where it has undefined behavior for other RS
… probably acceptable, right?
Dave Cramer: yes, e.g. with books that only work with iBooks because of scripting support
Matt Garrish: maybe just a note that root-relative could cause issues if authors use it?
Dave Cramer: so does that mean that there are epubs that could be built to work in some RS, but expose an interop issue if opened in another RS?
Matt Garrish: right
… usually this happens in epubs that try to go from one folder to a sibling folder
… but when all content is below the package document its fine
… but we don’t specify that right now, only that content must be below the root
Dave Cramer: not sure what the right course of action is, but maybe we can continue this another time with Romain present
Wendy Reid: we need RS people here on next call that know exactly what RSes are doing right now
Marisa DeMeglio: one of the github threads has a sample, but I wasn’t able to download it
… maybe if we wrote to the mailing list Romain could provide samples
… would also love to have a list of epubs that must absolutely continue to work
Dan Lazin: I have filed https://github.com/w3c/epub-specs/issues/1699
Matt Garrish: also, there’s not much hand authoring, and most tools will put all the content into one folder
… we only ran into an issue with this with multiple renditions, and that hasn’t really gone anywhere
… so is this maybe more of a theoretical issue
3. AOB?
Wendy Reid: the agenda item about the a11y note is postponed
Dave Cramer: and a reminder that we have TFs
Wendy Reid: locators TF has slightly changed times
… moved to the same week, but friday at 10 am
… eastern time
… boston
Dan Lazin: do we know what the fallback algorithms are in JP RS?
… in most RS we’re familiar with, they have a proprietary way of counting position
… e.g. every 1000 bytes, or every 1024 bytes of the zipped file size
… would love to know what JP RS are doing
Shinya Takami (高見真也): I’ll encourage JP RS people to join the TF
Ben Schroeter: i think there’s an issue with the TF invite
Wendy Reid: okay, i’ll fix
Dave Cramer: RRSAgent: bye
4. Resolutions
- Resolution #1: Remove PLS section from the authoring document, close issue 1690
- Resolution #2: Absolute URLs for manifest items should have a special scheme that is not
file:
, close issue 1688