Publishing Working Group F2F, 1st day — Minutes

Date: 2017-11-06

Attendees

Present: Ivan Herman, Mateus Teixeira, Luc Audrain, Garth Conboy, Leonard Rosenthol, Tzviya Siegman, Dave Cramer, Charles LaPierre, Ben Schroeter, Brady Duga, Chris Maden, Ric Wright, Rachel Comerford, Takeshi Kanai, Laurent Le Meur, George Kerscher, Matt Garrish, Romain Deltour, Reinaldo Ferraz, Benjamin Young, Tim Cole, Cristina Mussinelli, JunGamo, Evan Yamanishi, Marisa DeMeglio, Avneesh Singh, Liisa McCloy-Kelley, Toshiaki Koike, Tess O’Connor, David Wood, Bill McCoy, Lillian Sullam, Hadrien Gardeur, Daniel Weck, Baldur Bjarnason, Dave Cramer, Paul Belfanti

Regrets: Chris Maden

Guests: Dave Browning, Tess O’Connor, Dan Sanicola, Jeff Jaffe, Richard Ishida, Addison Phillips, Fuqiao Xue, Kenneth Rohde Christiansen, Jasmine Mulliken

Chair: Tzviya Siegman, Garth Conboy

Scribe(s): Ric Wright, Brady Duga, Lillian Sullam, Rachel Comerford, Mateus Teixeira, Leonard Rosenthol, Cristina Mussinelli

Content:

1. Work Mode
2. FPWD update
3. Lifecycle and Manifests 1
- 3.1. Flow chart for WPUB, Lifecycle
4. I18N
5. Packaging
- 5.1. PDF as PWP Profile
- 5.2. Zip and packaging
6. Resolutions

Ric Wright: Introductions first

Garth Conboy: Dinner: https://docs.google.com/document/d/1X2BgP5mP1xuwQvT-IK1xNTnsw73wrJB_fdtxbHEOp34/edit

Garth Conboy: Dinner plan is in your email, across the freeway, an easy walk (see link above)
… Room holds 22, please sign up. Everyone needs to kick in $30 Give money to Garth, but be sure to get a receipt!

Tzviya Siegman: welcome everyone. We are going to go over protocols, general plans, etc.
… for tomorrow note that we are swapping the morning and afternoon agendas

Garth Conboy: Dinner location: https://www.fattoriaemare.com/

Garth Conboy: MOVED PUBLISHING WG DINNER TO 7:00p — to make room for folks attend most of the Benetech reception 6:00p to 7:00p at the Hyatt 3SIXTY bar.

1. Work Mode

Tzviya Siegman: https://www.w3.org/publishing/groups/publ-wg/WorkMode/

Tzviya Siegman: same old procedures. Everybody should know the drill. But make sure you are signed up to watch all the repositories on github
… patent policy is the same as always (W3C)

Ivan Herman: We need to be very careful to keep github issues focused so don’t mix issues. If in doubt, open a new issue
… Note that if we get to recommendation point, then if we keep all the issues/repos in good shape we will be better off
… Query: Are there still workflow problems we need to look at

Tim Cole: Bear in mind that we work/interact with other groups, so we need to keep those interactions visible

Ivan Herman: we have a good set of labels for the issues, but if you feel that others are needed, please let Ivan know

George Kerscher: note that more people will be joining the WG, so if we have a guide to help the on-boarding process

Ivan Herman: there is such a “newbie” guide. If anyone sees that anything is wrong or missing there, please generate a PR to correct that

2. FPWD update

Ivan Herman: Just to remind everyone, the first working draft is extremely important. So it is critical that the first draft be clear and precise

Tzviya Siegman: FPWD https://w3c.github.io/wpub/

Ivan Herman: This is especially important as this group is new and there is some concern in other WGs that we may try to “re-invent web”
… this doesn’t mean that we have to solve all the technical issues in the first public draft.
… we need to publish that first public draft by the end of 2017
… the FPWD is also important because it may have patents, implicit or explicit
… we need both the WP and PWP documents, though the latter is likely to be much shorter
… also need a first draft of the EPUB-Aria-2 spec by March
… also we need a FPWD of EPUB4 by end of Q1 2018 (yikes!)

George Kerscher: what are the expectations for the samples that will be in FPWD?

Tzviya Siegman: This is TBD, depending on what the actual content of the FPWD is

Luc Audrain: Perhaps we need to create some of those examples

Ivan Herman: That is a goal but not a requirement for FPWD

Tim Cole: We also are on the hook for a draft on locators. What is the expectation for that?

Ivan Herman: in the charter we wave our hands so it is OK to, for example, to have a separate Locator document

Leonard Rosenthol: Are we going to discuss Locators tomorrow?

Ivan Herman: yes

Tzviya Siegman: Do we think we are on track?

Leonard Rosenthol: yes

Tzviya Siegman: Most of the people in the WG are from the publishing industry. We need to ensure that we garner support from other members/WGs of the W3C

Ivan Herman: This is one of the reasons we need to get the FPWD out, so we can beat the bushes to get that support and input
… Also, we need to reach out to the browser community and try to get participation or at least input from the big 4 browsers
… They have said they are waiting to see the FPWD

David Wood: Has there been preliminary discussions with the browser-folks?

Ivan Herman: We first need indicate what our direction is, i.e. the FPWD THEN it will be time to engage in that conversation

David Wood: I believe that engagement takes a long time so we need to get started

Leonard Rosenthol: +1 on waiting till the FPWD is published

Ivan Herman: The browsers have indicated they are waiting for the FPWD

David Wood: In our experience it took years to get that dialog going and see any fruit (i.e. implementation in the browser)

Luc Audrain: It is important to start testing our proposals against the browsers. This can start the dialog

Ivan Herman: That is correct, this is part of starting the dialog. We have seen this in the CSS WG

Tim Cole: Note that we need to ensure that that the specs are written in such a way that they are testable

Ivan Herman: this is very true, we are still looking for a testing champion

Tzviya Siegman: Yes, this is very important. We need someone to step up

Mateus Teixeira: willing to help but cannot lead the effort

Takeshi Kanai: I would be able to help out but cannot take the lead

Ivan Herman: We really need some one to step up to do this.
… there are some extensive tests in the W3C for testing, but it isn’t clear that this will be sufficient

Tim Cole: One of the aspects we need to pay attention to is to verify that the features we spec are implementable (as opposed to testable)

Benjamin Young: We are not ready to write tests because we don’t have the spec, but we need people who will keep track whether the spec is both implementable and testable
… You don’t need to be a developer for this

Romain Deltour: Do we have any concrete requirements for what the tests will be like? API? Declarative? JS?

Ivan Herman: It is up to us to decide what the testing environment and the tests themselves are like
… The key is that when we get to point of a recommendation, that we have a concrete plan to test the specification

David Wood: In the absence of tests which we don’t have yet we do need to layout a skeletal plan for what those tests might be.

Benjamin Young: +1 to dwood’s thinking here

Tzviya Siegman: Are there existing resources in the W3C to help us in this process?

Ivan Herman: not really. there is a test harness but there are no dedicated resources
… and it is clear that the web-platform tests will not be sufficient

Tzviya Siegman: https://github.com/w3c/web-platform-tests

Laurent Le Meur: EDRLab has no experience in testing a specification. But as EDRLab will implement the spec in Readium, we’ll fully test our implementation of the spec, which is already a good asset.

David Wood: Specifically, I suggested that GitHub issues should be created against spec sections. Further, GitHub tags for the various types of tests (DOM, API, manual, etc) should be added to the issues in order to assist the eventual test coordinator.

Tess O’Connor: I have some experience.

Tzviya Siegman: ac krk

Ric Wright: I am happy to help but can’t commit to leading it. I do have some TIME available, but can’t afford to work for free (my management objects)

3. Lifecycle and Manifests 1

Matt Garrish: https://github.com/w3c/wpub/wiki/Options-for-Processing-a-Manifest

Benjamin Young: We are calling these Bill of Lading

Benjamin Young: well…it’s clearer anyhow :)

Garth Conboy: Or even Waybill

Benjamin Young: garth++ much shorter

Matt Garrish: Questions - if we go with Web App manifest, how does the work withthin what we want in WP
… App itself is self contained
… not true with publications
… web app manifest and WP are different use cases
… web app is installation, WP has pagination, etc, etc
… Can we extend web app manifest for our needs? [WAM here out]
… Use WAM as is, or extend WAM, or make our own thing?

Garth Conboy: This link? https://github.com/w3c/wpub/wiki/Options-for-Processing-a-Manifest

Leonard Rosenthol: @hadrien - IIF? Image operability?

Benjamin Young: +1 Matt!
… WAM doesn’t really manifest anything

Hadrien Gardeur: @leonardr yes

Benjamin Young: points to a URL, and has some stuff about icons.
… We need something bigger, with, say, a Manifest (or Bill of Lading, etc)

Luc Audrain: Waybill : http://www.businessdictionary.com/definition/waybill.html

Benjamin Young: We need a lot more in our manifest

Leonard Rosenthol: One thing we are divided on is whether a pub can also be an app

David Wood: +1 to bigbluehat’s comments. We might also need support for authentication components in the WAM on a per-resource basis.

Leonard Rosenthol: Can the user experience be bundled with the publication itself

Benjamin Young: You can use a WAM to point at a single web page
… They are starting to hook in service workers

Garth Conboy: Seems like a bad direction to allow bundling of publications

Luc Audrain: +1

Garth Conboy: Content can last a long time, but Reading software will stop working

Tzviya Siegman: +1 to garth

Ivan Herman: WAM is registered as an official media type
… browser will have built in behavior for that media type, even if we don’t want it to
… So if a built in behavior conflicts with what we want to do, that can bite us
… We can not tell the browser to not do certain things

Benjamin Young: Web App Manifest (WAM) https://w3c.github.io/manifest/

Ivan Herman: We have a divergence, what do we do?
… . 1) diverge entirely
… 2) ask them to accept we diverge and get them to split the spec
… We could share some data structures, but actual processing diverges.
… Need the WAM people here to help us decide if 2 is a reasonable path forward

Bill McCoy: Bikeshedding on names
… Waybill isn’t publication specific
… find something that expresses structure

Benjamin Young: Table of contents is good!
… Other topic: what happens when a browser sees a Waybill by itself?

Ivan Herman: What does it mean then that there is a claim that several browsers support WAM?

Benjamin Young: The manifest is irrelevant to the browser until the user says “download”
… The presence of download UI is triggered by the reference, not the media type
… WAM and Waybill are very different models
… not sure WAM is ready to have us cram all our stuff in their spec

Tim Cole: typically the trigger is the rel type in the link, e.g.,

Benjamin Young: WAM may be a bad foundation

Romain Deltour: +1 bigbluehat

Hadrien Gardeur: @timCole and the fact that the rel is the trigger is IMO a bug, not a feature of the WAM spec

Ivan Herman: Welcome Kenneth from WAM

Benjamin Young: Summarizing for Kenneth

Luc Audrain: FWIW : https://www.w3.org/TR/appmanifest/

Kenneth Rohde Christiansen,: We try to keep WAM as simple as possible, to avoid adding things we later regret
… using other tech (eg service workers) to provide functionality (like file lists)

Benjamin Young: We want something between appcache and service workers

Garth Conboy: Things in our manifest: metadata, default reading order, list of other resources (not primary), and a fourth one? Which he forgets

Benjamin Young: The Lifecycle section of our spec https://w3c.github.io/wpub/#wp-lifecycle

Ivan Herman: Our view of WAM is, you install something that is an app by itself
… publishing is much more declarative view
… the app part is outside the manifest

Leonard Rosenthol: We disagree

Ivan Herman: Ok, then “not necessarily” included
… our manifest is data, full stop

Kenneth Rohde Christiansen,: WAM has metadata about the app
… like should there be a status bar, etc
… in WAM, pushing as much as possible to service workers

Laurent Le Meur: WP manifest could be a subclass of WAM. But personally, don’t see it that way
… We may have multiple pubs in one app
… many people will want to choose what app to read the pub in (eg a11y case)
… we don’t want to rely on the web, like service workers, since we may not always have the web case (eg reading an book)

Kenneth Rohde Christiansen,: Don’t see a problem using WAM for that
… currently, the manifest points to service worker, but you could point at something else

Benjamin Young: You can do that today, did it for my demo
… Made a service worker that had an html doc that pointed at the actual resources. Then could have a WAM that points at that doc
… doable now, but question is, what is the Bill of Lading? Where does it live? Do we need a service worker?

David Wood: WAMs are a good idea
… We have a need for an app specific purpose, not something that is general purpose
… A little scared by: caching. Have very special needs for that in WP
… we also have reading order
… To get something into WAM, have a publishing section of WAM, or have a special profile of WAM
… either way, we are in a bad place, because we won’t know if a browser that supports WAM supports our specific WAM
… Browser understands WAM, grabs our data, then can’t do anything with it
… Better to have a Waybill specific to us, so we know the browser supports it

Tim Cole: https://www.w3.org/TR/appmanifest/#linking

Kenneth Rohde Christiansen,: You can specify it so we know what the WAM is [but I missed the details because bigbluehat was messing around]

Tim Cole: One possibility, you can link differently

David Wood: Relying on the browsers to implement whether it supports a specific route or not

Tim Cole: Already has to do that

David Wood: If you have a content type that is unsupported, the default behavior will be different
… browser will know if it supports a type or not
… Different than the link following behavior

Ivan Herman: Is there an expected behavior is a browser has a WAM link?

Kenneth Rohde Christiansen,: If you are adding to home screen then it is defined

Benjamin Young: what Kenneth just described is what we attempted in the HTML First demo https://dauwhe.github.io/html-first/MobyDickNav/index.html

Kenneth Rohde Christiansen,: Likes the link, because browsers could do something different
… like show a book icon
… could have two links, one for the Book, the other for a web app to install a book reader

David Wood: My concern is that the default behaviour of a browser for a WAM might not be appropriate for a WebPub manifest (waybill). Thus, we need a mechanism to differentiate how a browser will react (including in the face of differentiated browser actions/bugs).

Tzviya Siegman: Pubs have lots of metadata

Ivan Herman: Want to make clear what the info is. Some overlaps with WAM, but we have a lot more
… There should also be a property to say, if you want more metadata or “here”

Kenneth Rohde Christiansen,: The only fear I have is if you extend it, it would need prefixes
… To avoid conflict
… Also good to decide if our metadata makes sense in general for WAM to pull it back

Ivan Herman: Say we use the link mechanism, the content of what we refer can be the same as what a WAM refers to?
… content, structure, etc?

Kenneth Rohde Christiansen,: With a WAM, most things are optional
… you would just add some more stuff to it

Ivan Herman: Current WAM, data that is stored is one section, installation is separate section
… can we split this into two specs?
… Since we may not care about installation

Kenneth Rohde Christiansen,: You could also make a new spec
… And say it is a WAM, then say what is different
… just point to the section you care about
… splitting might be confusing

Romain Deltour: We are talking about pubs, and browser behavior, and how it fits with what we want
… thinking about WAM, looking at what is progressive on top of WAM
… has several pubs installed as WAMs
… works great
… now we need to install in another app
… There are some details, like must install in a top level browsing context
… but in general we are not in conflict and we need progressive enhancement

Kenneth Rohde Christiansen,: Browsing contexts are there for security reasons
… . but maybe could relax for pubs

Benjamin Young: Has a demo
… shows relationship between WAM and Bill of Lading and the experience of what happens when you use them
… Bill of Lading is additive to WAM, much larger in scope in various needs (like offlining and sandbox)

Liisa McCloy-Kelley: +1

Tzviya Siegman: Next steps?
… we have a way of working with WAM
… but Marcos said “show us demos!”

Benjamin Young: What we build is orthogonal to WAM

Kenneth Rohde Christiansen,: But we should coordinate
… and we can decide to move some things to WAM

Tim Cole: Tend to agree but concerned the only commonality is there is a list of things
… is that enough to make this reusable?
… Are using most of WAM and adding? Or is it just the manifest we are using? Is it too much of a headache to use WAM?

David Wood: bigbluehat, In your conception, would each publisher define their own service worker?

Ivan Herman: Still wondering about next steps
… We have kept away from putting into specific serialization

Benjamin Young: dwood: I’d hope there’s a browser/reading-system one, but it could potentially be overridden by a publication-specific one (perhaps)

Ivan Herman: wanted to clarify what information we really need for a WP

David Wood: bigbluehat That makes sense to me. Thanks.

Ivan Herman: what would be the next step? Define what have and put it into JSON, then prefix wpub, then look at WAM
… if there is an existing term, then use WAM
… And WAM WG looks at our list and decides to incorporate it
… Then what we have, is it the same media type as WAM?

Kenneth Rohde Christiansen: Depends. If you want both app vs pub installs then different media types, if you don’t care then use the same.

Benjamin Young: That is the distinction. WAM is additive, but our doc is mandatory
… lifecycle needs to be done first

David Wood: +1 to Hadrien, a rel shouldn’t be a different media types. That would be mixing approaches.

Benjamin Young: do I need to visit the about page 3 times before we get the WP reading experience
… If it is something you don’t care about then can reuse WAM type

Kenneth Rohde Christiansen: Not everyone looks at web apps the same way. MS is crawling the web, looking for apps

Benjamin Young: If we need a different UI the first time you enter a WP on the web you get the WP experience, then we need something other than just WAM model
… If not, then WAM is fine
… Basically, is Bill of Lading required or optional?

Kenneth Rohde Christiansen: Could be when you click on the book icon, you just get a new UI. Or it could be you download the book and add it to your library, etc
… nice that it is up to the UA

David Wood: It is “nice” for it to be up to the UA only if the UA is well-behaved.

Benjamin Young: Do you ever check to see if the WAM has changed?

Kenneth Rohde Christiansen: Up to the UA, Chrome checks on occasion
… but not defined

Takeshi Kanai: Concerned about post processing
… when I install thousands of titles, how do I find a specific title?
… For instance, how do we sort by name, etc?

Leonard Rosenthol: Seems like it is up to UA

Kenneth Rohde Christiansen: Seems like this is out of scope for the manifest

Tzviya Siegman: Wrapping up - what now?

Ivan Herman: Don’t want to plan on JSON format for FPWD
… but once we have a draft, how do we work together?

Kenneth Rohde Christiansen: Github, meetings

Ivan Herman: Love github, great for specific issues. But this is a bit higher level

Kenneth Rohde Christiansen: We can have meetings. Not on regular basis, but when we need to

Toshiaki Koike: toshiaki-koike has joined #pwg

Tzviya Siegman: https://w3c.github.io/wpub/#wp-lifecycle

3.1. Flow chart for WPUB, Lifecycle

Ivan Herman: See flow chart image

Matt Garrish: differences between web application and how it is launched and instillation
… good basis for moving forward

Benjamin Young: would like to see flow charts for web publication

Rachel Comerford: +1 bigbluehat - I was curious about the same thing

Benjamin Young: something that says this is the web publication on the web, the moment the user sees it, the moment the publication goes offline
… WAM is totally additive, we need to define when a collection of web pages becomes a publication

Leonard Rosenthol: I hear where you’re going Ben and I don’t disagree
… I just want to make sure we don’t jump too far into the user experience
… what is the tipping point from regular web experience to pub experience

Benjamin Young: for annotation we had to imagine user experiences
… some of the life cycle stuff has to need to know what the experience is
… what are those moments and how do we enable those

Matt Garrish: we launched into adapting the web app, pulled back too detailed, now we’re looking at going forward without getting into the technical details
… let’s run through the use cases to get more details

Rachel Comerford: I 100% support what you’re saying
… identifying the key points in the process helps know what the consumer/publisher want

Liisa McCloy-Kelley: are we always thinking about it as predefined
… we could see where someone wants something defined by the publisher, but also when someone would want to define it themselves

Leonard Rosenthol: keep the packaging stuff separate, as long as everything lives in one location, all you’re doing is building a manifest
… you could bundle up anything in a manifest that exists on the web

Tzviya Siegman: we need to define the life cycle events
… ben made a good point that we haven’t defined these moments yet

Rachel Comerford: offers to project

Leonard Rosenthol: whether we think that every web publication should have those things defined…

Tzviya Siegman: keep in mind we’re going to have to describe the things that Rachel is projecting
… we have a few events that we can define, so let’s think about these life cycle events are
… publish is an event

Benjamin Young: it doesn’t really matter if you put it on the web if the user can’t get it
… offlining means that the user can keep it

Tzviya Siegman: print publishing has dates, I don’t want to get there

Liisa McCloy-Kelley: you’re put something out in the universe

Leonard Rosenthol: there is something before you publish and that is you author

Matt Garrish: discovery how does the user agent even know how to find the publication

Ivan Herman: I can’t care about the thing with authoring or publishing
… I don’t a care about the workflow that lead to the point of finding the publication

Benjamin Young: matt do you mean that you are discovering it’s a publication

Ivan Herman: that’s our starting point and what the browser does and what events lead to that situation are irrelevant

Ivan Herman: discovery is done what happens next with the browser

Tzviya Siegman: I think we’re trying to put ourselves in mind of the user agent
… how do we make it a functional piece of the web

Leonard Rosenthol: we should tell the user agent how define one of these things

Benjamin Young: we can’t tell the browsers of the past what to do with it
… what are we adding to the browsers of the future
… what does the browser do?

Ivan Herman: the question of whether it needs an active user to display as a book is not up to us to decide
… it’s up to the browser, as the user of the browser I can set a preference, but not up to us in the working group to define this

Kenneth Rohde Christiansen: WAM made an analysis document

Ivan Herman: the web app manifest doesn’t impose on the browser to do this and this steps
… the same holds here…

Kenneth Rohde Christiansen: browser may show ambient batching, microsoft, google, I think it is nice there is innovation

Benjamin Young: we’ve discovered we have a publication…browser decides options…

Tim Cole: discovery, be able to identify what you found, select, obtain
… if your manifest supports all of those things
… that is what the record is about…those 4 things…we can talk about after obtain

Benjamin Young: that is a good start, there is more beyond that, there needs to be a reading order

Tim Cole: enough information for those 4 things

Benjamin Young: I’d prefer to decouple from the manifest

Tim Cole: I’m talking about what the record has to do, the web page has to have enough information to know that’s the 1984 I’m talking about

Ivan Herman: can we document what those actions are?

Leonard Rosenthol: Discover, Identify, Select, Obtain

Tim Cole: Here’s the publication: https://www.ifla.org/publications/functional-requirements-for-bibliographic-records

Benjamin Young: rachel can you take tim’s 4 words and put to right of discover

Ivan Herman: so discover, identify, select, obtain, what do these things mean?

Tim Cole: Select means there is enough in there to determine it’s what I want
… title is not enough
… would also need publisher

Benjamin Young: when you get it how much do you want

David Wood: you might not get the whole thing beyond the subset

Tim Cole: Chapter 6 is the relevant Chapter (User Tasks)

Tzviya Siegman: can I interupt? we only have kenneth for part of the day
… let’s use Kenneth’s time wisely
… we need to come back to this
… parallel road to WAM…let’s go back to next best step
… we can figure this out in a timely way

Kenneth Rohde Christiansen: it sounds good, WAP, do everything on github
… we don’t want to create a different working group for manifest

Ivan Herman: I still need to understand how the work we do connect to the subgroup

Leonard Rosenthol: until we know everything we need and the life cycle we can’t go back to the web app

Kenneth Rohde Christiansen: give us a list of what you have, and we can see if it makes sense and if have what you need
… we can have a more thorough review of what you need

Benjamin Young: once we’ve determined what we want and where we want it, we can figure out where WAM fits in the life cycle

Ivan Herman: yes, but what do we end up with at the end
… do we have to different manifest that try to synchronize

Kenneth Rohde Christiansen: you have a specific manifest that refers to WAM
… we have an example with web payments

Ivan Herman: that would be interesting to see

Kenneth Rohde Christiansen: payment method manifest
… it’s very very simple

Tzviya Siegman: https://w3c.github.io/payment-method-manifest/

Ivan Herman: what’s your timeline?

Kenneth Rohde Christiansen: living timeline
… google might have additions…more work discussing android, ios

Ivan Herman: for us where it becomes a problem, when we have a normative dependency, by the time we can to our standard…
… there is a need to have a recommendation where there is a “version 1”
… if you look at the planning there are recommendations very two years

Kenneth Rohde Christiansen: I don’t see any problem with that… living standard

David Wood: how to manage the interaction…argument against putting in a publication sub to WAM

Kenneth Rohde Christiansen: we’ll if it’s an extension and you’re extensions have a prefix I don’t see a problem

Leonard Rosenthol: as long as the WAM doesn’t have more ids…

Kenneth Rohde Christiansen: one user agent can say if there is no service worker we can’t put in our microsoft store, but how to handle that and if that’s a problem is invoation in the browser space
… we don’t say anything is mandatory, it’s more for the user agents
… up to the user agent

Tzviya Siegman: so how do we continue, next steps

Ivan Herman: come up with our view of manifests, clear manifest…these are our terms…once we have a clear idea we can synchronize
… whenever is a term in the WAM we would use, we use that, clear naming scheme

Kenneth Rohde Christiansen: I think it’s still good to create an issue on the manifest…other people might see it and think it’s interesting

Ivan Herman: flow chart is important let’s matt start to write things
… we have to talk about packaging…

Tzviya Siegman: half of the pagination people are not here

Benjamin Young: what comes after the four stages of discovery, identify, select, obtain?

David Wood: is keeping different from archiving?

Ivan Herman: yes, but we’re not looking at that now
… what is the difference between identify and discovery?

Benjamin Young: your browser affords the user that you’re within a publication; now the UA needs to identify what the publication is

Tim Cole: so, in some context I discover that I am within a “web publication,” then I identify the things that make it up, then I select certain items that are important to it (what are technical requirements, for example?)

Ivan Herman: that’s fine, but that’s not identification

Tim Cole: right, that’s selection

Liisa McCloy-Kelley: a practical example: looking at a website with an instruction manual that has several options (search, nav, etc.), but it’s not done as a “publication”
… you’re trying to find this manual and you somehow link deeply into a page within it
… “identifying” is finding that you’re part of a bigger object
… “select” is trying to figure out whether that piece is a part of something larger and selecting your context and range of information
… “obtain” is “how do I get this”?

Tim Cole: right, including any constraints that go with it

David Wood: so, you get linked off to some place inside a web pub, and then your UA knows that you have discovered the web pub. is that always true?

Liisa McCloy-Kelley: that goes back to how we define this “thing” so that the UA knows it is a web pub; the user doesn’t care, they’re just looking for information

David Wood: the linkage won’t happen unless we make it happen

Liisa McCloy-Kelley: i can make the choice to have the full thing so I can reference it in the future

Benjamin Young: now that I have it, what can I do with it? that might be the next step after “obtain”
… is this making any sense (@ivan)?

Ivan Herman: I’m not skeptical, just trying to connect our diagram with the record that we have (i.e., in the FPWD)
… “discover” is within a publication; “select” means I need to find out what pub I’m part of; “obtain” is “can I get the publication?”; still don’t see the role of “identify”
… we have to be careful so each part of the workflow has the same “weight”, that each has a specific role

Benjamin Young: identify, select, obtain are all in the “manifest” – this is the identifier, this is the metadata, these are the components of the publication

Ivan Herman: I understand what Rachel added — “find my parent” under “identify” — but then “select” becomes unclear to me

Benjamin Young: I think we have an understanding that each step is additive to the one before it, it becomes “larger”; we’re giving us the steps needed to understand the publication

Tim Cole: the important thing is to recognize that there are certain steps that need to happen

Rachel Comerford: on the education level, i think of “select” as a way to choose a specific version of a single publication
… the element of “selection” is deciding where I’m going with that publication–what version I need

Ivan Herman: so it’s the same publication with the same identifier, but different ways of reading it?

Benjamin Young: there’s nothing in “identify” and “select” that we haven’t already identified in the metadata conversations

Ivan Herman: except that, it may happen that i get on a page, but in fact that page may belong to three or four different web pubs, so it’s shared by several publications, meaning it can link out to several manifests; it become sup to the user to choose which one they want to use

Liisa McCloy-Kelley: yes, but we then need to be clear that individual asset within a package can/cannot have multiple parents

Ivan Herman: this is not reflected in our current info items

Benjamin Young: even if not possible, “select” needs to give the UA/human enough info so that they can make an intelligent decision about it being something they want

Ivan Herman: i want a better word

Liisa McCloy-Kelley: choose!

Benjamin Young: “obtaining” is the hard part, and beyond that is even harder–what does the UA do then? does it tell me how far along I am in a web pub? are there interface changes in my browser?

Ivan Herman: this is true, but we shouldn’t define it

Benjamin Young: right, but we need to acknowledge it
… this happened in the annotations wg also

Ivan Herman: just to repeat, I haven’t seen anything that would not be covered by the data we have in our document, which is a good thing

Benjamin Young: once we get past “obtain”, do we afford only a linear experience, “nonlinear”, etc.? we need to prescribe something to the UA after it obtains the publication
… for example, a PDF affords a “reading” context; you have search, nav, forward-backward progression, etc.

Ivan Herman: right, so the data is defined so that the browser can use it

Benjamin Young: yes, but what sort of thing do we want to be done with that data?
… what is it supposed to afford the user? think about audio books, a11y, etc.
… what does the user get from knowing there are 10 pages instead of just the 1 they discovered?
… need to define what we “hope” the UA will do

Ivan Herman: i’m not sure

George Kerscher: once you get from a chunk of info, you can’t know what its parent is… too unpredictable, but a browser can find the manifest, and the UA can switch state into an intelligent browser that knows it is accessing a publication

Ivan Herman: exactly, but the problem is we can spend days trying to come up with complicated ways of handling wpubs… i think what we have in the document is there for covering what we know at this moment, but we at some point need to become specific; for the time being it’s an abstract set of info; i’m afraid of “that’s useful, but we need to be more tangible”

Benjamin Young: offline becomes a question here too
… if offlining is a requirement, is it human-centric UAs, or more general?
… yes, we’re all talking about a collection of documents and how it’s obtained, but we don’t know what those mean
… personally, i think in a more book-like manner
… but what happens when that web pub involves other resources than itself?

Tim Cole: problem is, can we resolve any of those issues and how comprehensive does it need to be?

David Wood: the issues around linking to/from manifests are there for a reason; the term is “document transclusion” when you have resources used in more than one publication
… there are good reasons not to link from a single item to all the others that include it; it’s fragile, metadata might conflicts, etc.
… unless we want to break the web, i suggest “issue 76” be closed as unresolved because we probably won’t do that
… let’s not spend valuable f2f time on that; would love to fix it, but it’s theoretically unfixable

Benjamin Young: so, what if I have a copy of “1984” and the publisher redacts it? I want that history, but I want access to an original copy that I obtained. Are we saying all of these copies should be replicated with separate URLs?

David Wood: that’s fine, but the individual document doesn’t need to link to many different manifests

Garth Conboy: given that perspective, can you not mandate the “find my parent” part of “identify”?

David Wood: it’s occasionally useful, but we can’t do it

Benjamin Young: if we release many versions of a manifest, what does the publication then reference? is everything kept in sync, is the publisher required to always keep the publication manifestations updated in all their different versions?

David Wood: in some context, publisher might choose to transclude and keep the data updated, but that’s not going to be true in all cases; it should be allowed, but not mandated

Ivan Herman: trying to translate in a practical way–in HTML file, e.g., “part 22 of the manual”, i may link to the manifest that identifies the full publication, but that’s where I stop–it’s not mandated, but it’s allowed

David Wood: not only should it be a “may”, but there should be a caveat that it’s dangerous if the versions aren’t maintained

Brady Duga: this is true any time you place a link in a document

George Kerscher: if this is in the metadata, in the manifest about this resource, and a discover tool can find all the related manifests, that can be done through search, but not listed in every related file

Ivan Herman: we agree

Liisa McCloy-Kelley: so, stepping back to what bigbluehat said—books get updated all the time, so when the book gets offlined, you get a snapshot of the book, just like buying a physical book
… oftentimes the same changes are made across many files (e.g., ad cards)

Laurent Le Meur: right, and what we’re saying is that this is nice, but not a requirement

Ivan Herman: so can we close the issue as unresolved?

Benjamin Young: yes, but which resource SHOULD reference the manifest? all of them “may”? so what if none of them do?
… if a resource is found by itself, what is its manifest? what does it point to?

Garth Conboy: the children don’t know its parent, but the parent knows all its children; that’s where we are at

Ivan Herman: right

Benjamin Young: if we remove the findability of the “total thing”, the only way we can really find the publication is via metadata and URLs within the document–that’s weird

Tzviya Siegman: example, assembling journal articles in a journal, in print we use pages; in digital, we call them “e-locators”; people want a “boundedness” to a publication, a way to find the whole from a part

Ivan Herman: well, we know a manifest MUST have a URL; the server returns what? we haven’t completely decided, but the URL is there… I will get the manifest in one way or another
… once I get that, I can access the publication, search, whatever; the only problem we have now, is what happens when we encounter a smaller part, and where I go back to, what is its parent?
… that’s what we are talking about

Tzviya Siegman: publishers often link to a ToC

Liisa McCloy-Kelley: but that’s bad, it’s clunky and ugly
… the body of the book must still somehow know it’s part of a whole, a user would want to know it’s in a specific book

Benjamin Young: in a web context, if you’re on “page 25”, that’s all you would know. you lose the context of the publication as a whole

Ivan Herman: that’s what we’re saying, we MAY place such metadata links, but we can’t require it because it is an unreasonable burden on authors/publishers

Liisa McCloy-Kelley: that’s fine

David Wood: we can’t in general in a web context link from child to parent due to several technical concerns; it’s better to make it a choice to maintain these links instead of mandating them

Ivan Herman: so, ok to close with these comments?

group: yes

David Wood: we achieved consensus–this is it!

Proposed resolution: close issue 76, with the comment that a link to the WP MAY be part of the resources, but WP does not require it (Ivan Herman)

David Wood: +1

Mateus Teixeira: +1

Luc Audrain: +1

Brady Duga: -1

Tzviya Siegman: https://github.com/w3c/wpub/issues/76

Ivan Herman: https://github.com/w3c/wpub/issues/76

Benjamin Young: that doesn’t solve 76 which is about multiple parents

Proposed resolution: close issue 76, with the comment that a link to the WP(s) MAY be part of the resources, but WP does not require it (Ivan Herman)

Benjamin Young: we still need to consider multiples

Leonard Rosenthol: the user chooses

Garth Conboy: Yes, the “(s)” makes this what we agreed, I think.

Tzviya Siegman: we’re all confused

Ivan Herman: so can we agree on the current proposal with the “(s)” added?

Benjamin Young: fine, but i think it introduces a messy problem: which one of the manifests is THE one that signifies the publication to the UA?

George Kerscher: there’s a discovery state, the browser can choose among many manifests, but not necessarily moving into a “reading” state?

Resolution #1: close issue 76, with the comment that a link to the WP(s) MAY be part of the resources, but WP does not require it

Benjamin Young: yes, but there is still that other issue, which we can open later

Brady Duga: the issue is that, given that now there are multiple parents, how to pick the right one? this is a harder issue to solve

Benjamin Young: now that there’s the opportunity to pick the publication, the browser can’t enter the state by default, because a default/parent cannot be identified

Matt Garrish: right, what are the expectations? this is the problem we’ve had since the beginning with multiples

Ivan Herman: we may be in a situation where we may have no idea, upon encountering a resource, what publication it belongs to. we probably won’t have a solution for this

Brady Duga: yes, but that’s not issue 76

Garth Conboy: you might just not be able to do it; that’s all; we accept that

Matt Garrish: it’s not the end of the world, but this introduces a divergence

Brady Duga: issue 76 is not about finding a parent, but about picking among multiples

Garth Conboy: yes, but we seem to be agreeing not to specify it
… that’s the final resolution we seem to come upon

Benjamin Young: but i need to test this stuff!
… how do we do that?

Tim Cole: we don’t need to test if it’s just a “may”

Tzviya Siegman: let’s each state our case

Benjamin Young: i just want to know how we will test it

Ivan Herman: i’m confused… we’re trying to redefine the web

David Wood: if you read issue 76, allow resources to allow to multiple manifests? YES. solve the problem about linking to multiple pubs? NO.

Garth Conboy: yep, ivan is frantically typing the resolution now!

Ivan Herman: comment added, the wp will not define how the choice among multiples

Garth Conboy: time for a 15 minute break since we argued through the scheduled break

4. I18N

Tzviya Siegman: a few folks from the I18n group have come to join us…

Richard Ishida: who’s heard of us? A few… And how many run away? right
… I’m going to do an intro and background about us and why you should care
… no slides, no structure - just ask!
… (shows home page - www.w3c.org/international

Addison Phillips: https://www.w3.org/International

Richard Ishida: everything you need to know is there incl. info, how to participate, current activities, etc.
… three major divisions - requirements, developer support & education/outreach

Fuqiao Xue: https://w3c.github.io/i18n-activity/projects/

Richard Ishida: requirements incl. things needed on various languages
… dev support reviews documents for other groups. there is even a review “radar” on where things are in process

Fuqiao Xue: Review radar: https://github.com/w3c/i18n-activity/projects/1

Richard Ishida: making sure more groups start early (FPWD) rather than at last call. (Note to selves: make sure we do this!)
… also developing a self-review checklist (with github magic incl)
… lots of informative documents, test suites and more for DevSupport
… for education & outreach, explanations about how the web works so that authors can do the right thing
… (walked us through some examples such as vertical text)
… let’s talk more about the requirements

Richard Ishida: https://github.com/w3c/i18n-discuss/wiki/Analysing-support-for-text-layout-on-the-Web

Richard Ishida: showing off the Japanese layout requirements (which are not necessary the same ones used in Word docs)
… emphasis, H&J etc. all included in the doc. And working on the same thing for other scripts
… use these, and don’t think just about English/Roman
… but - we’ve changed our emphasis recently

Fuqiao Xue: https://w3c.github.io/typography/gap-analysis/language-matrix.html

Richard Ishida: working on an attempt to show off what is still missing on the web, for each language
… table shows language/scripts vs. various layout features that might be relevant
… these features come from a document about international typography, https://w3c.github.io/typography/
… and with this document other specs (such as CSS) can refer to it to be the single source for explantation of various features

Leonard Rosenthol: could this document also be used as a referenced for non-web publications (but still digital)?

Richard Ishida: this could be used for non-web scenarios as well by defining the definitions and requirements

George Kerscher: does it also track text to speech?

Richard Ishida: no, it does not

Richard Ishida: there are links in the document to github issues on things that have not yet been solved
… which hopefully people will pick up and help us define them
… also links to spec issues, browser bugs and even type samples
… and anyone can contribute samples

Tzviya Siegman: can I just take photos and submit?

Richard Ishida: yes take pictures of those Hebrew books and upload them!
… the document also describes a new way of approaching the lreq specs. For J, we had a great set of experts who wanted to work on it
… and for Arabic and Hebrew, they are excited to start with and then peter out a bit
… so for J, we didn’t have links to CSS/SVG but they are working on it
… so we think that lreq isn’t the right approach but instead ot find the gaps in a given standard (eg. CSS) and then documenting those
… esp with specific tests
… and once you have pointed out that it doesn’t work, write the section of the lreq that talks about how it should work
… and then working with spec and implementation folks to get it done!
… This is a big message from the I18n committee about how you might want to consider going about your process as well

Ivan Herman: one thing that can be useful for us would be to help fill out the typography table, for language that we speak, so that we can help see where problems are and where we might be able to help
… I spent an afternoon working on Hungarian

Garth Conboy: yes and now Hungarian is greener than English :)

Ivan Herman: please try to help contribute and it will help us in the long run

Tzviya Siegman: list from 2015 https://www.w3.org/TR/dpub-css-priorities/

Tzviya Siegman: back in 2015, the CG produced a document about requirements for CSS (aka missing things) incl H-rules, etc. I bet some of that work would align well here. Perhaps @dauwhe can take a look

Richard Ishida: any other questions?

Brady Duga: this is very helpful and we push back on the UA anyway. one area which we could use guidance on is metadata. format selection, content etc.

Ivan Herman: metadata is important because content is the web, but we ned extra stuff

Richard Ishida: I would need some more info but here you go…
… 1) don’t require plain text for metadata (allow for markup)
… 2) don’t stick it in attribute values, use elements
… 3) make sure that you allow for Hebrew/Arabic issues such as direction

Brady Duga: isn’t that defined by markup?

Richard Ishida: it can be but not always

Richard Ishida: you need additional language info for things like text to speech, semantic search, etc.

Ivan Herman: in web annotations, we identified issues with JSON for doing such things and ask the I18n folks to solve this for JSON.

Richard Ishida: this is an open issue in our group and we are continuing to work on it
… string-meta

Richard Ishida: https://w3c.github.io/string-meta/

Tim Cole: if you have objects, you can add that information. Is there a way to have a default?

Richard Ishida: we don’t know all the details here, but having a default (for each language!) is OK…but make sure you have a way to override.
… in most JSON setups thats not possible, but yes it can be done
… you need to specify the details of fallbacks…which is why we want an interchange model

Ivan Herman: yes! we don’t want to invent this, just use it!

Richard Ishida: we are still working on it as there are pros and cons and not agreement

Benjamin Young: we referred to that spec from Annotations

Richard Ishida: because you come from publishing community, you will find things that authors struggle with and therefore “hack around”. So knowing those things will help better understand requirements and commonalities

Tzviya Siegman: sum up - PLEASE help!!

Liisa McCloy-Kelley: where are we going with emojis as language? (images, alt-text, etc)

Richard Ishida: Yes!!
… Chemical symbols are an issue too
… please let us know specific issues around emojis
… I would like to avoid them, but it is getting harder to do

Benjamin Young: if you can get poop in a spec, anything is possible :)

Leonard Rosenthol: <clap><clap>

5. Packaging

Leonard Rosenthol: https://docs.google.com/document/d/15oig71-ybZDt7G-C0CGmf7KG9k35y0ggu7sXLRadH2k/edit

Ivan Herman: if I take an EPUB and unzip is a web publication who cares

Hadrien Gardeur: sorry Ivan, but this doesn’t make a whole lot of sense

Laurent Le Meur: if you do not use zip as packaging what happens?

Hadrien Gardeur: +1 to what Leonard said

Hadrien Gardeur: that’s my point

Leonard Rosenthol: we want to use a content that doesn’t not have had an absolute url when they were packaged

Ivan Herman: any EPUB4 is something I can put on the web

Laurent Le Meur: this is not the issue

Tzviya Siegman: do we need to support anything that is not on the web?

Brady Duga: is a packaged publication something that was based on Html only?

Brady Duga: will browsers will be able to manage packaged files that are portion of the web?

David Wood: Paraphrased from duga - Does a PWP just use Web technologies or must it be on the Web? If the latter, then there is a lot to do, such as CORS/Fetch.

Benjamin Young: there’s this packaging format the supports both, fwiw…it’s called MIME

Leonard Rosenthol: it depends on how we package, web agent may be indipent from the origin
… packaging from the web or packaging the web, how do you package a data base or a stream?

Ivan Herman: which packaging technology is the best to be used and we need to unpackaged every format we recive

Benjamin Young: one (perhaps the only current) example of browsers consuming packaged, non-Web HTML/JS (etc) https://browserext.github.io/browserext/

Ivan Herman: is a set or requirements on how you put the content in the package, if a publisher create an EPUB4 and follow some guidelines it is fine

Liisa McCloy-Kelley: goal of the group is to make EPUB publication advance as firs class web publications

Tim Cole: depends from use cases

Ivan Herman: today I can create a web site that does not work on the web, you have to follow rules, if you create a publication you also have to follow some rules

Leonard Rosenthol: we do want choices for different publication may have different packages depending on different use cases

Liisa McCloy-Kelley: what happens if something is changed in a publications

5.1. PDF as PWP Profile

Leonard Rosenthol: PDF as PWP Profile (slides)

Leonard Rosenthol: how to move PDF in the future ecosystem: leverage the Open Web Platform - but no commonly accepted portable container
… PDF file Container: series of alternative presentation, a set of resources, using media queries for alternate presentation (for print/Screen/etc)
… different features (CSS/presentation, etc)
… for every resource there is a relative url in pdf, regular resource in PDF or external one, PDF already have locations for all resources
… this is not from Adobe but from PDF association
… the advantage is all their community already know PDF and user find something that already know

Garth Conboy: alternative renditions did not had success in EPUB, what is the PDF container?

Leonard Rosenthol: is not zip, it an serialization of a group of objects

Ivan Herman: what you discuss is very similar to what we are discussing in PWP

Cristina Mussinelli: .. alternative media query is an interesting feature, I would be more happy if PWP if the packaging format can be technically used for PWP?

Leonard Rosenthol: yes

David Wood: I can understand why you propose this, from the Open Web Platform there are at least 3 issues
… would we be happy with binary components and not international?

David Wood: PDF is 7-bit ASCII with binary components

David Wood: - What happens when a default PDF content is not provided because only HTML is created?

David Wood: - What would Richard Ishida say to a 7-bit ASCII wrapper?

David Wood: - Would we be happy with a core binary format instead of OWP?

Leonard Rosenthol: i do not think there is any problem if PDF is a PWP publication

Leonard Rosenthol: media query can be in your normal web content

Cristina Mussinelli: .. our viewer works an any EPUB readers

Leonard Rosenthol: we assume the presentation is in the document

Laurent Le Meur: if the working group define a packaging, what impede you to use it?

Leonard Rosenthol: the backward incompatibility for old PDF content

Luc Audrain: starting from PWD you can produce a classic PDF, do we need to use a pagination tool, that is something we do not need to do with EPUB.
… paginated EPUB did not succeed

Liisa McCloy-Kelley: it is true but there is a request from publisher to have it

5.2. Zip and packaging

Cristina Mussinelli: Tzviya; we must go back to zip and packaging

Ivan Herman: I heard that there are major security issues I’d like to know more

Brady Duga: there are on signing and multiple files

Benjamin Young: zip are hard to create well. zip have restricted metadata

Brady Duga: you can handle with external metadata

Laurent Le Meur: we do not need a streamable content, we can expose it via a extension

Leonard Rosenthol: non easily updatable, for other languages that are not English or latin there are issues
… in Japan there are issues, zip it is not standardized at international level,

David Wood: zip in a web content, there is an other standard used to compress in the web gzip

David Wood: Broad tent publishers don’t typically use multi-part mime because it is “too much trouble” - it works well when people use it, though.

Brady Duga: you may have multiple copies or location of the same content

David Wood: Multi-part mime is widely used in email

Leonard Rosenthol: if our goal is to package content from the web we need it

Garth Conboy: OCF it seems it doesn’t have file restrictions

Ivan Herman: what part of this discussion should go in the working group? we will need to make a choice, which packaging solution is the best or maybe we can invest in a new one.

Leonard Rosenthol: https://tools.ietf.org/html/rfc2557

Laurent Le Meur: CBOR is a binary format, then we need to create a parser for the different platforms?

Cristina Mussinelli: http://cbor.io

6. Resolutions

Resolution #1: close issue 76, with the comment that a link to the WP(s) MAY be part of the resources, but WP does not require it