12:17:20 RRSAgent has joined #pmwg 12:17:25 logging to https://www.w3.org/2026/05/21-pmwg-irc 12:17:25 RRSAgent, make logs Public 12:17:26 Meeting: Publishing Maintenance Working Group 12:17:57 ivan has changed the topic to: Meeting Details 2026-05-14: https://lists.w3.org/Archives/Public/public-pm-wg/2026May/0019.html 12:17:58 Chair: susan 12:17:58 Meeting: Publishing Maintenance Working Group Telco 12:17:58 Agenda: https://lists.w3.org/Archives/Public/public-pm-wg/2026May/0019.html 12:17:59 regrets+ brady, gautier, toshiakikoike, gpellegrino 12:54:26 shiestyle has joined #pmwg 12:56:19 DaleRogers has joined #pmwg 12:59:29 MasakazuKitahara has joined #pmwg 12:59:56 ajellinek has joined #pmwg 13:00:00 present+ 13:00:10 present+ shiestyle 13:00:11 sueneu has joined #pmwg 13:00:12 present+ 13:00:16 present+ 13:00:16 present+ sueneu 13:00:52 present+ 13:01:07 present+ MasakazuKitahara 13:01:17 present+ 13:01:24 present+ 13:02:19 mgarrish has joined #pmwg 13:02:42 regrets+ wendyreid 13:03:42 present+ mgarrish 13:06:57 Hadrien has joined #pmwg 13:07:01 present+ hadrien 13:07:27 LaurentLM has joined #pmwg 13:07:33 present+ 13:08:18 scribe+ 13:08:24 present+ charles 13:08:31 Topic: Do we need the concept of Annotation Sets? https://github.com/w3c/epub-specs/issues 13:09:20 https://github.com/w3c/epub-specs/issues/3000 13:10:13 laurent: do we replace the annotation structure with a zip that could contain any annotation type plus any information about the book 13:10:26 q+ 13:10:38 LaurentLM: it makes sense to have information about the book outside of the structure 13:10:50 ack Hadrien 13:10:51 …what shape should this take 13:11:14 Hadrien: even if we have a zip we want a way of grouping the annotations together 13:11:16 q+ 13:11:31 gman has joined #pmwg 13:11:33 …the concept of the set will continue to exist. we want to have a well known location 13:11:36 present+ 13:11:47 chair+ 13:12:08 …we've agreed that there is a usefulness but a limited one for metadata 13:12:37 …mostly for services that need the metadata and for matching but this might not be so useful 13:12:52 …only when we have to present the metadata outside of the book 13:13:11 …so we should have few pieces of metadata 13:13:23 q+ 13:13:29 ack ivan 13:13:30 …citation styles would be a useful influence on the kinds of metadata we present 13:14:09 ivan: we are talking about standardizing the interchange format 13:14:18 q+ 13:14:28 …at that level, the set creates one single file for import and export 13:14:41 …when we accepted images, etc. we decided that zip is the format 13:14:57 …then later we decided that zip is the only accepted format 13:15:59 …we do not need the grouping function, one file that lists all the annotations is not needed 13:16:23 …if the annotations sets role is to have collective metadata, there are two things there, 13:16:34 …information about the generator and the publication 13:16:42 current ebook metadata relative to the book: https://w3c.github.io/epub-specs/epub34/annotations/#about 13:16:55 …the generator isn't a necessary item in the interchange format 13:17:15 …the metadata for the publication, is mostly copies of data in the package document 13:17:38 …so we are repeating those values. 13:18:18 …which seems superfluous 13:19:05 …my conclusion is that the simplest way to do this is to make the metadata package include a copy of the metadata in the publication 13:19:11 ack LaurentLM 13:21:16 LaurentM: there will be import and export of files plus push and pull of sets of annotaions in the rest API 13:21:29 …I don't think there is an issue choosing zip as a container 13:21:40 …it is also possible that our format should support this 13:22:19 …about the generator, many files include this kind of information, though I don't suggest including the date 13:22:41 …this can be useful to differentiate annotations that don't conform to the specifcations 13:23:04 q+ 13:23:21 …I would be worried about totally tieing our format to epub by using the opf in annotations 13:23:37 …and it mixes xml and json which is never great 13:23:54 …we just need descriptive information about the book, and no list of annotations 13:24:17 …just title, author, etc or something that links the annotation to the book 13:24:19 ack Hadrien 13:24:46 Hadrien: we recently released Thorium on IOS and we have academic users 13:24:53 …we have almost 40% of everything that is read is in PDF 13:25:14 …so when we implement the highlight feature we will be asked to do it for PDF as well. 13:25:29 …if we want to implement this feature, it has to work for more than epub 13:25:45 …we need something as format agnostic as we can 13:26:39 …when we look at what we have for interchange, de-paged annotations, we define that we use a zip, that we have an extension and profile, and a list in .json 13:27:11 …so we have one document in which we have a list of all documentations even if we have a set 13:27:30 …by definition we need a set if we have mulitiple files 13:27:50 …now I think we have too much data, do we need dc date, dc creator? 13:28:12 …at a minimum we need something to identify annotations when they are separate from the book 13:28:16 ack ivan 13:28:47 ivan: we have currently a json structure for the set. I feel an itemized list isn't necessary 13:29:04 q+ 13:29:14 …we can have a sub directory for organizing the files 13:29:39 …coming back to metadata, we have never formalized that this should work outside of epub 13:29:57 …assuming this is a requirement now, it becomes a different discussion 13:30:18 …we should have a new pr about which metadata is necessary 13:30:53 …we must make clear how the identifiers are related to each other 13:31:12 …we cannot be silent about how this is related to the epub version 13:31:38 …I propose a pr that removes the itemized data from the annotation set/metadata 13:32:00 …we can discuss the exact name later, and we can further discuss it in the pr 13:32:07 q? 13:32:14 ack Hadrien 13:32:58 LaurentM: you will open a PR on the metadata? and we will leave the generator for now 13:33:19 Hadrien: I think having one file will all the information is more efficient 13:33:33 …I think there is a utility in having it all in one place 13:33:58 …that means if you look at annotation sets, we have metadata including items 13:34:11 …which is necessary because you need names in json 13:34:25 …i'm not sure we need type, generator, etc 13:34:37 …we need a minimal set of metadata 13:34:53 …I would go for a light weight approach but keep the set 13:35:13 q? 13:35:27 LaurentM: it would be useful to have the advice of other reading systems. I will try to get some feedback 13:35:39 ivan: then I will hold off on the PR for now 13:35:59 Topic: Proposing a text on merging [annotations] https://github.com/w3c/epub-specs/pull/3001 13:36:07 q+ 13:36:14 ack ivan 13:36:30 topic [new] 13:37:06 ivan: I have a question about when annotaions are merged. What can we describe normatively 13:37:31 …it turns out there is not much consistency in book metadata so we have to rely on the heuristics in the rs 13:37:46 LaurentLM has joined #pmwg 13:37:51 …so we need a paragraph to make clear to readers what they can expect on import 13:37:54 q? 13:38:01 q+ 13:38:10 q- 13:39:17 …if I am a user and not an implementor, I may never see it 13:39:39 q? 13:39:41 …I don't want to hide the text from reading systems either 13:40:18 s/[new]/Proposing a text on merging [annotations] https://github.com/w3c/epub-specs/pull/3001 13:40:48 q+ 13:40:59 ack LaurentLM 13:41:02 …if we are able to separate the information into implementer and user sections, then this text should go into the user part 13:41:20 LaurentM: most of section six is about what a user can expect 13:41:41 q+ 13:41:42 …there are sentences that are aimed at developers 13:42:16 …we can change the language to make it clear that implementers should pay attention to 13:42:25 ack DaleRogers 13:42:25 …rather than duplicating information 13:43:32 q+ 13:43:33 q? 13:43:40 ack LaurentLM 13:43:40 DaleRogers: I notice section 6 is called "best practices for reading systems" we could in each section say what audience a section is aimed at 13:43:54 q+ 13:44:00 LaurentM: the risk is duplication and being unclear 13:44:35 …developers know how to deal with a use case, if we reorient this section more like use cases we can avoid splitting and duplicating 13:44:36 ack ivan 13:45:15 ivan: perhaps we could rewrite the section as use cases, and then have other sections or notes for implementors 13:45:27 …most of the sections would be use case oriented 13:46:05 …editorially, the simplest thing is to merge the PR and then we will move it during rewriting 13:46:11 q+ 13:46:41 LaurentM: I can take a run on rewriting it, I'll show you the branch before the PR 13:46:52 ack sueneu 13:47:47 q+ 13:48:36 Do users look at the reading system spec? I expect not, so rewriting it with a use case framing makes sense 13:48:44 ack LaurentLM 13:48:58 s/Do users/sueneu: Do users/ 13:49:28 LaurentM: I think users won't read the spec but there is an interim party, people who make articles and usage notes who would need this information 13:49:41 …and they will pick up on this vocabulary 13:49:47 q? 13:50:29 Topic: Add some context to the use of the term Segment in the Target section https://github.com/w3c/epub-specs/pull/2990 13:51:02 LaurentM: we know we can create bookmarks and annotations. A bookmark is a placeholder in a text or image but not so precise for a user 13:51:23 …an annotation is a highlight of text or other media, and has some range in the content 13:51:54 …the selectors we define can isolate a range even for a bookmark, since there is no specific marker for a book mark 13:51:56 q+ 13:52:23 …will we accept that the selector of a bookmark can be a range or must be a single point? 13:52:30 ack Hadrien 13:52:52 Hadrien: reading systems usually have a specific affordance for bookmarks 13:53:16 …like an icon on the corner of the page, we look at what is currently displayed to the users 13:53:25 …and check if there is a bookmark there 13:53:33 q+ 13:54:01 …if there is a superlong range, the text could be displayed across many screens and cause problematic behavior 13:54:22 q+ 13:54:26 ack ivan 13:54:28 …i suspect rs will want to define a bookmark that won't cross the boundary of multiple screens 13:54:56 ivan: this is related to the previous discussion because th choice of selectors is done behind the scenes by the rs 13:55:24 q+ 13:55:31 …so this is a matter of the affordance that LaurentM was talking about. So the situation you describe won't happen 13:56:19 …so reading systems should do this properly. We should say if a readers intent is to set a bookmark, we should use a single selector 13:56:57 LaurentM: a text position selector has a beginning and end, and is a range 13:57:17 q+ 13:57:20 ack LaurentLM 13:57:33 …do you agree that a bookmark should be a single location we should add that to the spec 13:57:35 ask 13:57:38 ack 13:57:42 ack Hadrien 13:58:31 LaurentLM: if reading system A generates a bookmark, it will still work on reading system A, but export that annotation to reading system B and B handles bookmarks differently, we could have problems 13:59:06 …even if we include a specification, reading system B will have a sanitisation process 13:59:24 …that will then decide if the bookmark would be at the beginning or end of a range 13:59:40 …we can try to force whatever and people will do what they want 14:00:01 …I don't think we will be able to control what people will do 14:00:22 ack DaleRogers 14:00:28 …it might be good to have some recommendations but reading systems shouldn't expect it 14:01:16 DaleRogers: a reading system can do what it wants with our declarations, and we can make those declarations 14:01:42 rrsagent, draft minutes 14:01:44 I have made the request to generate https://www.w3.org/2026/05/21-pmwg-minutes.html ivan 14:43:32 LaurentLM has joined #pmwg 14:55:41 mgarrish has left #pmwg 14:56:59 LaurentLM has joined #pmwg 15:01:47 gpellegrino has joined #pmwg 16:01:12 Zakim has left #pmwg