14:46:17 RRSAgent has joined #epub-fxl 14:46:21 logging to https://www.w3.org/2024/02/05-epub-fxl-irc 14:46:21 RRSAgent, make logs Public 14:46:22 please title this meeting ("meeting: ..."), wendyreid 14:46:32 meeting: FXL Accessibility Task Force 14:46:37 date: 2024-02-05 14:46:42 chair: wendyreid 14:59:31 JonasLillqvist has joined #epub-fxl 15:01:48 Hadrien has joined #epub-fxl 15:02:16 gpellegrino has joined #epub-fxl 15:02:19 present+ 15:02:20 jgriggs_prh has joined #epub-fxl 15:02:25 present+ 15:02:29 present+ 15:04:25 present+ 15:04:59 CharlesL has joined #epub-fxl 15:05:14 scribe+ 15:05:33 wendyreid: I'm working on the final revision of the main document 15:05:39 present+ 15:06:28 https://github.com/w3c/publishingcg/issues/69 15:06:45 ... I would like to speak about the issue Hadrien opened last week 15:07:28 gautierchomel has joined #epub-fxl 15:07:56 Koko: I work in Penguin Random House 15:08:49 wendyreid: while discussing about TTS we understood that there is no standard on top we can develop TTS engines 15:08:58 q+ 15:09:03 ack Hadrien 15:09:33 Hadrien: the issue I created only some of the issues in creating a TTS engine 15:10:07 I left out highlighting, virtual highlighting, switching voices between languages, etc. 15:11:13 ... I think we need to document what we see in real files 15:12:01 wendyreid: when I looked at Speech API, I've seen that it is a CG Note, it's not a REC, it's not officiale 15:12:28 q+ 15:12:52 ack Hadrien 15:12:56 ... I think there are issues about resources needed to create audio with good voices 15:13:07 Hadrien: in my experience it's wide supported 15:13:29 ... but it is quite inconsistent 15:14:49 ... its difficult to have feedback from the engines 15:15:17 ... actually you can use native APIs if you're developing native apps 15:16:21 ... I don't think the status of the document is an issue for us 15:16:41 ... because our work should be on what to do before passing data to Speech API 15:17:10 wendyreid: I prefer to work on stable specs 15:18:01 ... Hadrien is right, this work is about "pre-API" work 15:18:03 q+ 15:18:11 ack gpellegrino 15:18:13 scribe+ 15:18:39 q+ 15:18:39 gpellegrino: So I understand the issue, the idea is to define a way to extract text from EPUBs in an abstract layer that then you can pass to the TTS 15:18:47 ... or display to the end user as simple text 15:18:54 ... meaning for FXL, we can have a reflow version 15:18:58 ack Hadrien 15:19:10 Hadrien: It's more than text 15:19:23 ... it's also semantics: language + context 15:19:45 ... may also intercept hidden elements (like page breaks) 15:20:33 ... the output should be an object with the text plus metadata 15:21:02 wendyreid: a lot of TTS engines in reading apps do not read alt-text for images 15:21:40 Hadrien: in Readium alt-text is supported (no way to skip it, for the moment) 15:22:37 wendyreid: the base on which we can build this engine is to have a good semantic in the text 15:23:00 ... I'm making research on how browsers manage reading mode 15:23:35 q+ 15:23:40 ... I've found an article about bugs in the browser reading mode 15:23:57 https://www.sarasoueidan.com/blog/tips-for-reader-modes/ 15:24:02 ack Hadrien 15:24:43 Hadrien: I'm less concerned about that, I think the biggest issue we may have with FXL is similar to liquid mode with a well known fixed format for digital publications 15:25:25 ... if the reading mode is correct, we may still have problems in text splitted in multiple spans 15:25:49 ... I think we should reconstruct the content 15:26:50 ... problems with semantics may raise up with content splitted in multiple pages 15:28:38 wendyreid: I think that we may have a pause across different pages (in the middle of a paragraph) 15:29:23 Hadrien: if we creare a object for the whole publication, and we reconstruct the content using ICU tokenizer, we may be able to recreate sentences 15:29:54 wendyreid: we may have performance issues 15:30:27 Hadrien: it depends, we may have different approaches 15:31:54 wendyreid: In FXL we may use the TOC to reconstruct part of the content 15:34:21 Hadrien: it's not that simple, since you have things splitted in multiple pages 15:34:36 wendyreid: maybe we can benefit from using DPUB ARIA roles 15:35:18 Hadrien: sure, maybe also having something to say where a sentence ends 15:37:24 ... I think documenting what developers are already doing 15:37:40 ... is a good starting point for creating this document 15:37:56 wendyreid: which question may we ask? 15:38:19 Hadrien: I think we can start with asking how do they break the content (based on HTML, etc.) 15:38:35 ... and knowing how do they manage non textual content 15:38:40 with semantics 15:38:42 q+ 15:39:29 ... I think this system may also be useful for remediation process 15:39:42 ack gpellegrino 15:39:52 gpellegrino: It might be worth asking about MathML 15:40:02 ... TTS breaks on MathML, might also be interesting 15:40:16 ... reader mode and TTS, we might need multiple approaches to MathML 15:40:20 q+ 15:40:29 ack CharlesL 15:40:53 cant find the window 15:42:15 CharlesL: will we have FXL ebooks with MathML? 15:42:40 q+ 15:42:40 gpellegrino: I think the scope of this document is also for reflowable 15:42:48 ack CharlesL 15:43:03 q+ 15:43:12 wendyreid: I think we can mention MathML without speaking about the format 15:43:28 ack Hadrien 15:43:37 CharlesL: I think there should be an option for reading "invisible" things like alt-text 15:44:04 Hadrien: sure, maybe we should then discuss about skippability and escapability 15:45:11 wendyreid: I think another important element is personalization 15:47:08 Hadrien: we have a call on Readium community this Wednesday, I may add an agenda item about this 15:47:45 ... for example Readium mobile and Readium desktop have two different approaches and we may start documenting them 15:48:10 s/cant find the window// 15:48:35 wendyreid: AOB? 15:49:59 Hadrien: the more I look at it, the more it seems to me useful for different usecases 15:51:10 CharlesL has left #epub-fxl 15:52:50 zakim, end meeting 15:52:50 As of this point the attendees have been Hadrien, jgriggs_prh, gpellegrino, wendyreid, CharlesL 15:52:53 RRSAgent, please draft minutes 15:52:54 I have made the request to generate https://www.w3.org/2024/02/05-epub-fxl-minutes.html Zakim 15:53:01 I am happy to have been of service, wendyreid; please remember to excuse RRSAgent. Goodbye 15:53:01 Zakim has left #epub-fxl 17:00:55 rrsagent, bye 17:00:55 I see no action items