15:27:18 RRSAgent has joined #pronunciation 15:27:23 logging to https://www.w3.org/2023/12/11-pronunciation-irc 15:27:23 RRSAgent, make logs Public 15:27:54 please title this meeting ("meeting: ..."), PaulG 15:27:55 zakim, agenda? 15:27:55 I see nothing on the agenda 15:27:55 Agenda+ Agenda Review, Membership & Announcements 15:27:55 Agenda+ Review Use Case 15:27:55 Agenda+ Action Items 15:27:55 Agenda+ Github Issues and examples 15:28:00 Agenda+ Other Business 15:28:15 presnt+ 15:28:19 present+ 15:51:53 S_Wood has joined #pronunciation 16:00:53 Alan has joined #pronunciation 16:01:43 matatk has joined #pronunciation 16:01:43 present+ 16:01:49 present+ 16:03:01 present+ 16:03:13 https://github.com/w3c/pronunciation/wiki/Assessment-Examples-for-the-Break-Tag 16:04:15 Dee has joined #pronunciation 16:04:49 present+ 16:04:58 zakim, take up next item 16:04:58 agendum 1 -- Agenda Review, Membership & Announcements -- taken up [from PaulG] 16:05:44 matatk: we are trying to check attendance at CSUN 16:06:06 ...is anyone attending? 16:11:07 zakim, take up next item 16:11:07 agendum 2 -- Review Use Case -- taken up [from PaulG] 16:11:30 Alan: is there a reason we're using both commas and breaks? 16:12:20 q+ 16:12:52 Dee: sometimes the speech engine doesn't pronounce "A." but pronounces a short vowel 'a' 16:13:25 S_Wood: for clarity we should remove the comma 16:13:33 ack matatk 16:14:19 matatk: these being from real world examples is very valuable and I think we need to be careful to separate the broader context from the specific asks that we have from implementors 16:14:47 matatk: we don't want them to pinpoint what could be attributable to a bug in the TTS engine 16:15:07 matatk: the break in example 4 is not trying to be the comma 16:15:40 matatk: so we don't need to drop it except that it was being used as a workaround for a bug 16:15:59 Dee: totally makes sense to me 16:16:07 ack me 16:16:21 scribe+ 16:16:30 PaulG: We use @time in all of these examples. 16:16:30 https://github.com/time -> @time 16:16:42 ... but users' AT/TTS have different rates of speech. 16:17:06 ... There are encoded strengths for breaks of different lengths, e.g. after a comma would be a 'weak' break; after a sentence would be a 'strong' break. 16:17:21 ... Devs may be more amenable to coding along the lines of those types of breaks, rather than times. 16:17:42 ... This is an opportunity to use SSML as a starting point, but make the names for the breaks more intiutive. 16:17:52 ... We could come up with our own terms, or use the ones they have. 16:18:13 ... What are your thoughts about using strength over time? 16:18:31 Alan: Is strength relative - you could adjust browser settings (baseline for them). 16:19:03 PaulG: My understanding is it's flexible, so that if you adjust TTS base speed, then breaks are affected proportionally to speech speed factor (e.g. 1.5x) 16:19:29 PaulG: A half-second break could sound like the end of the content to someone who's listening to 450wpm vs 100wpm 16:19:44 Alan: Is 'weak' in Chrome going to be the same as 'weak' in Firefox? 16:19:50 PaulG: Undetermined. 16:20:01 rrsagent, make minutes 16:20:02 I have made the request to generate https://www.w3.org/2023/12/11-pronunciation-minutes.html matatk 16:20:09 PaulG: I think it's based on the platform 16:20:14 ... Even the spec isn't clear about what those tokens are. 16:20:45 PaulG: If we only put @time in our examples, that's all we'll get. 16:20:45 https://github.com/time -> @time 16:21:02 ... When someone's using their own AT, we want their AT to be able to honor that. 16:21:14 ... Time may be OK with a built-in AT (maybe). 16:21:28 ... Amazon implements it, and Google has strength. 16:21:58 Alan: I've not used strength, nor rate, in our SSML. Most of our SSML is hand-rolled. When you use Web Speech API, you send a rate separately to SSML. 16:22:12 Alan: Are we saying change some, because we want both? 16:22:14 PaulG: Yes 16:22:17 Alan: Great 16:22:50 PaulG: If someone's concerned because these times aren't well specified, or for any reason, let us know 16:23:02 Alan: I don't think Web Speech was designed to work with SSML 16:23:03 q+ 16:23:24 PaulG: It may be that strength maps to something inside the TTS engine, and could be well receied 16:23:37 s/receied/received/ 16:23:49 Dee: These examples were real-world examples relating to Amazon Polly use in ETS. 16:24:13 S_Wood: I should talk to mark about all this 16:24:36 ... for this document, we need a varied example base 16:25:21 PaulG: Maybe not specified (strength) to avoid the spec being too English-centric. 16:25:53 ... This could be a way to pivot away from time being a make-or-break feature for us. 16:26:13 ... If we could get by with pauses being based on existing breaks, and you don't have to add a timer to your software, can we work with that? 16:26:15 q? 16:26:26 https://github.com/w3c/pronunciation/wiki/Reasons-for-Delays-in-Spoken-Content 16:26:55 PaulG: We'll have a meeting next week (unless nobody available) and then not for the following 2 weeks. 16:27:03 When we come back, we should take some time to collate these examples. 16:27:19 s/When we come back/PaulG: When we come back/ 16:27:34 PaulG: Then we can try to restart conversations with the vendors. 16:27:49 Dee: Would be helpful to know how the browsers are implementing things like break strength in relation to prossidy. 16:27:57 PaulG: Fantastic idea. Maybe we can do some outreach. 16:28:30 ... We need devs who have knowledge of the inner workings. 16:28:41 Dee: Important to understand how that works, and how time plays into that. 16:28:56 ... If you look up the docs, both time and strength are mentioned. 16:29:09 ... Time being optional attribute, strength not. 16:29:19 Dee: That was from 2010's SSML REC 16:29:36 present+ Dee 16:29:41 rrsagent, make minutes 16:29:43 I have made the request to generate https://www.w3.org/2023/12/11-pronunciation-minutes.html matatk 16:29:54 Dee: Could ask Mark? 16:30:04 rrsagent, make minutes 16:30:05 I have made the request to generate https://www.w3.org/2023/12/11-pronunciation-minutes.html matatk 16:30:42 PaulG: Is everyone here now able to be next week? 16:30:42 Alan: +1 16:30:42 Dee: +1 16:30:42 matatk: +1 16:30:42 PaulG: For next week, could you talk to Mark? Maybe he knows someone who knows? 16:31:05 ... We need to be prepared to find different AT does it different ways. 16:32:12 matatk: we need to preserve the "real world" examples but filter it for the people we want to focus on specific implementation details 16:32:42 matatk: the two audiences, our internal folks and the external implementors we need to influence. 16:33:19 ...Alan mentioned how web speech API doesn't fully implement SSML 16:33:30 ...is that a path to solve our problems? 16:35:07 Alan: I just wrote a higher-level tool to help us with some of the issues 16:35:29 ...in some ways Web Speech gives more control 16:35:41 ...over rate and timing 16:36:31 matatk: Can you share the code or examples for that higher-level tool? 16:42:01 matatk: I was just thinking, whenever we show people we should use the TAG explainer format 16:42:36 ...they focus on the user problem being solved 16:43:18 ...and compare how the various solutions (ssml, web speech, etc) deliver on those solutions 16:44:01 https://tag.w3.org/explainers/ 16:44:20 zakim, next item 16:44:20 I see a speaker queue remaining and respectfully decline to close this agendum, PaulG 16:44:24 ack me 16:44:28 q? 16:44:32 zakim, next item 16:44:33 agendum 3 -- Action Items -- taken up [from PaulG] 16:45:08 zakim, next item 16:45:08 agendum 3 was just opened, PaulG 16:45:15 zakim, close item 3 16:45:15 agendum 3, Action Items, closed 16:45:16 I see 2 items remaining on the agenda; the next one is 16:45:16 4. Github Issues and examples [from PaulG] 16:45:19 zakim, next item 16:45:19 agendum 4 -- Github Issues and examples -- taken up [from PaulG] 16:45:25 zakim, close item 4 16:45:25 agendum 4, Github Issues and examples, closed 16:45:26 I see 1 item remaining on the agenda: 16:45:26 5. Other Business [from PaulG] 16:45:27 zakim, next item 16:45:28 agendum 5 -- Other Business -- taken up [from PaulG] 16:45:57 Alan: clarification: web speech is not a solution for these problems 16:47:33 https://www.w3.org/TR/pronunciation-gap-analysis-and-use-cases/#gap-analysis 16:48:17 PaulG: We mentioned Web Speech in the gap analysis, but we didn't include it in the table (probably because it's very separate from the content) 16:48:23 rrsagent, make minutes 16:48:25 I have made the request to generate https://www.w3.org/2023/12/11-pronunciation-minutes.html matatk 16:48:31 PaulG: Maybe we just call that out, and make it more clear. 16:48:50 PaulG: We can explain why AT users are left behind if things are not moved into the AX tree 16:49:17 S_Wood: I may have some slides that explain this. 16:49:34 rrsagent, make minutes 16:49:35 I have made the request to generate https://www.w3.org/2023/12/11-pronunciation-minutes.html matatk