14:19:35 RRSAgent has joined #ag 14:19:39 logging to https://www.w3.org/2026/06/02-ag-irc 14:19:39 RRSAgent, make logs Public 14:19:40 Meeting: AGWG Teleconference 14:19:43 agenda? 14:19:51 rrsagent, make minutes 14:19:52 I have made the request to generate https://www.w3.org/2026/06/02-ag-minutes.html kevin 14:49:49 GreggVan has joined #ag 14:49:57 GreggVan has joined #ag 14:52:31 q+ to say this is the onboarding 14:52:33 q- 14:53:29 CClaire has joined #ag 14:54:01 janina has joined #ag 14:55:03 Stephanie has joined #ag 14:55:38 laura has joined #ag 14:56:29 HaTheo has joined #AG 14:57:05 Helen has joined #ag 14:59:51 NatTarnoff has joined #ag 15:00:14 Helen has joined #ag 15:00:25 giacomo-petri has joined #ag 15:00:27 stevef has joined #ag 15:00:30 wendyreid has joined #ag 15:00:37 present+ 15:00:43 Heather has joined #ag 15:00:55 Present+ 15:01:04 present+ 15:01:09 shadi has joined #ag 15:01:09 present+ 15:01:09 present+ 15:01:12 present+ 15:01:15 AWK has joined #ag 15:01:18 present+ 15:01:20 ryomtoob has joined #ag 15:01:23 Present+ 15:01:24 present+ 15:01:25 bbailey has joined #ag 15:01:28 present+ 15:01:31 present+ 15:01:31 ahick has joined #ag 15:01:35 present+ 15:01:43 present+ 15:01:55 present+ 15:01:55 present+ Laura_Carlson 15:01:55 present+ 15:01:55 scott has joined #ag 15:01:58 Scribe: Laura_Carlson 15:02:05 Scribe: laura 15:02:11 Jennie_Delisi has joined #ag 15:02:26 present+ 15:02:30 Charles has joined #ag 15:02:46 present+ 15:02:51 julierawe has joined #ag 15:02:54 present+ 15:03:19 AP: the first thing that we will start the meeting with is introductions. 15:03:31 present+ 15:03:39 zakim, start meeting 15:03:39 RRSAgent, make logs Public 15:03:41 Meeting: AGWG Teleconference 15:03:42 agebda> 15:03:43 agenda? 15:03:50 I'd like to give an intro 15:03:55 present+ 15:04:10 q+ 15:04:17 BrianE has joined #ag 15:04:17 wendyreid has joined #ag 15:04:17 laura has joined #ag 15:04:17 GreggVan has joined #ag 15:04:17 denkeni has joined #ag 15:04:17 alice has joined #ag 15:04:17 alastairc has joined #ag 15:04:17 kirkwood has joined #ag 15:04:17 hdv has joined #ag 15:04:17 jyasskin has joined #ag 15:04:17 JeroenH has joined #ag 15:04:17 Adam has joined #ag 15:04:17 jedi has joined #ag 15:04:17 Rachael has joined #ag 15:04:17 mgifford2 has joined #ag 15:04:37 Zakim, next item 15:04:37 agendum 1 -- Introductions -- taken up [from Adam] 15:05:09 Zakim, next item 15:05:09 I see a speaker queue remaining and respectfully decline to close this agendum, Adam 15:05:12 q? 15:05:15 LoriO has joined #ag 15:05:16 sam-estoesta has joined #ag 15:05:18 RY: I'm Rachael Yomtoob. I'm no longer with Deque, so l've rejoined as an invited expert. 15:05:21 ack Rachael 15:05:21 present+ 15:05:25 ack ryomtoob 15:05:25 present+ 15:05:28 q? 15:05:32 Zakim, next item 15:05:32 agendum 1 was just opened, Adam 15:05:32 present+ 15:05:37 present+ 15:05:41 Zakim, close item 1 15:05:41 agendum 1, Introductions, closed 15:05:41 I see 3 items remaining on the agenda; the next one is 15:05:41 2. Announcements [from Adam] 15:05:50 BrianE has joined #ag 15:05:50 wendyreid has joined #ag 15:05:50 laura has joined #ag 15:05:50 GreggVan has joined #ag 15:05:50 denkeni has joined #ag 15:05:50 alice has joined #ag 15:05:50 alastairc has joined #ag 15:05:50 kirkwood has joined #ag 15:05:50 hdv has joined #ag 15:05:50 jyasskin has joined #ag 15:05:50 JeroenH has joined #ag 15:05:50 Adam has joined #ag 15:05:50 jedi has joined #ag 15:05:50 Rachael has joined #ag 15:05:50 mgifford2 has joined #ag 15:05:50 Ben_Tillyer has joined #ag 15:05:53 present+ 15:05:55 Zakim, take up 2 15:05:55 I don't understand 'take up 2', Adam 15:05:58 Zakim, take up item 2 15:05:58 agendum 2 -- Announcements -- taken up [from Adam] 15:06:38 probably all day mon & tue 15:06:51 with joint meetings on the thu / fri 15:06:56 ShawnT has joined #ag 15:06:59 present+ 15:07:03 present+ 15:07:06 present+ 15:07:10 present+ 15:07:19 q+ 15:07:57 Katherine has joined #ag 15:07:57 Patrick_H_Lauke has joined #ag 15:07:57 Ben_Tillyer has joined #ag 15:07:57 BrianE has joined #ag 15:07:57 wendyreid has joined #ag 15:07:57 laura has joined #ag 15:07:57 GreggVan has joined #ag 15:07:57 denkeni has joined #ag 15:07:57 alice has joined #ag 15:07:57 alastairc has joined #ag 15:07:57 kirkwood has joined #ag 15:07:57 hdv has joined #ag 15:07:57 jyasskin has joined #ag 15:07:57 JeroenH has joined #ag 15:07:57 Adam has joined #ag 15:07:57 jedi has joined #ag 15:07:57 Rachael has joined #ag 15:07:57 mgifford2 has joined #ag 15:07:57 q? 15:07:59 jtoles has joined #ag 15:08:04 q+ Patrick_H_Lauke 15:08:07 scribe+ 15:08:08 AP: TPAC 2026, will be a hybrid event. located in Dublin. AG meetings Monday or Tues. 15:08:08 present+ 15:08:22 Makoto_U has joined #ag 15:08:25 Kevin: re-chartering- short extension, which we got late in the day because there were, we got, because there were some, uh, last minute changes requested 15:08:46 Dirk has joined #ag 15:09:03 ... have requested permission to progress that for AC review. 15:09:31 Monica has joined #AG 15:09:37 Gez has joined #AG 15:09:40 present+ 15:09:40 HV: I'm pleased to say that we had a great working session last Friday, and that's me and Jeroen Holscher and Steve Faulkner. 15:09:48 present+ 15:09:48 present+ 15:09:54 Poornima has joined #ag 15:09:59 ack Patrick_H_Lauke 15:10:02 ack me 15:10:05 ...we're at a version that feels like we're ready for note. So we're currently in discussion with with chairs. 15:10:24 https://lists.w3.org/Archives/Public/w3c-wai-gl/2026AprJun/0035.html 15:10:42 PL: I just wanted to draw people's attention to you've received yesterday my usual WCAG 2.x Task Force backlog meeting email. 15:11:11 present+ 15:11:15 Francis_Storr has joined #ag 15:11:15 laura has joined #ag 15:11:15 Patrick_H_Lauke has joined #ag 15:11:15 Ben_Tillyer has joined #ag 15:11:15 BrianE has joined #ag 15:11:15 wendyreid has joined #ag 15:11:15 GreggVan has joined #ag 15:11:15 denkeni has joined #ag 15:11:15 alice has joined #ag 15:11:15 alastairc has joined #ag 15:11:15 kirkwood has joined #ag 15:11:15 hdv has joined #ag 15:11:15 jyasskin has joined #ag 15:11:15 JeroenH has joined #ag 15:11:15 Adam has joined #ag 15:11:15 jedi has joined #ag 15:11:15 Rachael has joined #ag 15:11:15 mgifford2 has joined #ag 15:11:20 Patrick_H_Lauke++ 15:11:22 ... one's a tiny bit special, because it includes an either-or set of pull requests. 15:11:22 at the start. So we've been deliberating 15:11:31 joryc has joined #ag 15:11:33 present+ 15:11:52 imirfan has joined #ag 15:12:03 ... we've got two PRs that basically fall on either side of the fence on how to interpret it. So yeah, at this stage we'd be interested if people could have a look at it, thumbs up or comment. just so we can get a feel 15:12:16 .. for it. 15:12:55 zakim, next Item 15:12:55 agendum 3 -- Spike results: Progress to conformance score — https://docs.google.com/presentation/d/1XshX43F4167UtOU1XKDt7CHcpJyEm4iVhmZXJBp9840/edit?slide=id.p#slide=id.p -- 15:12:58 ... taken up [from Adam] 15:13:03 rm: we are, uh, implementing some of the comments back from the retrospective last week I did want to request that as people queue today to please go ahead and queue two, so queue plus to say, and put your topic in to help chairs manage topics. 15:13:45 Slideset: https://docs.google.com/presentation/d/1XshX43F4167UtOU1XKDt7CHcpJyEm4iVhmZXJBp9840/edit?slide=id.p#slide=id.p 15:13:46 Eloisa has joined #ag 15:13:46 Francis_Storr has joined #ag 15:13:46 laura has joined #ag 15:13:46 Patrick_H_Lauke has joined #ag 15:13:46 Ben_Tillyer has joined #ag 15:13:46 BrianE has joined #ag 15:13:46 wendyreid has joined #ag 15:13:46 GreggVan has joined #ag 15:13:46 denkeni has joined #ag 15:13:46 alice has joined #ag 15:13:46 alastairc has joined #ag 15:13:46 kirkwood has joined #ag 15:13:46 hdv has joined #ag 15:13:46 jyasskin has joined #ag 15:13:46 JeroenH has joined #ag 15:13:46 Adam has joined #ag 15:13:46 jedi has joined #ag 15:13:46 Rachael has joined #ag 15:13:46 mgifford2 has joined #ag 15:13:47 Detlev has joined #ag 15:13:52 present+ 15:14:07 [Slide 1] 15:14:56 [Slide 2] 15:15:05 Glenda has joined #ag 15:15:05 Eloisa has joined #ag 15:15:05 Francis_Storr has joined #ag 15:15:05 laura has joined #ag 15:15:05 Patrick_H_Lauke has joined #ag 15:15:05 Ben_Tillyer has joined #ag 15:15:05 BrianE has joined #ag 15:15:05 wendyreid has joined #ag 15:15:05 GreggVan has joined #ag 15:15:05 denkeni has joined #ag 15:15:05 alice has joined #ag 15:15:05 alastairc has joined #ag 15:15:05 kirkwood has joined #ag 15:15:05 hdv has joined #ag 15:15:05 jyasskin has joined #ag 15:15:05 JeroenH has joined #ag 15:15:05 Adam has joined #ag 15:15:05 jedi has joined #ag 15:15:05 Rachael has joined #ag 15:15:05 mgifford2 has joined #ag 15:15:05 Patrick_H_Lauke has left #ag 15:15:35 ac: We had two day virtual meeting a few weeks ago on conformance. So, various ideas were discussed in how the WCAG 3 conformance could be, could be different from WCAG 2. 15:15:42 ...3 of those ideas were given to small groups, and this is one of them. It was a progress towards conformance score. 15:16:26 ac: And this is literally copy and pasted from, you know, uh, some rough drafting earlier. 15:16:26 that it could be... the idea was it would show progress on the way to conformance. It would be a score, probably a percentage score. 15:16:34 [Slide 3] 15:18:01 [Slide 4] 15:18:30 [Slide 5] 15:18:58 [Slide 6] 15:19:58 [Slide 9] 15:21:33 [Slide 10] 15:21:44 Jen_G has joined #ag 15:22:04 Present+ 15:22:07 [Slide 11] 15:23:28 [Slide 13] 15:23:35 https://docs.google.com/spreadsheets/d/1tmh4NkJxur7zGCip3vtBd0utDrhN1nR5aZ_EAGnxDkE/edit?gid=0#gid=0 15:24:32 q? 15:25:41 ac: Each person could take, uh, would create a couple of tabs. We would copy across the, um, all the whole list of provisions. 15:25:47 ...We would have, columns per page or per view within whatever product it was we were looking at. 15:25:53 ...Each row had a provision. We were excluding assertions for the purpose of this 15:25:58 ...each cell would be, uh, probably numbered, actually, but we were focusing on pass or fail, or not applicable was counted as not a fail. 15:26:09 ... So, passing a not applicable were equivalent, um, so basically put in a 1 if it failed. 15:26:30 ... Giacomo, who had instance data, he was also putting in instances, rather than just a pass or fail. 15:27:02 q? 15:27:03 q+ to ask the audience or context for: ‘Show progress on the way to conformance’ 15:27:12 q+ 15:27:15 q+ 15:27:17 ack Charles 15:27:17 Charles, you wanted to ask the audience or context for: ‘Show progress on the way to conformance’ 15:27:24 ...so in general, this, the average score, we tried calculating using the average number of fails across conformance units. 15:27:29 ...So if you had 6 pages, how many pages did that fail across? 15:27:42 ...And then we've created an overall percentage for the scope of testing. 15:27:57 present+ 15:28:19 ch: The context question for this whole spike of work is, who is the audience or context in which this progress toward conformance would be made? 15:29:04 ac: this would be building on top of conformance. 15:29:58 ... I think quite useful for internal teams, if you're working on your product, what have you got left to do? 15:30:00 ack GreggVan 15:30:08 ... And, you know, how far away from that are you? 15:30:08 It can also be used for people like procurement 15:30:33 ... But yeah, there are various audiences. 15:31:05 Gregg: On the data sheet, that's the one that broke down all the provisions and tried to list their benefits for each different disability and give them a weight. 15:31:20 ... The goal there was to address a couple of issues 15:31:45 ... including - What if you solve the easy stuff, but the stuff that's really important you skip? So that's why, you know, 1, 2, 3 rating was included. 15:31:53 ... And the other one is, what if you solve all the problems for one group and leave the other group hanging? 15:32:00 ack Detlev 15:32:04 ... And that's the reason why it has 4 columns. 15:32:40 q? 15:33:14 detlev: Wondering whether this spike group considered having a graded result? 15:33:28 thought: in the US, orgs are risk averse. hence accessibility statements have “we strive to meet WCAG 2.2 AA” to strategically avoid any specific claim of conformance. it would take significant culture change to convince orgs to publicly disclose the progress toward conformance. 15:33:56 ac: yeah. So we did look at the first two case studies I'm will show , how to break down the instances. 15:34:05 ... we did talk about that kind of severity issue, and I will come to that in the conclusions. 15:34:24 [Slide 17] 15:35:24 Charles, it wouldn't be for use to ask organizations to disclose anything. This is really just providing a way to answer the questions around distance from conformance. It might be the case that regulators request that any such scoring be made public. 15:36:00 q+ to ask about the category weighted scores 15:36:05 [Slide 18] 15:36:57 q+ 15:37:29 ac: including the proportional instances seems to help. It correlates more closely to the user experience, 15:38:18 ack Ben_Tillyer 15:38:18 Ben_Tillyer, you wanted to ask about the category weighted scores 15:38:20 ... Given that this is just two sites, we would need to rerun this across more sites 15:38:34 ... from the other sites we were working on, that most sites score above 50% by default, even pretty awful ones. 15:38:45 ... You need to include a lot of different types of content in order to score worse than 50%. 15:38:54 q+ to speak to the categories use 15:39:06 ack GreggVan 15:39:43 BT: Amazing. The range that you've you've got for the per disability categories. And I was thinking that if I had those ratings of any product. 15:39:52 ... Beyond, I would probably look to ignore the the higher percentage points. If I was using the disability category. 15:40:28 gregg: Yeah, the question was, you said it had poor experience. and the, what does poor experience meant? 15:40:40 ... Is it for someone with a visual, or a cognitive, or a hearing, or physical, or with no disability? 15:41:06 ack Rachael 15:41:06 Rachael, you wanted to speak to the categories use 15:41:38 q+ 15:41:45 w3 has joined #ag 15:41:46 GP: So essentially I have created in the other sheet in the website sheet a visual score because the audience of this user testing was primarily people with visual disabilities. 15:41:49 graham has joined #ag 15:41:54 present+ 15:42:02 Present+ 15:42:07 ack GreggVan 15:42:25 q+ to ask about the weighting 15:42:34 I guess my impression of the categories was that they weren't going to so much be weighted as they were setting minimums to ensure. 15:42:46 rm: My impression of the categories was that they weren't going to so much be weighted as they were setting minimums to ensure. 15:43:06 ack Jennie_Delisi 15:43:06 Jennie_Delisi, you wanted to ask about the weighting 15:43:31 gregg: it might be useful to have two scores if you're doing this. One of them is overall, including pass if you don't have this, and one which is. 15:43:59 gregg: don't count it if you don't have it, um, because that would help you to not have somebody think it's really good when, in fact, it really... nobody's even tried to make it accessible. 15:47:30 w3 has joined #ag 15:47:30 Eloisa has joined #ag 15:47:30 Francis_Storr has joined #ag 15:47:30 laura has joined #ag 15:47:30 Ben_Tillyer has joined #ag 15:47:30 BrianE has joined #ag 15:47:30 wendyreid has joined #ag 15:47:30 GreggVan has joined #ag 15:47:30 denkeni has joined #ag 15:47:30 alice has joined #ag 15:47:30 alastairc has joined #ag 15:47:30 kirkwood has joined #ag 15:47:30 hdv has joined #ag 15:47:30 jyasskin has joined #ag 15:47:30 JeroenH has joined #ag 15:47:30 Adam has joined #ag 15:47:30 jedi has joined #ag 15:47:30 Rachael has joined #ag 15:47:30 mgifford2 has joined #ag 15:47:45 q+ to ask about option to amplify the difference 15:47:54 ack Jennie_Delisi 15:47:54 Jennie_Delisi, you wanted to ask about option to amplify the difference 15:48:11 ac: it's worth saying, so this website, the first website, is a big content site with a lot of stuff on each page, which is why you've got so many checks. 15:48:11 And why, you know, because you're having to count every instance that could have failed. 15:48:23 ahick has left #ag 15:48:57 q+ to mention the opposite of these "high" numbers 15:49:08 jd: If we are using the scale of 100. for the percentage-wise into limiting away. So if you're saying the biggest range is between 60 and 95%. 15:49:10 ack giacomo-petri 15:49:10 giacomo-petri, you wanted to mention the opposite of these "high" numbers 15:49:13 ...If that number could then have a way of getting amplified in terms of its granularity, we might notice a greater difference between certain ones, and I'm not sure if this. 15:50:14 This is fantastic information - really helpful 15:50:35 GP: If we look at the, not the average value, but the final success criteria that are passing, they are 19 out of 55, which in terms of conformance, it is a very low score. 15:50:48 ......But this is not reflected by the user testing. So it's a tricky one because, yeah, probably with those with distance towards conformance, numbers are quite high. 15:51:10 ...But on the other end, the current conformance score based on pass-fail success criteria, it is the opposite, so you are scoring very low, even though your website is not so bad. 15:51:51 ac: the way I'm thinking about that is because this site has really big pages with a few issues on it that are spread out, but there's quite a variety of issues across the pages as well. 15:51:58 ...It sort of hits all these triggers, but somebody who's actually using the site is only going to be encountering a subset of those, because they don't explore the whole of every page. 15:52:39 [Slide 21] 15:53:50 q? 15:53:59 rrsagent, make minutes 15:54:00 I have made the request to generate https://www.w3.org/2026/06/02-ag-minutes.html Adam 15:54:50 ac: Andrew Hick had an unweighted score. We've got the average pass rate, and then the weighted score. 15:55:30 [Slide 23] 15:56:56 [Slide 24] 15:58:18 [Slide 25] 15:59:21 ac: the judgment call on severity was the hardest part of training. Although the results were fairly consistent. 16:00:01 [Slide 26] 16:00:30 q+ 16:00:35 q+ 16:00:37 ac: unless it was built without responsive design, by the time the team reached 4.8, all of the functional needs were supported except for blind and motor impaired people 16:00:45 Heather has joined #ag 16:00:55 scribe+ 16:03:07 alastairc: [Slide 29] When trying to align the score with user experience for pwd, the following factors: Severity, proportionality, provision level, and disability categories. Thinks we may need to add a 'safety' level. 16:03:26 q+ to ask about positive gamification 16:04:40 is the link to the presentation in IRC? 16:05:18 thanks, Adam! 16:06:05 q? 16:06:08 alastairc: Reads [slide 30-31] Adding complexity to the site, you are more likely to have barriers. There's a potential, depending on how the provisions are written to create reverse incentive for things. This would handle cases for not applicable. 16:06:11 ack GreggVan 16:06:16 anton has joined #ag 16:06:38 present+ 16:06:43 GreggVan: One slide said "Except people who are blind and have physical disabilities" - does this mean one or the other, or people that have both. 16:06:51 alastairc: Good question, unsure, but can find out. 16:07:23 GreggVan: Idea of subtracting 50 from the score, thinks it might be a good idea because it seems that anything that scored less than 50 would be pretty much inaccessible. 16:07:26 ack wendyreid 16:07:26 wendyreid, you wanted to ask about positive gamification 16:08:21 q+ to say I think gamification is outside of conformance 16:08:33 q+ 16:08:36 q+ on gamification and positive efforts 16:09:39 wendyreid: Noticed that we put notes about gamification, and wonders if there's a positive side to gamification that could be considered. Knowing that gamification can positively influence motivate organizations to strive to achieve. One of the possible conclusions is around severity, and maybe give examples and guidance. for example, checkout flow 16:09:39 has a checkout button that's not labeled is more severe than an unlabeled banner. People need to understand how to get the highest gain in both usability and in the score, this might be motivating. It's hard to determine where to put effort to get those numbers up. 16:09:43 ack Rachael 16:09:43 Rachael, you wanted to say I think gamification is outside of conformance 16:09:48 q+ to say that Gamification and Gaming are two completely different things. (the fact that they use the same root "game" is a quirk of English) Gamification is great as Wendy pointed out. Gaming the system is not. 16:09:48 q+ to ask if subgroup considered tiers? Namely, 50%=F 60%=D 70%=C 80%=B 90%=A and then S tier 16:10:30 present+ 16:10:37 ack GreggVan 16:10:37 GreggVan, you wanted to say that Gamification and Gaming are two completely different things. (the fact that they use the same root "game" is a quirk of English) Gamification is 16:10:41 ... great as Wendy pointed out. Gaming the system is not. 16:10:56 Rachael: Chair hat off. I think gamification downside for strictly conformance hooks would be naturally there if we decided to go with a scoring conformance model. Gamification really does support organizational motivation, it's a little harder to build into law, but it would be a great place of a policy document also applies to orgs to talk about 16:10:56 how to use the conformance model to gamify and promote accessibility improvements. 16:11:27 GreggVan: Wanted to point out that gamification and gaming the system are completely different concepts. 16:11:34 ack alastairc 16:11:34 alastairc, you wanted to comment on gamification and positive efforts 16:11:36 GreggVan: and +1 to Bruce comment 16:12:03 [Slide 30] 16:12:04 q+ 16:12:42 q+ to mention flows as part of this too 16:13:31 q+ to say have some standardized severity items and room for more 16:13:36 ack bbailey 16:13:36 bbailey, you wanted to ask if subgroup considered tiers? Namely, 50%=F 60%=D 70%=C 80%=B 90%=A and then S tier 16:13:39 Did subgroup considered tiers? Namely, 50%=F 60%=D 70%=C 80%=B 90%=A and then S tier for assertion of full conformance plus supplemental best practices. 16:13:51 alastairc: Chair hat off to reply to Wendy. Severity: we can take is the most useful thing in terms of helping align whether we can gamify that. The problem I see is that it's very hard to consistently say what things would be considered severe across the completely different kinds of interface that this gets applied to the different sizes. 16:13:51 Suggestion would be to keep the conformance scoring as simple as possible, but to build on that. You may sit down as an organization and determine what are the most important tasks (within the product) for us and the users, and document if there's a legal case, or regulator knocking on your door. You need to take that into account severity and 16:13:51 provide recommendations on what to do next. 16:14:22 s/Suggestion would be/... Suggestion would be/ 16:14:24 s/provide recommendations/... provide recommendations/ 16:14:34 s/how to use the conformance model/... how to use the conformance model/ 16:14:44 bbailey: Love the distance from conformance, and is fascinated about how much work all the math is. People going to go full conformance claiming some of the supplemental best practices, and they could be put in X tier. 16:14:44 s/has a checkout button/... has a checkout button/ 16:14:51 ack Ben_Tillyer 16:15:00 alastairc: Focus on the distance from conformance, and think about ranges within. 16:15:45 q+ 16:15:48 in practice that ‘severity” can depend on the number of occurrences on a site. or the “importance” of the path blocked. even the constiuant group blocked etc. 16:15:52 q+ to say keep the math simple and visible 16:16:07 q+ on Ben's comment, and how you differentiate large interfaces from small 16:16:11 ack wendyreid 16:16:11 wendyreid, you wanted to mention flows as part of this too 16:16:11 q+ 16:16:17 Ben_Tillyer: Thinking about how to widen this number of what range we're looking at. Wonders if there's something we can do in terms of looking at the number of defects on a site and the disabilities that may be impacted for the site. Perhaps a progressive exponential scoring for each defect, instead of them being weighed all the same? 16:16:26 q+ 16:16:38 q+ to talk about "counting" issues 16:18:18 ack Jennie_Delisi 16:18:18 Jennie_Delisi, you wanted to say have some standardized severity items and room for more 16:18:19 wendyreid: Thinks there are implications, and challenges is implying things like flows. There's "like" things, such as occurrences, location of occurrences. There may be repetitive elements across pages (such as a website header), and it may be frustrating to continually come across the same defect. How could we account for the types of things? Are 16:18:19 they additive so they are weighted? Segment it? We want to make sure, especially in large organizations that have different teams split into different parts, that those are communicated and accounted for. 16:19:26 q+ to also ask about severity definitions and how they apply across interfaces 16:19:36 ack Rachael 16:19:36 Rachael, you wanted to say keep the math simple and visible 16:20:00 Jennie_Delisi: Want to make a point on a phrase that was in the severity levels. There is the internal reporting around this, but then there's the view reporting from multiple products as a comparison method to see which product would be more accessible to their users. If we can at least have a standardized list of severity definitions that would 16:20:00 reduce the requirement to have nondisclosure agreements gathered prior to being able to view this kind of information. I think we can better support people choosing accessible products will help the goal long-term. 16:20:25 ack me 16:20:25 alastairc, you wanted to comment on Ben's comment, and how you differentiate large interfaces from small and to also ask about severity definitions and how they apply across 16:20:28 ack alastairc 16:20:29 ... interfaces 16:20:35 Rachael: Chair hat off. The more complex the math, the higher the risk of non-adoption. Want to continually consider this as we figure out how to score. 16:20:53 s/they additive so they are weighted/... they additive so they are weighted/ 16:20:59 +1 to Alastair 16:21:02 s/reduce the requirement to have/... reduce the requirement to have/ 16:22:38 q? 16:22:42 ack HaTheo 16:22:51 alastairc: Chair hat off, on Ben's comment about increasing the weighting by the number of instances. It increases the complexity, but it could also distort things because of the size of the product. It also applies to apps, such as watch apps, which have a more constrained interfaces, or a govUK page, There's such a variation of results. If you 16:22:51 went down the weighting, then you might be penalizing products that have a much larger interface. Also about Jenny's point about severity defitnions, there's a question around how you would have a definition of severity. I think severity is one of those things that you need to sit down and define how that should work in your context. 16:23:05 I am also worried about the maths, but that the numerical results maps to ABCDF seems like a happy coincidence that makes scoring more approachable. 16:23:56 q+ on needing a buckload of data to work out a normalised score. 16:24:15 ack giacomo-petri 16:24:15 giacomo-petri, you wanted to talk about "counting" issues 16:24:20 +1 to theo's offer ! 16:24:22 HaTheo: Understanding Rachael's point, if you increase the complexity of the way you calculate the score, you might influence adoption. Another question is hard is it for companies to calculate the score, and interpret the score? In the sense that looking at the score; offered to write down the comment (yes please do HaTheo) 16:25:30 ack me 16:25:30 alastairc, you wanted to comment on needing a buckload of data to work out a normalised score. 16:25:31 +1 to Giacomo 16:25:32 giacomo-petri: Want to echo Alastair said about the number of issues is definitely a key factor. There is a risk of disproportionate penalizing larger pages. and also severity is important, but it is again subjective - there are just too many factors that affect the counting and the severity thing, and it's not easy. 16:25:39 +1 to context in severity, want to add disability to context list 16:26:11 Also the severity can depend on many factors, including the business priorities of the content owner. 16:26:28 q+ 16:26:35 ack GreggVan 16:26:51 alastairc: Wanted to reply to Theo, chair hat off. Having a normalized average would require a bucket load of data. We'd have to have good test results across a wide range of sites. Would need a plan and somebody to pay for it. Getting a rant of text data of sites and working out what those ranges would be to give a more useful representation. 16:26:59 severity examples could include: possible loss of life issues, severe weather information, etc. 16:27:05 agree C passes 16:27:05 q+ 16:27:12 +1 to clearly defining what level passes/conforms fully 16:27:14 The regulator can say what grade is good enough. 16:27:14 ack JeroenH 16:27:14 GreggVan: +1 to Bruce's idea, and thought about using grades, but caution using grades after thinking through it. 16:28:07 JeroenH: We used the ABCDE scale in the Netherlands as well. We don't have the misinterpretation of people that C might be good enough. It depends on how you interpret the scale. We use an interpretation more like energy label on the scale. 16:28:13 q? 16:28:23 alastairc: Pulled up an image showing ABCDE on an energy scale. 16:30:04 alastairc: Wrapping this up, we did briefly consider some potential reporting requirements, and to emphasize, this isn't conformance. This is separate from conformance and building on that to allow people to differentiate between not passing at all and being fully conformant. It might help to include a could have factors. As part of a conformance 16:30:04 claim, you're going to state which pages are included and for this scoring, it might help to include things like media count, component count, or other things which show how much surface area you've got as part of your interface. 16:30:13 s/claim, you're going/... claim, you're going/ 16:30:38 Potential metrics. Also used in architecture: Percentage of Compliant Elements. Distribution Ratios. Accessible Route Coverage. Task Success Rate. Wayfinding Efficiency 16:30:50 q+ 16:30:57 ack kirkwood 16:31:04 alastairc: Invites others to contribute, especially those with testing data, and emphasized this is a very generic thing. This is for the next editor's draft. It could be its own section within WCAG 3 or a completely separate document. 16:32:19 GSA has quite a bit of data from 2024 and 2025 annual survey: https://www.section508.gov/manage/section-508-assessment/ 16:32:40 https://www.section508.gov/manage/section-508-assessment/2025/assessment-data-downloads/ 16:32:55 q? 16:33:12 kirkwood: Comment in IRC above. May want to consider paralleling some of the metrics around architectural accessibility. Percentage of compliant elements, distribution ratio of the compliance. Tests the success rate and way finding efficiency, and usability and the accessible path. Maybe somethings that we're overlooking that can be included here. 16:33:12 A little concerned that we maybe have not had quite enough feedback within the blind community for what we're doing. 16:33:45 bbailey: Has quite a bit of data with downloads with WCAG 2 criteria type. The federal gov settled on compiling. 16:34:07 Poll: Should we continue to test the Progress to Conformance score? 16:34:16 +1 16:34:16 alastairc: Wants to take a general temperature of the group. 16:34:20 Glenda has joined #ag 16:34:31 +1 16:34:31 +1 16:34:35 +1 16:34:37 +1 16:34:39 +1 16:34:41 +1 16:34:48 +1 16:34:50 alastairc: +1 if you think this has been useful and we should carry on testing it (getting a wider set of data) or minus one if you don't think we should. 16:34:53 0 because based on previous conversations I am not sure the group is going to be OK with some core provisions not being "required" 16:34:54 +0 Interesting, but I have concerns that might be called concerns over category confusions 16:34:59 +1 testing will reveal something not yet considered 16:35:02 if this builds on conformance and we do not knoe yet how to measure that, this might better be on the back burner? 16:35:02 SydneyColeman has joined #ag 16:35:18 +0 16:35:23 0, I don't know if more data would be beneficial to 'prove' anything with this approach. 16:35:24 w3 has joined #ag 16:35:25 +0 16:35:40 q+ 16:35:48 q+ to answer 16:35:57 ack giacomo-petri 16:35:57 giacomo-petri, you wanted to answer 16:36:03 wendyreid: what about usability testing as part as a reference point? 16:36:04 +1 16:36:59 0 share Delev's concern. 16:37:05 q+ 16:37:12 +0 too confusing or complex for general public 16:37:38 q+ on difficulties 16:37:51 +1 to this direction, it’s a really solid starting point. However, to actually apply this at the scale of the broader audience reading WCAG, we need to introduce more rigor I think. Without data normalization and tracking Inter-Rater Reliability (IRR), the current testing approach just isn't ready to handle that kind of volume and variance... I 16:37:51 also would be happy to help think about this. 16:38:20 giacomo-petri: Essentially these are testing subdivided in tasks. Don't want to guide the user in a defined path. In one of the sheets, there is our internal score which is really a progress toward conformance which is not intended to measure usability or the effective accessibility of the products, since we count all the issues and the total of 16:38:20 numbers of checks that we perform. It is a metric that helps you understand how many things you're doing right and how many things you're doing bad. The difference is the effort that you have to put essentially to reach conformance, but has nothing to do with the usability of the product. There's a lot of subjectivity aspects that are tricky to 16:38:20 manage objectively. 16:38:22 q? 16:38:23 FYI regarding building accessibility metrics: Building accessibility conformance is typically measured using physical design standards, spatial metrics, and functional evaluations rather than a single numerical score. The specific metrics used depend on the regulatory framework being enforced. 16:38:28 ack Detlev 16:38:39 s/also would be happy to help/... also would be happy to help/ 16:38:46 giacomo-petriThe user testing is performed by PWD following specific tasks without too much guidance that would jeopardize the user testing itself. 16:38:48 s/numbers of checks that we perform/... numbers of checks that we perform/ 16:38:53 q+ to ask about required provisions 16:38:56 s/manage objectively/... manage objectively/ 16:39:06 ack alastairc 16:39:06 alastairc, you wanted to comment on difficulties 16:39:08 s/giacomo-petriThe/giacomo-petri The/ 16:39:25 s/giacomo-petri The/giacomo-petri: The/ 16:39:52 (unknown); consern that with this score towards conformance. I feel we are dancing around measuring conformance for a number of years now, and if this builds on conformance it might be wiser to try to make difficult decisions regarding how we're going to measure conformance and then move into the things around this. 16:40:32 q+ to ask about the core goal of conformance 16:41:10 s/(unknown); consern/Detlev: concern/ 16:41:42 q+ 16:41:47 Heather, "unknown" was Detlev speaking 16:41:54 q? 16:42:02 ack Rachael 16:42:02 Rachael, you wanted to ask about required provisions 16:42:04 alastairc: There's a starting assumption that conformance is meeting conformance. There's a basic conformance and assumptions. Comments about more data 'proving' anything. We might be able further understand the barrier scores with numbers that are close. We had the correlation between a fairly simple score and the subjective, if we had the 16:42:04 usability testing score, and had the correlation it could be useful. 16:42:12 s/usability testing score/... usability testing score/ 16:42:12 alastairc Would be interesting to see the barrier score correlation for each major disability group 16:43:09 q+ to say "if there is a good correlation between disabilty fails and conformance -- would there not also be the same correlation between passes and conformance? If not can you explain: (or is it too mathy) 16:43:31 AWL That would be possible for the 4 categories we use (keyboard / low-vision / screenreader / cog + deafness) 16:43:39 q? 16:43:47 s/ disability fails/provision fails/ 16:43:48 ack wendyreid 16:43:48 wendyreid, you wanted to ask about the core goal of conformance 16:44:06 Rachael: Thinks this data is helpful and it shows some promise with scoring. We had a conversation about a year ago that basically said the group was not comfortable with having every provision up for grabs. Basically what the said is in the current version is what the group was fairly comfortable with as must pass. With that baseline, wants to 16:44:06 clarify if there is a different way of looking at conformance where we focus on the core, and weight them higher, if there's another way to look at it from a reporting standpoint separate from conformance. (chair hat half on, half off). 16:44:16 s/clarify if there is a different/... clarify if there is a different/ 16:44:19 I feel like the conformance model is being pass fail today is not loved as it makes things like filling out ACRs really nebulous. I think the issue is that these are really hard to run a program on, as it's not motivating to compare across products and track progress if you don't know what is "normal" vs "exceptional" 16:44:47 q+ to talk about "stop when you reach a level" 16:46:12 +1 to exploring a strict "conformance model" and additional approaches as part of a policy/reporting document 16:46:33 q+ 16:46:51 wendyreid: Questioning how we frame this, or how we frame this for ourselves. We all want the web to be accessibility, but being completely realistic is that it's really hard to be 100% conformance. There are different pathways to get there. Framing this is critical. Like with any other spec, to conform to WCAG 3, you must achieve all the 16:46:51 applicable criteria to your product. Knowing this is difficult, here are the ways you can approach that: leveling, grading, tiers of criteria, etc. To track progress, measure progress or measure achievement, here are the different grads and stages to get to, and the challenge to make them approachable and possible? 16:46:52 ack GreggVan 16:46:52 GreggVan, you wanted to say "if there is a good correlation between disabilty fails and conformance -- would there not also be the same correlation between passes and conformance? 16:46:53 +1 to Wendy. Been working out how to put this into words. Give people different routes that we know have some benefits, ask them to pick one and to justify why? 16:46:55 ... If not can you explain: (or is it too mathy) 16:47:03 s/applicable criteria to your product/... applicable criteria to your product/ 16:47:03 +1 to Wendy 16:47:06 +1 to wendyreid 16:47:30 q+ to agree with Wendy and dealing with flashing 16:47:50 q+ to mention federal expectation is full conformance 16:48:09 GreggVan: +1 to Wendy's comments. If we strive for perfection, anything short is a failure. If we can combine Rachael's, let's make sure we can get all the critical showstoppers and provide a path for people to keep moving up that would be good. Another question: You said there's a high correlation between fails and fonromance or something. 16:48:09 Wouldn't there not be the same correlation with passes since fails and passes where you can submit one by subtract the other or am I missing something? 16:49:04 ack giacomo-petri 16:49:04 giacomo-petri, you wanted to talk about "stop when you reach a level" 16:49:11 s/Wouldn't there not/... Wouldn't there not/ 16:49:17 alastairc: Comparing two things, one was WCAG fails, and to reverse that. The barrier score is subjective based on impact of those fails. So to me this is saiying the more WCAG files you have, the more likely there are to be more impactful issues. Subjective by the auditor who looks at each issues and determines that it's going to block you, or its 16:49:17 not that important. 16:49:24 s/not that important/... not that important/ 16:50:04 +1 to Giacomo 16:50:21 klea has joined #ag 16:50:24 giacomo-petri: Agrees with Wendy's point, understands that you need to ensure that companies are making progress and they don't stop when they reach the phantomatic legal requirement. This can be addressed by policies but especially in the context of large companies, we need to acknowledge that maintaining a consistent level is not as 16:50:24 straightforward as it might seem due to frequent updates. 16:50:25 ack Stephanie 16:50:33 s/straightforward as it might/... straightforward as it might/ 16:51:41 +1 to taking the training aspect and novice perspective in mind 16:52:02 q? 16:52:03 ack me 16:52:04 alastairc, you wanted to agree with Wendy and dealing with flashing 16:52:15 Stephanie: Agrees with Giacomo and Wendy. From my perspective, we know we're talking to companies who provide products for Sydney government who have non accessibility specialists, people need to be told what to do. 250 departments and 600k staff is a huge scale. The more rigid conformance model the better. Not everyone is reading the WCAG spec. 16:53:46 +1 on supporting, sorry Adam 16:53:55 q? 16:54:29 ack bbailey 16:54:29 bbailey, you wanted to mention federal expectation is full conformance 16:54:39 alastairc: Anytime we try to define severity, or which parts of the product should be included, we're going to run into the problem of the infinite variety. What this means is the variation of digital interfaces. It's hard to define that kind of thing, and is better left of to people to make those judgement calls. The case of judgement doesn't 16:54:39 really sit in the conformance model, so I agree with that. The questions are really: Do these numbers actually help? Do the scores actually make sense. Not sure at the beginning if they did, but then considering the granularity. Thinks there should be immediate non-conformant failures (like flashing). 16:54:47 present+ 16:54:51 s/really sit in the conformance model/... really sit in the conformance model/ 16:55:29 qq+ to answer alastair 16:55:34 bbailey: Want to plug the Section 508 data again. It's a question of quality. Self-reported versus audited. 16:55:36 ack Rachael 16:55:36 Rachael, you wanted to react to bbailey to answer alastair 16:56:15 q? 16:56:20 alastairc: We don't have the audit results, we could create scoring for those, but we've got some usability measure, and it's tricky to compare the data. 16:56:46 Rachael: Side conversation with Bruce and I was having us get the usability side of ti to be able to compare it. thinks we can get some of it. 16:56:56 q+ 16:56:58 bbailey: There are some cover pages, summary sheets for each, I think it's there. 16:57:07 ack Rachael 16:57:13 bbailey: There are some cover pages, summary sheets for each agency. 16:57:13 agenda? 16:57:45 present+ 16:57:50 Charles has left #ag 16:58:00 present+ 16:58:04 present+ 16:58:15 present+ 16:58:22 anton has left #ag 16:58:28 alastairc End of question, end of presentation, from the poll, it looks like we're supporting continuing. Asks for other members to contribute to the conformance subgroup. 16:58:49 janina has left #ag 16:58:50 Adam: Thank you to everyone for participating. see you next week. 16:59:40 zakim, end meeting 16:59:40 As of this point the attendees have been stevef, CClaire, Heather, giacomo-petri, Rachael, NatTarnoff, shadi, Helen, AWK, ryomtoob, wendyreid, kevin, bbailey, HaTheo, 16:59:43 ... Laura_Carlson, ahick, Jennie_Delisi, hdv, julierawe, Charles, Stephanie, sam-estoesta, LoriO, BrianE, GreggVan, Ben_Tillyer, janina, alastairc, ShawnT, Adam, jtoles, Monica, 16:59:43 ... kirkwood, Gez, Poornima, Francis_Storr, Detlev, Jen_G, Eloisa, graham, w, anton, JeroenH, Glenda, Makoto_U 16:59:43 RRSAgent, please draft minutes v2 16:59:44 I have made the request to generate https://www.w3.org/2026/06/02-ag-minutes.html Zakim 16:59:51 I am happy to have been of service, Adam; please remember to excuse RRSAgent. Goodbye 16:59:51 Zakim has left #ag 18:41:17 laura has joined #ag 20:06:24 klea has joined #ag 20:20:21 Glenda has joined #ag