14:54:03 RRSAgent has joined #voiceinteraction 14:54:03 logging to https://www.w3.org/2021/03/24-voiceinteraction-irc 14:54:21 meeting: Voice Interaction Community Group 14:54:27 chair: debbie 14:54:33 scribe:ddahl 14:55:55 agenda:https://lists.w3.org/Archives/Public/public-voiceinteraction/2021Mar/0005.html 15:04:44 topic: dirk's walkthrough updates 15:05:00 https://w3c.github.io/voiceinteraction/voice%20interaction%20drafts/paArchitecture-1-1.htm#walkthrough 15:05:36 dirk: does walkthrough match description? 15:06:30 ...this is just the subset that matches part of the text 15:07:08 action: debbie to update text description in Section 4 15:07:48 dirk: reviews Figure 3 15:09:15 dirk: combination of local stuff and remote 15:09:58 ...knowledge graph can be used to find location given GPS 15:12:13 dirk: ASR is sent in parallel to both NLU's (local and remote) (step 6) 15:13:16 jim: what information is given to provider registry? 15:13:34 ...how does the provider registry work? 15:13:48 dirk: doesn't know anything about utterance semantics 15:14:05 ...it just tries to determine the number of registered IPA providers 15:14:14 ...it can use context 15:14:55 ...or user preferences 15:15:26 jim: what do input and output from provider registry look like? 15:17:25 dirk: provider selection service gets the ASR result 15:18:49 debbie: what information is in step 9 15:20:11 dirk: it's always the same, just a query for all providers 15:21:24 ...the provider registry could be more or less smart 15:22:02 jim: after provider selection service gets an ASR result it queries the Provider Registry for all providers 15:22:41 debbie: what about steps 10 and 11? 15:24:28 dirk: for each provider returned from the registry, authentication information is returned 15:25:14 jim: suggest sending NLU to provider selection service instead of ASR result 15:25:28 dirk: or could just send the raw audio 15:26:08 jim: now the Provider Registry sends all the providers, if we use NLU we could be more selective about what providers we return 15:26:30 dirk: this is the provider selection selection strategy 15:26:47 s/selection selection/selection 15:27:27 dirk: this is the minimum and the overall system could be smarter 15:28:33 dirk: then arrow 6 could be after the NLU 15:28:59 dirk: providers could use their own NLU 15:29:42 jim: what's the difference between the orange (Dialog) NLU and the specific NLU's in the IPA providers 15:30:43 jon: what if IPA provider 1 is American, then it could have full complement of components and functionality. 15:31:25 ...we now have text coming from the Dialog box 15:31:58 dirk: could come up with different walkthroughs describing different alternatives 15:32:28 jim: is one of the goals to enable IPA providers to support any input 15:33:05 ...are the inputs in step 13 in a standardized format 15:33:32 dirk: 1 -- raw audio, raw ASR or NLU 15:34:04 jim: suggest restricting to one alternative 15:34:46 debbie: I agree 15:36:59 debbie: maybe rule out NLU as a format 15:37:13 dirk: raw audio would be the most common 15:37:44 ...or text from a chatbot or ASR 15:38:16 jim: future enhancement would be to standardize NLU, but not covered yet 15:39:52 jim: what is the purpose of the NLU in the Dialog box? 15:40:07 dirk: that's the local NLU that processes some of the queries 15:41:08 action: dirk to change arrows to reflect the primary use case 15:41:32 topic: output side of walkthrough 15:41:58 dirk: arrow 1 is the various NLU results 15:42:34 dirk: asks dialog registry for the best dialog 15:42:57 dirk: may need to query for missing slots 15:43:50 ... gets next dialog move 15:44:20 ...NLG comes up with text, doesn't repeat information that's already known to the suer 15:44:24 s/suer/user 15:44:49 jim: we have a distributed semantic processing, some done in orange and some in blue 15:45:23 dirk: yest 15:45:28 s/yest/yes 15:45:40 jim: this might be too confusing 15:46:04 dirk: there are some implementations but not standarized 15:47:13 debbie: how do you find the Dialog that goes with the NLU 15:48:16 ...if the intent is "book flight" then there could be lots of dialog that could handle that intent 15:49:20 ...how do you associate that intent with the appropriate dialog 15:51:39 ...so actually intents and entities come back 15:52:01 ... and that's mapped to dialogs 15:52:44 dirk: that's correct 15:54:00 debbie: when do you actually get your information from the database 15:55:53 dirk: we haven't gotten to that yet, I could add that 15:56:40 dirk: things need to be clearer 16:00:10 action: debbie to pull together list of outstanding comments 16:01:18 dirk: also collected some suggestions from a presentation 16:01:42 ...what should we do to prove that we did what you describe 16:02:27 jim: eventuall there will be conformance tests 16:02:43 s/eventuall/eventually 16:03:15 rrsagent, make logs public 16:03:22 rrsagent, format minutes 16:03:22 I have made the request to generate https://www.w3.org/2021/03/24-voiceinteraction-minutes.html ddahl 18:40:02 RRSAgent has joined #voiceinteraction 18:40:02 logging to https://www.w3.org/2021/03/24-voiceinteraction-irc 18:40:18 present:debbie,dirk,jon,jim 18:40:26 rrsagent, format minutes 18:40:26 I have made the request to generate https://www.w3.org/2021/03/24-voiceinteraction-minutes.html ddahl 18:43:28 ddahl has left #voiceinteraction