This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 23559 - Clarify recognition when not in focus
Summary: Clarify recognition when not in focus
Status: RESOLVED FIXED
Alias: None
Product: Speech API
Classification: Unclassified
Component: Speech API (show other bugs)
Version: unspecified
Hardware: All All
: P2 normal
Target Milestone: ---
Assignee: Glen Shires
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2013-10-18 01:04 UTC by Glen Shires
Modified: 2013-11-06 21:48 UTC (History)
0 users

See Also:


Attachments

Description Glen Shires 2013-10-18 01:04:33 UTC
There are some very useful and important use cases for allowing speech recognition in a tab that has lost focus; including accessibility, dictating notes while viewing other tabs, and command-and-control.

To alert users when speech recognition is active, Section 4 already states "User agents must only start speech input sessions with explicit, informed user consent" and "User agents must give the user an obvious indication when audio is being recorded."

Thus I propose we delete the sentence "To minimize the chance of users unwittingly allowing web pages to record speech without their knowledge, implementations must abort an active speech input session if the web page lost input focus to another window or to another tab within the same user agent." This sentence only adds use case restrictions without adding any real protection, particularly when considering that a malicious web page can enable recognition without providing any visual indication on the web page.  The real protection is through the "obvious indication" provided by the user agent.

Also, specifications such as WebRTC do not place similar restrictions on web page focus.
Comment 1 Glen Shires 2013-10-23 20:27:53 UTC
Based on the above reasoning, I propose the following errata.
If there's no disagreement, I'll add this to the errata page on November 6.

Section 4 Bullet 4: Delete the following sentence:
 "To minimize the chance of users unwittingly allowing web pages to record speech without their knowledge, implementations must abort an active speech input session if the web page lost input focus to another window or to another tab within the same user agent."
Comment 2 Glen Shires 2013-11-06 21:48:54 UTC
I've updated the errata with the above change (E14):
https://dvcs.w3.org/hg/speech-api/rev/c4b3451ce85c

As always, the current errata is at:
http://dvcs.w3.org/hg/speech-api/raw-file/tip/speechapi-errata.html