This charter is written in accordance with the W3C Process, section 4.2.2 ( Working Group and Interest Group Charters).
$Date: 2002/09/20 13:48:21 $Author: dsr $
— Voice enabling the Web!
The Voice Browser Working Group was originally chartered in February 1999 with the goal of extending the Web to support access from any telephone to suitably designed applications. Users would be able to use their voice for input and their ears to listen to recorded and synthetic speech, music and other sounds. The Working Group is now being rechartered on a royalty free basis under the terms of W3C's Current Patent Practice, see section 14 of this charter for details. This follows the 13 June 2002 Director's Decision (Members only) on the results of the Voice Browser Patent Advisory Group. The Working Group will focus on driving VoiceXML and associated specifications through to Recommendation status, as well as continuing work on new features, based upon extensive industry experience with interactive voice response systems. Scope and deliverables for the Working Group are identified in sections 2 and 3.
Far more people today have access to a telephone than to a computer with an Internet connection. In addition, sales of mobile phones are booming, so that many of us have already or soon will have a phone within reach where ever we go. Voice Browsers offer the promise of allowing everyone to access Web based applications from any phone, making it practical to access the Web any time and any where, whether at home, on the move, or at work.
It is common for companies to offer services over the phone via menus traversed using the phone's keypad. Voice Browsers offer a great fit for the next generation of call centers, which will become Web portals to the company's services and related websites, whether accessed via the telephone network or via the Internet. Users will able to choose whether to respond by a key press or a spoken command. Voice interaction holds the promise of naturalistic dialogs with Web-based applications using speech synthesis, pre-recorded audio, and speech recognition. Voice interaction can escape the physical limitations on keypads and displays as mobile devices become ever smaller.
By switching to markup and Web-based technologies, it becomes much cheaper and easier to develop interactive voice response applications. Users dial into voice browsers that in turn download VoiceXML and other resources from web servers. Information supplied by authors can increase the robustness of speech recognition and the quality of speech synthesis. Text to speech can be combined with pre-recorded audio material, which can be used to enliven the user experience in a similar manner to the use of images in visual content. The lessons learned in designing for accessibility can be applied to the broader voice browsing market-place, making it practical to author content that is accessible to people using Braille based browsers, even if they are unable to hear or see.
W3C held a workshop on "Voice Browsers" in October 1998. The workshop brought together people involved in developing voice browsers for accessing Web based services. The workshop concluded that the time was ripe for W3C to bring together interested parties to collaborate on the development of joint specifications for voice browsers. As a response, W3C set up the "Voice Browser" Working Group. The Working Group is now being rechartered on a royalty free basis to drive the existing work through to Recommendation status, and to develop support for new features based upon extensive industry experience.
The Working Group started by developing a suite of requirements and followed up with work on the corresponding specifications.
The Working Group suspended work on several areas to free up teleconference time for VoiceXML. Work currently suspended includes stochastic grammars (N-Grams), pronunciation lexicon, and voice browser interoperation. In addition, the natural language semantics markup language (NLSML) specification has been transferred to the multimodal interaction activity. The Voice Browser Working Group is cooperating with the CSS Working Group to develop a replacement for the CSS2 aural properties. This is expected to result in a couple of modules for CSS3, one for speech synthesis, based upon SSML, and another for adding aural effects to visual web pages.
The Voice Browser Working Group is tasked with the development of specifications covering the following goals:
The Working Group is free to prioritize these goals as appropriate, and to drop individual goals, e.g. in case that there is insufficient interest or that there are not enough resources to meet them in the timeframe set out in Section 7.
The Working Group is expected to cooperate with other W3C Working Groups, see Section 9. The Working Group will also serve as a coordination body with existing industry Groups working on related specifications, and to provide a pool of experts on voice browsers, some of which will participate in the other W3C Working Groups relevant to voice browsers.
This Section describes an initial set of deliverables for achieving the goals stated in Section 2. At the discretion of the Chair, the Working Group can adapt this set as needed during the course of its work. However, all deliverables must fall within the scope of this charter, and sufficient resources to address them need to be available within the Working Group.
The Voice Browser Working Group is expected to advance the following specifications along the W3C Recommendation track. The milestones in Section 7 show the estimates for progressing the high priority items. Low priority items may be dropped if the resources for working on them aren't realised:
High priority items:
Low priority items:
In parallel with work on VoiceXML 2.0, the Working Group is expected to start work on the next version of VoiceXML, drawing upon public comment and extensive industry experience with earlier versions. This dual track approach is essential to maintaining the flow of innovation.
This Working Group is scheduled to last for slightly more than two years, from September 25th, 2002 to December 31st, 2004.
The Working Group will have fulfilled its mission if it succeeds in developing W3C Recommendations covering the goals stated in Section 2.
By default, all documents under development by the Working Group are available to W3C Members from the Working Group's web page. Selected documents will be made publically available via the W3C's technical reports page after approval from W3C management. The types of documents (Notes, Working Drafts etc.) are defined by the W3C Process.
Documents must have at least one editor and one or more contributors. Documents should have a date by which they will be declared stable. Any remaining issues at this date will be described in the document to avoid delaying its wider release.
This is a provisional list of milestones for the deliverables identified in section 3, and liable to change. The Voice Browser Working Group will be tasked with maintaining publically accessible information describing the documents under development and the schedule for their standardization. The table below uses the following abbreviations: Q for Quarter, WD for Working Draft, LCWD for Last Call Working Draft, CR for Candidate Recommendation, PR for Proposed Recommendation, and REC for Recommendation.
Date | VoiceXML 2.0 | SRGML | SSML | Semantic Interpretation |
CCXML |
---|---|---|---|---|---|
2002Q3 | WD2 | ||||
2002Q4 | LCWD | PR,REC | LCWD | WD | WD3 |
2003Q1 | LCWD | LCWD | |||
2003Q2 | CR | CR | |||
2003Q3 | PR | PR | CR | CR | |
2003Q4 | REC | REC | PR | PR | |
2004Q1 | REC | REC | |||
2004Q2 | |||||
2004Q3 | |||||
2004Q4 |
A note describing the goals for future versions of dialog markup might be released in 2003 Q2 or Q3. A first Working Draft might follow in 2004 Q1 (after VoiceXML 2.0 reaches Candidate Recommendation status) with a goal of LCWD by end of 2004. N-gram and Lexicon might begin to develop in 2003 after Grammar has reached recommendation. The work on Voice Browser Interoperation will be put on a slow track.
Access to email discussions and to documents developed by the Working Group will be limited to W3C Members and Invited Experts, until released for publication by the joint agreement of the Working Group and the W3C management team. Working Group members are required to honor the confidentiality of the Group's discussions and working documents, until such time that the work is publically released. Invited experts are bound by the W3C Invited Expert and Collaborators Agreement. Participants working for W3C Member organizations are bound by their contract with W3C.
The Voice Browser Working Group will have to take into account technologies developed by other Groups within W3C, and to advise them about the requirements for Voice Browsers and to ask them to review specifications prepared by the Working Group, covering proposals for extensions to existing or future Web standards. At the time the charter was written, the following ongoing W3C activities are concerned: (listed in alphabetical order)
The following is a list of Groups that are known or presumed to be working on, or interested in, standards relating to voice browsers, with pointers to the respective projects. The W3C Voice Browser Working Group will need to liaise with these Groups.
The archived member-only mailing list w3c-voice-wg@w3.org is the primary means of discussion within the Group.
Certain topics need coordination with external Groups. The Chair and the Working Group can agree to discuss these topics on a public mailing list. The archived mailing list www-voice@w3.org is used for public discussion of W3C proposals for Voice Browsers, and Working Group members are encouraged to subscribe. As a precaution against spam you must be subscribed in order to send a message to the list. To subscribe send a message with the word subscribe in the subject line to www-voice-request@w3.org.
The Working Group meets by phone on Tuesdays and Thursdays. The exact details, dates and times are published in advance on the Working Group page. Additional phone conferences may be scheduled as necessary on specific topics.
Face to face meetings will be arranged 3 to 4 times a year. Meeting details are made available on the W3C Member Calendar and from the Working Group page. The Chair is responsible for providing publically accessible summaries of Working Group face to face meetings, which will be announced on www-voice@w3.org.
The Voice Browser Activity will maintain public pages on the W3C website to describe the status of work and pointers to the Working Group, charter, Activity statement, and email archives.
The Group works by consensus. In the event of failure to achieve consensus, the Chair may resort to a vote as described in the Process Document. Each Member company which has at least one Group member in good standing may vote. There is one vote per W3C Member company. Votes are held by email to allow all participants a chance to vote; there is a two week voting period followed by a period of two working days for the announcement of the result. W3C staff and invited experts do not vote; however in the event of a tie the chair has a casting vote. If the issue is solved by consensus during the voting period, the vote is cancelled.
Note: the term good standing is defined in the W3C Process.
The W3C staff contact, and activity lead will be Dave Raggett (W3C Fellow on assignment from Openwave Systems). Resources of additional W3C team members will be required for some of the deliverables, should the conditions for starting these deliverables be met.
Requirements for meeting attendance and timely response are described in the Process document. Participation (meetings, reviewing, and writing drafts) is expected to consume time amounting to one day per week for the lifetime of the Group. Working Group participants are required not to disclose information obtained during participation, until that information is publically available.
W3C Members may also offer to review one or more Working Drafts from the Group for clarity, consistency, technical merit, fitness for purpose and conformance with other W3C specifications. The only participation requirement is to provide the review comments by the agreed-to date.
As decided on a case by case basis, invited experts may attend a single meeting or a series; they may in some cases be subscribed to the Group mailing list. For the duration of their participation, invited experts are encouraged to adopt the same requirements for meeting attendance and timely response as are required of W3C Members. Invited experts are subject to the same requirement for information disclosure as are required of W3C Members.
The W3C team will be responsible for the mailing lists, public and Working Group pages, for the posting of meeting minutes, and for liaison with the W3C communications staff for the publication of Working drafts. W3C team members are expected to adopt the same requirements for meeting attendance, timely response and information disclosure as are required of W3C Members. The W3C staff contact will be expected to devote 40% of his time to this Activity.
W3C promotes an open Working environment. Whenever possible, technical decisions should be made unencumbered by intellectual property right (IPR) claims.
This is a Royalty Free Working Group, as described in W3C's Current Patent Practice, see also the Director's decision of 13th June 2002 (W3C Members only).
Working Group participants disclose patent claims by sending email to <patent-issues@w3.org>; please see Current Patent Practice for more information about disclosures.
The Director's Decision on the Voice Browser PAG Recommendation makes provision for work on RAND extensions to an RF core specification:
The Working Group will be rechartered as a royalty-free Working Group as defined by the Current Patent Practice note (CPP). The core specifications ought to enable basic interoperability for voice browser applications across the Web, but might not include certain advanced or specialized features over which participants hold patents that they will not currently make available RF as defined in the CPP.
Should any part of a specification be removed from the core version because it is not available on a royalty-free basis, and should the WG decide to continue to work on this part, a PAG should be formed that could recommend issuing the particular part as a RAND specification coming out of W3C, or another organization.