Roy Ran
W3C Team contact
International standards harmonization in China
Slides available on line:
Text-to-speech pronunciation is often inaccurate and inconsistent because of technology limitations.
Accurate, consistent pronunciation of content spoken by text-to-speech (TTS) synthesis
SSML features critical for implementation:
Two front runners:
Examples on last page of Slides
Email Roy Ran: ran@w3.org
In-line SSML in an HTML fragment is shown below:
The farm was used to produce produce
The farm was used to <speak><phoneme ph="prəˈd(j)us">produce</phoneme></speak> <speak><say-as phoneme ph="ˈproʊd(j)us">produce</phoneme></speak>
Attribute based model of SSML:
Train stopped at that station: "Two to two to two two"
<span data-ssml='{"say-as" : {"interpret-as":"time: 2 minutes to 2"}}'>two to two</span> to <span data-ssml='{"say-as" : {"interpret-as":"time: 2 past 2"}}'>two two</span>.