[OK?] Re: SPARQL: format based on Unicode?

On Mon, Aug 01, 2005 at 08:34:37PM +0200, Bjoern Hoehrmann wrote:
> 
> Dear RDF Data Access Working Group,
> 
>   In http://www.w3.org/mid/43254eca.231195140@smtp.bjoern.hoehrmann.de
> I noticed that http://www.w3.org/TR/2005/WD-rdf-sparql-query-20050721/
> does not seem to state that the format is based on Unicode; this makes
> character classes in the EBNF like [^#xD#xA] ambiguous. Please change
> the draft to clearly indicate that the format is based on Unicode and
> which characters expressions like [^#xD#xA] refer to.  See also:
> <http://www.w3.org/TR/charmod/#sec-RefProcModel>, specifically C070,
> C077, C079, and C078.
> 
> (Reference to Unicode has been added since, but it seems the current
> editor's draft is still unclear about whether e.g. U+0000 may appear
> in a query literally or escaped, as there are portability issues for
> some of these characters, this needs to be defined more explicitly.)

After a discussion on IRC, I have hope that the textual changes
proposed in http://www.w3.org/mid/20060126021444.GZ17752@w3.org
will address your concearns. I would like to add that I prefer
to define SPARQL characters in terms of Unicode rather than in
terms of XML, which are, in turn, defined in terms of Unicode.
-- 
-eric

office: +81.466.49.1170 W3C, Keio Research Institute at SFC,
                        Shonan Fujisawa Campus, Keio University,
                        5322 Endo, Fujisawa, Kanagawa 252-8520
                        JAPAN
        +1.617.258.5741 NE43-344, MIT, Cambridge, MA 02144 USA
cell:   +81.90.6533.3882

(eric@w3.org)
Feel free to forward this message to any list for any purpose other than
email address distribution.

Received on Thursday, 26 January 2006 08:31:19 UTC