Re: [ISSUE-53] (Empty string for allowedCharacters): Empty string for allowedCharacters [MLW-LT Standard Draft]

On 9.10.2012 15:17, Yves Savourel wrote:

> During the call we mentioned: allowedCharacters="." (which is currently in the specification as an example)
> And allowedCharacters=".*"
> 
> I think both are wrong.
> 
> I don't think ".*" is a character class as defined here:
> http://www.w3.org/TR/xmlschema-2/#charcter-classes

Hi, you are right. I forgot that we are using classes only, so * is not
allowed.

> And "." is a valid pattern for a character class, but corresponds to [^\n\r] no to 'any characters'. (see http://www.w3.org/TR/xmlschema-2/#nt-WildcardEsc)
> 
> Could we then use ".[\n\r]" for "any characters?

We can't use .[\n\r] as it is not valid class. What we can do is to
redefine "." as matching any character including \n and \r. This should
not pose any harm to implementors as RE libraries usually had flag which
controls whether \n and \r are covered by . (whole string matching, and
line-by-line matching). I think this could be the easiest and cleanest
solution both from the spec and implementation PoV.

Of course I'm looking for Yves counterarguments as he is already typing
relevant Java code to prove that my thinking was wrong :-D

    Jirka

-- 
------------------------------------------------------------------
  Jirka Kosek      e-mail: jirka@kosek.cz      http://xmlguru.cz
------------------------------------------------------------------
       Professional XML consulting and training services
  DocBook customization, custom XSLT/XSL-FO document processing
------------------------------------------------------------------
 OASIS DocBook TC member, W3C Invited Expert, ISO JTC1/SC34 member
------------------------------------------------------------------

Received on Wednesday, 10 October 2012 09:14:32 UTC