00:45:27 timbl has joined #tagmem 01:19:55 plinss has joined #tagmem 02:35:40 timbl has joined #tagmem 03:24:28 plinss has joined #tagmem 04:23:13 timbl has joined #tagmem 13:57:29 RRSAgent has joined #tagmem 13:57:29 logging to http://www.w3.org/2011/02/09-tagmem-irc 13:58:40 scribenick: Ashok 13:58:43 DKA has joined #tagmem 13:58:55 zakim, who is here? 13:58:55 sorry, jar, I don't know what conference this is 13:58:56 On IRC I see DKA, RRSAgent, Zakim, timbl, Ashok, ht, jar, trackbot, Yves 13:59:01 zakim, this will be tag 13:59:01 "tag" matches TAG_f2f()9:00AM, and TAG_f2f()8:30AM, jar 13:59:17 scribe: Ashok 14:00:32 Ashok has joined #tagmem 14:00:47 plinss has joined #tagmem 14:01:03 scribenick: Ashok 14:03:00 johnk has joined #tagmem 14:04:33 masinter has joined #tagmem 14:05:23 noah has joined #tagmem 14:05:24 http://norman.walsh.name/2011/02/08/html-xml 14:05:45 http://www.w3.org/wiki/HTML_XML_Use_Cases 14:14:56 First draft of "product" page for privacy drafts: http://www.w3.org/2001/tag/products/PrivacyFriendlyWeb.html 14:16:04 johnk has joined #tagmem 14:16:09 Norm has joined #tagmem 14:16:28 Topic: HTML-XML-Divergence-67: HTML / XML Unification 14:18:09 http://appliedlife.blogspot.com/2009/08/markup-languages-family-tree.html - I did this when I was reading the HTML5 spec last year 14:20:18 Noah: Norm is chairing the XML/NTML unification taskforce 14:20:43 s/NTML/HTML 14:21:51 Noah: Issue-120 on HTML is on distributed extensibility 14:22:06 ... there is also an issue on RDFa prefixes 14:23:27 Norm: We consituted the taskforce with a mixture of XML and HTML folks 14:24:02 Norm's blog entry on the state of play in the HTML/XML Unification subgroup: http://norman.walsh.name/2011/02/08/html-xml 14:24:15 ... strted tofigure out what the problem was ... didn't get very far 14:24:26 ... then started on usecases 14:24:26 http://www.w3.org/wiki/HTML_XML_Use_Cases 14:24:47 Use cases wiki: http://www.w3.org/wiki/HTML_XML_Use_Cases 14:24:53 s/tofigure/to figure 14:25:21 Norm: We can discuss the usecases 14:25:41 Larry: They are usecase categories 14:26:14 ... you say XML Toolcahin but there are many flavors of Toolchains with different requirements 14:26:35 ... I don't see roundtripping 14:27:04 Norm: Roundtripping was something we talked about but did not make it as a usecase 14:28:03 Larry: Some may notn agree with usecases ... relating them to commercial use nay be very helpful 14:28:18 s/noyn/not/ 14:28:31 s/notn/not/ 14:28:51 s/may not agree with usecases/may not think usecases are important/ 14:29:27 ht: The guy from Deuthe bank talked about how XHTML had been very helpful 14:29:33 s/commercial/successful/ 14:29:44 s/nay/may/ 14:30:33 ht: discusses another commercial usecase 14:30:51 i think our feedback that going down to get more concrete examples that would increase credibility 14:30:54 ht: Such commercial usecases would be useful 14:31:16 s/Deuthe/Deutche 14:32:19 HTML is not good for data scraping.... 14:32:30 ht: Many colleagues scrape data and waste lots of time with HTML ... XHTML is much better for then 14:32:38 s/then/them/ 14:34:37 Larry: Details usecases -- analysis and extraction 14:34:44 s/Deuthe bank/Deutsche Telekom AG Kai Scheppe/ 14:35:05 s/AGKai/AG Kai/ 14:35:51 Noah: Norm, could you talk about the mindset of the group and where it is going 14:36:02 different detailed use cases have different requirements... requirements of "scraping" might be performance requirements, while those of "processing" care about fidelity 14:36:13 round-tripping has high requirement for fidelity 14:37:08 Noah: Says group members ready to leave 14:37:38 ... if we refine usecases that may convince some people to stay and work on the issue 14:37:58 Larry: We need to take requirements from real commercial users 14:38:57 Norm: Roundtripping may be a new usecase 14:40:32 Larry: usecase is starting with HTML, doing some XML processing abd then enitting HTML 14:41:47 Tim: The common DOM does not work because you don't add new TBody elements 14:42:25 q+ to wonder about scripts 14:42:38 ack next 14:42:40 timbl, you wanted to wonder about scripts 14:42:46 Larry: Using an XML Toolcahin to produce HTML -- new usecase 14:44:05 TBL: If the task force just nourishes and maintains the concept of polyglot, that would be very userful 14:45:12 Ashok has joined #tagmem 14:46:08 Norm: The hTML folks were quick o reject the Polyglot spec as too brittle ... too strict about angle brackets etc. 14:46:49 Norm: Polyglot is perceived as fragile for the same reasons as any XML, I.e. too strict about perfect syntax 14:47:08 masinter has joined #tagmem 14:47:17 "race to the bottom" 14:47:26 Noah: I don't buy that, because I think the #1 use case for polyglot is for people who are using XML tool chains or are happy to produce "perfect" syntax, but whose users require content served text/html... 14:47:52 ...so, they want a spec that tells them just what they can and can't put into that perfect syntax and have it work right when served text/html 14:47:59 q+ to talk about race to the top 14:48:23 ht: Producing polyglt is hard so once someone starts using a single language everyone goes to that -- race to the bottom 14:48:28 "I think "use XML toolchain to produce HTML" is the most common use case in the industry, and that polyglot is likely the most appropriate direction for them 14:50:36 Note to scrivbe: ht didb't assert "Producing polyglt is hard so once someone starts using a single language everyone goes to that -- race to the bottom" he was quoting it from the wiki 14:50:43 Larry: There may be changes to HTML ... the api to the DOM may have some options ... e.g. not failing in some way 14:51:12 ... some guidance about what not to use 14:51:24 ht: That's the polyglot document 14:51:59 Larry: No, it can have unbalanced brackets but does not use some features 14:52:12 Norm: I think there is a single DOM 14:52:33 New use case wiki page (very rough): http://www.w3.org/wiki/HTML_XML_Use_Case_08 14:52:35 document.write is the leading example 14:53:01 Tim: For many people the DOM is an API ... supports the same methds 14:53:25 Noah: XML and HTML processors working on the DOM 14:53:42 q? 14:53:45 ack next 14:53:47 timbl, you wanted to talk about race to the top 14:53:50 ht: There is html in conversation and the html out conversation 14:54:38 q+ to push back a bit on Tim's claim about polyglot 14:54:43 Tim: It is easy to produce polyglot documents ... avoids document.write 14:55:24 ... run it thru tidy ... if you produce polyglot you gaet 2 sets of people using it ... html folks and xml folks 14:55:41 ... so there will be a 'race to the top' 14:56:21 Noah: Sympathetic to polyglot 14:56:27 Norm has joined #tagmem 14:56:48 polyglot is useful for use cases that weren't in the set of use cases written up 14:56:49 ... useful for simple cases ... what about using external libraries, etc. 14:57:15 ... these may use document.write 14:57:39 ... so does Polyglot apply in these cases 14:57:42 "document.write" isn't the entire set of things that are "HTML specific DOM operations", but it's a good poster child for it 14:58:22 Norm: The vast majority of Web docs are string concatenetaion and they don't want to run tidy 14:58:32 s/are/are using/ 14:59:29 Larry: People may be discounting Polyglot because they are not looking at right usecases 15:00:45 Noah: Added usecase 8 15:01:45 the task force should be looking at creating a document that is acceptable to the W3C and web community... their local agreement is ok 15:01:49 Noah: Should we invest in improving the Polyglot document 15:02:16 Norm: I thought the Polyglot document went as far as it could 15:02:55 Larry: There is a large community of people with toolchain who needs to satisfied 15:03:24 Norm: The taskforce will produce a report and that will be reviewed 15:03:48 ... I was unable to persuade people to make technical changes 15:04:18 Noah: Talks about the taskforce and peoples motivations 15:04:36 A good faith participation in a task force would be to agree on a problem statement for the task force. 15:05:16 larry: What is the task? 15:05:36 Norm: It proved to be difficult to state the problem 15:05:52 ... so people moved on to usecases 15:06:40 Larry: Now that you have usecase are you going to try and define the prooblem again 15:07:39 Noah: The tone of the taskforce has been constructive 15:08:28 Larry: My experience is that when you are at loggerheads, bring in more people 15:09:03 ... bring in people who need the solution 15:09:13 DKA has joined #tagmem 15:10:03 Noah: Will the real users come to the taskforce and explain their usecases? 15:11:23 Larry: Document in the report where there is not consensus and why 15:12:50 Norm: Usecase number 4 is most bizzare 15:12:51 q+ to ask about FPML 15:13:06 the XML -> (XML/HTML polyglot ) -> XML or HTML tool chain 15:13:18 and the use case of "scraping" as a kind of consuming 15:16:35 Noah: Some folks claim no changes are needed ... HTML is the answer and XML is not helpful 15:17:11 q+ to make the XSLT-in-the-browser poiint 15:18:02 Norm: I think taskforce has gone as well as it could 15:18:30 ... no usecase has convinced the HTML folks that they need to change 15:18:51 Peter: What changes are you thinking of 15:19:20 NW: Even the script hack can be useful. 15:19:28 TBL: What's the script hack? 15:19:47 NW: