08:02:30 RRSAgent has joined #tagmem 08:02:30 logging to http://www.w3.org/2011/09/14-tagmem-irc 08:02:32 RRSAgent, make logs public 08:02:32 Zakim has joined #tagmem 08:02:34 Zakim, this will be TAG 08:02:34 I do not see a conference matching that name scheduled within the next hour, trackbot 08:02:35 Meeting: Technical Architecture Group Teleconference 08:02:35 Date: 14 September 2011 08:02:46 masinter has joined #tagmem 08:02:50 zakim, this is tag or something 08:02:50 sorry, DKA, I do not see a conference named 'tag or something' in progress or scheduled at this time 08:03:05 Scribe: Dan 08:03:11 ScribeNick: DKA 08:03:21 zakim, what is an easter egg? 08:03:21 I don't understand your question, jar. 08:03:21 Chair: Noah 08:07:50 Topic: URI Definition Discover; Metadata Architecture 08:08:09 http://www.w3.org/2001/tag/2011/09/13-agenda#metadata 08:09:02 Noah: aiming for a major piece of work on this in July time-frame. 08:11:34 JAR: 3 documents - important one is the http://www.w3.org/2001/tag/2011/09/referential-use.html - meant as an introduction to the other two. 08:14:46 JAR: I've come at this HTTP-RANGE14 issue from 2 directions. First - can I use a URI for a reference to a journal article. Is that allowed by the range14 rule? 08:16:21 JAR: Second way - writing statements about licenses. Is there a standard way to refer to the work that's licensed in a way that people will understand? Answer is no, but http-range14 is close... 08:17:38 … also, large numbers of people in linked data / SW community are discussing... 08:17:43 … for a long time ... 08:18:03 … so HTTP-Range14 needs to be fixed. 08:18:08 parts of this were issues in the early '90s discussions of URL standards in the URI working group in IETF 08:19:12 Ashok has joined #tagmem 08:19:41 JAR: so one option is that [CC] and others could do this privately. But there is enough interest that we should have a shared solution. 08:20:53 JAR: The question is: what is the relationship between what the URI refers to and the representations retrieved by the URI? 08:22:35 JAR: My definition of metadata is that it is information about information, not just information about anything. 08:22:48 JAR: [goes to the whiteboard] 08:23:35 timbl has joined #tagmem 08:25:09 [unminutable discussion on con-neg] 08:26:52 JAR: idea that URIs are used in particular contexts but used in practice referentially. Maybe that's only used in RDF. Premise is that URIs are being used referentially. 08:27:03 Tim: RDF should not be inconsistent with Web Architecture. 08:27:05 JeniT has joined #tagmem 08:27:09 JAR: Other people want it to be inconsistent. 08:28:05 … notion that a URI refers to something. In the case of a license statement, a URI would refer to the work, a URI that refers to the license terms and a URI that refers to the relationship of "being licensed." 08:29:00 … from an engineering tool that could be used in a remix tool. You do a copy-paste and the tool could just check the license. 08:30:31 Larry: in the context of RDF there is some ambiguity in whether you're referring to the document retrieved or the referent, and resolving this ambiguity [depends on the context.] 08:30:54 … the license relationship could be more explicit. 08:31:08 JAR: That's not the way RDF work - RDF has referential transparency. 08:31:46 Tim: you could have one property that says "I like this page" and then "ogp:like" - these say things about the topic of the page. 08:33:55 [discussion on sockets and plugs] 08:35:57 JAR: …being precise about what the subject of the license is… is the question. What is the relationship by convention - the agreement - between the way people are using URI referentially and the way people are using them for retrieval? Is there anything we can say or agree on ahead of time about what that relationship is? 08:36:41 JAR: I'm making a statement and I want the URI to refer to [e.g.] a rabbit. 08:36:56 Noah: And the retrievable is meant to be data about the rabbit? 08:37:05 JAR: That's what the [discussion] is over. 08:37:25 … even if it's zero, there's no reason a-priori to assume any relationship between the two. 08:37:54 HT: Is it or is it not yet legitimate for me to understand retrievable as "200 retrievable"? 08:38:14 HT: I want to make sure that e.g. 303 are not covered... 08:38:28 JAR: Right. 08:39:00 JAR: I'm talking about RFC-3986. That talks about retrieval. 08:39:26 Noah: 206 is being debated. 08:39:49 JAR: I'm using it in the sense of - would it be correct for a http server to deliver a 200 response? 08:40:26 … RDF makes no connection between the thing the URI is referring to and what gets retrieved. It leaves it up to the context. 08:40:55 Ashok: Earlier you and I reviewed the link header draft - that presents a solution to this, doesn't it? 08:41:29 JAR: There's a description which could be bound to the URI through a variety of methods - a link header, another header, a SPARQL query, etc... 08:41:49 Ashok: the link header would typically... 08:41:52 JAR: Give you a URI. 08:42:00 Ashok: What else do you require? 08:42:57 JAR: What is required is a way to go from a URI to this description that can be done on a hosted platform where they cannot change http headers or exotic response codes - also one round trip instead of two. Unless we relax these two things then semantic web will be out-competed by other standards. 08:43:44 +1s JAR's last statement 08:43:52 Tim: There's things you can do - change the response code, add a header, add a fragment, a fragment id syntax... 08:45:00 JAR: [clarifying] some people require … I can't argue with them. 08:47:52 "if you get a 2XX response when you request a URI, that URI refers to a document" 08:50:39 Tim: [on OGP] people looked at OGP and said "if Facebook made this mistake then others will make this mistake and so we should make the mistake legal." 08:50:40 ht has joined #tagmem 08:51:16 *lots* of people make the mistake 08:51:19 JAR: There are other places where range14a has not been observed. FlickR is one of them. 08:52:25 JAR: There is no enforcement point for httprange-14. So people are not going to be aware of it. 08:56:13 JAR: The statement that such a page is licensed with a license is false if the URI refers to the landing page. 08:58:11 they'll say no, because they all require server configuration 08:59:09 JAR: you could sit down with everyone who is doing it in this way you could try to convince people to use one of the approved range-14 solutions. [but that's not scalable] 08:59:49 Tim: you could make a validator. People will want to know what they said in RDFa. There will be problems e.g. licensing the wrong thing by accident. 09:00:31 JAR: We could got back to FlickR and ask them to change but I don't want to do that if haven't resolved this issue here. 09:02:11 [discussion on how flickr is using CC licenses] 09:04:37 Noah: let's say the UI says "by posting a photograph here you grant a license…" Now the UI could come back and inform the user allowing them to choose which meaning… Let's say I added comments on the landing page. Did I mean to license the comments as well as the photo? 09:05:56 HT: A bunch of different proposals exist for how to get from a URI to a description about the URI... 09:06:34 Ashok: if you use the link header you can get multiple descriptions... 09:06:41 s/URI to a description about the URI/URI for a thing to a URI for a description about the thing/ 09:07:14 JAR: We can amend range-14a. 09:07:38 … the whole point of the ISSUE-57 document is to sweeten the proposition of allowing people to live with range-14a. 09:07:59 … the outcome could be that people say "yes [for example] hash URIs are OK". 09:08:14 … another outcome could be that people are not satisfied by any of the solutions. 09:08:25 … we could ignore them and push something through... 09:08:59 HT: Do you have any interest in all at exploring the opposite outcome? That we should say "yes, you're right, you should go ahead and use 200"? 09:09:02 JAR: Yes. 09:09:09 HT: That would amount to a retraction of 14a. 09:10:04 JAR: I've come to a better appreciation for the alternatives. One alternative is to withdraw 14a and let the community decide. In this case, [e.g.] Creative Commons would develop our own approach. 09:10:39 I made some proposals which got some good feedback at http://www.jenitennison.com/blog/node/159 09:10:51 JAR: You'd have to write in RDF something to disambiguate. 09:12:29 HT: example: Pat is the owner of a domain. He puts up a page about himself. Then he says "use this URI to refer to me" - and now we can't make assertions about the document. 09:13:05 Tim: When my RDF tool takes one graph and merges it with another graph that make different assumptions then we break RDF. 09:15:13 q+ to ask JeniT about "The problem some people (including me) have with this is that hash URIs are primarily used to indicate portions of a web page, and using them for things that aren’t page fragments overloads them. It’s also an inflexible method, because the server isn’t told what the fragment identifier is, and therefore it can’t be used as the basis for a redirection, for example." 09:15:59 JeniT: talks about different ways of disambiguating statements. The analogy that I draw in that article is between how we think about persistence of URIs and how we deal with the fact that they don't persist over time. We should be aiming for people to use different URIs for people and documents but we need to deal with the fact that they don't all the time. 09:16:46 Larry: you've left out link relations. 09:17:38 Larry: you could have two license relationships. 09:18:25 JAR: yes - that's the Facebook solution... 09:18:46 [discussion on the meaning of meaning] 09:20:22 HT: "larry has five letters" vs. "larry has five children" - you have no issue understanding that but RDF does. 09:21:15 Tim: in the databases out there on the net right now there is no ambiguity - the semantics are well defined. 09:22:54 s/five children/three children/ 09:23:00 Tim: [some] say they must use another solution because the solutions [given to them by] web architecture are not acceptable. 09:26:17 There is masses of data out there where the people running the system don't have any problem when a table name happens to be the same as a value in a cell, don't have a problem when the same string happens to be used as an ID in one column and as a value in a different column. This data is all waiting t be put ion the web. It it clean and unabiguous and to suggest that when it goes onto the web we should necessarily introduce ambiguities becuase people always 09:26:17 to make a major step backwards. 09:26:49 so if databases don't have a problem, then the problem is with RDF, no? So fix RDF 09:29:00 JAR: what I want to know - how can I set expectations as I go into conversations with the community about this? 09:29:40 … I would like to say "I want to work through this tree with you" if I get a yes, then great but if I get a no then what expect ion can I set with people about this outcome? 09:30:28 iand's said why he won't use hashes http://lists.w3.org/Archives/Public/www-tag/2011Aug/0127.html 09:31:33 I have responded to all those points 09:32:27 HT: Should we review and prirotize the candidate amendments? Should we have some preferences in the TAG? 09:32:27 timbl, so your argument is that we go back and say to all the people who have used non-hash URIs for the last 5+ years and say that they were wrong? 09:32:55 timbl, that the httpRange-14 decision should never have encouraged people to do that 09:34:16 303 works and is fine -- it is inefficient 09:35:34 Tim: If there are architectural issues with the current solutions then we should design architecture to address those issues… 09:35:35 personally, i really dislike 303 09:37:13 Tim: When I look at Ian's arguments they don't look sound to me. It may be that's he's got one. 09:37:14 http://lists.w3.org/Archives/Public/www-tag/2011Sep/0004.html 09:37:23 JAR: I think they're as sound as anything else. 09:37:28 http://lists.w3.org/Archives/Public/www-tag/2011Sep/0006.html 09:37:45 HT: It's fair to say that the way in which the hash solution works is not consistent with the RFCs. 09:38:57 Tim: I'm really fond of the hash as a piece of punctuation between a global and local identifier. I want to use this in many contexts. 09:39:09 personally, I really dislike using # for this disambiguation too 09:40:10 The phrase "fragment identifier" is a historical and unfortunate. "Local Idenifier" is much better 09:42:28 JAR: if I have a flickr page with a photo on it, is the photo part of the page in the case where the license refers to the page? Some might say no but others might say yes. 09:44:07 JAR: we have an evolution where these things are turning into applications… if people are talking about what they see when the page is rendered, that might have very little relation to what is delivered in the 200 response. There's enough vagueness about what the document is that we're going to continue to have ambiguity even if we do get agreement on range-14. 09:44:35 Larry: so you need it to be precisely referring to "what you see when you get all the data and render the page." 09:44:40 JAR: That's one approach. 09:45:38 JAR: [why a:b solves the flickr problem] 09:46:44 Tim: FlickR could put some RDFa in there which says "landing page#photo" has CC license *whatever* and etc... 09:46:45 HT: So a:b is _necessary_ for a solution to the flickr problem, but not sufficient -- WebArch today doesn't even get us that far 10:04:35 Scribe: Ashok 10:04:42 ScribeNick: Ashok 10:05:20 Topic: Can publication of hyperlinks constitute copyright infringment? 10:05:56 Noah: Some writing was done. Next step we decided was to get some legal advice. 10:06:15 ... perhaps Thinh Nguyen may help 10:06:28 ... but no legal advice so far 10:07:31 masinter has joined #tagmem 10:07:41 Noah: quotes from product page: http://www.w3.org/2001/tag/products/PublishingLinking.html 10:08:31 Discusses success criteria 10:08:32 ... document should have impact 10:09:06 HT: I want to write a popular press version 10:09:33 Noah: Goal is PR in June. 10:09:54 ... FPWD in October. Is that realistic? 10:10:40 DKA: I sent out 3 request to people for legal review. Not heard back. 10:10:48 noah has joined #tagmem 10:10:51 s/request/requests/ 10:10:52 ACTION-541? 10:10:52 ACTION-541 -- Jeni Tennison to helped by DKA to produce a first draft of terminology about (deep-)linking etc. -- due 2011-07-26 -- OPEN 10:10:52 http://www.w3.org/2001/tag/group/track/actions/541 10:11:41 Jonathan: we need to bump the dates on ACTION-201 and ACTION-282, I think. Right? 10:11:45 DKA: case in UK touched on this ... remains a current topic 10:12:00 s/case/Case/ 10:12:29 Jeni: We should continue working on the document. 10:14:16 jar: I followed up with my legal contact. He referred me to a Professor of Law. 10:15:21 ... In conversation with him 10:18:33 q+ to say I'm confused about dealing with Ashok's question now 10:18:55 Ashok: If we go the direction of a popular press article what would our messages be? 10:19:28 ack next 10:19:29 timbl, you wanted to ask JeniT about "The problem some people (including me) have with this is that hash URIs are primarily used to indicate portions of a web page, and using them 10:19:34 ... for things that aren’t page fragments overloads them. It’s also an inflexible method, because the server isn’t told what the fragment identifier is, and therefore it 10:19:36 ... can’t be used as the basis for a redirection, for example." 10:19:37 ack next 10:19:38 noah, you wanted to say I'm confused about dealing with Ashok's question now 10:19:45 DKA: Users should have a right to link ... parallel to freedom of speech 10:19:48 q+ to cavill at the use of 'link' in our headline 10:20:49 Noah: Let's wait until we crisp up the finding before we answer that question 10:20:52 q? 10:20:54 ack next 10:20:55 ht, you wanted to cavill at the use of 'link' in our headline 10:21:26 HT: at the popular level, linking is a confusing concept 10:21:38 http://jeffersonsmoose.org/?p=90 10:21:39 HT: Too high-level to be implicated by anything a lawyer says. 10:22:11 ... the word "link" is in itself confusing 10:22:21 q? 10:22:32 DKA: Linking vs. Transclusion 10:23:20 HT: Depends on whether you use "image" or "object". 10:23:51 Noah: The document says the terminology is subtle even for experts 10:24:12 ... so we need to explain 10:24:24 ... we ask legal community what would help them 10:25:26 Discussion of "fair use" 10:26:45 Music students must buy the music. For plays they must rent a copy of the play 10:27:04 http://en.wikipedia.org/wiki/Fair_use 10:27:33 Yves: Understanding of free speech is different in different countries 10:28:03 Noah: Discusses material in the cache 10:29:25 Tim: A link can be seen as aiding and abetting 10:29:46 ... here is where you can find all your favorite TV series 10:29:54 I thought we had gone down the road that the TAG would document common practice and terminology, rather than documenting what should or shouldn't be legal or even legal practice 10:30:01 q+ 10:30:17 q? 10:30:32 q+ jar to expectations 10:31:23 Noah: Discusses nature of links ... 10:31:39 I thought we were going to look at "expert witness" contributions to copyright cases in order to summarize those as community consensus about terminology and concepts 10:31:52 ... web depends on linking and network effects 10:32:30 i don't want to make recommendations about what kinds of laws should be passed 10:32:46 Noah: Restricting linking makes the Web a less useful place 10:33:43 Larry: We should restrict ourselves to factual technical discussion 10:34:24 ... need expert testimony on copyright cases that is representative of community consensus 10:35:28 Noah: Value of web comes from kaing information resources available to others 10:36:01 s/kaing/making/ 10:36:11 q? 10:36:18 ack next 10:36:19 Larry: We should not try and assess value 10:36:20 ack next 10:36:21 jar, you wanted to expectations 10:38:13 or at least separate this into two documents: (a) technical terminology and use cases, for community consensus and Recommendation status (b) a policy document which represents what the TAG would like the W3C to advocate, as a TAG finding 10:38:15 jar: We are talking about technical aspects of web ... 10:38:23 q+ to advocate two documents 10:38:32 ... what are the expectations? How will people use these links 10:39:00 ... if you see a link you should be able to follow it 10:39:17 ... do you have to read and understand the surrounding text? 10:39:49 ack next 10:39:51 masinter, you wanted to advocate two documents 10:39:55 DKA: Could you send the editors some guidance 10:40:20 Larry: Separate policy and technology ... two documents 10:40:57 ... Get community consensus on the technical document 10:41:17 DKA: Not sure we need the policy document 10:41:33 encourage you to keep policy advocacy section separate 10:42:57 Noah: We should not say much about policy ... stress the archhitectural/technical aspects 10:44:36 s/archh/arch/ 10:44:58 Noah: what I actually said was that there's a middle ground between pure technology and policy, and that's to explain a bit about how the Web is used, and where they get value from it. That informs people who set policy, so they have the opportunity to support such uses, and to avoid inadvertently breaking things people value. 10:45:12 AM: I'd go further: I think we need to advocate policy. 10:45:25 Tim: Resaonable to point out the importance of things. Not waht laws should be written. 10:45:29 s/they get/people get 10:45:31 s/they get/people get/ 10:45:48 ... We can point out the value of things 10:47:14 http://www.w3.org/2001/tag/doc/deeplinking-20030911 10:47:16 Tim: I can see contries making it illegal to supply links to bomb-making material, etc. 10:47:40 ... what got us going was websites that say "do not link to this?" 10:48:18 DKA: Should not have legal standing to say "You cannot link to this website." 10:48:50 From deep linking finding: 10:48:53 "The Web is at the risk of damage. The hypertext architecture of the Web has brought substantial benefits to the world at large. The onset of legislation and litigation based on confusion between identification and access has the potential to impair the future development of the Web." 10:49:21 jar: Depends on whether a contract has been made and whether it can be enforced. 10:49:41 HT: Linking is like printing in a paper 10:50:15 Noah: Link is a capability for retrieving the material 10:50:42 Noah: We have published a Deep Linking finding 10:51:10 ... we have already made a policy statement 10:52:03 Noah: Don't prohibit linking, put access controls if you want to restrict linking. We say that in the finding. 10:53:16 jar: Terms of use should not be interpreted as entering into a contract 10:54:22 Noah: I wonder if there is a point to be made about fragment identifiers ... can make you miss the terms of use 10:55:02 ... if you link, you could like to fragments of a page 10:55:40 NM: Right, we should point out that fragment identifiers, for good reasons, can cause a user following a link to wind up in the middle of a page or work, which means material like terms of use at the top or bottom might not be seen. 10:55:48 DKA: Is there legal precedent? 10:55:58 NM: We should show use cases of where fragment references are valuable. 10:56:18 Tim: When I sent messages to the bank I got terms of use at the bottom of the page. 10:56:46 What will be helpful to the community? How would a TAG policy statement have effect? Who would refer to it? 10:57:06 please be careful about 'prevention' vs 'establish consequences' 10:58:59 DKA: Jonathan can you provide text that further elaborates the terms of use situation. 10:59:09 yes 10:59:35 Tim: Their messages sent "by messaging with the sender you accept the sender's terms which are on the web here and may change at any time". I added a similar disclaimer to my own messages to them and so by that measure they would be bound my terms which are on the web and may change at any time. 10:59:46 DKA: Jeni and I will meet first week of October to collaborate 10:59:57 on the same subject as Tim: having an HTTP header on get DoNotLog: yes, and a link to term of services "by responding to this http request, you agree not to log this interaction" 11:00:29 DKA: Jeni and I will meet first week of October to discuss further work on the document. We can discuss at TPAC. 11:01:55 DKA: We could have a BOF at TPAC to discuss 11:02:05 which should as difficult to defend as "do not link" ToS 11:02:49 Larry: We could use this as a away to bring in more of the community 11:03:14 ... W3C management could use this to attract community interest 11:04:15 Larry: Community group could be created around this topic 11:05:06 http://people.ischool.berkeley.edu/~pam/ 11:05:15 Noah: How about inviting Pam Samuelson to this session? 11:06:25 HT: Her article from 10 years ago is what I recommend people read to understand Copyright on the Web 11:06:39 ACTION: Appelquist to propose TPAC breakout on copyright and linking Due: 2011-09-27 11:06:40 Created ACTION-604 - Propose TPAC breakout on copyright and linking Due: 2011-09-27 [on Daniel Appelquist - due 2011-09-21]. 11:06:51 ACTION-541? 11:06:51 ACTION-541 -- Jeni Tennison to helped by DKA to produce a first draft of terminology about (deep-)linking etc. -- due 2011-07-26 -- OPEN 11:06:51 http://www.w3.org/2001/tag/group/track/actions/541 11:07:34 ACTION-541 Due 2011-10-11 11:07:34 ACTION-541 Helped by DKA to produce a first draft of terminology about (deep-)linking etc. due date now 2011-10-11 11:08:17 http://www.w3.org/2001/tag/products/PublishingLinking.html 11:12:53 rrsagent, pointer 11:12:53 See http://www.w3.org/2011/09/14-tagmem-irc#T11-12-53 12:14:22 plinss_ has joined #tagmem 12:20:29 jar has joined #tagmem 12:24:40 Ashok has joined #tagmem 12:33:13 very cool 12:33:56 masinter has joined #tagmem 12:36:46 action-478? 12:36:46 ACTION-478 -- Jonathan Rees to prepare a second draft of a finding on persistence of references, to be based on decision tree from Oct. 2010 F2F -- due 2011-12-06 -- OPEN 12:36:46 http://www.w3.org/2001/tag/group/track/actions/478 12:38:16 scribenick: timbl 12:38:21 noah has joined #tagmem 12:38:25 topic: Persistence 12:39:27 scribenick: timbl 12:39:34 my thought lately has been that it's really important to be more precise about what you want to be 'persistent' 12:40:19 jar: We know persistence is primarily a social question, not really a techniucal one. To get it you need lots of copies or 12:40:32 ... else a trustworthy institution. 12:41:18 ... there is no way to test for it: it is a lot about whether someone else believes that the a document is persistent enough for their purposes. 12:42:21 ... Who do you know, who do you trust? I have seen many policy sttaments by jounals saying they do not accept HTTP URIs as references. Some accept them only from webcitation.org for example. 12:42:27 a "confidence" game in both sense of the word 12:42:53 DKA has joined #tagmem 12:43:22 ... the right document about persirtent URIs has ot realy been written yet. 12:44:03 ... A lot of people have been puzzled by this, and there have been a lot of analyses. People grok the social nature of the issue. 12:44:46 ... Give that we are the TAG, and that in fact the perstence of a dereference is Somebody Else's Problem, should we have anything to say about it? 12:45:02 Larry: I think Jonathan's mention of "persistent actionability of a reference" is pretty close to the answer to your challenge about what is persistent 12:45:15 ...as far as it goes 12:45:50 noah: i think it's leaving out the endpoint 12:46:01 HT: There was a crucial point when the DOI people realized that "actionable" URIs (ie dereferencable, i.e. HTTP) were a good idea. 12:46:14 actionable URI as a reference to _what_ ? 12:46:14 You mean 'endpoint' as in endpoint of a connection, or something more like end state in some sense. 12:46:16 jar: There is not a big audience for this document. 12:46:29 I infer, e.g., reference to some published work. 12:46:51 is the work allowed to change, or do you mean the exact representation? 12:47:22 ... Some people just use DOIs. Some apply to be DOI registrars. 12:47:55 I assume that whoever establishes the reference string answers that question, but presumably making a reference to an "exact representation" needs to be an option when that's what you intend. 12:48:30 Zakim has left #tagmem 12:48:49 Zakim has joined #tagmem 12:48:50 ... Crossref has developed with time. The social contract with new members of crossref are now designed to support persistece. If you become a member of crossref, that means crossref has the right to keep th e metadata. They are backing up out-of-organization. (They are a non-profit). 12:49:58 ... Orchid and Datasite were not forced to use DOIs, they are using them because the social contract works for them, and becoming registrars. 12:50:16 orcid 12:50:42 masinter: Does the persistence of the DOI depend on these orgs -- or is the DIO in the ocumnet itself? 12:50:57 jar: Normally it is in the document, ut that isnot a requirement. 12:51:25 s/DIO/DOI/ 12:51:41 s/the ocumnet/the document/ 12:54:05 q+ to remind about (deployed) xmp.did / xmp.iid URIs 12:54:13 jar: The common practice is to publish in the web the DOI hyperlinked to the HTTP URI. 12:54:29 tim: Nice compromise. 12:55:25 jar : the funny thing is that they are not worried about the dor.org domain name dying - they have this redundant form n the DOI itself. 12:55:40 NM: I still find http://...some link to what you meant seems troublesome. Maybe because analgous techniques are used maliciously in phishing attacks 12:56:22 NM: I think users mostly want to trust that when the link text appears to be an absolute URI, then the link should be to the same URI. Not a disaster, but somewhat troubling. 12:56:32 jar: Then w3.org and things liek it are another story. They could make effort tohang onto the domain name. There are persistent identifiers for for example dated tech reports. 12:56:59 s/liek/like/ 12:57:54 Tim: What about new top-level domains with differet properties, where you can buy forever a domain? 12:58:59 jar: There is an important function web(r,u) as to whether r is a valid representation for u. If a proxy is in the way, ow do I know this functions till works? well, HTTP-bis has a pretty good story. 13:00:40 jar: Now URN specs ofetn discuss how you can deref them. 13:00:48 ... They are often wrong in my view. 13:02:31 ... What you want to say is that the only way to determine which , say IETF RFC, is actualy a valid one., by going to those in charge who have the definitive say. 13:03:31 jar: As to the TLD idea, I'm npt sure there is a demand for it. 13:04:58 ... note that w3.org's persistent policy is still in draft form. 13:08:08 larry: For persistent refefences, one way is to embed GUIDs withing hte object itself. 13:08:50 ... Most PDFs ahve two, a document id and an instance ID (whcih changes any tie the doc is edited). 13:09:33 ... The resoltion service is you searc on Google, and it works. It reuires Google instead of a custom service. 13:10:11 Noah: There are wrap date issues with GUIDs in many implementations/ 13:10:52 jar: There is a urn:guid: scheme being discussed. 13:11:01 Tim: What about uuid: scheme? 13:11:48 xmp.iid and xmp.did differ from uuid: in that there's a specific semantics 13:12:58 jar: The curation community seems to be going in that direction, and there is wih Tom Baker for example there is work with persistence for ontologies. 13:14:09 ... The typically take copies of ontologies they use in RDF. 13:15:40 jar: I have also been tinking of PIR a lot, the non-profit who run .org 13:15:47 ... They could be involved. 13:16:06 ... That woudl reduce the number of vulnerabilities by one. 13:16:27 ht: Or we could go to ICANN with PIR's support. 13:19:25 ht: The workshop dcc.ac.uk 13:22:14 ht: There is no acrhive anywhere which has an historical record of domain name ownership -- docmian registras are not required to keep archives. 13:24:39 s/docmian/domain/ 13:24:55 s/registras/registrars/ 13:25:48 s/acrhive/archive/ 13:26:01 s/woudl/would/ 13:26:09 s/tinking/thinking/ 13:26:48 ht: Maybe in order o get things to happen we need to be alarmist. 13:27:35 s/ o / to / 13:28:07 ... Actually if w3.org goes offline, a lot of people get W3C stuff from the waybackmachine. 13:29:31 ht: In fact what we do is all a question of how we estimate the risk. 13:29:57 ... We could get people to talk about that at the workhop. I am happy to write up a proposal at hte workshop. 13:30:25 FWIW, Wayback Machine copyright policy http://www.archive.org/about/faqs.php#20 13:37:21 ht; Business Coninuity Management -- what happes if a crisis hits -- 13:39:18 Tim: Interested to develop system which allows HTTP to failover to p2p, when there is a fail to get B linke dfrom A, ask A to boostratop move to p2p. A bit of protocol is necessary t boostraop this. 13:39:37 peter: I am interested in this, failover fro mhttp to p2p 13:42:08 s/linke dfrom/linked from/ 13:42:17 s/boostratop/bootstrap/ 13:42:27 s/t boostraop/to bootstrap/ 13:43:12 Tim: Importnat to look separately to short term and long term threats. Short term may be web server breakage, net breakage, crisis damage, or attack by government eg Egypt. 13:43:54 ht: Log term, orgs amy be gone. Short term, they are around but can't do their jobs. 13:44:04 s/Importnat/Important/ 13:44:19 s/amy/may/ 13:44:22 s/Log/Long/ 13:52:42 ht has joined #tagmem 13:53:00 [Discussion of workshop] 13:53:21 [Discussion of product page] 13:55:29 i think this is a lot less important for the TAG to work on than MIME and the web 13:55:49 .RESOLVED: The TAG agrees to endorse a workshop proposal on domain persistence for IDCC11 on 4 or 8 December. This probably means no more than that the workshop publicity would include some form of attribution to the TAG 13:56:43 I don't endorse a workshop proposal from the TAG because it doesn't raise above my threshold for TAG priorities 13:58:04 and i see no evidence of a community that wants to come to W3C 13:58:42 . RESOLVED: The TAG agrees to endorse a workshop proposal on domain persistence for IDCC11 on 4 or 8 December. This probably means no more than that the workshop publicity would include some form of attribution to the TAG. This is contingent on suitable approval from and coordination with W3C staff. 14:00:11 i'm not sure what 'endorse' means if it has no commitment at all for any follow-on work 14:01:44 i wouldn't mind a proposal for a breakout session at TPAC, on the other hand 14:01:58 I don't mind a proposal for a breakout session at TPAC, on the other hand 14:02:11 I don't mind a breakout session at TPAC, on the other hand 14:02:23 +1 on noah's proposed resolution. 14:05:21 Votes: 7 yes, 1 no, 1 abstain. 14:05:26 Chair rules no consensus. 14:05:34 Larry says OK to do it anyway. 14:05:44 Chiar rules no consensus, but Larry allows going forward. 14:08:29 ACTION: To talk to Ian about whether a 15 min plenary presentation on TAG status would be appropriate at TPAC. 14:08:29 Sorry, couldn't find user - To 14:08:39 ACTION: Noah to talk to Ian about whether a 15 min plenary presentation on TAG status would be appropriate at TPAC. 14:08:39 Created ACTION-605 - Talk to Ian about whether a 15 min plenary presentation on TAG status would be appropriate at TPAC. [on Noah Mendelsohn - due 2011-09-21]. 14:30:29 DKA has joined #tagmem 14:33:04 Topic: Unicode Normalization 14:33:17 http://lists.w3.org/Archives/Public/www-tag/2011Jun/0188.html 14:33:17 Noag: This started with an emil from Addison Philiips (sp?) 14:33:53 Noah: I missed this, then Peter asked about it in July.. We had a discusssion on Sept 1. 14:34:11 We assigned a couple of actions, one for me to follow up with Addison. 14:34:25 ACTION-592? 14:34:25 ACTION-592 -- Peter Linss to draft possible TAG position statement on Unicode, and alert Addison Phillips of our intention to attempt to get agreement starting in October after the F2F -- due 2011-09-08 -- OPEN 14:34:25 http://www.w3.org/2001/tag/group/track/actions/592 14:34:27 See also ACTION-592 of Peter 14:34:48 Peter: Started, not ready. 14:34:57 ACTION-590? 14:34:57 ACTION-590 -- Noah Mendelsohn to follow up with Addison Phillips on Unicode normalization http://lists.w3.org/Archives/Public/www-tag/2011Jun/0188.html -- due 2011-08-30 -- PENDINGREVIEW 14:34:57 http://www.w3.org/2001/tag/group/track/actions/590 14:35:07 close ACTION-590 14:35:07 ACTION-590 Follow up with Addison Phillips on Unicode normalization http://lists.w3.org/Archives/Public/www-tag/2011Jun/0188.html closed 14:35:50 Ashok: Could you pleas recap the problem? 14:36:29 Peter: Unicode has many ways of representing the same thing. These look identical on the screen. Eg accented character vs accent and charcaeter. 14:36:56 The result is users are confused, as the visible glyphs are identical. 14:37:21 This i snot a problem if your Unicode is normalized. There are normalization forms. 14:38:21 There are though some ways things can hav multiple encodeings (eg vietnames emultiple accents( which are not tackled by thhose normalizarions algorithms. 14:38:40 Then you have JS APIs for these things, accessing using say class ames etc 14:39:24 Surprisinglly this has nto been a proble yet. Maybe because people just give up using non-ascii. 14:39:37 Is it the case that Vietnamese doesn't normalize? I don't think that's what Peter said 14:39:48 The I18n group have been pushing formore people to pay more attention oth thi sproblem. 14:40:10 Within CSS group, much pushback e.g. from implementers, as therer are perfoamce costs. 14:40:30 jar: Why do peope thinks it ok to not do his? 14:40:52 peter: Because they haven't seen the problem ye. Bu absence of evidence is not evidnce of absence. 14:41:21 Noah: do we have a community of mostly people working in ASCII -- like the TAG? 14:41:53 Maybe we need non-engligh speaking input. 14:42:01 jar; This can't just be a web issue. 14:42:59 Larry: I worked on a system which had this problem. You can use non-ascii names without normalization. 14:43:56 Noah: Eg someone edits a style sheet in an editor. 14:45:19 masinter: Do any apps normalize 14:45:21 ? 14:46:07 masinter: The "visual cognates" problem is a lot more tha this -- also o and 0, l and 1 for example. 14:46:45 plinss: Typos are beyond our control 14:47:04 plinss: Meta-question - should tag be involved at all? 14:48:15 "Programs should always compare canonical-equivalent Unicode strings as equal" http://unicode.org/faq/normalization.html 14:49:17 Ashok: Call Martin Davis, of Unicode fame. 14:49:56 s/Martin/Mark/ 14:50:32 masinter: The 118n group have been askign people to tke this up with no effect, and have asked the TAG to push. 14:52:43 peter: Questions abound as to whether to check just at input time, or what. There are illegal strings, what happens if you chop a string between a character and its accent then recombine it? 14:53:02 http://www.w3.org/TR/2004/CR-charmod-resid-20041122/ is Candidate rec from 2004, hasn't progressed? 14:53:26 http://www.w3.org/TR/2005/WD-charmod-norm-20051027/ is Working Draft, from 2005 14:53:41 Yves: The language all have unicode normalization in their runtime. 14:54:26 ht: The XML spec I just checked does not mention normaization at all, except in a XML name section, not normative, it says use normal form 3. 14:55:48 The XML spec has the following non-normative guidance on XML names: "Characters in names should be expressed using Normalization Form C as defined in [UnicodeNormal]." 14:56:30 ht: In the charcter model spec, there was no one asking for it, so no one paid attention to it. 14:57:02 masinter: ther is a WD dates 2004, and normalization WD dated 2005 14:57:31 http://www.w3.org/International/wiki/CharmodNormSummary 14:57:37 http://www.w3.org/TR/charmod-norm/#sec-NormalizationApplication 14:58:39 ht: [reads] ... characters with muliple possible representatioins are compared code point by code pont. 14:59:05 http://www.w3.org/TR/charmod-norm/#sec-IdentityMatching 15:00:22 I think we should find that the I18N group should bring charmod-norm to rec, and address this problem by removing theory that doesn't correspond to practice 15:00:53 XML requires that processors _not_ normalize when comparing e.g. start and end tags (http://www.w3.org/TR/REC-xml/#dt-match): "Two strings or names being compared [*match* if they] are identical. Characters with multiple possible representations in ISO/IEC 10646 (e.g. characters with both precomposed and base+diacritic forms) match only if they have the same representation in both strings." 15:01:29 TBL: The TAG is at it's best when helping multiple groups. This is a slightly unusual case in which the i18n group themselves is in a way TAG-like in their role. I think we can support >them< in resolving this, but not clear we should be getting in at the level of detailed technical analysis. 15:03:33 TBL: I think one could imagine a string-compare-xxxx, so not do it in each API. Do not ask each subsystem to it. 15:03:53 NM: What about CSS? 15:04:19 reading http://www.w3.org/TR/charmod-norm/#sec-IdentityMatching, it actually looks like this recommends exact string match. That is, why is there a problem? 15:04:49 TBL: (scribe still struggling) 15:04:56 Don't change the data models 15:05:05 Change the way equality is tested for 15:05:56 TBL: What I'm suggesting is doing the smart check at comparison time. 15:06:49 I thought we were asked to review http://www.w3.org/International/wiki/CharmodNormSummary ? 15:07:41 AM: This is a bigger problem due to the IRI situation. It behooves us to think about this. 15:07:49 Tim: Clearly if yo just introduce canonicalization into one subsystem, then things break: for example, if th CSS system does and the XML DOM doesn't then the match between CSS and XML will fail wher it used to work. Insteda, good to introduce a compare function which compares blind to differences in encoding of accents. This will helkp and will not cause the same damage -- very rare damage. 15:08:16 If I was building a system from scratch, I would probably canonicalize on input. 15:08:29 I read the above from charmod-norm as at variance with TimBL's suggestion: " In accordance with section 3 Normalization, this step [normalization] must be performed by the _producers_ of the strings to be compared." [emphasis added] 15:08:48 q+ to ask about charmod-norm draft quoted above 15:08:54 But all this hodulebr architected by the i18n group not the TAG. 15:09:16 Yves: We could ask the i18n group why there advice did not take off. 15:09:17 "we would like to request that TAG schedule time in about four weeks to review I18N WG's proposed recommendations concerning Unicode Normalization" - Addison to www-tag on 6/29 15:09:48 Peter: They have talked to CSS group. There are disagreements within the CSS group. 15:10:12 Deal with it/Don't deal with it/worry about performance 15:11:35 jar: It in our charter to get involved. 15:13:28 jar: If this is a disagreement between working groups, it is in our charter. It needs to be more clearly laid out -- thjat os it seems in progress. 15:13:36 I don't understand why they have 2004 working draft with no progress.... where are the heartbeats documents, for example? 15:13:46 Peter: I volunteer to champion this within the TAG. 15:14:24 Jar: I'd like to ask various outside people for their opinion. 15:14:49 Peter: I'm not an expert on the unicode side. 15:16:44 their working draft disagrees with the wiki page, but without justification for why the change 15:17:27 right. 15:18:22 http://www.w3.org/TR/charmod-norm/#sec-IdentityMatching 15:20:14 If the i18n stuff were rec track, then thei CR phase would involve egtting eth CSS people etc to implement it, maybe. 15:20:25 s/If/Noah: If/ 15:21:12 masinter: It may be that the i18n group needs to think about the breadth of applicability of thsi work. 15:22:07 i18n core charter seems to be http://www.w3.org/International/core/ 15:22:26 Noah: As anindividual TAG member, I m conflicted. resonating, tim sayinthis is i18n charter, and jar syin this is in our charter, and larry sayung not clear that he i18n wg is using the w3c process nromally. 15:22:35 that is, the distinction between internationalization for presentation forms and protocol elements, and that the broad charter of I18N applies to presentation forms, and that they might want to be more modest in trying to internationalize protocol elements, against other priorities of reliability, implementation, performance 15:22:48 .. either thei rown material, and in their approcach to CSS. 15:23:59 ... We do think the TAG doesn't scale -- normally there are processes by which WGs give each other a heads up of direction, and I don't think the TAG should be used as a short cut. 15:25:20 DKA has joined #tagmem 15:25:25 Noah: We have no consensus. Propose: the TAG is not convinced or ready to commit to a deep dive. We would like to do a small 4-8 week exercise -- mayeb a telcon with i18n -- and ask the question again after that time. 15:26:14 jar: Not happy. This is no a very big thing. We can't do thinsg until they have done thei homework,. We can't help now -- they need to get sorted out. 15:28:56 not a big thing *yet*. what I mean is that there's not much for us to do until it's better prepared for us so that we can help efficiently… seems like a jumble to me. 15:30:05 ACTION-592? 15:30:05 ACTION-592 -- Peter Linss to draft possible TAG position statement on Unicode, and alert Addison Phillips of our intention to attempt to get agreement starting in October after the F2F -- due 2011-09-08 -- OPEN 15:30:05 http://www.w3.org/2001/tag/group/track/actions/592 15:32:04 jar: First steo should be they provide a odcument for us to go from 15:33:09 i'd like an enumerated set of issues and options, ini writing, as prep for a telcon 15:33:16 s/ini/in/ 15:35:30 a brief should ideally summarize the opposing views 15:38:11 ACTION: plinss to invite I18N and other concerned groups to provide written technical input as prep to discussion with the TAG regarding unicode normalization 15:38:11 Created ACTION-606 - Invite I18N and other concerned groups to provide written technical input as prep to discussion with the TAG regarding unicode normalization [on Peter Linss - due 2011-09-21]. 15:40:48 ht: The wiki is actually not consistent with what it recommends -- do you or do you not compare canonicaliation-aware? 15:41:54 topic: Minimization 15:42:04 ACTION-590? 15:42:04 ACTION-590 -- Noah Mendelsohn to follow up with Addison Phillips on Unicode normalization http://lists.w3.org/Archives/Public/www-tag/2011Jun/0188.html -- due 2011-08-30 -- CLOSED 15:42:04 http://www.w3.org/2001/tag/group/track/actions/590 15:43:10 Noah: Commit or cancel? 15:43:45 we have a commitment to have a finding in 1 month, hoped to have an initial draft in July. 15:43:50 Worried about timing. 15:44:15 DanA said he thought he could get it together in time for TPAc and it willb e contentious. 15:44:30 s/willb e/will be/ 15:46:14 DanA; This has alwasy been a really small thing anyway. The device API folks are happy they say to implement it, and we are in some cases alreday doing it -- but we would liek to se more examples of where ti has been a good approach applied in the wild. 15:46:30 Where has this been applied and resulted in better privacy? 15:46:44 jar: (But this is a basic tennant of capability design!) 15:47:38 dka: We have had stron geolocation wg partcipation since it begain, and upoming will be civc address objects, as an enhancement to geo lat long, (street/city/region/country etc). 15:47:58 ... If tyou apply thi s design, you should be able to just ask for say th e state. 15:48:20 q+ 15:48:24 ack 15:48:29 q- 15:48:33 .. But implementers Google, Moz in geo wg, and Opera push back. 15:48:33 s/tennant/tenet/ 15:48:35 q+ 15:48:43 q- ht 15:49:17 "We did privacy in geopriv, a can of words .. " They never dealt with minimization -- if users and edevs are not asking for this, then why do it... OWTTE. 15:55:33 ack next 15:56:28 ack next 15:59:38 In Lyon last year I sat in on the geo location group 16:00:18 .. supported thei r approach to get out the document, an dnot be drawn to mcuh into the privacy issue. 16:00:47 I think we have some responsibility to push on policy-based requirements 16:01:59 masinter: I am trying to understand the responability of the TAG .. relationship with the privacy stuff going on ... you giot you said push back from implementers, but this in not necessariy a maret-based requirement, not a policty-based one. 16:02:46 ccessibility and princacy fit into reas where there may be other forces beyond the ones the product manager immediately sees, whcih can'y be justofied on the basis that this is what the user s are demanding. 16:03:26 Noah: I don't have a kind warm feeling about that geopriv history at the moment. 16:03:38 Ashok: I don't thin you can find good use cases. 16:04:03 i think i understand some of the geopriv / geolocation history 16:04:08 DKA: I cand find lots of academics who will talk about tis but I need people in the real world, selling in hte market 16:04:57 Noah: If I can do a getCity, getState() call etc, thats's oen thing ... i cn also do a different call getAddress() whic hmay return blanks. 16:05:02 i think it's an API design concern that has only a little bit to do with access control 16:05:13 dka: That is not the questin -- this i not about access control. 16:06:11 the shape of the API treats 'granularity' an input parameter, and where the result has options for returning multiple granularity 16:06:43 i'm disagreeing with Noah that 'access control' is the right way to approach this problem 16:06:59 Dan: The model we are dealing ith in htis minimization issue, is that it is good for a developer to ask for the minimum data tha they can use, to avoid ahving m mroe private data around than ncessary. It is not about access control by the user. 16:07:11 q? 16:07:16 Noah; [stuff about user permisssion access control] 16:07:42 Yves: All users know what an app will access, as it is displayed in big letters. They have seen that. 16:07:44 in fact, what i like about 'data minimization' is that it gets around some of the access control problems 16:07:54 In access control someone decides who gets to access what ... in this case the user decides what is disclosed about him 16:08:35 ... Howver, as the aops use it, there are many possibilities of leaks by which the data can get out. So minimization is interesting to just reduce the likelihood of damage by a leak. 16:09:11 alternative is a separate part of API to 'fuzz' or 'reduce granularity' ... might more be more effective than data minimization 16:09:45 dan: The assumption is that the user should not be given lots of prompts. Minimization as a best practice does not involve prompts. 16:10:40 q+ 16:11:03 noah: Two ways of doing this --inspecting the code of an app, or doing it at run time. 16:11:29 q? 16:11:32 ack next 16:15:09 [discussiono various forms of attak] 16:16:20 larry: You can get teh same effect as minimization by fuzzing the geo data. 16:16:57 dan: This draft supports that sort of approach. 16:17:28 Dan: The document is supposed to just state the arch'l principle, which is why it is good as a TAG doc. 16:17:51 could the privacy activity take this forward without the TAG owning it? 16:17:55 I would hope we would get agreement with the WGs which would be most connected, such as geolocation , for exampel. 16:18:10 larry: What part ofthis could we push onto the princay activity? 16:18:37 dan: I'd like to complete this, and then hopefully have the rivacy group support that document and point to it. 16:19:31 jar: These are not really a spec, just guidelines. 16:19:37 dan: Yes. 16:19:48 ... I would like this to be a finding. 16:20:14 Larry: I like it enough that I would like you to make it into something which could be a finding. 16:20:29 jar: It is a design pattern. 16:20:39 tim: I like the "pattern" language. 16:21:02 s/a spec, just guidelines/guidelines, just an article/ 16:22:06 Noah: Do we know eher ethis fist in the system? It is API mini'n 16:22:20 dan: Actually Data Minimization. 16:25:20 Tim: DO you wanto to extend it to Sensitive Data Minimization in Code ? 16:29:43 Ashok: Would like to publish this as a Note, not a Finding. 16:33:17 rrsagent, pointer? 16:33:17 See http://www.w3.org/2011/09/14-tagmem-irc#T16-33-17 17:11:32 timbl has joined #tagmem 17:22:15 noah has joined #tagmem 18:25:02 Zakim has left #tagmem