13:39:46 RRSAgent has joined #csvw 13:39:46 logging to http://www.w3.org/2015/04/22-csvw-irc 13:39:48 RRSAgent, make logs public 13:39:48 Zakim has joined #csvw 13:39:50 Zakim, this will be CSVW 13:39:50 ok, trackbot; I see DATA_CSVWG()10:00AM scheduled to start in 21 minutes 13:39:51 Meeting: CSV on the Web Working Group Teleconference 13:39:51 Date: 22 April 2015 13:59:34 gkellogg has joined #csvw 14:00:03 DATA_CSVWG()10:00AM has now started 14:00:10 +??P0 14:00:17 zakim, ??P0 is me 14:00:17 +jtandy; got it 14:00:21 zakim, mute me 14:00:21 sorry, jtandy, muting is not permitted when only one person is present 14:00:42 zakim, code? 14:00:42 the conference code is 2789 (tel:+1.617.761.6200 sip:zakim@voip.w3.org), gkellogg 14:00:49 zakim, dial ivan-voip 14:00:49 ok, ivan; the call is being made 14:00:50 +Ivan 14:00:53 +??P1 14:01:01 zakim, I am ??P1 14:01:01 +gkellogg; got it 14:01:10 JeniT has joined #csvw 14:01:37 zakim, mute me 14:01:37 jtandy should now be muted 14:01:40 +[IPcaller] 14:01:47 +[IPcaller] 14:01:53 zakim, [IPcaller] is me 14:01:54 +danbri1; got it 14:02:22 zakim, unmute me 14:02:22 jtandy should no longer be muted 14:02:59 =Structuring issue handing 14:03:14 ivan: issues so far came in via email lists, am moving them towards github 14:03:47 … have to be careful all issues that come up should have a clear reference when things are done and we get a hopefully positive reply from orig commentator 14:03:52 q+ re github accounts 14:04:12 ivan: you all might have other tricks esp re github 14:04:24 … isssues relating to our oblig horizontal reviews eg i18n 14:04:35 q+ i18n asked about schedule 14:04:46 q+ to note i18n asked about schedule 14:04:57 ivan: we might want dedicated labels for these 14:05:07 also 2 things require something from us 14:05:11 eg. yakov ietf 14:05:23 ack me 14:05:23 danbri, you wanted to discuss github accounts and to note i18n asked about schedule 14:06:31 ivan: re schedule, what we said we'll try to do is move to CR in June so that's probably our answer 14:06:55 'therefore comments towards end of may would be great' 14:07:38 jenit: can someone take resp for putting all issues into github, otherwise it is likely to [bad things] 14:07:45 ivan: i have been doing that past few days 14:07:53 q+ to suggest a weekly duty rosta 14:08:04 ivan: everything so far ought to be in github 14:08:07 but what details? 14:08:10 https://github.com/w3c/csvw/issues/507 14:08:18 consider for example #507 clipboard stuff 14:09:12 ivan: if i'm not around someone else could cover 14:09:21 orig mail should be put into, copied, into issue, with a link 14:09:48 if a thread on list, … we ought to take a look at both, … when we answer to the mailing list maybe we should also link/copy in github 14:10:02 jeni: great if you can take responsibility 14:10:08 agree we don't need to copy them all 14:10:16 ref to 1st + summary text is great 14:10:33 responding to mail saying thank you, … point to github, may help direct relevant conversation into github 14:10:44 we should try to keep our discussion in the github issue as much as poss 14:11:00 and when we reach a conclusion after github discussion, reply in orig thread 14:11:01 +1 to JeniT 's suggestion about discussion in github and pasting conclusion into email thread 14:11:07 q+ to suggest changing thread titles 14:11:29 ivan: at end of may i'll be out in florence, then to nyc, … 14:11:49 q? 14:11:53 ack me 14:11:53 danbri, you wanted to suggest a weekly duty rosta and to suggest changing thread titles 14:12:31 https://github.com/w3c/csvw/issues 14:13:55 there is also the "Milestone" mechanism 14:14:05 we could have "PreCR horiz review" there 14:15:07 I can swap labels for milestones 14:15:20 (if we agree that would be useful) 14:15:30 csvw 14:17:13 jtandy: … still #205 to rework requirements 14:17:24 that issue is currently assigned to davide 14:18:04 jtandy: at tail end of f2f, he volunteered 14:18:09 jenit pinging in github 14:18:29 jenit, let's go thru issues 14:18:36 clipboard formats… https://github.com/w3c/csvw/issues/507 14:18:38 https://github.com/w3c/csvw/issues/507 14:18:54 clear that this is not relevant as we are not doing clipboard formats for csv files 14:19:03 jenit: propose that i respond to list making that point 14:19:11 q? 14:19:22 https://github.com/w3c/csvw/issues/507#issuecomment-94857565 14:19:27 ivan: after i asked it, … one more comment i put here, … he did reply to my mail saying 'yes indeed' 14:19:28 q+ 14:20:25 gkellogg: does relate somewhat to an issue Ivan and I discussed, w.r.t. transformations if there is no source. In which case it nominally uses the annotated model. However there is no official API for that model to use. 14:20:33 But perhaps there is a way to serialize the data model back 14:20:54 … in which case a clipboard format for that annotated data model would be relevant 14:21:00 danbri: exciting work for v2! 14:21:02 jenit: +1 14:21:15 ... 14:21:25 jenit: for me, all of those transformations is completely impl defined 14:21:30 e.g. how any json is passed in 14:21:38 or rdf etc, stdin, params or whatever 14:21:41 we say nothing currently 14:22:05 also applies to what happens when someone specifies a transform with some format that we have not defined any kind of handling for 14:22:49 … 14:23:09 gkellogg: we haven't defined any way to access that abstract model 14:23:11 q+ 14:23:19 ack gkellogg 14:23:22 jenit: i think we can't, it is outside our scope and we don't know yet what will be wanted/useful 14:23:26 … let's look in a year's time 14:23:41 can stdardize later 14:23:49 +1 to JeniT ... once there are implementations in place, we could construct (at least) a Note to describe 14:24:28 q- 14:24:33 q? 14:24:59 back re #507 14:25:19 ivan: let's suggest we don't need to do anything, hopefully response is fine, then we can close the issue 14:25:22 rrsagent, pointer? 14:25:22 See http://www.w3.org/2015/04/22-csvw-irc#T14-25-22 14:25:31 rrsagent, please make logs public 14:25:44 https://github.com/w3c/csvw/issues/506 Relationships to RDB tables 14:25:57 ivan: not sure how closely Ashok is following our group/charter etc. 14:26:36 ivan: this is where the issue arose that gregg raises - the only thing that I think may make sense. In case of the transformation, there would be a way for the external process to get hold of the merged metadata. 14:26:44 … merged metadata has a clear syntax as we define it 14:26:48 hmph, Skype died 14:26:51 -JeniT 14:27:02 … if a way to get hold of that, i could imagine a way for processors to use it w.r.t. a relational database 14:27:30 ivan: it would be possible to create a processor that takes merged metadata and provides a relational database schema 14:27:36 and then from that point on, that's all we can do. 14:27:43 +[IPcaller] 14:27:48 what it would require, in spec of the metadata doc, … 14:28:14 ivan: the only thing it would req in the metadata doc is one more flag to transform to say that it takes merged metadata as its imput 14:28:20 jenit: how does that help? 14:29:02 ivan: only thing i was wondering about … just idea … i may imagine a processor that takes the metadata doc, the way we define it, and can turn the metadata doc into a relational schema. Could provide a hook in transformation, ... 14:29:16 jenit: if i was doing an import i'd want the data not just the schema 14:29:24 [so, for example, the metadata defined foreign keys that would materialise in a relational schema] 14:29:47 q+ to say i expected import/export tooling not schema gen 14:30:08 ack me 14:30:08 danbri, you wanted to say i expected import/export tooling not schema gen 14:31:05 q+ 14:31:17 ack gkellogg 14:31:34 gkellogg: other thing still open re rdb relationship, was bit about not being able to export cols that are from a reference 14:32:14 ivan: it would add a relatively complex thing to a slightly secondary feature 14:32:28 … adding a ref into spec to a wiki page that you can create is fine 14:32:44 gkellogg: do we need to document that we dont fully match rdb model in that regard 14:32:58 jenit: couldn't we have an appendix on the rdb mapping? 14:33:07 (in cvs2rdf presumably?) 14:33:15 ivan: we can turn wiki page into an appendix 14:33:15 (think so) 14:33:30 … i would not think that adding additional sparql engine etc is needed there 14:34:02 jenit: can we put an action on jtandy or gregg to add some appendix to the rdf doc? 14:34:12 ivan: let me do that, i wrote the wiki page. i'll make a 1st run. 14:34:16 turn it into appendix 14:34:22 rrsagent, pointer? 14:34:22 See http://www.w3.org/2015/04/22-csvw-irc#T14-34-22 14:34:35 (jenit makes a dedicated issue) 14:34:50 (lots of fast typing noises!) 14:35:25 ivan: doing anything more we consider beyond our charter, but we'll integrate that wiki page into final rdf doc. 14:35:34 dan to respond to ashok on this 14:35:46 ivan: should we cc tag list 14:36:20 back to https://github.com/w3c/csvw/issues 14:36:25 https://github.com/w3c/csvw/issues/505 14:36:36 #505 minor, ivan's editorial that gregg/jeni could handle 14:36:50 https://github.com/w3c/csvw/issues/509 14:36:58 q+ 14:37:01 Relationship to Data Cube #509 14:37:07 q+ to suggest virtual cols help 14:37:10 q? 14:37:14 ack jtandy 14:37:38 jtandy: i have a reasonably well worked out example with real data, if i publish that into wiki i can refer to it 14:37:43 q? 14:37:58 ack me 14:37:58 danbri, you wanted to suggest virtual cols help 14:39:07 https://github.com/w3c/csvw/issues/179 14:39:18 danbri: v useful having a datacube-based example on the value of virtual cols 14:39:54 ivan: my #179 issue i asked if we needed virtual columns. Doing the rdbrdf text i found i needed virtual columns. So we need it by charter. 14:39:58 [virtual columns are necessary for mapping CSV to RDF Data Cube] 14:40:03 q+ Do we even need metadata merge? 14:40:14 so i am ok closing that issue 14:40:28 jenit: can you ivan make that comment within the issue? 14:40:33 so we can close it within that issue 14:40:39 ivan: ok, will do today 14:41:29 https://github.com/w3c/csvw/issues/508 14:41:58 q? 14:42:08 Comments on registration template for "appplication/csvm+json" #508 14:42:14 q+, Do we even need metadata merge? 14:42:24 ivan: [missed] 14:42:28 will start things moving 14:42:38 ivan: process says this should be done before candidate rec 14:42:45 new process doesn't say much else 14:42:48 we should do it 14:43:18 gkellogg: it became apparent in my blog post writing, that metadata merge is a big issue for implementors. Complexity .... 14:43:27 only reason for it right now is to pull in titles from CSVs 14:43:48 it is possible that some embedded metadata could be defined, but as-is now, it only really exists for adding titles. Only there for verification purposes. 14:44:12 should we be rethinking notion of metadata merge, replace it with validation phase, checking CSV's cols to those described in metadata. 14:44:20 jenit: what of issue of user-provided metadata? 14:44:28 q+ to comment about 3rd party usage of CSV files 14:44:28 gkellogg: yes it might merge with some other metadata ... 14:44:52 … if what the user is providing is on the minor nature, e.g. a tab-separated values option, then sure some need to modify 14:44:57 but otherwise it is pretty minor 14:45:03 e.g. dialect description 14:45:12 q+ 14:45:15 ack jtandy 14:45:15 jtandy, you wanted to comment about 3rd party usage of CSV files 14:45:50 ack ivan 14:46:00 jtandy: (missed detail, sorry) 14:46:11 ivan: there is user metadata that jeni mentioned 14:46:25 we do not have evidence right now that gregg's point is true 14:46:29 [(or false :)] 14:46:46 there is option that the csv might carry metadata 14:46:51 which would also be merged with whatever is out there 14:47:01 any metadata that is part of the doc is only on trivial matters 14:47:26 [detail: for people re-using CSV files published by third parties, we can override any metadata defined at source because we can always start the CSV "import" by pointing to _my_ local metadata file] 14:47:32 q+ to ask re Spreadsheets support (excel, drive) 14:48:19 ack danbri1 14:48:56 q? 14:48:58 ack me 14:48:58 danbri, you wanted to ask re Spreadsheets support (excel, drive) 14:49:07 q+ admin issue 14:49:28 q+ 14:49:33 ack admin 14:49:36 ack issue 14:49:46 jenit: Rufus has mentioned that (alongside virtual cols) the merge could be a barrier for implementors 14:49:53 we have had evidence that it is putting people off 14:50:04 I'm in favour of people defining formats with embedded metadata within them 14:50:36 we could say: it is down to them to say how that embedded metadata is converted to other, … we provide vocabulary, but not necssarily the exact processing rules 14:51:01 jenit: we have simplified around saying 'you use the 1st metadata file you find', even when that is user supplied, that reduces the times when we need merge 14:51:08 it is really now only focussed on titles from CSV files 14:51:13 and the user supplied options 14:51:18 and for them it will be impl defined 14:51:31 so because we're not defining what options, apis, commandlines etc , … 14:51:41 q+ to ask if there were any requirements demanding metadata merge 14:51:42 with all of that evidence i am also in favour of removing the need for merge 14:51:54 ack ivan 14:51:57 and instead have specialized step for checking titles supplied within CSV 14:52:09 +1 from me (the metadata merge discussions confused me) 14:52:12 ivan: 1st of all, admin issue. FOr all our processing, it would be good if this went into issue list. 14:52:12 +1 14:52:32 ivan: this seems a significant change, we should open/close it in github. 14:52:36 gkellogg: I will do that 14:52:46 would we need another pre-lccr pub? 14:52:57 [thoughtful silence] 14:53:05 ivan: let me come back to that 14:53:14 there is another related admin issue. 14:53:20 list of changes... 14:53:43 ivan: we do not have an *obligation* to issue another doc, since this is not formally a LC 14:53:49 since we are not in LC 14:54:12 i think that what we can do, … we make the change, and if it is documented on the issue list, we can send around an email on our own lists, we can draw attention to it 14:54:19 maybe contact implementors that we know about directly 14:54:25 q+ 14:54:32 ack jtandy 14:54:32 jtandy, you wanted to ask if there were any requirements demanding metadata merge 14:54:34 ack JeniT 14:54:55 jtandy: checking reqs to see if anything demanded a merge, … didn't find on a quick skim 14:55:07 ack danbri 14:55:18 danbri: test case that passes now and will fail or v-versa? 14:55:48 gkellogg: multiple relevant tests. will have to see which are affected. over 100 distinct tests… 14:56:26 gkellogg: as part of my extended action i'll investigate via my implementation 14:56:47 [a quick skim through the requirements indicates nothing demanding metadata merge] 14:57:21 -gkellogg 14:57:23 -jtandy 14:57:24 -Ivan 14:57:25 -JeniT 15:04:43 rrsagent, draft minutes 15:04:43 I have made the request to generate http://www.w3.org/2015/04/22-csvw-minutes.html ivan 15:05:00 disconnecting the lone participant, danbri1, in DATA_CSVWG()10:00AM 15:05:03 DATA_CSVWG()10:00AM has ended 15:05:03 Attendees were jtandy, Ivan, gkellogg, JeniT, danbri1