14:58:39 RRSAgent has joined #csvw 14:58:39 logging to http://www.w3.org/2015/01/14-csvw-irc 14:58:41 RRSAgent, make logs public 14:58:41 Zakim has joined #csvw 14:58:43 Zakim, this will be CSVW 14:58:43 ok, trackbot, I see DATA_CSVWG()10:00AM already started 14:58:44 Meeting: CSV on the Web Working Group Teleconference 14:58:44 Date: 14 January 2015 15:00:20 zakim, code? 15:00:20 the conference code is 2789 (tel:+1.617.761.6200 sip:zakim@voip.w3.org), gkellogg 15:00:27 jtandy has joined #csvw 15:01:22 +??P18 15:01:27 zakim, I am ??P18 15:01:27 +gkellogg; got it 15:01:28 +??P22 15:01:30 -??P22 15:01:57 just waiting for fire alarm test to complete :-( 15:02:32 +[IPcaller] 15:02:50 -[IPcaller] 15:03:06 zakim, IPcaller is danbri 15:03:06 zakim, who is on the phone? 15:03:07 sorry, danbri1, I do not recognize a party named 'IPcaller' 15:03:07 On the phone I see JeniT, gkellogg 15:03:30 +??P0 15:03:41 zakim, I am ??P0 15:03:41 +jtandy; got it 15:04:09 danbri2 has joined #csvw 15:04:44 +[IPcaller] 15:04:48 zakim, IPcaller is danbri 15:04:48 +danbri; got it 15:04:51 danbri has joined #csvw 15:05:21 ScribeNick: JeniT 15:05:42 JeniT & al, sorry I massively blanked on the call - can't believe it's weds already 15:05:57 jtandy: we can simplify conversions if we’re just basing off the tabular data model 15:06:23 gkellogg: yes, precedence etc all gets taken care of in the model generation 15:07:33 JeniT: I’ve added standing agenda on the main page 15:07:36 ScribeNick: danbri2 15:07:49 gkellogg: it would be good to still get emails to confirm that it’s going ahead 15:08:20 https://github.com/w3c/csvw/issues?q=is%3Aopen+label%3A%22Requires+telcon+discussion%2Fdecision%22+sort%3Aupdated-asc 15:08:23 jenit: prioritising issues for today, trying to check off our big list by working thru from either old-to-new or least-recently-updated, ... 15:08:57 -danbri 15:08:57 https://github.com/w3c/csvw/issues/55 15:09:34 JeniT: syntax for regular expressions 15:10:16 … any thoughts? 15:10:37 gkellogg: every language has a regex syntax, which big overlaps 15:10:49 … there are edge cases in particular languages 15:11:00 … this concerns me 15:11:12 … we should probably say a best practice for interop is to be conservative in use 15:11:17 q+ 15:11:31 … the only reason for a regex is to determine whether the value of a cell is used or not 15:11:42 … there’s no componentisation 15:12:04 ack jtandy 15:12:12 jtandy: my impression is compatible with gkellogg’s 15:12:24 … I’m struggling where we actually decided to use regular expressions within the metadata 15:12:33 … we were talking about using regexs to parse difficult bits of strings 15:12:38 … so is this time-expired? 15:13:23 ok - so regexp is used for validating CSV files ... 15:13:48 danbri1 has joined #csvw 15:13:58 jtandy: so the main point is for validation, to make sure that CSV meets a schema 15:14:59 +[IPcaller] 15:15:07 zakim, IPcaller is danbri 15:15:07 +danbri; got it 15:15:43 gkellogg: we have too many 'MAYs' in specs, w.r.t. testing 15:15:44 JeniT: there’s the option of making it ‘implementation defined’ and encouraging authors to be conservative 15:15:46 scribenick: danbri1 15:16:04 gkellogg: don't think we can say its impl defined, ... 15:16:14 q+ 15:16:19 … but also don't want to go too overboard in testcases, cooking up weird ways of breaking things via regexes 15:16:25 jenit: where does this leave us? 15:16:37 … which of the regex options do we say that impl has to conform with? 15:16:52 gkellogg: i expect the XML Schema one will be the most broadly available 15:17:08 … versus probably need to load a js env within ruby 15:17:16 … not v appealing 15:17:49 ack jtandy 15:17:51 … don't want to repeat their test suites to ensure all and every detail followed 15:18:33 -jtandy 15:18:59 gkellogg: i share j's concern w/ datetime strings and formats 15:19:13 jenit: propose we go fwd with speccing 15:19:23 … i don't think it's true that everyone has an xml schema impl handy 15:19:29 … it is an extra impl on top 15:19:34 (+1 on that —dan) 15:19:41 jenit: my pref is to go for ecmascript 15:19:42 +1 15:19:49 gkellogg: ok 15:20:01 jenit fills out github issue 15:20:29 sorry - lost connection via zakim - trying to get back in! 15:20:52 https://github.com/w3c/csvw/issues/65 15:20:59 jenit: next one is about datetime picture string 15:21:07 +??P0 15:21:07 "Register of recognised date-time picture string formats" 15:21:12 zakim, I am ??P0 15:21:12 +jtandy; got it 15:21:16 gkellogg: i was unable to find an impl that uses them for parsing 15:21:24 they are more often used for formatting 15:21:44 jenit: interesting. that suprises me a bit. Doesn't moment.js use it? 15:21:47 …for parsing 15:22:01 http://momentjs.com/docs/#/parsing/string-format/ 15:22:02 gkellogg: I didn't find one (but there may be impl out there) 15:22:48 https://github.com/w3c/csvw/issues/54 15:23:21 issue: it would be better to just have a list of known datetime formats 15:23:22 Created ISSUE-2 - It would be better to just have a list of known datetime formats. Please complete additional details at . 15:23:29 gah, bot. 15:23:53 jenit: this github i**ue is about the idea that it would be better  to just have a list of known datetime formats 15:24:12 … some that were hardcoded, this is what the format means, ... 15:24:41 jtandy: yes, spec would identify a registry/page, which says "this is how to identify the … datetime string, moment.js etc" 15:24:50 q+ 15:24:57 jenit: i was misinterpreting. This is about the syntaxes you recognise for the strings 15:25:05 jtandy: if you look at - 15:25:09 { "datatype" : "date", "format" { "python" : "%m/%d/%Y", # standard python format "javascript" : "M/D/YYYY", # javascript's moment.js format etc. } } 15:25:58 … register of known values effectively saying e.g. that the key 'python' refers to the python datetime string as spec'd at [url] 15:26:05 ack gkellogg 15:26:06 gkellogg: this didn't appeal to me. ugly! 15:26:26 … my u/standing of it has just changed, … but it is not even clear that these, the python and js, do same thing 15:26:54 … trying to enumerate all the ways, … editors won't do this, will only specify the ones of interest to them 15:27:28 … my main concern is w.r.t. interoperability 15:27:34 "datatype": "date", 15:27:34 "format": { 15:27:34 "picture-strings": [ 15:27:34 "unicode": "dd MMM yyyy", 15:27:34 "xpath": "[D01] [MN,*-3] [Y0001]" 15:27:34 ] 15:27:34 } 15:27:41 jtandy: there's a better eg in the csv2json doc, pasted above here. 15:28:00 … we assume that the metadata publisher cares enough about unicode version, xpath version, perhaps not about other options 15:28:22 q in terms of software impl, "do i understand the unicode thing? xpath?" "i can't read the datetime format anyway" 15:28:47 jenit: reading datetime format is pretty important for conversions, unlike regex which is more for validating 15:29:02 jtandy: same applies for number formats that we also discussed 15:29:14 gkellogg: I disagree that the regex stuff is just for validators 15:29:33 … that was my interpretation of it. If cell doesn't match some regex, it was … use a null or default value. 15:29:39 jenit: i don't think it says that anywhere 15:30:29 …rather than try parse all datetime formats have list of popular ones, e.g. lists from excel, google spreadsheets etc. 15:30:45 gkellogg: I think that is more likely to get good impl 15:30:54 … perhaps ns doc could contain registry of these formats? 15:31:01 … so we could update that without updating the spec? 15:31:23 jenit: is*ue there is impl conformance. … how often would it need to check the registry? 15:31:53 jtandy: are we anticipating then, that validating + parsing software would try to detect which of the blessed formats it used? 15:32:04 “datatype”: “date”, “format”: “ISO” 15:32:17 jenit: you'd have something like this [above], … col is a datetype date, format is ISO 15:32:18 YYYY-MM-DD 15:32:25 (iso8601?) 15:32:53 jenit: there would be a builtin list within each impl for such formats, we could name them after the unicode picture string 15:33:03 maybe later we could move towards fully supporting 15:33:43 jtandy: we give a particular list of pic strings, … beyond those you might be stuck 15:33:48 jenit: or fall back to regex 15:33:51 jtandy: seems workable 15:33:56 gkellogg: agree, best way fwd 15:33:57 +1 15:34:15 jenit: maybe not enough of us to make a formal resolution but i'll update issues 15:34:31 [no audio] 15:34:37 -danbri 15:34:43 https://github.com/w3c/csvw/issues/54 15:35:02 [skype has kicked me off.] 15:35:26 danbri has joined #csvw 15:35:32 +[IPcaller] 15:35:41 zakim, IPcaller is danbri1 15:35:41 +danbri1; got it 15:36:10 -jtandy 15:36:11 jtandy: conversion doc will need to reflect [metadata doc] when that's done 15:36:29 jenit: conv doc will need to say to use the value parsed per metadata spec 15:36:31 sorry - booted off the call again!!! 15:36:34 … use the semantic value of the cell 15:37:06 q? 15:37:08 +??P0 15:37:09 zakim, I am ??P0 15:37:09 +jtandy; got it 15:37:26 gkellogg: place to handle this is in the metadata doc 15:37:35 … transform docs should just reference that 15:37:36 +1 15:37:49 jenit: putting that as a comment on the is*ue 15:38:12 summary: metadata doc will talk about how string values are parsed into a semantic value, i.e. an actual structured date 15:38:28 …based off these formats. All the conversion docs must say is that you're using this 15:38:48 gkellogg: following this logic, if [something regex] there'd be a null value 15:39:07 jenit: maybe needs to be added to model, if an error on cell, … 15:39:18 …then conversion doc can define error behaviour for cells 15:39:37 (dan: I like this granular validity model) 15:39:49 jenit: adding to issue 15:39:57 *an 15:40:45 jtandy: if can't recognise, then the literal value will need to be used as no other options 15:41:12 gkellogg: e.g. if it was a datetime, … and we didn't recognise it, would rdf conversion use a datetime datatype, vs as a literal 15:41:20 … i think in rdfa it would've gone to plain literal 15:41:30 jenit: makes sense 15:41:37 jtandy: same issue is relevant to parsing numbers 15:41:48 jenit: the issue we put it on (#54) covers that 15:42:06 …reasonable middle ground 15:42:16 https://github.com/w3c/csvw/issues/66 15:42:27 #66 "Composite primary keys and foreign key references" 15:43:24 nice fiddly issue! 15:43:39 skipping for now. 15:44:19 ACTION: JeniT to add foreign key discussion to F2F agenda 15:44:19 Created ACTION-60 - Add foreign key discussion to f2f agenda [on Jeni Tennison - due 2015-01-21]. 15:45:14 https://www.w3.org/2013/csvw/wiki/F2F_Agenda_2015-02 15:45:14 https://github.com/w3c/csvw/issues/49