14:52:32 RRSAgent has joined #csvw 14:52:32 logging to http://www.w3.org/2015/02/25-csvw-irc 14:52:47 rrsagent, this will be csvw 14:52:47 I'm logging. I don't understand 'this will be csvw', danbri. Try /msg RRSAgent help 14:52:54 Zakim has joined #csvw 14:53:01 zakim, this will be CSVW 14:53:01 ok, danbri; I see DATA_CSVWG()10:00AM scheduled to start in 7 minutes 15:00:45 JeniT has joined #csvw 15:01:25 hi dan 15:01:29 hi 15:01:39 about to dial in. don't see anyone else in irc yet 15:01:59 I didn’t send out an agenda :/ 15:02:00 skype tells me "You have lost internet connectivity or your status is Offline. Please check your connection or change your status and try again." 15:02:20 I didn't either - sorry - california trip on tail end of that cold you saw 15:02:58 DavideCeolin has joined #csvw 15:03:04 DATA_CSVWG()10:00AM has now started 15:03:12 +[IPcaller] 15:03:26 zakim not taking my codes now :-| 15:03:33 gkellogg has joined #csvw 15:04:16 +[IPcaller] 15:04:19 zakim, IPcaller is danbri 15:04:19 +danbri; got it 15:04:24 jtandy has joined #csvw 15:04:26 zakim, code? 15:04:26 the conference code is 2789 (tel:+1.617.761.6200 sip:zakim@voip.w3.org), gkellogg 15:05:05 agenda: https://www.w3.org/2013/csvw/wiki/Meetings#Standing_Agenda 15:05:07 +[IPcaller] 15:05:13 zakim, IPcaller is me 15:05:13 +DavideCeolin; got it 15:05:16 +??P2 15:05:23 zakim, ??P2 is me 15:05:23 +gkellogg; got it 15:05:48 jumbrich has joined #csvw 15:05:49 +??P5 15:06:00 zakim, ??P5 is me 15:06:00 +jtandy; got it 15:06:07 scribnick: danbri 15:06:09 +??P0 15:06:19 scribenick: danbri 15:06:19 ??P0, zakim is me 15:07:19 regrets: Ivan 15:07:33 https://github.com/w3c/csvw/issues?q=is%3Aopen+is%3Aissue+label%3A%22Requires+telcon+discussion%2Fdecision%22 15:07:37 jenit: standing issue-driven agenda 15:07:45 https://github.com/w3c/csvw/issues/223 15:07:49 we started last week around unions of datatypes 15:08:02 Allowing "unions" of datatypes? #223 15:08:06 logger, pointer? 15:08:12 rrsagent, pointer? 15:08:12 See http://www.w3.org/2015/02/25-csvw-irc#T15-08-12 15:08:35 jumbrich: we tried to get some hints on frequency of datatypes in cols 15:08:47 we found our code slow, small/strange bugfeatures 15:09:04 Starting at 93000 files, parsed until lunchtime around 10k. 15:09:27 Most prominently noticed that datatypes are mixed with just strings and integer / numerical values, and that we had diff types of numerical values e.g. integers and floats 15:09:34 jenit: how freq or common is that? 15:09:53 jumbrich: roughly 1/3 of docs has a mix of char strings vs numerical values 15:10:11 we parsed 9000 docs. In 2082 we found an alpha value i.e. chars, and a float combination 15:10:32 danbri: how many (indep.) srcs are these things from? 15:11:03 jumbrich: good point, couple files are from same host, … eg. with same date. A lot from some brazilian portal who have a file per month. We looked at some gov uk… 15:11:25 … for now most of the files they probably belong to a small set of providers. Might be biased in that way. 15:12:00 as I wrote in the mail there is some risk of bias, hope for more next week 15:12:19 nothing in https://lists.w3.org/Archives/Public/public-csv-wg/2015Feb/ yet 15:12:58 [aside: jumbrich can you take a look at http://danbri.org/words/2012/03/08/776 … I tried to workaround the bias issue with some rdf vocab stats] 15:14:02 https://github.com/w3c/csvw/issues/223 15:14:52 sorry, doorbell brb 15:15:34 back 15:15:36 [discussion of null] 15:15:48 q+ to ask about multiple null values 15:15:54 ack jtandy 15:15:54 jtandy, you wanted to ask about multiple null values 15:16:02 q+ 15:16:19 jtandy: sometimes the reason I'll use multiple null values, … each letter I use for a null is for a diff reason. 15:16:24 would be useful to have this in the data 15:16:36 …if it says M it means null for this reason, N for this reason, … etc 15:16:56 but of course if it is a null value, the uri template gets to work on the canonicalized value, so M becomes null field, … 15:16:59 see https://github.com/w3c/csvw/issues/218 15:17:04 in a URI template, null is always null 15:17:23 jenit: union datatypes coudl give some way out 15:17:31 … define a datatype w/ some string value 15:19:04 gregg: how about literal from some other col? 15:19:25 … could imagine source something or other, including name of col as way of retrieving, e.g. virtual variables 15:19:31 … seems like a lot of complexity 15:19:45 ack gkellogg 15:20:19 jenit: let's open a separate issue for this. 15:20:29 gkellogg: impl of multiple datatypes is straightforward 15:20:40 the complexity maybe the merge language since a datatype is an atomic value 15:20:45 which does not otherwise take an array 15:21:02 that's the potential complexity - i.e. how it complicates the merge language 15:21:17 jenit: we'll revisit this again next week w/ more data from j. and ivan back 15:21:21 https://github.com/w3c/csvw/issues/260 15:21:27 -??P0 15:22:03 jtandy: we have core annos defined for tables 15:22:17 … e.g. to define the source, rows, cols, tables, … but not for table groups. 15:22:38 … given that resources is a mandatory property should it not be handled as a table group annotation? 15:22:41 jenit: makes sense 15:22:48 +1 15:22:52 +1 15:22:53 +1 15:22:55 +1 15:22:58 +1 15:23:03 +??P0 15:23:06 sorry 15:23:17 zakim, ??P0, is me 15:23:17 sorry, jumbrich, I do not recognize a party named '??P0,' 15:23:26 zakim, ??P0 is me 15:23:26 +jumbrich; got it 15:23:27 https://github.com/w3c/csvw/issues/261 15:23:56 jtandy: re serializations, we talked about standard mode, and a minimal mode 15:24:12 latter with just the resources described _in_ the table, without all the other paraphanalia 15:24:23 … for latter we needed a property linking table to the things described by the table 15:24:30 e.g. describes 15:24:34 https://tools.ietf.org/html/rfc6892 15:24:39 q+ 15:25:00 … i didn't find one, but there must be similar out there. 15:25:05 maybe rdf dataa cube has such? 15:25:06 ack gkellogg 15:25:23 gkellogg: my interpret. of the minimal mode, is that it would not include any table data at least in the rdf 15:25:25 http://www.w3.org/TR/powder-dr/#assoc-linking 15:25:28 it would simply be the results 15:25:41 q+ 15:25:48 q+ 15:26:26 jtandy: i expected min mode to be a block describing table as dataset, and then have something saying 'all of these resources are described in this dataset' 15:26:43 jenit: either way you'll need this relationship 15:26:45 ack danbri 15:27:58 danbri: yup powder or something from skos/schema/foaf/whatever for the non-minimal. For minimal, focus on 'what the table tells you about the world'. Row links aren't minimal. 15:28:04 jenit: let's open an issue on this 15:28:13 ack JeniT 15:28:14 jtandy: we discussed minimal standard and maybe another 15:28:33 https://tools.ietf.org/html/rfc6892 15:28:48 jenit: see also rfc 6892 'describes' 15:29:23 “In accordance with the ATOM specification [RFC4287], the describedby relationship is a relative URI, the base of which is http://www.iana.org/assignments/relation/ - i.e. the full URI of describedby is http://www.iana.org/assignments/relation/describedby - and this is included in the ATOM registry [AREG].” 15:29:29 (and in foaf we have xmlns.com/foaf/spec/#term_isPrimaryTopicOf … for things to docs and doc fragments) 15:29:38 (dc has 'subject', schema has 'about', …) 15:30:02 jenit: to close this issue, we'll need to pick something 15:30:42 jtandy: we haven't used powder or foaf so far in output 15:30:49 gkellogg: void would make a lot of sense 15:31:01 jenit: but void describes a linked data dataset 15:31:19 jtandy: i also agree that the rel between a void dataset and the thing is weak 15:31:36 … all the triples belong to this dataset implicitly, so a little weird. 15:32:05 jenit: I suggest, dc subject isn't strong enough, foaf would be the wrong vocabulary, so it comes down to schema.org or using the iana link relation 15:32:45 (http://schema.org/Dataset (dcat inspired) is in schema, but no notion of Table or Row in there yet) 15:32:58 jenit: so I'll propose isDescribedBy to have a candidate 15:33:05 gkellogg: except url doesn't link to anything useful 15:33:09 +1 15:33:11 jtandy: which is kind of irritating 15:33:24 jenit: so schema.org/about ? … 15:33:51 also doesn't quite feel right 15:34:20 jenit: will re-propose using linked relation isDescribedBy from POWDER 15:34:32 PROPOSAL: use http://www.iana.org/assignments/relation/describedby, as defined by POWDER, as relationship between entities & the table they are described within 15:34:40 ( somewhat related - https://github.com/schemaorg/schemaorg/issues/301 ) 15:34:59 jtandy: so this is rel from resource to row? 15:35:03 +1 15:35:05 jenit: yes 15:35:08 +1 15:35:23 jtandy: how would that work in the json serialization? 15:35:33 … you'd need a row object with all the things it describes in that row? 15:35:47 jenit: you'd maybe want described by property, maybe with a prefix on it, to avoid name clashes, ... 15:36:38 (discussion of fwd/back relation direction) 15:37:05 gkellogg: even from rdf, if you are filtering on subjects or entity types, having the row refer to the subjects, … vs other way, seems like it allows similar filtering 15:37:10 …and of course they're all triples. 15:37:20 jenit: is that a proposal to use 'describes' instead? 15:37:42 gkellogg: I guess 'describes' [from that rfc] would make more sense 15:37:59 jenit: i think yes in the json you don't need the specific rel as containment can clarify this 15:38:06 status of the rfc - informational 15:38:10 RFC-6892 15:38:27 -jumbrich 15:38:28 danbri: would it be normative? 15:38:39 jenit: just saying use it as a property - not making normative statements about it? 15:38:54 jtandy: the full ref to this would be www.iana.org/assignments/relations/describes ? 15:38:59 yup 15:39:06 (presumably http or https :// ) 15:39:24 jenit: this would/could make it easier to indicate what the table contains 15:39:48 PROPOSAL: use http://www.iana.org/assignments/relations/describes, as defined by RFC6892, as the relationship between a table/row and the entities that are described wtihin it 15:39:48 +??P9 15:39:53 +1 15:39:53 zakim, ??P9 is me 15:39:53 +jumbrich; got it 15:39:54 +1 15:39:57 +1 15:39:58 +1 15:40:00 +1 15:40:04 +1 15:40:07 "The relationship A 'describes' B asserts that resource A provides a description of resource B. There are no constraints on the format or representation of either A or B, neither are there any further constraints on either resource." 15:41:24 jtandy: and the prefix would be what? 15:41:43 gkellogg: [missed] 15:41:50 .. how does rdf reference link relations? 15:42:02 not in http://www.w3.org/2011/rdfa-context/rdfa-1.1 15:42:14 powder is in there 15:43:35 it points to http://www.iana.org/assignments/link-relations/link-relations.xml 15:43:41 jenit: I would use 'rel' 15:44:23 or http://www.iana.org/go/rfc5988:foo ? 15:44:29 er rfc5988:foo 15:44:42 jenit: it does not exactly matter exxcept in examples 15:45:04 jtandy; when i use it in plain json, do i use unqualified 'describes'? no prefix? 15:45:14 jenit: you could use document containment hierarchy? 15:45:23 jtandy: you need a property to attach the array 15:45:33 jenit: yes then 'describes' without prefix 15:45:51 https://github.com/w3c/csvw/issues/245 15:46:07 "Separate RDF savy implementations? #245" 15:46:14 jenit: would be better with Ivan here 15:46:43 gkellogg: also https://github.com/w3c/csvw/issues/193 ("Documenting the processing steps for metadata #193") but also needs Ivan really. 15:47:02 … main point on that is that there is a finalization step to the metadata, pieces that are logically there are substantialized 15:47:06 e.g. name/title of a col 15:47:47 … what we have is consistent with the way we have been describing things so far. 15:48:09 … if we look at this as a consensus process, I think Ivan has a different feeling on that and I'd be wary of closing this issue without his involvement. 15:48:48 jenit: I see it as same result either way, comes down just to how it is described in the doc and whether it is clear to people. Let's revisit this once other changes are merged in, probably after we publish the next drafts. Reasonable? 15:49:02 gkellogg: I think what we should revisit is the finalization 15:49:32 … important for next set of pubs. Separate that bit about finalizing. There's text to review, … if we can include this info sooner and preserve finalization for a later round. 15:49:36 jenit: yes 15:49:50 … as long as we have covered saying what propertty values are 15:50:01 … and revisit after seeing the draft 15:50:20 (jenit to progress this issue) 15:50:29 aob? 15:50:57 jtandy: am planning a chunk of work on the conv docs this weekend, which should get through my resolved-but-not-yet-actioned issues. 15:50:59 jenit: great 15:51:13 … when done we can all review before aiming to publish as discussed end of March. 15:51:18 s/weekend/week/ 15:51:28 Adjourned. 15:51:32 rrsagent, please draft minutes 15:51:32 I have made the request to generate http://www.w3.org/2015/02/25-csvw-minutes.html danbri 15:51:35 -JeniT 15:51:51 rrsagent, please make minutes public 15:51:51 I'm logging. I don't understand 'please make minutes public', danbri. Try /msg RRSAgent help 15:52:12 -gkellogg 15:52:16 -DavideCeolin 15:52:32 -jumbrich 15:52:36 jumbrich has left #csvw 15:53:16 jtandy, https://dvcs.w3.org/hg/webschema/file/b4c3ad199322/schema.org/ext/weatherforecast.html 15:53:58 -jtandy 15:53:59 -danbri 15:53:59 DATA_CSVWG()10:00AM has ended 15:53:59 Attendees were JeniT, danbri, DavideCeolin, gkellogg, jtandy, jumbrich 16:05:47 rrsagent, make minutes public 16:05:47 I'm logging. I don't understand 'make minutes public', danbri. Try /msg RRSAgent help 16:07:27 I don't see command to make logs public 16:07:31 rrsagent, make logs public 16:07:36 ah, logs vs minutes 17:59:19 Zakim has left #csvw 18:15:44 danbri has joined #csvw 18:29:02 danbri has joined #csvw 18:30:16 danbri1 has joined #csvw