15:21:33 RRSAgent has joined #dwbp 15:21:33 logging to http://www.w3.org/2014/10/30-dwbp-irc 15:21:41 zakim, this will be dwbp 15:21:41 ok, phila; I see DATA_DWBP()11:30AM scheduled to start in 9 minutes 15:22:05 RRSAgent, this meeting spans midnight 15:22:39 Meeting: DWBP Face to face, TPAC 2014, Day 1 15:22:46 CarlosIglesias has joined #dwbp 15:24:31 DATA_DWBP()11:30AM has now started 15:24:39 +RiccardoAlbertoni 15:26:36 how remote partecipation is going to work? are we supposed to be connected only by irc or an audio connection is foreseen? 15:28:29 -RiccardoAlbertoni 15:28:30 DATA_DWBP()11:30AM has ended 15:28:30 Attendees were RiccardoAlbertoni 15:28:49 Hi RiccardoAlbertoni - you need to be on IRC as usual and the dial in number will work. We're still gathering here so I won't connect to zakim, just yet 15:29:25 My guess is a lot of people will assume we're starting at 09:00 (half an hour's time). No chairs here yet... 15:30:36 Caroline__ has joined #DWBP 15:30:42 Hello!!! 15:31:06 ok, thanks .. then I will wait the actual start for calling by skype .. 15:31:36 ericstephan has joined #dwbp 15:32:03 We will call Zakim? 15:32:51 Yes, but not yet Caroline__ 15:33:08 People are still gathering here in the room. 15:33:17 ok! Please let me know when I should call 15:33:22 It's still early morning here... 15:34:20 Good morning! :) 15:35:51 JeniT has joined #dwbp 15:38:48 raphael has joined #dwbp 15:38:59 I have made the request to generate http://www.w3.org/2014/10/30-dwbp-minutes.html raphael 15:40:32 Eric_Kauz has joined #DWBP 15:44:13 Hi all! Official declaration that we'll be starting at 9:00 (in 15 mins) 15:44:15 gatemezi has joined #dwbp 15:44:22 Ok! :) 15:44:25 :) 15:44:39 Morning, gatemezi. I just said we'll be starting in 15 mins 15:45:38 Morning Hadley.. Thanks. Any other means to follow you remotely apart from Zakim ? 15:45:39 raphael has joined #dwbp 15:47:34 zakim, call SalonA 15:47:34 ok, phila; the call is being made 15:47:35 DATA_DWBP()11:30AM has now started 15:47:37 +SalonA 15:48:16 Present+ Raphael_Troncy 15:48:24 jtandy has joined #dwbp 15:48:53 laufer has joined #dwbp 15:48:56 Ig_Bittencourt has joined #dwbp 15:49:12 zakim, SalonA has Jeremy_Tandy, Laufer, Ig, JeniT, Bart, Chunming, Eric_Kauz, raphael, hadleybeeman, phila, Olivier, Annette, Erik_Mannens 15:49:13 +Jeremy_Tandy, Laufer, Ig, JeniT, Bart, Chunming, Eric_Kauz, raphael, hadleybeeman, phila, Olivier, Annette, Erik_Mannens; got it 15:50:08 chunming has joined #dwbp 15:50:55 +RiccardoAlbertoni 15:51:39 zakim, salona has Kirby_Shabaga 15:51:40 +Kirby_Shabaga; got it 15:52:09 zakim, salona has Gary_Driscoll 15:52:09 +Gary_Driscoll; got it 15:52:50 chair: Hadley 15:52:58 Topic: Intro to the day 15:53:29 hadleybeeman: We plan to cover the big picture topic, what is the scope, what do we have the capacity to do etc. 15:53:56 ... More importantly I'm hoping we can get stuff writen down, work through issues, perhaps writing/editing as we go 15:54:15 ... we'll spend this morning reviuewing the requirements in the BP doc 15:54:24 ... ideally ending with a long list of issues in the tracker 15:54:40 ... this PM we'll split into groups and work on the BP doc and the 2 vocabs 15:54:55 ... may come back with specific questions for the group 15:55:02 ... likely to spread into tomorrow morning 15:55:14 ... ideally we want Editor's drafts by end of tomorrow 15:55:22 Vagner_Br has joined #dwbp 15:55:26 ... we need to think about the use cases that we have 15:55:40 ... there seem to be UCs in our heads that need to be in the UCR doc 15:55:59 ... and we want to make the most of having everyoine here. So we need feedback and suggestions for making the best use of the time 15:56:05 scribe: philA 15:56:09 scribeNick:philA 15:56:30 bernadette: Before we split into groups I'd like to talk about the structure of the BP doc 15:56:37 hadleybeeman: OK, but probably tomorrow afternoon 15:57:06 zakim, SalonA has Vagner_Br, Bernadette 15:57:06 +Vagner_Br, Bernadette; got it 15:57:12 zakim, who is here? 15:57:12 On the phone I see SalonA, RiccardoAlbertoni 15:57:13 SalonA has Vagner_Br, Bernadette 15:57:13 On IRC I see Vagner_Br, chunming, Ig_Bittencourt, laufer, jtandy, raphael, gatemezi, Eric_Kauz, JeniT, ericstephan, CarlosIglesias, RRSAgent, Zakim, phila, BartvanLeeuwen, 15:57:13 ... RiccardoAlbertoni, hadleybeeman, trackbot 15:57:44 q+ 15:58:50 BernadetteLoscio has joined #dwbp 15:59:03 olivier has joined #dwbp 15:59:26 AdrianoC has joined #dwbp 15:59:56 Topic: Intros 16:00:30 Tour de Table 16:00:33 Caroline has joined #DWBP 16:00:45 raphael: From EURECOM. Ghislain is one of my colleagues 16:00:56 (Only scribing guests) 16:01:24 chunming: From China Host, observing today but work on data in China 16:01:26 +Caroline_ 16:01:38 s/work on data/work on big data/ 16:01:49 annette_g has joined #dwbp 16:01:49 Gary_Driscoll: Interested in all things data 16:01:58 JeniT: From ODI, co-chair of CSVW 16:02:44 zakim, SalonA has Ken_Laskey 16:02:44 +Ken_Laskey; got it 16:03:01 zakim, salona has Reinaldo 16:03:01 +Reinaldo; got it 16:03:49 hello Carol! 16:04:11 jtandy: I'm jeremy Tandy from the UK Met Office. I'm an observer here but interested in taking down the barriers to others reusing data. Unanticiapted reuse is what we're aiming for 16:04:55 Adrian: From University of ?? where we work on data consumption. We're trying to complement and add value to what we call data enrichment 16:05:09 Adrian from University of Minas Gerais 16:05:14 Olivier: I'm from the BBC 16:05:32 annette_g: I work at the Lawrence Livermore Lab in the super computer centre 16:05:48 Kirby: I'm with Boeing in Seattle 16:06:02 reinaldo: I work in W3C Brasil office, observing today 16:06:18 ErikM: I'm observing today but my team is involved in a lot of groups 16:06:30 Ken: I'm with MITRE Corp 16:06:31 s/Lawrence LIvermore/Lawrence Berkeley 16:06:41 zakim, who is here? 16:06:41 On the phone I see SalonA, RiccardoAlbertoni, Caroline_ 16:06:42 SalonA has Reinaldo 16:06:42 On IRC I see annette_g, Caroline, AdrianoC, olivier, BernadetteLoscio, Vagner_Br, chunming, Ig_Bittencourt, laufer, jtandy, raphael, gatemezi, Eric_Kauz, JeniT, ericstephan, 16:06:42 ... CarlosIglesias, RRSAgent, Zakim, phila, BartvanLeeuwen, RiccardoAlbertoni, hadleybeeman, trackbot 16:06:56 s/Lawrence Livermore/Lawrence Berkeley 16:08:56 hadleybeeman: Explains overall aim of the WG 16:09:30 hadleybeeman: We're not a Linked data WG. We have a broad aim therefore. 2 quite specific vocabs and a general best practices doc 16:09:38 ... the use cases provide the grounding of course 16:09:39 http://www.w3.org/TR/dwbp-ucr/ 16:10:11 kirby has joined #dwbp 16:10:23 s/??/Minas Gerais, Brazil 16:10:28 q- 16:10:29 hadleybeeman: Please keep thinking about use cases that we're missisng 16:10:40 em has joined #dwbp 16:11:04 scribe: hadleybeeman 16:11:34 em has joined #DWBP 16:11:40 -> http://www.w3.org/TR/dwbp-ucr/#requirements-1 Requirements 16:11:46 phila: To make the best use of the time that we have we will skip the use cases and focus on requirements 16:11:55 scribe: ericstephan 16:12:44 phila: It would be really good for people to go thru the use cases and make sure that everything is complete. For the interest of time we will go thru the requirements together. If we are missing a requirement now is the time to add new requirements. 16:13:13 Bernadette: Will we also filter out requirements to determine scope? 16:13:38 phila: We need to bring the use cases to something manageable 16:14:18 The requirements are in different clusters and for most use cases you can follow the links that pertain to the use cases in the document. 16:14:38 fjh has joined #dwbp 16:14:46 zakim, who is here? 16:14:46 On the phone I see SalonA, RiccardoAlbertoni, Caroline_ 16:14:47 SalonA has Reinaldo 16:14:47 On IRC I see fjh, em, kirby, annette_g, Caroline, AdrianoC, olivier, BernadetteLoscio, Vagner_Br, chunming, Ig_Bittencourt, laufer, jtandy, raphael, gatemezi, Eric_Kauz, JeniT, 16:14:47 ... ericstephan, CarlosIglesias, RRSAgent, Zakim, phila, BartvanLeeuwen, RiccardoAlbertoni, hadleybeeman, trackbot 16:14:55 rrsagent, generate minutes 16:14:55 I have made the request to generate http://www.w3.org/2014/10/30-dwbp-minutes.html fjh 16:15:51 Present+ Frederick_Hirsch 16:16:03 +q 16:16:13 Reinaldo has joined #dwbp 16:16:19 KenL has joined #DWBP 16:17:10 phila: All the requirements have been derived from use cases. Some requirements are absolutely basic baby steps. 16:17:34 q+ 16:17:47 q+ to ask about the choice of a *suitable* format 16:17:48 q- 16:17:58 phila: Reviewing over section 4.1.1 requirements in UCR 16:18:04 ack jeni 16:18:04 JeniT, you wanted to ask about the choice of a *suitable* format 16:18:49 Zakim, who is speaking? 16:18:50 JeniT: Is there a requirement for a suitable format? If you are publishing for geographic data then you need a geographic format etc. 16:19:01 Caroline, listening for 10 seconds I could not identify any sounds 16:19:03 JeniT is speaking @Caroline 16:19:08 ? 16:19:11 q? 16:19:17 taisuke_ has joined #dwbp 16:19:23 yaso has joined #dwbp 16:19:26 fjh_ has joined #dwbp 16:19:57 http://www.w3.org/TR/dwbp-ucr/#requirements-1 16:20:20 phila: Yes JeniT we do need to include a requirement for a suitable format 16:20:22 Is the question a single format or unambiguously identifying the format that is being used? Format =s will change and we need to understand how to interpret. 16:21:00 @kenL: Sounds like we need to flesh that out 16:21:05 phila: R-FormatLocalize requirement, different parts of the world write in different formats the local can make a big difference when sharing data 16:21:31 q+ 16:21:48 newton has joined #dwbp 16:21:51 ken: question on locale:: Is it your local or the locale of the data? 16:21:54 ErikM has joined #DWBP 16:22:08 bernadette: is it a requirement for data format or metadata? 16:22:23 q+ 16:22:44 phila: It is a requirement for the information about the data. 16:22:49 jhund has joined #dwbp 16:23:19 q+ 16:23:31 action: phil to add a requirement for a suitable format (as per jenit's suggestion) 16:23:31 Created ACTION-106 - Add a requirement for a suitable format (as per jenit's suggestion) [on Phil Archer - due 2014-11-06]. 16:23:32 phila: The meaning about localize needs to become clearer. 16:23:54 action: phil to clarify RFormatLocalize according to questions in the F2F discussion 16:23:54 Created ACTION-107 - Clarify rformatlocalize according to questions in the f2f discussion [on Phil Archer - due 2014-11-06]. 16:23:59 phila: ....localize and format 16:24:09 ack bern 16:24:20 q- 16:24:28 ack laufer 16:24:36 bart: need to be making issues and actions as we go along. 16:25:09 antoine has joined #dwbp 16:25:28 laufer: There are layers in data and metadata information. Do we need to clarify inheritance when we discuss collections 16:26:20 phila: There is no requirement that covers inheritance, the current requirement for granularity doesn't cover it. 16:26:23 action: phil to amend/expand R-GranularityLevels to cover Laufer's question about inheritance —metadata for the data itself and for the dataset 16:26:23 Created ACTION-108 - Amend/expand r-granularitylevels to cover laufer's question about inheritance —metadata for the data itself and for the dataset [on Phil Archer - due 2014-11-06]. 16:26:31 q? 16:26:38 SumitPurohit has joined #DWBP 16:26:47 Hi Sumit! 16:26:50 q+ to ask whether vocabularies cover code lists 16:26:51 hello Sumit! 16:26:57 Hi Eric 16:26:59 q+ what is a "reference vocabularies"? 16:27:06 Hello Hedley 16:27:11 ack jeni 16:27:11 JeniT, you wanted to ask whether vocabularies cover code lists 16:27:24 q+ to ask what is a "reference vocabularies"? 16:27:30 phila: 4.1.2 discussed data vocabularies section 16:27:43 q+ 16:27:46 + +1.509.372.aaaa 16:27:53 - +1.509.372.aaaa 16:27:57 q- 16:28:10 + +1.509.372.aabb 16:28:45 +1 to JeniT's comment about separating the "vocabulary data model" requirement from the "vocabulary code list" requirement 16:28:51 q+ 16:29:08 jeniT: Is this about the format, we need to publish data that relates to code lists if they are available 16:29:11 Shouldn't any vocabulary be covered and be able to be uniquely identified? 16:29:29 jeniT: It is very much like vocabularies.. 16:30:11 bernadette: I think we had that in mind but maybe more focused on ontology specific vocabularies to supply the meaning. 16:30:32 what about using a vocabulary such as SKOS for publishing code list ? 16:31:07 phila: currently the ucr doesn't include code lists....does the use cases include code lists? This is an issue 16:31:25 laufer: A code list is a foreign key? 16:31:30 issue: phil to look at whether the UCR doc sufficiently covers code lists 16:31:30 Created ISSUE-48 - Phil to look at whether the ucr doc sufficiently covers code lists. Please complete additional details at . 16:31:33 phila: Yes it is, it has to be there. 16:31:49 bart: If you don't have it you don't have a clue what the data means 16:31:55 q? 16:32:00 ack raphael 16:32:00 raphael, you wanted to ask what is a "reference vocabularies"? 16:32:03 Now voice is clear... 16:32:19 raphael: I wonder if there is a definition of a reference vocabulary? 16:32:43 bernadette: We don't have a glossary.... 16:33:29 raphael: a glossary would be helpful. 16:33:37 q+ to ask if it matters what qualified as a vocabulary if way to identify is useful 16:33:42 q+ 16:33:56 q- 16:33:57 phila: There are w3c documents around we need to point to them or expand upon them 16:34:03 issue: Phil to Either improve on the definition of "reference vocabulary", or point to existing definitions 16:34:03 Created ISSUE-49 - Phil to either improve on the definition of "reference vocabulary", or point to existing definitions. Please complete additional details at . 16:34:25 ack ig 16:34:26 q+ 16:34:29 ack ken 16:34:29 KenL, you wanted to ask if it matters what qualified as a vocabulary if way to identify is useful 16:34:43 There are many dimensions that can be use (authority, persistency, popularity, etc.) to decide whether a vocabulary is a reference one or not. Perhaps one could at this stage provides examples of reference vocabularies and others that are not 16:35:01 Ken: What qualifies as a vocabulary? If you can have something that is well documented can't the vocabulary be more fluid? 16:35:23 hadley: Are we talking about the definition of a vocabulary or reference to vocabulary? 16:35:27 wondering whether "vocabulary" is a too semantic web/linked data biassed terms 16:35:32 Ken: I don't know if I care... 16:35:47 we may be talking more generically about "data models" 16:36:05 me Caroline, sue this link http://www.w3.org/2013/dwbp/track/issues/49/edit 16:36:13 s/sue/use 16:36:16 Ken: URIs to identify vocabularies if I think its one and you think its one, that's ok 16:36:24 q+ 16:36:35 ack jtandy 16:37:19 Jtandy: One of the things that inhibits people is knowing which vocabulary to use. Helping people use to start with would be a great outcome and take away the excuse 16:37:34 rhiaro_ has joined #dwbp 16:37:36 q+ to respond re recommending vocabs 16:37:41 Jtandy: establishing a procedure for where to look would be really useful 16:38:07 +q 16:38:11 ack ig 16:38:18 Ig: The reference vocabulary should take into account of ontological commitment. I advocate the use of ontology 16:39:09 q+ 16:39:15 hadley: I agree with Jtandys point but at the same time recommending vocabularies for an infinite number of use cases might be a very hard problem 16:39:16 q+ 16:39:32 Jtandy : There was an attempt in a previous WG on how to find/look for vocabularies. Maybe this link here http://www.w3.org/TR/ld-bp/#VOCABULARIES can be a good starter to look at 16:39:35 q+ jtandy 16:40:13 sumit: I have a suggestion, we should explicitly say what we mean by vocabulary so we will be on the same page. We should take it as an action item to define what we mean 16:40:17 ack sumit 16:40:19 q+ saying terms and definitions was meant as an example and not a firm recommendation. Also, choosing vocabulary can be matter of policy or current practice, and these may change. Identifying vocabulary is important point and is first step to mediating between vocabularies. 16:40:21 ack me 16:40:22 hadleybeeman, you wanted to respond re recommending vocabs 16:40:25 q- 16:40:34 ack laufer 16:40:58 q+ to sayi terms and definitions was meant as an example and not a firm recommendation. Also, choosing vocabulary can be matter of policy or current practice, and these may change. Identifying vocabulary is important point and is first step to mediating between vocabularies. 16:41:18 yanai has joined #dwbp 16:41:23 newton_ has joined #dwbp 16:41:28 laufer: we are talking about suggestions for vocabulary that could be useful. This would be a huge problem, we should restrict the metadata about the collection .... 16:41:51 ack j 16:42:14 q+ 16:42:21 q+ isn't this part of best practices? 16:42:27 antoine has joined #dwbp 16:42:28 q+ 16:42:35 q+ to isn't this part of best practices? 16:42:44 jtandy: responding back to hadley, I wasn't advocating a list of vocabularys that would be obsolete very quickly. I am advocating a way of registering vocabularies to find the things that may or may not be useful to them 16:42:51 q+ 16:43:15 ack ken 16:43:15 KenL, you wanted to sayi terms and definitions was meant as an example and not a firm recommendation. Also, choosing vocabulary can be matter of policy or current practice, and 16:43:18 ... these may change. Identifying vocabulary is important point and is first step to mediating between vocabularies. 16:43:43 q+ 16:43:57 ack bernadette 16:44:02 +[IPcaller] 16:44:07 kenL: I didn't want to get into definitions of vocabularies, this might change from case to case 16:44:15 zakim, IPcaller is me 16:44:15 +antoine; got it 16:44:21 ack yaso 16:44:44 yaso: We should recommend best practices on making vocabularies? 16:44:46 q+ 16:45:00 ack me 16:45:00 ack bart 16:45:01 BartvanLeeuwen, you wanted to isn't this part of best practices? 16:45:34 bart: I am wondering if this is something that goes to vocabularies about best practices to select vocabularies 16:45:34 +1 to bart 16:45:37 ack ig 16:45:57 http://lov.okfn.org/dataset/lov/ 16:46:01 ig: I agree with Ken and Raphael, not to propose a vocabulary but a place for people to find the vocabulary. 16:46:36 Ig: To Yaso are we interested in how to use a vocabulary or create a vocabulary 16:46:43 ack phil 16:46:46 scribe: hadleybeeman 16:47:22 so one piece of metadata for a vocabulary would be the documented formalism in which the vocabulary is expressed. 16:47:24 phila: Thanks to all — I don't disagree with anything I've heard. I've written about how you choose a vocab in W3C namespace and European Commission sites. 16:47:48 ... It does list some vocabularies. Schema.org, Dublin Core, etc. That could be found and incorporated/improved upon. 16:48:00 ... LOV — linked open vocabularies — is a project run by Raphael. 16:48:23 ... If you're thinking of coming up for a term for a bus, it tells you all the schemas and vocabularies that have anything to do with the term "bus". 16:48:35 ... The Research Data Alliance are tryign to build something similar 16:48:49 @hadleybeeman I can take over again 16:48:56 http://lov.okfn.org/dataset/lov/search?q=bus 16:48:56 ... On new vocabularies: It is a different subject, but we have been offered some useful text on that by the Multilingual Web group 16:49:01 scribe: ericstephan 16:49:04 https://rd-alliance.org 16:49:45 q? 16:49:53 @gatemezi ... wow, there are 73 results already for http://lov.okfn.org/dataset/lov/search?q=bus 16:49:59 ack bernadette 16:50:06 phila: To summarize yes we are talking requirements, kens point is really well taken not defining to narrowly, provide guidance and the bp document should provide this 16:50:10 +1 Phil 16:51:03 @jtandy.. yep! More details on the right column: 49 classes, 24 properties... and the domain of the vocabularies ; 16:51:23 bernadette: We are going to have requirements for vocabularys themselves and best practices for vocabularies themselves. We need more than what we currently have to help guide this. 16:51:48 bernadette: If we are going to work on vocabularies we need this tree and more 16:51:57 bernadette: pointing to 4.1.2 16:52:21 issue: Bernadette to help us find more use cases on the vocabulary itself (including creating a vocabulary) 16:52:21 Created ISSUE-50 - Bernadette to help us find more use cases on the vocabulary itself (including creating a vocabulary). Please complete additional details at . 16:52:27 q? 16:52:34 bernadette: when I look at 4.1.2 this is not for people using the vocabularies not the people creating the vocabularies 16:52:57 s/not the people/it is for the people/ 16:53:13 q+ to ask if we should talk about "discovery metadata" 16:53:17 q+ to ask whether there are particular metadata requirements 16:53:24 q+ 16:53:26 phila: 4.1.3 Are there other requirements for metadata? 16:53:28 ack jtandy 16:53:29 jtandy, you wanted to ask if we should talk about "discovery metadata" 16:53:33 q? 16:54:25 jtandy: when I see the word metadata, its so broad in its meaning, do we want to refine the metadata to define as discovery metadata usage metadata (which is much richer) 16:54:44 * +1 for usage metadata 16:54:46 q+ 16:55:02 jtandy: for scoping I recommend focusing on discovery metadata... 16:55:03 ack JeniT 16:55:03 JeniT, you wanted to ask whether there are particular metadata requirements 16:55:15 phila: I agree and I think we will need to have some usage metadata 16:55:29 ack BernadetteLoscio 16:55:41 +SumitPurohit 16:55:45 - +1.509.372.aabb 16:55:51 q+ to say distinguishing discovery metadata vs. use metadata is sometimes a slippery distinction because what I would use as a criteria for discovery could be what you want for use (and other variations) 16:55:57 bernadette: We are thinking about different kinds of metadata 16:56:14 best not to try to silo the metadata 16:56:34 +1 16:56:44 bernadette: We haven't defined this yet we should consider what laufer said about the levels. Some metadata related to collection and some related to the data itself. 16:57:20 jtandy: We've steered away from provenance metadata at this point (csv working group) 16:57:28 q+ 16:57:51 ack laufer 16:58:11 bernadette: The collection could consist of different kinds of data, the metadata can be nonspecific to each particular type of data in the data set. 16:58:19 (or at least we've steered away from making a recommendation about inclusion of provenance metadata at this point in order to keep our scope tight) 16:58:46 q? 16:59:00 q+ 16:59:15 laufer: We could classify the data by the collection or specific data schema. Someone could define a profile .... 16:59:29 q+ to talk about which metadata — and our scope 16:59:51 laufer: I think its a good hint that we don't have to focus on metadata for the schema 16:59:56 q- 16:59:59 q+ to ask how these requirements might actually be tested ... which might help determine if a given requirement should be included in the doc 17:00:08 ack ken 17:00:08 KenL, you wanted to say distinguishing discovery metadata vs. use metadata is sometimes a slippery distinction because what I would use as a criteria for discovery could be what 17:00:10 laufer: data about the data, not to clarify the items of the schema. 17:00:11 q- 17:00:11 ... you want for use (and other variations) 17:00:54 ack hadley 17:00:54 hadleybeeman, you wanted to talk about which metadata — and our scope 17:01:01 ken: If we are taking about different kinds of metadata on discover and usage. If you can keep it as flat as possible that's what I could recommend. 17:01:49 hadley: We could take on describing best practices for metadata and not be particularly useful. What is stopping other people for using my data? 17:02:20 ack jtandy 17:02:20 jtandy, you wanted to ask how these requirements might actually be tested ... which might help determine if a given requirement should be included in the doc 17:02:20 ack j 17:02:25 hadley: If we think about about people not using my metadata because its not tidy? I am interested to hear what you have to say 17:02:54 q+ 17:03:14 jtandy: I agree hadley, you have to ask specific questions, how do you actually test these requirements? How are you going to demonstrate whether they work or not if you are going to put this in an IRC document? 17:03:29 s/IRC document/recommendation/ 17:03:53 -SumitPurohit 17:04:17 phila: Being able to think about this is useful, we do have to think about how the best practices are based on the requirements. How you validate this. 17:04:42 jtandy: How do you validate this because its really hard to test? 17:04:44 Makx has joined #dwbp 17:05:00 q 17:05:06 q? 17:05:23 q+ 17:05:24 ack annette 17:05:32 phila: We've got to be able to narrow down the scope because the current scope is vast. 17:06:26 annette: In the science world usage is a pervasive problem. Unless you can say this column represents this it is meaningless to others. 17:06:35 ack bern 17:06:51 scribe: hadleybeeman 17:07:33 Bernadette: I'm not sure if we need a requirement of how to associate the metadata to the data collection? The collection will be a set of data, a set of files — the metadata will be in another file. We need a requirement to say how we are going to link these things. 17:07:53 phila: It's in the requirements. R-Citable, asking for a persistent and unique identifier. 17:08:02 Hadley: maybe we need to explain it a bit more? 17:08:14 phila: If people are raising questions, we need to clarify it. 17:08:27 jtandy: the metadata needs to cite the data, not the other way around. 17:08:30 q+ 17:08:42 ack erics 17:08:59 ericstephan: Re Validation: Are we allowed to specify technical approaches for best practices? 17:09:00 phila: yes 17:09:24 q+ 17:09:35 ericstephan: We've talked about using JSON, JSON-LD and RDF as examples for metadata. Choosing one or all of them. 17:09:55 (what I meant was that the metadata should cite the data _and_ there should be a way to find the metadata from the data - e.g. like a link header) 17:10:00 q? 17:10:05 ack bernadette 17:10:31 @jtandy : now it's clear enough.. +1 17:10:33 bernadette: for the best practices, we can have more than one implementation for a best practice. The technical approach can expressed be in different implementations 17:10:59 philA: on to Requirements for Licenses. 17:11:28 ... I'm sorry to report that an Eu project we thought might help didn't get funded. But we did say, and the ODI has made plain, that data should be associated with a license. 17:11:38 ...The ODI recommends rights rather than a license. 17:11:42 JeniT: Well, both 17:12:04 Phila: This is more of a commercial angle stuff. What liability do you have as a user, or as a publisher? 17:12:19 +1 ... and if you are publishing data under a free usage license then you should say so - not assume that people will infer that! 17:12:29 ...We don't have the legal expertise to develop this, (what licenses are, or what right statements may be) — but this is explicitly out of scope for the group. 17:12:35 ...We can just say "stick a license on it." 17:12:36 ErikM has joined #DWBP 17:12:37 q+ to say it should also say rights are explicit 17:12:52 ...If the group has the capacity to go further, then we're open to it. 17:13:19 deirdrelee has joined #dwbp 17:13:19 jenit: I think it should also say "information about rights are available", which is a separate thing. For example, the data may have some third party rights restrictions. 17:13:34 ... This should be a separate requirement. Not to specify what that could be, but that it's worth including. 17:13:53 philA: Lee Dodds wants us to do more. 17:14:01 jeniT: I'm sure there is more to do there. 17:14:15 ...Also, why pull out liability terms? There are lots of terms and conditions to put on the use of data. 17:14:36 ...Maybe better to say "Requirements for legal compliance". Info about rights, about licenses, and clear terms and conditions (which may include liability) 17:14:48 philA: I think the liability came from Steve Adler 17:14:54 Steve: I'm not sure 17:15:25 BREAK FOR COFFEE, back in 15 mins 17:16:37 -antoine 17:17:40 em has joined #DWBP 17:24:56 Sorry but I have to leave, Hope you'll continue the good discussion after the coffee.. 17:25:14 -RiccardoAlbertoni 17:28:05 jtandy has joined #dwbp 17:31:13 BernadetteLoscio has joined #dwbp 17:32:08 yaso has joined #dwbp 17:33:23 KenL has joined #DWBP 17:33:33 jtandy_ has joined #dwbp 17:34:32 q+ 17:34:37 PhilA: Provinance 17:34:38 q+ 17:34:41 q- 17:35:00 s/Provinance/Provenance/ 17:35:05 +[IPcaller] 17:35:13 zakim, IPCaller is me 17:35:13 +antoine; got it 17:35:14 Phila: Who created this data? 17:35:14 q+ 17:35:22 q+ 17:35:33 KenL: Who created it or who owns it. 17:35:53 ack fjh 17:35:55 Kenl: this gets into policy. 17:36:07 q+ 17:36:25 fjh: which provinence matters, need a bit more guidance. 17:36:33 q+ 17:36:44 ack jtandy 17:37:19 Jtandy: where did this data come from, do I trust this data, only one facet of provinence. 17:37:24 annette_g has joined #dwbp 17:37:41 jtandy: provinence means all things to all people. ambiguous. 17:37:44 q? 17:38:20 Phila: originating organisation with contact details 17:38:59 q- 17:39:08 Ack ericstephan 17:39:08 ericstephan: should not put this requirement on everybody, originator creator would be sufficient. 17:39:13 Ack laufer 17:39:20 deiu has joined #dwbp 17:40:35 laufer: give the organisation, should be sufficient 17:40:35 Q+ ericstephan 17:40:47 phila has joined #dwbp 17:41:08 q? 17:41:36 ericstephan_ has joined #dwbp 17:41:38 q+ to say suggest we accept Phil's original requirement as placeholder because we can spend days trying to resolve this. Defer until later. 17:41:41 +1 bernadette 17:41:45 BernadetteLoscio: is this just simple metadata about who created the data? 17:41:54 Ack eric 17:41:55 +1 BernadetteLoscio 17:42:01 Ack been 17:42:08 Ack BernadetteLoscio 17:42:09 ericstephan_: we need to be explicit about provinance and what it means. 17:43:13 phila: can we simplifiy this to originating organisation 17:43:16 q+ to ask if we are developing use cases in this discussion 17:43:53 BernadetteLoscio: If we define organisation, we have to define other metadata 17:44:02 q+ 17:44:13 Ack Ken 17:44:13 KenL, you wanted to say suggest we accept Phil's original requirement as placeholder because we can spend days trying to resolve this. Defer until later. 17:44:57 hadleybeeman: talking about different things, origin and creator is a specific use case, needs to be backed up by UC and evidence, 17:45:12 phila has joined #dwbp 17:45:14 q+ 17:45:21 ack hadleybeeman 17:45:21 hadleybeeman, you wanted to ask if we are developing use cases in this discussion 17:45:30 q+ 17:45:34 q+ 17:45:40 hadleybeeman: otherwise have to define for all other metadata, is there another word 17:46:02 q+ 17:46:07 Q? 17:46:20 phila: do we need to change provAvailable 17:47:05 hadleybeeman: make it an issue that word provenance is unclear and needs to be better defined. 17:47:55 q? 17:47:58 issue: Phil to clarify the use of the word "provenance" any potential confusion it causes 17:47:59 Created ISSUE-51 - Phil to clarify the use of the word "provenance" any potential confusion it causes. Please complete additional details at . 17:48:01 hadleybeeman: proposed as an issue, 17:48:15 kirby has joined #dwbp 17:48:24 ericstephan_: need to establish an minimum set of provenance 17:48:42 * +1 to Eric 17:48:50 q? 17:48:51 ericstephan_: provenance vocabulary is highly complex. Need to identify minimum requirements set 17:48:54 ack eric 17:48:56 HZ has joined #DWBP 17:49:04 ack me 17:49:25 q- 17:49:38 +1 to ericstephan_ ... agreed that the provenance requirement should start by indicating a minimal set of requirements 17:49:40 q- 17:49:41 BartvanLeeuwen: are we differing from the process. we are discussing each item over again. 17:49:46 ack laufer 17:50:33 laufer: we are discussing meaning of it, not that we have to give all information. We have an example of people wanting simple, but there are others that are more complex. 17:50:45 agree with laufer ... if people can (& want) to provide complex provenance information they should be able to do so 17:50:54 hadleybeeman: what do we do regarding confusion on terms 17:51:14 I agree jtandy, but I think we need to have a minimal set defined for validation 17:51:19 q+ 17:51:23 issue: lauter to help us think about how to address our confusion of terms. (glossary?) 17:51:23 Created ISSUE-52 - Lauter to help us think about how to address our confusion of terms. (glossary?). Please complete additional details at . 17:51:46 Phila:requirements for industry reuse, goes to motivation of work group, if we are building eco system, we need SLAs. 17:52:10 phila: data should be suitable for industry reuse is vague. 17:52:18 q+ 17:52:19 q+ 17:52:43 phila: service level agreeements are at heart of it. 17:52:44 ack ig 17:53:12 q+ 17:53:18 Ig_Bittencourt: difference regarding reuse, should be data should be available for reuse. not currently good requirement 17:53:19 q+ to suggest changing this from a "should" to a "may" 17:53:22 ack j 17:54:00 q+ to say that there’s a point of publishing data for access rather than reuse 17:54:01 jtandy_: what is criteria for suitable for reuse for an industry., revenue stream should be removed. 17:54:03 ack bart 17:54:04 q+ 17:54:14 q+ to talk about guarantees for availability separate from SLAs 17:54:47 ack laufer 17:54:48 q+ 17:54:54 BartvanLeeuwen: was breakout session on financial benefits, no one is giving out figures on monetary advantages of using open data. 17:55:10 q+ 17:55:20 q+ to say SLA should be replaced with Applicable Policies because agreement is two sided and here we are stating conditions of use by owner/provider 17:55:25 laufer: are we talking about contracts? All of them are requirements. 17:56:07 steve: if there is no service level agreement, companies will not use it. 17:56:09 q+ to say that SLAs should definitely be separate from licence 17:56:53 steve: 90 percent of open data sites do not have an SLA, it is out there but can be removed anytime. 17:57:49 steve: many license agreements have restrictions. Say they have ability to remove the data anytime, potential revenue is a misnomer 17:58:23 ack me 17:58:23 hadleybeeman, you wanted to suggest changing this from a "should" to a "may" 17:58:23 +1 to steve ... the SLA needs to be included as a separate item to indicate a data publisher's commitment to keeping data available or that it will be refreshed on a particular frequency etc. 17:58:45 erikmannens has joined #DWBP 17:59:05 hadleybeeman: there is a question of how these are put in requirements vs. how we are going to discuss it in best practices 17:59:36 ISSUE: Whether SLA is/can be thought of as part of the licence or whether it needs to be pulled out spearately? 17:59:36 Created ISSUE-53 - Whether sla is/can be thought of as part of the licence or whether it needs to be pulled out spearately?. Please complete additional details at . 17:59:44 hadleybeeman: industry is a vague term, 18:00:17 RRSAgent, draft minutes 18:00:17 I have made the request to generate http://www.w3.org/2014/10/30-dwbp-minutes.html phila 18:00:27 hadleybeeman: proposes change section from SLA should be available .. ..... do not want to stop someone from using an SLA 18:00:35 q? 18:00:41 ack JeniT 18:00:41 JeniT, you wanted to say that there’s a point of publishing data for access rather than reuse and to talk about guarantees for availability separate from SLAs and to say that 18:00:44 ... SLAs should definitely be separate from licence 18:01:19 JeniT: plenty of times people are publishing to provide access to data 18:01:53 am happy to concede to JeniT's point :-) 18:02:16 q? 18:02:17 JeniT: distinction regarding api availability, also important for users to have quaranteed availability over a long period of time not just up time. 18:02:19 q+ 18:02:39 Perhaps we need to be clear about what we mean when we say "service level agreement" 18:03:09 q+ 18:03:10 q? 18:03:14 JeniT: api will be available for example 5 years, also SLA should be different than licenses. 18:03:17 ack bernadette 18:03:18 in addition to commitment for availability, an 'SLA' might include the refresh rate for the data 18:03:22 q- 18:03:59 BernadetteLoscio: why should this be different for industry and not for someone else. 18:04:19 issue-53? 18:04:19 issue-53 -- Whether sla is/can be thought of as part of the licence or whether it needs to be pulled out spearately? -- raised 18:04:19 http://www.w3.org/2013/dwbp/track/issues/53 18:04:35 q? 18:04:42 ack me 18:04:52 yanai has joined #dwbp 18:04:53 -antoine 18:05:11 +1 18:05:14 phila: proposal is that industry reuse and potential revenue be deleted. 18:05:27 +1 18:05:34 +1 18:05:38 +1 (observer) 18:05:39 +1 18:05:42 +1 18:05:46 +1 18:05:46 +1 18:05:48 +1 18:05:48 +1 18:05:50 PROPOSED: Delete R-IndustryReuse and R-PotentialRevenue as requirements 18:05:51 +1 18:05:53 +1 18:05:53 +1 18:05:56 +1 18:05:56 i would like the 3rd party reuse 18:05:58 +1 18:06:00 +1 18:06:03 RESOLVED: Delete R-IndustryReuse and R-PotentialRevenue as requirements 18:06:11 ack chunming 18:06:52 erikmannens has joined #DWBP 18:07:13 chunming: sla, static data sets or dynamic data sets. If static, sla is related to trust of data, dynamic data sets there would be other metrics freshness, real time guarantee, 18:07:26 +1 to comment from chunming 18:07:46 chunming: maybe we can find another terminology to use instead of SLA 18:08:07 @Caroline ... Eric has minuted his comment fairly well 18:08:21 phila: static and dynamic data is coming up. Timeliness and quality are being covered. 18:08:35 q? 18:08:40 ack ken 18:08:40 KenL, you wanted to say SLA should be replaced with Applicable Policies because agreement is two sided and here we are stating conditions of use by owner/provider 18:09:35 @Caroline ... happy to help :-) 18:09:36 KenL: what is SLA is not defined in day job, we are talking about conditions of use, describing what you are getting. SLA is wrong term.