09:21:43 RRSAgent has joined #dwbp-DataUsage 09:21:43 logging to http://www.w3.org/2014/04/01-dwbp-DataUsage-irc 09:21:44 Ok 09:23:45 We should work here https://www.w3.org/2013/dwbp/wiki/Data_usage_notes 09:24:39 Zakim has joined #dwbp-DataUsage 09:26:44 Hello! 09:26:55 My mic is not working but i can listen 09:27:11 Good morning everyone 09:34:20 Data set selection might mean many things to many groups.... 09:39:50 Yes I can here a bit :-)_ 09:41:22 Knowing data is credible or trustworthy would be extremely important 09:42:04 Maybe we can have a feedback on the data to reinforce the quality of the data 09:42:40 Yes very good especially feedback from respected experts 09:43:07 A leading researcher feedback versus a non-expert 09:45:05 Would this help? https://www.force11.org/node/4770 09:47:00 So citations, scholarly value, might be useful? 09:50:32 BernadetteLoscio_ has joined #dwbp-DataUsage 09:51:09 Eric: suggests https://www.force11.org/node/4770 09:51:36 Hi Eric! 09:52:00 Hello Bernadette! 09:54:46 Isn't one simple use case of API and data selection the use mime type? 10:02:45 https://www.w3.org/2013/dwbp/wiki/Data_usage_notes 10:05:49 I’m not feeling good :/ 10:06:07 I use the msm http://iserve.kmi.open.ac.uk/wiki/IServe_vocabulary to describe services 10:09:44 really interesting 10:09:49 For some datasets (terascale and petascale) and in science it is more advantageous to move the API to the data rather than having the API operate on the data. From a data usage perspective I might want to know what API I could operate on data types I was already aware. 10:10:38 im not sure if i understand :( 10:12:20 It’s a problem of performance, Eric? 10:13:49 Yes 10:13:53 That is correct 10:14:03 It would also be true of streaming data 10:14:42 Streaming data as opposed to a fixed dataset 10:16:20 Streaming data is about real-time data, you mean? 10:18:00 Yes 10:20:20 That is also good providing information about the organization or individual using the project 10:20:28 s/project/data/ 10:21:14 Data provenance is very important, it might also be important to describe what doesn't work with the dataset 10:24:48 Who is publishing and who is using the data, how it is being used, and what APIs are capable of using the data? 10:28:35 Yes As Phil was talking about yesterday how data is associated with other data? 10:29:07 PDF file was read and a table was generated from that. How do we describe that association ? 10:32:41 rrsagent, make logs public 10:35:39 sooo...describing dataset from a Data governance, ownership, stewardship, access from a community perspective? Interesting 10:36:29 yesyou can also describe the associations of the datasets you’re using 10:36:59 Its almost like a differentiating data in the wild from "formalized" datasets? 10:37:28 yes 10:38:06 A use case might be relying on Google Maps for some data but adding my own Point of Interest mapping points to a map. You could rely on Google Maps but maybe not my POI data 10:39:48 Yes, this is a good use case. Just like about drugs: if I have a index of drugs and take a dataset of FDA (if it was open) and then I add my impressions about each drug, to share 10:39:59 (I saw something like this in Brazil) 10:40:00 Yes great example 10:41:05 Depending on your perspective the "expertise" could be relative. What the FDA says versus personal experience 10:41:20 Yes! 10:41:32 I add this 2 cases on the wiki https://www.w3.org/2013/dwbp/wiki/Data_usage_notes 10:41:42 Now I’m gonna take some coffee 10:41:44 Great! 10:41:47 Okay me too 10:59:14 Are we going back to the main group? 11:01:33 Not sure yet, eric. We should be starting up again in a minute or two 11:01:39 ^ ericstephan 11:01:57 Okay thank you Hadley 11:06:08 Eric, you left the hangout? 11:06:30 fkyanai has joined #dwbp-DataUsage 11:06:36 It left me :-) 11:06:45 Ok 11:06:48 We can start a new 11:07:49 Okay I am back on. 11:08:16 Sorry no video from my side, but it is still dark and my picture looks creepy working by the light of the monitor :-) 11:08:58 It is ok 11:09:11 Yaso, Berna and Adriano are coming 11:09:23 Great 11:11:24 yaso has joined #dwbp-datausage 11:19:51 http://lov.okfn.org/dataset/lov/details/vocabulary_voaf.html 11:27:37 visual analytics might be another example 11:44:05 scriber adrianov 11:44:30 https://www.w3.org/2013/dwbp/wiki/Data_usage_notes#Challenges_areas 11:44:35 You can edit also 11:44:40 Eric 11:45:34 ericstephan: vocab should enable privacy config 11:46:41 newton_: an important point (related to revenue) is how to value the data 11:47:36 BernadetteLoscio_: the vocab should reflect the process of charging 11:49:22 BernadetteLoscio_: discussion on wether or not SLAs are on the scope of data usage 11:51:22 Maybe the providers of data need to know the fee for serving their data 11:51:36 If it is served for instance on AWS 11:53:45 Zakim has left #dwbp-DataUsage 11:54:39 https://www.w3.org/2013/dwbp/wiki/Challenges 11:55:09 rrsagent, make logs public 12:03:58 BernadetteLoscio_: discussing about the scope: privacy, revenue, traceability and gathering feedback 13:04:04 HadleyBeeman has joined #dwbp-DataUsage 13:11:59 rrsagent, draft minutes 13:11:59 I have made the request to generate http://www.w3.org/2014/04/01-dwbp-DataUsage-minutes.html HadleyBeeman 13:18:09 fkyanai has joined #dwbp-DataUsage 13:18:58 adrianov has joined #dwbp-DataUsage 13:19:19 fkyanai has left #dwbp-DataUsage 13:19:22 fkyanai has joined #dwbp-DataUsage 13:19:31 Hi ! 13:19:37 Eric, are you online ? 13:19:45 The new link to the hangout 13:19:46 https://plus.google.com/hangouts/_/7ecpi3vvm7mu0rjl23e6mgdgls 13:21:08 newton has joined #dwbp-DataUsage 13:22:00 Hi Eric 13:23:17 ALL: scope includes traceability, gathering feedback, and other aspects, namely privacy and revenue 13:23:42 scriber: adrianov 13:24:55 BernadetteLoscio has joined #dwbp-DataUsage 13:25:00 BernadetteLoscio: other aspects also include provenance 13:25:22 scribe: adrianov 13:25:56 Hello is everyone coming back? I'll get back on line 13:26:08 BernadetteLoscio: our focus is on who is using the data 13:31:51 BernadetteLoscio: we are going to organize all items collected in the first brainstorm 13:35:11 ... and classify them into categories: Traceability, Feedback, Other aspects (including data provenance, revenue and privacy). 13:43:12 I think there are many aspects of the provenance vocabulary we could borrow or use as a basis. The difference being the PROV is describing what happened Data Usage describes what is possible. 13:46:45 This is the link of Data usage notes 13:46:46 https://www.w3.org/2013/dwbp/wiki/Data_usage_notes 13:50:15 general challenges https://www.w3.org/2013/dwbp/wiki/Challenges 13:50:26 To Me the points in Dataset selection/Processing/usability can be organized under: Who What When Why How 14:04:46 I think of provenance as just "Data Usage History" from our perspective 14:05:04 Does this make sense? 14:06:12 I have a dataset A, here is how it was used, who used it, and here is how they used it. This is the data usage history... 14:06:15 yaso has joined #dwbp-datausage 14:06:33 makes sense to me 14:08:03 Its past (PROV) and present/future (Data Usage) tense use of data. 14:08:56 It is very complementary to provenance 14:08:58 Yes 14:09:38 rrsagent, draft minutes 14:09:38 I have made the request to generate http://www.w3.org/2014/04/01-dwbp-DataUsage-minutes.html HadleyBeeman 14:09:53 I've heard it called predictive provenance, now we call it data usage 14:09:55 yes! 14:10:47 scribe: newton 14:11:32 scribe: yaso 14:14:24 I added a few points to our wiki page 14:19:20 BernadetteLoscio: the point now is "how can we, as consumer of data, give a feedback about the dataset" 14:19:50 There might be different kinds of feedback, blogging versus following a protocol? 14:20:03 yes... 14:20:33 we can describe data feedback on a machine readable format? 14:20:46 Yes! 14:20:49 :) 14:20:58 What do you suggest to do that? 14:20:59 In some cases the machines will be giving the feedback 14:21:02 do you know if there is a vocab for this? 14:21:11 Not off hand 14:21:19 Its a great question 14:21:30 It almost follows under document transclusion 14:22:24 do you think that this should include in the data usage vocabulary? 14:22:29 I wonder if something is available under BFO 14:23:12 BFO is a vocab? 14:23:32 Basic Foundation Ontology Its something that came out of the BIomedical community to manage research data 14:23:55 yeah... maybe... i dont know this 14:23:57 OBI and AIO use BFO.... 14:24:41 I know a little about it, OBI is used to describe how data is processed or used and I'm not sure if it handles feedback but I can check just a sec.... 14:25:12 we’re gonna have more coffee 14:25:13 :-) 14:25:15 sounds good 14:42:37 newton has joined #dwbp-DataUsage 14:43:33 yaso has joined #dwbp-datausage 14:45:18 hi Eric! 14:45:21 we're back! 14:47:54 Hi! 14:57:08 http://srvisis01.univ-mlv.fr/selfMed/molecules.php 15:00:29 http://www.w3.org/TR/skos-ucr/#Requirements 15:08:21 Example way of leveraging PROV as a base line Instead of prov:wasGeneratedBy we use duv:Generates 15:16:12 HadleyBeeman has joined #dwbp-DataUsage 15:20:51 Hi 15:20:59 There is someone here? 15:21:20 https://www.w3.org/2013/dwbp/wiki/Data_usage_notes 15:21:49 fkyanai has left #dwbp-DataUsage 15:22:20 rrsagent, prepare minutes 15:22:20 I'm logging. I don't understand 'prepare minutes', HadleyBeeman. Try /msg RRSAgent help 15:22:46 rrsagent, draft minutes 15:22:46 I have made the request to generate http://www.w3.org/2014/04/01-dwbp-DataUsage-minutes.html HadleyBeeman 15:23:56 yaso has joined #dwbp-datausage 15:28:18 RRSAgent, draft minutes 15:28:18 I have made the request to generate http://www.w3.org/2014/04/01-dwbp-DataUsage-minutes.html yaso 15:28:35 eric? 15:36:03 Hi is everyone back? 15:50:23 yaso has joined #dwbp-datausage 15:51:35 yaso has left #dwbp-DataUsage 16:18:08 rrsagent, draft minutes 16:18:08 I have made the request to generate http://www.w3.org/2014/04/01-dwbp-DataUsage-minutes.html HadleyBeeman 16:27:59 newton has joined #dwbp-DataUsage