See also: IRC log
<newton_> Ok
<yaso> We should work here https://www.w3.org/2013/dwbp/wiki/Data_usage_notes
<ericstephan> Hello!
<ericstephan> My mic is not working but i can listen
<ericstephan> Good morning everyone
<ericstephan> Data set selection might mean many things to many groups....
<ericstephan> Yes I can here a bit :-)_
<ericstephan> Knowing data is credible or trustworthy would be extremely important
<yaso> Maybe we can have a feedback on the data to reinforce the quality of the data
<ericstephan> Yes very good especially feedback from respected experts
<ericstephan> A leading researcher feedback versus a non-expert
<ericstephan> Would this help? https://www.force11.org/node/4770
<ericstephan> So citations, scholarly value, might be useful?
<adrianov_> Eric: suggests https://www.force11.org/node/4770
<BernadetteLoscio_> Hi Eric!
<ericstephan> Hello Bernadette!
<ericstephan> Isn't one simple use case of API and data selection the use mime type?
<yaso> https://www.w3.org/2013/dwbp/wiki/Data_usage_notes
<yaso> I’m not feeling good :/
<ericstephan> I use the msm http://iserve.kmi.open.ac.uk/wiki/IServe_vocabulary to describe services
<BernadetteLoscio_> really interesting
<ericstephan> For some datasets (terascale and petascale) and in science it is more advantageous to move the API to the data rather than having the API operate on the data. From a data usage perspective I might want to know what API I could operate on data types I was already aware.
<BernadetteLoscio_> im not sure if i understand :(
<yaso> It’s a problem of performance, Eric?
<ericstephan> Yes
<ericstephan> That is correct
<ericstephan> It would also be true of streaming data
<ericstephan> Streaming data as opposed to a fixed dataset
<newton_> Streaming data is about real-time data, you mean?
<ericstephan> Yes
<ericstephan> That is also good providing information about the organization or individual using the data
<ericstephan> Data provenance is very important, it might also be important to describe what doesn't work with the dataset
<ericstephan> Who is publishing and who is using the data, how it is being used, and what APIs are capable of using the data?
<ericstephan> Yes As Phil was talking about yesterday how data is associated with other data?
<ericstephan> PDF file was read and a table was generated from that. How do we describe that association ?
<ericstephan> sooo...describing dataset from a Data governance, ownership, stewardship, access from a community perspective? Interesting
<yaso> yesyou can also describe the associations of the datasets you’re using
<ericstephan> Its almost like a differentiating data in the wild from "formalized" datasets?
<yaso> yes
<ericstephan> A use case might be relying on Google Maps for some data but adding my own Point of Interest mapping points to a map. You could rely on Google Maps but maybe not my POI data
<yaso> Yes, this is a good use case. Just like about drugs: if I have a index of drugs and take a dataset of FDA (if it was open) and then I add my impressions about each drug, to share
<yaso> (I saw something like this in Brazil)
<ericstephan> Yes great example
<ericstephan> Depending on your perspective the "expertise" could be relative. What the FDA says versus personal experience
<yaso> Yes!
<yaso> I add this 2 cases on the wiki https://www.w3.org/2013/dwbp/wiki/Data_usage_notes
<yaso> Now I’m gonna take some coffee
<ericstephan> Great!
<ericstephan> Okay me too
<ericstephan> Are we going back to the main group?
<HadleyBeeman> Not sure yet, eric. We should be starting up again in a minute or two
<HadleyBeeman> ^ ericstephan
<ericstephan> Okay thank you Hadley
<newton_> Eric, you left the hangout?
<ericstephan> It left me :-)
<newton_> Ok
<newton_> We can start a new
<ericstephan> Okay I am back on.
<ericstephan> Sorry no video from my side, but it is still dark and my picture looks creepy working by the light of the monitor :-)
<newton_> It is ok
<newton_> Yaso, Berna and Adriano are coming
<ericstephan> Great
<ericstephan> http://lov.okfn.org/dataset/lov/details/vocabulary_voaf.html
<ericstephan> visual analytics might be another example
<adrianov_> scriber adrianov
<newton_> https://www.w3.org/2013/dwbp/wiki/Data_usage_notes#Challenges_areas
<newton_> You can edit also
<newton_> Eric
<adrianov_> ericstephan: vocab should enable privacy config
<adrianov_> newton_: an important point (related to revenue) is how to value the data
<adrianov_> BernadetteLoscio_: the vocab should reflect the process of charging
<adrianov_> BernadetteLoscio_: discussion on wether or not SLAs are on the scope of data usage
<ericstephan> Maybe the providers of data need to know the fee for serving their data
<ericstephan> If it is served for instance on AWS
<BernadetteLoscio_> https://www.w3.org/2013/dwbp/wiki/Challenges
<adrianov_> BernadetteLoscio_: discussing about the scope: privacy, revenue, traceability and gathering feedback
<fkyanai> Hi !
<fkyanai> Eric, are you online ?
<fkyanai> The new link to the hangout
<fkyanai> https://plus.google.com/hangouts/_/7ecpi3vvm7mu0rjl23e6mgdgls
<newton> Hi Eric
<adrianov> ALL: scope includes traceability, gathering feedback, and other aspects, namely privacy and revenue
<adrianov> scriber: adrianov
<adrianov> BernadetteLoscio: other aspects also include provenance
<newton> scribe: adrianov
<ericstephan> Hello is everyone coming back? I'll get back on line
BernadetteLoscio: our focus is on who is using the data
<newton> BernadetteLoscio: we are going to organize all items collected in the first brainstorm
<newton> ... and classify them into categories: Traceability, Feedback, Other aspects (including data provenance, revenue and privacy).
<ericstephan> I think there are many aspects of the provenance vocabulary we could borrow or use as a basis. The difference being the PROV is describing what happened Data Usage describes what is possible.
<newton> This is the link of Data usage notes
<newton> https://www.w3.org/2013/dwbp/wiki/Data_usage_notes
general challenges https://www.w3.org/2013/dwbp/wiki/Challenges
<ericstephan> To Me the points in Dataset selection/Processing/usability can be organized under: Who What When Why How
<ericstephan> I think of provenance as just "Data Usage History" from our perspective
<ericstephan> Does this make sense?
<ericstephan> I have a dataset A, here is how it was used, who used it, and here is how they used it. This is the data usage history...
makes sense to me
<ericstephan> Its past (PROV) and present/future (Data Usage) tense use of data.
<ericstephan> It is very complementary to provenance
<ericstephan> Yes
<ericstephan> I've heard it called predictive provenance, now we call it data usage
<ericstephan> yes!
<yaso> scribe: newton
<yaso> scribe: yaso
<ericstephan> I added a few points to our wiki page
<newton> BernadetteLoscio: the point now is "how can we, as consumer of data, give a feedback about the dataset"
<ericstephan> There might be different kinds of feedback, blogging versus following a protocol?
<BernadetteLoscio> yes...
<BernadetteLoscio> we can describe data feedback on a machine readable format?
<ericstephan> Yes!
<BernadetteLoscio> :)
<newton> What do you suggest to do that?
<ericstephan> In some cases the machines will be giving the feedback
<BernadetteLoscio> do you know if there is a vocab for this?
<ericstephan> Not off hand
<ericstephan> Its a great question
<ericstephan> It almost follows under document transclusion
<BernadetteLoscio> do you think that this should include in the data usage vocabulary?
<ericstephan> I wonder if something is available under BFO
<BernadetteLoscio> BFO is a vocab?
<ericstephan> Basic Foundation Ontology Its something that came out of the BIomedical community to manage research data
<BernadetteLoscio> yeah... maybe... i dont know this
<ericstephan> OBI and AIO use BFO....
<ericstephan> I know a little about it, OBI is used to describe how data is processed or used and I'm not sure if it handles feedback but I can check just a sec....
we’re gonna have more coffee
:-)
<ericstephan> sounds good
<BernadetteLoscio> hi Eric!
<BernadetteLoscio> we're back!
<ericstephan> Hi!
http://srvisis01.univ-mlv.fr/selfMed/molecules.php
<BernadetteLoscio> http://www.w3.org/TR/skos-ucr/#Requirements
<ericstephan> Example way of leveraging PROV as a base line Instead of prov:wasGeneratedBy we use duv:Generates
<newton> Hi
<newton> There is someone here?
<newton> https://www.w3.org/2013/dwbp/wiki/Data_usage_notes
<BernadetteLoscio> eric?
<ericstephan> Hi is everyone back?
This is scribe.perl Revision: 1.138 of Date: 2013-04-25 13:59:11 Check for newer version at http://dev.w3.org/cvsweb/~checkout~/2002/scribe/ Guessing input format: RRSAgent_Text_Format (score 1.00) Succeeded: s/project/data/ Found Scribe: adrianov Inferring ScribeNick: adrianov Found Scribe: newton Found Scribe: yaso Inferring ScribeNick: yaso Scribes: adrianov, newton, yaso ScribeNicks: adrianov, yaso WARNING: No "Topic:" lines found. WARNING: No "Present: ... " found! Possibly Present: ALL BernadetteLoscio BernadetteLoscio_ Eric HadleyBeeman adrianov adrianov_ dwbp-DataUsage ericstephan fkyanai https joined left newton newton_ scriber yaso You can indicate people for the Present list like this: <dbooth> Present: dbooth jonathan mary <dbooth> Present+ amy WARNING: No meeting title found! You should specify the meeting title like this: <dbooth> Meeting: Weekly Baking Club Meeting WARNING: No meeting chair found! You should specify the meeting chair like this: <dbooth> Chair: dbooth Got date from IRC log name: 01 Apr 2014 Guessing minutes URL: http://www.w3.org/2014/04/01-dwbp-DataUsage-minutes.html People with action items: WARNING: Input appears to use implicit continuation lines. You may need the "-implicitContinuations" option. WARNING: No "Topic: ..." lines found! Resulting HTML may have an empty (invalid) <ol>...</ol>. Explanation: "Topic: ..." lines are used to indicate the start of new discussion topics or agenda items, such as: <dbooth> Topic: Review of Amy's report[End of scribe.perl diagnostic output]