W3C

- DRAFT -

SV_MEETING_TITLE

01 Apr 2014

See also: IRC log

Attendees

Present
Regrets
Chair
SV_MEETING_CHAIR
Scribe
adrianov, newton, yaso

Contents


<newton_> Ok

<yaso> We should work here https://www.w3.org/2013/dwbp/wiki/Data_usage_notes

<ericstephan> Hello!

<ericstephan> My mic is not working but i can listen

<ericstephan> Good morning everyone

<ericstephan> Data set selection might mean many things to many groups....

<ericstephan> Yes I can here a bit :-)_

<ericstephan> Knowing data is credible or trustworthy would be extremely important

<yaso> Maybe we can have a feedback on the data to reinforce the quality of the data

<ericstephan> Yes very good especially feedback from respected experts

<ericstephan> A leading researcher feedback versus a non-expert

<ericstephan> Would this help? https://www.force11.org/node/4770

<ericstephan> So citations, scholarly value, might be useful?

<adrianov_> Eric: suggests https://www.force11.org/node/4770

<BernadetteLoscio_> Hi Eric!

<ericstephan> Hello Bernadette!

<ericstephan> Isn't one simple use case of API and data selection the use mime type?

<yaso> https://www.w3.org/2013/dwbp/wiki/Data_usage_notes

<yaso> I’m not feeling good :/

<ericstephan> I use the msm http://iserve.kmi.open.ac.uk/wiki/IServe_vocabulary to describe services

<BernadetteLoscio_> really interesting

<ericstephan> For some datasets (terascale and petascale) and in science it is more advantageous to move the API to the data rather than having the API operate on the data. From a data usage perspective I might want to know what API I could operate on data types I was already aware.

<BernadetteLoscio_> im not sure if i understand :(

<yaso> It’s a problem of performance, Eric?

<ericstephan> Yes

<ericstephan> That is correct

<ericstephan> It would also be true of streaming data

<ericstephan> Streaming data as opposed to a fixed dataset

<newton_> Streaming data is about real-time data, you mean?

<ericstephan> Yes

<ericstephan> That is also good providing information about the organization or individual using the data

<ericstephan> Data provenance is very important, it might also be important to describe what doesn't work with the dataset

<ericstephan> Who is publishing and who is using the data, how it is being used, and what APIs are capable of using the data?

<ericstephan> Yes As Phil was talking about yesterday how data is associated with other data?

<ericstephan> PDF file was read and a table was generated from that. How do we describe that association ?

<ericstephan> sooo...describing dataset from a Data governance, ownership, stewardship, access from a community perspective? Interesting

<yaso> yesyou can also describe the associations of the datasets you’re using

<ericstephan> Its almost like a differentiating data in the wild from "formalized" datasets?

<yaso> yes

<ericstephan> A use case might be relying on Google Maps for some data but adding my own Point of Interest mapping points to a map. You could rely on Google Maps but maybe not my POI data

<yaso> Yes, this is a good use case. Just like about drugs: if I have a index of drugs and take a dataset of FDA (if it was open) and then I add my impressions about each drug, to share

<yaso> (I saw something like this in Brazil)

<ericstephan> Yes great example

<ericstephan> Depending on your perspective the "expertise" could be relative. What the FDA says versus personal experience

<yaso> Yes!

<yaso> I add this 2 cases on the wiki https://www.w3.org/2013/dwbp/wiki/Data_usage_notes

<yaso> Now I’m gonna take some coffee

<ericstephan> Great!

<ericstephan> Okay me too

<ericstephan> Are we going back to the main group?

<HadleyBeeman> Not sure yet, eric. We should be starting up again in a minute or two

<HadleyBeeman> ^ ericstephan

<ericstephan> Okay thank you Hadley

<newton_> Eric, you left the hangout?

<ericstephan> It left me :-)

<newton_> Ok

<newton_> We can start a new

<ericstephan> Okay I am back on.

<ericstephan> Sorry no video from my side, but it is still dark and my picture looks creepy working by the light of the monitor :-)

<newton_> It is ok

<newton_> Yaso, Berna and Adriano are coming

<ericstephan> Great

<ericstephan> http://lov.okfn.org/dataset/lov/details/vocabulary_voaf.html

<ericstephan> visual analytics might be another example

<adrianov_> scriber adrianov

<newton_> https://www.w3.org/2013/dwbp/wiki/Data_usage_notes#Challenges_areas

<newton_> You can edit also

<newton_> Eric

<adrianov_> ericstephan: vocab should enable privacy config

<adrianov_> newton_: an important point (related to revenue) is how to value the data

<adrianov_> BernadetteLoscio_: the vocab should reflect the process of charging

<adrianov_> BernadetteLoscio_: discussion on wether or not SLAs are on the scope of data usage

<ericstephan> Maybe the providers of data need to know the fee for serving their data

<ericstephan> If it is served for instance on AWS

<BernadetteLoscio_> https://www.w3.org/2013/dwbp/wiki/Challenges

<adrianov_> BernadetteLoscio_: discussing about the scope: privacy, revenue, traceability and gathering feedback

<fkyanai> Hi !

<fkyanai> Eric, are you online ?

<fkyanai> The new link to the hangout

<fkyanai> https://plus.google.com/hangouts/_/7ecpi3vvm7mu0rjl23e6mgdgls

<newton> Hi Eric

<adrianov> ALL: scope includes traceability, gathering feedback, and other aspects, namely privacy and revenue

<adrianov> scriber: adrianov

<adrianov> BernadetteLoscio: other aspects also include provenance

<newton> scribe: adrianov

<ericstephan> Hello is everyone coming back? I'll get back on line

BernadetteLoscio: our focus is on who is using the data

<newton> BernadetteLoscio: we are going to organize all items collected in the first brainstorm

<newton> ... and classify them into categories: Traceability, Feedback, Other aspects (including data provenance, revenue and privacy).

<ericstephan> I think there are many aspects of the provenance vocabulary we could borrow or use as a basis. The difference being the PROV is describing what happened Data Usage describes what is possible.

<newton> This is the link of Data usage notes

<newton> https://www.w3.org/2013/dwbp/wiki/Data_usage_notes

general challenges https://www.w3.org/2013/dwbp/wiki/Challenges

<ericstephan> To Me the points in Dataset selection/Processing/usability can be organized under: Who What When Why How

<ericstephan> I think of provenance as just "Data Usage History" from our perspective

<ericstephan> Does this make sense?

<ericstephan> I have a dataset A, here is how it was used, who used it, and here is how they used it. This is the data usage history...

makes sense to me

<ericstephan> Its past (PROV) and present/future (Data Usage) tense use of data.

<ericstephan> It is very complementary to provenance

<ericstephan> Yes

<ericstephan> I've heard it called predictive provenance, now we call it data usage

<ericstephan> yes!

<yaso> scribe: newton

<yaso> scribe: yaso

<ericstephan> I added a few points to our wiki page

<newton> BernadetteLoscio: the point now is "how can we, as consumer of data, give a feedback about the dataset"

<ericstephan> There might be different kinds of feedback, blogging versus following a protocol?

<BernadetteLoscio> yes...

<BernadetteLoscio> we can describe data feedback on a machine readable format?

<ericstephan> Yes!

<BernadetteLoscio> :)

<newton> What do you suggest to do that?

<ericstephan> In some cases the machines will be giving the feedback

<BernadetteLoscio> do you know if there is a vocab for this?

<ericstephan> Not off hand

<ericstephan> Its a great question

<ericstephan> It almost follows under document transclusion

<BernadetteLoscio> do you think that this should include in the data usage vocabulary?

<ericstephan> I wonder if something is available under BFO

<BernadetteLoscio> BFO is a vocab?

<ericstephan> Basic Foundation Ontology Its something that came out of the BIomedical community to manage research data

<BernadetteLoscio> yeah... maybe... i dont know this

<ericstephan> OBI and AIO use BFO....

<ericstephan> I know a little about it, OBI is used to describe how data is processed or used and I'm not sure if it handles feedback but I can check just a sec....

we’re gonna have more coffee

:-)

<ericstephan> sounds good

<BernadetteLoscio> hi Eric!

<BernadetteLoscio> we're back!

<ericstephan> Hi!

http://srvisis01.univ-mlv.fr/selfMed/molecules.php

<BernadetteLoscio> http://www.w3.org/TR/skos-ucr/#Requirements

<ericstephan> Example way of leveraging PROV as a base line Instead of prov:wasGeneratedBy we use duv:Generates

<newton> Hi

<newton> There is someone here?

<newton> https://www.w3.org/2013/dwbp/wiki/Data_usage_notes

<BernadetteLoscio> eric?

<ericstephan> Hi is everyone back?

Summary of Action Items

[End of minutes]

Minutes formatted by David Booth's scribe.perl version 1.138 (CVS log)
$Date: 2014/04/01 16:18:14 $

Scribe.perl diagnostic output

[Delete this section before finalizing the minutes.]
This is scribe.perl Revision: 1.138  of Date: 2013-04-25 13:59:11  
Check for newer version at http://dev.w3.org/cvsweb/~checkout~/2002/scribe/

Guessing input format: RRSAgent_Text_Format (score 1.00)

Succeeded: s/project/data/
Found Scribe: adrianov
Inferring ScribeNick: adrianov
Found Scribe: newton
Found Scribe: yaso
Inferring ScribeNick: yaso
Scribes: adrianov, newton, yaso
ScribeNicks: adrianov, yaso

WARNING: No "Topic:" lines found.


WARNING: No "Present: ... " found!
Possibly Present: ALL BernadetteLoscio BernadetteLoscio_ Eric HadleyBeeman adrianov adrianov_ dwbp-DataUsage ericstephan fkyanai https joined left newton newton_ scriber yaso
You can indicate people for the Present list like this:
        <dbooth> Present: dbooth jonathan mary
        <dbooth> Present+ amy


WARNING: No meeting title found!
You should specify the meeting title like this:
<dbooth> Meeting: Weekly Baking Club Meeting


WARNING: No meeting chair found!
You should specify the meeting chair like this:
<dbooth> Chair: dbooth

Got date from IRC log name: 01 Apr 2014
Guessing minutes URL: http://www.w3.org/2014/04/01-dwbp-DataUsage-minutes.html
People with action items: 

WARNING: Input appears to use implicit continuation lines.
You may need the "-implicitContinuations" option.


WARNING: No "Topic: ..." lines found!  
Resulting HTML may have an empty (invalid) <ol>...</ol>.

Explanation: "Topic: ..." lines are used to indicate the start of 
new discussion topics or agenda items, such as:
<dbooth> Topic: Review of Amy's report


[End of scribe.perl diagnostic output]