W3C HCLS – Linked Life Data

04 Jun 2012

See also: IRC log


+1.206.732.aaaa, Scott_Marshall, +1.631.444.aabb, +681aacc, michel
M. Scott Marshall


<mscottm> scribenick: Janos

<mscottm> https://docs.google.com/document/d/1VrBWM133Pxxqe5jbjgg2IPiwv5JaHo98DDJdqOT1yh0/edit

<michel> discussion from a month ago

<michel> mscottm: graph metrics in an rdf graph

<mscottm> http://dl.dropbox.com/u/21690634/Quantifying%20RDF%20data%20sets.pdf

mtscottm: talk about making RDF metrics available. Aspects of data which are needed for consumption. Biohackathon. Pharma wanted to know license; last time updated; language labels ..

;faceted browsing. How do we handle 30 different drugbank datasets available at central site it seems for data market ..

choice should be encoded in SPARQL query. In talk by janos: counts, and ratios. Companies called in are interested in metrics. Graph theoritic measures; other measures.

mscottm: Graph diameter based on a particular predicate based on transitive closure. How deep hierarchy goes.
... asking about federation across linked data health sciences; Talked with EricP who is working on HL7 documents ..

Difficult to talk about all health care data would be better to scope.

<michel> I just compiled a list of dataset-related vocabularies: http://www.w3.org/wiki/HCLSIG/LLD/DatasetDescription

janos: Overview of schema.org extension HCLS talk with Aaron Brown of google for better web searching.

mscottm: Schema.org discussion could benefit from good relations ontology.

janos: we need to work together with HCLS and schema.org for feedback

mscottm: going back to graph metrics. Expensive to issue SPARQL query to count.

<michel> work that we did in the japan biohackathon: https://docs.google.com/spreadsheet/ccc?key=0AvCayBYdTclldEpBSS1wRXNEaU9OeHdWcGRwc09mSmc


<mscottm> Janos: HDT = header dictionary triple representation of RDF

<michel> hgt w3c submission - http://www.w3.org/Submission/2011/03/

<michel> here's the new vocab introduced by hdt - http://www.w3.org/Submission/2011/SUBM-HDT-RDFS-20110330/

A video of the tool http://www.youtube.com/watch?v=HMPkc725sMY

mscottm: Switiching gears to discussion of working group hcls on note. Practice notes more specific details to be added.
... our group has potential to trail blaze in how we document a published a linked open data, VOID, etc, using the graph uri
... going through use case of other information could be used to select data sets, E.g., knowing expression is about mouse versus human

<mscottm> Scott: imagines that connectivity of EHR and clinical research and scans (DICOM) is part of http://www.ctsaconnect.org/ontologies

<michael> bye all!

janos: discussing scope of CTSAconnect; multipurpose use of integrated data connecting researchers and also for administrative data

mscottm: any questions please send to me

Summary of Action Items

[End of minutes]

Minutes formatted by David Booth's scribe.perl version 1.136 (CVS log)
$Date: 2012/06/04 16:14:51 $

Scribe.perl diagnostic output

[Delete this section before finalizing the minutes.]
This is scribe.perl Revision: 1.136  of Date: 2011/05/12 12:01:43  
Check for newer version at http://dev.w3.org/cvsweb/~checkout~/2002/scribe/

Guessing input format: RRSAgent_Text_Format (score 1.00)

Found ScribeNick: Janos
Inferring Scribes: Janos

WARNING: No "Topic:" lines found.

Default Present: +1.206.732.aaaa, Scott_Marshall, +1.631.444.aabb, +681aacc, michel
Present: +1.206.732.aaaa Scott_Marshall +1.631.444.aabb +681aacc michel
Got date from IRC log name: 04 Jun 2012
Guessing minutes URL: http://www.w3.org/2012/06/04-HCLS-minutes.html
People with action items: 

WARNING: Input appears to use implicit continuation lines.
You may need the "-implicitContinuations" option.

WARNING: No "Topic: ..." lines found!  
Resulting HTML may have an empty (invalid) <ol>...</ol>.

Explanation: "Topic: ..." lines are used to indicate the start of 
new discussion topics or agenda items, such as:
<dbooth> Topic: Review of Amy's report

[End of scribe.perl diagnostic output]