HCLSIG/LODD/Meetings/2008-10-01 Conference Call
- Date of Call: Wednesday October 1, 2008
- Time of Call: 11:00am Eastern Daylight Time (EDT), 16:00 Greenwich Mean Time (GMT), 17:00 Central European Time (CET)
- Dial-In #: +1.617.761.6200 (Cambridge, MA)
- Dial-In #: +33.4.89.06.34.99 (Nice, France)
- Dial-In #: +44.117.370.6152 (Bristol, UK)
- Participant Access Code: 4257 ("HCLS").
- IRC Channel: irc.w3.org port 6665 channel #HCLS (see W3C IRC page for details, or see Web IRC)
- Duration: ~1h
- Convener: Susie
- Goals of LODD
- Potential data sets
- Attributes of data sets
<kei> susie: introduction
<kei> Oktie: u. of toronto, visiting IBM watson research, linked data community
<kei> Oktie: within IBM also working with Health Care data, PhD work, approximate string join, data cleansing, find links between different data sources
<kei> Glenn: national research council (researcher), text mining of scientific articles, knowledge extraction/discovery, recently working on semantic web, use semantic concepts to mine the literature
<kei> Chris: Univ. of Berlin (Assistant Prof), linked data project (drug related), 2 billion triples, info extraction project (extraction from wikipedia), interlinking hub, linked data publishing tool, identity resolution
<kei> Bosse: from AZ, identify use case, integrate different research data and drug over web as a focus, looking for collaboration
<kei> Anja: introduction
<kei> Scott: Univ. of Amsterdam, combine info. extraction, machine learning, SW, web services, bioinformatics, applications (killer apps), terminology/ontology plays a key role, database issues
<kei> EricN: working SW since 2001. Met Susie then. W3C health care, former chair, working in Pharmaceutical industry.
<kei> Christine: long history in semantic web in health care, IBM research, on the funding side ...
<kei> Susie: met EricN about 2001, talked about SW, Working Oracle for 18 months before joining Lilly, involved in Open Innovation Program, one of the co-chairs of HCLS
<kei> Susie: motivation -- a lot of info about drug on the web, the resources are highly distributed, one of the goals of this task force is to bring some of the heterogeneous datasets together, using open linking data principles, using HTTP URI's to publish, discover, and link RDF data
<kei> Oktie?: build on existing technologies (HTTP, URI) to deploy applications so existing tools/browsers can be used to access the linked data, a lot of data have been published based on this principle
<kei> Susie: datasets about drug made available in linked data format
<gnewton> How to Publish Linked Data on the Web" http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/
<kei> EricN: centralized or decentralized approach to querying the data over the Web (linked data)
<kei> Gnewton: decentralized approach, linked data crowd?
<kei> EricN: put everything in one triplestore may not be scalable.
<kei> Susie: can Chris give a presentation on linked data?
<kei> Chris: a workshop was held in Beijing
<kei> Susie: another possibility is to present at one of the HCLS calls
<kei> Chris: when to present?
<kei> Susie: after the ISWC meeting
<Oktie> You can find all the linked data links at http://linkeddata.org
<kei> Susie: Chris mentioned dbpedia that contains quite a lot of data about drugs
<kei> Susie: linking open data and the value it provides. One of the foci of the task is to demonstrate how to link a few datasets.
<kei> Glenn: compromise between use cases and low-hanging fruit
<kei> Susie: it could be a comprise
<kei> Susie: some of the datasets are proprietary or not completely accessible to the public, also how much work is involved
<kei> Susie: use cases -- easy for people to understand, subset of drug data, taking a number of data sources to link them together, scrape data for a theurapeutic area, class of drugs, etc
<kei> Susie: next steps -- understanding available data, list of datasets about drugs (e.g., drug bank, drugdb, etc)
<kei> Chris: scope?
<kei> Susie: second step: use case, third step: how datasets fit in the use case, how different datasets fit together, technologies/licenses, demo, host, promoting the work (demo)
<kei> Susie: people (volunteers) who would like to work on the area
<kei> Oktie: list of datasets on the wiki, link/dowload/access data, gather some information
<kei> Susie: all participants look at the datasets and compare notes
<kei> Chris: decide on the general criteria, create a table
<kei> Susie: people from the open linking community created such a table (something similar)?
<kei> Oktie: in addition to technical sides, what kinds of links are interesting, users? useful for examining the datasets
<kei> Chris: criteria: legal issue, how old/new the datasets, need a laundry list
<kei> EricN: papers -- drug information and put it on semantic web, use cases have been defined in the articles, narrow down to a few items, drug banks/diseases to do data mining, we need a goal that drives the criteria, need a drug/pharmaceutical goal, bioontologies workshop, data in the public domain, new diseases, phenotype, toxicity, look at the paper, identify practical things
<kei> Susie: agree that we should be aware of the work. Olivier sent a paper relating to this. Strong encourage people to post papers. Who else would like to join?
<kei> Glenn: can join with Susie in identifying papers
<kei> Susie: a use case is important -- a use case that requires integration of multiple sources, understanding the datasets
<kei> Susie: Can EricN and Bosse help identify the use cases?
<kei> Kei: if people are interested in neuroscience use cases, I can try to contribute
<kei> EricP: working as SW group contact for HCLS
<ericP> -> http://mor.nlm.nih.gov/download/rxnav/RxNormAPI.html RxNormAPI
<kei> EricP: use cases HCLS -- drugs/patients
<kei> EricP: simple but useful use cases
<kei> Susie: create a list of use cases for evaluation
<kei> Susie: Chris and Anya posted the info. gathered so far. Glenn extended the info as to understanding the legal issues, etc
<kei> Susie: good progress for today's call
<kei> Susie: structural and procedural things. Call everything other week?
<ericP> i just stuck some candiate evaluation criteria into http://esw.w3.org/topic/HCLSIG/LODD/Business
<kei> Susie: the group agreed
<kei> Susie: co-chair of the HCLS group so ideally others can take charge of the task force?
<kei> Susie: something that people may need some time to consider and discuss more next time
<kei> Chris?: what is leading the task?
<kei> Susie: leading the task involves coming up with agenda items, coordinating the task, and working on additional projects is an option
<kei> Susie: leading the biordf was a postive experience
<kei> Susie: good for the CV, etc
<kei> Susie: wrap up -- technical panel advisory committee meeting in France, discussion about work going on in W3C, IG's get together, there will be a HCLS F2F meeting, HL7, updates on SW group, OWL folks, time set aside for different tasks to make more progress, opportunities to do addition research ...
<kei> Susie: strongly encourage people to participate (Oct 20, 21)
<kei> Susie: further discussion of LODD at the F2F meeting
<kei> Susie: those who can't attend f2f meeting, participate by phone ...
<kei> Chris: are there other f2f meetings?
<kei> EricP: f2f meeting is rare in spring meeting ...
<kei> Bosse: must participants by W3C members?
<mscottm> gnewton, I can't find you in the HCLS IG participants list (Glenn right?)
<kei> Susie: non-members can participate as invited experts
<kei> Susie: we encourage all discussion on the HCLS list. maybe people can add a prefix to the subject header specific to a task force
<kei> Susie: next meeting will be held in two weeks.
<mscottm> gnewton, would you consider having your W3C Advisory Committee rep sign you up to our group?