SV_MEETING_TITLE -- 08 Nov 2010

<mib_hv7s1y> EBI

http://esw.w3.org/HCLSIG_BioRDF_Subgroup/Meetings/2010/11-08_Conference_Call

<ericP> hey all, i'm still on another call. i'll show up in a few

ok, see ya soon

<scribe> scribenick: mscottm

Dietrich: Interested in how to make use of content from publishers - content that you don't own
... Important to provide the information that allows consumers of the data to draw the appropriate conclusions.
... Semantic Web provides the means to do this.

Slide 1 (pdf page 2): Pistoia Background

Dietrich: Pistoia formed to enable Pharma companies to share pre-competitive IT resources

Now on page 4, slide 3

scribe: Semantic Web sets up the first 'layer' of interoperability for Pistoia. We are now in the test phase.

page 5, slide 4

scribe: Brokering service uses vocabularies and rules

page 6, slide 5

Dietrich: Do you care about licenses?

Scott: Yes, we have discussed ways to deal with access control and security, as well as describe that an RDF graph has a certain license attached to it.

page 8 (SESL user interface mock up)

Tim: How did you get a 'co-occurrence'?

Dietrich: algorithmically

page 9: (Mockup) Diabetes genes in SESL prototype

page 10: (Mockup) Diabetes genes in SESL prototype II

Dietrich: Used UniprotDB for part of it (gene/protein)

page 12: Full triple store schema

page 13: Access to document / sentence content

Tim: Are you using URI's?

Dietrich: Yes but not everywhere. LexEBI doesn't use them internally - we map to URI's.

Michael: Do you use UIMA or 'roll your own'?

Dietrich: We don't use UIMA but are collaborating with colleagues who do.
... [continuing] because we use a uniform format, we can query across many different types of data. The only drawback is that query performance begins to slow.
... The particular corpus is limited to the last several years. Literature doubles every 8 years or so.
... We have reached a starting point and would like to work with others to build on this experience.
... Questions?

Scott: Are there resources from this project that we can access such as SPARQL endpoint, Lexicon, etc.?

Dietrich: Lexicon and other resources such as SPARQL will be public. I would prefer to do this starting January after checking with administrative people. I could give a sample sooner.

<ericP> am i late?

<ericP> apologies for being so late. hoping to catch the next steps

Scott: We would be interested in a sample access sooner to start the conversation about formats for sharing.

<KeesVanBochove> @ericP the meeting started about an hour ago

Scott: I think that we could form a 'block' of consumers and eventually form consensus on best practices and RDF that could motivate RDF services from ArrayExpress.

Tim: I see some definite overlap such as the Annotation Ontology (AO).

Dietrich: We would be interested in how to harmonize annotations on a document.
... Interested in combining manual and automatic annotations?

Tim: Yes.

ack

Satya: How do you create queries? GUI?

Dietrich: Queries are built into the web services at the moment. A query builder interface would be nice.

Satya: Where are your schemas stored? How are they used?

Dietrich: We use a number of different namespaces, such as for dates, etc.

Satya: More specifically, AO for authorship of a particular annotation, Uniprot for proteins/genes, etc.

Dietrich: Haven't gotten to that level of detail yet on, for example, annotations but am interested.

Marco: These types of resources would be interesting for use in a Biobanking project, which is getting a next round of funding in the Netherlands.

<KeesVanBochove> My connection dropped and the conference system doesn't allow me to enter again. I will follow up with Scott and Marco next Wednesday. Thanks for inviting me and see you next time!

Dietrich: You would be interested in the CALBC SSC corpus III (ready by next summer), we currently have immunological resources. Working hard on normalization.
... We can always use UIMA because we work with web services but I want annotations in the *stream* of the data between services.

[answering followup question from Michael]

Scott: Thanks Dietrich and group at EBI. I hope that we can continue to explore this convergence of efforts around the themes of federated triplestores, microarray RDF, and RDF representations of assertions mined from text. Probably the best way to get started is to attempt to share data located in our respective SPARQL endpoints. Thanks to all for coming. Bye.

Scribe.perl diagnostic output

[Delete this section before finalizing the minutes.]

This is scribe.perl Revision: 1.135  of Date: 2009/03/02 03:52:20  
Check for newer version at http://dev.w3.org/cvsweb/~checkout~/2002/scribe/

Guessing input format: RRSAgent_Text_Format (score 1.00)

Found ScribeNick: mscottm
Inferring Scribes: mscottm

WARNING: No "Topic:" lines found.


WARNING: No "Present: ... " found!
Possibly Present: Dietrich KeesVanBochove Marco Michael Satya Scott Tim ericP matthias_samwald mdmiller mib_hv7s1y mroos mscottm phil rfrost scribenick ssahoo2 sudeshna
You can indicate people for the Present list like this:
        <dbooth> Present: dbooth jonathan mary
        <dbooth> Present+ amy


WARNING: No meeting title found!
You should specify the meeting title like this:
<dbooth> Meeting: Weekly Baking Club Meeting


WARNING: No meeting chair found!
You should specify the meeting chair like this:
<dbooth> Chair: dbooth

Got date from IRC log name: 08 Nov 2010
Guessing minutes URL: http://www.w3.org/2010/11/08-hcls-minutes.html
People with action items: 

WARNING: No "Topic: ..." lines found!  
Resulting HTML may have an empty (invalid) <ol>...</ol>.

Explanation: "Topic: ..." lines are used to indicate the start of 
new discussion topics or agenda items, such as:
<dbooth> Topic: Review of Amy's report

[End of scribe.perl diagnostic output]

- DRAFT -

SV_MEETING_TITLE

08 Nov 2010

Attendees

Contents

Summary of Action Items

Scribe.perl diagnostic output