HCLSIG BioRDF Subgroup/Meetings/2007-07-16 Conference Call
Informal F2F Details
- Date of Meeting: Monday July 16, 2007
- Time of Meeting: 1:00pm-4:00pm Eastern Time
Location
- Room 346, Stata Center, MIT, at 32 Vassar Street, Cambridge, MA
- Directions to Stata
- Street map showing Stata
- Building map showing room 346
Telcon Details
- Dial-In #: +1.617.761.6200 (Cambridge, MA)
- Dial-In #: +33.4.89.06.34.99 (Nice, France)
- Dial-In #: +44.117.370.6152 (Bristol, UK)
- Participant Access Code: 246733 ("BIORDF")
- IRC Channel: irc.w3.org port 6665 channel #BioRDF (see W3C IRC page for details, or see Web IRC)
Agenda
- Progress on URI work (see HCLSIG_BioRDF_Subgroup/Tasks/URI_Best_Practices/Recommendations )
The editor proposes the following:
- Review reasons for putting effort into this project
- Review objectives and target audience
- Discuss process (consensus and closure)
- JAR's proposals, see HCLSIG_BioRDF_Subgroup/Tasks/URI_Best_Practices/Recommendations/DraftTalk
- a) Conceptual framework: descriptions, documents, versions, access, etc.
- b) Bibliographic reference analogy
- c) Retrieval ontology and well-known retrieval rules
- d) Identifiers for public database records
- e) Administrative options
For each item, I'd like to see:
- participants understand the issue
- endorsement or rejection of work so far
- some progress (e.g. agreeing on requirements, elucidation of issues)
- agreement on plan for further work
- individual commitments to help out
Minutes
IRC http://www.w3.org/2007/07/16-BioRDF-minutes
Meeting Notes:
1. Review reasons for putting effort into this project
- we want to use same URI for same thing, so that joins work - URIs should have clear meaning
- Punning is definitely not OK. E.g. a gene and its descriptive
record must have different URIs, if both have URIs at all. -- No disagreement from anyone present. - reusing a URI for different purpose worse than breaking link -- No disagreement from anyone present. - recommendations document must have a story about accessing stuff
- accessing document versions
- accessing RDF descriptions of resources
(definitions, identifications, specifications, documentation...)
- RDF statements that are otherwise about something, e.g.
non-description statements such as instrument readouts
2. Review objectives and target audience
Objectives: a document meeting above goals. HCLS primarily, others if possible. No one expressed an opinion on question of whether HCLS = those active in the public-semweb-lifesci list, or HCLS = the W3 SIG members, or HCLS = the larger health care & life sciences community.
3. Discuss process (consensus and closure)
Alan: what kind of endorsement, attached to the document, would make a difference to a life sciences "consumer" such as a pharma? [e.g. endorsed by editor only, by BioRDF, by HCLS mailing list, by
HCLS IG, by TAG or SWEO, ...] -- No particular answer.
Eric P: process = editor submits draft to the IG; draft is published on W3C site iff IG gives approval. Draft must specify its approval status, e.g. "editor's opinions only", "approved by IG", etc.
4. Make sure everyone understands the conceptual framework, and gets a
chance to critique JAR's approach of descriptions, documents, versions, access, etc.
Document: group's advice to JAR: augment definition with the nonintuitive examples such as news, instrument readout. Aims of defining this term:
- LSID compatibility
- opportunity to be different (or same) from foaf:Document
or AWWW "information resource" [3. Enable abstraction away from particular location methods such as HTTP] Alan: please consult with OBI on choice of term and/or definition General advice: please coin a new term, perhaps a qualification or refinement of "document".
There are groups of pieces-of-data, and some people use various terms for different purposes... Someone noted that different people implement LSIDs differently. Someone said that the LSID notion of version is impractical since it disallows trivial changes such as spelling corrections.
The "meaning" of document may be constant, but there is a need to provide variant representations. Bill gets stuck on the terms "version" and "variant". Advice to JAR: clarify definitions and give examples. Explicitly discuss issue of variants in meaning vs. variants in representation.
EN: Clear policy expression [of range of possible versions as a function of time and HTTP headers] is very important. AR: OBO has a reasonable framework for ontology evolution.
JAR explained to Bill Bug about versioned LSID denotations: an LSID that has a version component denotes a piece-of-data; otherwise it has no data, only metadata. (According to the spec that is.) Bill wants definitions that are clearer or more constrained; not necessarily different terms. AR: avoid using "version" and "variant" altogether. E.g. restrict
to language like
"An access at time t retrieves the document at time t..." [JAR: so
what do you
call what was retrieved? Is the definition (policy) of a document
tied to a concept
of "accessing"? ...]
JAR reiterated the non-punning principle: A piece of data and the document that of which it's a version need to have different names, or else no name at all. [On the web we name documents, and do GETs of those names to retrieve pieces-of-data, which we don't name. With LSIDs we name both the document (LSID with no version component) and the piece-of-data (LSID with version component).]
Dereference: JAR was advised to change the definition to the following: a particular way of getting a piece-of-data, using web protocols, given a URI. [N.b. the URI might name the piece-of-data, or it might name a document of which the piece-of-data is a version; or there may be no relation between the piece-of-data and the identified resource, if the server is behaving foolishly.]
There was some hesitation around "meant" statements, but the phrasing was reluctantly accepted.
Editor was advised to put in a heading to divide the NLM terms from the others. JAR proposed moving them out of the definitions section altogether.
AR: Put this principle up for discussion: Names should outlive their
publishers.
Here is our first take at a strategy for a community process.
Daniel: ontology versioning.
EN: put in a link to POWDER.
5. Retrieval ontology and well-known retrieval rules
Skipped
6. Identifiers for public database records
[I had to leave while this was being discussed]
7. Administrative options [for a naming authority, if we decide we need one]
Meeting adjourned before this item was reached.