HCLS/ScientificPublishingTaskForce

From W3C Wiki
HCLS Home SciPub Home Discussions

Scientifc Publishing Task Force

Task1: Ontology for Experiment Self-Publishing

Task Objectives

To develop a general purpose ontology for self-publishing single experiment in RDF format that will facilitate data sharing, discovery and integration. Applications such as web-publishing tool and semantic search engine that are built on top of this ontology will demonstrate the emerging semantic standards and technologies can help developing more interactive scientific communities centered around user-generated scientific contents on the web .

Task Status

Rationale

Today, scientific research data and results are shared primarily through publishing paper. A paper is usually composed of many experiments. While it's an important form of knowledge representation and sharing, scientific paper is far from an ideal form for finding and sharing experiment data. One can easily imagine how many new possibilities will open up if most of research data are available on the web as units of single experiment, which can be searched by everyone and consumed by computers. Of course, the challenge is how to get the scientific community to publish experiment data on the web in an appropriate format. As one key step toward meeting this challenge, the task describe here is intended to provide a set of vocabularies for publishing experiment data as single unit.

It's worth to mention a few benefits to researchers who self-publish experiment data:

  1. Make your experiment data widely accessible, unlocking the values of your most important assets.
  2. Increase the visibility of your research as a result of this new and extensive web of experiment data.
  3. Promote research collaborations and new knowledge discovery.

Scope

Defining the right scope is as important as the work product of this task. We should realize that scale is critical in establishing a new paradigm. In order to encourage large scale participation from everyone in the scientific community, two requirements are placed on the ontology to be developed here: (1) the terms should be as general as possible so that researchers across all disciplines can use the ontology; and (2) the ontology should facilitate development of easy-to-use self-publishing tools that everyone will like to use.

Note that there are some domain-specific ontologies, such as microarray experiment ontology, already existed today. However, the scope of the current task is broader than any of these. In fact, the ontology to be developed here will not be specific to any research domain.

Participants

AJ Chen (add your name here if you are interested in joining the task)

Paola Di Maio (keen but rather dumb would be system user)

Harry Snow (wants to help develop authoring tools so we can all stop wasting time trying to mine RDF out recently published papers!)

Use Cases

  1. Any researcher can publish his or her experiment data as single unit of experiment in RDF format using this ontology. Ideally, a new easy-to-use self-publishing tool specifically designed for such purpose will encourage the use of the ontology. However, any ontology editor can be used to create RDF content. Because data in such format can be searched by semantic web search engine, it offers a new channel for data sharing and retrieval that will accelerate scientific discovery as well as increase the researcher's visibility.
  2. Anyone can search for all experiments available on the web at various property levels. This use case requires a search engine that aggregates published experiment data in RDF format. Compared to traditional literature search, this semantic web search engine will provide much more relevant information.

Deliverables

  1. 1st Deliverable: A set of defined objects and properties for describing/publishing single unit of experiment.
  2. 2nd Deliverable: An ontology for self-publishing experiment data at high level.

Related resources

These employ a range of formal syntactical implementations from XML Schemas on through OWL-based ontologies.

EDITORIAL NOTE: On hopes some effort will be invested in ensuring these efforts have commensurate, formal semantic implementations, as this will be an absolute pre-requisite for these individual efforts to realize their intended goals. This does not necessarily imply they all must use the same normative syntax or link to a single, foundational ontology but provisions must be made to ensure they are algorithmically commensurate, if they are to contribute to the overall semantically-specified, biomedical investigation-related knowledge ecosystem (BillBug - 2006-08-28).

Timeline for Task Completion

  • Stage 1 (3 month goals)
  • Stage 2 (6 months goals)