HCLSIG/SWANSIOC
Scientific Discourse Task Force
The HCLS Scientific Discourse Task is co-chaired by TimClark and Anita De Waard.
Project Description
Provide a Semantic Web platform for biomedical discourse which can be evolved over time into a more general facility for many types of scientific discourse, and which is linked to key biological categories specified by ontologies.
Discourse categories should include research questions, scientific assertions or claims, hypotheses, comments and discussion, experiments, data, publications, citations, and evidence. Biological categories should include such categories as genes, proteins, antibodies, animal models, laboratory protocols, biological processes, disease classifications,anatomical structures, user-generated taxonomies, and tools.
Our primary scientific use cases will be derived from problems in digital scientific communications and web-based research collaboratories supporting research in neurological disorders and therapies.
The scientific use cases will motivate a series of informatics use cases which can later be generalized across wider areas of biology and medicine. We propose as a first general scientific use case, the cross-application of discoveries in stem cell, Alzheimer and Parkinson disease research via biomedical web communities in those areas.
The informatics specific use cases will initially focus on interoperability of the SWAN Alzheimer Knowledge Base with research communities using the Science Collaboration Framework (SCF) Drupal deployment, as well as with useful tools for bibliographic annotation and online scientific discussion and collaboration.
SubTasks
Objectives are paired with tasks and activities.
A. Use Cases:
1. Enhance Drug-mechanism Knowledge in Drug Product Labels
- Status: Use case being finalized (deciding on drugs/patient characteristics, partners are joining)
- Leads: Richard Boyce, Maria Liakata, Jodi Schneider, Mike Taylor, Anita de Waard
- Partners: DERI, Elsevier; inviting ePocrates, Elsevier' Reaxys Database
2. Defining core metadata for describing biomedical investigations
- Status: Description of use case being edited
- Leads: Tim Clark, Susanna-Assunta Sansone, David Shotton, Philippe Rocca-Serra
- Goal: develop a generic, anonymised, multi-EHS compliant format and an information architecture that allows access to symptom/treatment data as a Linked Data source that can be made freely available and used for drug efficacy/outcome (meta)studies
- Status: Description of use case being edited
- Leads: Joanne Luciano, Anita de Waard
- Partners: RPI, Elsevier, - need clinical/patient forum and/or pharma partner
--Anita de Waard 14:51, 31 October 2011 (UTC)
B. Ontologies:
1. Scientific Discourse formalization: Prepare SWAN IG Note. Completed
- Subtask coordinator: Paolo Ciccarese
- IG Notes: SWAN formalization
2. Scientific Discourse in Online Communities: Integrate SWAN and SIOC ontologies. Completed
- Subtask coordinators: Alex Passant & Paolo Ciccarese
- Discussion: SWAN / SIOC Integration
- IG Notes: (a) SIOC Modifications (b) SWAN-SIOC Alignment
- Slides: Aligning Scientific Discourse and Social Semantics
3. Annotation Ontology (AO): Build provenance-aware model of mappings between sections of non-SemWeb documents, and terms in SemWeb ontologies.
- Subtask coordinator: Paolo Ciccarese
- Discussion: Document Annotation
4. Coarse-grained rhetorical structure: ORB - Ontology of rhetorical blocks
- Subtask coordinator: Paolo Ciccarese
- Current Version: Ontology of rhetorical blocks version March 9
5. Discourse, Data & Experiment: Integrate SWAN, myExperiment & OBI (Ontology of Biomedical Investigations) ontologies
- Subtask coordinator: Sudeshna Das
- Discussion: Discourse, Data & Experiment
6. Bibliographic Ontologies: Integrate SWAN Citations with PRISM & CiTO to enable widely interoperable bibliographic ontologies.
- Subtask coordinators: Paolo Ciccarese and David Shotton
- Discussion: CiTO / SWAN and SWAN / PRISM Integration
7. Medium-grained structure: Integration of Medium-grained discourse structure with DRO and DoCO
- Subtask coordinator: Jodi Schneider
- Discussion: Alignment Medium-grained with DRO and DOCO
C. Scientific Discourse structure:
- Subtask coordinator: Anita de Waard
- Discussion: Rhetorical Document Group
- Charter: hold weekly calls discussing:
- New directions in scientific discourse structures
- Update ontologies and alignments
Project Pages
- Business Case
Semantic Integration of Biomedical Web Communities to Accelerate Research In Neurodegenerative Disorders
Online community sites in general (forums, weblogs, bulletin boards, collaboratories, open access journals, etc.) have replaced many of the traditional means of keeping a community informed and are supplementing, and to some extent replacing, print libraries and print publishing. They are a valuable source of information and quite often it is a community site where you would end up when searching for some information. But there is a problem - online community sites are like islands without bridges connecting them. You may find information in a forum, but not know that there are missing pieces of related information that can be found on other community sites.
SIOC (Semantically-Interlinked Online Communities) is an attempt to link online community sites and to use Semantic Web technologies to describe the information community sites have about their structure and contents. An aim of SIOC is to allow people to find related information in other online communities and to discover new connections between discussion posts. The SIOC project is a sub-initiative of the DERI Líon project (funded by SFI).
In parallel with the SIOC effort, researchers are now beginning to realise the potential of social web technologies for scientific, legislative and other domain-specific discourses. Both formal scientific works in publications and also research discourse in community mechanisms can and should be interlinked with semantics. For example, efforts like bio-zen and SISC are aim ing to represent data, information and knowledge from research in all facets of the life sciences on the Semantic Web. There is a need to provide structured representations of professional scientific discourse for the HCLS domain, and this fits well with a future direction of SIOC to augment the existing framework with terms and applications specific to various domains.
Alzheimer Disease (AD) and Parkinson Disease (PD) are devastating neurodegenerative disorders for which there is no cure, and whose mechanisms (etiology) are incompletely understood. There are currently some 5 million Alzheimer patients and more than 1 million Parkinson patients in the U.S. alone, with the cost of care-giving running into the hundreds of billions of dollars. These numbers are expected to double over the next several decades because of projected increases in the aged population. AD is now the third most expensive disease to treat in the U.S., costing society close to $100 billion annually.
AD and PD are characterized by the loss of function and eventual death of massive numbers of neurons, beginning in specific brain regions (entorhinal cortex in AD and substantia nigra in PD).
There are a number of ways in which closer integration of stem cell, AD and PD research could be beneficial:
- Emerging hypotheses propose that certain molecules that play a central role in AD and PD are involved in neural regeneration, and that a possible cause of cell death is the loss of this regenerative capacity. These molecules include nerve growth factor, amyloid precursor protein and dopamine.
- Maintaining stable cultures of human neurons has been difficult, and has impeded progress in developing "test tube" models to test hypotheses and screen drugs. Using embryonic stem cells to generate human neurons and other types of brain cells could lead to better test-tube models of neurodegenerative disease.
- Therapy development is exceptionally challenging for neurodegenerative diseases because in the adult central nervous system, neurons generally are not capable of regenerating to replace diseased and dying cells. Stem cells may be manipulated to develop cell lines and engineered tissue suitable for transplantation therapy.
- Stem cell biology may provide knowledge to harness the brain's innate regenerative capacity for therapeutic purposes.
Alzheimer Disease research is the focal area of the oldest and largest biomedical web community by and for AD researchers, Alzforum (www.alzforum.org).
The SWAN ontology and knowledgebase (Ciccarese et al. 2008, Journal of Biomedical Informatics, in press) is a joint project of the Alzforum and the Massachusetts General Hospital. Stem Cell technology is likewise the subject of StemBook, an online publication of the Harvard Stem Cell Institute. StemBook is implemented using the Science Collaboration Framework (SCF), a special distribution of Drupal which among other capabilities can node-proxy resources on SPARQL endpoints (lazily instantiated node data) and understands certain elements of SWAN. A third web community, PD Online Research, also based on SCF, is now under development with scheduled deployment in Spring 2009 and planned integration with SWAN.
These communities with intersecting but distinct research interests are poster children for semantic interoperability of discourse. They form a convenient and perhaps ideal driving biological project for integrating SWAN and SIOC while keeping requirements grounded in the needs of actual biomedical researchers.
Meetings
Past meetings:
- March 19, 2012: Overall check in
- February 20, 2012: The PROV Provenance data model, Paul Groth - http://www.w3.org/TR/2012/WD-prov-dm-20120202/
- January 30: Paolo Ciccarese and Tommaso Teofili: Creating, visualising, sharing, curating and discussing text mining results with the Domeo Annotation Toolkit. Integration of Domeo with Apache UIMA through Apache Clerezza
- January 23, 2012: Joint call: Use case update
- December 19, 2011: Rafal Rak, NAtional Center for Text Mining, U-Compare; Use cases update
- November 21 2011: Use cases overview + New Format
- October 31 2011: Use cases overview
- October 10 2011: Use cases for the new HCLS Charter
- July 25 2011: Ping Wang, Rensellear Polytechnic - A Semantically-Enabled Provenance-Aware Water Quality Portal
- July 11 2011: Adrian Walker - Application Semantics via Rules in Open Vocabulary Executable English
- June 20 2011: Anita de Waard - Executable Papers
- June 6 2011: Merce Crosas - Data Citation Principles
- May 23 2011: Joanne Luciano - SADI; Alex Garcia - RDFising Biomedical Docs
- May 2 2011: BioRDF Demonstrator - Collaboration
- April 18 2011: Discussion on medium-grained ontologies and alignment
- April 11 2011: Jodi Schneider: Medium-grained ontologies and alignment
- April 4 2011: Report on the Open Annotation Consortium Workshop
- March 28 2011: talk by Anita de Waard on a Paradigmatic/Syntagmatic analysis of scientific text
- March 21 2011: talk by Barend Mons, Erik Shultes and Scott Marshall on nanopublications
- March 14 2011: talk by Howard Burrows
- March 7 2011: David R Newman 'Research Objects for e-Laboratories'
- Feb 28 2011: Rhetorical Document Model
- Feb 14 2011: Rhetorical Document Model
- Feb 7 2011: "Beyond the PDF" Research Objects Report
- Jan 31 2011: Rhetorical Document Model, DEXI and BtPDF Research Objects
- Jan 24 2011: Report on Beyond the PDF Workshop
- Dec 20 2010: Alignment Status
- Dec 13 2010: Rhetorical Document Model
- Dec 6 2010: Data+Experiment (DEXI) Gully Burns on the KE-f-ED Model of Experiments
- Nov 29 2010: Data+Experiment (DEXI)
- Nov 15 2010: Rhetorical Document Model: ORB & DOCO convergence progress report
- Nov 01 2010: Discourse, Data & Experiment (SWAN + myExperiment + OBI etc)
- Oct 25 2010: Annotation Framework Update & Live Demo
- Oct 18 2010: Bibliographic Records & Citations (CiTO + SWAN + PRISM etc)
- Oct 4 2010: Annotation Ontology and Text Mining
- Sep 27 2010: Coarse grained document structure: ORB & DOCO
- Sep 20 2010: Aligning the Text Mining and SemWeb Communities
- Sep 13 2010: SciDisc Autumn 2010 Status and Planning
- Other Previous Meetings
Dial-in & IRC Information
- Dial-In #: +1.617.761.6200 (Cambridge, MA)
- Dial-In #: +33.4.26.46.79.03 (Paris, France)
- Dial-In #: +44.203.318.0479 (London, UK)
- Participant Access Code: 42572 ("HCLS2")
- IRC Channel: irc.w3.org port 6665 channel #HCLS2 use IRC direct link or (see W3C IRC page for details, or see Web IRC)
- Mibbit quick start: Click on mibbit for instant IRC access
- Duration: 1hr
Related Links
- http://sioc-project.org
- http://drupal.org/project/sioc
- http://www.sindice.com
- http://swse.deri.org
- http://swan.mindinformatics.org
- http://swan.mindinformatics.org/ontology/1.1/ (OWL Files)
- http://neuroscientific.net/index.php?id=43 (bio-zen)
- http://neuroscientific.net/sisc/sisc_introduction.htm (SISC)
- In the bio-zen initiative, SIOC has been included in their attempts to represent data, information and knowledge from research in all facets of the life sciences on the Semantic Web. As part of this, the Semantically-Interlinked Scientific Communities (SISC) effort aims to improve how scientific data and knowledge is currently being represented and communicated. It uses SIOC, FOAF, DC, Creative and Science Commons, OBO and HCLS ontologies and technologies as its basis. According to initiative creator Matthias Samwald, SIOC was chosen one of the base ontologies for this effort since it provides "an excellent tool to describe scientific discourse in a practical, web-centric manner".
- http://www.stembook.org/
- http://rdf.myexperiment.org/ontologies/
- http://rdfs.org/ns/void-guide
- Functional Requirements for Bibliographic Records (IFLA 1998)
Participants
(To e-mail any DERI members, use firstname.lastname@deri.org)
- Sophia Ananiadou (U of Manchester)
- Uldis Bojars (DERI / NUI Galway)
- John Breslin (DERI / NUI Galway)
- Gully Burns (USC/ISI)
- Kei Cheung (Yale School of Medicine)
- Annamaria Carusi (University of Oxford)
- Paolo Ciccarese (Harvard Medical School)
- Tim Clark (Harvard Medical School)
- Ron Daniel (Elsevier)
- Sudeshna Das (Harvard Medical School)
- Anita deWaard (Elsevier)
- Alf Eaton (Nature Networks)
- Ronan Fox (DERI / NUI Galway)
- Matthew Gamble (University of Manchester)
- Carole Goble (University of Manchester)
- Tudor Groza (DERI / NUI Galway)
- Christoph Lange (Jacobs University)
- Joanne Luciano (Tetherless World Constellation @ Rensselaer Polytechnic Institute, Predictive Medicine, Inc.)
- Scott Marshall (Leiden University Medical Center)
- David R Newman (University of Southampton)
- Marco Ocana (Balboa Systems)
- Jack Park (Open University)
- Alexandre Passant (DERI / NUI Galway)
- Satya Sahoo (Wright State University)
- Matthias Samwald (DERI / NUI Galway)
- Susanna Sansone (University of Oxford)
- Tony Scerri (Elsevier)
- Jodi Schneider (DERI/NUI Galway)
- David Shotton (University of Oxford)
- Susie Stephens (Johnson & Johnson Pharmaceutical Research & Development)
- Holger Stenzhorn (DERI / NUI Galway)
- Karin Verspoor (University of Colorado)
- Elizabeth Wu (Alzheimer Research Forum)
- Jun Zhao (University of Oxford)
If you have any questions please contact Tim Clark (tim_clark at Harvard dot EDU)
Categories: