================================================================
================================================================
Questions marked with an asterix (*) are more important.
================================================================
Section 0. Contact and confidentiality
================================================================
Contact e-mail: simon.jupp@manchester.ac.uk , robert.stevens@manchester.ac.uk
Do you mind your use case being made public on the working group website and
documents?
No
=============================================================
Section 1. Application
=============================================================
In this section we ask you to provide some information about the 
application for which the vocabulary(ies) and or vocabulary mappings are 
being used.
Please note:
-- If your use case does not involve any specific application, but 
consists rather in the description of a specific vocabulary, skip 
straight to Section 2.
-- If your application makes use of links between different 
vocabularies, do not forget to fill in Section 3!
 1.1. What is the title of the application?
Sealife - A Semantic Grid Browser for the Life Sciences Applied to the study of Infectious Diseases
 1.2. What is the general purpose of the application?
        What services does it provide to the end-user?
Develop a grid browser for the life sciences, which will link the web to the current emerging e-science infrastructure. 
The Sealife browser will allow users to automatically link a host of Web servers and Web/Grid services to the Web content he/she is visiting. 
The browser will identify terms in web pages and documents as they are being browsed using background knowledge held in ontologies and vocabularies. 
Through the use of Semantic Hyperlinks, which link identified concepts and terms in pages to servers and services, the Sealife browser will offer 
a new dimension to context-based information integration.
*1.3. Provide some examples of the functionality of the application. Try 
to illustrate all of the functionalities in which the vocabulary(ies) 
and/or vocabulary mappings are involved.
The vocabularies will be used to identify bio-medical terms and concepts on web pages as the user browses the web. Dynamic hyperlinks are 
generated around the concepts and link boxes will offer relevant and related services to the user based on terms held in the vocabularies and ontologies. 
 1.4. What is the architecture of the application?
      What are the main components?
      Are the components and/or the data distributed across a network, 
or across the Web?
The system is built on the Conceptual Open Hypermedia Service (COHSE) system. COHSE's Architecture is composed of a COHSE Distributed Links Service (DLS)
agent and two supporting services: a Knowledge Service (OS) and a Resource Manager (RM). The DLS agent is responsible for adding hyperlinks and drawing 
text boxes on the pages using AJAX technology. The knowledge Service stores ontologies (in OWL) or SKOS vocabularies, that are used for term recognition 
on web pages. The resource manager allows mapping between concepts and resources, once resources are discovered hyperlinks to the resource are generated 
on the page. Services are pluggable components e.g. google web search service.
The COHSE system can be deployed on the web server side, as an intermediary service (e.g. proxy) or in the browser (e.g Mozilla plugin)
 1.5. Briefly describe any special strategy involved in the processing 
of user actions, e.g. query expansion using the vocabulary structure.
For any term highlighted on a page, resources are provided for broader, narrower and related terms obtained from the underlying vocabulary.
 1.6. Are the functionalities associated with the controlled 
vocabulary(ies) integrated in any way with functionalities provided by 
other means? (For example, search and browse using a structured 
vocabulary might be integrated with free-text searching and/or some sort 
of social bookmarking or recommender system.)
 1.7. Any additional information, references and/or hyperlinks.
http://www.biotec.tu-dresden.de/sealife/
http://cohse.cs.manchester.ac.uk/
Demo- http://cohse.cs.manchester.ac.uk/demo/NELI-demo.avi.zip
=============================================================
Section 2. Vocabulary(ies)
=============================================================
In this section we ask you to provide some information about the 
vocabulary or vocabularies you would like to be able to represent using 
SKOS.
Please note:
-- If you have multiple vocabularies to describe, you may repeat this 
section for each one individually or you may provide a single 
description that encompasses all of your vocabularies.
-- If your use case describes a generic application of one or more 
vocabularies and/or vocabulary mappings, you may skip this section.
-- If your vocabulary case contains cross-vocabulary links (between the 
vocabularies you presented or to external vocabularies), please fill in 
section 3!
 2.1. What is the title of the vocabulary? If you're describing multiple 
vocabularies, please provide as many titles as you can.
Our potential vocabularies are any biomedical ontology-like resource. We are targeting the Open biomedical Ontology (ies) (http://obo.sf.net). Others include SNOMed, MeSH, and many more. The OBO Ontologies use their own representation that has OWL semantics. The others have their own proprietory formats. We have already written converters from OBO format to SkOS and one already exists for MeSH.
2.2. Briefly describe the general characteristics of the vocabulary, 
e.g. scope, size...
The Gene Ontology, for instance has some 20,000 terms. It is a simple taxonomy combined with part-of relationships. It also has thesaurus features of scope note (definition), synonyms, broader than, narrower than and related terms. It covers molecular function, biological process and cellular location of gene products. Other OBO ontologies cover many features of molecular biology from genotype to phenotype. Other ontologies cover disease, anatomy, healthcare, etc. they range in size from a few hundred terms to 100,000's.
 2.3. In which language(s) is the vocabulary provided?
      In the case of partial translations, how complete are these?
American English.
 *2.4. Please provide below some extracts from the vocabulary. Use the 
layout or presentation format that you would normally provide for the 
users of the vocabulary. Please ensure that the extracts you provide 
illustrate all of the features of the vocabulary.
OBO format used to create the ontologies/vocabularies.
[Term]
id: neli:000015
name: Aspergillus
description: ....
Synonym: .... ! narrower
is_a: neli:000011 ! Bacteria
[Term]
id: neli:000016
name: Bacillus
description: ...
Synonym: ... ! broader
is_a: neli:000011 ! Bacteria
 2.5. Describe the structure of the vocabulary.
      What are the main building blocks?
      What types of relationship are used? If you can, provide examples 
by referring to the extracts given in paragraph 2.4.
The vocabularies we plan to use cover various aspects of the bio-medical domain. The domain ranges from vocabularies used in the area of molecular 
biology and anatomy to infectious disease, taxonomy and medicine. These vocabularies range in size, scope and representation. Many of the biological 
ontologies are represented in the OBO format (e.g. The Gene Ontology [http://www.geneontology.org/]) for which we have mapped to SKOS, others are in OWL and some like MeSH (Medical Subject Headings) are in some native representation (e.g. XML). We are also working with the Nation Library of Infectious Disease (NeLI) to generate a new vocabulary that will be used aid user navigation around the NeLI website. This vocabulary will be represented in SKOS and used as a lexical resource for the COHSE system.
All of the vocabularies and ontologies range in their level of expression but share some common features, such as subsumption hierarchies, preferred labels, synonyms and descriptors.
 2.6. Is a machine-readable representation of the vocabulary already 
available (e.g. as an XML document)? If so, we would be grateful if you 
could provide some example data or point us to a hyperlink.
MeSH is available in MeSH XML format, some XSLT is available to do a conversion.
The OBO ontologies are available as OBO XML or in the OBO file format. OBO-Edit API is available.
 2.7. Are any software applications used to create and/or maintain the 
vocabulary?
      Are there any features which these software applications currently 
lack which are required by your use case?
We are using a combination of Protege and OBO-edit to generate and edit vocabularies, we then have a set of application that convert these various 
formats into a SKOS representation.
 2.8. If a database application is used to store and/or manage the 
vocabulary, how is the database structured? Illustration by means of 
some table sample is welcome.
N/A
 2.9. Were any published standards, textbooks or written guidelines 
followed during the design and construction of the vocabulary?
      Did you decide to diverge from their recommendations in any way, 
and if so, how and why?
In most cases we did not develop the vocabularies, they were developed with a different use case in mind, we have adapted them for use within our application.
 2.10. How are changes to the vocabulary managed?
 2.11. Any additional information, references and/or hyperlinks.
=============================================================
Section 3. Vocabulary Mappings
=============================================================
In this section we ask you to provide some information about the 
mappings or links between vocabularies you would like to be able to 
represent using SKOS.
Please note:
-- If your use case does not involve vocabulary mappings or links, you 
may skip this section!
 3.1. Which vocabularies are you linking/mapping from/to?
OBO to SKOS. 
MeSH to SKOS.
OWL to SKOS.
*3.2. Please provide below some extracts from the mappings or links 
between the vocabularies. Use the layout or presentation format that you 
would normally provide for the users of the mappings. Please ensure that 
the examples you provide illustrate all of the different types of 
mapping or link.
- "Mycobacterium Tuberculosis" -> is_a (skos:broaderThan) -> "Mycobacterium"
- "Mycobacterium Tuberculosis" -> causes (skos:relatedTo) -> "Tuberculosis"
- "Azoles" -> Treatment (skos:relatedTo) -> "Tuberculosis"
 3.3. Describe the different types of mapping used, with reference to 
the examples given in paragraph 3.2.
 3.4. Any additional information, references and/or hyperlinks.