XML/RDF Digital library

Project acronym: QUESTION-HOW
Project Full Title:Quality Engineering Solutions via Tools, Information and Outreach for the New Highly-enriched Offerings from W3C: Evolving the Web in Europe
Project/Contract No. IST-2000-28767
Workpackage 2, Deliverable D2.3

Project Manager: Daniel Dardailler <danield@w3.org>
Author of this document: Oreste Signore <oreste.signore@cnuce.cnr.it>

Created: 26 March 2003. Last updated: 26 March 2003.


Table of Content:


Introduction

The main task has been to develop a user interface to query complex and specialized XML documents corpora (like juridical documents, cultural heritage cataloguing cards, user manuals, etc.).
At the lower level, we have the XCDE library developed at the Department of Computer Science, University of Pisa. The core of the library (indexing and compressing algorithms) remain property of their authors. The library is written in C and provides a set of efficient algorithms and data structures for indexing and searching an XML document collection.
The documents must be well-formed and may be heterogeneous in that they may reflect different DTDs. The library supports the storage and management of these XML files in native form, operating directly at the File System level. The main features of the library are: state-of-the-art algorithms and data structures for text indexing, compressed space occupancy, and novel succinct data structures for the management of the hierarchical structure of the XML document.
The user interface gets the document structure from the XMLSchema, and makes use of some RDF facilities to broadening or narrowing query terms, implementing a graphical browsing of thesauri, in order to support semantic equivalences for more effective searches.
The possibility of querying different document collections, where semantically equivalent data element are stored in different XML document structures, has also been considered.


Progress to date

At the present stage, the following components have been completed and are under test:

The implementation environment is:

Complete deliverable: http://www.weblab.isti.cnr.it/projects/QH/docs/deliverable.html

See the demo at http://www.weblab.isti.cnr.it/projects/QH/demo/schema

See the report (in PDF) on the XCDE library.


Deviations from plan

none.