Libraries worldwide face significant challenges in coping with the
increasing amounts of digital material that they must acquire from
external sources, either digital publishers or material produced by
their institution (e.g. by the faculty and researchers of an academic
institutions). These materials require new methods for long-term
stewardship and preservation in addition to those that have evolved
for print and other analog material.
Libraries must consider how these materials will be acquired,
either from publishers or faculty, and deposited into library
repositories, as well as their long-term storage, management, and
preservation. They must arrange for an array of managed digital
services including storage, transformation and transcoding, indexing,
search, and access. Institutions need digital repository solutions
that span these needs, allowing them to offer services for digital
resources in the same way that they have in the past offered services
for physical resources.
DSpace is an insitutional digital repository developed by
Hewlet-Packard Laboratories and the Massachusetts Institute of
Technology Libraries, and is currently in production at MIT. Faculty
and researchers can submit their research output (for example: papers,
image collections, datasets, audio, video) to relevant "communities"
in DSpace for long-term stewardship by the MIT Libraries.
Researchers at MIT and around the world can then find and access
resources in the DSpace corpus, with the assurance that MIT Libraries
is committed to keeping those resources available over time. At MIT,
DSpace runs on hardware donated by HP. The system also works on a
variety of Unix and Linux platforms. HP and MIT have made the DSpace
software available via an open-source, BSD-style license (see http://www.dspace.org).
During 2003, MIT Libraries will work with libraries at several
other academic research institutions that wish to adopt the DSpace to
provide similar service on digital resources. MIT, HP, and these
institutions hope to create a federated system in which, over time,
the collective resources of the world's leading research institutions
are made available to the world.
As the library's role in collecting, managing, and preserving
digital material produced by its organization expands, users will
expect to be able to find, access, and use these materials from
anywhere in the world, now and at any time in the future.
Delivering on the promise of this vision will require advances in
inter-operability among the digital resources under institutional
management - wherever they are physically hosted - and between those
resources and web services that consume and/or produce descriptions,
annotations, and additional resources. HP, through its ongoing
strategic alliance with MIT, has funded "Simile", a research project
extending DSpace in the direction of the semantic web.
SIMILE is a joint project conducted by the W3C, HP, MIT Libraries,
and MIT's Lab for Computer Science. SIMILE seeks to enhance
inter-operability among digital assets, schemas, metadata, and
services. A key challenge is that the collections which must
inter-operate are often distributed across individual, community, and
institutional stores. We seek to be able to provide end-user services
by drawing upon the assets, schemas, and metadata held in such
stores.
Simile will leverage and extend DSpace, enhancing its support for
arbitrary schemas and metadata, primarily though the application of
RDF and semantic web techniques. The project also aims to implement a
digital asset dissemination architecture based upon web standards. The
dissemination architecture will provide a mechanism to add useful
"views" to a particular digital artifact (i.e. asset, schema, or
metadata instance), and bind those views to consuming services.
To guide the SIMILE effort we will focus on well-defined,
real-world use cases in the libraries domain. Since parallel work is
underway to deploy DSpace at a number of leading research libraries,
we hope that such an approach will lead to a powerful deployment
channel through which the utility and readiness of semantic web tools
and techniques can be compellingly demonstrated in a visible and
global community.