Draft Vocabularies Datasets As Current Situation
From Library Linked Data
(→More has been done on reference vocabularies and element sets than on bibliographic datasets) |
(→Data availability) |
||
| (One intermediate revision not shown) | |||
| Line 1: | Line 1: | ||
==Data availability== | ==Data availability== | ||
| - | === More has been done on | + | |
| + | The success of linked library data relies on the ability of its practitioners to identify, re-use or connect to existing datasets and data models. Linked datasets and vocabularies that are essential in the library and related domains, however, have previously been unknown or unfamiliar to many. The LLD XG has thus initiated an inventory of available library-related linked data, which is presented in further detail in [http://www.w3.org/2005/Incubator/lld/wiki/DraftReportWithTransclusion-Multiple_Reports#Available_Vocabularies_and_Datasets Section @@TODO@@] and has lead to the observations below. | ||
| + | |||
| + | === More has been done on value vocabularies and element sets than on bibliographic datasets === | ||
Many metadata element sets and value vocabularies have been released as linked data over the past couple of years, including some flagship value vocabularies already used by many libraries, such as the [http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset#Library_of_Congress_Subject_Headings Library of Congress Subject Headings], or the [http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset#DDC_summaries Dewey Decimal Classification]. It is also encouraging to see that reference metadata frameworks are also provided in a linked data-compatible form, including [http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset#Dublin_Core Dublin Core] or various [http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset#FRBR_Ontologies FRBR implementations]. | Many metadata element sets and value vocabularies have been released as linked data over the past couple of years, including some flagship value vocabularies already used by many libraries, such as the [http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset#Library_of_Congress_Subject_Headings Library of Congress Subject Headings], or the [http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset#DDC_summaries Dewey Decimal Classification]. It is also encouraging to see that reference metadata frameworks are also provided in a linked data-compatible form, including [http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset#Dublin_Core Dublin Core] or various [http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset#FRBR_Ontologies FRBR implementations]. | ||
| - | However, there are not yet many bibliographic datasets available in the linked data space. | + | However, there are not yet many bibliographic datasets available in the linked data space. Examples such the release of the [http://www.bl.uk/bibliographic/datafree.html British National Bibliography] show that there are indeed considerable difficulties involved (many discussed in this report). However, this proves not deterring enough, and the number of datasets released as linked data keeps increasing at a fast pace. |
=== Quality and support for available data varies greatly === | === Quality and support for available data varies greatly === | ||
| - | The level of maturity or stability of available resources vary greatly. Many resources we found are the result of (ongoing) project work, or the result of individual initiatives, and advertise themselves as mere prototypes. The abundance of such efforts is a sign of healthy activity going on in the library linked data domain. | + | The level of maturity or stability of available resources vary greatly. Many resources we found are the result of (ongoing) project work, or the result of individual initiatives, and advertise themselves as mere prototypes. The abundance of such efforts is a sign of healthy activity going on in the library linked data domain. It should come as no surprise, when the whole linked data endeavor encourages a much more agile view on data than in any previous paradigm. Yet this somehow jeopardizes the long-term availability and support for library linked data resources. |
From this perspective, we find it encouraging that more and more established institutions are committing resources to linked data projects, from the national libraries of Sweden, Hungary, Germany, France, the Library of Congress and the British Library, to the Food and Agriculture Organization of the United Nations, not to mention OCLC. | From this perspective, we find it encouraging that more and more established institutions are committing resources to linked data projects, from the national libraries of Sweden, Hungary, Germany, France, the Library of Congress and the British Library, to the Food and Agriculture Organization of the United Nations, not to mention OCLC. | ||
| Line 14: | Line 17: | ||
=== Linking across datasets has begun but requires further effort and coordination === | === Linking across datasets has begun but requires further effort and coordination === | ||
| - | Establishing connections across | + | Establishing connections across datasets is a core aspect of linked data technology, and a key condition to its success. A quick look at available data (see [http://www.w3.org/2005/Incubator/lld/wiki/DraftReportWithTransclusion-Multiple_Reports#Available_Vocabularies_and_Datasets Section @@TODO]) shows that many semantic links are already available across published value vocabularies, which is a great achievement for the nascent library linked data community as a whole. But more can -- and should -- be done to alleviate the issue of data redundancy in the various authority resources that library and related organizations maintain. A similar statement can be made about other datasets and the metadata element sets used to structure linked data descriptions. The two main bottlenecks are the rather low level of long-term vocabulary support and communication between vocabulary developers, and the lack of mature tooling to lower the cost for data publishers to produce a massive amount of semantic links across datasets. However, efforts are being carried out, which facilitate the exchange of experience, as well as the production and sharing of relevant links. We discuss this specific issue further at [http://www.w3.org/2005/Incubator/lld/wiki/DraftReportWithTransclusion-Multiple_Reports#The_linking_issue the end of this report]. |
| - | + | ||
| - | + | ||
| - | A similar | + | |
| - | + | ||
| - | + | ||
| - | + | ||
| - | + | ||
Revision as of 22:12, 12 August 2011
Contents |
Data availability
The success of linked library data relies on the ability of its practitioners to identify, re-use or connect to existing datasets and data models. Linked datasets and vocabularies that are essential in the library and related domains, however, have previously been unknown or unfamiliar to many. The LLD XG has thus initiated an inventory of available library-related linked data, which is presented in further detail in Section @@TODO@@ and has lead to the observations below.
More has been done on value vocabularies and element sets than on bibliographic datasets
Many metadata element sets and value vocabularies have been released as linked data over the past couple of years, including some flagship value vocabularies already used by many libraries, such as the Library of Congress Subject Headings, or the Dewey Decimal Classification. It is also encouraging to see that reference metadata frameworks are also provided in a linked data-compatible form, including Dublin Core or various FRBR implementations.
However, there are not yet many bibliographic datasets available in the linked data space. Examples such the release of the British National Bibliography show that there are indeed considerable difficulties involved (many discussed in this report). However, this proves not deterring enough, and the number of datasets released as linked data keeps increasing at a fast pace.
Quality and support for available data varies greatly
The level of maturity or stability of available resources vary greatly. Many resources we found are the result of (ongoing) project work, or the result of individual initiatives, and advertise themselves as mere prototypes. The abundance of such efforts is a sign of healthy activity going on in the library linked data domain. It should come as no surprise, when the whole linked data endeavor encourages a much more agile view on data than in any previous paradigm. Yet this somehow jeopardizes the long-term availability and support for library linked data resources.
From this perspective, we find it encouraging that more and more established institutions are committing resources to linked data projects, from the national libraries of Sweden, Hungary, Germany, France, the Library of Congress and the British Library, to the Food and Agriculture Organization of the United Nations, not to mention OCLC.
Linking across datasets has begun but requires further effort and coordination
Establishing connections across datasets is a core aspect of linked data technology, and a key condition to its success. A quick look at available data (see Section @@TODO) shows that many semantic links are already available across published value vocabularies, which is a great achievement for the nascent library linked data community as a whole. But more can -- and should -- be done to alleviate the issue of data redundancy in the various authority resources that library and related organizations maintain. A similar statement can be made about other datasets and the metadata element sets used to structure linked data descriptions. The two main bottlenecks are the rather low level of long-term vocabulary support and communication between vocabulary developers, and the lack of mature tooling to lower the cost for data publishers to produce a massive amount of semantic links across datasets. However, efforts are being carried out, which facilitate the exchange of experience, as well as the production and sharing of relevant links. We discuss this specific issue further at the end of this report.
