LD4LT Group Kick-Off and Roadmap Meeting
Linked Data for Language Technology (LD4LT) 21 March, Athens, Greece co-located with the European Data Forum 2014.
Linked Data (LD) has proven beneficial in many new and unforeseen ways for Language Technology (LT) and the newly gained interoperability and availability of LT data and services is currently receiving industry adoption. The Workshop will offer several short talks around the topics of Linguistic resources available in the linked data format and their potential impact on research and industry. We will explore together the advantages of promoting free, open and interoperable language resources as Linked Open Data (LOD) to lower the entry cost to innovating in the use of such resources and technologies in research and industry. We will also aim at exploring how Linguistic Linked Licensed Data open new opportunities for market-based linguistic data exchange.
In summary, we aim at bringing together stakeholders involved in the linguistic data value chain in order to explore new use cases and business models for reusing existing and future language (open or licensed) linked data resources and platforms for the joint development and hosting of multi–language data sets and services in different types of applications such as content analytics, machine translation, sentiment analysis, term disambiguation, etc.
With the foundation of the Linked Data for Language Technologies (LD4LT) W3C community group, this event will start the discussion, analyse current trends and offer a crystallization point to coordinate the development of future LD-based LT applications. The LD4LT Group Kick-Off Meeting and Roadmap meeting is supported by the LIDER EU project, the MultilingualWeb community, the NLP2RDF project, the Working Group for Open Data in Linguistics as well as the DBpedia Project.
As input to the discussion and the work of the LD4LT group, we invite you to take a few short minutes to fill in the first LIDER survey. During the kick-off meeting, via the survey and through the LD4LT group, you will have the opportunity to offer your view on how linked data and language technology should benefit each other.
Registration and Position Statement
The LT4LD Kick-Off meeting is a free community event - there is no admission fee for participants, but registration is required via this online form (a free W3C Account or only your mail address needed).
At the workshop we want to give you room for a position statement: what are you ideas / technologies / use cases related to Linked Data and Language Technology? If you are interested in making a position statement a the workshop, please indicate this in this in your registration form by providing a short title. We assume about 10 minutes per statement but will decide about detailed timing depending on the number of statements.
Schedule (9:00 - 17:00 EET)
Session 1 Challenges
- Welcome and Introduction (LD4LT Group Chair)
- Challenges of Linked Data 4 Language Technology (A. Gomez-Perez from the LIDER Project)
- Hans Uszkoreit: Language Technologies and Linked Data: Opportunities, Challenges and Synergies (DFKI, Language Technology Lab)
- Nicoletta Calzolari: "Open Language Resources & Meta-Resources: a Treasure and a Challenge for Linked Data" (CNR)
- Phil Archer "Demanding Multilingual Schemata" (W3C)
Session 2 Discussion
- Introduction of participants
- Briefing from W3C Community Groups:
- Linked Data for Language Technology (LD4LT) - Dave Lewis
- OntoLex - Philipp Cimiano
- Best Practice in Multilingual Linked Open Data (BPMLOD) - Jorge Gracia
- Emotion and Sentiment Analysis - Gabi Vulcu
- Refine charter and goals for LD4LT and relationship to other groups
Session 3 An Industry Perspective
Presentations by companies and industrial stakeholders about current problems as well as existing solutions.
- Uroš Milošević from Institute Mihajlo Pupin: Letters to Meaning, Rozeta is a multilingual NLP and Linked Data tool wrapped around STRUTEX, a structured text knowledge representation technique used to represent natural language documents in structured form, and extract words and phrases. Rozeta provides automatic extraction of STRUTEX dictionaries in Linked Data form, semantic enrichment through link discovery services, a manual revision and authoring component, a document similarity search tool and an automatic document classifier.
- Ilan Kernermann, KDICTIONARIES
Session 4 Use Case Discussion
An online survey on areas of industrial interest and use case areas for linked data in language technology is currently underway.
The results of this survey will be used, together with input form the keynote speakers, existing community groups, and industrial position statement, to seed an open mike discussion on the priorities and concerns of industry in considering the use of linked data for language technologies.
The LD4LT meeting will be organised by the LIDER Project
Asuncion Gómez-Pérez (UPM)
David Lewis (TCD)