[Minutes] LD4LT call 2014-05-15

See
http://www.w3.org/2014/05/15-ld4lt-minutes.html
and below as text. Note: I am sending out the text version since the the tracker tool monitors the mailing list; in this way the issues and actions list are automatically updated. Also, I will close the issue-2
https://www.w3.org/community/ld4lt/track/issues/2
with a link to today’s discussion.

- Felix


   [1]W3C

      [1] http://www.w3.org/

                               - DRAFT -

                                LD4LT CG

15 May 2014

   [2]Agenda

      [2] http://lists.w3.org/Archives/Public/public-ld4lt/2014May/0005.html

   See also: [3]IRC log

      [3] http://www.w3.org/2014/05/15-ld4lt-irc

Attendees

   Present
          DaveLewis, Gary, Kevin, MartinBenjamin, RobertoNavigli,
          ali, arle(IRC), asun, flati, fsasaki, john, jorge,
          maria, penny, phil, roberto, tizinao

   Regrets
   Chair
          dave

   Scribe
          fsasaki

Contents

     * [4]Topics
         1. [5]madrid meeting
         2. [6]Reaching agreement on core LR metadata ontologies,
            with META-SHARE
         3. [7]Guidelines for migrating existing LR metadata into
            RDF
         4. [8]other issues + call time
         5. [9]aob
     * [10]Summary of Action Items
     __________________________________________________________

   <fsasaki_> scribe: fsasaki

   dave: reminder for all to join IRC during calls: go to
   [11]http://irc.w3.org , channel #ld4lt

     [11] http://irc.w3.org/

   agenda at
   [12]http://lists.w3.org/Archives/Public/public-ld4lt/2014May/00
   05.html

     [12] http://lists.w3.org/Archives/Public/public-ld4lt/2014May/0005.html

   <daveL_> agenda:
   [13]http://lists.w3.org/Archives/Public/public-ld4lt/2014May/00
   05.html

     [13] http://lists.w3.org/Archives/Public/public-ld4lt/2014May/0005.html

   dave going through agenda at
   [14]http://lists.w3.org/Archives/Public/public-ld4lt/2014May/00
   05.html

     [14] http://lists.w3.org/Archives/Public/public-ld4lt/2014May/0005.html

   dave: I'll talk about last week in madrid, we had an LD4LT
   meeting - others can give their viewpoints as well
   ... then we'll discuss ongoing work on meta-share ontology
   ... then jorge, penny who provided useful info on mailing list
   - would be good to see where that work is
   ... there are a few technical issues discussed on the list. If
   there is a thread on this we mark that up as an issue

   [15]https://www.w3.org/community/ld4lt/track/issues/

     [15] https://www.w3.org/community/ld4lt/track/issues/

   <daveL_> [16]http://www.w3.org/community/ld4lt/track/

     [16] http://www.w3.org/community/ld4lt/track/

   above links gives issue list

   dave: the tracker tracks actions and issues
   ... the issue tracker gives us a way to track topics over
   several meetings, and can associate that with actions as well
   ... see e.g. issue-2 which is meta-share metamodel work
   approach
   ... we have use case and requirements as issue-1
   ... would not go through that today
   ... then there is an issue about dcat, issue-3
   ... we had last week discussion in madrid related to dcat
   ... there is an opportunity for cross over where

madrid meeting

   dave: there was LIDER meetings and MLW workshop, very
   succesfull, hosted by UPM

   about 100 people turning up

   dave: a broader set of discussions around multilingual web
   topics
   ... linked data not the only topic

   <daveL__>
   [17]http://www.multilingualweb.eu/documents/2014-madrid-worksho
   p/

     [17] http://www.multilingualweb.eu/documents/2014-madrid-workshop/

   see slides now linked from
   [18]http://www.multilingualweb.eu/documents/2014-madrid-worksho
   p/2014-madrid-program

     [18] http://www.multilingualweb.eu/documents/2014-madrid-workshop/2014-madrid-program

   <daveL__>
   [19]https://www.w3.org/community/ld4lt/wiki/LD4LT_Group_Madrid_
   May_2014_Meeting

     [19] https://www.w3.org/community/ld4lt/wiki/LD4LT_Group_Madrid_May_2014_Meeting

   (slides from the LIDER WS are now also linked)

   dave: felix had reached out to wikipedia translation group,
   alolita sharma came with a large team of wikimedia folks
   ... wikipedia is now trying to help people to translate pages
   directly
   ... they are looking into tools, machine translation and other
   technologies
   ... they are also interested in data in other languages
   ... there is now also wikidata for creating data directly
   ... related to dbpedia which is about extracting data from
   content
   ... anybody wants to add things about this?
   ... that was 1st day - 2nd day focused more on linked data and
   language technology
   ... we had several presentations from LIDER but also other (EU)
   projects
   ... we had discussions about data + metadata aspects of
   language resources
   ... several people from LD4LT where here, Christian Chiarcos
   from OLWG, Stelios Piperidis and Marta Villegas presenting
   about META-SHARE
   ... and many others. had good discussions
   ... had a good opportunity to talk about representations of
   language resource metadata
   ... good side discussions with EU and publications office about
   what they will do
   ... they are planning to publish parallel documents with fine
   grained identifiers - they are interested in the RDF version of
   that
   ... these are legal text - high quality because of the domain
   ... covering many EU languages

   asun: one of the other outcomes:
   ... we need more cooperation with open knowledge foundation
   ... importance of having pure linguistic resources instead of
   having domian dependent language resources

   dave: agree
   ... we did not have Christian Chiarcos involved here so far
   ... these discussions will also continue at LREC related events
   I assume

   <MartinBenjamin> There have been some experiments with
   Wikipedia translation to Swahili that have been less than
   successful, using the Google Translate Toolkit. The biggest
   problem comes from running English articles through Google
   Translate, which is absolutely horrible for Swahili.

Reaching agreement on core LR metadata ontologies, with META-SHARE

   related issue-2

   <daveL__>
   [20]http://lists.w3.org/Archives/Public/public-ld4lt/2014Apr/00
   11.html

     [20] http://lists.w3.org/Archives/Public/public-ld4lt/2014Apr/0011.html

   dave: had input from Jorge, thread is listed at
   [21]https://www.w3.org/community/ld4lt/track/issues/2

     [21] https://www.w3.org/community/ld4lt/track/issues/2

   <daveL__>
   [22]http://lists.w3.org/Archives/Public/public-ld4lt/2014Apr/00
   17.html

     [22] http://lists.w3.org/Archives/Public/public-ld4lt/2014Apr/0017.html

   dave: meta-share ontology is very comprehensive already

   <MartinBenjamin> Big problem with using Wiki data is that
   things are written in wiki markup - no stable reference point
   to link anything below the article level. This is a disaster
   for trying to link Wiktionary data

   dave: issue is: how will this be transformed into an ontology
   to use in the linked data area
   ... jorge has set up a related link on the website

   <jorge>
   [23]https://www.w3.org/community/ld4lt/wiki/Meta-Share_OWL_meta
   model

     [23] https://www.w3.org/community/ld4lt/wiki/Meta-Share_OWL_metamodel

   <daveL__>
   [24]https://www.w3.org/community/ld4lt/wiki/Meta-Share_OWL_meta
   model

     [24] https://www.w3.org/community/ld4lt/wiki/Meta-Share_OWL_metamodel

   jorge: in the wiki I added materials about meta-model
   description and the preliminary RDF version made by UPF
   ... we discussed - how to work on this in a collaborative
   manner
   ... I talked about several options: wiki, protege, gdocs
   ... it seems people are OK with gdocs spreadsheet
   ... I put the gdrive doc in a mode so you can share it with
   everybody

   <daveL__>
   [25]https://docs.google.com/spreadsheets/d/15SE4_qAqYFostmD52uK
   xpkCPZh1f5TrPeoXKNTlDYpQ/edit#gid=0

     [25] https://docs.google.com/spreadsheets/d/15SE4_qAqYFostmD52uKxpkCPZh1f5TrPeoXKNTlDYpQ/edit#gid=0

   jorge: wanted to check if this is suitable for everybody

   <MartinBenjamin> On the other hand, Wikipedia has a great
   hidden multilingual terms feature - interwiki links. Terms like
   "Down syndrome", or movie titles, etc, are very difficult to
   find in other languages. But if you go to the Wikipedia page
   for the topic, then follow the interwiki link to the page in
   your target language, the concept as expressed in that language
   is usually the article title or is high at the top. The problem
   again is how to exploit this a[CUT]

   jorge: if that's ok I can fill this with the current state,
   that is the ontology made at upf

   roberto: is this already populated?

   jorge: no, this is just a skeleton - if people agree I would
   add it

   dave: jorge, did you have discussions about what modules there
   might be?
   ... the current ontology e.g. has dublin core and others
   ... then there are many meta-share items
   ... did you have discussions how to reflect these in separate
   namespaces / modules?

   jorge: we just have discussed to keep the meta-share module as
   is
   ... some part can be improved, e.g. about licenses

   penny: had some audio issues - what are you discussing
   currently?
   ... I'm not an RDF expert so looking into this now
   ... we are now looking into what upf did
   ... so cutting the meta-share ontology into modules that could
   improve a lot the model

   jorge: do you have the gdrive spreadsheet in front of you - is
   it fine with you to work with this?

   penny: yes, let me check

   <daveL__> for navigating the original meta-share shcema there
   is a useful structure at:

   <daveL__>
   [26]http://www.meta-share.org/portal/knowledgebase/HomePage

     [26] http://www.meta-share.org/portal/knowledgebase/HomePage

   <jorge>
   [27]https://www.w3.org/community/ld4lt/wiki/Meta-Share_OWL_meta
   model

     [27] https://www.w3.org/community/ld4lt/wiki/Meta-Share_OWL_metamodel

   <daveL__>
   [28]https://docs.google.com/spreadsheets/d/15SE4_qAqYFostmD52uK
   xpkCPZh1f5TrPeoXKNTlDYpQ/edit#gid=0

     [28] https://docs.google.com/spreadsheets/d/15SE4_qAqYFostmD52uKxpkCPZh1f5TrPeoXKNTlDYpQ/edit#gid=0

   <scribe> ACTION: jorge to fill the gdocs with the current
   meta-share items [recorded in
   [29]http://www.w3.org/2014/05/15-ld4lt-minutes.html#action01]

   <trackbot> Created ACTION-2 - Fill the gdocs with the current
   meta-share items [on Jorge Gracia - due 2014-05-22].

   penny: we are already discussing some updates
   ... we should take that into account

   jorge: so you mean it would make sense to add a module for
   services

   maria: we were thinking about adding another module for
   collections
   ... to cater for loose collections of data
   ... that should be identified by themselves
   ... we have not reached a final decision on that

   jorge: so meta-share community does not have a final consensus
   on this

   <daveL__> felix: can a stable vesion of the schema be
   indentified

   felix: would it be possible to identify a version that we use
   as the basis for conversion?

   maria: version 3.0 - marta has alredy worked on converting that
   to RDF
   ... there will be a minor update around LREC

   asun: a few things related to the process:
   ... and how to record some kind of extra information
   ... during the following weeks we will raise many issues
   related to models etc.
   ... at some point we should decide: which are the core
   properties
   ... based on that we can start to extend with other terminology
   that is not in the core but also important
   ... I would suggest to make the core minimal and try to extend
   with other items
   ... I would include in the gdocs excel a new column: candidate
   vocabularies that could be re-used for representing meta-share
   terms

   jorge: agree

   asun: third comment:
   ... we should record proposal names, e.g. to know
   "computational lexicon with property has been proposed by
   Jorge"
   ... so that we see who had made a proposal
   ... and final comment:
   ... how to relate this with W3C notes that we are writing
   ... at the moment there is no argumentation
   ... at some point it would be good to have rationale of
   decisions together with the term agreed
   ... otherwise in a conference call like this we could be in a
   recurrent way

   felix: how about having another gdocs (a word doc)

   dave: using mailing list?

   felix: sure, for discussions, but for document writing gdocs
   helps

   dave: agree

   asun: lot's of mail is ok but having the rationale documented
   in one place helps
   ... start big discussion by mail is difficult to follow

   penny: really like asun point
   ... could we have an issue tracker

   asun: we can have a column to store the discussion

   [30]https://www.w3.org/community/ld4lt/track/issues

     [30] https://www.w3.org/community/ld4lt/track/issues

   <tcarrasco> Emails consensus shoud be consolidated into a
   proper document - it might be appropriate to have a couple of
   editors

   felix: we could use the w3c tracker, I would volunteer to keep
   that up to date with issues that have been discussed
   ... as an output of todays discussion I'd close issue-2, the
   work approach

   penny: e.g. we said meta-share version 3.0 is stable, that is
   something to track

   jorge: in the example we could say in a column: we could just
   say "this comes from the meta-share model"
   ... for this first version most of the stuff will be authored
   by meta-share
   ... then we could add (using the same column) new things

   dave: 2nd comment: what should go into core?
   ... you have indicated that licenses would go out of the core
   ... are there other natural groupings
   ... e.g. usages, classification of resources etc.
   ... should this stay in the core?

   maria: usages could be left out
   ... but maybe first let's have a look at the model and then
   come up with concrete suggestions

   felix: how about timeline expectations from the lider project

   asun: we try to reach agreement quite soon
   ... during LREC we will approach CLARIN + LRE map people
   ... trying to involve them in the discussion
   ... if this community building works we should reach aggreement
   by September
   ... then additional stuff could be added later

   <asungomezperez> Yes, end of July instead of September

   jorge: in the last LIDER meeting we said: we'd like to have a
   draft of the core by the end of July

   <asungomezperez> sorry .... to many dates in my head

   dave: indeed, that will encourage people to contribute to LD4LT
   ... having a bit of work done on the core will encourage people
   from industry to bring in their ideas
   ... would be good for this community group + the meta-share
   community as well
   ... would be interesting, penny, if we forward discussion to
   the meta-share community as well
   ... e.g. to see what parts in meta-share are stable / will
   evolve in the next months etc.
   ... that will influence the discussion on what should be the
   core

   penny: agree
   ... we have plenty of ideas to improve the whole thing

Guidelines for migrating existing LR metadata into RDF

   dave: from LIDER are there any technical pointers we could
   provide?
   ... we have a related LIDER work area "reference architecture"

   roberto: on the modeling aspect we could provide experience,
   e.g. about babelnet > RDF conversion
   ... at the moment there is only the slides from madrid and a
   short report

   dave: roberto, can you send that to the LD4LT list so that
   people can have a look?

   roberto: sure, will try to structure a bit more and then send
   it out

   jorge: what roberto is working on is the data conversion, but
   on the agenda there is the metadata aspect

   roberto: I could focus on the metadata aspect

   jorge: your input roberto on the data aspect we are working on
   in the bpmlod group would be great

   dave: there is the bpmlod group and the "data on the web" w3c
   best practices group people should be aware of

   tomas: that is a W3C working group

   <tcarrasco> [31]http://www.w3.org/2013/dwbp/wiki/Main_Page

     [31] http://www.w3.org/2013/dwbp/wiki/Main_Page

   <tcarrasco> Data on the Web Best Practices Working Group (DWBP
   WG)

   <scribe> ACTION: roberto to send out information on
   architecture for converting (meta)data into rdf [recorded in
   [32]http://www.w3.org/2014/05/15-ld4lt-minutes.html#action02]

   <trackbot> Created ACTION-3 - Send out information on
   architecture for converting (meta)data into rdf [on Roberto
   Navigli - due 2014-05-22].

other issues + call time

   dave: no discussion of other items today
   ... call time - what to do?

   tomas: slot in afternoon better for US people

   asun: agree, afternoon much better for this call for getting US
   people in
   ... thursday afternoon is good for me

   <asungomezperez> I cannot on tuesday afternoon because of
   teaching

   <scribe> ACTION: dave to set up doodle poll for call time
   [recorded in
   [33]http://www.w3.org/2014/05/15-ld4lt-minutes.html#action03]

   <trackbot> Error finding 'dave'. You can review and register
   nicknames at
   <[34]http://www.w3.org/community/ld4lt/track/users>.

     [34] http://www.w3.org/community/ld4lt/track/users%3E.

   <tcarrasco> Multilingual Electronic Dossier (MED) -
   [35]http://joinup.ec.europa.eu/site/med

     [35] http://joinup.ec.europa.eu/site/med

   <scribe> ACTION: david to set up doodle poll for call time
   [recorded in
   [36]http://www.w3.org/2014/05/15-ld4lt-minutes.html#action04]

   <trackbot> Created ACTION-4 - Set up doodle poll for call time
   [on David Lewis - due 2014-05-22].

   dave: reminder - after dublin there will be locworld workshop
   feisgiltt
   ... historically XML focused, this time more linked data
   focused
   ... esp. morning of 4th june
   ... terminology and linked data will be an important topic here
   too

aob

   dave: thanks a lot for participaing, great participation in the
   call
   ... people please speak up to make contribution for you and
   others, that is what the group is for
   ... thanks all, bye!

Summary of Action Items

   [NEW] ACTION: dave to set up doodle poll for call time
   [recorded in
   [37]http://www.w3.org/2014/05/15-ld4lt-minutes.html#action03]
   [NEW] ACTION: david to set up doodle poll for call time
   [recorded in
   [38]http://www.w3.org/2014/05/15-ld4lt-minutes.html#action04]
   [NEW] ACTION: jorge to fill the gdocs with the current
   meta-share items [recorded in
   [39]http://www.w3.org/2014/05/15-ld4lt-minutes.html#action01]
   [NEW] ACTION: roberto to send out information on architecture
   for converting (meta)data into rdf [recorded in
   [40]http://www.w3.org/2014/05/15-ld4lt-minutes.html#action02]

   [End of minutes]
     __________________________________________________________


    Minutes formatted by David Booth's [41]scribe.perl version
    1.138 ([42]CVS log)
    $Date: 2014-05-15 09:31:33 $
     __________________________________________________________

     [41] http://dev.w3.org/cvsweb/~checkout~/2002/scribe/scribedoc.htm
     [42] http://dev.w3.org/cvsweb/2002/scribe/

Scribe.perl diagnostic output

   [Delete this section before finalizing the minutes.]
This is scribe.perl Revision: 1.138  of Date: 2013-04-25 13:59:11
Check for newer version at [43]http://dev.w3.org/cvsweb/~checkout~/2002/
scribe/

     [43] http://dev.w3.org/cvsweb/~checkout~/2002/scribe/

Guessing input format: RRSAgent_Text_Format (score 1.00)

Found Scribe: fsasaki
Inferring ScribeNick: fsasaki
Present: DaveLewis Gary Kevin MartinBenjamin RobertoNavigli ali arle(IRC
) asun flati fsasaki john jorge maria penny phil roberto tizinao
Agenda: [44]http://lists.w3.org/Archives/Public/public-ld4lt/2014May/000
5.html
Got date from IRC log name: 15 May 2014
Guessing minutes URL: [45]http://www.w3.org/2014/05/15-ld4lt-minutes.htm
l
People with action items: dave david jorge roberto

     [44] http://lists.w3.org/Archives/Public/public-ld4lt/2014May/0005.html
     [45] http://www.w3.org/2014/05/15-ld4lt-minutes.html


   [End of [46]scribe.perl diagnostic output]

     [46] http://dev.w3.org/cvsweb/~checkout~/2002/scribe/scribedoc.htm

Received on Thursday, 15 May 2014 09:34:42 UTC