12:50:23 RRSAgent has joined #ld4lt 12:50:24 logging to http://www.w3.org/2014/07/17-ld4lt-irc 12:50:29 meeting: LD4LT telco 12:50:32 chair: Dave 12:50:47 agenda: http://lists.w3.org/Archives/Public/public-ld4lt/2014Jul/0018.html 12:52:56 daveL has joined #ld4lt 12:55:18 Ali_H_Vahid has joined #ld4lt 12:55:50 Meeting: LD4LT community Group Call 12:56:04 chair: Dave Lewis 12:56:40 rrsagent, draft minutes 12:56:40 I have made the request to generate http://www.w3.org/2014/07/17-ld4lt-minutes.html daveL 12:57:18 Agenda: http://lists.w3.org/Archives/Public/public-ld4lt/2014Jul/0018.html 12:57:40 TizianoFlati has joined #ld4lt 12:57:56 jgracia has joined #ld4lt 12:58:43 Tcarrasco has joined #ld4lt 12:59:53 Renat has joined #ld4lt 13:00:03 present+ Renat 13:00:43 apologies, we just lost goto meeting for a minute, back now 13:00:54 I have made the request to generate http://www.w3.org/2014/07/17-ld4lt-minutes.html fsasaki 13:00:57 present+ fsasaki 13:00:59 Agenda: http://lists.w3.org/Archives/Public/public-ld4lt/2014Jul/0018.html 13:01:22 present+ Jorge, Victor 13:01:30 rnavigli has joined #ld4lt 13:01:34 present+ TizianoFlati 13:01:40 present+ DaveLewis 13:01:56 Serge has joined #ld4lt 13:01:56 Agenda: http://lists.w3.org/Archives/Public/public-ld4lt/2014Jul/0018.html 13:02:00 present+ M.T. Carrasco Benitez 13:02:02 regrets+ penny 13:02:20 topic: roll call 13:02:28 I have made the request to generate http://www.w3.org/2014/07/17-ld4lt-minutes.html fsasaki 13:02:48 s/roll call/agenda review/ 13:02:56 dave: this week want to focus on meta-share ontology 13:03:08 .. then will go through meta-share changes we did since last time 13:03:14 kevinkoidl has joined #ld4lt 13:03:36 .. and will cover suggestions by victor on licensing 13:03:53 present+ RobertoNavigli 13:03:55 .. penny is not here today, she had mailed comments on licensing, will go through those 13:04:07 topic: action items 13:04:16 action-5? 13:04:16 action-5 -- Víctor Rodríguez-Doncel to proposal for a license modue -- due 2014-06-19 -- OPEN 13:04:16 http://www.w3.org/community/ld4lt/track/actions/5 13:04:19 done 13:04:24 close action-5 13:04:24 Closed action-5. 13:04:25 present+ kevinkoidl 13:04:29 ACTION-7: Felix - Check with w3c groups if there are other approaches to represent languages as uris 13:04:29 Notes added to ACTION-7 Check with w3c groups if there are other approches to represent languages as uris. 13:04:31 action-7? 13:04:31 action-7 -- Felix Sasaki to Check with w3c groups if there are other approches to represent languages as uris -- due 2014-06-19 -- OPEN 13:04:31 http://www.w3.org/community/ld4lt/track/actions/7 13:04:40 http://lists.w3.org/Archives/Public/public-ld4lt/2014Jul/0004.html 13:04:44 close action-7 13:04:44 Closed action-7. 13:04:56 present+Ali_ H_Vahid 13:05:24 felix: ok to discuss with meta-share, hard to resolve in general 13:05:28 action-8? 13:05:28 action-8 -- David Lewis to Look into isa work related to dcat profiles and report back -- due 2014-07-10 -- OPEN 13:05:28 http://www.w3.org/community/ld4lt/track/actions/8 13:05:39 done, see mail from dave 13:05:41 victor_rodriguez has joined #ld4lt 13:05:46 close action-8 13:05:46 Closed action-8. 13:06:09 see mail at http://lists.w3.org/Archives/Public/public-ld4lt/2014Jul/0011.html 13:06:20 action-9? 13:06:20 action-9 -- Jorge Gracia to Implement changes in metashare spreadsheet -- due 2014-07-10 -- OPEN 13:06:20 http://www.w3.org/community/ld4lt/track/actions/9 13:06:30 close action-9 13:06:30 Closed action-9. 13:06:36 ACTION-10: Jorge - Identify some external vocabularies to use in ms 13:06:37 Notes added to ACTION-10 Identify some external vocabularies to use in ms. 13:06:38 dave: will discuss later in the call 13:07:04 jorge: will be covered during meta-share discussion 13:07:06 close action-10 13:07:06 Closed action-10. 13:07:21 dave: thanks to all for working on your action points :) 13:07:23 http://mlode2014.nlp2rdf.org/lider-roadmapping-workshop/ 13:07:26 topic: event announcements 13:07:38 dave: ld4lt / lider RM workshop in leipzig 13:08:11 http://www.w3.org/blog/International/2014/07/14/linked-data-meets-content-analytics-4th-lider-ld4lt-event-2nd-september-leipzig/ 13:08:21 Francesca has joined #ld4lt 13:08:29 felix: this event is looking at getting more input from the analytics use cases and needs for linked data 13:08:45 .. as there will be a lot of those companies there 13:09:17 dave: will be a good opportunity for people in this group to meet f2f and discuss content analytics and general topics 13:09:29 dave: then, we recently had a workshop in dublin at loc world 13:09:35 Fahad has joined #ld4lt 13:09:42 .. we have now an opportunity to repeat that in vancouver 13:09:48 .. will be in last week of october 13:10:00 .. so FYI, I'll send details around later 13:10:09 Topic: META-SHARE vocabulary CORE 13:10:15 I have made the request to generate http://www.w3.org/2014/07/17-ld4lt-minutes.html fsasaki 13:10:25 https://docs.google.com/spreadsheets/d/15SE4_qAqYFostmD52uKxpkCPZh1f5TrPeoXKNTlDYpQ/edit#gid=0 13:11:11 present+ serge, marta 13:11:28 jorge: modifications of the gdocs spreadsheet format: 13:11:36 .. I created new columns to put new information in 13:12:01 .. we keep track of the old information. these have been hidden as you can do in excel. Just click and the previous info appears 13:12:29 .. I added colors so that you can see what changed - this is shown in blue 13:12:51 .. in the discussion column: I, penny, dave, others have added comments for feeding the discussion 13:13:11 .. my proposal: go through the rows in the spreadsheet, re-read the discussion column, see what we can decide 13:13:23 .. I colored in red the discussions that may be more critical 13:13:34 .. propose to go through whole list of rows 13:13:37 dave: agree 13:13:47 jorge: set of classes are short 13:13:59 .. first: agent 13:14:20 .. proposed to use FOAF agent both for person and organization 13:14:32 .. see in the comment suggestion provenance agent 13:15:01 dave: using it by itself does not make sense, of course 13:16:20 jorge: for us, as a first step, I propose this, without prov ontology 13:16:22 dave: sure 13:16:36 jorge: now row six: there were some labels expressed as camel case 13:16:45 .. I changed this as separate words 13:16:56 .. also suggest to write labels in lower case 13:17:07 .. recommended to write labels as normal English 13:17:16 .. something to keep in mind when we do the clean version of this 13:17:32 .. for corpus: I removed disjointness, thought it is not useful 13:17:53 .. in row 10: corpus collection 13:18:08 .. penny explains that this value does not come from meta-share model 13:18:37 .. I say: we could introduce collection class of dublin core 13:18:47 .. need to check that with meta-share people if that fits with them 13:18:57 zotano has joined #ld4lt 13:19:04 (http://purl.org/dc/dcmitype/Collection 13:19:12 http://www.w3.org/TR/vocab-dcat/#vocabulary-overview 13:19:39 dave: in dcat there is the idea of a data set 13:19:48 .. that would be the language resource in our case 13:20:03 .. but it can also be a catalog, which can be a collection with data sets 13:20:44 .. so catalog rather than dct collection may be a better way of doing it 13:21:23 @@: for corpora you can have audio of the corpus and the transcript 13:21:24 Marta Villegas 13:21:29 .. in a sense you have two corpora 13:21:38 s/@@/marta/ 13:21:55 marta: so you need two instances of corpuse to encode both parts of the corpus 13:22:21 .. that is the idea: to build a higher node so that you can add more corpora inside 13:22:33 dave: so that is probably different than dcat catalogue 13:22:42 .. in your description a collection is a sub grouping 13:23:25 http://dublincore.org/documents/2012/06/14/dcmi-terms/?v=dcmitype#Collection 13:23:38 dave: about dublin core: it says "it is an aggregation + collection of resources" 13:23:52 .. so dct: collection maybe is more accurate 13:24:11 jorge: we have to decide: how to map dcat data sets with language resources 13:24:39 dave: so stay with collections as suggested 13:24:40 jorge: ok 13:26:16 present+ philippC, johnMcC 13:26:32 discussion on the definition of "corpus" 13:28:35 philipp: postpone discussion and decide later whether we define this as property or class 13:28:45 marta: in meta-share corpus you may have different media types 13:29:07 .. you can have audio media type or text part 13:29:33 .. penny can give us more info - it is not trivial to move from annotation schema to ontology at this pont 13:29:36 s/pont/point/ 13:30:12 john: two options: we map to other concepts, or we just represent what is in meta-share. what is the goal here? 13:30:35 jorge: the aim is closer to use what is in meta-share, and to convert that in owl ontology 13:30:45 .. meta-share is based on decades of discussion 13:31:24 john: so if in meta-share there is a corpus collection we use that, if not, we can use s.t. from a semantic web vocabulary 13:31:38 marta: corpus collection has been added, it is not in the original meta-share 13:31:59 john: if this is about alignment we should not have a new vocab that is not in meta-share? 13:32:12 dave: this is a first attempt to map existing xml format into rdf 13:32:26 .. that's slightly different to map a vocab into antother one 13:33:47 (scribe has a hard time to capture discussion, will see if there is a conclusion) 13:34:23 https://www.w3.org/community/ld4lt/wiki/Meta-Share_OWL_metamodel 13:34:45 http://www.meta-net.eu/meta-share/META-SHARE%20%20documentationUserManual.pdf 13:35:57 https://github.com/metashare/META-SHARE/tree/master/misc/schema/v3.0 13:36:36 marta: above schema is the latest version of the xml schema 13:38:04 Proposal - corpus: collection of linguistic data; it be in several media-types. Corpus can be: media-type homogenous or heterogeneos; monolingual or multilingual. 13:38:43 dave: this type of corpus are first class citizens in meta-share model? 13:38:45 marta: yes 13:38:49 s/dave/jorge/ 13:39:04 jorge: maybe good then to define this as first class entity 13:39:13 I have made the request to generate http://www.w3.org/2014/07/17-ld4lt-minutes.html fsasaki 13:46:06 Today, the relevant corpora today is n-lingual plain text 13:46:29 dave: how to wrap up the discussion on corpus definition, jorge? 13:46:35 jorge: let's move to license topic 13:46:58 .. one major issue to clarify: mapping between language resource and dcat data set and dcat distribution classes 13:47:01 .. this is still oepn 13:47:06 s/oepen/open/ 13:47:34 action: daveL to gather info on how to provide more detailed mapping from meta-share to dcat 13:47:34 Created ACTION-11 - Gather info on how to provide more detailed mapping from meta-share to dcat [on David Lewis - due 2014-07-24]. 13:47:34 Human annotation is realistic for small corpus - large corpus requires programatic processing for cleaning, annotation and other processing 13:47:47 jorge: agree, now let's move into licensese topic 13:47:59 Topic: META-SHARE vocabulary LICENSE 13:48:11 https://www.w3.org/community/ld4lt/wiki/Licensing_information 13:48:24 http://lists.w3.org/Archives/Public/public-ld4lt/2014Jul/0014.html 13:48:43 dave: wikipage from victor, penny sent above mail 13:49:09 Sound poor 13:49:21 victor: penny likes the approach and made some comments 13:49:31 .. she said: we should declare more precisely which elements we use 13:50:29 .. currently literals are plain strings. they should be replaced by URIs 13:50:39 .. penny says the license name has to be kept 13:50:43 .. use of URL is also ok 13:50:55 .. resources that have double licensing 13:51:02 .. should be supported, I agree 13:51:23 .. she discussed more information that should be there 13:51:30 .. penny has not reflected comments in wiki 13:51:46 .. I can do it for here or she can do it herself, I'll send a mail to her about that 13:51:56 .. next step will be to update spreadsheet 13:52:42 victor: we can use meta-share term, declaring odrl 13:52:51 .. connecting both via owl:sameAs 13:53:05 .. in martas translation I missed an element to aggregate license information 13:53:23 .. in martas model these properties were directly attributed to the resources 13:53:57 dave: is it necessary to have the aggregation? or can you retrieve that via sparql? 13:54:21 victor: if a resource has two licenses the properties will be related to license one or two 13:54:31 dave: ok 13:55:49 .. when I look at dcat I will take the discussion of multiple licsenses into account too 13:56:53 felix: when you have issues with dcat you may want to talk to phil archer directly, he is on top of things 13:57:07 dave: makes sense - about dcat we can make a wiki page 13:57:23 .. so that it is digestable for dcat people 13:57:27 felix: makes a lot of sense 13:57:33 +1 13:57:51 dave: so victor will lialise with penny and the wiki page 13:58:00 .. and then we can make changes to the actual spreadsheet 13:58:08 topic: moving forward 13:58:26 dave: do people want to have another call next thursday? 13:58:37 .. I won't be around but we could arrange it 13:59:12 people can do both weeks 13:59:28 dave: I could not chair next week but maybe somebody else can do that 13:59:45 .. trying to nail things down before we get to August 13:59:56 topic: process reminder 14:00:24 dave: we want to finish off spreadsheet, then a stable core part, and then handle that back to marta / penny to publish that on their own github 14:00:34 .. hope that we can get to that after the holidays 14:01:32 topic: aob 14:02:02 dave: we will arrange to have a call next week, assure that we can start the session, then another call in two weeks too 14:02:15 .. thanks to all for your efforts in the mail and here! 14:02:33 adjourned 14:02:39 I have made the request to generate http://www.w3.org/2014/07/17-ld4lt-minutes.html fsasaki 14:10:52 fsasaki has left #ld4lt