Warning:
This wiki has been archived and is now read-only.

Chatlog 2010-04-29 Data Catalog Vocabulary

From W3C eGovernment Wiki
Jump to: navigation, search

See CommonScribe Control Panel, original RRSAgent log and preview nicely formatted version.

Please justify/explain all edits to this page, in your "edit summary" text.

<sandro> Guest: Bobbin Teegarden,
<sandro> Guest: William (ww) Waites
<sandro> Guest: Paul (PaulZH) Hermans
<sandro> Present: Brand_Niemann, David_James, Ed_Summers, Fadi_Maali, George_Thomas, Jon_Phipps, Li_Ding, Luigi_Montanez, Martín_Álvarez, Michael_Hausenblas, Richard_Cyganiak, Vassilios Peristeras
15:01:16 <RRSAgent> RRSAgent has joined #egov
15:01:16 <RRSAgent> logging to http://www.w3.org/2010/04/29-egov-irc
15:01:18 <trackbot> RRSAgent, make logs public
15:01:20 <trackbot> Zakim, this will be EGOV
15:01:20 <Zakim> ok, trackbot, I see T&S_EGOV(LD TECH)10:00AM already started
15:01:21 <trackbot> Meeting: eGovernment Interest Group Teleconference
15:01:21 <trackbot> Date: 29 April 2010
15:01:30 <cygri> nice. thanks mhausenblas!
15:01:54 <cygri> agenda: http://www.w3.org/egov/wiki/Data_Catalog_Vocabulary/2010-04-29
15:02:06 <edsu> Zakim, LC is edsu
15:02:06 <Zakim> +edsu; got it
15:02:10 <mhausenblas> RRSAgent, draft minutes
15:02:10 <RRSAgent> I have made the request to generate http://www.w3.org/2010/04/29-egov-minutes.html mhausenblas
15:02:12 <Zakim> +??P7
15:02:25 <cygri> Chair: cygri
15:02:31 <cygri> Scribe: edsu
15:02:52 <cygri> zakim, who is here?
15:02:52 <Zakim> On the phone I see ??P3, +1.202.955.aaaa, martin, George, +03539149aabb (muted), edsu, ??P7
15:02:54 <Zakim> On IRC I see RRSAgent, Zakim, LuigiMontanez, PaulZH, george, jonphipps, martin, fadi, cygri, n8fr8__, mhausenblas, hughb, ww, edsu, sandro, trackbot
15:03:10 <Zakim> +jonphipps
15:03:29 <lindstream> lindstream has joined #egov
15:03:32 <Zakim> +??P8
15:03:57 <n8fr8__> n8fr8__ has left #egov
15:04:03 <Zakim> - +03539149aabb
15:04:09 <vassilios> vassilios has joined #egov
15:04:12 <DavidJames> DavidJames has joined #egov
15:04:46 <edsu> that's dc area
15:04:54 <Zakim> + +1.425.378.aacc
15:05:23 <Zakim> + +03539149aadd
15:06:00 <edsu> ww: are you on the call?
15:06:09 <PaulZH> I'm also calling IP
15:06:56 <bobbin> bobbin has joined #egov
15:06:57 <cygri> zakim, who is talking?
15:07:08 <Zakim> cygri, listening for 10 seconds I heard sound from the following: ??P3 (9%), ??P8 (9%)
15:07:18 <fadi> zakim, aabb is fadi
15:07:18 <Zakim> sorry, fadi, I do not recognize a party named 'aabb'
15:07:29 <cygri> zakim, who is on the phone?
15:07:29 <Zakim> On the phone I see ??P3, +1.202.955.aaaa, martin, George, edsu, ??P7, jonphipps, ??P8, +1.425.378.aacc (muted), +03539149aadd (muted)
15:07:46 <cygri> zakim, aadd is fadi
15:07:46 <Zakim> +fadi; got it
15:08:09 <vassilios> vassilios also on IP phone
15:09:08 <cygri> zakim, aaaa is LuigiMontanez
15:09:08 <Zakim> +LuigiMontanez; got it
15:09:16 <cygri> zakim, who is here?
15:09:16 <Zakim> On the phone I see ??P3, LuigiMontanez, martin, George, edsu, ??P7, jonphipps, ??P8, +1.425.378.aacc (muted), fadi (muted)
15:09:18 <Zakim> On IRC I see bobbin, DavidJames, vassilios, lindstream, RRSAgent, Zakim, LuigiMontanez, PaulZH, george, jonphipps, martin, fadi, cygri, mhausenblas, hughb, ww, edsu, sandro,
15:09:20 <Zakim> ... trackbot
15:09:58 <edsu> Topic: Admin
15:10:21 <edsu> cygri: agenda http://lists.w3.org/Archives/Public/public-egov-ig/2010Apr/0084.html
15:10:51 <edsu> ... we'll have a presentation from martin of ctic, and hopefully one from li ding of rpi
15:11:23 <edsu> ... time permitting i'd like to talk about the scope and deliverables of the group 
15:11:34 <edsu> ... and hopefully write those down on the wiki
15:11:39 <edsu> Topic: Demos
15:12:13 <cygri> wiki page: http://www.w3.org/egov/wiki/Data_Catalog_Vocabulary/Datos_de_Asturias
15:12:19 <edsu> martin: here at ctic we develop an open data solution in spain, a good proof of concept of the linked data solution
15:12:59 <edsu> ... here you can see some metadata based on the VoID and Dublin Core vocabularies
15:13:24 <edsu> ... you can see license and provenance information, and the available data formats
15:13:47 <edsu> ... the generation of the catalog is using sparql and xslt
15:14:29 <martin> http://www.w3.org/egov/wiki/Data_Catalog_Vocabulary/CTIC_Dataset_List
15:14:30 <edsu> ... we have a list of catalogs (a meta catalog)
15:15:16 <fadi> fadi has joined #egov
15:15:23 <edsu> ... similar to the deri dcat approach, title, description, license
15:15:28 <martin> http://datos.fundacionctic.org/sandbox/catalog/feeds/catalogs_en.rss
15:15:40 <martin> http://datos.fundacionctic.org/sandbox/catalog/map.html.en
15:16:06 <edsu> ... also have a rss feed of the catalogs, which is used to geenrate a map of the catalogs
15:17:18 <edsu> cygri: you have used void for the data catalog, are the datasets in the catalog all available as rdf? this is not the case for most catalogs (e.g. data.gov)
15:17:38 <edsu> ... do you think it makes sense to use void when the dataset is not rdf?
15:18:21 <edsu> martin: i think void is by definition only for rdf data sets ... so i think no?
15:18:52 <edsu> cygri: that's what void was made for originally, but it possibly could be applied in a broader scope ... but i take your answer as saying you'd only use it for rdf data sets?
15:19:04 <edsu> martin: i think void could be useful
15:19:23 <cygri> zakim, who is here
15:19:23 <Zakim> cygri, you need to end that query with '?'
15:19:29 <edsu> mapstraction.addOverlay('http://datos.fundacionctic.org/sandbox/catalog/feeds/catalogs_en.rss');
15:19:35 <cygri> zakim, who is here?
15:19:35 <Zakim> On the phone I see ??P3, LuigiMontanez, martin, George, edsu, ??P7, jonphipps, ??P8, +1.425.378.aacc (muted), fadi (muted)
15:19:37 <Zakim> On IRC I see fadi, bobbin, DavidJames, vassilios, lindstream, RRSAgent, Zakim, LuigiMontanez, PaulZH, george, jonphipps, martin, cygri, mhausenblas, hughb, ww, edsu, sandro,
15:19:40 <Zakim> ... trackbot
15:20:34 <Zakim> + +1.202.564.aaee
15:21:27 <edsu> cygri: do we have a new attendee from dc?
15:21:48 <edsu> cygri: ok, it's brandt from epa in washington
15:21:54 <cygri> zakim, aaee is brand
15:21:54 <Zakim> +brand; got it
15:23:04 <edsu> bobbin: i'm on the call too
15:23:15 <cygri> zakim, aacc is bobbin
15:23:15 <Zakim> +bobbin; got it
15:23:56 <cygri> zakim, who is here?
15:23:56 <Zakim> On the phone I see ??P3, LuigiMontanez, martin, George, edsu, ??P7, jonphipps, ??P8, bobbin, fadi, brand
15:23:59 <Zakim> On IRC I see fadi, bobbin, DavidJames, vassilios, lindstream, RRSAgent, Zakim, LuigiMontanez, PaulZH, george, jonphipps, martin, cygri, mhausenblas, hughb, ww, edsu, sandro,
15:24:01 <Zakim> ... trackbot
15:24:25 <DavidJames> LuigiMontanez and I are sharing a conference phone
15:24:43 <cygri> zakim, davidjames is with luigimontane
15:24:43 <Zakim> +davidjames; got it
15:24:50 <cygri> zakim, davidjames is with luigimontanez
15:24:50 <Zakim> davidjames was already listed in LuigiMontanez, cygri
15:24:51 <bobbin> Bobbin Teegarden, semantic consultant in Seattle
15:25:14 <edsu> cygri: seems like Li isn't going to make the call
15:25:21 <edsu> Topic: Deliverables
15:26:37 <edsu> cygri: there wasn't a clear consensus at the end of the last call, about defining an abstract vocabulary about the data you find in catalogs, and then we could have several different documents that explain how you could use these terms in a concrete syntax: rdf/xml, rdfa, atom, etc
15:26:54 <edsu> ... a couple of points were raised on the mailing list
15:27:13 <edsu> ... erik suggested we should be clear about the types of serices we want to enable
15:27:27 <edsu> ... so that's where i would like to pick up
15:27:40 <DavidJames> we're getting a lot of echo and delay on the line
15:27:41 <LI_DING> LI_DING has joined #egov
15:28:20 <edsu> cygri: i sent an email http://lists.w3.org/Archives/Public/public-egov-ig/2010Apr/0083.html
15:28:50 <edsu> ... one deliverable might be use cases and requirements
15:29:20 <edsu> ... a second thing could be an rdf vocabulary and reference that lists the terms, which required/optional, examples
15:29:56 <edsu> ... it would make sense to use some sort of formalism, uml, or perhaps rdf schema, since it will give us one possible way of deploying dcat data
15:30:55 <edsu> ... in the email the choice of concrete format maybe we should not decide on this yet ... if we have some clear requirements for using syndication formats for example ; or if we can do everything with rdfa -- it's hard to say before we know what requirements we have
15:31:23 <edsu> ... do you think we could delay the decision about the concrete format until we do use cases requuirements?
15:31:23 <Zakim> + +1.518.495.aaff
15:31:34 <Zakim> - +1.518.495.aaff
15:31:50 <PaulZH> I do agree to postpone the decision
15:32:38 <Zakim> + +1.518.495.aagg
15:32:47 <edsu> edsu: i agree in principle, but it will take more time
15:32:52 <LI_DING> hi 
15:32:55 <LI_DING> li is joining
15:33:25 <edsu> Zakim, +1.518.495.aagg is LI_DING 
15:33:25 <Zakim> +LI_DING; got it
15:34:04 <edsu> LI_DING: we're reviewing cygri's proposal at http://lists.w3.org/Archives/Public/public-egov-ig/2010Apr/0083.html
15:34:55 <edsu> cygri: i think by writing down use cases and requirements we'll get a better understanding about technical choices
15:35:30 <edsu> LI_DING: what is the current status, discussion the requirements?
15:36:27 <edsu> cygri: yes, basically we're discussing whether we should abstract out of the concrete syntax, and write use cases and requirements
15:37:22 <edsu> ... there doesn't seem to be a lot of disagreement or agreement with this
15:37:55 <edsu> LI_DING: i want to hear more voices first i think
15:38:11 <cygri> zakim, who is on the phone
15:38:11 <Zakim> I don't understand 'who is on the phone', cygri
15:38:23 <cygri> zakim, who is on the phone?
15:38:23 <Zakim> On the phone I see ??P3, LuigiMontanez, martin, George, edsu, ??P7, jonphipps (muted), ??P8, bobbin, fadi (muted), brand, LI_DING
15:38:25 <Zakim> LuigiMontanez has LuigiMontanez, davidjames
15:38:58 <DavidJames> Regarding http://lists.w3.org/Archives/Public/public-egov-ig/2010Apr/0083.html ... I like the list of requirements.
15:39:49 <DavidJames> The deliverables also look good to me.
15:39:55 <edsu> LI_DING: i'd need more time to review the proposal, but i think we want to keep things as minimal as possible to get something out there quick, also i think we should keep use cases simple, how much do we want to get?
15:40:19 <edsu> ... we should try to produce something actionable
15:41:37 <edsu> cygri: i don't hear any strong objection to moving forwards with the deliverables i outlined in the email ; for now we should continue working with this assumption
15:41:54 <LI_DING> one thing could be a challenge
15:42:03 <DavidJames> Can we ask everyone to vote (or something similar)?
15:42:34 <edsu> LI_DING: maybe we can have a challenge a deliverable: how many gov't data sets can you find?
15:43:16 <edsu> ... lets grow the number of government data sets, more people using the vocabulary
15:43:45 <edsu> cygri: but what does this mean for the format we want to design?
15:44:10 <vassilios> i think it is not a matter of number of datasets but number of catalogues
15:45:07 <Zakim> -brand
15:45:22 <edsu> LI_DING: it's a parallel effort: one side we grow the datasets, and on the other we develop the terminology, vocabulary that can be used
15:46:16 <edsu> cygri: DavidJames you asked whether we should do a vote, i'm not sure what point this would serve
15:46:33 <edsu> ... since there doesn't seem to be any strong disagreement
15:46:43 <cygri> ack DavidJames
15:46:48 <edsu> DavidJames: i think it would be nice to hear the agreement
15:47:22 <edsu> cygri: i think this is something i'd like to do on the next call, rather than rushing it
15:47:35 <edsu> ... to make sure there is enough time
15:47:49 <edsu> Topic: Use Cases
15:48:56 <edsu> cygri: we want to have use case or stories for what people want to be able to do: for example i want to be able to get a list of datasets from data.gov that are not supported by the current data.gov user interface
15:49:22 <edsu> ... i'd be interested to hear about what use cases people on the call have
15:49:32 <edsu> ... for standard formats for data catalogs
15:49:42 <Zakim> -LI_DING
15:50:41 <edsu> martin: i'd like to be able to query multiple catalogs, to be able to query other geographic regions
15:51:16 <vassilios> cross-catalogue querying
15:51:25 <edsu> cygri: so you want to be able to federate national catalogs into say a european union catalog?
15:51:47 <edsu> s/martin/vassilios/
15:52:00 <edsu> vassilios: yes
15:52:21 <george> (fed/state/local) gov agency independently published catalog using (some set of standard) vocab for dynamic aggregation of catalogs
15:52:55 <george> would like to see bottom up (as per Li - in use) drive basic (extensible) rdfs vocab convergence for w3c blessing
15:53:00 <edsu> DavidJames: for the nationaldatacatalog we'd like to consume data from other data catalogs, we'd like to have one uniform way of getting at catalog data
15:53:33 <edsu> ... for municipalities and state governments we'd like to do a similar thing, enabling federation across many levels
15:54:38 <edsu> george: i think i share the interest that david and luigi have ; would be great to have some sort of vocabulary at the w3c that reflects what's in use as much as possible
15:54:42 <edsu> cygri: agreed
15:55:47 <edsu> martin: i think federation is soemthing i would like to do ; i agree with vassilios with his approach
15:55:56 <LI_DING> I was dropped from the call
15:56:46 <edsu> cygri: it might be a good next step to draft this use case as a little document, to sketch how it could look in practice as a story
15:56:47 <vassilios> i could help on this
15:57:14 <edsu> ... there are a couple of other use cases on the mailing list, maybe we can get those into written form too
15:57:27 <vassilios> OK
15:57:55 <vassilios> and i thin kit is more or less the same whether we speak about EU or USA
15:58:03 <edsu> cygri: would be great if you could start it, and we could see if other use cases are covered
15:58:33 <edsu> ACTION: vassilios to document a federation use case for dcat
15:58:33 <trackbot> Could not create new action (unparseable data in server response: No child element named 'id') - please contact sysreq with the details of what happened.
15:58:48 <george> +1
15:58:48 <LuigiMontanez> yes, I have access
15:58:50 <LuigiMontanez> +1
15:58:52 <DavidJames> +1
15:58:56 <edsu> +1
15:58:56 <martin> +1
15:58:58 <fadi> +1
15:59:03 <edsu> cygri: wiki is ok?
15:59:17 <cygri> http://www.w3.org/egov/wiki/Data_Catalog_Vocabulary/Use_Cases_and_Requirements
15:59:36 <edsu> cygri: great, there is a page where i've copy pasted stuff from the email, i propose we add subsections there, use this as a live document
15:59:53 <edsu> cygri: so that's it for this week, thank you all for joining in
15:59:56 <DavidJames> thanks
16:00:00 <george> thank you cygri
16:00:01 <cygri> thanks all!
16:00:06 <LuigiMontanez> thank you cygri, edsu for transcribing
16:00:06 <vassilios> thank you!
16:00:13 <Zakim> -George
16:00:17 <Zakim> -??P3
16:00:18 <george> thanks edsu
16:00:21 <Zakim> -LuigiMontanez
16:00:23 <Zakim> -fadi
16:00:25 <Zakim> -jonphipps
16:00:25 <edsu> np :-)
16:00:29 <Zakim> -edsu
16:00:31 <Zakim> -??P8
16:00:37 <Zakim> -martin
16:00:39 <edsu> RRSAgent, generate minutes
16:00:39 <RRSAgent> I have made the request to generate http://www.w3.org/2010/04/29-egov-minutes.html edsu
16:00:51 <Zakim> -bobbin
16:01:05 <cygri> ok, chairing a teleconference from a conference hallway is a bad idea. i won't try that again :-)
16:01:19 <edsu> cygri: heheh, any news from www2010?
16:01:21 <george> wish i was there!
16:01:29 <LuigiMontanez> LuigiMontanez has joined #egov
16:01:36 <bobbin> zakim?
16:02:08 <cygri> edsu, yes, many interesting things going on. great conference so far
16:02:10 <edsu> shoot, the minutes don't look to good
16:02:17 <edsu> http://www.w3.org/2010/04/29-egov-minutes.html
16:02:31 <cygri> bobbin, what do you want zakim to do?
16:02:51 <bobbin> +1
16:03:19 <edsu> cygri: it does look good
16:03:26 <edsu> (the conference)
16:03:40 <edsu> cygri: have you and Li had a chance to chat about stuff informally?
16:03:40 <cygri> edsu, the minutes look good to me?
16:03:51 <edsu> oh, they do look better now when i reloaded
16:04:02 <cygri> edsu, yes we had dinner yesterday and are planning a few things
16:04:08 <edsu> cygri: nice
16:04:26 <edsu> cygri: i think having you and Li on the same page will help a great deal
16:04:30 <Zakim> -??P7
16:04:31 <Zakim> T&S_EGOV(LD TECH)10:00AM has ended
16:04:32 <Zakim> Attendees were +1.202.955.aaaa, martin, George, +03539149aabb, edsu, jonphipps, +1.425.378.aacc, +03539149aadd, fadi, LuigiMontanez, +1.202.564.aaee, brand, bobbin, davidjames,
16:04:35 <Zakim> ... +1.518.495.aaff, LI_DING
16:05:19 <DavidJames> DavidJames has left #egov
16:06:27 <cygri> edsu, agreed
16:07:03 <cygri> will go now and see if they left any lunch for me
16:07:10 <cygri> thanks for scribing and the comments ed
16:18:33 <lindstream> lindstream has left #egov
17:16:50 <cygri> cygri has joined #egov
17:40:23 <ww> edsu: sorry wasn't able to make it today
17:41:05 <ww> would have liked to bring up the spatial/temporal ambiguity of dcat:granularity though
17:50:49 <DavidJames> DavidJames has joined #egov
17:56:42 <ww> cygri: do you think it would be reasonable to distinguish between spatial and temporal granularity/resolution in dcat?
17:58:11 <ww> (nevermind datasets that have different granularity on their different axes, which happens with some envionmental satellite datasets for example)
17:58:12 <cygri> ww, i'll have to talk to fadi to remember why we didn't distinguish them. can i get back to you about this next week?
17:58:27 <ww> sure no hurry
18:01:24 <Zakim> Zakim has left #egov
18:04:36 <ww> any thoughts on rdf:graph, rdf:Graph?
18:09:50 <cygri> ww, what are those?
18:10:21 <ww> well, they don't exist, which I think is problematic
18:10:51 <ww> using the talis changeset, vocabulary, for example, changes are expressed as additions and removals of reified triples
18:11:10 <ww> i need a way to express the graph that those triples are supposed to live in
18:11:19 <ww> (not strictly related to egov as such, of couse)
18:11:34 <ww> we have rdf:Statement, rdf:subject, rdf:predicate, rdf:object
18:11:40 <ww> but no rdf:graph
18:12:03 <ww> I suppose because I believe that named graphs were added after the rdf core vocabulary was etched in stone
18:13:19 <ww> for my immediate purposes I've invented a namespace for this, but...
19:33:35 <DavidJames> DavidJames has joined #egov
19:44:52 <cygri> cygri has joined #egov
20:54:51 <cygri> cygri has joined #egov
21:00:21 <cygri> cygri has joined #egov
# SPECIAL MARKER FOR CHATSYNC.  DO NOT EDIT THIS LINE OR BELOW.  SRCLINESUSED=00000253