11:51:33 RRSAgent has joined #gld 11:51:33 logging to http://www.w3.org/2012/01/25-gld-irc 11:51:36 Zakim has joined #gld 11:52:15 zakim, this will be gld1 11:52:15 ok, sandro; I see SW_e-Gov( GLD)6:30AM scheduled to start 22 minutes ago 11:53:02 mhausenblas has joined #gld 11:53:02 sandro has changed the topic to: Government Linked Data (GLD) WG -- F2F2 -- Code GLD1 -- http://www.w3.org/2011/gld/wiki/F2F2 11:53:16 ok, I guess we're set and ready! 11:54:03 SW_e-Gov( GLD)6:30AM has now started 11:54:10 +??P0 11:54:18 +sandro 11:55:42 GeraldSteeman has joined #GLD 11:57:09 +GeraldSteeman 11:57:41 fadi has joined #gld 11:58:38 + +3539149aaaa 11:58:48 Zakim, aaaa is me 11:58:48 +mhausenblas; got it 11:58:56 zakim, aaaa is galway 11:58:56 sorry, cygri, I do not recognize a party named 'aaaa' 11:58:57 Zakim, fadi is with me 11:58:58 +fadi; got it 11:59:13 BenediktKaempgen has joined #gld 11:59:22 Zakim, I am really galway 11:59:22 I don't understand 'I am really galway', mhausenblas 11:59:25 boris_ has joined #gld 11:59:26 zakim, mhausenblas is really galway 11:59:26 +galway; got it 11:59:26 dvilasuero has joined #gld 11:59:36 zakim, mhausenblas is with galway 11:59:36 +mhausenblas; got it 11:59:45 zakim, i'm with galway 11:59:45 +cygri; got it 12:00:01 zakim, who is on the phone 12:00:01 I don't understand 'who is on the phone', cygri 12:00:09 Deirdre has joined #gld 12:00:10 member:zakim, who is on the phone? 12:00:21 member:zakim who is on the phone? 12:00:29 zakim, who is on the phone? 12:00:29 On the phone I see simonWall, sandro, GeraldSteeman, galway 12:00:30 galway has galway, mhausenblas, cygri 12:01:04 zakim, BenediktKaempgen is with galway 12:01:04 +BenediktKaempgen; got it 12:01:07 csarven has joined #gld 12:01:17 zakim, csarven is with galway 12:01:17 +csarven; got it 12:01:21 zakim, dvilasuero is galway 12:01:21 sorry, dvilasuero, I do not recognize a party named 'dvilasuero' 12:01:24 PhilA has joined #gld 12:01:29 zakim, boris is with galway 12:01:29 +boris; got it 12:01:30 zakim, Deirdre is with galway 12:01:30 Gofran has joined #gld 12:01:30 +Deirdre; got it 12:01:38 zakim, fadi is with galway 12:01:38 +fadi; got it 12:01:44 zakim, Gofran is with galway 12:01:44 +Gofran; got it 12:01:49 zakim, PhilA is with galway 12:01:49 +PhilA; got it 12:02:15 BartvanLeeuwen has joined #gld 12:02:17 zakim, dvilasuero is with galway 12:02:17 +dvilasuero; got it 12:02:25 zakim BartvanLeeuwen is with Galway 12:02:33 zakim, BartvanLeeuwen is with Galway 12:02:33 +BartvanLeeuwen; got it 12:02:39 zakim, who is on the phone? 12:02:39 On the phone I see simonWall (muted), sandro, GeraldSteeman, galway 12:02:40 galway has galway, mhausenblas, cygri, BenediktKaempgen, csarven, boris, Deirdre, fadi, Gofran, PhilA, dvilasuero, BartvanLeeuwen 12:03:09 RRSAgent, pointer? 12:03:09 See http://www.w3.org/2012/01/25-gld-irc#T12-03-09 12:03:44 RRSAgent, make logs public 12:04:07 I'm thinking skype is the best bet. 12:04:15 (with sound turned off) 12:06:32 DeirdreLee has joined #gld 12:06:42 + +1.802.371.aabb 12:07:23 - +1.802.371.aabb 12:09:55 the only thing we now need (for both Zakim and skype) is .... 12:09:59 Washington! :) 12:10:07 + +1.802.371.aacc 12:10:27 ( who is dvilasuero ? ) 12:10:36 + +1.202.691.aadd 12:10:51 zakim, aadd is Washington 12:10:51 +Washington; got it 12:14:03 tighten has joined #gld 12:14:09 - +1.802.371.aacc 12:14:19 Zakim, mute me 12:14:22 sorry, mhausenblas, I do not know which phone connection belongs to you 12:14:29 Zakim, mute galway 12:14:29 galway should now be muted 12:15:17 Zakim, unmute galway 12:15:17 galway should no longer be muted 12:15:39 sandro, dvialasuero is daniel vila from madrid 12:16:24 it's richard.cyganiak 12:19:28 trackbot, start telecon 12:19:31 RRSAgent, make logs world 12:19:33 Zakim, this will be GLD 12:19:33 ok, trackbot, I see SW_e-Gov( GLD)6:30AM already started 12:19:34 Meeting: Government Linked Data Working Group Teleconference 12:19:34 Date: 25 January 2012 12:19:53 Agenda: http://www.w3.org/2011/gld/wiki/F2F2#Wednesday.2C_25-Jan-2012 12:20:04 Zakim, who's here? 12:20:04 On the phone I see simonWall (muted), sandro, GeraldSteeman, galway, Washington 12:20:06 galway has galway, mhausenblas, cygri, BenediktKaempgen, csarven, boris, Deirdre, fadi, Gofran, PhilA, dvilasuero, BartvanLeeuwen 12:20:09 On IRC I see tighten, DeirdreLee, BartvanLeeuwen, PhilA, csarven, dvilasuero, boris, BenediktKaempgen, fadi, GeraldSteeman, mhausenblas, Zakim, RRSAgent, simonWall, cygri, rreck, 12:20:11 ... danbri, trackbot, sandro 12:20:20 GofranS has joined #gld 12:20:51 scribenick: mhausenblas 12:22:46 bhyland has joined #gld 12:22:48 t_gheen has joined #gld 12:22:52 ping 12:22:59 pong bhyland 12:24:09 George has joined #gld 12:24:29 GUEST: Gofran Shakair 12:24:56 GUEST: Deirdre Lee 12:24:58 Topic: Introduction and welcome. Agenda review 12:25:18 PhilA: I'm in Galway, W3C staff 12:25:38 + +1.802.371.aaee 12:26:05 PhilA: I've been working on vocabularies, ADMS, DCAT, organisation ontology 12:26:49 RRSAgent, pointer? 12:26:49 See http://www.w3.org/2012/01/25-gld-irc#T12-26-49 12:26:50 dvilasuero: In Galway, Master student with Boris (UPM) from Spain 12:27:21 fadi: In Galway, finished my MSc on Publishing Linked Gov Data, Google Refine, DCAT 12:28:11 boris: In Galway, UPM, we do Linked Government Data in Spain, will help facility vocab 12:28:42 mhausenblas: I'm co-hosting here. Head Linked Data section here at DERI 12:29:36 cygri: In Galway, LiDRC at DERI as well, I'm focusing on vocabs (DCAT, DataCube) and also other WG (RDF, RDB2RDF) 12:30:11 ... I'd like to learn about requirement for DataCube vocab, also DCAT 12:30:33 BartvanLeeuwen: In Galway - I'm a Semantic Fire Fighter from A'dam 12:30:58 ... doing Linked Open Data, looking for advise for best practices and share again 12:31:25 Note to self, need to talk to fadi about Dan Smith's work on Refine Extensions http://wiki.linkedgov.org/index.php/Extension 12:31:28 csarven: In Galway, MSc in LiDRC, with Michael and Richard, working on data-gov.ie, tooling around this 12:31:45 DaveReynolds has joined #gld 12:32:14 GofranS: In Galway, MSc students in eGov unit at DERI, focusing on metadata i18y, ADMS 12:32:26 -GeraldSteeman 12:33:04 DeirdreLee: In Galway, heading the eGov unit in DERI, working with Vassilios of the EC 12:33:18 +GeraldSteeman 12:33:28 ... we're doing Open Data, policies, etc. 12:33:41 ... for example, DCAT is of interest 12:34:08 Spyros has joined #gld 12:34:16 zakim, spyros is with galway 12:34:16 +spyros; got it 12:35:05 BenediktKaempgen: In Galway, from FZI in Karlsruhe, Germany 12:35:32 ... into business intelligence, interested to provide feedback for DataCube and other related efforts 12:35:44 ... such as SKOS extension for hieratchy 12:35:53 s/hieratchy/hierarchy 12:36:04 ... as well as versioning input 12:36:05 +[IPcaller] 12:36:17 zakim, IPcaller is me 12:36:17 +DaveReynolds; got it 12:36:24 ... we've published Eurostat and XBRL data 12:37:01 Spyros: In Galway, IBM SCTC in Dublin, we are into Linked Open Data publishing (dublinked.ie) 12:37:22 ... we do data management for Smart Cities using Linked Data 12:38:00 first/last name spelling? 12:38:28 I think I have everyone by Spyros on http://www.w3.org/2011/gld/meeting/2012-01-25 12:38:33 s/by/but/ 12:38:35 SpyrosKotoulas has joined #gld 12:38:38 +rreck 12:38:39 got it. 12:38:44 -rreck 12:39:09 sheesh, it took 4 tries and then dropped 12:39:57 cmusialek has joined #gld 12:40:01 zakim, mute me 12:40:03 DaveReynolds should now be muted 12:40:06 GofranShukair has joined #gld 12:40:21 cmusialek: Chris Musialek, working on data.gov 12:40:23 cmusialek: GSA Data.gov lead, working on vocab.data.gov and other related GLD for Data.gov 12:40:36 t_gheen: One World Law Library 12:40:53 +rreck 12:41:07 whew, i called in 9 times 12:41:35 ping 12:41:40 pong 12:41:48 t_gheen: Library of Congress 12:42:20 danbri has joined #gld 12:42:27 Introducing George Thomas from US HEaltha & Human Services 12:43:19 bhyland: 3RoundStones, GLD co-chair, US Gov LD initiatives (EPA), strong open source product orientation, more on Web Arch, Data Mgmt 12:43:37 ... better tooling for Web2.0 app dev's for using RDF stack tech 12:43:38 ... 12:43:44 danbri has joined #gld 12:44:11 objectives for F2F2 - focus on enabling aspects for GLD publishers, how to roll out LD projects 12:45:13 ... value add to augment tech chops with mgmt understanding 12:45:16 me sandro, it's Gofran Shukair and Benedikt Kämpgen 12:46:03 Yigal (not in IRC) - working on Gov Grant vocab - 12:47:20 ... been working with Gov Data for a long time, worked with Dan Gillman (BLS) 12:48:03 Mike Pendleton (not on IRC) - EPA - doing LD projects, new approaches to data warehousing and publishing using LD 12:48:20 ... interest and contribution in Procurement 12:48:59 cgueret_work has joined #gld 12:49:02 Anne Washington (not on IRC) from George Mason University, Professor Public Policy, bkgrnd CS and IS 12:49:21 ... interest and bckgrnd in Dig Archives, preservation incl metadata 12:49:21 Okay, http://www.w3.org/2011/gld/meeting/2012-01-25 has everyone correctly listed (I think). 12:49:40 ... need for external 'non-branded' info in determining scope and direction of GLD projects 12:49:52 ... part of the W3C eGov IG 12:49:52 q+ 12:50:59 + +1.518.276.aaff 12:51:02 - +1.802.371.aaee 12:51:25 +??P4 12:51:35 q? 12:51:46 http://www.w3.org/2011/gld/meeting/2012-01-25 12:51:49 q- 12:51:52 Yigal has joined #gld 12:52:20 olyerickson has joined #GLD 12:52:29 Dan Gillman (not yet on IRC) 12:52:37 BLS, DC F2F2 host 12:52:45 where is the video ? 12:52:49 @bhyland I have been on in car, just arrived at TWCRPI 12:53:01 zakim, I am aaff 12:53:01 +olyerickson; got it 12:53:06 ... involved with metadata standards and requirements for access to statistical data (for 'quite some time' :) 12:53:37 ... got involved with GLD through chair role of Open Gov Vocab WG (part of Fed CIO Council Data Arch Subcmt) 12:53:59 ... interest in synergy and application of W3C/GLD to BLS data 12:54:15 w? 12:54:18 q? 12:54:19 q+ 12:54:20 q/ 12:54:26 q? 12:54:47 olyerickson: Dir of Web Science Ops at TWC RPI 12:55:03 ack olyerickson 12:55:11 ... project lead for logd.twc.rpi.edu - int gov cat search demo, govpedia.org project, others 12:56:28 regrets+ Hadley Beeman 12:56:40 ... interest in firming up international BP guidance for GLD, co-leading URI construction session later today, also vocab rec's esp DCAT, (other good collab mojo) 12:57:48 sandro: W3C primary staff contact with PhilA, key interest is making SemWeb work, GLD all ++, QB??, more :) 12:58:59 GeraldSteeman: NASA S&T Info Prg Office, deliverable reviewer from lay-person persp, general interest in GLD 12:59:00 q+ to update on Gishlain status 12:59:36 s/Gishlain/Ghislain/ 12:59:44 ... bhyland adds contibutions from Gerald incl outreach at high levels 12:59:55 zakim, unmute me 12:59:55 DaveReynolds should no longer be muted 13:01:27 DaveReynolds: SW/LD long timer, CTO Epimorphics, UK Pub Sector - data.gov.uk (variety of offices/agencies), vocab work - Org Ont (UK Organogram with cygri and JT), QB, LDA co-developer (great stuff!), variety of edu/env publishing 13:01:37 ... Linked Data API see http://code.google.com/p/linked-data-api/ 13:01:46 ... interests - mostly vocab with cygri etal 13:01:52 t_gheen_ has joined #gld 13:01:54 q? 13:02:00 zakim, mute me 13:02:00 DaveReynolds should now be muted 13:02:19 simonWall? 13:02:24 Picking up on DaveReynolds comments about the org ontology being used for organograms - here's an example http://data.gov.uk/organogram/department-for-business-innovation-and-skills 13:02:27 zakim, unmute simonWall 13:02:27 simonWall should no longer be muted 13:03:29 simonWall: morning! Dir of Data Mgmt Australian Bu of Stats - working on standardizing statistical data/metadata, statistics/statistics/statistics 13:03:32 simonWall: I lead the Data Management Section at the Australian Bureau of Statistics (http://abs.gov.au). (in Canberra) 13:04:17 ... unlike (sandro ;), most interested in QB vocab, role as influencer of international stat community, interest in LD, and W3 membership 13:04:48 ... is alive and well :) 13:05:10 t_gheen has joined #gld 13:07:24 rreck: consultant in Wash DC, masters in comp linguistics, textual data & RDF thesis, published, working in law enforcement, working with vocabs, 3rd W3C (GRDDL, other?) group 13:07:55 ... review props that influence stability of GLD, collab with AnneW 13:08:27 I'm here but you don't hear me 13:08:46 it's christophe gueret 13:08:49 christophe gueret 13:08:51 from the VU 13:08:52 yep 13:08:56 should be 13:09:10 zakim, ??P4 is cgueret_work 13:09:10 +cgueret_work; got it 13:09:19 http://www.few.vu.nl/~cgueret 13:09:22 :/ 13:09:25 that's me :) 13:09:41 zakim, cgueret_work is really Christophe Gueret 13:09:41 I don't understand you, PhilA 13:09:41 thx :) 13:10:02 zakim, cgueret_work is ChristopheGueret 13:10:02 +ChristopheGueret; got it 13:10:27 topic: Agenda Review 13:10:32 Please look at http://www.w3.org/2011/gld/wiki/F2F2 13:11:27 Topic: DataCube vocab discussion update 13:11:40 http://publishing-statistical-data.googlecode.com/svn/trunk/specs/src/main/html/cube.html 13:11:42 cygri: we have a draft spec 13:12:00 Wiki page: http://www.w3.org/2011/gld/wiki/Statistical_Cube_Data 13:12:03 cygri: started in 2010, that's the current status : http://publishing-statistical-data.googlecode.com/svn/trunk/specs/src/main/html/cube.html 13:12:29 cygri: recently not that much activity re consumption 13:12:46 ... working on a generic client for any kind of DataCube data 13:13:12 ... we have quite some issues in the queue raised by people that have been using DataCube 13:13:29 ... suggestions for improvements and extensions (incl. from BenediktKaempgen) 13:13:34 cygri: next steps are 13:13:57 ... transferring the issues to GLD tracker 13:13:58 http://code.google.com/p/publishing-statistical-data/issues/list 13:14:11 cygri: as well as discuss extensions in GLD 13:14:44 zakim, unmute me 13:14:44 DaveReynolds should no longer be muted 13:14:48 q? 13:14:50 ack me 13:14:50 mhausenblas, you wanted to update on Gishlain status 13:15:05 Mike_Pendleton has joined #gld 13:15:06 q+ to still update on G.'s status ;) 13:15:32 cygri: additionally we want to publish the current spec as a FPWD in the GLD 13:15:45 .. I do have an action on it anyways 13:16:46 cygri: in order to improve DataCube we should take into consideration all the valuable feedback 13:17:00 q+ to agree with Richard :) 13:17:07 q+ 13:17:13 ... need to find a balance between quickly getting out a FPWD on it vs. incorporating the feedback 13:17:14 q? 13:17:27 +q 13:17:36 +q 13:17:50 George_ has joined #gld 13:19:02 ack mich 13:19:06 ack me 13:19:06 mhausenblas, you wanted to still update on G.'s status ;) 13:19:11 ack mhausenblas 13:19:29 ack DaveReynolds 13:19:29 DaveReynolds, you wanted to agree with Richard :) 13:19:33 Michael: Seems Gihslain will join G'way later today 13:19:35 ack me 13:19:52 DaveReynolds: Agree with what cygri said 13:20:08 ... need a canonical issues list and a FPWD of DataCube spec 13:20:35 ... need to remove ambiguities in the spec 13:20:48 ... folding in the experience from practice 13:20:49 q+ to talk about use cases 13:21:19 DaveReynolds: DataCube has been used by a number of groups now, already 13:21:32 ... some co-ordination is needed 13:21:54 ... esp. re aggregation there has been quite some development in the SDMX world 13:22:01 q? 13:22:18 DaveReynolds: coordination with standards from the Observations&Measurements area 13:23:38 q? 13:24:00 bhyland: Are people happy enough to move to the W3C space? 13:24:40 cygri: I think GLD is the appropriate space for this, yes 13:25:07 ... so far the work has happened in an informal space (cf. Google code repo) 13:26:15 bhyland: what do we need to do? are the documents in good shape? How do we move forward and raise awareness? 13:26:23 DruidSmith has joined #gld 13:26:24 ack me 13:26:56 PhilA: From a process point of view we need to create a product for DataCube in the GLD tracker 13:27:16 ... and also for future products (DCAT, etc.) as currently there is only one product 13:27:35 cygri: Positiv 13:27:51 ACTION: PhilA to add products on issue tracker 13:27:51 Created ACTION-29 - Add products on issue tracker [on Phil Archer - due 2012-02-01]. 13:27:51 DanG has joined #gld 13:28:21 ACTION: cygri to produce editor's draft of Data Cube spec 13:28:22 Created ACTION-30 - Produce editor's draft of Data Cube spec [on Richard Cyganiak - due 2012-02-01]. 13:28:32 q? 13:29:21 Michael: I assume FPWD of DataCube will be available together with the other FPWD on BP, etc.? 13:29:32 ... which would mean: soon 13:29:33 q? 13:29:43 Agree with cygri that once have first working draft is a good time to see feedback. 13:29:47 ack BenediktKaempgen 13:29:52 s/see/seek/ 13:30:38 BenediktKaempgen: Question re issues 13:30:52 ... how will they be grouped 13:31:45 BenediktKaempgen: Also, questions regarding consumption side 13:32:34 We do see groups consuming live Data Cube data, including iPhone apps. So there is some active use to learn from. 13:32:50 cygri: My take on this is that the scope is backed up by the charter 13:33:15 ... so we need to be careful regarding how far we go, can't stray too far from this 13:33:36 ... but it's important that we're compatible with other related works such as DDI 13:34:08 ... agree with co-ordination with others, yes 13:34:29 Agree with Richard, keep scope narrow as currently defined, but do a good job of co-ordination. 13:34:30 cygri: regarding consumption tools - we're producing a vocab, not a processor 13:34:31 the charter says: "Statistical "Cube" Data. The group will produce a vocabulary, compatible with SDMX, for expressing some kinds of statistical data. This need not be as expressive as all of SDMX, but may provide a subset as in the RDF Data Cube vocabulary. It may also include ways to annotate data to indicate its assumptions and comparability." 13:34:49 precended by: "The group will also produce documentation, examples, and, optionally, test cases and OWL ontologies for these vocabularies." 13:34:59 cygri: though, feedback from DataCube consumers would be beneficial 13:35:07 bhyland has joined #gld 13:35:12 -rreck 13:35:16 q? 13:35:20 ack cygri 13:35:20 cygri, you wanted to talk about use cases 13:35:32 on http://www.w3.org/2011/gld/charter 13:35:38 mhausenblas, it's optional. 13:36:11 Question: Is this correct wiki page for updating the WG on progress, http://www.w3.org/2011/gld/wiki/Statistical_Cube_Data 13:36:13 mhausenblas: it currently is OWL (for values of "OWL" that are basically RDFS :)) 13:36:19 ack simonWall 13:36:20 q? 13:36:44 simonWall: we're very active in the SDMX and DataCube space 13:37:29 q? 13:37:31 +rreck 13:37:41 q+ possibly use real data as an example? 13:37:55 cygri: Would like to raise one more issue - does it make sense to also document use cases? 13:37:57 q+, real examples? 13:38:02 Michael: +1 to use cases 13:38:10 cygri: Does it make sense to document use cases for vocabularies? 13:38:15 use cases are always good... 13:38:25 Michael: Yes, we're backed up by charter (cf. 'examples') 13:38:33 cygri: I think it is a good reality check to resolve design criteria issues. Helps with clarity 13:38:45 Although UCS are probably best recorded in a separate document 13:39:13 cygri: Matter of resources in the working group 13:39:22 cygri: Do we have the resources in the group to document use cases. 13:39:25 q+ 13:39:34 +1 to use cases 13:39:42 +1 for use cases 13:39:54 q? 13:40:02 q- real 13:40:11 q- examples? 13:40:17 +1 for use cases 13:40:21 bhyland: we are writing docs for real working people, so mapping to real world is important 13:40:24 ack bhyland 13:40:38 +1 to bhyland 13:40:48 q? 13:40:51 bhland: where is the wiki page to update progress for this? 13:41:04 eg., http://www.w3.org/2011/gld/wiki/Statistical_Cube_Data 13:41:05 q+ to talk about examples 13:41:05 s/bhland/bhyland 13:41:11 Is this correct wiki page for updating the WG on progress, http://www.w3.org/2011/gld/wiki/Statistical_Cube_Data 13:41:33 cygri: AFAIK there is no single page that captures the current status 13:41:42 ... not really updated, also 13:41:52 I updated it recently a bit. 13:42:11 bhyland: recommend creating a high level page to organize the information 13:42:18 q? 13:42:36 ... can people devote time to this effort over the next few months? 13:42:37 ack me 13:42:37 DaveReynolds, you wanted to talk about examples 13:43:05 DaveReynolds: Yes, I can commit some time in the next 5 month, rather at the end 13:43:20 DaveReynolds: re UC, there are different needs 13:43:28 ... real data samples 13:43:41 +1 DaveReynolds 13:43:45 +1 as well 13:44:13 +1 too 13:44:19 oh too bad we didnt do it in google+ so others could have joined 13:44:54 DaveReynolds: UC in the sense cygri was talking about vs. real world samples 13:44:56 rreck, maybe during a break we can experiment with other vid tech. 13:45:09 Michael: both would be good! 13:45:22 rreck, also, I gather this is a commercial skype account that can do multi-way. 13:45:38 oh? 13:45:44 DaveReynolds: also valuable to evangelise to document the usage 13:45:58 ... but not in the spec but a separate doc 13:45:59 q+ 13:46:02 q? 13:46:10 DaveReynolds: additional note on how case-study/examples realize use cases 13:46:34 Michael: Agree with DaveReynolds to have a separate non-REC-Track doc on UC 13:46:36 1- 13:46:38 q- 13:46:44 Sounds to me as if bhyland is talking about usage guidelines? 13:46:55 or tutorials? not sure. 13:47:26 bhyland: who can work on this? 13:47:47 Committment for DataCube/SDMX work offered by DaveReynolds, Richard, others? 13:47:47 +q 13:47:59 Add SimonWall to the list. 13:47:59 simonWall: count me in re UC 13:48:00 So the people working on the qb data are DaveReynolds, cygri, simonWall 13:48:11 You can count me in also. 13:48:15 bhyland: simonWall, DaveReynolds, cygri 13:48:32 q+ 13:48:36 ack simonWall 13:49:09 zakim, mute me 13:49:09 DaveReynolds should now be muted 13:49:19 Topic: Vocabulary Selection discussion 13:49:41 is there a Skype ccall that one could be included in for video? 13:49:54 Yigal has joined #Gld 13:49:57 PhilA, can you add me, too? 13:50:04 boris: Please look at the slides at http://www.w3.org/2011/gld/wiki/images/6/65/VocabularySelection.pdf 13:50:07 ...or is it only DC/Galway>? 13:50:52 So the people working on the qb data are DaveReynolds, cygri, simonWall, mhausenblas, BenediktKaempgen 13:52:38 http://www.w3.org/2011/gld/wiki/images/6/65/VocabularySelection.pdf 13:53:43 boris: starting presentation on vocabulary selection 13:54:12 ... charter says we need to provide guidelines to governments 13:54:44 ... RDF requires specific domain terms in order to provide a certain domain 13:55:32 ... modelling is important phase in data lifecycle 13:55:33 ping 13:55:57 pong 13:55:58 ... (showing different data lifecycle models, they all have a modelling phase) 13:56:42 ... big picture: 1. search for existing vocabularies in various search engines/repositories 13:56:53 q+ to note re suitability 13:57:04 ... 2. if suitable is found, re-use it 13:57:16 ... 3. otherwise, search for suitable thesauri etc 13:57:39 ... 4. if those exist, build a vocabulary by transforming these resources into RDFS 13:58:03 ... 5. otherwise, build from scratch. this happens if the domain is very new or complex. but doesn't happen so often 13:58:15 Encourage interactivity IMO 13:58:29 lol, co-chairs differ ;-) 13:58:36 :) 13:59:13 boris: there are multiple repositories for searching vocabularies, but no one definitive 13:59:18 No one central place to find a vocab is a feature, not a bug :-) 13:59:39 ... (summary table of available repositories) 13:59:52 Michael: ontologi.es is in fact Melvin C ;) 14:00:41 s/Melvin C/Toby Inkster 14:01:11 boris: there are no guidelines to help developers to decide which engine/repo to use 14:01:43 Note that DataFAQs will soon provide a statistical vocabulary ranking service based on use in the LOD cloud. I've asked them for a statement as to how this will work. Note also that they will provide a commmunity vocab ranking service soon as well 14:01:47 Michael: re relevant vocabs - ORG seems to be missing? 14:02:14 boris: (summary list of gov-relevant vocabs) 14:02:15 @Michael, noted. I'll add. 14:02:23 This is a partial list 14:02:35 ... probably need to include a few more 14:03:00 Michael: re vocabulary prefixes - my advise is simple - use prefix.cc 14:03:07 ... there is a list of popular prefixes from the RDFa group 14:03:11 Vocabulary list: would like to see org on there :) 14:03:24 also DOAP is missing 14:03:44 Mike_Pendleton has joined #gld 14:03:47 DOAP is on the previous list 14:04:19 yeah sorry now i see it :( 14:04:42 But wasn't on the prefix list I don't think. 14:04:58 boris: (demo of LOV - http://labs.mondeca.com/dataset/lov/suggest/ ) 14:05:01 Not sure about BIBO as the one and only vocab in that area to single out, but not my field. 14:06:05 boris: criteria for selecting a particular vocab/ontology 14:06:11 definately, if we have BIBO there we should have some other important library vocabs 14:06:21 q? 14:06:33 LOV search - nice 14:06:36 ... usage, maintenance, coverage, etc etc 14:06:58 ... tools for building vocabularies: neologism, protege, ... 14:07:31 q? 14:07:37 (LOV people are: Bertrand Vatant and Pierre-Yves Vandenbussche) 14:07:49 +q 14:07:53 ack me 14:07:53 mhausenblas, you wanted to note re suitability 14:07:58 ack mhausenblas 14:08:05 q+ 14:08:17 mhausenblas: what is “suitable”? how do you define this? 14:08:19 mhausenblas: what is 'suitable'? 14:08:46 ... give concrete advice how to figure out which competing vocabulary to use 14:08:56 ... and advice when it makes sense to build your own 14:09:09 ime suitability is often a vocab combo - ie org + vcard 14:09:16 ... also important for suitability: does my data sparql well if expressed in this vocab? 14:09:20 q+ to agree with mhausenblas and add a bit more 14:09:37 ... existence of multiple repos/engines not a problem. they do different things 14:09:45 ... some crawl, some are curated 14:09:56 I've asked DataFAQs people to compare/contrast their vocab ranking capability with LOV vocab ranking. 14:09:59 ... if we had the resources: meta search engine? 14:10:12 +1 vocab metacrawler at w3 14:10:14 ... would have value if run at W3C 14:10:50 ... our best practice document will be frozen in time, so static lists will go out of date 14:10:59 +1 to more than vocab search; need ranking "vocabRank" or "schemaRank" 14:11:01 ... perishable info should maybe not go in there 14:11:11 q- 14:11:22 q? 14:11:25 ack cygri 14:11:33 The Best Practices Recommendation document will be almost "frozen" as of the publication data. The way we'll add flexibility to the Vocabs is through the community driven LOD Cookbook. 14:11:35 cygri: Agree with Michael 14:11:55 +1 to Michael 14:12:12 GLD recommendation for "high quality" linked data is to use widely-used, relevant vocabularies *correctly* 14:12:22 cygri: we should avoid to create concrete suggestions what are suitable vocabs - it's arbitrary 14:12:38 q+ 14:12:39 these are two diff gld deliverables tho - selection, and recommended 14:12:39 The question is, how to find vocabs (a) in wide use (b) whether they are relevant 14:12:48 +1 to Michael and Richard, W3C shouldn't be maintainer of such lists, especially not if that has implications on procurement 14:12:52 q+ 14:12:57 cygri: I'd like to see guidance on how to use the tools (check lists) to determine what is relevant, quality, etc, 14:13:05 q? 14:13:11 Cygri: For this WG, suggest that we have a basic sets of questions we ask the maintainer. We don't want to arbitrarily add vocabs. 14:13:12 +1 to DaveReynolds ( by default ;) ) 14:13:13 q+ 14:13:32 boris: I agree with both Richard and Michael said 14:13:54 I think surveys/questionaires/etc don't scale 14:13:55 recommendation for domain agnostic - cross cutting vocabs for all GLD publishers... 14:13:55 q? 14:14:37 bhyland: what do you mean by implications on procurement? 14:14:46 snapshot problem regarding procurement and inclusion on some 'list' like a gld deliverable 14:15:05 I think we should leverage the presence of vocabs "in the wild" (ie LOD Cloud) to assist selection 14:15:50 q? 14:16:17 ack PhilA 14:16:17 PhilA, you wanted to agree with mhausenblas and add a bit more 14:16:24 mhausenblas: it's a competitive advantage if my vocab is w3c-listed and yours isn't. best practice document will be frozen in time 14:16:39 PhilA: here are some criteria: 14:16:46 ... 1. permanence of domain name 14:17:04 ... for example, LOV service URL looks not permanent. that's bad. 14:17:17 gov consortium mandates are nice ... 14:17:37 I think we should call URLs URLs not URIs 14:17:41 ... 2. change control. who's in charge of changing it? 14:17:50 q+ re change control and vocab ownership 14:17:55 ... dublin core has a large committee in charge, so changing it is hard. that's good 14:18:03 ... 3. is it actually used in the wild? 14:18:17 q? 14:18:21 ... we should point out these criteria even if it may be very hard to evaluate in practice 14:18:26 +1 to all PhilA said 14:18:52 BartvanLeeuwen: should also point out that local language documentation is important 14:19:00 +1 BartvanLeeuwen -- another criterion is support for multiple languages, eg in the documentation for the vocabulary 14:19:06 +1 too 14:19:14 +1 too 14:19:17 ... ideally, vocabularies should have documentation in multiple languages 14:19:19 vocabs should be properly described in several languages 14:19:51 q? 14:20:06 ack BartvanLeeuwen 14:20:27 Michael: We need to distinguish between vocab discovery and vocab creation guidelines, I believe 14:20:49 boris: most vocabs are english but governments speak all sorts of languages. we have work in progress on how to express multilingual vocabs on the web of data 14:21:05 GofranShukair: ADMS 14:21:12 GofranShukair: ADMS describes semantic assets. that includes vocabularies 14:21:19 http://joinup.ec.europa.eu/asset/adms/home 14:21:37 ack GofranShukair 14:21:45 ... we describe metadata, incl language 14:21:50 ... ready for review 14:22:06 q? 14:22:18 ack bhyland 14:22:20 ack bhyland 14:22:48 I can take the action 14:23:08 will be pleased to contribute with French concerns 14:23:19 ACTION: boris to create a Wiki page on multi-lingualism of vocabs 14:23:19 Created ACTION-31 - Create a Wiki page on multi-lingualism of vocabs [on Boris Villazón-Terrazas - due 2012-02-01]. 14:23:20 bhyland: multilingual issues are important. awareness should be raised. please, write a blurb on this 14:23:20 http://joinup.ec.europa.eu/asset/adms/home 14:23:55 q+ 14:24:18 In addition to the multilingual vocab issue, there is the multilingual instance data issue --- english predicates but literals in other languages. 14:24:20 bhyland: having criteria for inclusion of vocabularies is important. let us draft a list of vocabularies. 14:24:45 q? 14:24:53 ... where is it hosted? university? production system? what's the institution's commitment to maintenance? 14:25:08 MacTed has joined #gld 14:25:43 ... we should work on such a checklist over the next two days 14:25:44 +1000 14:25:45 +1 14:25:47 Michael: we should maybe also talk about vocab management (what is the process to add new terms? who owns the namespace? hit-by-truck scenario) 14:25:53 ack mhausenblas 14:25:53 mhausenblas, you wanted to discuss change control and vocab ownership 14:26:17 +1 to bernadette's suggestions for capturing criteria for vocab selection 14:26:23 mhausenblas: there can be issues around ownership of namespace, hit by bus risk etc 14:26:37 +1 namespace ownership problems 14:26:42 mhausenblas: namespace ownership, distinguish btw discovery, management, creation advice - more will discover than create - 14:26:46 ... need to distinguish between vocabulary search and vocabulary creation. different issues 14:26:49 +1 bhyland: during these two days let's start the checklist of things people need to look for in deciding whether a vocab is good enough, such as stability, domain name, point of contact, etc. 14:27:13 I have had commercial clients unwilling to use existing namespaces because of copyright exposure 14:27:25 ... experience shows that something can start informally and move to something more formal, e.g. story of VoID 14:27:26 PhilA: notes that danbri has solved the "what happens if I go under a bus" issue through an agreement with DCMI (so FOAF is as stable as DC) 14:27:29 ( I don't think bhland said we should produce a list of vocabs. ) 14:27:57 q? 14:28:02 ... so we can say there's a process that can take you from informal work to something permanent and fit for purpose 14:28:28 q+ to suggest the document include fears/nightmare-scenarios 14:28:33 q? 14:28:57 mhausenblas: i like checklists 14:29:02 +1 14:29:12 +1 to sandro's ' fears/nightmare-scenarios' 14:30:01 charter quote: "Vocabulary Selection. The group will provide advice on how governments should select RDF vocabulary terms (URIs), including advice as to when they should mint their own. This advice will take into account issues of stability, security, and long-term maintenance commitment, as well as other factors that may arise during the group's work." 14:30:52 +1 cygri: don't list vocabs, just list how to evaluate vocabs 14:30:55 Michael: Does the WG interpret this in the sense of 'we provide checklist how to' or rather 'list concrete vocabs'? 14:31:06 Michael: I'd very much prefer the former 14:31:18 t_gheen has joined #gld 14:31:20 cygri: lists of recommended vocabs in bp vocab selection? instead, criteria list for selection - then there's std vocabs for cross cutting GLD publisher concerns - nice delineation 14:31:32 +1 14:31:47 +1 14:31:50 q? 14:31:52 ack cygri 14:31:57 q+ to make 2 suggestions for vocab selection if you want me to, or I'll park it if time is short 14:32:06 sandro: i agree. arbitrary lists would be a problem 14:32:07 +1 to cygri, criteria not lists 14:32:26 sandro: we might explain that criteria list in terms of "nightmare scenarios" 14:32:29 sandro: how to write this 'checklist' - nice to explain in terms of issues/challenges (fears/nightmare-scenarios) 14:32:38 +1 cygri 14:32:49 ... "here are possible things that could go wrong. check how the vocabulary or its maintainers deals with that" 14:33:03 q? 14:33:04 ... this would bring it to life 14:33:21 bhyland: i agree but can we put a positive spin on it? 14:33:23 ack sandro 14:33:23 sandro, you wanted to suggest the document include fears/nightmare-scenarios 14:33:35 PhilA: for the record: it would be horrible if danbri was hit by a bus. 14:33:36 ack PhilA 14:33:36 PhilA, you wanted to make 2 suggestions for vocab selection if you want me to, or I'll park it if time is short 14:33:37 q? 14:33:59 PhilA: national part of domains matter 14:34:09 ... but you can use .us in .ie 14:34:15 q+ 14:34:21 +1 to PhilA 14:34:32 Yigal has joined #gld 14:34:38 ... multilingual: want to use dublin core in finnish? don't reinvent it. provide a translation with finnish labels 14:34:40 +1 to PhilA 14:35:24 ack cygri 14:35:30 cygri: +1 provide labels for existing vocab/namespace 14:35:46 we should mention Z39.19? 14:36:02 ... common issue/problem/mistake 14:36:03 The Finnish National Library maintains the Finnish version of Dublin Core... 14:36:03 skos 14:36:27 mhausenblas: label as 'quality requirement' 14:36:29 mhausenblas: some quality criteria can be expressed as sparql queries 14:36:33 ... for example presence of labels 14:36:38 q? 14:36:51 PhilA: chose Finnish at random - but good to see that my entirely random choice is ahead of the game, simonWall 14:37:30 ACTION: mhausenblas to compile first version of vocabulary selection quality checklist 14:37:31 Created ACTION-32 - Compile first version of vocabulary selection quality checklist [on Michael Hausenblas - due 2012-02-01]. 14:37:47 Having the label in the URI for vocab terms is a multi-language issue for some folks. There is genuine argument on both sides whether opaque URIs + labels in all languages is better than having one preferred language reflected in the URIs. 14:37:59 Michael: Z39.19 sounds interesting indeed, thanks rreck! 14:38:05 Point taken (I googled that one; I do know that the New Zealand National Library maintains the Maori version of DC though.) 14:38:19 ACTION-31? 14:38:19 ACTION-31 -- Boris Villazón-Terrazas to create a Wiki page on multi-lingualism of vocabs -- due 2012-02-01 -- OPEN 14:38:19 http://www.w3.org/2011/gld/track/actions/31 14:38:20 *mhausenblas: i could help with that action 14:38:56 +sandro.a 14:39:05 i have done alot of work with z39.19 and multi-lingual representation 14:39:07 -sandro 14:39:32 DaveReynolds, do you have some pointers re multilingual URIs? would be good to include the debate in that wiki page 14:40:32 topic: Legacy Data 14:40:41 http://www.w3.org/2011/gld/wiki/File:LegacyData.pdf 14:41:07 scribenick: BenediktKaempgen 14:41:34 cygri: would have to dig, the OBO world has best practice advice on using opaque URIs which might be relevant. Also I have annedotal evidence though would need to be circumspect about to phrase that in public :) 14:41:34 Spyros: On Dublin data rdfized to RDF 14:41:49 s/about/about how/ 14:42:24 q+ on the term 'legacy data' 14:42:28 ... what is legacy data? Is gov supposed to transform all data (e..g., pdfs, scan, xsl)? 14:42:45 ... most data from relational db 14:42:50 cygri: we also have a paper for las dc conf on multilingual URIs 14:43:01 stasinos has joined #gld 14:43:07 where we review obo and others 14:43:30 ... often also: geo data, temporal data (statistics), record oriented relational data (e.g., about citizens) 14:43:46 Spyros' slides are now linked from the agenda 14:44:24 scribe: BenediktKaempgen 14:45:02 q+ re prioritisation of data sources - demand driven 14:45:22 ... concerns: privacy issues (who can assess whether something is privacy sensitive?), how much to publish (efficiently, considering the costs), ... 14:46:23 bhyland has joined #gld 14:46:26 ping 14:46:27 ... considering risks with opening up data; how about institutions that are not quite government 14:47:00 Sorry is this is a repeat, per the charter on legacy data: "Legacy Data. The group will produce specific advice concerning how to expose legacy data, data which is being maintained in pre-existing (non-linked-data) systems. 14:47:02 olyerickson has joined #GLD 14:47:25 ... also technical issues: architecture, what visualizations (applications consuming data), how to facilitate use by non-experts 14:47:46 q? 14:48:21 ... how to automate such processes 14:48:58 ... how to provide guidance/template/references/cookbook for processes 14:49:16 q+ to ask what makes this 'legacy' 14:49:30 ... transforming data into RDF often possible but might be awkward 14:49:39 q? 14:49:48 ack me 14:49:48 mhausenblas, you wanted to comment on the term 'legacy data' and to discuss prioritisation of data sources - demand driven 14:50:33 I think we should consider referring to "data life cycle" ala http://www.ddialliance.org/what (DDI Alliance) 14:50:40 mhausenblas: two reactions: term legacy data, maybe we should use a different term (e.g., raw data) 14:50:48 davidwood_ has joined #gld 14:51:20 zakim, davidwood is me 14:51:20 sorry, bhyland, I do not recognize a party named 'davidwood' 14:51:43 bhyland, please check your phone for immediate text message requiring your action. Sorry to interrupt. 14:51:44 I think the core question is, what best practices for data life cycle management should this group make that pertain to GLD? 14:51:50 davidwood_ has left #gld 14:51:52 ... Secondly, question always: where to start publishing data? Uptake then further drives publishing process. User-pull rather than publisher-push. 14:52:36 q? 14:52:43 ack DaveReynolds 14:52:43 ack me 14:52:44 DaveReynolds, you wanted to ask what makes this 'legacy' 14:52:51 me thinks we're talking about exposing RDB's ergo R2RML 14:53:02 q+ 14:53:20 Michael: re multimedia interlinking see http://events.linkeddata.org/ldow2009/papers/ldow2009_paper17.pdf 14:53:28 DaveReynolds: RDF can walk along "legacy"/raw data 14:53:32 q? 14:53:54 ... Representing key parts in raw data/legacy is difficult. 14:54:42 zakim, mute me 14:54:42 DaveReynolds should now be muted 14:54:46 +1 - exposing these existing/emerging W3 works for this 14:54:50 q? 14:54:51 q? 14:54:56 Richard: We should list related work (R2RML, M, Griddle, xslt...) 14:54:56 q? 14:55:01 cygri: There are a number of existing W3C standards that already address the transformation part (R2RML, GRDDL, etc.) 14:55:08 ack me 14:55:18 I think the real issue is how to integrate LD best practices with your existing data life cycle management infrastructure 14:55:37 +1 to what olyerickson 14:55:38 bhyland: Spyros points out how broad the description of legacy data is in the charter 14:55:47 ... we should set some boundaries 14:55:48 +1 olyericksson 14:55:50 Bhyland: How to bound this topic? 14:55:52 s/what olyerickson/what olyerickson said 14:55:52 +1 to byhland. bounding is important 14:55:53 1+ 14:55:56 q+ 14:56:13 Michael: Scope should be on W3C standards and then expand 14:56:31 q? 14:56:38 ack me 14:56:41 @bhyland please re-state what to take a stab at... 14:56:46 Topic: Legacy Data discussion 14:57:22 bhyland: the charter is very broad in the description of what is to be included in the "Legacy" section of the BP Recommendation. 14:57:28 mhausenblas: What resources are available? 14:57:42 q+ 14:57:55 We need to bound it. Suggest we put some lines in the sand as to what is "in" and we'll be able to reasonably do within the next 6 mos in this WG. 14:58:09 mhausenblas: IBM Biplav/Spiros resource committment to drive expeccted 'legacy' contribution 14:58:35 q+ 14:58:37 Spyros is here on behalf of IBM and is an invited guest of the F2F. Thus, he cannot make make committments for IBM to this WG. 14:59:33 q? 14:59:35 mhausenblas: if we go for a broad interpretation of this topic, then we need people and volunteers 14:59:50 ack cygri 15:00:14 cygri: Agree with boundaries. Good starting point would be W3C standards. 15:00:24 q- 15:00:36 danbri has joined #gld 15:00:41 t_gheen has joined #gld 15:00:53 better arbitrary than nothing? 15:00:55 cygri: E.g., it would be helpful to describe tools. Risk to be arbitrary with inclusion. 15:01:09 q+ re tools 15:01:12 bhyland_ has joined #gld 15:01:17 cygri: standards, tools, approaches 15:01:39 cygri: Also useful to describe approaches, e.g., for modelling. 15:01:47 Hmmm...this is the first time I realized we were talking about CONVERSION 15:02:07 q? 15:02:11 ... There should be experiences in WG to give recommendations on such processes. 15:02:13 Michael: Against listing tools explicit, but rather provide examples of tool catalogs such as found http://www4.wiwiss.fu-berlin.de/latc/toollibrary/ and a http://www.planet-data.eu/results/datasets-and-tools 15:02:16 ack mhausenblas 15:02:16 mhausenblas, you wanted to discuss tools 15:02:18 q+ 15:02:35 We have some of the content Cygri is describing in the current LOD cookbook, especially as it relates to the auto conversion vs. human-involved modeling. 15:02:45 q? 15:02:49 +1 point at the wiki makes good sense 15:03:02 mhausenblas: Problem with tools is that they can get outdated. 15:03:04 olyerickson: i converted a price from dollars to pounds recently 15:03:21 ... Similar to Vocabulary case, have a checklist. 15:03:37 MichaelH: His bias is on describing checklist approach rather than a specific list of tools which will become dated over time. 15:03:43 @cygri that's the "right" direction, isn't it? 15:03:45 q+ 15:03:48 q? 15:03:54 ack DeirdreLee 15:03:56 ack DeidreLee 15:04:05 Yigal has joined #gld 15:04:27 DeirdreLee: Agrees with not describing tools. But in case of vocabularies makes sense. 15:04:54 q+ 15:05:24 ... Users demands would help with legacy issues. 15:05:30 ack cygri 15:05:40 stasinos has joined #gld 15:06:47 cygri: Agrees with seeing transforming legacy data as a process that needs to be a compromise of effort and benefit. Start with metadata, concept schemes, and later go on with the acutal raw data. Looking at users will really be useful. 15:07:02 q? 15:07:04 cf. http://users.iit.demokritos.gr/~konstant/ 15:07:06 q+ 15:07:43 cygri: Handling legacy AKA "raw data" has some logical starting points and (could go on infinitely). Address misconceptions about converting to RDF as an "augmentation" to existing system. Others convert to RDF and that is it. 15:07:46 +1 to cygri 15:07:49 ack olyerickson 15:07:56 ... Important w.r.t. legacy data: What does it actually mean? What does it implicate? Regarding on the situation, specific approaches may make more sense than others (e.g., transformaing most data into RDF). 15:08:02 DanG has joined #gld 15:08:35 +??P10 15:08:44 q+ to ask if we can agree on a term now, please? should we use original data? source data? 15:08:50 Zakim, ??P10 is stasinos 15:08:50 +stasinos; got it 15:09:23 Michael: Suggest to think along TimBL's 5 star scheme http://5stardata.info/ 15:09:28 q? 15:09:57 olyerickson: Discussion about legacy is not usefull if not seen from perspective of a certain scenario. Best-practice they need is to continuously manage their data. 15:10:11 Refine+DERI_extensions and R2RML covers 80% of the GLD publisher waterfront afaic - i'd love to see standards, tools, approaches covering spreadsheets and RDB's 15:10:21 olyerickson, i didn't see the link you mentioned? 15:10:33 +1 concrete examples are essential 15:10:34 +1 to olyerikson - focus on Linked Data as an access approach and how it ties in to existing data management practice, avoid terms like "legacy" 15:10:38 ... Life examples of tools of how to get specific issues done, might be useful. 15:10:40 q? 15:10:47 ack DeirdreLee 15:10:58 link to DDI Alliance http://www.ddialliance.org/what 15:11:03 q+ to suggest talking about standards instead of tools when possible 15:11:08 Olyerickson: Feels we walk a line between decribing checklists to evaluate vs. associating specific tools to "get the job done." 15:11:21 Link to ANDS recommendations http://ands.org.au/guides/index.html 15:11:46 q+ 15:12:22 olyerickson, are you aware of https://github.com/FranckCo/DDIOnto ? 15:12:23 ack me 15:12:23 mhausenblas, you wanted to ask if we can agree on a term now, please? should we use original data? source data? 15:12:25 Time check: 3 minutes until tea break 15:12:42 q- 15:12:46 ack me 15:12:46 cygri, you wanted to suggest talking about standards instead of tools when possible 15:12:52 DeirdreLee: Concrete example: EU Inspires (?) data publishing very cumbersome. To sell the approaches to government may be very difficult. 15:12:53 Aside: INSPIRE can be met via linked data, e.g. UK has proposed URI guidelines for naming INSIPRE spatial objects. 15:12:54 +1 source data (although I don't have any 'legacy' heartburn...) 15:12:54 Agreed: Legacy data to be recast as "raw data" 15:13:06 @cygri No I wasn't, thanks! 15:13:17 Legacy? How about "metadata-challenged" 15:13:30 mhausenblas: Shall we use a different term than legacy. Suggestion: See it in terms of TimBL star schema. 15:13:50 legacy/raw data is 'existing' data. Linked Data is simply an extra way to represent 'existing' data 15:13:53 +1 to hopping TBL's 'raw data' bandwagon, however 'raw data' is a misnomer in my gov experience 15:13:59 ... rename legacy to raw data. 15:14:23 -1 to "raw data" that caused problems when TBL used it 15:14:36 PROPOSAL: To use 'raw data' rather then 'legcay data' along TimBL's 5 star scheme 15:14:37 kind of -1 to "raw data". statisticians hate that 15:14:44 Proposal: To use the term 'Raw Data' to refer to existing data 15:14:48 +1 to "source data" over "raw data"... 15:14:59 "non-RDF data"? 15:15:00 +1 to source data 15:15:08 Proposal: Not carried 15:15:12 "spreadsheets" 15:15:29 PROPOSAL: To use 'source data' rather then 'legacy data' along TimBL's 5 star scheme 15:15:33 ???: Mainly about spreadheets and relational data. 15:15:37 q? 15:15:42 ack sandro 15:16:02 +2 to sandro 15:16:28 "pre-formal"? 15:16:40 danbri has joined #gld 15:16:56 +1 to exposing...what? ;) 15:17:26 OK, chairs have conferred and we agree ... "Source Data" 15:17:35 mhausenblas' proposal seconded... 15:17:35 PROPOSAL: To use 'non-RDF data' rather then 'legacy data' along TimBL's 5 star scheme 15:17:39 mhausenblas: how to call that first publing working draft 15:17:52 unlinked data! 15:17:55 -1 to non RDF 15:17:58 non-RDF is stilted 15:18:12 -1 to non-RDF 15:18:13 +1 bio 15:18:17 @simonWall some RDF is also unlinked 15:18:27 no open data? 15:18:27 +1 to bladder relief... 15:18:28 +1 to source data 15:18:34 +0 on "source data", it means some different but in a less harmful way than "raw data" 15:18:35 +1 to source data 15:18:42 +1 source data 15:18:51 +0.5 to source data. not my favourite but could work well enough. 15:19:09 +1 source data 15:19:50 bhyland_: we resume at 10:30am/3:30pm 15:19:56 -galway 15:20:01 -sandro 15:20:02 -rreck 15:20:06 -stasinos 15:20:18 are we hanging up? 15:20:31 cmusialek has joined #gld 15:21:01 -simonWall 15:30:00 q? 15:30:42 +sandro 15:31:19 olyerickson has joined #GLD 15:31:27 olyerickson has left #GLD 15:31:30 olyerickson has joined #GLD 15:31:39 do we have to dial in again? 15:31:51 ...or is everyone on mute? 15:33:49 t_gheen has joined #gld 15:33:51 George has joined #gld 15:33:57 ping, is the Galway team read to resume? 15:34:11 yes 15:34:13 sorry 15:34:32 "Interfacing to Existing Data System" 15:34:38 "Providing an RDF Interface" 15:34:39 Galway is coming... 15:34:40 "RDF Interfaces" 15:34:44 galway ping 15:34:46 cygri, mhausenblas ... 15:34:59 +galway 15:35:16 +??P9 15:35:27 Zakim, ??P9 is stasinos 15:35:28 +stasinos; got it 15:35:29 zakim, i'm with galway 15:35:29 +cygri; got it 15:35:40 "Providing RDF Interfaces" 15:35:59 -1 to non-RDF. I prefer "Source Data" 15:36:15 PROPOSAL: To use 'source data' rather then 'legacy data' along TimBL's 5 star scheme 15:36:15 +1 PhilA 15:36:21 +0.5 15:36:27 +1 to "source" too 15:36:30 +0.5 to source data 15:36:54 +0 on "source data", it means some different but in a less harmful way than "raw data" 15:36:55 zakim, SUM(sourceData) 15:36:55 I don't understand 'SUM(sourceData)', olyerickson 15:37:12 +1 source data (although domain specific/original data would be more clear) 15:37:25 and what about "genuine data" ? :) 15:37:28 Proposal for replacement name, it has a "use by date" of at least the FPWD 15:37:31 sandro: the default is we dont revisit decisions. 15:37:35 +0.98 to "source data" 15:37:37 Agreed: "Source Data" 15:37:47 cool 15:37:50 RESOLUTION: To use 'source data' rather then 'legacy data' along TimBL's 5 star scheme 15:37:54 +??P11 15:37:57 RRSAgent, draft minutes 15:37:57 I have made the request to generate http://www.w3.org/2012/01/25-gld-minutes.html mhausenblas 15:38:06 +0 source data okay as long as it's open to revisiting before LC. ( -1 to this term forever) 15:38:16 q? 15:38:43 http://logd.tw.rpi.edu/sites/default/files/w3c_gld_uri_construction_25jan12.pdf 15:38:46 Topic: URI Construction discussion 15:40:22 olyerrickson: we have general URI recommendations, e.g., data patterns. 15:40:56 dvilasuero: Agrees. 15:41:26 What do you mean missing? 15:42:07 olyerrickson: instance-hub-uri-design makes it possible to re-host uris 15:42:31 ... re-host, i.e. move to a different architecture after testing 15:42:37 DanG has joined #gld 15:43:21 ... requirements to uri creation approach: no need to make URI self-describing, non-domain-specific 15:43:22 yes, Michael, it must be. I see legacy discussion in two parts in fact. 15:44:13 DanG has joined #gld 15:44:32 q+ 15:44:53 q+ to caution against using the org component, slide 3 15:45:24 q+ 15:45:52 ... major parts: id, org, category/token 15:47:39 What about using subject matter categories rather than agency based ones? They won't die if the agency does. 15:47:46 ... explanations of examples are linked from the wiki, e.g. in best practices document 15:48:33 DanG - UK recommendation is def to use subject matter and avoid agencies 15:48:36 ... room for discussion. 15:48:36 q? 15:48:43 q+ 15:48:46 ack cygri 15:49:29 This is NOT a recommendation; it's simply what we are ucing 15:49:35 s/ucing/using/ 15:49:46 @Michael - we're planning to break in 15-20 minutes, when we've completed or at least come to natural break point in URI discussion. We have to walk to get our lunch. 15:50:28 I was planning to be gone by now, good night all. 15:50:29 Richard: Good handle of what the section should say. Small concern: some guidelines are applicable everywhere,e.g., slashes, stability; other aspects that apply only to specific use cases. UK gov guidelines mostly only apply to specific environments. 15:50:53 ... E.g., re-hosting is something quite specific. 15:51:04 @cygri good point; that was "merely" RPI's requirement ;) 15:51:05 cygri: The main focus should be on stuff that is "true everywhere". 15:51:33 What always applies vs. more specific example that could be better described as use cases. 15:51:34 ... Recommendations should be more generic. Needed: To abstract from the use cases of TWC or UK gov to have a less complicated design. 15:52:11 Example from DERI, re: data.gov.ie project ... 15:52:31 s/data.gov.ie/http://data-gov.ie 15:52:32 -simonWall 15:52:42 ... approach was to complicated, but this was realized only afterwards. 15:52:49 q? 15:52:50 +1 cygri 15:52:53 ack me 15:52:53 PhilA, you wanted to caution against using the org component, slide 3 15:53:42 PhilA: Concern: Names of governments departements change very often, should not be included in URI. Similar goes for locations. 15:53:53 How about if we provide 1) background on the imporance of URI strategy; 2) the value of persistence strategy; 3) detail the issues involved to evaluate a URI scheme 15:53:55 q? 15:56:00 olyerickson: Good point. But there is always the question whether create URIs from the concepts (from the actual data). If modelled from the data, then even if concepts changed, at that time of modelling the data was valid and as such the URIs are valid still, also. 15:56:52 Yigal has joined #gld 15:56:53 q+ 15:56:55 _dammit or /dammit ?;) 15:56:55 Michael: I don't see much of a point in criticising RPI's work now - he made it clear it's an example, not the recommendation 15:57:21 +1 to sandro's point 15:57:32 sandro: important to have a plan in case a name changes. 15:57:37 ack me 15:57:45 NB: We aren't criticizing RPIs URI draft ... it gave us something in black & white to discuss. Therefore it is good & useful IMO. 15:58:03 {sector}.data.gov.*/id/{thing-type}/{instance}/natural/instance/hierarchy 15:59:00 really good point 15:59:11 DeirdreLee has joined #gld 15:59:13 +1 to Dave's wise words re scalability of URI spaces via sub-domains 15:59:33 DaveReynolds: Describe constants: 1) the constants (e.g., sectors for the UK). 2) use of sub-domains to allow for autonomy within gov't authorities. 3) explain scalability implications involved depending on URI structure. Explain URI construction and allude to performance issues ... 16:00:04 +1 16:00:06 q? 16:00:07 ... Separating the advice of what to do vs. if you don't do it, you'll get bitten in the bum 16:00:08 +1 16:00:09 +1 16:00:10 q? 16:00:11 Stale URIs (from non-existant depts) will make the data look stale, even if it's brand new.... 16:00:19 DaveReynolds: Depends on the use of URIs: stabilized, architecture-dependent. Separation of tools that allow to create uris and the methods of how to deal with issues afterwards. 16:00:20 ack Yigal 16:01:45 Michael: We're implicitly assuming transparent URIs now 16:01:49 Yigal: also responsibilities change. We need to think about temporal issues such as at what time did uri represent something. 16:02:02 q+ 16:02:09 Michael: also known as hackable URIs 16:02:24 ack sandro 16:02:37 Yigal: temporal aspect in URI? which HHS? responsibilities change even if/when orgs don't 16:02:41 t_gheen has joined #gld 16:03:14 For those who may not be aware ... as well as the original UK recommendations http://www.cabinetoffice.gov.uk/media/308995/public_sector_uri.pdf there are recommendations about spatial objects (as relates to the EU INSPIRE directive) http://location.defra.gov.uk/wp-content/uploads/2011/09/Designing_URI_Sets_for_Location-V1.0.pdf useful example of patterns of things beyond "id" and "def" 16:03:32 -sandro 16:03:35 +1 to point out ways in which things can/will break 16:03:52 q? 16:03:52 +sandro 16:04:00 bhyland: We need to bound URI construction topic. 16:04:42 q- 16:05:11 ... On the one hand best practices should be valid as long as possible. On the other hand it should also include more specific issues. 16:06:16 q? 16:06:21 ... cannot tell Google, Yahoo which vocabularies to use. 16:06:22 q? 16:06:34 q? 16:06:49 q? 16:06:50 :-) thanks PhilA. Can someone scribe? 16:07:00 bhyland has joined #gld 16:07:13 scribe: PhilA 16:07:14 Thanks. 16:07:22 q+ 16:07:23 ping 16:07:27 sorry about that! 16:07:46 olyerickson: I'd like to propose that the guidance we're getting -> we should transform what we have so far into a check list or decision tree 16:08:02 thanks! 16:08:08 DruidSmith has joined #gld 16:08:15 sorry from all of us in DC .. we seem to get aperiodically dropped from our guest network and there is no explicit notification ... 16:08:19 olyerickson: highlight the issues. What we're saying is what we did, what we thought about and why we did it 16:08:29 and worst, we loose the IRC history :-( 16:08:39 s/worst/worse of all/ 16:09:32 q? 16:09:40 q? 16:09:44 ack olyerickson 16:09:53 DanG has joined #gld 16:10:16 Yigal has joined #gld 16:10:39 UK guidance also talks about /def (controversy!) and /dataset among other topics 16:10:47 bhyland: I would say, not having read the UK guidance in 8 months or so - that's more comprehensive and thought out. We should consider others (and strip out the UK-specific stuff) 16:10:49 q? 16:10:57 sandro: I like the decision tree idea a lot 16:11:01 +1 to "decision tree" idea 16:11:33 sandro: "Don't do this" or it will cause problems later and "you probably don't want to do this but you may have reasons not to" and so on 16:12:21 bhyland: cmusialekhas a mission to do today. I don't think the guidance is ready for him. The RPI draft is a good input - needs to be discussed further 16:12:26 "RPI thing" is not a draft...it's what we did and why ;) 16:12:59 q? 16:13:08 Wait a minute...I think ChrisM is a test subject and should actually 16:13:13 cmusialek: I'm less familiar with the intricacies of URI design. But I'm hearing that it's time to act from the US gov and maybe get 80% right 16:13:17 s/cmusialekhas/cmusialek/ 16:13:22 olyerickson: I'm going to disagree with you, bhyland 16:13:54 olyerickson: We're not saying to Chris, go ahead and use this. I'm saying "try it, see what breaks and let us know" 16:14:08 Olyerickson is saying the draft RPI URI guidance is a proposal ... try it and give us feedback. 16:14:11 olyerickson: I'd also say take a look at the UK advice and tell us what the problem is 16:14:22 +1 Olyerickson 16:14:25 RPI has used the RPI version for a very specific case. 16:14:27 olyerickson: We've used ours for a very specific case 16:14:35 community drives standards or standards drive community? 16:14:53 q? 16:14:55 The former DeirdreLee (if it's to be used) 16:14:57 q+ 16:15:18 ack cygri 16:15:29 cygri: I wanted to say that in terms of structuring these BP Recommendations, I agree with sandro and mhausenblas to structure these as a list 16:15:49 cygri: Seems a good way to teach/inform 16:15:54 +1 to cautioning about what might go wrong...BUT it needs to be informed advice 16:16:19 q? 16:16:22 cygri: Have you thought about future change? Is there 'cruft' in there (scribe doesn't recognise the term cruft but that's life) 16:16:25 +1 to richard 16:16:26 ack bhyland 16:16:30 +10000 to cygri 16:16:35 ack bhyland 16:17:25 bhyland: I appreciate John's request for data.gov to take the RPI advice and see how it works. That might be the RPI state, but I'm not sure it's the W3C position as it hasn't been sanctioned by the WG 16:17:31 olyerickson, who suggested giving uninformed advice? ;-) 16:17:32 +1 to keeping things separate 16:18:04 q+ then break 16:18:32 ack then, 16:18:38 ack break 16:18:38 @cygri I didn't mean...hmmm...what did I mean ;) 16:18:45 ack then 16:18:54 AnneW: How do we iterate through a suggested set of guidelines & recommendations? 16:19:29 PROPOSAL: URI sub-team work on a check-list for URI construction 16:19:29 cmusialek has joined #gld 16:19:31 sandro: The WG is supposed to iterate on the doc until everyone agrees with it 16:19:38 ... then it goes to the outside world 16:19:43 ... etc. 16:19:45 +1 to olyerickson proposal 16:19:58 +1 to oleyrickson 16:20:03 sandro: The normal W3C process is that the group reviews and once they don't have any problems with it, then goes to last candidate review for feedback. 16:20:09 +1 to oleyrickson 16:20:46 bhyland: What RPI has provided is a draft. But let's encourage cmusialek to be part of the discussion as it continues to evolve 16:20:54 +1 to cmusialek et.al. be part of the conversation 16:20:56 +rreck 16:20:59 q? 16:21:07 bhyland: Are we at a natural breaking point? 16:21:08 In reference to using Congressional Districts as example: Is everyone aware that these are redrawn every 10 years? 16:21:11 GalwayL YES 16:21:35 -stasinos 16:21:44 are we hanging up? 16:21:50 reconvene at 12:25 and 5:15pm 16:22:01 I can also stay on bridge... 16:22:02 Zakim, mute galway 16:22:02 galway should now be muted 16:22:10 reconvene at 12:15 and 5:15pm 16:22:26 zakim, mute olyerickson. 16:22:26 olyerickson should now be muted 16:22:31 Zakim, unmute galway 16:22:31 galway should no longer be muted 16:24:13 Zakim, mute galway 16:24:13 galway should now be muted 16:24:18 -sandro 16:24:24 -rreck 16:24:31 -GeraldSteeman 16:27:06 -DaveReynolds 16:28:14 Wiki record of these minutes is up to date at this point 16:36:18 Note for the "Vocabulary Selection" team: check out the recent addition to DataFAQs re: the role of vocabulary selection in Linked data quality https://github.com/timrdf/DataFAQs/wiki/Assisting-vocabulary-selection 16:43:26 I have to go now, unfortunately, Richard is taking over Galway. Literally. :) 16:50:17 boris has joined #gld 17:09:18 +sandro 17:10:24 sandro: the Washington room is still empty (the video link is showing us that). I can ping you when we're about to reconvene if you like? 17:22:09 mhausenblas has joined #gld 17:22:19 DeirdreLee has joined #gld 17:25:06 -galway 17:25:19 dvilasuero has joined #gld 17:25:48 zkim, code 17:25:52 zakim, code? 17:25:52 the conference code is 4531 (tel:+1.617.761.6200 sip:zakim@voip.w3.org), cygri 17:26:01 +galway 17:27:24 +??P1 17:27:29 zakim, dvilasuero is with galway 17:27:29 +dvilasuero; got it 17:27:34 zakim, ??P1 is me 17:27:34 +DaveReynolds; got it 17:27:42 zakim, boris is with galway 17:27:42 +boris; got it 17:27:59 zakim, who is one the phone? 17:27:59 I don't understand your question, olyerickson. 17:28:09 zakim, who is on the phone? 17:28:09 On the phone I see Washington, olyerickson (muted), ChristopheGueret, sandro, galway, DaveReynolds 17:28:12 galway has galway, dvilasuero, boris 17:28:37 zakim, galway has BartvanLeeuwen, BenediktKaempgen, boris, cygri, DeirdreLee, dvilasuero, fadi, GofranShukair, PhilA, SpyrosKotoulas 17:28:37 boris was already listed in galway, PhilA 17:28:38 dvilasuero was already listed in galway, PhilA 17:28:39 zakim, i'm with galway 17:28:40 +BartvanLeeuwen, BenediktKaempgen, cygri, DeirdreLee, fadi, GofranShukair, PhilA, SpyrosKotoulas; got it 17:28:43 zakim, csarven is with galway 17:28:44 cygri was already listed in galway, cygri 17:28:47 +csarven; got it 17:29:56 t_gheen has joined #gld 17:31:00 Topic: Discussion on Best Practices for Publishing Government Linked Data (FPWD) 17:31:27 bern: Did a big restructuring of the wiki page yesterday 17:33:07 http://www.w3.org/2011/gld/charter 17:33:16 George_ has joined #gld 17:33:22 BernHyland: Two questions - how do we move from a wiki to a FPWD, and how do we reflect fture changes 17:33:32 s/fture/future/ 17:33:50 sandro: We can publish directly from the wiki using a transformation script we have 17:34:09 sandro: It's called RevDoc. It's only my WGs that have used it 17:34:13 ... so far 17:34:17 ... code is not polished 17:34:32 ... alternative is to convert to respec which a lot of folk prefer 17:34:36 +GeraldSteeman 17:35:03 bernHyland: does it require your help to use RevDoc? 17:35:14 sandro: yes - incantations and bones are involved 17:35:22 sandro: it could be useful but there are alternatives 17:36:00 bernHyland: Respec is the alternative 17:36:13 q+ to mention ReSpec 2 17:36:21 bh: I'm familiar with Respec so I'd rather use that 17:36:34 Yigal has joined #gld 17:37:00 bh: I'll need help from people to make sure that they remember to record who changes what and when 17:37:30 bhyland has joined #gld 17:37:32 ping 17:37:38 http://dev.w3.org/2009/respec2/ 17:38:01 ack me 17:38:01 cygri, you wanted to mention ReSpec 2 17:38:28 bh: There seemed to be a lot of activity last September in terms for formatting that we can look at 17:39:06 q+ to ask what documents the group is going to publish 17:39:51 sandro: One month off is OK. But we can put changes on the front page of the wiki 17:39:54 ack cygri 17:39:58 cygri, you wanted to ask what documents the group is going to publish 17:40:02 q? 17:40:17 cygri: Do we have something like a complete list of the documents that the WG is going to produce (Rec and non-Rec) 17:40:41 +1 a clear list of docs and intended status would be helpful 17:40:55 bhyland: The community directory is published, BPs will be a Rec, 17:41:13 +1 to list of documents that will be produced 17:41:26 sandro: The Wg should do what it thinks is best, there are no rules as such 17:41:58 bhyland: We should put things that logically go together in a single doc. For e.g. we might have a lot of stuff about URI consutruction that could be separate 17:42:18 bhyland: The Cookbook isn't a Rec - it could become part of the directory 17:43:09 George_: The milestones section of the charter says that the directory and cookbook are separate 17:43:23 sandro: I think we should remain open to splitting docs as we see fit 17:43:24 Michael: Regarding publishing the BP FPWD, I think boris and I already had a chat, no? Boris, can you share our proposal on the call, please? 17:43:44 q+ 17:43:50 Mike_Pendleton has joined #gld 17:44:03 q+ 17:44:12 Michael: Essentially, the idea was to manually transfer the content from the Wiki - we're three Editors, so workload-wise this should work 17:44:47 +sandro.a 17:44:50 cygri: I thought the Recommended vocabs were going to be in separate docs (DCAT and Data Cube) but I don't know about the otehr areas 17:44:50 -sandro 17:45:04 q- 17:45:04 cygri: suggested re: recommended vocabs, have one doc for DCAT, another for DataCube 17:45:13 Michael: Just to make it clear - I'm against the script-based version from the Wiki as we have a rather messy structure there and I don't wanna play guinea pig. sorry sandro, no offence meant 17:45:34 sandro: If we're just going to endorse someone else's vocab we don't need a big doc for that 17:45:45 bhyland: How much of the data cube spec is already written? 17:46:17 cygri: We have a spec that is pretty much ready. We might want to add things and improve things but in principle there is an existing spec that covers what you woujld expect it to 17:46:35 ... we will need to write more if we decide that there are issues that need to be addressed? 17:47:12 bhyland: Is there any benefit for having this as a separate doc? 17:47:15 +1 that's what I thought 17:47:29 cygri: Yes, that makes sense and I already have an action item to create it 17:47:37 ... with help from DaveReynolds et al 17:47:50 +1 QB should be an own spec 17:48:00 +1 17:48:01 +1 separate spec for qb 17:48:15 q? 17:48:17 ack boris 17:48:58 boris: wrt the draft of the BP spec - we (Michael, Bern and I will create the doc, people only need to update the wiki) 17:49:03 bhyland: Agreed 17:49:08 olyerickson has joined #GLD 17:49:12 Topic: Community Directory 17:49:44 Slides are at http://www.w3.org/2011/gld/wiki/images/c/c6/BHyland_W3C_GLD_WG_F2F2_Directory.pdf 17:50:32 bhyland: The idea is the the CD (Community Directory) is a place where people not necessarily familiar with LD can get some guidance 17:50:53 bhyland: The initial CD was put together with some loose requirements from the June f2f 17:53:03 DanG has joined #gld 17:53:12 bhyland: Talks through her slides 17:53:59 bhyland: Haven't had a lot of feedback - need and would like more 17:54:14 zakim, mute me 17:54:14 DaveReynolds should now be muted 17:54:18 bhyland: So where do we go? semanticweb.org? SWEO? 17:54:39 bhyland: Now that we have a working site, we can seek feedback, maybe open it up 17:55:10 q? 17:55:13 q? 17:55:14 q? 17:55:25 q? 17:56:33 bhyland: I think the first thing is to make sure that people think it's a good idea 17:56:48 RE UI, actually simple is good --- priority should be *useful* 17:57:28 bhyland: Biplav asked what a company like IBM should put in? What's the (relevant) address for IBM? 17:57:41 bhyland: addresses for global/multi-national concern is good topic for vocab rec tomorrow 17:57:48 cygri: DERI is listed in there. We did that because we were asked to do it 17:58:05 cygri: But I was thinking about why I should want to return to it to make sure our data is up to date? 17:58:16 q+ 17:58:28 cygri: what's the incentive for data freshness on the CD? 17:58:31 cygri: If people come here to find info about expertise then obviously we'd want to be properly represented 17:58:56 cygri: We have an interest in being found if people are looking for LD expertise in Ireland 17:59:32 we need a vocabulary to describe Linked Data domain :) 17:59:35 cygri: I'd be interested to know who else is working on the kind of thing we do 17:59:46 cygri: LD Communities of Interest/Practice query where? 18:00:42 q? 18:01:20 bhyland: We used the W3C CSS and then made changes - we'd like to make the side panel batter 18:01:23 q+ 18:01:24 q? 18:01:24 q? 18:01:38 zakim, unmute me. 18:01:38 olyerickson should no longer be muted 18:02:34 olyerickson: Don't be too hard on yourself. It looks good and it's hard to do faceted browsing 18:03:25 olyerickson: There seems to be some interlinking that is not linking up. If you choose a company, then look at the topics, then try and click on those, what you expect to see is a re-listing of relevant companies 18:04:29 bhyland: Agree it would be useful to have tool tips around different terms 18:04:37 q+ 18:04:38 ... such as adding tool tips 18:04:41 ack olyerickson 18:04:47 q- 18:05:35 bhyland: I agree with cygri that if you know people are finding you through the DIR then you'll be more careful about keeping it up to date 18:05:41 ]ack sandro 18:05:43 q? 18:05:45 ack sandro 18:06:26 q+ 18:06:47 sandro: I'm super picky about sites as a user. But I do have to wonder about a bit of usability testing wouldn't be a bad thing. Unless it delivers a good experience on attempt 1 you might lose people 18:07:07 sandro: Are there way that other people could contribute improvements? Fork? 18:07:13 cmusialek has joined #gld 18:07:24 -DaveReynolds 18:07:30 sandro: I'm not sure how Callimachus puts things together. Are there grad students that could do stuff with it? 18:08:01 bhyland: They're welcome to download the code and work on it. This is built on v.12 - we're now on v.16 which now includes import/export of apps 18:08:19 bhyland: updating the instance doesn't take a lot of work 18:08:40 DaveReynolds has left #gld 18:08:50 bhyland: There's not a large technical hurdle to overcome. Just a bit of CSS and JS 18:09:09 ... what I'd like is a list of features that we can fix 18:09:17 ... expecially if they're trivial! 18:09:39 s/expecially/especially/ 18:10:07 bhyland: We went to a lot of trouble to get it on a w3 domain for reasons of permanence etc. Got to be easy to use 18:10:15 sandro: Do you have an issues list? 18:10:24 @sandro VERY good point!! 18:10:30 +1 to sandro 18:10:38 is there a github wiki? 18:10:43 bhyland: I'll ask James how he wants to queue up issues 18:11:14 bhyland: Things like needing a login is surprising 18:11:23 q? 18:11:24 What is the code host? github? Google Code? each have built-in issues trackers 18:11:25 .... but maybe that's a good thing to prevent the spam 18:11:45 action: bhyland to set up an issues list for dir.w3.org 18:11:46 Created ACTION-33 - Set up an issues list for dir.w3.org [on Bernadette Hyland - due 2012-02-01]. 18:12:10 Problem solved: http://code.google.com/p/callimachus/issues/list 18:12:19 bhyland: Then we can see what is easy and what needs more work to implement, prioritise etc. 18:12:29 q? 18:12:32 q? 18:12:36 Ah okay 18:12:48 sandro: There's a Callimachus issues list, what we need is a dir.w3.org issue list 18:12:58 @sandro thanks for the clarification 18:13:05 q? 18:13:07 bhyland: Obviously James and I are best places to decide if it's a Callimachus or dir.w3.org issue 18:13:12 ack cygri 18:13:35 cygri: I wanted to give an armchair view of usability but not sure if that' the bets use of our time? 18:13:36 now you can submit issues. :-) 18:14:25 q+ 18:14:29 bhyland: I have an bias towards action - I was expecting some philosophical issues to deal with 18:14:38 ack Mike_Pendleton 18:14:44 on the philosophical side, i just want to know whether it's httpRange-14 compliant 18:15:05 I'm going to need to ask other people in the government space and see what they expect and compare it with what there is 18:15:47 Mike_Pendleton: bugs in Firefox display 18:15:49 Mike_Pendleton: The left had side has a list of things that may or may not mean anything to people. ... Conversation then found a bug 18:16:32 Mike_Pendleton: continues to give thoughts to bhyland who takes notes... 18:17:38 q+ 18:17:55 bhyland: how about visualizations? 18:17:58 http://dir.w3.org/page/number-of-organizations-by-country.xhtml?view 18:18:04 ... names queries? 18:18:11 s/names/named 18:18:32 Hmmm. When I'm looking at an "area of expertise", like http://dir.w3.org/scheme/organizational+categories/rdf+store?view ... I don't see who has that expertise. 18:18:33 ... are there other ways to view the information that are more meaninful? 18:18:34 Not sure what you're looking at... 18:18:42 q? 18:19:19 Okay, bhyland was referring to visualizations on http://dir.w3.org/page/number-of-organizations-by-country.xhtml?view 18:19:40 bhyland: any suggestions for linking up with egov interest group? 18:19:50 ... open knowledge foundation 18:20:42 q? 18:21:20 q? 18:22:02 PhilA: That's best achieved by a personal conversation 18:22:20 cygri: We work with OKFN and can tell them about it. The DIR isn't quite there yet though 18:23:35 q? 18:23:41 q? 18:23:52 ack BartvanLeeuwen 18:23:59 ACTION: bhyland convene a meeting on armchair usability for community directory 18:24:00 Created ACTION-34 - Convene a meeting on armchair usability for community directory [on Bernadette Hyland - due 2012-02-01]. 18:24:31 BartvanLeeuwen: Backing up a bit... if we think about being able to pull some data directly from the Web, perhaps through gr: data? 18:25:04 BartvanLeeuwen: GR for company products services (Deirdre - vocab for LD domain?) 18:25:26 bhyland: how does GR pull/update company info? 18:25:38 bhyland: I think it's a really good suggestion. 18:25:52 ... and then pull that from where ever into the CD 18:25:56 Lots of red faces around the table looking at the large pile of uneaten dog food 18:26:04 q+ 18:26:38 bhyland: We could offer guidance on what RDFa to include on your site, then we could accept a URL of a page to parse and then that could be added to the directory 18:26:39 bhyland: if there was basic RDFa on someone's site, how can we automatically update their info in the directory? 18:26:51 bhyland: It's tiresome to have to enter that by hand in 2012 18:27:14 sandro: I think we'd want to support the system being able to import data from a given location 18:27:23 BartvanLeeuwen: and preferably auto-updating too 18:27:35 sandro: auto slurping high on list of to do's in general 18:27:36 washington seems to have dropped off skype? 18:27:43 q+ 18:27:43 Core business vocabulary? 18:27:51 bhyland: what is the state of the art for scraping a page, RDFa? 18:27:58 danbri has joined #gld 18:28:01 bhyland: What's the state of the art for being able to scrape a site for RDFa, 18:28:14 sandro: It doesn't have to be RDFa, it can be any RDF format 18:28:15 https://joinup.ec.europa.eu/asset/core_business/home 18:28:22 BartvanLeeuwen: I'm willing to take a look at it 18:29:04 bhyland: take our site - say we had a book that we'd published. And we marked up the page with data. How to do we say look at this and this but not that 18:29:05 q? 18:29:09 q? 18:29:16 q+ to say this is not on the critical path for the community directory 18:29:36 BartvanLeeuwen: If you look at GR you can say what your service offerings are 18:30:28 action: BartvanLeeuwen to investigate GR ingest from CD provided page 18:30:29 Sorry, couldn't find user - BartvanLeeuwen 18:30:32 q? 18:30:57 action: BartvanLeeuwen to investigate how Good Relations etc could assist with automatically filling up the directory 18:30:57 Sorry, couldn't find user - BartvanLeeuwen 18:31:05 can we please not make this more complicated than necessary 18:31:10 action: Leeuwen to investigate how Good Relations etc could assist with automatically filling up the directory 18:31:11 Sorry, couldn't find user - Leeuwen 18:31:18 action: van Leeuwen to investigate how Good Relations etc could assist with automatically filling up the directory 18:31:18 Created ACTION-35 - Leeuwen to investigate how Good Relations etc could assist with automatically filling up the directory [on Bart van Leeuwen - due 2012-02-01]. 18:31:25 ack cygri 18:31:25 cygri, you wanted to say this is not on the critical path for the community directory 18:31:27 q? 18:31:28 q? 18:32:04 cygri: I'm all for eating our own dog food. At the same time, to make the CD a success, the question of whetehr it can slup in RDFa is not necessarily the most important 18:32:15 but it does speak to the freshness and updating issue :) 18:32:20 cygri: I don't want to discourage people looking at it, but it's not priority number 1 18:32:31 mhausenblas has joined #gld 18:32:38 cygri: So this is a vendor directory for LD organisations etc, yes? 18:32:57 cygri: Are there similar examples of sites that do the same for other areas? 18:32:57 It is broader than a vendor directory. 18:33:17 Cygri: is there an analogous site to this one? 18:33:20 cygri: Can we find an example of something that achieves what we want to do in a differnt domain? 18:33:26 DanG has joined #gld 18:33:34 bhyland: The library community likes directories 18:33:43 bhyland: It's not just about vendors 18:33:56 agree with Mike_Pendleton wrt being aligned with Procurement 18:33:57 bhyland: It's about finding expertise, whether commercial, academic or whatever 18:34:38 bhyland: there are many examples of these kinds of directories - ex. travel sites 18:34:48 biomedical directories 18:35:32 ack olyerickson 18:35:44 Mike_Pendleton has joined #gld 18:35:45 olyerickson: I'll reinforce what others have said about KISS 18:36:09 olyerickson: It's hard to get people to add their data, even harder to get them to recode their websites 18:36:22 q+ 18:36:46 ... if we want to be able to slurp in pre-cooked RDF then great, but maybe that should be a separate file 18:37:03 olyerickson: an option for GR is having a separate location of the RDF info 18:37:14 q- 18:37:40 olyerickson: If I add my company info into the CD then it would be nice if the CD made an RDF file available that I could then add to my site 18:37:49 bhyland: Love that suggestion 18:38:08 bhyland: It's a Foafomatic tool - great 18:38:26 me thinks that's what BartvanLeeuwen meant in the first place, + the idea that callimachus could/should also serve as a RDFa template for those that can/will publish that 18:38:35 bhyland: You get something back for your effort 18:38:41 George_, ack 18:38:45 ack DeirdreLee 18:38:46 ;) 18:39:17 DeirdreLee: It seems the CD seems to be taking a centralised approach. We want people to put theire data out there and then third party tools can use it 18:39:29 DeirdreLee: And the CD is a third party tool in this context 18:39:46 q+ to rebut that 18:39:59 otherwise we'll pull it from dbpedia :) 18:40:14 q- 18:40:16 bhyland: Yep, think distributed, think linked data 18:40:28 q+ 18:40:55 q- 18:40:57 bhyland: summarises what she's taken down so far (and Sandro reminds her he's on the q) 18:41:19 cmusialek has joined #gld 18:41:52 Topic: Linked Data Cookbook 18:41:53 topic: Linked Data Cookbook 18:42:42 http://www.w3.org/2011/gld/wiki/Linked_Data_Cookbook 18:43:24 bhyland: It uses a linking gov data chapter I wrote from last November 18:43:36 ... we got permission to keep the copyright 18:43:45 ... some of it prob belongs in the best practices 18:44:01 ... useful if you've had a chance to review it of course 18:44:30 boris: The content looks the same as the BP working draft - is it not the same? 18:45:00 cygri: Refers to the charter... 18:45:14 The group will produce a collection of advice on smaller, more specific issues, where known solutions exist to problems collected for the Community Directory. This document is to be published as a Working Group Note, or website, rather than a Recommendation. It may, instead, become part of the Community Directory site. 18:46:22 +q 18:46:57 BenediktKaempgen: We have been talking about the BP as a static document and it shouldn't be too specific as it will go out of date. The cookbook is more of a live document/resource 18:47:15 BartvanLeeuwen: I see it as a more specific document and yes, a living one 18:47:24 ack BenediktKaempgen 18:47:38 q+ 18:47:52 BenediktKaempgen: For example, a list of the current, most important vocabularies - that's a useful start for individuals 18:48:06 BenediktKaempgen: posits list of vocabs as example of 'smaller more specific' 18:48:07 how about: when to use RDF/XML vs Turtle vs RDFa vs SPARQL ? 18:48:11 ... so the criteria go in the BP doc, ones that meeti the crierta go in the cookbook 18:48:16 ack DeirdreLee 18:48:28 DeirdreLee: What's the government element of the cookbook? 18:48:30 q+ 18:48:32 DeirdreLee: what's the Gov angle? 18:48:52 DeirdreLee: It seems as if it could cover life sciences etc. ... 18:49:10 bhyland: There is a lot of overlap and may overlap the Linked Data Platform WG too 18:49:48 bh: I write the various entries with gov in mind even though things can be used elsewhere too 18:50:20 bh: 80%+ can apply to any LD project, yes - but people from gov will gravitate to it on w3.org 18:50:52 q? 18:51:20 Sorry ..I have to go ..bye everyone see you tomorrow 18:51:30 http://www.w3.org/TR/swbp-vocab-pub/ 18:51:46 ack sandro 18:52:14 http://answers.semanticweb.com/ 18:52:25 sandro: I picture the cookbook as an FAQ, stak overflow type thing 18:52:58 sandro: There are 30-40 questions that gov people will ask when asked to consider implementing LD 18:53:27 bhyland: Mike_Pendleton gave me a bunch of questions when we began working with the EPA - yes, that makes sense 18:53:28 +1 to sandro. that made sense to me. 18:53:42 s/stak/stack/ 18:54:06 bh: Thanks for the feedback - that helps me see what needs to be done 18:54:17 q? 18:54:22 q+ 18:54:24 +1 to stack overflow-like functionality (but that's not free anymore) 18:54:55 cygri: So how can we collect those questions? 18:54:57 @bhyland I have to sign off now...apologies. Have a great day, everyone! 18:55:05 -olyerickson 18:55:09 ack cygri 18:55:11 olyerickson has left #GLD 18:55:38 ACTION: bhyland gather top 30-40 questions for the FAQ 18:55:38 Created ACTION-36 - Gather top 30-40 questions for the FAQ [on Bernadette Hyland - due 2012-02-01]. 18:55:39 byland: dare I suggest an action item to collect the questions 18:56:28 mhausenblas, i'm not in charge of the agenda, but it says we stop at 8 18:56:54 +rreck 18:57:12 GeraldSteeman has joined #GLD 18:57:39 list of stackoverflow clones. we could install an instance of one of these.... http://meta.stackoverflow.com/questions/2267/stack-overflow-clones 18:58:09 sandro, why? answers.semanticweb.com is already there. don't fragment 18:58:30 cygri, +1 18:58:35 bhyland: Considers the day, whether we have achieved our targets 18:59:05 bhyland: reviews tomorrow's agenda 18:59:19 cygri, not sure, just brainstorming. are there tags there we can use to help get GLD folks started in the right direction there? 18:59:48 bhyland: anyone not here tomorrow? t_gheen has to meet someone very senior in the West Wing 18:59:55 sandro, not really. it's for asking questions and getting them answered, not really for reading old answers 18:59:57 yes. i have posted the slides 19:00:15 bhyland: We'll talk about stability tomorrow 19:01:06 PhilA: Anne W might want to look at the outcome from the workshop on stability held last month http://www.w3.org/2001/tag/2011/12/dnap-workshop/notes.html 19:01:16 cygri: more DCAT tomorrow 19:01:17 cygri: Would like to talk about DCAT 19:01:21 +1 cygri 19:01:27 +1 on DCAT as the hope is to resolve to go to FPWD 19:01:53 http://www.w3.org/2011/gld/wiki/F2F2#Agenda 19:02:00 q? 19:02:36 +1 more ADMS tomorrow morning too 19:02:52 Current static version of DCAT is at https://www.w3.org/2011/gld/group/WD-DCAT-20120106.html 19:03:42 agreed 19:03:51 with a mandate! 19:05:49 Interoperability Solutions for European Public Administrations http://ec.europa.eu/isa/ 19:06:00 bhyland has joined #gld 19:06:15 Join up https://joinup.ec.europa.eu/ 19:07:00 Thanks all round 19:07:06 Meeting adjourned 19:07:16 -sandro.a 19:07:21 -galway 19:07:28 -rreck 19:07:34 SpyrosKotoulas has left #gld 19:07:36 -Washington 19:07:39 -GeraldSteeman 19:08:16 ping 19:08:41 is someone in Galway publishing the minutes for today?? 19:08:58 RRSAgent, set logs world-visible 19:09:18 RRSAgent, generate minutes 19:09:18 I have made the request to generate http://www.w3.org/2012/01/25-gld-minutes.html George_ 19:10:08 Wiki is up to date, sandro 19:10:19 aww, I was JUST doing that. 19:10:35 Do please check I've done it right - I think I havea 19:11:22 @Sandro, you have your work cut out for you to fix Gofran's name that the bot chocked on all over the place 19:11:22 cygri has joined #gld 19:11:33 Mike_Pendeton has joined #gld 19:11:49 ah, Shakair vs Shukair 19:11:49 to Bernadette http://www4.wiwiss.fu-berlin.de/latc/toollibrary/categories.php#application 19:14:09 hm. who is 'stasinos' 19:17:28 sandro: stasinos konstantopoulos, from NCSR demokritos. Looking for URis for you - but he's a WG member 19:17:46 Then weird the system didnt recognize him. 19:17:46 cygri_ has joined #gld 19:18:15 love his email address on http://users.iit.demokritos.gr/~konstant/ 19:18:31 Member is http://www.iit.demokritos.gr/ 19:18:56 HadleyBeeman has joined #gld 19:19:13 Oh, I see. 19:19:28 was he in at DERI, but late? 19:19:36 No, he just dialled in 19:19:57 I've worked with Demokritos in different ways for some time 19:20:11 My affiliation at the end fot he POWDER stuff was with them 19:21:03 +HadleyBeeman 19:21:03 *nod* 19:21:32 Day 1 minutes done. http://www.w3.org/2011/gld/meeting/2012-01-25 19:21:43 You're all done for today then? 19:21:54 Yes, HadleyBeeman, we finished a bit early. 19:21:56 Yesm HadleyBeeman - all done for today 19:21:59 I just thought I'd pop in for a bit… I'm at home with a cold, and getting a bit restless. 19:22:10 sorry to hear about the cold. 19:22:11 Well, congrats on that. :) Sorry I missed you all. 19:22:14 Hope you get better soon 19:22:15 coming back tomorrow? :-) 19:22:21 Thanks. These thinks happen. 19:22:29 People leaving for the pub now. Bye 19:22:31 Yep, I should be online for most of tomorrow. 19:22:36 Have fun at the pub, PHilA 19:22:51 Hope today was productive! 19:23:01 -HadleyBeeman 19:26:25 PhilA has left #gld 19:53:17 danbri has joined #gld 20:11:33 danbri has joined #gld 20:22:38 danbri has joined #gld 20:26:15 csarven has joined #gld 20:46:51 danbri has joined #gld 20:50:33 danbri has joined #gld 21:18:02 danbri_ has joined #gld 22:05:00 disconnecting the lone participant, ChristopheGueret, in SW_e-Gov( GLD)6:30AM 22:05:05 SW_e-Gov( GLD)6:30AM has ended 22:05:07 Attendees were sandro, simonWall, GeraldSteeman, +3539149aaaa, fadi, galway, mhausenblas, cygri, BenediktKaempgen, csarven, boris, Deirdre, Gofran, PhilA, dvilasuero, 22:05:11 ... BartvanLeeuwen, +1.802.371.aabb, +1.802.371.aacc, +1.202.691.aadd, Washington, +1.802.371.aaee, spyros, DaveReynolds, rreck, +1.518.276.aaff, olyerickson, ChristopheGueret, 22:05:13 ... stasinos, DeirdreLee, GofranShukair, SpyrosKotoulas, HadleyBeeman 23:53:36 olyerickson has joined #GLD 23:53:39 olyerickson has left #GLD 23:59:25 BartvanLeeuwen has joined #gld