From Government Linked Data (GLD) Working Group Wiki
Please justify/explain all edits to this page, in your "edit summary" text.
13:54:29 <RRSAgent> RRSAgent has joined #gld 13:54:29 <RRSAgent> logging to http://www.w3.org/2012/05/31-gld-irc 13:54:31 <trackbot> RRSAgent, make logs world 13:54:31 <Zakim> Zakim has joined #gld 13:54:33 <trackbot> Zakim, this will be GLD 13:54:33 <Zakim> ok, trackbot, I see T&S_GLDWG()10:00AM already started 13:54:34 <trackbot> Meeting: Government Linked Data Working Group Teleconference 13:54:34 <trackbot> Date: 31 May 2012 13:55:48 <olyerickson> olyerickson has joined #gld 13:55:58 <olyerickson> Joining shortly... 13:56:04 <George> George has joined #gld 13:57:23 <bhyland> bhyland has joined #gld 13:57:58 <bhyland> Today's agenda: http://www.w3.org/2011/gld/wiki/Meetings:Telecon20120531 13:58:37 <boris> boris has joined #gld 13:59:18 <Zakim> + +1.847.668.aaaa 13:59:26 <Zakim> +davidwood 13:59:37 <bhyland> zakim, davidwood is me 13:59:37 <Zakim> +bhyland; got it 13:59:48 <Zakim> +??P20 13:59:56 <boris> zakim, ??P20 is me 13:59:56 <Zakim> +boris; got it 13:59:58 <DaveReynolds> DaveReynolds has joined #GLD 14:00:01 <Zakim> +??P24 14:00:03 <Zakim> +mhausenblas 14:00:21 <bhyland> Guest speaker today is John Barker, see bio http://solutions.wolterskluwer.com/blog/author/john/ 14:00:30 <cygri> cygri has joined #gld 14:00:38 <olyerickson> zakim, who is on the phone? 14:00:38 <Zakim> On the phone I see mikependleton, +1.847.668.aaaa, bhyland, boris, ??P24, mhausenblas 14:00:46 <cygri> zakim, i'm with mhausenblas 14:00:46 <Zakim> +cygri; got it 14:00:54 <olyerickson> zakim, I am ??p24. 14:00:55 <Zakim> +olyerickson; got it 14:01:02 <olyerickson> mute me. 14:01:19 <Zakim> +George_Thomas 14:01:35 <Zakim> +gatemezi 14:01:35 <olyerickson> mute me 14:02:03 <boris> http://www.kit.edu/index.php 14:02:04 <Zakim> + +3539149aabb 14:02:17 <gatemezi> gatemezi has joined #gld 14:02:17 <Deirdre> Deirdre has joined #gld 14:02:23 <Zakim> +??P35 14:02:39 <olyerickson> @bhyland: Wie gehts? 14:02:53 <DeirdreLee> DeirdreLee has joined #gld 14:02:56 <Zakim> +BrendanIAB 14:03:02 <gatemezi> Zakim, who is here? 14:03:02 <Zakim> On the phone I see mikependleton, +1.847.668.aaaa, bhyland, boris, olyerickson, mhausenblas, George_Thomas, gatemezi, +3539149aabb, DaveReynolds, BrendanIAB 14:03:06 <Zakim> mhausenblas has cygri 14:03:06 <Zakim> On IRC I see DeirdreLee, gatemezi, cygri, DaveReynolds, boris, bhyland, George, olyerickson, Zakim, RRSAgent, mhausenblas, Mike_Pendleton, MacTed, trackbot, sandro 14:03:07 <MartinKaltenb> MartinKaltenb has joined #gld 14:03:31 <bhyland> zakim, aaaa is JohnBarker 14:03:31 <Zakim> +JohnBarker; got it 14:03:52 <bhyland> zakim, aabb is DeirdreLee 14:03:52 <Zakim> +DeirdreLee; got it 14:04:24 <tinagheen> tinagheen has joined #gld 14:04:38 <olyerickson> 12 people and a robot named Zakim... 14:04:44 <Zakim> +[LC] 14:04:55 <tinagheen> Zakim, [LC] is me 14:04:55 <Zakim> +tinagheen; got it 14:04:59 <Mike_Pendleton> present: Bernadette, Sandro, Michael H., John E, George, Mike P., Boris, John Barker (Wolters Kluwer), Dave, Richard C, Deirdre, Martin, gatemezi, MacTed, Tina G. 14:05:00 <sandro> [ sorry -- last-minute regrets from me. ] 14:05:14 <sandro> [ I think. ] 14:05:36 <bhyland> Propose to accept: http://www.w3.org/2011/gld/wiki/Meetings:Telecon20120524 14:05:43 <mhausenblas> +1 14:05:53 <Mike_Pendleton> Propose to accept minutes? 14:05:55 <George> +1 14:05:55 <boris> +1 14:05:58 <gatemezi> +1 14:06:01 <Mike_Pendleton> Minutes accepted 14:06:04 <MartinKaltenb> +1 14:06:26 <Mike_Pendleton> Scribe: Mike_Pendleton 14:06:40 <bhyland> chair: bhyland 14:06:57 <olyerickson> No. the student we have doing the PROV work is too busy doing the PROV work to talk to me :( 14:07:26 <olyerickson> zakim, mute me. 14:07:26 <Zakim> olyerickson should now be muted 14:07:34 <olyerickson> zakim, you're a peach! 14:07:34 <Zakim> I don't understand 'you're a peach!', olyerickson 14:07:49 <Mike_Pendleton> Bernadette intoduced John Barker, Wolters Kluwer 14:09:35 <gatemezi> URL for Wolter Kluwer: http://solutions.wolterskluwer.com/blog/author/john/ 14:09:43 <olyerickson> John Barker's related blog post: http://bit.ly/JPaUDW 14:10:59 <Mike_Pendleton> John leading interactive conversation; open to separate dialog later 14:11:03 <bhyland> q? 14:12:35 <olyerickson> Wolters Kluwer intro 14:12:51 <Mike_Pendleton> John B: WK is a Dutch company, operations in 35 countries; provide content, now software combined with content; take public domain data, classify, enrich and publish it for customers. 14:13:24 <Mike_Pendleton> ...supplier of data to government and cosumer of government data 14:13:58 <Mike_Pendleton> ...helped US publish IRS code 100 years ago 14:14:27 <Mike_Pendleton> ...topical classification scheme used by multiple governments 14:16:28 <olyerickson> Q: John is saying these silos would be "ideally" linked, but are not necessarily linked (today) 14:17:10 <Zakim> +Sandro 14:17:35 <olyerickson> Q: Does Wolters Kluwer have it's own identification scheme? Does it leverage existing LEIs? 14:18:30 <Mike_Pendleton> ...legislation is comprehensively linked to exec branch regulations; WC collects data from government; glad US is offering free content and investing in more metadata - leads to better outcomes for WC customers. Would like government branches to agree on standards on how different source of law are referenced. 14:18:58 <bhyland> s/WC/WK 14:19:09 <bhyland> WK = Wolters Kluwer 14:20:38 <Mike_Pendleton> ...WC desire for standards is beyond U.S., and includes China. WC wants a time based element to metadata. Research is spent on determining what version of a statute is being used. 14:20:46 <bhyland> s/WC/WK 14:21:43 <MartinKaltenb> the mentioned project with WKD (Wolters Kluwer Germany) is LOD2: http://lod2.eu 14:21:44 <olyerickson> Q: What does "working with" mean, esp what additional metadata is being suggested? 14:21:46 <mhausenblas> Michael: FYI - we're working with Christian Dirschl in the LOD2 project http://lod2.eu/ 14:22:06 <MartinKaltenb> Hi Michael - parallel thinking... 14:22:09 <olyerickson> Q: How is the described work different than e.g. DataCube? 14:22:32 <Mike_Pendleton> ...Working with Open Knowledge Foundation; trying to help governments ID metadata that should be included with public domain data; how best to semantically enable data for consumption; moving increasingly to making knowlege structures available in the clood 14:22:45 <bhyland> … Legisltation, e.g., HIPPA, Dodd Frank get amended. The provisions retain the same citation however we don't know if the agency or court in what *timeslice* the agency or court was referring to. 14:22:54 <bhyland> … WK can provide more value to customers. WK can help define what metadata on public domain data to semantic model for consumption, in decision trees, intelligent charts, etc. 14:23:04 <bhyland> … WK is moving to cloud. Looking to make available topical indices, thesauri, and other classification resources. 14:23:30 <gatemezi> Q: which ontologies have they developed in WK? Are they public? 14:23:57 <tinagheen> +1 to question from gatemezi 14:24:12 <bhyland> q? 14:24:22 <olyerickson> unmute me. 14:24:31 <olyerickson> zakim, unmute me 14:24:31 <Zakim> olyerickson should no longer be muted 14:24:45 <bhyland> JohnBarker: "We would like to see gov'ts globally standardize metadata. Make everything comprehensively linked. WK would then rededicate their editors to higher value activities. 14:24:48 <Mike_Pendleton> ...Question - are your ontologies public? 14:25:16 <bhyland> JohnBarker: "If that doesn't happen, we'll have Gov't agencies continue to purchase content from WK because it is more searchable. 14:26:03 <Mike_Pendleton> WK Germany and WK Netherlands makes some available; Christian Dershal (sp?) is the person to talk to. U.S. domain ontologies are not available, need partners for that. 14:26:07 <MartinKaltenb> WKD will publish 2 SKOS thesauri on places of trial and employment law as open data in about 2-4 weeks as far as I know 14:26:16 <MartinKaltenb> Christian Dirschl 14:26:33 <bhyland> Christian Dershal (WK) is point of contact working with Open Knowledge Foundation on publishing ontologies as LOD. 14:27:25 <bhyland> Christian Dirschl 14:27:25 <bhyland> Content Architect, Head of Content Strategy 14:27:28 <bhyland> Wolters Kluwer Germany 14:27:40 <olyerickson> http://solutions.wolterskluwer.com/blog/author/christian-dirschl/ 14:28:13 <Zakim> -DaveReynolds 14:28:15 <Mike_Pendleton> Bernadette question - how to motivate gov't; how do you make your data available to gov't agencies? 14:29:21 <olyerickson> q+ 14:29:57 <bhyland> JohnBarker on why gov't agencies use WK content? 14:29:57 <bhyland> * Gov has discarded their content, they haven't archived. 14:29:57 <bhyland> * Gov agency may request content via email or facsimile. Say an agency lawyer. 14:30:07 <Mike_Pendleton> John: Gov't agencies in some cases have discarded archives, lawyers and regulatory staff sometimes need it; request it from WK 14:30:53 <Mike_Pendleton> ...Gov't agencies sometimes aggregate statutes and make them available on their website (e.g., SEC) ; others don't 14:33:37 <Mike_Pendleton> ....there is no comprehensive aggregation of cases by gov't; for example, courts don't always follow a standard when referencing regulations. WK invests effort to address this. We arrange sources of law by relative authority by topic and explain how they inter-relate. 14:34:41 <Mike_Pendleton> ...If there were links created between content by gov't, that would help; this is why gov't's rely on WK; 14:35:35 <bhyland> … If gov't would LINK them, then WK could just focus on commentary & explanations. 14:35:35 <bhyland> WK has lawyers & CPAs spending their time FINDING content rather than using their expertise to write commentary and explanations. 14:36:39 <Mike_Pendleton> ...for example, the fed might want to know about legal/regulatory innovations in California or another state. Gov'ts want to know about what other gov'ts are doing in how things are regulated. WK aggregates a cross jurisdictional view - 'smart charts' that help answer questions 14:38:01 <Mike_Pendleton> ...the answers to specific questions are organized by jurisdiction, which give instant answers to questions like 'what are the privacy regulations' - they differ by jurisdictions. 14:38:44 <Mike_Pendleton> ...if governments would put more effort in the metadata, many others would save $ and greater value would be derived. 14:38:47 <bhyland> q? 14:41:06 <Mike_Pendleton> John E Question: I work at RPI on LOD gov't work, several things you touched on are of interest - linking together datasets - legal entity IDs for different datasets; and problem of regs that specify regulator requirments; and access to other data silos 14:41:46 <bhyland> zakim, who is speaking? 14:41:57 <Zakim> bhyland, listening for 10 seconds I could not identify any sounds 14:42:16 <olyerickson> olyerickson also mentioned the OrgPedia project (Beth Noveck, Jim Hendler, et.al.) 14:42:29 <bhyland> zakim, mute olyerickson 14:42:29 <Zakim> olyerickson should now be muted 14:42:30 <olyerickson> mute me 14:43:13 <bhyland> JohnBarker: "What is the law on privacy, e.g., health records, information on a customer? They differ in every juridiction. THere is a lot of open government data available, however it has to be classified, linked, annotated and distributed in a reusable manner. 14:43:55 <Mike_Pendleton> John: There are business rules that go with metadata entry; if I am a piece of content labeled on regulation; and there is a certain context with a regulation; a common vocab across all data, jurisdictions and governments and business rules so that links how knowledge of the relative authority of each regulation. 14:44:03 <bhyland> … Everyone benefits if the gov't spends tax dollars to make data available using standard vocabularies and best practices, especially around provenance, URI policy & services, standard vocabularies for tax, legal & regulation. Ideally, a common vocabulary across all juridictions ... Also, business rules that are linked expressing relative authority of a piece of data (e.g., case law, regulation, etc) 14:44:27 <olyerickson> unmute me 14:44:35 <Mike_Pendleton> ...some users are bound by a case, others not, based on relative authority 14:44:38 <olyerickson> unmute me 14:44:47 <olyerickson> never mind 14:44:47 <bhyland> zakim, unmute olyerickson 14:44:47 <Zakim> olyerickson should no longer be muted 14:46:26 <Mike_Pendleton> John E Question: Orgpedia is operating to help aggregate open source corporate data to help gov't to get access to data which is not normally used by their agency but is by others. Hinges on exposing identifiers and linking them. 14:46:58 <bhyland> olyerickson: "First use cases are directed at government, people in agencies needing to use inter-government agency content. Requires exposing identifiers and linking them ... none is commonly available. You have to troll to find out what is available. 14:47:54 <Mike_Pendleton> ...none of this is commonly available; in the open space, it is arduous task. There are a lot of inconsistencies between gov't agencies as how companies are identified; disambiguation is hard, and is what SW was created for. 14:48:37 <olyerickson> mute me 14:49:06 <olyerickson> bhyland: No more questions for this witness ;) 14:50:40 <Mike_Pendleton> John: Standards are important; USC is inconsistent; some titles change the order in which parts appear. WK would like standardization; but challenge is how standardization is maintained and persistence assured. WK used query transformation to address this. Makes the query match the data that exists. WK also adds metadata, markup. 14:51:19 <Mike_Pendleton> ...a third thing is when they are unable to change content in a meaningful way, we have human editors create connections. 14:52:10 <olyerickson> JohnBarker: creating "hybrid content business solutions..." (did I hear that right?) 14:52:37 <Mike_Pendleton> ...WK creates hybrid content software solutions - drive workflow steps in a software application. Requires accurate info - human layer needed; can't rely on content logic yet 14:52:43 <bhyland> … WK addresses content as follows: 14:52:43 <bhyland> 1) Query transformation, synonyms, root extensions, to make the query match the data that exists. 14:52:43 <bhyland> 2) Enrich content with metadata, fix errors, add markup 14:52:43 <bhyland> 3) If we cannot delegate to the search engine, we have human editors manually create a connection. There are examples where administrative rulings don't reference Code. 14:52:50 <bhyland> … Hybrid content software solutions, where content generates business rules. That drives workflow steps in a software application. We have to have highly accurate content to codify it. A human today must interpret the content that in turn a s/w developer has to put into code. We cannot semantically model the content yet. 14:53:13 <bhyland> q? 14:53:19 <bhyland> ack olyerickson 14:53:19 <olyerickson> q- 14:54:35 <olyerickson> Very concerned about the proprietary nature of e.g. their ontologies (the schemae that make it all sensible) 14:55:45 <olyerickson> we need not only linked open govt data, but linked open govt ontologies...if the vendors want $$$ to understand the data, then it's not open 14:57:02 <Mike_Pendleton> Bernadette: Clear need for coordination! Web Masters need clear markup instructions. WK and others have a lot to gain with government doing this. What about WK sharing ontologies and helping to drive this. 14:58:36 <bhyland> q? 14:59:00 <gatemezi> Thanks for the presentation, John! 14:59:11 <olyerickson> Yes, thanks JohnBarker 14:59:35 <mhausenblas> Indeed, thanks a lot for the presentation, John 14:59:51 <MartinKaltenb> many thanks John also from my side! 15:01:24 <bhyland> Agreed: To continue the dialog between the international professional publishing sector & gov't & int'l standards group. 15:01:27 <olyerickson> olyerickson has left #gld 15:01:57 <Zakim> -olyerickson 15:02:23 <MartinKaltenb> thanks - bye 15:02:25 <Zakim> -Sandro 15:02:43 <Zakim> -JohnBarker 15:02:48 <Zakim> -George_Thomas 15:02:50 <Zakim> -DeirdreLee 15:02:52 <Zakim> -gatemezi 15:02:54 <Zakim> -mhausenblas 15:03:01 <Zakim> -tinagheen 15:03:36 <Zakim> -boris 15:03:41 <Zakim> -BrendanIAB 15:03:56 <bhyland> RRSAgent, set logs world-visible 15:04:05 <bhyland> RRSAgent, generate minutes 15:04:05 <RRSAgent> I have made the request to generate http://www.w3.org/2012/05/31-gld-minutes.html bhyland 15:05:00 <bhyland> https://www.w3.org/2009/CommonScribe/panel/?group=gld&go=Use+This+Group 15:06:39 <bhyland> Minutes from today's meeting published here: http://www.w3.org/2011/gld/meeting/2012-05-31 15:14:31 <Zakim> -mikependleton 15:14:33 <Zakim> -bhyland 15:14:33 <Zakim> T&S_GLDWG()10:00AM has ended 15:14:33 <Zakim> Attendees were mikependleton, +1.847.668.aaaa, bhyland, boris, mhausenblas, cygri, olyerickson, George_Thomas, gatemezi, +3539149aabb, DaveReynolds, BrendanIAB, JohnBarker, 15:14:33 <Zakim> ... DeirdreLee, tinagheen, Sandro 15:21:21 <DaveReynolds> DaveReynolds has left #GLD 17:06:05 <Zakim> Zakim has left #gld 17:08:05 <bhyland> bhyland has joined #gld 17:31:14 <danbri> danbri has joined #gld # SPECIAL MARKER FOR CHATSYNC. DO NOT EDIT THIS LINE OR BELOW. SRCLINESUSED=00000204