07:59:06 RRSAgent has joined #odw 07:59:06 logging to http://www.w3.org/2013/04/24-odw-irc 07:59:20 meeting: ODW Day 2 07:59:24 PhilA has changed the topic to: ODW13 Day 2 07:59:36 Has anyone seen hadley 08:00:02 floppy has joined #odw 08:00:52 lottebelice has joined #odw 08:00:55 SimpsonTP_ has joined #odw 08:01:30 scribenick: BartvanLeeuwen_ 08:01:33 StevenPemberton has joined #odw 08:02:03 Topic: Perspectives & Experience 08:02:36 scribenick: BartvanLeeuwen 08:03:48 floppy has joined #odw 08:04:02 markbirbeck has joined #odw 08:04:19 Topic: Perspectives & Experience 08:04:45 rtroncy has joined #odw 08:05:02 DNA Digest by Fionna Nielsen 08:05:38 s/Fionna/Fiona/ 08:06:03 DNADigest non for profit org. with purpose of opening up genome sequences 08:06:44 albertm has joined #odw 08:06:53 DNA Sequencing, purpose : cancer research/ heritable traits and illness and rare diseases 08:07:19 rrsagent, make minutes 08:07:19 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html StevenPemberton 08:07:20 yoshiaki has joined #odw 08:08:00 rrsagent, make log public 08:08:02 rrsagent, make minutes 08:08:02 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html StevenPemberton 08:08:11 gaztop79 has joined #odw 08:08:41 Agenda: http://www.w3.org/2013/04/odw/agenda#day2 08:09:23 Chair: Phil Archer 08:09:44 Apologies to Fiona who is saying seriously interesting stuff but fighting technology problems. Facilities are on the case but so far without success 08:09:49 rrsagent, make minutes 08:09:49 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html StevenPemberton 08:10:29 scribenick: StevenPemberton 08:10:31 AndreaP has joined #odw 08:10:37 cjg has joined #odw 08:10:51 Fiona: researcher needs databases of genomic variation 08:11:02 cgueret has joined #odw 08:11:06 ... more data is needed to validate results 08:11:21 AndyS has joined #odw 08:11:24 ... they want to learn from other data 08:11:29 ... but are not sharing their own 08:11:40 ... why not? 08:11:47 ... - confidential data 08:12:07 ... - easy to identify individuals from genome data 08:12:16 BartvanLeeuwen has joined #odw 08:12:17 ... - BULKY 08:12:38 ... so data has to be de-identified and aggregated 08:12:46 scribenick: BartvanLeeuwen 08:12:50 no open sharing of raw data 08:13:22 only defects for described diseases is shared 08:13:45 Solution of DNA Digest, not all research needs access to full data 08:14:10 rrsagent, make minutes 08:14:10 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html StevenPemberton 08:14:28 e.g. Does this mutation occur at higher frequency 08:14:48 stressindikator has joined #odw 08:15:01 this result is already de-identified 08:15:02 albertm has joined #odw 08:16:18 challenges: connecting existing datasets, incentives for sharing and commercial model 08:16:34 http://dnadigest.org 08:17:23 yoshiaki has joined #odw 08:17:27 Next speaker: Florian Bauer http://www.reeep.org 08:17:58 reeep helps with renewable energy in developing countries 08:18:53 hideaki has joined #odw 08:18:57 website after website is launched, "portal proliferation syndrome", that needs to be fixed 08:19:31 in current projects, data silos are created 08:22:09 scribenick: StevenPemberton 08:22:09 floppy1 has joined #odw 08:22:36 Florian: the ystem has to deal with disambiguation 08:22:41 s/ystem/system/ 08:22:45 its seems very hard to stop this 08:22:46 but its because project deliverables state information dissemination , not how 08:22:46 We need to break silos and link information to help them do it differently 08:22:47 Can we support the linking of information with and automated system 08:22:47 we have to ! 08:22:52 ... and use linked open data 08:23:16 ... we supply thesauri 08:23:30 ... our system helps tag unstructured documents 08:23:53 ... a file is sent, in whatever format, we analyse it, extract common terms 08:24:05 ... based on thesaurus and vocab 08:24:16 ... and deliver lots of metadata 08:24:25 ... with definitions etc. 08:24:44 ... and we can store the location of the document with tags (if the user wants) 08:25:05 ... [gives an example] 08:25:42 ... please look at http://api.reegle.info 08:25:57 ... we are a non-profit 08:26:02 ... so no charge 08:26:24 ... we consider this a first step 08:27:13 Next speaker - schema.org, Dan Brickley 08:27:21 next speaker: Dan Brickely Schema.org 08:27:28 JeniT has joined #odw 08:27:37 scribenick: BartvanLeeuwen 08:28:10 rrsagent, make minutes 08:28:10 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html StevenPemberton 08:28:13 introducing schema.org 08:28:38 search engines decided to collaborate, even though they are competitors 08:29:39 putting triples on the web was only done by a limited number of people 08:30:00 schema.org is not SEO, its semantic technology gone mainstream 08:30:44 schema.org is html with markup, to generate triples 08:31:12 creates rich snippets for enhanced search results 08:31:31 allows custom search engine 08:32:22 working on integration and discovery of datasets 08:32:45 we are driven by being able to answer questions, thats why datasets are interesting 08:32:54 johnlsheridan has joined #odw 08:33:05 [ shows examples ] 08:34:02 Florian has joined #odw 08:34:39 vocabulary is not designed by the schema.org people, its community driven 08:35:06 the team only integrates community efforts 08:35:40 this community is driven by w3c community group 08:36:05 albertm has joined #odw 08:36:08 dataset are added based on work by a.o. W3C GLD 08:37:13 where do graphs stop and tables start ? 08:37:49 schema.org is about finding dataset not making all data rdf 08:38:19 If I get a chance I'll talk about what we're doing at W3C on vocabs - the schema.org management model is instructive 08:38:25 Next speaker: Licensing Library and Authority Data Under CC0: The DNB Experience, Lars Svensson 08:38:28 rrsagent, make logs public 08:38:42 rrsagent, draft minutes 08:38:42 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html PhilA 08:38:53 jpcs1 has joined #odw 08:39:53 cerealtom has joined #odw 08:40:09 today I talk about the change business model 08:40:55 albertm_ has joined #odw 08:41:02 cgueret has joined #odw 08:41:35 mixed business model right now: which dataset / which format / access type 08:42:22 floppy has joined #odw 08:42:31 3 methods for harvesting meta data: SRU, OAI and HTTP 08:43:02 SRU and OAI are free services after registration to keep customers up to date 08:43:25 HTTP is free, no registration or tracking 08:43:42 formats, RDF, CSV and MARC 08:43:49 MARC is a custom library format 08:44:16 http://en.wikipedia.org/wiki/MARC_standards 08:44:52 by end of this Quarter also as PDF ( nice PDF ) 08:45:17 Authority data is available under CC0 and metadata as well 08:45:56 metadata about books is mostly CC0, RDF / CSV always CC0 08:46:32 MARC data of last 2 years is available under a fee older under CC0 08:46:48 process of transition 08:47:05 discussion about lost of revenue and spillover 08:47:50 scared of people running away with our data and earning lots of money with it. 08:48:04 no know examples of this actually happening 08:48:32 lessons learned: Finding right licenses can be tricky 08:48:49 CC-by license was to complicated 08:49:22 make it easy for people to use your data ! 08:49:29 we ended up with CC0 08:49:58 Next speaker: Open Government Data Projects in Japan, Shuichi Tashiro, IPA 08:50:06 rjw has joined #odw 08:50:24 IPA is the government IT standards body. 08:51:23 brief intro of opendata in Japan 08:52:09 Oktober 2008 CIO Forum started discussion about opendata in japan 08:52:20 s/Okt/Oct/ 08:52:24 in 2001 there was already a e-gov portal site 08:52:40 johnlsheridan has joined #odw 08:53:06 multiple administrative procedures available in one portal, but with non standard UI 08:53:30 public private collaboration with a realtime datalink 08:53:48 between train company and seismic center 08:54:07 in 2011 all trains stopped 10 seconds before earthquake wave reached main land 08:54:52 October 2010 a Open gov Lab was started with various items 08:55:07 before 2012 open gov strategy 08:55:28 lot of data put out had low quality 08:55:40 Earthquake of 2011 was a turning point 08:55:54 demand for data: 08:55:57 -- who is where 08:56:01 floppy1 has joined #odw 08:56:06 -- who need what / who can provide what 08:56:20 -- availabily of general infrastructure 08:56:27 -- polution level 08:56:35 Multi tier collaboration is needed 08:56:49 s/availabily/availability 08:57:07 s/polution/pollution 08:57:17 [ shows examples ] 08:57:59 albertm_ has joined #odw 08:58:00 albertm` has joined #odw 08:58:29 Recovery and reconstruction support program database, one stop shop for support programs 08:59:11 this resulted in a vocabulary problem, all local govs used there own terminology 08:59:21 JeniT_ has joined #odw 09:00:53 Japan has problem with its character code 09:01:02 very local, and very specific to identity 09:02:25 [ shows architectural diagram ] 09:02:51 o.a. contains a vocabulary database to solve problems with local terminology 09:02:55 JJ has joined #odw 09:04:02 also a open license character databases, with RDF support to find relationship between characters and legislation about character sets ? 09:04:33 Panel: Previous speakers 09:05:29 daveL has joined #odw 09:06:18 markbirbeck1 has joined #odw 09:06:48 hadley: calls out to panel: if you had 10billion budget where would you be in 10years 09:07:30 Danbri: we should finaly start integrating 09:07:32 fumi has joined #odw 09:07:37 Lars: Digetizing 09:08:06 solve rights issues, who own rights on publications is somewhat hard 09:08:11 albertm` has joined #odw 09:08:49 floppy has joined #odw 09:09:01 Florian: governments need give us the data we need to answer the important questions. Smartgrids are important 09:09:05 in relation to IPA presentation on linked data in Japan, there is a new W3C community that aims to develop best practice in multilingual linked open data: http://www.w3.org/community/bpmlod/ 09:09:47 Fiona: Money is not a issue, not technology , standardization is the current issue 09:10:06 Questions from room 09:10:33 ivan, remark: we should generalize a bit not just government but also science data 09:11:02 they produce data we should be able to access that as well. 09:12:09 Dave Lewis: remark, focus on LOD in countries where main language is english, there is multilingual Comm group [ please insert uri ] 09:13:03 http://www.w3.org/community/bpmlod/ 09:13:50 Hans Overbeek: questions about standards, what is the best advice? should we use schema.org or DCAT profile for integrating datasets 09:13:59 DCAT Application profile work through EC's ISA Programme. Details at http://joinup.ec.europa.eu/asset/dcat_application_profile/description 09:14:27 danbri: Schema .org is a agile approach, which could spin of into a standardization body 09:14:46 rrsagent, make minutes 09:14:46 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html ldodds 09:14:52 scribenick: ldodds 09:15:05 Topic: Product Data 09:15:28 Chair: PhilArcher 09:16:59 HadleyBeeman has joined #odw 09:17:12 PhilArcher: I think product data is going to be very important 09:17:32 first speaker is John Walker, paper: http://www.w3.org/2013/04/odw/odw13_submission_40.pdf 09:17:58 John is talking about Open Data in the electronics industry 09:18:25 rrsagent, pointer? 09:18:25 See http://www.w3.org/2013/04/24-odw-irc#T09-18-25 09:18:27 John: @NXPData is our twitter, please share what we're doing 09:18:57 ... NXP is semiconductor company in netherlands, lots of large customers and large portfolio of products 09:19:14 ... Content is the product, Product data is part of the content => data is the product 09:19:29 ... Why make data Open? 09:19:53 ... In bad old days, very doc centred information, lots of content silos, content reuse was copy and paste or (worse) re-keying 09:20:07 ... Consequences: inconsistency, cost, errors, complex to manage 09:20:35 ... Consequences (for customers): confusing, inconsistent, manual effort to gather/re-use information, difficult to find new products 09:20:50 ... Vision: unified content strategy. Create once, approve once, re-use many times 09:21:04 ... ISO 13584 -- data model for describing products 09:21:14 ... DITA -- for natural language content 09:21:36 ... variety of outputs, including flyers, data sheets, online, etc 09:21:47 ... data sheets are mixture of text and data 09:22:07 ... parametric search interface for finding products 09:22:34 c_riffle has joined #odw 09:22:35 ... dictionary of properties of products 09:23:33 ... NXP want to be canonical source of information which is re-published by others, e.g. distributors, aggregators, etc 09:23:43 ... want Web of Data to help collaboration 09:23:58 ... http://data.nxp.com and http://qa.data.nxp.com 09:24:19 ... that is a work in progress 09:24:38 ... give us feedback on what is there 09:24:46 ... Open Data challenges 09:24:55 ... how do we convince others to use (linked) open data? 09:25:10 ... how do we justify business case (ROI) -- argue that its simpler 09:25:26 ... what formats should we use? (rate of adoption of RDF/Linked Data) 09:25:39 ... How do we ensure quality, security, enable access 09:25:51 ... How do we combine semi- and unstructed content in publications? 09:25:56 ... Are we giving away a key asset? 09:26:07 ... How do we standardise in industry? 09:27:08 Next speakers are Andy Hedges & Richard McKeating, Tesco 09:27:16 s/unstructed/unstructured 09:28:06 Andy: big numbers about tesco, but what is interesting is how our data connects us 09:28:19 bschloss has joined #odw 09:28:23 ... some of that data is ours, some belongs to others: customers, suppliers, manufacturers 09:28:36 ... may not be able to share all of it: rights, commercial sensitivity 09:28:46 JeniT has joined #odw 09:28:57 ... need to understand where information comes from, and how it can be best combined for customer benefit 09:29:09 ... offer customers best service/prices and compare data 09:29:36 ... perhaps not intuitive to allow cross-supermarket price comparison, but want to be a good brand have good contract with community 09:29:50 bhyland has joined #odw 09:30:06 Richard: I'm passionate about how to use open data, microformats, etc to help customers 09:30:34 ... Tesco Open Data is about our customers 09:30:44 ... Places where you can buy things, incl. online 09:30:52 ... Products that we offer (across range of brands) 09:31:03 ... Orders: what's in your basket, or buy on the web 09:31:17 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html JeniT 09:31:21 ... Journeys: we make to fulfill orders, e.g. deliveries, logistics 09:31:36 ... Rewards: what do you get as a customer, club card pointers, offers, price promise 09:32:07 ... at Tesco we are on journey towards exposing this data, to allow app providers to access it 09:32:16 ... would be great to link to open data sources 09:32:26 ... really key area is how we share data with trading partners 09:32:41 ... important for brands to share product information, esp. accurate data 09:33:01 ... customer access to data, e.g. purchase history, allergen information, etc 09:33:11 ... Tesco sell NXP products :) 09:33:16 st has joined #odw 09:33:42 ... but they have little information on it (opportunity for sharing) 09:34:06 Andy: customers don't just shop at tesco 09:34:24 albertm has joined #odw 09:34:24 ... standards make our life easier 09:34:49 ... expose price promise, cross retailer product comparison, delivery choices 09:35:08 ... can compare products using GTIN (same EAN) 09:35:24 ... but suppliers might have different sizes, etc. Some products can vary 09:35:45 ... Keen to start dialog with standards makers 09:36:12 Next speaker is Mark Harrison, GS1 09:36:16 Slides: http://www.w3.org/2013/04/odw/GS1-LinkedDataPresentation-ODI-April2013.pdf 09:36:23 GS1 is a standards body, assigns GTINs 09:36:53 Mark: GS1 is global stds org; 1m companies working on standards, e.g. for barcodes, logistics, rfid, etc 09:37:11 ... new initiative started in Feb, GS1 digital: putting identification into the web 09:37:49 ... B2C example: map human-readable keywords ("milk") to Product category identifier (GPC), search has user constraints (price, distance, urgency) 09:38:08 ... contextual filters for product category, e.g. organic, skimmed milk 09:38:37 ... refine search to find products and services that match needs, e.g local store offering the product 09:38:53 ... Achieve this using data linkages 09:39:17 ... start with keyword which is mapped to category; category has attributes/criteria 09:39:54 ... imagine using contextual searching across all suppliers across the web, not just facet search within single website 09:40:04 ... Schema.org and Good Relations are key vocabularies 09:40:26 ... + GS1 : Global Product Classification (GPC) 09:40:45 ... easy for them to open that, already available as XML, in process of creating multi-lingual RDF 09:40:57 naomi has joined #odw 09:41:18 ... GSN: Global Data Synchronisation Network has additional data about manufacturers, etc 09:41:29 ... GPC as Linked Open Data 09:42:05 Slide shows simple example graph of GTIN and facets 09:42:19 Mark: please collaborate with us: email gs1digital@gs1.org 09:42:28 BartvanLeeuwen has joined #odw 09:42:38 ... I'm working on project as researcher, others handling industry engagement 09:43:12 Next speaker is Phillipe Plagnol 09:43:16 Product Open Data 09:43:18 Slides: http://www.w3.org/2013/04/odw/POD-2013.04.22_01.pdf 09:43:46 Philippe: Product data is critical for open data movement 09:44:10 ... everything around us is a product, has one GTIN code which is unique identifier 09:44:27 ... products used by everybody, every day 09:44:39 ... lots of contextual information, e.g. product packaging, nutrietion 09:44:58 ... products are fundamental for trade, economics 09:45:15 ... objective is to create big repository of data about products, based on barcode 09:45:31 ... include ecological impacts, sources, support responsible consumers 09:46:02 ... lots of apps to support product barcode scanning, then can use that to access data 09:46:12 ... BUT: currently no public database containing this database 09:46:54 ... manufacturers have all this information in databases to support printing, but don't share it 09:47:04 ... largely question of access 09:47:30 ... give us what is already printed on the packaging, using GTIN as a key 09:48:11 ... need to have a product schema for manufacturers to support their publishing 09:48:39 ... asking manufacturers for nutritional data 09:48:53 ... working in France, hoping to get traction in other countries 09:49:19 ... incredible possibilities of using this product data 09:49:48 ... GTIN is a new communication channel. More easily support product annotation, product mapping, etc 09:50:23 ... apps can support product reviews, consumer recommendations/decision support 09:50:39 ... only thing stopping us is an open catalog of the data 09:50:56 ... imagine a "google maps for products" 09:52:01 Example of consumer buying decisions based on using a third-party app that uses ecological data 09:52:31 Now moving to discussion with all speakers 09:52:59 PhilArcher: (to Andy) does Philippe's talk scare/please you? 09:53:16 yoshiaki_ has joined #odw 09:53:18 Andy: pleases us, we want to see this data opened too, helps us be better retailer 09:53:32 ... an awesome challenge 09:53:59 Richard: legislation is important too, "contains nuts" is a life/death decision 09:54:32 John: v. interesting. In semi-conductor industry, people will take the data if its available 09:54:39 rjw has joined #odw 09:54:50 ... want to make sure products are accurately described, whether its strawberries or microchips 09:55:00 ... help consumers find what they need 09:55:18 PhilArcher: Mark, what are members saying? 09:55:42 Mark: enthusiasm from many members, great opportunity for all of us to be more responsible consumers 09:55:50 stressindikator has joined #odw 09:55:55 ... how do we spend out money, what choices do we make? 09:56:37 Jim King (Adobe): isn't there a large product db of RFID data? 09:56:58 Mark: not quite the same thing, that is likely EPC data, movement of stock through supply chain 09:57:04 ... we're discussing the product master data 09:57:19 ... opening EPC data is more commercially sensitive 09:57:41 TomHeath: this is great session, what are the concrete steps towards openly licensed data? 09:58:06 Mark: different kinds of product data; product categories/values can be openly licensed, largely format shift 09:58:22 ...data from manufacturers, requires discussions with them, v. early stage 09:58:34 ... they need confidence in benefits for themselves and others and licensing discussions 09:58:40 ... lots of good will, enthusiasm 09:58:59 Richard: Tesco are translating desire to action by working with GS1 and brands 09:59:11 ... make it easy and not disadvantage suppliers 09:59:48 albertm` has joined #odw 09:59:58 ZachBeauvais: heard lots of positive noises, but what are the business cases on building on available open data? Any concrete examples? 10:00:15 Richard: one bus. case is legislative changes and ensuring that products are accurately described 10:00:28 ... opportunities within our enterprise, we have silos 10:00:40 ... really only just starting to understand wider bus. case 10:00:56 Martin ? (IBM): are competitors trying to catch up? 10:01:18 Richard: plenty of retailers working in this space, but not co-ordination yet 10:01:47 John: companies are already scraping and reselling data outside of our control 10:01:58 ... if we can clearly license it, then can make it easier 10:02:06 Closing statements 10:02:39 Philippe: originally saw GS1 as "enemy", but can now see they're embracing open data 10:02:49 gaztop79 has joined #odw 10:02:50 LarsG has joined #odw 10:02:54 JeniT has joined #odw 10:03:04 ... GTIN code is often hidden by lots of ecommerce web sites, needs to be published and clearly available 10:03:27 ... come and download dump our own data to see how it is constructed 10:03:38 ... I'm following GS1 stds work, give me feedback 10:03:57 Mark: huge opportunity to make a difference, make world better place 10:04:15 John: in B2B industry, focus needs to be on streamlining business integration 10:04:23 Richard: keep focus on benefits for customer 10:04:37 Andy: echo that, need to look at customer needs. 10:04:51 That's the end of the session! 10:05:17 rrsagent, make minutes 10:05:17 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html ldodds 10:06:31 PhilA has joined #odw 10:06:42 rrsagent, draft minutes 10:06:42 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html PhilA 10:08:45 markbirbeck has joined #odw 10:14:30 cjg_ has joined #odw 10:14:34 AndyS1 has joined #odw 10:14:58 ivan_ has joined #odw 10:15:03 naomi_ has joined #odw 10:19:03 floppy has joined #odw 10:19:53 markbirbeck has joined #odw 10:23:07 Topic: Dumb Strings That Mean So Much 10:23:14 Chair: Hideaki Takeda 10:23:26 scribenick: naomi 10:25:47 ldodds has joined #odw 10:25:55 jpcs1 has joined #odw 10:27:27 stressindikator has joined #odw 10:27:53 BartvanLeeuwen has joined #odw 10:28:14 cgueret has joined #odw 10:28:16 LarsG has joined #odw 10:28:17 ldodds has joined #odw 10:28:17 Next speaker: Ministry of the Interior of the Netherlands, Geonovum, Hans Overbeek 10:28:38 JeniT has joined #odw 10:28:39 Hans: Concept URI Strategy for the NL Public Sector 10:28:58 rtroncy has joined #odw 10:29:00 rjw has joined #odw 10:29:09 ... Thijs gives you geographic informaion 10:29:10 albertm has joined #odw 10:29:20 s/infomaion/information/ 10:29:48 bhyland has joined #odw 10:30:09 yoshiaki has joined #odw 10:30:31 StevenPemberton has joined #odw 10:30:43 rrsagent, here? 10:30:43 See http://www.w3.org/2013/04/24-odw-irc#T10-30-43 10:31:07 PhilA has joined #odw 10:31:19 scribe: PhilA 10:31:26 scribeNick: PhilA 10:31:29 HadleyBeeman has joined #odw 10:31:46 topic: Draft URI Strategy for the NL Public Sector, Hans Overbeek 10:31:57 paper http://www.w3.org/2013/04/odw/odw13_submission_14.pdf 10:32:29 Hans: Talking about Designing URI Sets for Public Sector, ISA programme study on URI Persistence etc 10:32:48 Hans: Why do we need a URI strategy - it's about trust, provenance 10:32:57 ... hard to do in the LOD Cloud 10:33:18 Hans: We want our URIs to be recognisable and trustworthy 10:34:17 Hans: We have kept registers - buildings, railways etc. for hundreds of years, mostly to define identifiers 10:34:23 ... joint points for linked data 10:35:13 Hans: we develop a model and a vocabulary for it 10:35:22 ... a register is a list of things that you want to reference 10:35:58 Hans: then there's all the sensor data etc. What we think of as the big data 10:36:38 ... re-use of things like reference objects is what we want to re-use when we write our URI strategy 10:37:02 ... we struggled a little as we have to mint URIs, but we have a lot of identifiers that we can re-use 10:37:08 ... but we don't have a register for everything 10:37:20 ... there was no register for all our municialites 10:37:33 ... so we had to mint URIs for them... which means making a register 10:37:41 ... you can only have URIs if you have a register 10:37:53 ... No iregister? No identifier 10:38:02 ... so we were convinced that we needed a URI strategy 10:38:32 ... the pattern that we used was, not surprisingly the one developed in the UK/backed by ISA 10:39:23 Hans: The domain should identify the regsiter in a persistent way, {register}.data.gov.nl 10:39:30 gaztop79 has joined #odw 10:40:24 Hans: The UK pattern has a {sector} in the pattern which sounds nice but its' hard to find someone to govern the sectors. Some will overlap etc. 10:40:38 ... so we thought we might not deed {sector} and left it out 10:40:51 s/deed/need/ 10:41:06 s/its'/it's/ 10:41:07 ... with no strategy, you can use any URI, but it's less recognisable and less trustworthy 10:41:07 sam has joined #odw 10:41:20 hans: That means we end up having to have a register of registers 10:41:38 s/municialites/municipalities 10:41:40 Hans: What infratsructure is needed? 10:41:55 s/iregister/register 10:42:04 Hans: which apps use the resvolvers and how frequently 10:42:13 Hans: There's more in the presentation and the paper 10:42:19 s/regsiter/register 10:42:27 ... are we heading the right way? 10:42:48 s/infratsructure/infrastructure 10:42:57 Topic: Shared understanding = shared foreign keys (and more), Richard Light 10:43:03 s/resvolvers/resolvers 10:43:05 paper http://www.w3.org/2013/04/odw/odw13_submission_17.pdf 10:43:15 slides http://www.w3.org/2013/04/odw/odw13_slides_17.pdf 10:44:03 rrsagent, make minutes 10:44:03 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html StevenPemberton 10:44:11 Richard: Want to talk about a pragmatic approach to getting URIs into the cultural heritage sector 10:45:00 floppy has joined #odw 10:45:00 i/Chair: Hideaki Takeda/scribenick: naomi_ 10:45:03 rrsagent, make minutes 10:45:03 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html StevenPemberton 10:45:14 Richard: gives brief history of museum identifiers 10:45:41 ... work was done on vocabularies, controlled vocabularies 10:45:55 scribenick: PhilA 10:46:56 is this the first RDF/XML of the workshop? 10:47:17 Richard: shows some examples of collections described in RDF 10:49:23 sam has joined #odw 10:49:49 Richard: There are good discussions in progress across the sectors... 10:49:49 ... but although there is more RDF coming out, when you look in detail, a lot of values are given as strings 10:50:21 Richard: Modes is software used in most UK museums, Not free but you become a share holder 10:50:51 Richard: Modes includes standard term lists etc., that become standards across users 10:51:02 ... now startung to use Web to get the terms 10:52:00 Richard: Modes includes a live search of geonames as source of URLs for geographic places. Conversion happens in the software 10:52:26 Richard: Can we use SPARQL endpoints as a a term list? Yes... 10:52:44 bschloss has joined #odw 10:53:17 Richard: Curators won't do any LD publishing themselves. All done in Modes 10:53:47 AndreaP has joined #odw 10:53:59 Richard: uses XSLT to transform data from original XML data. handles the conneg etc 10:54:19 s/startung/starting 10:54:37 trc has joined #odw 10:54:52 Richard: Shows work that gave a URI to every word Shakespeare ever wrote 10:55:20 s/scribenick: naomi/scribenick: naomi_ 10:55:25 rrsagent, make minutes 10:55:25 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html StevenPemberton 10:55:26 Richard: Adlib and CALM so looking at generating/using linked data 10:55:56 Richard: gives example of dog food eacting 10:56:04 s/eacting/eating/ 10:56:20 Topic: Aggregating media fragments into collaborative mashups: standards and a prototype, Philippe Duchesne 10:56:30 paper http://www.w3.org/2013/04/odw/odw13_submission_28.pdf 10:56:53 pd: on Media Fragments 10:57:17 pd: lists issues faced 10:58:31 scribe note - slides are expressive/detailed 10:59:30 pd: points to earlier work that is all media specific 11:00:02 pd: No harmonised definition of the fragments 11:00:17 pd: Wanted to decouple fragment from media 11:00:45 pd: geospatial and tree paths not part of any previous work AFAIK 11:02:07 pd: project done mostly using HTML/JSON developers 11:02:18 ... but we have a SPARQL endpoint as well 11:02:35 JSON on the screen a few minutes after RDF/XML... 11:04:18 Sorry folks, no time for questions on this session 11:05:32 pd: gives a quick demo 11:07:16 floppy has joined #odw 11:07:28 topic: Digital Archiving 3.0, Christophe Guret 11:07:36 paper http://www.w3.org/2013/04/odw/odw13_submission_34.pdf 11:07:44 slides http://www.slideshare.net/cgueret/digital-archiving-30-odw 11:08:23 scribe note - slides are expressive 11:10:02 cgueret: we need to treat the data and metadata differently 11:10:09 ... we find LD the best format for thias 11:10:16 s/thias/this/ 11:10:40 cgueret: Many formats for data itself 11:10:46 timbl has joined #odw 11:11:49 cgueret: rather than force people to transform their data, they should just get the data in the repository - it's up to the latter to sort out formats 11:12:07 cgueret: Forget about URIs as data 11:12:11 PhilA: Grrrr 11:12:48 cgueret: we have new formats every 5 years. Use conneg to handle format evolution 11:14:01 -> a related pointer: how to cite data for scholarly purposes, the Amsterdam Manifesto, http://www.force11.org/AmsterdamManifesto 11:15:21 topic: Discovery panel 11:15:29 scribenick: CaptSolo 11:15:40 chair: danbri 11:16:00 danbri: we will be discovering discovery, as everything brings back to this topic 11:16:12 next speaker: Richard Wallis, OCLC 11:16:32 cgueret has joined #odw 11:16:34 rjw: working for OCLC 11:16:55 ... WorldCat (stats about number of lbiraries, books) 11:17:10 ... integrating linked data, schema.org 11:17:38 ... the other hat: chaired ... [could you add detail here, missed it?] 11:18:13 ... need to publicize links to resources 11:18:31 ... generic vocabs = generic "glue" that helps link resources 11:18:58 Richard chairs the Schema Bib Extend Community Group http://www.w3.org/community/schemabibex/ 11:19:11 ... you have to demonstrate the benefits -- use the data to drive the services 11:19:30 next speaker: Chris Metcalf, Socrata 11:19:36 [http://www.w3.org/2013/04/odw/agenda#al59 abstract, paper] 11:19:50 Chris: ~60 customers who want to use open data 11:20:10 ... how can we use schema.org, etc. to help solve the discovery problem 11:20:23 ... catalogs that need to speak to each other (cities.data.gov, ...) 11:20:39 ... how can we encourage people and industry 11:20:54 note: if i miss things scribing, please add them :) 11:21:47 (technical pause) 11:22:01 next speaker: Steven Pemberton, CWI 11:22:01 StevenPemberton 11:22:02 [http://www.w3.org/2013/04/odw/agenda#al9 abstract, paper] 11:22:13 Steve: don't have slides, can speak now :) 11:22:24 Steve: involved with W3C from day 1 11:22:36 ... name on number of standards, incl. RDFa 11:22:51 ... point of research: make computers easier for people to use 11:22:59 ... small data is important 11:23:07 ... e.g., website for this conference 11:23:29 ... you look for data on aiports, lodging, agenda, ... (and you enter same info again and again) 11:23:53 ... if the info were in RDFa you could automatically add this info to your calendar, find best flights, ... 11:24:08 ... if your browser helped you here, people's lives 'd better 11:24:23 ... and browsers would would by providing services that help use this data 11:24:26 ... win-win for all 11:24:30 ... use RDFa 11:25:02 next speaker: Pascal Romain and Elie Sloïm, Conseil général de la GIronde/Temesis 11:25:12 ... Pascal from local council 11:25:20 ... Elie from a company, W3C member 11:25:26 [http://www.w3.org/2013/04/odw/agenda#al38 abstract] 11:26:01 Elie: checklist of 72 good practices (fr, en) 11:26:20 ... every good practice has to be available online, international, usable, realistic 11:26:35 ... OPQuast - Open quality standards 11:26:47 ... if you are open data producer, go and check the guidelines 11:27:05 s/Steve: don't have slides/Steven: don't have slides/ 11:27:12 ;) 11:27:34 Pascal: open data checklist : a tool for LOD? 11:27:37 s/Steve: involved with W3C from day 1/Steven: involved with W3C from day 1/ 11:27:50 markbirbeck: thanks 11:28:07 next: Madi Solomon, Pearson 11:28:13 [http://www.w3.org/2013/04/odw/agenda#al48 abstract, paper, slides] 11:28:26 Madi: need to find new ways of doing business 11:28:42 ... provided a solution for publishing open linked data (?) 11:28:54 ... but never used the words "open" or "linked" 11:29:07 ... termed it resource enrichment (?) 11:29:11 jpcs1 has joined #odw 11:29:27 ... textbook can be broken down into a large number of assets 11:29:36 ... each requires its metadata, ... 11:29:51 ... concept extraction, keywords, faceted exploration 11:30:15 ... built a rule to match with wikipedia, automated metadata generation as quickly as possible 11:30:48 ... astrophysics textbook -- had to do filtering, though, to keep out sci-fi topics :) 11:31:02 ... found out we can make taxonomies on-the-fly 11:31:26 ... creates a baseline, curated taxonomy 11:31:41 ... virtuos circle -- put it back into community, maintain, update, ... 11:31:53 Dan: question time 11:32:03 ... back to W3C aspects 11:32:27 ... (question to the whole panel) 11:32:37 ... if not making standars, what should be doing instead 11:32:40 ... ? 11:32:49 Steven: small data is important. 11:33:15 ... see benets from including data on websites 11:33:53 Pascal?: standardisation is good efford 11:33:55 DeirdreLee has joined #odw 11:34:11 rjw: not standards work, it is nurturing communities that have emerged 11:34:20 ... sharing experiences, ... 11:34:50 Madi: as new coach of digital Publishing IG - questions is what can i do for you? 11:35:01 hideaki has joined #odw 11:35:10 Chris: i love simple tools that do powerful things 11:35:34 ... w3c working on these things, but lot of people don't know re them 11:35:39 ... need to reach out, inform 11:35:59 albertm has joined #odw 11:36:00 Elie: need to produce standards + guidelines for implementing those standards 11:36:20 ... have nice specs but not always simple to implement 11:36:25 questions from the audience 11:37:20 Hadley: back to geocities era - we made lists 11:37:32 ... feel we are doing that now = making data catalogs, lists of resources 11:37:56 ... what we need to do to make it worthwhile to index the metadata by search engines 11:38:03 st has joined #odw 11:38:07 ... so average person can participate 11:38:10 (applause) 11:38:24 Chris: that's schema.org RDFa stuff 11:38:34 ... build it into catalogs, so data gets crawled 11:38:52 ... not just list datasets, also make metadata schemas more practical for people 11:39:04 ... want to type in my ZIP-code and find what's relevant to me 11:39:22 rjw: we (non-search-engines) we need to talk their (search engine) language 11:39:45 ... need to put up resources in front of people 11:39:59 Steven: i marked my homepage w RDFa 11:40:06 BartvanLeeuwen has joined #odw 11:40:13 ... involved me ending up on Google Maps w the location where I live 11:40:21 danbri: getting more vistors? :) 11:40:39 Bob Schloss (IBM): by analogy to SEO - let's implement 11:41:00 ... fabulous implementation -- go to cloud's website, enter a URL from where the dataset can be fetched 11:41:15 ... they 'd "suck in" the dataset and give it buck w enriched metadata 11:41:22 s/buck/back/ 11:41:37 ... let's externalize it. if nobody starts, we won't have it 11:41:53 Chris: we are doing cataloging well 11:42:01 ... re dscovery by search engines 11:42:13 ... data by itself not so useful to everyday people 11:42:22 ... people need to *use* those datasets 11:42:33 ... we need to allow the data to be more useful 11:43:11 Bernadette: as someone who has spent years on vocabs for linked data 11:43:21 ... need communication, mentoring 11:43:31 ... info that simply and quickly explains what it is about 11:43:46 ... to those who make decisions 11:44:02 ... people in this room should write books, make videos, organize seminars with the stakeholders 11:44:20 ... there's so many standards, implementations. can be overwhelming 11:45:02 (miss this one re google glass, RDFa, ...) 11:45:20 Martin, CTIC: Spanish experience re economy, corruption 11:45:39 ... Spanish government launched a technical standard for all public bodies 11:45:54 ... all have to use these guidelines when exposing open data 11:46:03 ... have to use linked data 11:46:20 ... URI scheme for catalogs, datasets, ... 11:46:50 bhyland has joined #odw 11:47:01 now the last round of comments from the panel 11:47:08 danbri: 10words of 30 syllables now 11:47:24 Steven: if you got information, it should be on the web + machine readable 11:47:50 ?: as data producers think of objects and entities -- instead of datasets 11:48:21 rjw: you're only people in the domain who know the benefits -- demonstrate them to everyone ! 11:48:38 Madi: middle place b/w where data is release and ppl access it. data-driven businesses 11:48:55 Chris: we as LOD advocates need to reach out to ppl outside the LOD community 11:49:06 ... many as tired of SemWeb cause they don't see the benefits 11:49:14 ... schema.org,RDFa, simple tools 11:49:26 Elie: need more metadata for search engings, end-users 11:49:36 ... metadata quality -- checks needed 11:49:42 ... make website for end-users 11:49:47 ... websites 11:49:59 ... need to work on quality 11:50:08 danbri: thanks to the panel 11:50:20 rrsagent, here? 11:50:20 See http://www.w3.org/2013/04/24-odw-irc#T11-50-20 11:50:31 panel discussion finished 11:50:41 StevenPemberton has joined #odw 11:50:54 trc has left #odw 11:51:17 rrsagent, draft minutes 11:51:17 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html PhilA 11:51:28 c_riffle has joined #odw 11:51:32 jpcs1 has joined #odw 11:52:31 yoshiaki has joined #odw 11:56:09 naomi has joined #odw 12:16:18 st has joined #odw 12:16:32 st has left #odw 12:22:05 st has joined #odw 12:22:19 jpcs1 has joined #odw 12:27:10 stressindikator has joined #odw 12:34:26 LarsG has joined #odw 12:38:55 timbl has joined #odw 12:40:04 rtroncy has joined #odw 12:43:20 stressindikator has joined #odw 12:43:44 yoshiaki has joined #odw 12:45:19 StevenPemberton has joined #odw 12:46:32 markbirbeck has joined #odw 12:47:48 AndyS has joined #odw 12:49:38 sam has joined #odw 12:51:09 fumi has joined #odw 12:52:04 AndreaP has joined #odw 12:52:04 yoshiaki has joined #odw 12:52:08 timbl has joined #odw 12:52:11 yaso has joined #odw 12:52:12 floppy has joined #odw 12:52:32 cjg has joined #odw 12:52:33 JeniT has joined #odw 12:52:58 bhyland has joined #odw 12:53:52 st has joined #odw 12:55:03 rjw has joined #odw 12:55:41 anyone scribing? 12:57:28 albertm has joined #odw 12:57:46 could someone take over if Agis can not do it now? 12:58:09 ScribeNick: JeniT 12:58:17 bhyland has joined #odw 12:58:33 Michalis speaking now re publicspending.gr 12:59:01 Topic: “Storytelling” in the economic LOD: the case of publicspending.gr 12:59:38 Michalis: visualising open spending data 12:59:49 naomi has joined #odw 13:00:17 ... linked open data enables these kinds of analysis 13:00:30 ... this has prompted new projects on LOD 13:00:41 ... network analysis is a useful tool suited to linked data & semantic web 13:00:46 ... particular interest in real-time open data 13:01:14 question: what tools did you use to generate network analysis graphs? 13:01:28 Michalis: visualisations were done by Gephi, processed by R & other mathematical tools 13:01:48 Coupling a data exploration, visualization, story assembly website such as IBM's Many Eyes 2.0 on top of LOD seems like something worth pursuing. See http://www-958.ibm.com/software/analytics/labs/manyeyes/#home 13:01:48 ... all done with open source tools 13:02:15 Topic: Bottom up Activities for linked open data, open government in Japan 13:02:36 Takumi from OKF Japan 13:02:38 Reminder to PhilA -- put RDA lightening talk slides on ODW Workshop website, please! 13:03:11 ... and from Keio University 13:03:33 Takumi: International Open Data in Japan Feb 23rd 2013, 300 participants in 8 cities 13:04:06 ... botom-up activities from stakeholders are driving LOD in Japan 13:04:16 ... academic institutions try to engage with local government & communities 13:04:25 s/botom/bottom/ 13:04:34 ... community members have same goals for LOD, which promotes collaboration 13:04:49 ... neutral intermediary coordinates activities & shares best practices 13:05:05 ... LODAC (LOD for ACademia) http://lod.ac/ 13:05:30 ... develops lots of datasets & builds dictionaries 13:05:36 ... engages with local communities 13:05:52 ... Yokohama city, one of the biggest cities in Japan, large LOD community 13:06:09 ... Sabae city has first local government in Japan publishing LOD on its website 13:06:24 ... LODAC & Yokohama community collaborate 13:06:45 ... create mashup around museum & event information, demonstrating value of combining datasets from different communities 13:06:59 ... private companies have question/answer datasets for Yokohama city 13:07:07 I will e-mail you slides, for now, add links please to http://rd-alliance.org and to http://static.squarespace.com/static/50ad9169e4b00ca12a884beb/t/50b34139e4b033e6125eec16/1353924921864/rda-flyer.pdf (their 1-page flyer) 13:07:27 ... community generated new consortium, including government, citizen, academic members 13:07:40 BartvanLeeuwen has joined #odw 13:07:43 ... this led to Yokohama city becoming big LOD community 13:07:55 ... Sabae city first publisher of LOD on its website 13:08:03 ... 2011 published XML datasets with CC licences 13:08:09 ... 2012 published RDF 13:08:23 ... ATR Creative (private company) used LOD for their own product 13:08:26 Yokohama Art Spot: http://lod.ac/apps/yas/ 13:08:34 ... iPhone application 13:09:01 ... Sabae city became most advanced open data city in Japan, because of collaboration between stakeholders 13:09:23 ... government publishes datasets, but other organisations gather, aggregate, make available LOD 13:09:31 floppy has joined #odw 13:09:38 Sabae Burari, an application of POI and maps mush-up for local sightseeing spot: https://itunes.apple.com/jp/app/sabaeburari/id595859507?mt=8 13:09:45 ... OKF Japan organises bottom-up activities in Japan 13:10:11 ... organised 300 participants in open data day, some doing hackathons, some editing Wikipedia 13:10:18 International Open Data Day in Japan: http://odhd13.okfn.jp/ 13:10:19 ... over 90% participants were satisfied 13:10:40 ... biggest benefits around networking, sharing ideas, and learning about open data & improving engineering skills 13:10:53 ... those involved from different sectors 13:11:17 ... OKF Japan helps to share best practices with each area, by providing toolkits & tutorials 13:11:33 ... eg Where Does My Money Go? originally developed in UK, localised for Japanese usage 13:11:44 ... used in Yokohama city, with tutorials for other cities 13:12:02 ... Conclusion: 13:12:14 ... bottom-up activities has driven engagement 13:12:23 ... collaboration by academia is key 13:12:36 ... neutral intermediary (OKF Japan) coordinates & helps share activities 13:12:56 Topic: Utilising Linked Social Media Data for Tracking Public Policy and Services 13:13:04 cgueret has joined #odw 13:13:05 Deirdre from DERI 13:14:00 Deirdre: relationship between open data & public policy 13:14:17 ... can be used to influence public policy & services, to lobby, influence policy makers 13:14:29 ... eg use statistics to guide new schools 13:14:40 ... also used to justify policy decisions 13:14:47 ... and to evaluate policies 13:15:17 ... eg environmental measurements to see whether regulations have had an effect 13:15:28 ... still a lot of research to do about how this all works 13:15:43 ... what about combining open data with social media data? 13:16:06 ... could give evidence-based policy evaluation 13:16:27 ... social media data is already being used for business intelligence, trend analysis, opinions on brand etc 13:16:34 ... lots of activity from industry 13:16:50 ... government is coming around to this, but using them in limited ways 13:17:11 ... social media used for dissemination & limited engagement, but not to full potential 13:17:12 bhyland has joined #odw 13:17:17 ... not being used to get information from social media 13:17:28 ... government is only a publisher, not a consumer of social media 13:18:09 ... government should be harnessing information from social media 13:18:16 ... proposed to do this using linked data 13:18:36 ... extract data from social media, express as linked data, analysis on it 13:18:43 ... challenges: 13:18:58 ... wide variety of sources, each with its own API 13:19:03 ... wide variety of formats 13:19:05 ... privacy concerns 13:19:14 ... can be noisy, difficult to process 13:19:38 ... this is all based on solid research funded under EU FP7 13:19:56 BartvanLeeuwen has joined #odw 13:19:57 www.linked2media.eu 13:19:59 ... Linked2media to provide SMEs with tooling 13:20:01 ldodds_ has joined #odw 13:20:13 ... DERI developed Social Media Linked Data Space 13:20:31 ... now trying to apply this (designed for SMEs) to government 13:20:55 ldodds__ has joined #odw 13:20:56 ... has a triplestore, crawlers, integrating 25 different sites including review sites 13:21:25 ... there are restrictions on different social media APIs which limit which data you can access 13:21:50 ... once we have data, we model in common linked data format, reusing existing vocabularies 13:21:56 ... using SIOC for review data 13:21:58 PhilA: can I add a barcamp discussion proposal? 13:22:06 ... schema.org, rev, Marl etc 13:22:39 PhilA: How should we attribute open datasets? 13:22:39 ... next steps are to look into integration of social media data with other linked data 13:22:58 ... also using data to influence, justify & evaluate public policy 13:23:16 ... not just technical aspects to this research, also social, political etc 13:23:26 Topic: Panel discussion 13:23:47 Christian: run a small web design company in London 13:24:06 ... running tool to monitor corruption 13:24:38 Uldis: if you had one lesson learnt, what would it be? 13:24:53 Christian: we've had a lack of collaboration, despite open source tools 13:25:16 Deirdre: use of social media data is limited by the restrictions that they place on it 13:25:52 Michalis: big lesson is that critical mass for open data is lower than the web itself 13:26:01 ... you build on the existing web 13:26:19 ... make an application work, and everything else will fall into place 13:26:36 Takumi: engage with local community and local government 13:26:49 ... local community has a diversity of needs 13:27:02 ... need cross-relationships to tackle the real problems 13:27:54 Bart: for Deirdre: did you investigate engaging with the public rather than just reading twitter? asking specific questions rather than just listening? 13:28:16 Deridre: that's something we are looking at 13:28:27 ... a lot of citizen engagement platforms ask on specific topics 13:28:41 ... but it's hard to find participants that care enough to give that feedback 13:28:48 ... when you just listen you can see the trends 13:29:02 ... see what they *do* care about: maybe they just care about environment, not transport 13:29:18 ... these are different approaches for different goals 13:29:54 Uldis: what do you expect to get from crowd-sourcing? 13:30:08 Christian: we kept hearing stories about corruption, but we didn't write them down or map them 13:30:13 cerealtom has joined #odw 13:30:16 ... wanted to build something 13:30:24 ... doesn't work for people who don't have internet access 13:30:33 yvesr: surely its time for a game of crack attack! 13:30:33 ... they do have mobile phones, we have SMS number 13:30:49 ... we want to tell people they have something wrong in their country 13:31:27 Kal: are you concerned about the demographic about people who contribute open data & participate on social media, and how that's different from demographic of general population? 13:31:45 Takumi: in Japan, we don't have much difference 13:32:06 ... tends to be young people, lots of men, but otherwise not so much difference 13:32:40 Takumi's coauthor is speaking 13:32:54 panelmember: in both Yokohama & Sabae, there are lots of knowledge workers 13:33:21 ... lots of data providers 13:33:42 ... many students took part in open data day, to compose articles on Wikipedia 13:34:10 Michalis: researchers & journalists are different 13:34:17 ... because of how they're funded 13:34:27 ... there are both types of users in each demographic 13:34:46 floppy has joined #odw 13:34:52 ... we're seeing a spread in access around Greece, among users who just want to find relevant information 13:35:04 Deirdre: I'm not worried about the demographic as long as we're aware of it 13:35:10 ... we're not claiming that it's representative 13:35:18 ... you can build in SMSs or having real workshops 13:35:20 s/panelmember: in/Yoshiaki Fukami: in/ 13:35:31 ... if you need something that's representative, build in other demographics 13:35:44 Christian: new technology goes hand-in-hand with old technology, don't forget radio 13:36:02 BobSchloss: political parties are getting clever at extracting features from social media 13:36:03 SimpsonTP_ has joined #odw 13:36:24 ... some politicians have dashboards in which the weight of each statement is modified by Facebook friends or twitter followers 13:36:48 ... the rumour is that they can identify whether people are influential in their communities 13:37:08 ... will we see comments being weighted? 13:37:29 Christian: like A/B testing politics: it's no longer politics just the most popular person wins 13:37:39 Deirdre: is that something that should or can be stopped? probably not 13:37:51 ... if we just have opinions then it's biased & subjective 13:38:00 ... if we just have open data, it's not tied into human aspect 13:38:09 ... we need to combine the two to get the balance 13:38:22 Michalis: if you can make objective information more attractive 13:38:36 ... can you relate election area to spending, for example 13:38:50 ... tagging the spatial location of the payment 13:39:00 ... you can find objective information in a subjective way 13:39:36 Uldis: Concluding comments? 13:40:11 Michalis: we believe economic LOD should be nucleus of LOD 13:40:16 ... need money to go around 13:40:36 ... need to say clearly to policy makers which data is data infrastructure 13:40:38 floppy has joined #odw 13:40:47 ... eg in economics, all public spending, prices 13:40:47 sam has joined #odw 13:40:53 beauvais has joined #odw 13:40:59 ... theory and application together 13:41:24 Takumi: relationship between local community & other communities very important 13:41:59 Deirdre: as a community, it's great to see our progress, but I'd love to see more interdisciplinary talks & sessions at these events 13:42:12 ... bringing real use cases to complement the technical skills we bring 13:42:27 Christian: we mustn't forget that there are parts of the world where things aren't moving at this speed 13:42:41 ScribeNick: AndreaP 13:42:51 Topic: Lightning Talks - eGovernment and multilingualism (chair: Yaso Córdova) 13:42:57 HadleyBeeman has joined #odw 13:43:17 yaso has joined #odw 13:43:37 yoshiaki has joined #odw 13:43:57 Topic: A Brief Report on the Research Data Alliance Plenary in March 2013 (Bob Schloss, IBM) 13:44:25 Bob: Huge amount of data from scientists in the next years 13:44:49 ... How such data will be accessible? 13:45:39 cnolle has joined #odw 13:45:40 ... RDA wants to move as IETF 13:46:09 ... to accelerate and facilitate research data exchange. 13:46:24 ... A number of WGs are being organised 13:46:27 floppy has joined #odw 13:46:39 DeirdreLee has joined #odw 13:46:52 ... e.g., on Persistent Identifiers, Metadata 13:47:05 ... They want to look at existing standards. 13:47:23 ... Provenance and quality are other key issues for RDA. 13:47:52 ... They want metadata to be searchable, in a cross-disciplinary way. 13:48:23 ... Another issue: how to handle big datasets. 13:48:58 ... Again: datasets for peer review. 13:49:21 .. Real work starts in September. You are all encouraged to join. 13:50:03 ... About how to join: http://rd-alliance.org/ 13:50:13 Topic: Open Data in Data Journalists' Workflow (Uldis Bojārs, University of Latvia) 13:51:17 Uldis: National Library of Latvia opening up data. 13:51:40 ... [interrupted] 13:52:48 ... [technical issues] 13:53:27 Topic: Empowering the E-government data life cycle (Edoardo Colombo, Politecnico di Milano) 13:53:31 yaso has joined #odw 13:53:56 Edoardo: The project is multidisciplinary. 13:54:14 ... computer scientists, engineers, ... 13:54:16 yaso has joined #odw 13:54:47 ... Goal to set up a publishing protocol for open data that can be used by PAs. 13:56:11 ... General goal is to set up an eGov system, enabling PAs to publish data and citizen to discover them. 13:56:45 ... Why? To have facts, not opinions - open data are facts. 13:56:52 See rd-alliance.org , consider coming to their plenary in Washington DC in September, think what W3C standards and Open Data best practices (such as DCAT) can be extended for their needs. 13:57:12 ... Presenting the system "search computing architecture". 13:57:14 BartvanLeeuwen has joined #odw 13:57:29 ... Use case: money given to hospitals. 13:57:46 ... The high level query is translated into low level ones. 13:58:06 ... Result is presented to the user. 13:59:09 Topic: Open Data in Data Journalists' Workflow (Uldis Bojārs, University of Latvia) 13:59:18 Uldis: I'm back. 13:59:43 ... Interested in the area of open data. 13:59:57 albertm has joined #odw 14:00:05 ... We need to make it easier to work with the data, to make them more re-usable. 14:00:34 ... Work with data frictionless from the start. 14:00:55 ... We should be able to use it for building stories. 14:01:13 ... Presenting workflow on data-driven journalism process. 14:02:11 ... The idea is to have a set of tools able to cover the whole process. 14:02:25 ... Data journalism is just one of t he use cases. 14:03:03 ... Stressing the need to get stories from data. 14:03:15 s/t he/the/ 14:03:55 ... Most important part is data discovery and publishing. 14:04:48 .. Journalist must have information useful to assess the quality of the data they are going to use, first of all information on data provenance. 14:05:08 trc has joined #odw 14:05:37 PhilA has joined #odw 14:05:37 Topic: Lessons learned (and questions raised) from an interdisciplinary Machine Translation approach (Timm Heuss, University of Plymouth) 14:05:42 yaso has joined #odw 14:06:14 Timm: Motivation is that ambiguity is an issue in Natural Language Processing. 14:06:22 rrsagent, draft minutes 14:06:22 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html PhilA 14:06:29 ... Ambiguity may result in incorrect translations. 14:06:42 cnolle has joined #odw 14:06:50 ... Disambiguation is carried out based on dictionaries. 14:07:10 ... Not like the approach. 14:07:25 ... Rather: use what is in the LOD cloud. 14:08:27 ... Statistics for LOD is key for NLP. 14:09:11 ... Number of issues. 14:09:23 ... Can LOD really model natural language? 14:09:49 ... How can simply access LOD datasets? Some of the relevant ones are not easily accessible. 14:10:40 Topic: Interoperability Challenges for Linguistic Linked Data (David Lewis, Trinity College Dublin) 14:11:05 yaso has joined #odw 14:11:28 David: Would like to talk on a number of issues: content management, NLP technologies, localisation. 14:11:58 ... Presenting localisation's value chain. 14:12:30 ... A lot of work is outsourced. 14:12:39 ... Re-use is also a big market. 14:12:52 ... Statistical machine translation is also used. 14:13:11 ... The value chain is quite long. 14:13:39 ... Support for interoperability is there (XML-based), but interoperability is expensive. 14:14:57 ... W3C ITS IG (http://www.w3.org/International/its/ig/) is trying to address some of these issues. 14:15:19 yaso has joined #odw 14:15:25 ... The idea is to address all the localisation process workflow. 14:15:52 ... This is done by using existing formats. 14:16:34 .. Interest in using Linked Data to disambiguate terms and to introduce confidence. 14:16:51 ... Also: how can we use Linked Open Data in the process? 14:17:08 s/.. I/... I/ 14:17:46 ... Provenance ontology (http://www.w3.org/TR/prov-o/) very relevant here. 14:18:24 ... RDF used also for process monitoring. 14:19:15 jpcs1 has joined #odw 14:19:26 ... Another opportunity is to use multilingual LOD datasets to train machine translation. 14:20:47 Topic: Bar Camp Pitches 14:21:06 yaso has joined #odw 14:22:49 Eric: Project on reference implementation for LOD supporting data quality and provenance 14:23:23 Bart: How can we actually say that OD is successful from a business perspective. 14:24:23 James: Which are the barriers to using OD? 14:24:39 sam has joined #odw 14:25:10 anyone ready to scribe? (as AndreaP is leaving soon) 14:25:25 Leigh: Attribution and OD - can we have best practices on this? 14:25:46 Bernadette: Best practices for Persistent URIs? 14:27:15 johnlsheridan has joined #odw 14:27:20 Christopher: Want to show what I did by aggregating data from different Universities 14:27:26 Wolfgang Orthuber 14:27:50 numeric feature spaces 14:28:43 JeniT, Omar: linked CSV 14:28:51 Mark Harrison: ? 14:29:13 AndreaP: can't manage to scribe everything, but i can add some detail 14:29:28 Michael Lutz: ? 14:29:43 ok, barcamp pitches finished 14:30:00 if you pitched barcamp ideas, add more detail here 14:31:44 Michael: What you would like to have as a contribution from the European Commission on open data? e.g., concerning legislation, regulation, reference data and services 14:35:18 rssagent, make minutes 14:36:04 rssagent, make draft minutes 14:36:28 rrsagent, draft minutes 14:36:28 I have made the request to generate http://www.w3.org/2013/04/24-odw-minutes.html AndreaP 14:39:05 cjg has joined #odw 14:40:06 HadleyBeeman has joined #odw 14:40:56 bhyland has joined #odw 14:41:30 fumi has joined #odw 14:44:09 beauvais has joined #odw 14:48:19 StevenPemberton has joined #odw