See also: IRC log
See attendee list
Scribes: Jeanne, Boris, PhilA, cgueret, olyerickson
<Martin> Shows basic structure with no feedback loop
... adds in the feedback loop
... how can we measure transparency by measuring data?
... journal paper coming out soon
... we can measure info straight from the government
... we can measure powerful voices and citizen's groups
... lots of measurement of perception of transparency. Can we measure it more objectively
... describes different types of transparency (agent controlled or not)
... publicity and accountability are both necessary
... e.g. Nigeria is very transparent but accountability has been low and therefore transparency has been less effective
... 8 metrics for institutional and eco transparency
... applied the 8 metrics to many South American countries
... results of applying the 8 xparency metrics to latin american countries
... Venezuela not doing so well, Brazil and Chile dong well in these terms
... describes two different values that come out of the metrics
<olyerickson> is interested in how good statistical correlation shows lack of corruption (was that the speaker's point?)
<Martin>Low correlation between the online srevice inde
... low correlation between Government Data Openness Index and UN Online Service Index
... so research shows there must be accountability and that info can be received (however - doesn't matter)
... mechanisms for accountability necessary, not just lections
<olyerickson>not sure I understand the "publicity condition..."
<Martin>Shows video generated from text - as an e.g. of an appropriate way to show data
<Federico>Surprised to see no reference to social media
<Martin>Not seen evidence / research into impact of social media in this space
<Richard>Your research was focussed on South America?
<Martin>yes, because they are all democtratic countries but with varying definitions of democracy
<AndrewL>I see Argentina second from bottom on your list. Being used to beat Argentina with... If you've travelled around these countries, does the transparency index 'feel right'?
<Martin>I think this is embryonic and we haven't yet reached a steady state
... You're right that Argentina has always had problems with corruption and used to be a world power 50 years ago... lots of problems
<Sharon>First career was as a manager in government
<olyerickson>Sharon is with CTG Albany http://www.ctg.albany.edu/
<Sharon>this morning is a summary of 20 years experiendce in this spacxe
... "then a miracle occurs" cartoon
... We don't really understand where all the data around us comes from
... it appears like the "then a miracle occurs" situation
... Data problems come from a variety of sources. And problems arise along the way. There are assumptions about data and its provenance
... data comes from the operation of public programmes
... actions may collect data in different ways etc. About people, places, money etc and we treat them differently
... We don't use data properly. We mis-use it because of false assumptions, or we don't use it because we don't understand it
... the reasons why we don't use the data for policy making is that we don't really understand whewre it comes from
... and we shift responsibility for data quality depending where we are in the chain
... e.g. census bureau is a gold standard for getting data right at the source. But the less we spend at the source the less likely the data is to be accurate
... Where does data come from?
... administrative systems
... governed in a particular way and gathered in a specific context for a specific need
... about 40K homeless people in NY state on any given night. Wide variety of support programmes. Some very large, some very small... different capabilities in organising and collecting data
... differing links to welfare system etc.
... ability to pull data together varies -> variation in quality and detail of that data
... mother arriving with 2 children doesn't know SS number, have birth certificates etc. -> gaps in the record that may or may not be filled in during her stay
... are the dates registered first entry to the shelter or most recent
... case 2 parcel data/cadastral
<olyerickson>Cadastral records => http://en.wikipedia.org/wiki/Cadastre
<Sharon>data collected for purporse of tax evaluation etc. Used for all sorts, transport planning, city planning etc.
<PhilA> Sharon: Don't have to allow entry to assessors and so they may not be able to get in the building
<PhilA> Sharon: as the data is passed up the chain and is aggregated, less and less of the data is passed on. If you want it all you'll have to contact thousands of local agencies
<PhilA> ... they all do it differently
<PhilA> Sharon: Case 3 - where does the money go?
<PhilA> ... 1bn of the 8bn went to the NY state transport system
<PhilA> ... there's a requirement for grantees of the money to report quarterly. States not set up to do this and so systems had to be put in place very quickly
<PhilA> ... 13 different information systems used by 400 projects
<PhilA> ... implementation quality varied greatly
<PhilA> ... and it's hard to measure. What is a job created? What is a job saved?
<PhilA> Sharon: we have data, we have technologies, we have management/organisations - how do we collect it? define it etc., and we have policies about the services and the data
<PhilA> ... context is real people in real places trying to do complicated things
<PhilA> ... not surprising that a lot of data is not used in policy making
<PhilA> ... tend to use single data sources. We don't have data about whether services for homeless people and it's harder to get
<PhilA> ... and elephant in the room is data quality
<PhilA> ... for most people data quality means accuracy but it means more
<PhilA> ... data quality = fitness for use
<PhilA> ... data quality usually involves trade offs between, say, timeliness which may mean sacrificing some quality
<PhilA> ... many dimensions to data quality
<PhilA> ... Describes US census data - very detailed. The gold standard. cf. the metadata about a tweet
<olyerickson> ...Link to "Data Quality Assessment" (Pipino, et.al. 2002) http://dwquality.com/DQAssessment.pdf
<PhilA> ... these are prganisations that understand the value of data. The amount of metadata and quality and attention on it should be commensurate with the value we think the data has
<PhilA> ... some data quality tools
<PhilA> ... metadata should support users you don't know
<olyerickson> ...non-technical data quality tools for providers and users...
<PhilA> sharon: data is an artificial representation of the real world
<olyerickson> ..."the more structured the (data), the further it is from the real world"
<PhilA> ... we have talked a little about integrating data sets and we recognise that this is rarely done. That might not be a bad thing as combinations multiple the errors substantially
<PhilA> paper -> http://www.w3.org/2012/06/pmod/pmod2012_submission_39.pdf
<PhilA> HelenD: Take Sharon's last slide as a starting point... we're in the tsunami of data
<PhilA> ... but do we get the information we want?
<PhilA> ... not always cost free
<olyerickson> ..."right" to data linked to "right" to access to information
<PhilA> HelenD: Parallel to publishing of data has been improvement in rights to access to data
<PhilA> ... ECHR sees access to data as part of freedom of expression 3 years ago, confirmed by UN less than a year ago
<PhilA> ... not sure that there's a necessity to talk about a right to data when we have a human right linked to freedom of expression
<PhilA> ... need data in a proper format
<PhilA> ... machine readable etc.
<PhilA> ... and should be free except perhaps printing costs if you want it that way
<PhilA> ... highlights http://asktheeu.org
<PhilA> ... used by OKFN to get access to entire EU budget in machine-readable format for the first time
<olyerickson> ..."New UN General Comment on Freedom of Expression" (2011) <http://bit.ly/L3B0xk>
<PhilA> ... other aspect to access to info being a right puts oblgation on govs to publish proactively, not just in response to FOI
<PhilA> ... Sharon made the comment that the data that's available may not be what's needed in policy making
<olyerickson> ..."When people outside govt...insist on data, can force policy makers to take into consideration data they may have overlooked"
<PhilA> ... but when people outside government insist on access to data can force policy makers to consider data they haven't considered and then maybe use them - recognise the feedback loops
<PhilA> ... we need relevant feedback loops
<PhilA> HelenD: what do we need to know to hold govs to account?
<PhilA> ... open budget spending data is very important
<PhilA> ... what is the impact on financial crisis on health services is very big
<olyerickson> ..."Data for Democracy": budget spending, health services, education data, companies registers, tax delinquency
<PhilA> ... Chris Taggart gives most countries a score of 23/100 on company transparency
<olyerickson> ..."Transparency International measures corruption, not transparency..."
<PhilA> Katleen: So Helen's spoken about gov obligation to provide what citizens need, not what they want to give
<olyerickson> ..."relevant information is not for the government to choose"
<PhilA> Katleen: the Open data community is part of the supply side, not the demand side which is the end users, the public. That's teh goal we need to aim for
<olyerickson> ...relevancy based on what citizens demand/need
<PhilA> ... so what is our role?
<PhilA> ... sitting here won't help
<PhilA> ... it's about reaching out to wider communities
<PhilA> ... open data activitists have a responsibility to reach out to local activists
<PhilA> ... ask not what open data can do for us but what it can do for citizens (paraphrase)
<PhilA> ... most peoiple don't know what to do with data files
<PhilA> .. need to find a way to translate what we;re doing to a bigger group
<PhilA> ... we need to realise how we're creating new dependencies
<PhilA> ... the open data community ends up being the gate keepers
<PhilA> Richard: Questions?
<olyerickson> ...reference to "School of Data" (Ckan)
<PhilA> Katleen: It takes a lot of work and it's a very good start but we need to do more (scribe may have that wrong)
<olyerickson> ...School of Data link <http://schoolofdata.org/>
<PhilA> HelenD: Is the data reaching the campaigning groups and civil society folks
<PhilA> ... does civil society know how to get the data to relevant people. Not just raw data but visualisation and so on. Need complemenatry measures
<olyerickson> ...dataviz work shouldn't stop on the web site
<PhilA> ... we do a lot of work helping organisations use the data
<olyerickson> ...viz work should reach out to "user community" (citizens) to ensure it meets demands and is effective
<PhilA> Tim: What is the message to those on teh supply side - should we stop and consider the capacity to use data before we release it
<PhilA> Federico: Do you think opening budget data is 'not enough' do you think teaching people about the budget could be better than just opening the data set
<Jeanne> Great points on having to communicate more about open data ideas and projects. We can applaud the good ones here, but if the developers, citizens, or others don't know about what we are doing to help it doesn't make a difference.
<olyerickson> ...Q: transparency in visualization...e,g,, how people know how data has been simplified
<PhilA> Christian: We also need some transparency in how to visualise the data. It gets simplified and so we need to understand that. DataViz closes data
<PhilA> Vagner: Back to Sharon - the more structured the data, the less they represent reality - seems counter to everything we're here to talk about?
<PhilA> ... how can we comibine this data with the mess of reality
<PhilA> ... in Brazil, people aren't concerned with data so how do we make it more interesting and more powerful
<PhilA> Katleen: I agree with pretty much all remarks made...
<PhilA> ... in response to Tim - maybe we should go out and ask people
<PhilA> ... developers get asked, but citizen's groups, environmental activists etc. need to be asked too
<PhilA> ... should government have a role in giving information, not just publishing data. When does the repsonsibility come back to gov?
<PhilA> HelenD: On data accuracy - reality is messy. Don't expect true accurate data from government - ask what data they're using
<PhilA> ... dicussions about whether Ecuador or Argentina is more transparent doesn't invaliudate the data
<PhilA> HelenD: Shows slide ranking availability of data in different countries - Serbia at the top, Austria at the bottom
<olyerickson> Info on FOI legislation: <http://bit.ly/L3D7Ru>
<PhilA> Katleen: We have to do both - keep demanding data sets, and we need to link to the users.
<olyerickson> Global RTI Rating Site <http://www.rti-rating.org/>
<PhilA> ... we need to be very aware of what people want. Portals report that some data sets are never downloaded
<PhilA> ... in Brazil of course I met were the OD community. But you have a long history of participatory budget and those people know about the importance of access to info to engage in the debate.
<olyerickson> Q: "where is the stop in open data?"
<PhilA> Yannis: Where is the stop in Open Data - how much info do we really want to publish? is there a barrier somewhere? We're working on the assumption that if everything is open... I;m not happy to have my income statement online but if *everyione's* is, maybe I don't mind so much
<PhilA> ... if we live in a society where everything is open, then is that a viable society?
<PhilA> HelenD: In theory it's possible to have a transparent dictatorship (ref. Martin's talk)
<PhilA> ... it wouldn't be possible for gos to publish *all* the data so let's start with what people are asking for
<Girts> can i get the transcript of todays #egov activity?
<PhilA> HelenD: The only way you know you've changed country when you drive around Europe is that the speed limit changes
<olyerickson> @girts Just hit <cntl>P...
<PhilA> helenD: Several countries put all tax returns online
<Girts> ow it asks for username/pass today
<PhilA> HelenD: We're saying let's ensure that the data is relevant. The Spanish portal has loads of PDFs of data that was already available
<Girts> and thanks again :)
<olyerickson> Re-iterating point that we must ensure data that is published is relevant to society...
<PhilA> Katleen: reports on discussions with privacy people who of course don't share default of open is good
<PhilA> ... we can't advocate taxes online in Belgium in the way Norway does
<PhilA> ... so we need to accept that there are differences between countries
<PhilA> Richard: closes session
<olyerickson> phila: Reminder that lunch is UPSTAIRS (floor 5) not downstairs
<Jeanne> Irina: In the business of making data accessible and discoverable.
<olyerickson> ...2nd part of the battle: using the data
<olyerickson> ...this session is about using the data
<olyerickson> Subtopic: Open Culture Data
<Jeanne> Irina: We have three speakers today. First is Lotta on A Dynamic Faceted Browser for Data Cube Statistical Data at http://www.w3.org/2012/06/pmod/pmod2012_submission_12.pdf
<PhilA> scribe: Jeanne
Lotta: Started experiment in
September 2011 on open culture data, 8 datasets in Netherlands
and 1 hack-a-thon on cultural events.
... Recommendations: CC0 (Creative Commons) for metadata; PDM (public domain mark), CC-BY of CC-BY-SA for content; open standards; and open documentation and communication
... Hack-a-thon was on governmental data gathered people to talk about what we do. Started to build apps on the day of the hack-a-thon. Had one month. 13 apps were made and 8 submissions on cultural data. Gold prize was Vistory based on open images, as well as education prize.
Faces of the Rijksmuseum app based on 100,000 images made available by the Rijkmuseum. Ran image detection software on the content and identified faces.
<olyerickson> ...Link to OCD site http://www.opencultuurdata.nl/
Lotta: Ruks-Quiz was applauded by Rijkmuseum was a quiz based on the content from the museum. Another app made the content available through the conclass browser to see their collections in-house in a new way. Another was Connected Collection that recommends related content from other cultural heritage museums based on your search.
<Vagner_br> the correct Lotte`s presentation is "Open Culture Data" at http://www.w3.org/2012/06/pmod/pmod2012_submission_22.pdf
Lotta: The winning app connected
videos with the collection.
... Next step was that in January 2012 worked with Images for the Future and Creative Commons Nederland, started master classes in April, competition and hack-a-thon in June and another in October. In January 2013 will be awards.
... Master class topics: copyright, technology and tools and licensing, benefits and risks of resuse, hackathons and evaluation.
... 7 new datasets including 140,000 new images from a press company. Future? Release more open culture data sets, gather and share more knowledge and case-based evidence about open culture data, organize networking events, position open culture data in national data catalog, evolve
... Benefits: public mission, data enrichment, increasing relevance, increasing channels to end users, brand value, specific funding, discoverability, new customers, building experise, spill over effects
Question: You recommend CC0 for metadata, we also have ISO open metadata license.
<olyerickson> ISA Open Metadata License mentioned <http://bit.ly/LeHFem>
Lotta: We recommend CC0 to be compatible with Europeana and Europe heritage institutions.
<olyerickson> @jeanne note it's "ISA" not "ISO..."
Question: Are the resources available via Europeana?
Lotta: The intent is to have them accessible.
<olyerickson> Subtopic: Linked Open Data in Use
Florin Bauer and Martin Kaltenbock: http://www.w3.org/2012/06/pmod/pmod2012_submission_2.pdf
Florin: Is open data empowering people? Yes--focus on energy for people in developing countries.
<PhilA> Hmm.... France showed as doing a lot of renewable. Wonder if that includes nuclear (of which FR has a lot)
Florin: We need to provide tools that let people understand the data. We do the visualizations and capacity building. We also have to help decision and policy makers to understand why open data and linked open data is so important. It is difficult to explain.
<olyerickson> ...link to reegle project <http://www.reegle.info/>
<olyerickson> ...Florian: "Martin and I wrote the (Linked Open Data) book..."
Florin: In developing countries talk about the economic benefits the government will have when they release open data. See how datasets have connections with other datasets and organizations. "Linked Open Data: The Essentials" and available at http://www.google.be/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=0CFAQFjAA&url=http%3A%2F%2Fwww.semantic-web.at%2FLOD-TheEssentials.pdf&ei=N5PhT4CKIc6M-waq3ZCYAw&usg=AFQjCNFGkIawiz0lY1FCSi2uwHMgEF-gxg&sig2=YSmXh94SJ4jT
Florin: With linked open data we
share datasets behind the scene, import data updates
automatically, and let other web sites consume the data easily.
Should we transform our own platforms and tools, yes! Reegle is
the gateway we created.
... 220,000 users per month and 1700 stakeholders worldwide. http://reegle.info Information from DBpedia on the country, UN, Eurostat, World Bank, UK, and our company on events and databases we provide.
... Includes data from US OpenEI portal. Without open data we would never be able to maintain this. This is how open linked data helps us.
Martin: Model behind the country
codes shows how we interlink via semantic web technologies.
Controlled vocabularies are essential, and studying SKOS
vocabularies. We have the European data forum 2 weeks ago in
Copenhagen--there is a strong need for standardization,
interoperability, cross-catalog searches, and multi-lingual.
These can be achieved by linked open vocabularies.
... Open Eurobot in the future. Some of the linked vocabularies: reegle, Kluwer, Geological survey of Austria...
... API to extract tags from freetext based on term frequency and our thesaurus. Renewable or clean or sustainable energy--send information to API and get all terms extracted from the text.
Florin: Currently working on drafting a study to prove the benefit of open linked data in energy focused on developing countries. Please let me know if you want to contribute to this effort.
Farida Vis: Cultural specificity--when we collect data in Western cultures, we expect people to honestly answer and complete surveys. Where does this expectation come from? We need to think of global data cultures.
Julian Tait:The role of communication in building an open data ecosystem at http://www.w3.org/2012/06/pmod/pmod2012_submission_37.pdf
Julian: Future Everything is a
festival around Manchester. How would Manchester have developed
if all data was made open? Who would benefit, who would be at
... How can we create an ecology around open data in Manchester. Metropolitan area is 2.6M people, 10 local authorities, complex entity. People cross borders and want to access services anywhere, but administratively cities don't work like that. It also has to be sustainable solutions.
... Best to have a large potential market for services. What is the structure of government in Manchester? Identifying communities by needs and authority. All are set up differently. How would open data benefit you and release or sharing of information benefit you?
... Develop and engage with communities. Why should we do this, people asked. No logic of open data. Who is it for? We created an open data community of users--lots of developers around "OpenData Manchester".
... Users are drawn from different communities--politicians, developers, artists. We created themed hackdays around grand challenges, "Lovely Data"--health, environment, etc.
... Apps were created--bus timetables, mapping visualizations of transport. People find these things useful. Got lots of proof of concepts with no finished products. From these communities we created the data store for all 10 local authorities. Had to have representation and relevance to the users and the communities. Creation of the data and how it is expressed would be relevant to the needs of the community. Open data ecology--community of open data
authorities. Government needed to get data from other governments--but no real ownership of the ecology. Creation of a mutual space--artists, journalists, activists. Want to sustain it now--authorities asking how they should release data and what people will do with it. Business cases are manifesting.
Julian: Transport is a good
case--how they get information to people, 18K real-time
displays vs. provide open data for app creation. GM Local
Authorities--8.5M pounds potentially saved. Planning
Data--invited people to the event to help discover new
... "If we release open data, then we won't need to do this." SWIRRL won the European open data challege is developing businesses based on open data on CIP project and tourism.
PhilA: A few years ago at a W3C event Sir Tim Berners-Lee had a conversation with Gordon Brown and was asked what should happen and he said make your data open. Andrew Stott then got a call to make data.gov.UK. Who else should Tim have dinner with? Are you looking at getting open data about products on shelves?
Julian: No, we haven't focused on that although anyone can make data open. An idea to make food data more open allows transparency to see sourcing of the food.
Michelle: Our project collects
cultural information in Taiwan for open data development. We
had an open data day and provided awards. Want to create apps
like a car GPS service. With government data on GIS data, then
can people can get more benefit from the products and
corporation can provide data back to the government, too.
Irina: How much reuse has happened and what value has happened?
Florin: We have seen a huge increase in the number of users based on taking the data from other portals and mashing it up. Big impact on the front end side.
Lotta: Made 2000 videos available on Wikipedia. Current stats from April--people can add these to Wikipedia articles, and viewed in 50 language versions 3M times per month. By contributing open data, we can a big uptake in usage.
Julian: Transportation data in Manchester has made a difference for our citizens.
Helen: Many people are trying to get FOIA implemented and adopted, but places like Kenya are not making the connection between open data and transparency. There can be ways the open data community can help the access to information community--why opening data can increase internal government efficiency.
Florin: Transparency is less valued in some cases then economic benefits, but open data can result in both.
TimDavies: We have data for co-production working like in Manchester and open public services--does this sidestep the policy space? Are those compatible or are there tensions there?
Lotta: There's the PSI directive that notes cultural heritage institutions are part of the open data efforts--just metadata or open data? Well, it may be only where you have the rights, which is a small portion of it. What is the business model? What do you mean by open data?
Julian: In Manchester, we can do things together that we could never do before. There are set in stone structures, everyone is trying to do the best, but sometimes they fight the structures they are within. By embracing civil society, it shows how they can become more relevant, efficient, etc.
Yannis: Inspired by slides to simulate a situation in Manchester--but we also need to experiment in new models for politicians. From Greece, we discuss how to do life experiment--to take a small society like 30K in an island and have these questions outside the lab. Convince the community to open everything and to convince the politicians (mayor) to do this?
Julian: It depends on the level of concerns because you can't impose the wall of openness.
Yannis: Sooner or later we need to do experiments outside the lab.
Richard: New discussions deal with the content held by museums, archives, and libraries under the directive. But content where third parties hold rights, they are not affected. There is a large amount of content in the public domain.
Lotta: What is in the public domain should remain in the public domain. Rights need to be clear.
Martin: PSI directive is good start, but seems to be watered down.
Richard: Not my impression. Working groups discussing it--we expected opposition, but have had less than expected. Final text adopted by Parliament and Council deals with concerns raised by the institutions about the cost. Austrian federal government reacts in this area, but city of Vienna is a torchbearer.
<boris> scribe: Boris
<Jeanne> Boris: Thanks
John: Introducing the
...: Making sense the government data
... try to understand, analyze the data itself
... relevance on the data
... Introducing the speakers
... Introducing the talks
... First talk from Grece
...: an introduction of LOD in Grece and its status
... idea to enable participation, promote competition
... no apps yet, 2-3 stars
... it's important in Grece
... Issues, how to demonstrate that LOD has added value?
... apps, apps, apps, and more apps
... how to push an elephant in Grece
<josema> coming late to the
log today but comment made by Julian on boundaries resonates
with me; hope you could discuss further about it and
...: updates, visualizations
... Input: "diavegia" every public institution has to provide data
... pointing some vocabularies ....
... original decision text in pdf
... there is a Tax Information System
<PhilA> scribe: boris
...: checklist, they have an ontology by reusing available vocabs
... 2. basic visualizations
... register on thedatahub
... provide links
... links to EU, US, etc
... also links to geodata, dbpedia
... 5. demos and services
... 6. data awareness, organizing hackathons
... showing an example of a basic visualization
... showing where my money goes data
... showing "bubble" visualization about banks
... money goes to help bank
...: now showing other visualization example
... showing a demo about Litchfield district, where my money goes
... extracting information from dbpedia
... comparative statistics, about the money in Lichfield and Kalithea
... SPARQL QUERY about Kalithea for product and services
... they need to do some ontology alignment
... they get the results from that query
... comparing budget of Litchfield and Kalithea
... now build a Greek business repository
... it's an ongoing process
... they can exchange information in both ways
... example Vodafone and Siemens
... information about Greek companies
... they presented first app on the Greek LOD
... try to initiative the benefits of LOD lifecycle
... try to get people involved
... it was hard to get people involved, data was rich, but not in a good shape
... making meaningful data was so, so hard
... people involved, newspaper portal
... preparing visualization for them
...: quick question: how this is a priority giving the current situation in Greece?
michail: In Greece, they have a
special meeting for defining the values of these data
...: now they want to show simple cases
... apparently nothing is happening because of this ....
... no consumption of data so far
... they will inform what happens
someone: it is possible to do the same at EU level
Michail: that's the goal
...: there's a eu directive ... it's a hidden business information .... others can use the data to comepite
... there is not technical barrier
Uldis: introducing his
presentation "Exploring the networks in open data"
...: he is coming from University of Latvia
... his co-author, Valdis is a social network analyst
... network visualization and analysis is useful, showing its benefits
... there are several types of networks
... also a open data hackathon in Riga
... they need the data first, the pre-process the data, then visualize data
... we need data to be open
... it's a problem in Latvia
... no so much open documents
... document in html form from the parlamient
... still they got the information .... political parties, etc
... of course you can publish in a newspaper
... next step is to pre-process the data
... no much cleaning was needed
... output was information about objects and relations among those objects
... next, defining graph connections
... they also need to filter the data
... for example opinion differences ... what to filter out ... filter out trivial decisions
... for visualization they define legends for each one of the parties
... talking about the parties
... about of the level of tolerance, how to consider a connection
... who always agree or disagree ... we can see that some parties have a strong voting discipline
... show another visualization taking into account the 35% disagree
... next, to refine the visualization
... define the right cut-off values ....
... what domain experts think about that? ... their opinion is important to improve visualizations
... documents and visualizations are useful for this task
... showing another example of middle values, opposites parties are voting the same
... we can identify some patterns on the visualizations
<scribe> ...: new diagram
with more number of connections
...: there are connections between opposites parties
... what can we do next?
... understand the visualization
... one expert from former members of the parliament
... improve visualizations, create multiple multiple visualizations
... this involves some new things to do ...
... also we can bring more data ...
... from outside of the graph
... click on a edge and include additional information
... connect to LOD data
... more examples
... donations to political parties
... another use case showing company communication patterns, it has clusters of nodes, clusters are based on where people live
... the connectors over the company offices
... concluding: we need more open data
... discovering patterns, help to make sense, what is the final purpose of the visualization
... interested in to collaboration to other groups
... more contact information
... they have a smart network analyzer tool
... now he's showing the tool ... providing a particular visualization
... showing bridging/disconnecting graphs from opposite parties
Girts: taking data from website,
and apply some conversions .... doing more scrapping help with
no modification of the data? ....
...: the nice graphs, but they can be used for opposite parties
<Jeanne> As a reminder, if you are interested in the W3C eGovernment Interest group, you can see the group at http://www.w3.org/egov/wiki/Main_Page To join go to http://www.w3.org/egov/IG/participation It's a great place to continue these discussions.
Uldis: scrapping is open
Daniel: presenting himself
...: how many of you know w3c?
... the same question in other conference, only 3 hands
... how Russia see the rest of the world, and how us see Russia... in general
... convincing Russian people was hard
... presenting the talk outline
... the project is funding by the ministry of education
... partners Leipzig University, some companies from Russia, etc
... 600 man days - 4 stages
... no code so far
... they got some ontologies, math ontology, and nano ontology
... they want to build a framework that converts structure data and text into LOD
... also make linkable at the end
... most of the work is doing manually
... research challenges, rdb2rdf, text2rdf (similar to poolparty)
... creating Identifiers, and reusing these identifiers
... they are not using any other triplestore
... Russians are building their own triplestore
... it will be ready soon
... Leipzig is helping them in some tasks
... some prototype examples
... data are accessible via web services
... they have to improve the visualizations
... stakeholders have different needs
... now showing a simple browser
... Nano Browsing
... on the left side there's the ontologies
... the prototype is based on media wiki
... the second part is for identifying Experts on the Nano field
... they extract the name of the person ...
... another example: who and where
... visualization of the links
... among places and people
... also in a map
... another example for decision making
... analyzing lod dataset, who writes about something
... should I invest into a research project? (example of question)
... concluding: to attract more Russian people
... from OD -> LOD
... try to stablish work between Russian and UK on the upcoming event next week
... last but not least, try to put an stack all the "know-how"
... at the bottom "Ryde the Wave" from the W3C
... describing the interaction with the government
... it will finish by july next year
... finally, what the economic benefits ?
someone: there is a LOD stack for
free from LOD2
...: you can use it
Daniel: thanks!, but Russian people want to do it everything themselvs
someone: possibility to collaborate with Russian project?
Daniel: not yet
John: presenting Yannis
Yannis: introducing himself
...: talking about LOD2
... but Engage is a new project, not under FP7, it's supposed to be an infrastructure ....
... it's about open data, CKAN, LOD2 platform, but there is difference is that the domain is Science
... science is different
... they are developing the version of the platform by July
... they need more metadata, semantics, vocabularies
... for scientists datasets are different ...
... related to what do we need to prove
... some of the partners are microsoft, ibm, ...
... now they are dealing with experts
... they want to make it as open as possible, also the code, technologies
<PhilA> boris: regarding the vocabularies you need - you're not starting from scratch - which ones are you using?
Yannis: egov vocabs are out
there, but in science is different .... evolving is an
...: we are still searching more vocabs
someone: regarding the
...: and vocabs is the beginning
Yannis: yes, there will be a
...: engage is an attempt on this unit (eu one) ...
... the question is not comparing to others like data.gov.uk, the question is if we can help sciensts
someone: not so much of infrastructure, it's about data accesible
Yannis: the whole thing is there
John: please put the links on the
...: brilliant set of slides
John: to Daniel ..... future steps, expand more horizontally ... cover more domains, what are your next steps?
Daniel: several approaches, use
W3C as evangelist, Russian 2020, EU it's a bit slow, it takes
some time ... Russian has already money for this ....
...: to convert to LOD and not pdf
... ministry of financies is the next player
... an initiative in Russian, kind of Sillicon Valley in Russian, smart city 2014
someone: To Daniel
...: politics about the stuff in Russian, not great reputation in transparency
Daniel: anyone from Russian, yes
....: I know what you mean .... probably they misunderstand the five star scheme
...: they don't have any problem with that
someone: feeling of LOD as static
...: services implies open data feeds
... it is your aim to provide APIs?
... question was for yannis
Yannis: the idea is not to store,
provide and give data feeds
...: 2012 is the last year when LOD is static data,
... makes no sense not to have updating mechanisms
... explaining the issues of scientific data
... everything should be online
... also the linking
someone: yesterday was pointing
out as well, we need big infrastructure
...: his company is working on that in Netherlands
... also geo layers
... statistical data
Yannis: yes, in our Project we
have microsoft, ibm, ....
...: being infrastructure is hard ... the Greek node can give a lot of infrastructure ...
Phil: Welcome Chris Tagger
John: close session
<PhilA> Afternoon session
<PhilA> scribe: PhilA
chair for this session is Vassilios Peristeras from the ISA Programme/DG DIGIT
Business Models for PSI Re-Use: A Multidimensional Framework, Michele Osella, Istituto Superiore Mario Boella
Michele: introduces himself
... showing results of a survey
... focussing on conceptual model
... talks around the slide...
... elaborates on prominent gaps in the literature slide
... focussed on the economics of downstream from the data
... shows different perspectives of PSI re-use
... Research Questions (slide)
... PSI isn't a source of competitive advantage... so what is?
... the Framework 'canvas view'
... PSI is bundled within services that are offered to customers - so the service is the product that is sold
... pyramid view (slide)
<olyerickson> ...Link: European Public Sector Information Platform <http://epsiplatform.eu/>
Michele: each layer in the
triangle depends on the one below's existence
... then dashboard view (slide)
... templates adopted for each check box. Uses sophisticated business logic
... conclusive remarks (slide)
... we believe the frameowrk offers a promising stepping stone for research into commercial exploitation of PSI
... proposal to use it to analyse the many hundreds of apps created in hackathons for public data
... we plan to use the framework ourselves to evaluate social value of PSI within smart cities
... linked to more material etc.
AndrewL: Do you have examples of the kind of business that can set up using this?
Michele: Shows slide from
... identifies 8 archetypal business models
... shows example of an 'open source like' business model (and refers to it as poss applying to Open Corporates)
<olyerickson> ...Another recent OpenCorporates preso by Chris is here <http://slidesha.re/MqxmSi>
Chris: Setting up a company is
like giving birth - you wonder why you did it but later realise
that it was worth it
... almost everyone works for a company, buys stuff from them etc.
<olyerickson> ...much of the "corporate world" we live is hidden from us
Chris: businesses have changed completely in terms of the way the are set up and operate
<olyerickson> ...normal people can't see the interconnections of modern corporates
<olyerickson> ...the corporate world is increasingly complex
Chris: some corporations comrpise thousands of legal entities
<olyerickson> ...new complexities, such as instant semi-automatic company generation
Chris: companies can be formed for a specific purpose. They can be set up for a short time. Expect to see them existing for a matter of weeks or even hours in near future
<olyerickson> ...opacity introduced by "secrecy jurisdictions"
Chris: very complex networks, come circular
<olyerickson> ...systematic problems we see today are in part due to inability to understand it all
<olyerickson> ...ref to "World Bank Puppet Masters" report
Chris: competitors want to pay
less tax, lower wages etc.
... world bank report highlights legal entities used to launder money
... companies are artificial
<olyerickson> ...Link: WB Puppet Masters report <http://bit.ly/MqyOnq>
Chris: did a survey and published
today for tomorrow's DAA. Could anyone do a search and see if a
... was there a licence on the data? Can you get hte info as data or through an API for free? Info on directors?
<olyerickson> ...report being released today, how open is company data in EU
<JeanneHolm> The Puppetmasters report is very interesting: https://openknowledge.worldbank.org/handle/10986/2363
Chris: no one got 100%. UK and
Norway did well
... in Spain, Austria, Greece, Romania etc. you cannot get any info
... so this info isn't available to citizens, journalists etc.
<olyerickson> ...The Real Problem: Have an entry for every single legal entity in the world
Chris: one problem was we needed
to have a list of every legal entity in the world
... legal entities matter as they are the things that have liability
<olyerickson> ..."real data, running code..."
<olyerickson> ...O(43M) companies listed currently
cf. IETF slogan - rough consensus, running code
Chris Open Corporates data is all available under an open licence. Increasingly cooperating with company registers directly
<scribe> ... New Zealand, Norway etc.
UNKNOWN_SPEAKER: working with
World Bank, W3C, EC, Financial Stability Board/financial
... after 12 months' operation, getting most data by screen scraping data we get invited to FSB etc.
... breathing same air as GS1, ISO etc.
<olyerickson> ...has obtained most of their data to-date via scraping, but are being invited to consult with financial boards etc now
<olyerickson> ...most data has come from the community
UNKNOWN_SPEAKER: most of the data
came from the open community, e.g. using scraper wiki
... getting offers of help from all around the world
... paid for on projected income
... for entity matching, data cleansing
<olyerickson> ...huge # of requests for API every day
<olyerickson> ...dual license model
UNKNOWN_SPEAKER: you can use our
API for free but it might be cheaper to pay us to do it
... you can use the data but under share alike. So folk like Dunn & Bradsteet who don't want to share their data aren't going to do this
<olyerickson> ...not *trying* to get customers, but are getting some because share-alike is poison pill to some users
UNKNOWN_SPEAKER: some customers
that want the data without the share alike
... adding more data, enriching everything
Antonis: I'm Antonis Ramfos from
... IT company does most of its wor for public sector
Antonis: interest for us is the
... we see that public sector opens its data increasingly
... we have to realign our business model
... you may have heard of Cordis which we've been running for 10 years. We've done major development on that for the future
... will become a content provider. We have to use the open data to continue to run the service
... the future is here
... whole public service provision is changing and we need to think of new business models
Jens: I have a short talk
yesterday about the Dutch version of FixMyStreet which we found
about after we'd begin work on our project.
... do we make money? not yet
... we deliver services to the mobile government workforce
... how do we make money on bad data?
... a bit like OC. We do open data collection and create data. We can do it better than most government depts
... it helps that we know how governments work
... we offer APIS with 2 way communication
... and we can charge annual fee for premium services
... like guarantees on up time etc.
... 2 new cities signed up today :-)
... Sharon's cartoon this morning was interesting: info $1, info you need $500
... we give the cheap data away and charge for the expensive
... costs of making decisions is very high. We can help to reduce that
David: 10 years ago, 2 10 year old girls were murdered in the UK. Single reason perpetrator wasn't spotted earlier was lack of interoperable data
PhilA: (Google Soham murders)
David: Person I works with all
data standards rather than making a choice
... Police only allowed to spend money on Policing. So wanted the products to be available to governments and everyone else
... in terms of business model. very simple
... we work with suppliers at the beginning of a procurement process
... lowers the cost because you don't have agree on a standard before you start
<scribe> ... ongoing data maintenance is easier
UNKNOWN_SPEAKER: work with suppliers to lower the cost of supply
<olyerickson> ..."Makes money by saving money..."
<olyerickson> Subtopic: Discussion of Business Models for Public Sector Open Data
Chris: Companies like mine will never be the sort of size that generates the sort of size of those legal entities
<olyerickson> ...ChrisT: Trying to srink the business intelligence market down to 1/100th its size
<olyerickson> ...every person's life is effected by this
Chris: competitive info is a significant barrier to entry for many new businesses
<olyerickson> ...access to competitor information is a barrier to innovation
Chris: the extra money comes from the ability to do things more quickly nad easily
Vassilios: Many public admins don't like the cost involved and the fact that others generate the profits
<olyerickson> Gov't needs to guarantee that govt LOD will be sustained
Michalis: We keep asking for
applications. Governments are part of this. They need to be
involved. Which kinds of data are going to be open and
maintained in order to stimulate ecomonmic activity
... should define minimum level of support
olyerickson: +1 to Micahlis
<JeanneHolm> Very important to be sure that companies can trust that the government data they build a business on will be available, accessible, and in a usable format. This can be the biggest barrier to true innovation and economic benefits.
olyerickson: What role should
govs play in fostering the ecosystem. Healthy Human Services
has for several years hosted an event that brings together
people from gov and private sector to foster an ecosystem
... size of the event is huge now. But a numnber of challenges where gov has offered prizes in 10K - 50K for specific challenges for apps based on the data
<JeanneHolm> Some challenges are up to $1,000,000 for particularly complex issues.
<JeanneHolm> See more at http://www.challenge.gov
olyerickson: So my question is - is the EU doing anything similar - giving prizes etc.
Vassilios: I don't think there's
an EU answer, but there might be from member states
... building a community catalogue
<olyerickson> PhilA: Thanks for making sense of my utterances ;)
<scribe> scribe: cgueret
Rethinking Kelly and Etling's Map of the Iranian Blogosphere Gwyneth Sutherlin, University of Bradford
ICT for conflict resolution
scribe: people seems to start
with false assumption that data is diffused as neutral thing,
... limitations of open data for IR policy?
<JeanneHolm> See Gwyneth's paper at http://www.w3.org/2012/06/pmod/pmod2012_submission_31.pdf
how can culture be incorporated?
scribe: physics model of iranian
... show clusters in the network
... poetry / mixed / secular / twelver / ...
... => not useful for policy makers
... what is lost?
... social media are always used in the same way
... online behaviour are supposed to give insight on culture
... need to determine what makes this online space unique
<CaptSolo> slides of the "Exploring the Networks in Open Public Data" talk: http://www.slideshare.net/CaptSolo/exploring-the-networks-in-open-public-data-13391338
scribe: in Iran, need to take in
account the source of the information people put online (added
... need to do a semantic filter first, prior to any visualisation
... enable to distinguish in the information comes, from example, from a diaspora or from elsewhere
... strategies around surroinding online personas
... maybe a way to explain poetry: use it to do metaphoras and escape persecusion off-line
Q: english tweeter user use a lot of sentiments to say things without saying them
A: research has been done on offline content, need to extend it to online content now
The Open Data Workflow: Towards an Open Data Middleware Stefan Scheglmann, Institute for Web Science and Technologies (WeST), University of Koblenz-Landau
WeST associated to gesis (like franhauferh)
developed LISA (local information search and aggregation) for first open data contest in Germany
scribe: showing data about
... heatmap visualisation of neighbourhood
... use point of interests extracted from OSM and other data sets
... took a week for two persons to realise
<olyerickson> ...LISA Munich Edition <http://lisa.west.uni-koblenz.de/lisa-demo/>
scribe: won the first prize
<JeanneHolm> http://lisa.west.uni-koblenz.de/?lang=en for general information about LISA
scribe: got attraction from
business that want to generate sponsor based revenues
... want open data workflow
... explore / choose / import / normalize / present
... use things like Yahoo Pipe
... this and other services can ensure some parts of the chain
... there is a need to define a middleware based on different services
... will be used by policy makers and citizen
... facilitate data consumption
... Open doesn't imply accessible
... people that are targeted are not always able to use the data they receive
... such a middleware will fix that and allows new services to be easily plugged in
... there are different LISA for different cities
Q: wrt middelware, is this just ideas or do you have something already?
A: partially done for LISA, now thinking of making this more robust and figure out more building blocks
scribe: currently more of a strategic discussion
Q: open news project could use that kind of middleware
A: that's exactly the goal, bring non technician to use open data
A means to embrace and map together data standards David Mitton, Listpoint
code list management service
scribe: IATI standard
... make aid spending easier to access
... want to re-use existing information and no re-create code lists
... challenges: mapping / delegation / versioning / changes
... need to be able to access and rely on country codes (cf ISO)
... need to be able to inform data consumers when the data is updated
scribe: did mapping for IATI
... people create new standards every single day
... mappings had to be done manually
... found errors in conding conventions too
... created new code lists there
... community need to have faith in the data they are looking at
... developers can extract the mapping layer in XML
... whenever there is a change in code, all the consumers of the data are alerted
... it's an open platform
... 4000 code lists so far
Additional flash lightning by Uldis BojÄrs
(shows europeana.eu site)
there was Hack4Europe event
scribe: created europe.in to explore europeana content
(shows example with looking for "Da Vinci")
scribe: list of items related to Da Vinci, can click for more informations
<CaptSolo> links from the Europeana visual expl #pmod lightning talk: http://europ.in/about - http://europ.in/search?q=da%20Vinci http://blog.europeana.eu/2012/06/hack4europe-2012-preview-of-prototypes/ + europeana page
<CaptSolo> individual image "pages": http://www.europeana.eu/portal/record/02302/0457E5166CB382AB844BCAB45A7A84AB3FF648D4.html
<CaptSolo> - some html5 history magic involved to dynamically generate item pages
<CaptSolo> - feedback welcome
<CaptSolo> - and let us know of interesting search keywords you find
<PhilA> stijn: I'm Stijn Goodertier from PwC
<PhilA> stijn: Keynotes from Antony Huang from NIEM and Jeanne Holm
<PhilA> stijn: talks on need for code lisst being open and open data middleware
<PhilA> ... need to make data open and accessibkle
<PhilA> ... invitation to join initiatives on Joinup and the ISA Programme
<PhilA> ... we can talk more about it in the networking event afterwards
<PhilA> Session chaired by Luigi Reggi
<olyerickson> Subtopic: Dynamic Faceted Browser for Data Cube
..Speaker: Nikolaos Loutas (DERI)
... statistics data important
... most previously published as excel(csv) but increasingly RDF
... When pub'd in RDF, usually Data Cube Vocabulary
... concepts and attributes for describing data
... measure, attributes of data, dimension
... W3C recommendation
...Example: Ireland data in Data Cube
... average policy maker can't write SPARQL query ;)
... to help: faceted browser over DataCube data
... Working on the UI
...Figure: Overview of DataCube Viewer Architecture
... powered by sparql endpoint (rather than federated query)
...features: no pre-processing, works against any RDF but must be DataCube vocab, pulls extra info via URIs, data can be exported
... contact the authors: Fadi, Gofran or Nikos...
...features: <live demo>
..Q: Statistical data is multi-dimensional...there is prior art to working with this...could this be supported?
...Q: What about MathML? Have they thought about this
... Q; Several nerdish questions about scalability, portability to other vocabs, etc
subtopic: Faceted Browser for
...speaker: Boris Villazon-Terrazas
... We have the data, now need to deliver relevant applications "for my mother, brother, etc"
... NLP, rich client, etc
... Within LOC Cloud, significant portion with geospatial data
... Developed a tool for exploring/visualizing RDf datasets enhanced with geospatial info
... <arch diagram>
... What do we mean by geospatial resource
... use dpbedia geo:lat geo:long model
... but also, geoLinkedData, NeoGeo models
... Model complicated models possible...eg rivers (lineString) etc
... Recall yesterday, many apps based on Google Maps; they do too ;)
... <screen shot> Facets on upper left, overlay options lower left
... OpenStreetMaps implemented now too; dynamic switching between the tw
... OpenLayers API supported
... Catalogue Service Web
... Any other Inspire-based services are supported
... Complex geometries: example, lake equivalents
... allowing user input of corrections ("editions")
... (user clicks on "edition" link, inputs changes, editors can review)
... uses GeoLinkedData
... Partner with GIN (Geo Inst of spain)
...speaker: Possible to integrate statistical data from Statistical Office of Spain
<PhilA> RDF data cube is on
the standards track, not there yet though (but it will
...speaker: Meteorological Data via AEMET
...example: weather stations
... click on weather station, see the weather data...
... see a graph of the data
... modeling sta in sensor network vocab
Subtopic: Research Challenge on
Visualization (Francesco Mureddu)
... visual analytics for policy-making
... "challenges in governance"
... dig into data to extrapolate patterns and models for more evidence-based policy debate
... need people to be more engaged in policy making
... take on complex, unpredictable patterns and make sensible
... Possible solutions: economics, health data, demographics, legal arguments
...also: discussion arguments, geovisualization, security and national defense
... corporate and military angles
... "Inspiring Cases"
... US Labor Force Visualization
... State Cancer Profiles
... Instant Atlas
...Link: stata cancer Profiles <http://statecancerprofiles.cancer.gov/>
... Labor Force Visualizer <http://flare.prefuse.org/launch/apps/job_voyager>
... GapMinder <http://www.gapminder.org/>
Link: InstantAtlas <http://www.instantatlas.com/>
... "Corwall is quite boring..."
<PhilA> Sorry but I have to say, Cornwall is not boring. It's beautiful and a favourite place for holidays!#
scribe: Key Challenges:
... User acceptability: still not intuitive
... dataviz still largely demand-driven
... better integration between tools/packages
... Comments about GoogleViz
... see also GeoDaCenter --- special prohject to enhance criminal justice
<PhilA> olyerickson: The idea of data-enhanced policy making is incredibly optimistic
<PhilA> ... as a US alien... is there a trend within the EU for data-enhanced policy making?
<PhilA> Francesco: We'd liek to create the trend by the Crossover project
<PhilA> DavidO: One of the flagship initiatives of the EC - Future ICT - is a kind of real world data simulator applying particularly to policy making. Simulation, aggregation etc. Big research project on this issue
<PhilA> Gianluca: Crossover is just one project funded by the EC
<PhilA> ... this is a support action to build a community and define a roadmap. There are bigger projects
<PhilA> ... based on this assumption that evidence-based policy making is possible
<PhilA> ... a new call for proposals is coming soon
<PhilA> ... we're awaiting the results of some project to see if there is a trend
<PhilA> DavidO: We had a fear that participants that wouldn't understand what we meant by OD for Policy Modelling
<PhilA> ... obviously unfounded
<PhilA> ... we have seen a number of key tools being discussed, these are the ones being used in policy making
<PhilA> ... so it's happening
<PhilA> ... we had an extremely useful practitioners' panel yesterday
<PhilA> ... also very important for us is the global nature of the community
<PhilA> ... in terms of the roadmap we have had a lot fo contributions focussed on the challenges that OD needs to address to improve policies
<PhilA> ... some challenges lie in accessibility nad usability. Similar to digital divide debate
<PhilA> ... not just hardware but skills/media literacy too
<PhilA> ... these are some of the key issues we have got from this. We'll work on this through the commentable document on the crossover site
<PhilA> ... do you agree with the research challenges - and do you work on this
<PhilA> .. The unstructured part is the LinkedIn group for discussion of issues
<PhilA> ... also we have a collaborative bookmarking method on diigo.com
UNKNOWN_SPEAKER: Humbled by level
... special thanks to scribes
... notes available online soon
... get your slides to PhilA!
... Learned much
... jeanneHolm has gotten many ideas to pick up in W3C eGov IG
... Note also W3C GLD WG
... Finishing early (remarkable)
... Heading over to Sofitel shortly
... the way to get attention of policy makers is free alcohol
... bar opens at 6p (53 mins)
<CaptSolo> thanks for a great event!
<Girts> excellent, ty!
Follow-up link: FuturICT Project http://futurict.eu
<CaptSolo> great work scribing, olyerickson and everyone else!