This wiki has been archived and is now read-only.

Best practice/dataset criteria

From Share-PSI EC Project
Jump to: navigation, search


Outline of the best practice

Best practices for identifying high value datasets.

Management summary


To develop the criteria for ‘high-value datasets’ taking into consideration the likely re-use of open data to help governments understand which datasets to prioritise for publication.


To follow this guidance on dataset criteria which has been developed through engaging with both users and re-users of the data. The characteristics of ‘high-value datasets’ from three perspectives: reusabiity, value for data owners, value for re-users.

  • high-value data should reach 3-star data on the 5-star schema of Tim Berners-Lee (making it available on the web under an open license in a non-proprietary structured format)
Value for data owner

A dataset may be considered of high-value when one or more of the following criteria are met:

  • it contributes to transparency
  • the publication is subject to a legal obligation
  • it directly or indirectly relates to their public task
  • it helps with cost reduction
Value for re-users
  • the value of a dataset primarily depends on its use and re-use potential, which can lead to the generation of business activity. The potential of the dataset is defined by:
  • the size and dynamics of the target audience
  • the number of systems or services that could use the dataset

Datasets contributing to transparency have a strong social impact and re-user’s interest in these datasets is high.

It is important to engage directly with re-users to understand the value of your dataset.

Engaging with Reusers


  • establish a communication channel, for example, with a mailing list or a community on Joinup or on the Open Data portal that could be used to make announcements to re-users and to gather feedback
  • use collaborative tools. This encourages collaboration between a community or re-users and the cross-fertilisation of ideas and business opportunities.

Best Practice Identification

Why is this a Best Practice? What's the impact of the Best Practice?

It’s important to have a shared understanding of what can be considered to be high-value datasets so that publication of these datasets can be prioritised.

Link to the PSI Directive

'This relates to dataset criteria and priorities.

Why is there a need for this Best Practice?

Understanding which datasets should be published, under what criteria and priority will help the publication of more high quality datasets.

What do you need for this Best Practice?

An understanding of high-value data, and communication channels with data users and data reusers.

Applicability by other member states?

  • The approach is applicable to any Member State.

Contact info - record of the person to be contacted for additional information or advice.

Nicolas Loozen Retrieved from: [1]

Related Best Practices