Outline
This best practice sets out a number of criteria that can be used to prioritise the publication of some datasets ahead of others.
Links to the Revised PSI Directive
Challenge
To develop the criteria for ‘high-value datasets’ taking into consideration the likely re-use of open data and to help governments understand which datasets to prioritise for publication.
Solution
To follow this guidance on dataset criteria which has been developed through engaging with both users and re-users of the data. The characteristics of ‘high-value datasets’ are seen from three perspectives: re-usability, value for data owners, value for re-users.
Reusability
- High-value data should reach at least 3-stars on Tim Berners-Lee's 5 star schema (making it available on the Web under an open licence in a non-proprietary, structured format).
Value for data owner
A dataset may be considered of high-value when one or more of the following criteria are met:
- sharing it contributes to transparency;
- the publication is subject to a legal obligation;
- the data directly or indirectly relates to their public task;
- sharing it helps with cost reduction.
Value for reusers
The value of a dataset primarily depends on its use and reuse potential, which can lead to the generation of business activity. The potential of the dataset is defined by:
- the size and dynamics of the target audience;
- the number of systems or services that could use the dataset.
Datasets contributing to transparency have a strong social impact and re-user’s interest in these datasets is high.
Engaging with Reusers
It is important to engage directly with reusers to understand the value of your dataset. Recommendations:
- establish a communication channel, for example, with a mailing list or a community on Joinup or on the Open Data Portal that could be used to make announcements to re-users and to gather feedback;
- use collaborative tools. This encourages collaboration between a community or re-users and the cross-fertilisation of ideas and business opportunities.
Why is this a Best Practice?
It’s important to have a shared understanding of what can be considered to be high-value datasets so that publication of these datasets can be prioritised.
Why is there a need for this Best Practice?
Understanding which datasets should be published, under what criteria and priority, will help public authorities to see the benefits of publishing more high quality datasets.
How do I implement this Best Practice?
In order to implement this BP you need:
- an understanding of high-value data;
- communication channels with data users and data reusers.
Where has this best practice been implemented?
Country | Implementation | Contact Point |
---|---|---|
References
- Timisoara Workshop Talk: Good practices for identifying high value datasets and engaging with reusers: the case of public tendering data. Nicolas Loozen.
- Timisoara Session: How good is good enough? Makx Dekkers
- Krems Workshop Session: Specific Critical Success Factors for Open Data Publication and Use. Anneke Zuiderwijk, Iryna Susha, Yannis Charalabidis, Peter Parycek, Marijn Janssen
Contact Info
Nicolas Loozen, PwC