Warning:
This wiki has been archived and is now read-only.
Data quality dimensions metrics
From Data on the Web Best Practices
This page gathers relevant quality dimensions and ideas for corresponding metrics
Sources:
- http://lists.w3.org/Archives/Public/public-dwbp-wg/2015Apr/0023.html
- http://www.slideshare.net/OpenDataSupport/open-data-quality-29248578
Contents
Availability
- Yes/no, maybe with explanation why the data is not available (privacy, security, archived, lost, not yet captured etc.)
- Open/restricted/registration, again possibly with explanation
- For access/re-use
- Indication of persistence and longevity
Processability
- Level on the 5-star scale (although there were opinions that it is dangerous to attach value to the linking because the data might be good but link to ‘bad’ data)
- Links to metadata standards used and data model/schema to enable automatic processing
Cluster accuracy/consistency/relevance
it might be useful to include some information about the context (e.g. why was the data created and what purpose is it supposed to serve).