A2.

Annex 2 – Public Data Principles

Working definition of “Public Data”

"Public Data" is the objective, factual, non-personal data on which public services run and are assessed, and on which policy decisions are based, or which is collected or generated in the course of public service delivery.

Draft Public Data Principles

Public data policy and practice will be clearly driven by the public and businesses who want and use the data, including what data is released when and in what form – and in addition to the legal Right to Data itself this overriding principle should apply to the implementation of all the other principles.

Public data will be published in reusable, machine-readable form – publication alone is only part of transparency – the data needs to be reusable, and to make it reusable it needs to be machine-readable. At the moment a lot of government information is locked into PDFs or other unprocessable formats.

Public data will be released under the same open licence which enables free re-use, including commercial re-use – all data should be under the same easy to understand licence. Data released under the Freedom of Information Act or the new Right to Data should be automatically released under that licence.

Public data will be available and easy to find through a single easy to use online access point (data.gov.uk) – the public sector has a myriad of different websites, and search does not work well across them. It’s important to have a well-known single point where people can find the data.

Public data will be published using open standards, and following relevant recommendations of the World Wide Web Consortium. Open, standardised formats are essential. However to increase reusability and the ability to compare data it also means openness and standardisation of the content as well as the format.

Public data underlying the Government’s own websites will be published in reusable form for others to use – anything published on government websites should be available as data for others to re-use. Public bodies should not require people to come to their websites to obtain information.

Public data will be timely and fine grained – Data will be released as quickly as possible after its collection and in as fine a detail as is possible. Speed may mean that the first release may have inaccuracies; more accurate versions will be released when available.

Release data quickly, and then re-publish it in linked data form – Linked data standards allow the most powerful and easiest re-use of data. However most existing internal public sector data is not in linked data form. Rather than delay any release of the data, our recommendation is to release it ‘as is’ as soon as possible, and then work to convert it to a better format.

Public data will be freely available to use in any lawful way – raw public data should be available without registration, although for API-based services a developer key may be needed. Applications should be able to use the data in any lawful way without having to inform or obtain the permission of the public body concerned.

Public bodies should actively encourage the re-use of their public data – in addition to publishing the data itself, public bodies should provide information and support to enable it to be re-used easily and effectively. The Government should also encourage and assist those using public data to share knowledge and applications, and should work with business to help grow new, innovative uses of data and to generate economic benefit.

Public bodies should maintain and publish inventories of their data holdings – accurate and up-to-date records of data collected and held, including their format, accuracy and availability.

Available at http://data.gov.uk/wiki/Public_Data_Principles

Comments (2)

Public Data Principles

Fully in favour of Annex 2, which should become a summary of the whole policy rather than an annex .  Safeguards should only include anonymisation of records , and any spatial aggregation to avoid disclosure should be at no higher than postcode level if absolutely necessary , but none if the data is not sensitive.  FOI should be repealed and replaced by a fully open data approach.  

Local authorities and other public bodies need to be pushed and persuaded to become more open, and to improve their systems so that their data is accessible, consistent, and timely, -   plus their IT staff need to become less paranoid about internal IT use, as it blocks staff from effectively using and understanding their own data.

Public data feedback

  1. Public data should be owned by the British tax payer, and not the Government, and hence freely available to the public and business to do with as it sees fit.
  2. Census records should be private for not more than 70 years, by which time, any adult appearing in the data would be a minimum of 88 years old. I know of no-one who would object.
  3. Birth, death and marriage data should be freely available as they are public records. We should not have to pay £8 for a certificate when the data is often all that is required.