An interesting article was released in the Yale Journal of Law & Technology earlier this year discussing a view that government should focus on providing usable data online rather than full-blown websites.
Titled Government Data and the Invisible Hand , the premise was quite simply explained in the abstract:Rather than struggling, as it currently does, to design [web ]sites that meet each end-user need, we argue that the executive branch should focus on creating a simple, reliable and publicly accessible infrastructure that exposes the underlying data. Private actors, either nonprofit or commercial, are better suited to deliver government information to citizens and can constantly create and reshape the tools individuals use to find and leverage public data.
This approach is very much at odds with the current approach both in the US and Australia, where in most cases the respective governments provide both the data and all the interpretation designed to meet the needs of specific audiences.
Via the current approach, data can becomes difficult to extract, or is presented in a way that is not useful. On that basis these websites are difficult to use. They are also expensive to develop and maintain and difficult to keep current.
The approach in practice
I've encountered both approaches in Australian government websites.
In a past role, managing the website for a private sector water and energy utility, one of the consistently most trafficked areas of our website was local weather. This section had only a peripheral involvement with the main focus of the site, however the level of usage made it important to retain.
We did not run this weather service ourselves. Instead we used a raw data feed provided by the Bureau of Meteorology (BOM) for free. The data was simply customised and represented in an attractive way in our website.
Ours was not the only organisation using this data - a number of other organisations had built businesses though providing weather information - sometimes combined with video, maps, commentary or other feeds. These sites collectively attracted more traffic than the BOM itself.
To my recollection, provided this data was not packaged and directly resold commercially, the BOM had a policy of giving away the data freely.
This approach helped ensure that the public were able to access accurate information, to the public good. It is important to note that BOM data was collected and processed by people and equipment already paid for out of the public purse.
On the other hand, the Australian Bureau of Statistics (ABS) provided a great deal of the data I used in my day-to-day role.
This data was preprocessed by the ABS into tables or excel documents. These were often chunks of information that were not much use to my audience.
My team spent many hours manually deconstruct and reconstruct the ABS data into different forms to make it useful for our corporate needs.
The ABS did not provide data as a raw feed. While the ABS did gave away its data for free online - and this was fantastically useful - the overhead that went into its website inevitably made it less timely, therefore reducing its value in a commercial sense.
So, in comparison:
BOM
- Gave its data away for free online (public access to public data)
- No data analysis required (lower cost to the agency, faster to market)
- Referrals from everyone reusing the data (reach)
- Enormous innovation in how the data was 'mashed' with other sources / analysed and presented (lower cost to agency, transference of risk of misinterpretation to private sector)
ABS
- Gave its reports away for free online - but not the raw data (public value but less timely)
- Provided intensive data processing (quality assurance but higher cost to the agency, slower to market)
- Limited online reuse, therefore fewer referrals (lower reach)
- No innovation in data analysis and presentation (higher analysis/presentation cost to agency, any risk of misinterpretation stays with the agency)
Bt the way, it's worth pointing out that BOM is the most trafficked government website in Australia. ABS, despite a wider range of statistics, is much further down the list.
Can the data approach be used across other agencies?
I believe it can. Even in my agency we release numbers and resources which could be indexed and provided in a raw data form for reuse.
We also have a website estimator for calculation purposes. There are around a dozen similar estimators that do a similar job - several providing virtually the same result as the official estimator. However those 'fan' estimators cost nothing to the public purse to create or maintain.
So if members of the public are prepared to create these tools, why should the agency?
Granted this last example is a little more tricky than that - the estimation process is time-consuming and maximising accuracy is a key goal.
However there are other government websites and tools which could and would be delivered by private organisations and individuals, if only the government allowed access to the data stream.
Level the playing field
Note that the article does not suggest that government should stop analysing data and presenting this analysis in websites.
What it suggests is to provide the raw data on a level playing field, thereby allowing private and public organisations the same capacity to use it.
The best way to ensure that the government allows private parties to compete on equal terms in the provision of government data is to require that federal websites themselves use the same open systems for accessing the underlying data as they make available to the public at large.This means that government agencies such as the ABS can continue to provide reports for people who choose not to do their own analysis.
However it opens the field to innovation and the use of various data sources to make connections that government, in a siloed form, is not as able to do.
This levelling is critical - if the government wants to see innovation it should not hold back the 'secret sauce'. The data needs to be available in a way that allows private and other public enterprises to use it in an equal way.
Open systems are available today via standards such as XML and RSS - look at how Google syndicates maps and ads or how Facebook allows the creation and dissemination of applications.
In conclusion
Government has a crucial role to play in the collection of data across the country. This is a task well suited to the public sector as it is in the public interest that this be available.
However government doesn't have the systems or culture to be best suited to interpret and combine this data or make it useful for individuals and organisations.
Government should provide interpretations - however it should not hold an artificial monopoly over this.
By allow other organisations to access the raw information innovation in its presentation can occur more rapidly, providing deeper insights for the public good.
Make government data freely available.
Does anyone have other examples of where government collected data has become freely available? I'd love to blog about the successes.