This post was inspired by a comment by John Sheridan on Twitter,
craigthomler what I'd like for ? egs of use of for service delivery innovation, value creation etc, not just curiosity
It's a good New Years wish and highlights two questions that I have been pondering for a long time.
1. Why aren't people making more use of publicly release government data?
2. Does making government data publicly available have any value if people aren't using it to value add?
Let's take them in order...
1. Why aren't people making more use of publicly release government data?
Across these six countries (I am excluding the two states), that is in excess of 670,000 datasets released publicly. However if you search around there's not that many apps listed using the data. The
US site lists around 1,150 and
Australia's site lists 16 - however that's not many compared to the number of datasets.
OK, let's work through a few possibilities.
Firstly it could be that these datasets are being widely used, but people simply aren't telling the catalogues. Data may be embedded in websites and apps without any communication back to the central catalogue, or it may be downloaded and used in internal spreadsheets and intranets. In this case there's no actual issue, just a perceived one due to lack of evidence.
Secondly, to face facts, the majority of people probably are still not aware of these data catalogues - they haven't really been widely promoted and aren't of much interest to the popular media. Therefore there may be hundreds of thousands of people wishing to access certain government information but unaware that it is readily available.
Thirdly, those people aware of these datasets may be daunted by the number released, unable to find the data they specifically want to use or simply aren't interested.
Finally, perhaps simply releasing a dataset isn't enough. Few people are data experts or know what to do with a list of values. Could it be that we need simple and free analysis tools as well as raw data?
There's steps governments can take to address all of these possibilities.
If people aren't telling the government about their apps, why not establish light 'registration' processes to use them which capture information on why they are being used? Or if this is too invasive, offer people appropriate incentives to tell the central catalogue about their uses of the data.
Secondly, there may be a need to promote these data catalogues more actively - to build awareness via appropriate promotion.
Thirdly, perhaps we need to do more user-testing of our data catalogues to better understand if they meet the audience's needs. Combined with excellent mechanisms for suggesting and rating datasets, this could greatly inform the future development and success of these catalogues.
And finally, governments need to consider the next step. Provide the raw data, but also provide sites and tools that can analyse them. Sure governments are hoping that the public will create these, and maybe they will, however that doesn't mean that agencies can't do so as well. There's also pre-existing tools, such as Yahoo Pipes, IBM's Manyeyes and analytics tools from Google which could be pre-populated with the government datasets, ready for users to play with.
Alongside all these specific solutions, maybe governments need to start using some of the tools at their disposal to ask why people aren't using their data. Is it the wrong data? Presented in the wrong way? Too hard to use? Market research might help answer these questions.
2. Does making government data publicly available have any value if people aren't using it to value add?
Now to take the second question - does it really matter whether people are using open government data anyway?
Are there other goals that releasing data addresses, such as transparency and accountability, intra-government sharing and culture change?
If the mandate to release data leads to government culture change it may drive other benefits - improved collaboration, improved policy outcomes, cost-savings through code and knowledge sharing.
Of course it is harder to provide a direct quantitative link between releasing data, changing culture and the benefits above. However maybe this is what we need to learn how to measure, rather than simply the direct correlation between 844 datasets released, 16 apps created.