Report (3rd Day)

Part of Protection of Freedoms Bill – in the House of Lords at 5:45 pm on 15 February 2012.

Alert me about debates like this

Photo of Lord Lucas Lord Lucas Conservative 5:45, 15 February 2012

I understand and I remember from reading biographies that that was the case. None the less, the data were shared. To take an example from my time on the Front Bench as spokesman for agriculture during the problem of BSE, the Ministry of Agriculture, Fisheries and Food had been meticulously researching what was happening with this plague and had years of data. We had good people internally who were researching it. We did not know what was happening and we kept the data to ourselves. Three weeks after we released the data, we were told what was happening, which was transmission by food. That was right. Making data public, beyond a research group, is a very important thing to do at the right time. I should not like to see something in legislation that prevents that and allows people to hog data that should be public so that they can be properly criticised and understood.

My amendment is not on the same subject; it concerns technical bits of drafting in the same clause. I very much welcome the determination to provide greater access to data sets. It is something that I have struggled with, particularly with universities. All the universities that send me data stick copyright notices on them, which I studiously ignore. They have yet to sue me for it, probably because they have better things to do. Alternatively, I proposed as a remedy to one university that, if it insisted on its copyright notice, I would automatically generate an FOI exemption for every one of my users who wanted to access the data. The university thought that a number of 10,000 users a day was getting a bit large.

It is important that we understand that, when information is released under FOI, it can be passed on and made public; and that the generating institution does not retain some sort of control over it merely on a whim. I can understand why that might be the case if the material comprises something done under a publication scheme and is paid for, but otherwise it is very important that the information can be circulated whether in news media or in publications such as mine -the Good Schools Guide-or in many other applications.

I do not see why the proposals in the Bill do not go further and why they are restricted to data sets. It is common for all kinds of information released under FOI to be accompanied by a copyright. However, it is often obvious from the information that it has no conceivable commercial value to the public authority. A requester may have obtained a policy that he or she wants to publish on a website which demonstrates alleged shortcomings in an institution; for example, it may show that a decision has been taken without proper consideration of the consequences. The requester may want to write to Members of this House about the information that has been disclosed. Why should they be prevented from doing so by a copyright notice? It seems to me that the principles we are setting out in this clause should go further.

My second concern is about the definition of "data set", which I believe is unjustifiably narrow. The Cabinet Office carried out an open data consultation which sets out admirably ambitious objectives for the greater use of data sets. Many collections of data currently gathered by public bodies which may be essential to revealing the inner workings of government do not seem to fall within the legal definition of a data set as set out in Clause 102(2). Any electronic collection of data which is the result of analysis or interpretation cannot be a data set because of new Section (5)(b)(i) of the definition. The obligation to release it in reusable form will not apply to it, nor will the requirements to release it subject to the minimal restrictions embodied in a specific licence. The Information Commissioner will not be able to require that this collection of data must be published under an authority's publication scheme. It seems that only raw data untouched by human hand are to be affected by this clause. That may suit people like me who spend their day with programs interpreting data, but most people want access to something which has been prepared for human consumption and has been set out in a way that members of a local authority are intended to understand rather than the geeks in their data department. I do not understand why the Government are seeking to exclude from this clause data which have been made human readable, as it were.

Paragraph (c) of the definition states that a data set remains one only so long as all or most of the information in it,

"remains presented in a way that (except for the purpose of forming part of the collection) has not been organised, adapted or otherwise materially altered since it was obtained".

I am puzzled by that definition. A publication of data from a database may start off in the form of a spreadsheet consisting of three columns but is reorganised to consist of seven columns. Has enough been done to prevent the data set being published? How is anyone to know that? How is anyone to interpret that? How is any user to know that something is publishable? It seems to me that we are setting ourselves up for endless arguments. I am particularly concerned that authorities may find that, by reorganising data, they are able to conceal it from publication. It does not seem to me that that is the Government's intention. I very much hope that they will explain to me their understanding of how this subsection may not be used in that way.