My Lords, I rise to support the amendments tabled in the name of the noble Baroness, Lady O’Neill. There is some confusion outside this House about the definitions of ““dataset”” and ““re-use”” in this Bill. Although the provisions in Clause 100 do not actually change any of the obligations under FOI, other than so far as they relate to the format of the information provided, this clause has stimulated some debate about what it is that might need to be provided, as the Bill says, in a form which is, ““capable of re-use””. In particular, there is a question about whether we are talking about raw data, which, as the noble Baroness, Lady O’Neill, has argued, may be pretty meaningless in themselves, or the associated metadata—information which would allow someone to interpret the data. This might include the information identifying the individual records to which the data relate, or, say, the geographic location of a sample. In some cases this might be relatively straightforward. Indeed, I think that the Bill envisages data sets as something relatively contained and manageable—say, an Excel spreadsheet.
In the context of university research, however, data sets might be very much more complicated. Universities UK has given a very helpful example. The European Bioinformatics Institute’s 1,000 genomes data set comprises approximately 200 terabytes of data equivalent to the capacity of 3,200 iPods or 40,000 DVDs. The metadata are stored separately from the data themselves and accessing the data requires specialist software. As it happens, this data set is already in the public domain, which brings me to my next point.
There is already a move towards making data more widely available. This is a requirement of several major funders—the Wellcome Trust and research councils, for example. I believe that we should encourage this as the best way to ensure that access to data can be provided in such a way as to increase the transparency of research. We know that there is already important work under way on this issue, not least the work by the Royal Society to which the noble Baroness, Lady O’Neill, has referred. In addition, the Government’s recently published Innovation and Research Strategy included a commitment to look at the potential to increase access to data assets, including through providing funding for the world’s first open data institute. The Government will publish an open data White Paper this spring. This will be informed by the work of a group chaired by Dame Janet Finch on improving access to research publications, which will also report shortly.
In addition, the research councils are setting up a UK gateway to research, which will allow ready access to research council funding, research information and related data. For this reason I particularly support Amendment 147B in the name of the noble Baroness, Lady O’Neill, which states that public authorities should, "““provide … or undertake to provide””"
access to their data, which I take to mean that an adequate data-sharing plan should be sufficient to satisfy their obligations under the Act.
Will the Minister in his response comment on some specific questions? If the research team uses specialist software to organise their data, is there an expectation that it should have to alter the format to make it accessible to a non-specialist user? Would the cost of making data available in an alternative format be chargeable under FOIA? Does the Act require researchers to provide metadata to aid interpretation of the data set? Would the costs of providing metadata be chargeable under FOIA? If the Minister is unable to concede to an amendment exempting research information at the pre-publication stage, could he reassure the research community that robust data-sharing plans will be favourably considered by the ICO as evidence of intention to publish, and thus consistent with the appropriate use of the existing Section 22 exemption?
Protection of Freedoms Bill
Proceeding contribution from
Baroness Warwick of Undercliffe
(Labour)
in the House of Lords on Thursday, 12 January 2012.
It occurred during Debate on bills
and
Committee proceeding on Protection of Freedoms Bill.
About this proceeding contribution
Reference
734 c6-7GC Session
2010-12Chamber / Committee
House of Lords Grand CommitteeSubjects
Librarians' tools
Timestamp
2023-12-15 21:20:10 +0000
URI
http://data.parliament.uk/pimsdata/hansard/CONTRIBUTION_799837
In Indexing
http://indexing.parliament.uk/Content/Edit/1?uri=http://data.parliament.uk/pimsdata/hansard/CONTRIBUTION_799837
In Solr
https://search.parliament.uk/claw/solr/?id=http://data.parliament.uk/pimsdata/hansard/CONTRIBUTION_799837