{"id":464,"date":"2013-10-28T14:17:14","date_gmt":"2013-10-28T14:17:14","guid":{"rendered":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/?p=464"},"modified":"2013-10-28T14:17:14","modified_gmt":"2013-10-28T14:17:14","slug":"science-as-an-open-enterprise-prof-geoffrey-boulton","status":"publish","type":"post","link":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/2013\/10\/28\/science-as-an-open-enterprise-prof-geoffrey-boulton\/","title":{"rendered":"Science as an open enterprise \u2013 Prof. Geoffrey Boulton"},"content":{"rendered":"<p><a href=\"http:\/\/www.openaccessweek.org\/\"><img decoding=\"async\" loading=\"lazy\" class=\"alignright size-medium wp-image-466\" src=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/OpenAccessWeek-300x101.png\" alt=\"\" width=\"300\" height=\"101\" srcset=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/OpenAccessWeek-300x101.png 300w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/OpenAccessWeek.png 311w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a>As part of <a title=\"OpenAccessWeek\" href=\"http:\/\/www.openaccessweek.org\/\" target=\"_blank\" rel=\"noopener\">Open Access Week<\/a>, the <a title=\"DL\" href=\"http:\/\/www.ed.ac.uk\/schools-departments\/information-services\/research-support\/data-library\">Data Library<\/a> and <a href=\"http:\/\/www.ed.ac.uk\/schools-departments\/information-services\/research-support\/publish-research\/scholarly-communications\">Scholarly Communications<\/a> teams in IS hosted a lecture by emeritus <a title=\"Boulton\" href=\"http:\/\/www.ed.ac.uk\/schools-departments\/geosciences\/people?cw_xml=person.html&amp;indv=437\" target=\"_blank\" rel=\"noopener\">Professor Geoffrey Boulton<\/a>\u00a0drawing upon his study for the Royal Society: <a href=\"http:\/\/royalsociety.org\/policy\/projects\/science-public-enterprise\/report\/\" target=\"_blank\" rel=\"noopener\">Science as an Open Enterprise (Boulton, et al 2012)<\/a>. The session was introduced by <a href=\"http:\/\/uk.linkedin.com\/in\/robinrice\/\">Robin Rice<\/a> who is the University of Edinburgh Data Librarian.\u00a0 Robin pointed out that the <a title=\"UoE\" href=\"http:\/\/www.ed.ac.uk\/\" target=\"_blank\" rel=\"noopener\">University of Edinburgh<\/a> was not just active, but was a leader in research data management having been the first UK institution to have a formal <a title=\"Policy\" href=\"http:\/\/www.ed.ac.uk\/schools-departments\/information-services\/about\/policies-and-regulations\/research-data-policy\" target=\"_blank\" rel=\"noopener\">research data management policy.<\/a>\u00a0 Looking at who attended the event, perhaps unsurprisingly the majority were from the University of Edinburgh.\u00a0 Encouragingly, there was roughly a 50:50 split between those actively involved in research and those in support roles.\u00a0 I say encouragingly as it was later stated that often policies get high-level buy in from institutions but have little impact on those actually doing the research. Perhaps more on that later.<\/p>\n<p>For those that don\u2019t know Prof. Boulton, he is a geologist and glaciologist and has been actively involved in scientific research for over 40 years.\u00a0 He is used to working with big things (mountains, ice sheets) over timescales measured in millions of years rather than seconds and notes that\u00a0 while humanity is interesting it will probably be short lived!<\/p>\n<p>Arguably the way we have done science over the last three hundred years has been effective. Science furthers knowledge. \u00a0Boulton&#8217;s introduction made it clear that he wanted to talk about the processes of science and how they are affected by the gathering, manipulation and analysis of huge amounts of data: the implications, the changes in processes, and why evenness matters in the process of science. This was going to involve a bit of a history lesson, so let\u2019s go back to the start.<\/p>\n<p><strong>Open is not a new concept<\/strong><\/p>\n<div id=\"attachment_469\" style=\"width: 310px\" class=\"wp-caption alignright\"><a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5615-001.png\"><img aria-describedby=\"caption-attachment-469\" decoding=\"async\" loading=\"lazy\" class=\"size-medium wp-image-469 \" src=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5615-001-300x200.png\" alt=\"Geoffrey Boulton talking about the origins of peer review\" width=\"300\" height=\"200\" srcset=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5615-001-300x200.png 300w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5615-001-1024x684.png 1024w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5615-001-768x513.png 768w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5615-001-1536x1026.png 1536w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5615-001-1568x1048.png 1568w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5615-001.png 1600w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><p id=\"caption-attachment-469\" class=\"wp-caption-text\"><em>&#8220;Open is not a new concept&#8221;<\/em><\/p><\/div>\n<p>Open has been a buzzword for a few years now.\u00a0 <a href=\"http:\/\/www.theodi.org\/\">Sir Tim Berners-Lee and Prof. Nigel Shadbolt <\/a>have made great progress in opening up core datasets to the public.<span style=\"color: #000000\">\u00a0 But for science, is open a new concept? <\/span>Boulton thinks not. Instead he reckons that openness is at the foundations of science but has somehow got a bit lost recently.\u00a0 Journals originated as a vehicle to disseminate knowledge and trigger discussion of theories.\u00a0 Boulton \u00a0gave a brief history of the origins of journals pointing out that <a href=\"http:\/\/en.wikipedia.org\/wiki\/Henry_Oldenburg\">Henry Oldenburg<\/a> is credited with founding the peer review process with <a href=\"http:\/\/rstl.royalsocietypublishing.org\/\">the Philosophical Transactions of the Royal Society<\/a>.\u00a0 The journal allowed scientists to share their thoughts and promote discussion.\u00a0 Oldenburg\u2019s insistence that the Transactions be published in the vernacular rather than Latin was significant as it made science more accessible.\u00a0 Sound familiar?<\/p>\n<p><strong>Digital data &#8211; threat or opportunity?\u00a0<\/strong><\/p>\n<p>We are having the same discussions today, but they are based around technology and, perhaps in some cases, driven by money. The journal publishing model has changed considerably since Oldenburg and it was not the focus of the talk so let us concentrate on the data.\u00a0 Data are now largely digital.\u00a0 Journals themselves are also generally digital. \u00a0The sheer volume of data we now collect makes it difficult to include the data with a publication. So should data go into a repository?\u00a0 Yes, and some journals encourage this but few mandate it.\u00a0 Indeed, many of the funding councils state clearly that research output should be deposited in a repository but don\u2019t seem to enforce this.<\/p>\n<p><strong>Replicability \u2013 the cornerstone of the scientific method<\/strong><\/p>\n<div id=\"attachment_470\" style=\"width: 209px\" class=\"wp-caption alignright\"><a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5619_2.png\"><img aria-describedby=\"caption-attachment-470\" decoding=\"async\" loading=\"lazy\" class=\"size-medium wp-image-470\" src=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/IMG_5619_2-199x300.png\" alt=\"Image of Geoffrey Boulton during his talk\" width=\"199\" height=\"300\" \/><\/a><p id=\"caption-attachment-470\" class=\"wp-caption-text\">Geoffrey Boulton, mid-talk.<\/p><\/div>\n<p>Having other independent scientists replicate and validate your findings adds credence to them. Why would you as a professional scientist not want others to confirm that you are correct?\u00a0 It seems quite simple but it is not the norm.\u00a0 Boulton pointed us to a recent paper in Nature <a href=\"http:\/\/www.nature.com\/nature\/journal\/v483\/n7391\/full\/483531a.html\">(Nature v483 n7391<\/a>) which attempted to replicate the results of a number of studies in cancer research. The team found that they could only replicate 6, around 11%, of the studies.\u00a0 So the other 81% were fabricating their results?\u00a0 No, there are a number of reasons why the team could not replicate all the studies.\u00a0 The methodology may not have been adequately explained leading to slightly different techniques being used, the base data may have been unobtainable and so on but the effect is the same. Most of the previous work that the team looked at is uncorroborated science.\u00a0 Are we to trust their findings?\u00a0 Science is supposed to be self-correcting.\u00a0 You find something, publish, others read it, replicate and corroborate or pose an alternative, old theories are discounted (Science 101 time: \u201c<a href=\"http:\/\/en.wikipedia.org\/wiki\/Null_hypothesis\" target=\"_blank\" rel=\"noopener\">Null Hypothosis<\/a>&#8220;) and our collective knowledge is furthered.\u00a0 Boulton suggests that, to a large degree, this is not happening. Science is not being corroborated. We have forgotten the process on which our profession is based. Quoting Jim Gray:<\/p>\n<blockquote><p><em>&#8220;when you go and look at what scientists are doing, day in and day out, in terms of data analysis, it is truly dreadful. We are embarrassed by our data.&#8221;<\/em><\/p><\/blockquote>\n<p><strong>Moving forward (or backwards) towards open science<\/strong><\/p>\n<p>What do we need to do to support, to do to advise, to ensure materials are available for our students, for our researchers to ensure they can be confident about sharing their data? \u00a0The University of Edinburgh does reasonably well but we still, like most institutions, have things to do.<\/p>\n<p>Geoffrey looked at some of the benefits of open science and while I am sure we all already know what these are, it is useful to have some high profile examples that we can all aspire to following.<\/p>\n<ol>\n<li>Rapid response \u2013 some scientific research is reactive. This is especially true in research into epidemiology and infectious diseases.\u00a0 An outbreak occurs, it is unfamiliar and we need to understand it as quickly as possible to limit its effects. During an e-coli outbreak in Hamburg local scientists were struggling to identify the source. They analysed the strain and released the genome under an open licence. Within a week they had a dozen reports from 4 continents. This helped to identify the source of the outbreak and ultimately saved lives.<em>(<a href=\"http:\/\/www.nejm.org\/doi\/full\/10.1056\/NEJMoa1107643#t=article\" target=\"_blank\" rel=\"noopener\">Rohde et al 2011<\/a>)<\/em><\/li>\n<li>Crowd-sourcing \u2013 mathematical research is unfathomable to many.\u00a0 Mathematicians are looking for solutions to problems. Working in isolation or small research clusters is the norm, but is it effective?\u00a0 <a href=\"https:\/\/www.dpmms.cam.ac.uk\/~wtg10\/\">Tim Gowers<\/a> (University of Cambridge) decided to break with convention and post the \u201cproblems\u201d he was working on to <a href=\"http:\/\/gowers.wordpress.com\/\" target=\"_blank\" rel=\"noopener\">his blog<\/a>.\u00a0 The result; 32 days \u2013 27 people \u2013 800 substantive contributions. 800 substantive contributions!\u00a0 I am sure that Tim also fostered some new research collaborations from his 27 respondents.<\/li>\n<li>Change the social dynamic of science &#8211; &#8220;We are scientists, you wouldn&#8217;t understand&#8221; is not exactly a helpful stance to adopt. \u00a0&#8220;We are scientists and we need your help,&#8221; now that\u2019s much better! \u00a0The rise of the app has seen a new arm of science emerge, &#8220;citizen science&#8221;. The crowd, or sometimes the informed crowd, is a powerful thing. With a carefully designed app you can collect a lot of data from a lot of places over a short period. Projects such as <a href=\"https:\/\/www.ashtag.org\/\">ASHtag<\/a> and <a href=\"http:\/\/leafwatch.naturelocator.org\/\">LeafWatch<\/a> are just two examples where the crowd has been usefully deployed to help collect data for scientists.\u00a0 Actually, this has been going on for some time in different forms, do you remember the <a href=\"http:\/\/setiathome.berkeley.edu\/\">SETI@Home<\/a> screensaver?\u00a0 It\u2019s still going, 3 million users worldwide processing data for scientists since 1999.<\/li>\n<li>Openness and transparency \u2013 no one wants another &#8220;<a href=\"http:\/\/www.theguardian.com\/environment\/2010\/jul\/07\/climate-emails-question-answer\" target=\"_blank\" rel=\"noopener\">Climategate<\/a>&#8220;.\u00a0 In fact Climategate need not have happened at all. Much of the data was already publicly available and the scientists had done nothing wrong. Their lack of openness was seen as an admission that they had something to hide and this was used to damaging effect by the climate sceptics.<\/li>\n<li>Fraud \u2013 open data is crucial as it shines the light on science and the scientific technique and helps prevent fraud.<\/li>\n<\/ol>\n<p><strong>What value if not intelligent?<\/strong><\/p>\n<p>However, Boulton&#8217;s closing comments made the point that openness has little value if it is not &#8220;intelligent&#8221; so this means it is:<\/p>\n<ul>\n<li>accessible (can it be found?)<\/li>\n<li>intelligible (can you make sense of it?)<\/li>\n<li>assessable (can you rationally look at the data objectively?)<\/li>\n<li>re-usable (has sufficient metadata to describe how is was created?)<\/li>\n<\/ul>\n<p><span style=\"color: #000000\">I would agree with Boulton&#8217;s criteria but would personally modify the accessible entry. In my opinion data is not open if it is buried in a PDF document. OK, I may be able to find it, but getting the data into a usable format still takes considerable effort, and in some cases, skill.\u00a0 The data should be ready to use.<\/span><\/p>\n<p><span style=\"color: #000000\">Of course, not every dataset can be made open.\u00a0 Many contain sensitive data that needs to be guarded as it could perhaps identify an individual. \u00a0There are also considerations to do with safety and security that may prevent data becoming open.\u00a0 In such cases, perhaps the metadata could be open and identify the data custodian.<\/span><\/p>\n<p><strong>Questions and Discussion<\/strong><\/p>\n<p>One of the first questions from the floor focused on the fuzzy boundaries of openness and the questioner was worried that scientist could, and would, hide behind the \u201clegitimate commercial interest\u201d since all data had value and research was important within a university&#8217;s business model.\u00a0 Boulton agreed but suggested that the publishers could do more and force authors to make their data open. Since we are, in part, judged by our publication record you would have to comply and publish your data.\u00a0 Monetising the data would then have to be a separate thing. He alluded to the pharmaceutical industry, long perceived to be driven by money but which has recently moved to be more open.<\/p>\n<p>The second question followed on from this asking if anything could be learned from the licences used for software such as the <a href=\"http:\/\/www.gnu.org\/copyleft\/gpl.html\">GNU<\/a> and the <a href=\"http:\/\/www.apache.org\/licenses\/\">Apache<\/a> Licence.\u00a0 Boulton stated that the government is currently looking at how to licence publicly-funded research. \u00a0What is being considered at the EU level may be slightly regressive and based on EU lobbying from commercial organisations. There is a lot going on in this area at the moment so keep your eyes and ears open.<\/p>\n<p>The final point from the session sought clarification of <a href=\"http:\/\/www.ed.ac.uk\/schools-departments\/information-services\/about\/policies-and-regulations\/research-data-policy\/\" target=\"_blank\" rel=\"noopener\">The University of Edinburgh research data management policy<\/a>.\u00a0 Item nine states<\/p>\n<blockquote><p><em>&#8220;Research data of future historical interest, and all research data that represent records of the University, including data that substantiate research findings, will be offered and assessed for deposit and retention in an appropriate national or international data service or domain repository, or a University repository.\u201d<\/em><\/p><\/blockquote>\n<p>But how do we know what is important, or what will be deemed significant in the future? Boulton agreed that this was almost impossible.\u00a0 We cannot archive all data and inevitably some important \u201cstuff\u201d will be lost &#8211; but that has always been the case.<\/p>\n<div id=\"attachment_471\" style=\"width: 477px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/2013_10_24_GeoffreyBoulton_out.png\"><img aria-describedby=\"caption-attachment-471\" decoding=\"async\" loading=\"lazy\" class=\" wp-image-471  \" src=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/2013_10_24_GeoffreyBoulton_out-1024x768.png\" alt=\"View of the audience for Geoffrey Boulton's talk as part of Open Access Week at UoE\" width=\"467\" height=\"350\" srcset=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/2013_10_24_GeoffreyBoulton_out-1024x768.png 1024w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/2013_10_24_GeoffreyBoulton_out-300x225.png 300w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/2013_10_24_GeoffreyBoulton_out-768x576.png 768w, https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/2013_10_24_GeoffreyBoulton_out.png 1200w\" sizes=\"(max-width: 467px) 100vw, 467px\" \/><\/a><p id=\"caption-attachment-471\" class=\"wp-caption-text\">The audience for Geoffrey Boulton&#8217;s talk as part of Open Access Week at UoE<\/p><\/div>\n<p><strong>My Final Thoughts on Geoffrey&#8217;s Talk<\/strong><\/p>\n<p>An interesting talk.\u00a0 There was nothing earth-shattering or new in it, but a good review of the argument for openness in science from someone who actually has the attention of those who need to recognise the importance of the issue and take action on it.\u00a0 But instead of just being a top down talk, there was certainly a bottom up message.\u00a0 Why wait for a mandate from a research council or a university? There are advantages to be had from being open with your data and these benefits are potentially bigger for the early adopters.<\/p>\n<p>I will leave you with an aside from Boulton on libraries&#8230;<\/p>\n<blockquote><p>&#8220;Libraries do the wrong thing, employ the wrong people.\u201d<\/p><\/blockquote>\n<p><em>For good reasons we&#8217;ve been centralising libraries. But perhaps we have to reverse that. Publications are increasingly online but soon it will be the data that we seek and tomorrow&#8217;s librarians should be skilled data analysts who understand data and data manipulation.\u00a0<\/em> Discuss.<\/p>\n<p><strong>Some links and further reading:<\/strong><\/p>\n<ul>\n<li><a title=\"Slides\" href=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/files\/2013\/10\/OpenAcecessWeek_Boulton.pdf\" target=\"_blank\" rel=\"noopener\">Slides from the presentation<\/a><\/li>\n<li>Royal Society Report &#8211; <a href=\"http:\/\/royalsociety.org\/policy\/projects\/science-public-enterprise\/report\/\">Science as an open enterprise<\/a><\/li>\n<li>Nature Paper \u2013 <a href=\"http:\/\/www.nature.com\/nature\/journal\/v483\/n7391\/full\/483531a.html\">Nature Volume 483 Issue 7391<\/a><\/li>\n<li>Climategate \u2013 <a href=\"http:\/\/www.theguardian.com\/environment\/2010\/jul\/07\/climate-emails-question-answer\">an overview from the Guardian<\/a><\/li>\n<li>MANTRA \u2013 <a href=\"http:\/\/datalib.edina.ac.uk\/mantra\/\">Research data management best practise<\/a><\/li>\n<li><a href=\"http:\/\/www.ed.ac.uk\/schools-departments\/information-services\/about\/policies-and-regulations\/research-data-policy\">UoE research data management policy<\/a><\/li>\n<li>Video: Geoffrey Boulton on Science as an Open Enterprise &#8211; from <a title=\"Open Access Seminar\" href=\"http:\/\/vimeo.com\/59398605\" target=\"_blank\" rel=\"noopener\">University of Minho Open Access Seminar &amp; OpenAIRE Interoperability Workshop<\/a>, February 2013<\/li>\n<\/ul>\n<p><em>Addy Pope<\/em><\/p>\n<p><em>Research and Geodata team, EDINA<\/em><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>As part of Open Access Week, the Data Library and Scholarly Communications teams in IS hosted a lecture by emeritus Professor Geoffrey Boulton\u00a0drawing upon his study for the Royal Society: Science as an Open Enterprise (Boulton, et al 2012). The &hellip; <a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/2013\/10\/28\/science-as-an-open-enterprise-prof-geoffrey-boulton\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"advanced_seo_description":"","jetpack_seo_html_title":"","jetpack_seo_noindex":false,"jetpack_post_was_ever_published":false},"categories":[5,9,14,20],"tags":[48,85,87,88],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/posts\/464"}],"collection":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/comments?post=464"}],"version-history":[{"count":0,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/posts\/464\/revisions"}],"wp:attachment":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/media?parent=464"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/categories?post=464"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/tags?post=464"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}