Quicker, easier interface for DataVault

We are delighted to report that Edinburgh DataVault now has a quicker and easier process for users. The team have been working hard to overhaul the form for creating a new ‘vault’ to improve the user experience. The changes allow us to gather all the information we need from the user directly through a DataVault web page. The users no longer need to first login to Pure and create a separate dataset record there. Instead, that bit will be automated for them.

I have explained the new process and walked users through the new form in a new and updated how-to video, now combining the getting started information with the demo of how to create a vault:

Get started and create your vault! (8 mins)

The new streamlined process for users is represented and compared to our open research data repository DataShare in this workflow diagram. DataVault is designed for restricted access, but can also handle far larger datasets than DataShare.

The diagram shows the steps users go through in DataShare and DataVault. Common steps are deposit and approval.

The DataVault process includes the gathering of funding information, and review and deletion none of which are present in the DataShare workflow since they would not be relevant to that open research data repository.

The arrow showing DataVault metadata going to the internet represents the copying of selected metadata fields into Pure, where they are accessible as dataset records in the university’s Edinburgh Research Explorer online portal.

Our new course “Archiving Your Research Data”, featuring Sara Thomson, Digital Archivist, provides an introduction to digital preservation for researchers, combined with practical support on how to put digital preservation into practice using the support and systems available here at University of Edinburgh such as the DataVault. For future dates and registration information please see our Workshops page.

A recording of an earlier workshop (before the new interface was released) is also available: Archiving Your Research Data Part 1: Long-term Preservation.

If you are a University of Edinburgh principal investigator, academic, or support professional interested in using the Edinburgh DataVault, please get in touch by emailing data-support@ed.ac.uk.

Pauline Ward
Research Data Support Assistant
Library and University Collections
University of Edinburgh

It Is Our Mantra

Last week I was honoured to accept an invitation to speak at the Library Technology Conclave at Somaiya Vidyavihar University in Mumbai, India, organised by Informatics Limited and the University. A prelude the day before the event included a half-day “Research Data Management (RDM) Basics” tutorial for about 50 attending librarians, which I delivered based on adaptations of our Research Data Support team’s training materials for PhD students and staff. The training exercises, developed from a few other external librarian training sessions I’ve done, focused on building librarians’ confidence in supporting researchers with data management planning and data sharing. Doing the training in person helped me to overcome communication barriers and foster deeper engagement than could have happened online only.

Lighting the flame of the conference

Lighting the flame of the hybrid conference

The conference was on the theme of “Research Data Management and Stewardship: Building Blocks for Open Science,” with a number of eminent librarians, scientists, and educators speaking in keynotes and on panels in six thematic sessions, in-person and remotely. There was a palpable sense of urgency to the proceedings, as those in the room were concerned that India’s scientific institutions, without funder mandates, national open infrastructure, nor observable changes in cultural norms for RDM and Open Science, might be left behind, given this emerging new, more transparent way of conducting research. Questions focused not on the What or Why of Open Science, but how to instigate behavioural change of scientists and researchers, and how librarians could create demand for new services such as data repositories and quickly skill themselves up.

I have some empathy for their position. A decade or so ago I attended conferences which felt more like hand-wringing than change-making, with endless talk of carrots and sticks (and carrot-stick jokes), with researchers explaining over and again their reluctance to be ‘scooped’ by giving access to their data. I am not sure what caused the tipping point to talking about the potential of data sharing and open science to the exciting reality of it happening, but it seems to have come round (more or less). I do still harbour concerns that our own researchers will be left out of participation in the shared infrastructure that is the European Open Science Cloud because of Brexit-related barriers here.

Robin with attendeesOne talk that piqued my interest involved a survey of librarians in Gujarat about RDM and their capacity to deliver new types of service, by Dr. Bhakti Gala. As the Indian LIS (library and information science) curriculum was apparently seen to not be delivering RDM training to any great extent yet, the researcher had asked how the librarians had acquired knowledge of RDM. She said that about half the librarians who had pursued self-training had learned from the free, online MANTRA course (which stands for Research Data Management Training), offered by the University of Edinburgh.

The Chair of the panel, Prof Shalini Urs, with whom I had had a conversation over dinner with about the name of the course, said [naming me, as I sat in the audience] that I would be happy to hear that was the case, to which I of course smiled and nodded. Alluding to our prior conversation about whether the name was a cultural [mis-]appropriation or not, she looked me in the eye and said, “It is Our MANTRA, now.” Which is, of course, the great thing about Openness.

University of Edinburgh’s new Research Data Management Policy

Following a year-long consultation with research committees and other stakeholders, a new RDM Policy (www.ed.ac.uk/is/research-data-policy) has replaced the landmark 2011 policy, authored by former Digital Curation Centre Director, Chris Rusbridge, which seemed to mark a first for UK universities at the time. The original policy (doi: 10.7488/era/1524) was so novel it was labeled ‘aspirational’ by those who passed it.

"Policy"

CC-BY-SA-2.0, Sustainable Economies Law Centre, flickr

RDM has come a long way since then, as has the University Research Data Service which supports the policy and the research community. Expectation of a data management plan to accompany a research proposal has become much more ordinary, and the importance of data sharing has also become more accepted in that time, with funders’ policies becoming more harmonised (witness UKRI’s 2016 Concordat on Open Research Data).

What has changed?

Although a bit longer (the first policy was ten bullet points and could fit on a single page!), the new policy adds clarity about the University’s expectations of researchers (both staff and students), adds important concepts such as making data FAIR (explanation below) and grounding concepts in other key University commitments and policies such as research integrity, data protection, and information security (with references included at the end). Software code, so important for research reproducibility, is included explicitly.

CC BY 2.0, Big Data Prob, KamiPhuc on flickr

Definitions of research data and research data management are included, as well as specific references to some of the service components that can help – DMPOnline, DataShare, etc. A commitment to review the policy every 5 years, or sooner if needed, is stated, so another ten years doesn’t fly by unnoticed. Important policy references are provided with links. The policy has graduated from aspirational – the word “must” occurs twelve times, and “should” fifteen times. Yet academic freedom and researcher choice remains a basic principle.

Key messages

In terms of responsibilities, there are 3 named entities:

  • The Principle Investigator retains accountability, and is responsible as data owner (and data controller when personal data are collected) on behalf of the University. Responsibility may be delegated to a member of a project team.
  • Students should adhere to the policy/good practice in collecting their own data. When not working with data on behalf of a PI, individual students are the data owner and data controller of their work.
  • The University is responsible for raising awareness of good practice, provision of useful platforms, guidance, and services in support of current and future access.

Data management plans are required:

  • Researchers must create a data management plan (DMP) if any research data are to be collected or used.
  • Plans should cover data types and volume, capture, storage, integrity, confidentiality, retention and destruction, sharing and deposit.
  • Research data management plans must specify how and when research data will be made available for access and reuse.
  • Additionally, a Data Protection Impact Assessment is required whenever data pertaining to individuals is used.
  • Costs such as extra storage, long-term retention, or data management effort must be addressed in research proposals (so as to be recovered from funders where eligible).
  • A University subscription to the DMPOnline tool guides researchers in creating plans, with funder and University templates and guidance; users may request assistance in writing or reviewing a plan from the Research Data Service.

FAIR data sharing is more nuanced than ‘open data’:

  • Publicly funded research data should be made openly available as soon as possible with as few restrictions as necessary.
  • Principal Investigators and research students should consider how they can best make their data FAIR in their Data Management Plans (findable, accessible, interoperable, reusable).
  • Links to relevant publications, people, projects, and other research products such as software or source code should be provided in metadata records, with persistent identifiers when available.
  • Discoverability and access by machines is considered as important as access by humans. Standard open licences should be applied to data and code deposits.

Use data repositories to achieve FAIR data:

  • Research data must be offered for deposit and retention in a national or international data service or domain repository, or a University repository (see next bullet).
  • PIs may deposit their data for open access for all (with or without a time-limited embargo) in Edinburgh DataShare, a University data repository; or DataVault, a restricted access long-term retention solution.
  • Research students may deposit a copy of their (anonymised) data in Edinburgh DataShare while retaining ownership.
  • Researchers should add a dataset metadata record in Pure to data archived elsewhere, and link it to other research outputs.
  • Software code relevant to research findings may be deposited in code repositories such as Gitlab or Github (cloud).

Consider rights in research data:

  • Researchers should consider the rights of human subjects, as well as citizen scientists and the public to have access to their data, as well as external collaborators.
  • When open access to datasets is not legal or ethical (e.g. sensitive data), information governance and restrictions on access and use must be applied as necessary.
  • The University’s Research Office can assist with providing templates for both incoming and outgoing research data and the drafting and negotiation of data sharing agreements.
  • Exclusive rights to reuse or publish research data must not be passed to commercial publishers.

Robin Rice
Data Librarian and Head, Research Data Support
Library & University Collections

Data Mindfulness training integrated in new resources

The Research Data Service is pleased to announce an update to our ‘Data Mindfulness: Making the Most of Your Dissertation Data’ training materials.

Originally developed to provide face to face research data management (RDM) training for undergraduate students undertaking a dissertation project, the newly revised course is now available as one of ten units within the Library’s new LibSmart II training course.

‘Data Mindfulness: Your Dissertation Data‘ combines videos, reading material, and short interactive exercises to help students think about data management issues as they prepare to undertake a research project, potentially for the first time.

The course is designed to follow the research journey from beginning to end, from developing a research question and conducting a literature search, through to generating and managing project data and files during the life of the project and beyond.

The ‘Data Mindfulness’ unit provides an approachable introduction to the subject of RDM, with up-to-date and relevant information and guidance for undergraduate and masters students. The updated content also includes expanded material on finding and accessing secondary data sources, as well as links to wider training and resources provided by the Library.

You can find more information about the new LibSmart II course, and how to enrol here: https://www.ed.ac.uk/information-services/help-consultancy/rm-and-consultancy/academic-support-librarians/libsmart.

In addition to LibSmart II, we are also pleased to be working in conjunction with the Research Training Centre, based in the School of Social and Political Science, to deliver an updated version of the ‘Data Mindfulness’ course as part of the Micro-Methods Workshop series. You can find details of the Micro-Methods Workshops series here: https://research-training-centre.sps.ed.ac.uk/micro-methods.

Finally, we have made the ‘Data Mindfulness’ training materials available for re-use under an open CC-BY license, and you can find links to the videos and download a PDF of the revised ‘Data Mindfulness’ course handbook from the Research Data Service site (a Word version of the handbook is available on request): https://www.ed.ac.uk/information-services/research-support/research-data-service/training.

We hope these ‘Data Mindfulness’ materials are useful and relevant and appreciate any comments or feedback that you may have at data-support@ed.ac.uk.

Bob Sanders
Research Data Support Assistant