New MOOC! Research Data Management and Sharing

[Guest post from Dr. Helen Tibbo, University of North Carolina-Chapel Hill]

The School of Information and Library Science and the Odum Institute at the University of North Carolina-Chapel Hill and EDINA at the University of Edinburgh are pleased to announce the forthcoming Coursera MOOC (Massive Open Online Course), Research Data Management and Sharing.

CaptureThis is a collaboration of the UNC-CH CRADLE team (Curating Research Assets and Data Using Lifecycle Education) and MANTRA. CRADLE has been funded in part by the Institute of Museum and Library Services to develop training for both researchers and library professionals. MANTRA was designed as a prime resource for postgraduate training in research data management skills and is used by learners worldwide.

The MOOC uses the Coursera on-demand format to provide short, video-based lessons and assessments across a five-week period, but learners can proceed at their own pace. Although no formal credit is assigned for the MOOC, Statements of Accomplishment will be available to any learner who completes a course for a small fee.

The Research Data Management and Sharing MOOC will launch 1st March, 2016, and enrolment is open now. Subjects covered in the 5-week course follow the stages of any research project. They are:

  • Understanding Research Data
  • Data Management Planning
  • Working with Data
  • Sharing Data
  • Archiving Data

Dr. Helen Tibbo from the School of Information and Library Science (SILS) at the University of North Carolina at Chapel Hill delivers four of the five sets of lessons, and Sarah Jones, Digital Curation Centre, delivers the University of Edinburgh-developed content in Week 3 (Working with Data). Quizzes and supplementary videos add to the learning experience, and assignments are peer reviewed by fellow learners, with questions and answers handled by peers and team teachers in the forum.

Staff from both organizations will monitor the learning forums and the peer-reviewed assignments to make sure learners are on the right track, and to watch for adjustments needed in course content.

The course is open to enrolment now, and will ‘go live’ on 1st March.
https://www.coursera.org/learn/research-data-management-and-sharing

Hashtag: #RDMSmooc

A preview of one of the supplementary videos is now available on Youtube:
www.youtube.com/watch?v=yhVqImna7cU

Please join us in this data adventure.
-Helen

Dr. Helen R. Tibbo, Alumni Distinguished Professor
President, 2010-2011 & Fellow, Society of American Archivists
School of Information and Library Science
201 Manning Hall, CB#3360
University of North Carolina at Chapel Hill
Chapel Hill, NC 27599-3360
Tel: 919-962-8063
Fax: 919-962-8071
tibbo@ils.unc.edu

Highlights from the RDM Programme Progress Report: August – October 2015

The RDM Roadmap 2.0 has been completed, approved, and published online and work has started on achieving the deliverables. A copy of the Roadmap is publicly available on the RDM webpages and can be downloaded from http://www.ed.ac.uk/files/atoms/files//uoe-rdm-roadmap_-_v2_0.pdf.

The RDM Services brochure has now been published in both paper and electronic form and is proving very popular with researchers. The electronic version can be downloaded from http://www.ed.ac.uk/files/atoms/files/rdm_service_a5_booklet_0.pdf

Work on DataVault is progressing well and an interim DataVault service is now nearly complete. The Software Sustainability Institute has worked with the DataVault team to road test the interim solution, as a result some optimisations to the process were identified and are being coded up. DataVault user events have been held in both Manchester and Edinburgh, both events were well attended and the general impression of the current DataVault functionality was positive. Further, round three, funding is being sought from Jisc in December to continue this joint development effort.

Jisc has provided funding for up to nine PhD students to be employed one day per week for four months within their school. Their role will be to help researchers within their school record their research data as Datasets in the PURE system, and to direct any RDM or DMP queries to the RDM team for further support. The Dataset records in PURE will provide the Edinburgh University contribution to the national Research Data Discovery Service, this will increase the discoverability of Edinburgh data and ensure that more researchers are meeting the requirements of their research funders to make their data discoverable and reusable. Applications for the first set of three PhD student interns have been received and are currently being shortlisted, the successful applicants should be able to begin work before the end of 2015.

In October some minor questions were received about the DataShare application for Data Seal of Approval (DSA), these were responded to and DataShare has now been approved for the DSA. This is a major achievement for the entire DataShare team who have worked hard to make DataShare a Trusted Digital Repository.

Over the three month period a total of 173 staff and PGR’s have attended a RDM course or workshop, an additional 20-25 staff have attended research committee meetings or small group presentations where RDM has been on the agenda. Both regular and on demand RDM sessions (courses, workshops, & presentations) will continue to be offered and we are currently in the process of scheduling 30 courses, workshops for January to June 2016 as well as a number of presentations.

The “Data Management and Sharing” Coursera MOOC is well under way with a December launch anticipated. Sarah Jones, DCC, is our video instructor, using scripts adapted from MANTRA.

National and International Engagement Activities

10th August meeting in London with other Alan Turing Institute members to discuss RDM requirements to be provided by member institutions.

17th of August a one day RDM event was organised for Danish visitors from the University of Copenhagen to present UoE RDM services, outreach activities and ELNs.

31st August Dealing with Data conference.

7th/8th September meeting with Gottingen University to talk about digital scholarship, including RDM.

7th October DataVault engagement event at Manchester University.

29 October, Educause conference, Indianapolis. Robin Rice was on a panel with Jan Cheetham & Brianna Marshall, University of Wisconsin and Rory Macneil, RSpace: “Drivers and responses toward research data management maturity: transatlantic perspectives.

Kerry Miller

RDM Service Co-Ordinator

Analytics platform trial

Information Services is evaluating a new collaborative platform for data-science and analytics as part of its expanding portfolio of services for researchers. We are looking for researchers with suitable problems who expect to achieve results in the one-year trial. We will be able to work closely with a small number of projects to help them get the most out of the platform, and training will be available. In addition, we encourage further researchers to use the platform with less formal support.

The Aridhia AnalytiXagility Platform

AnalytiXagility is a purpose-built, user-friendly, collaborative platform for data science and analytics. It allows your team to easily create, discuss, modify and share analyses in a single, secure system accessed conveniently through a web browser.
The platform handles routine data management tasks such as confidentiality, availability, integrity and audit, reducing time to insight and discovery. In particular, it is ideally suited for:

  • Exploring, comparing and linking structured datasets including data quality profiling
  • Supporting data management, accountability and provenance
  • Processing large datasets that do not fit in memory

Bring your team

Project members collaborate through a private workspace configured with compute, storage and analytical tools. Embedded social media tools allow teams to post and share questions, updates, comments and insights, building an active record of the research undertaken.

Bring your data

Users import their datasets using the secure and reliable file transfer mechanism, SFTP. Working files (documents, images, analysis scripts) can be uploaded directly through the web interface, and tagged for easy management and retrieval by the team.

Bring your analysis

AnalytiXagility provides an analysis platform, based on R, which can be accessed through a web browser. Combining R with an SQL database and an associated access library allows researchers to analyse their data in a faster and more scalable way than with R alone.

Generate your output

The platform supports generation of PDF reports for communication and publication using LaTeX templates, such as those provided by many leading journals, in which users can embed active analytical scripts to auto-generate images and tabular data within the report at runtime.

More information

If you are interested in participating in the trial, please email IS.Helpline@ed.ac.uk with the subject “XAP Trial”.

Further information can be found at:

Steve Thorn
Research Services
IT Infrastructure

New data analysis and visualisation service

Statistical Analysis without Statistical Software

The Data Library now has an SDA server (Survey Documentation and Analysis), and is ready to load numeric data files for access by either University of Edinburgh users only, or ‘the world’. The University of Edinburgh SDA server is available at: http://stats.datalib.edina.ac.uk/sda/

SDA provides an interactive interface, allowing extensive data analysis with significance tests. It also offers the ability to download user-defined subsets with syntax files for further analysis on your platform of choice.

SDA can be used to teach statistics, in the classroom or via distance-learning, without having to teach syntax. It will support most statistical techniques taught in the first year or two of applied statistics. There is no need for expensive statistical packages, or long learning curves. SDA has been awarded the American Political Science Association Best Instructional Software.

For data producers concerned about disclosure control, SDA provides the capability of defining usage restrictions on a variable-by-variable basis. For example, restrictions on minimum cell sizes (weighted or unweighted), use of particular variables without being collapsed (recoded), or restrictions on particular bi- or multivariate combinations.

For data managers and those concerned about data preservation, SDA can be used to store data files in a generic, non-software dependant format (fixed-field format ASCII), and includes capability of producing the accompanying metadata in the emerging DDI-standard XML format.

Data Library staff can mount data files very quickly if they are well documented with appropriate metadata formats (eg SAS or SPSS), depending on access restrictions appertaining to the datafile. To request a datafile be made available in SDA, contact datalib@ed.ac.uk.

Laine Ruus
EDINA and Data Library