<br />
<b>Warning</b>:  Undefined array key "file" in <b>/apps/www/wordpress/blogs/wp-includes/media.php</b> on line <b>1686</b><br />
<br />
<b>Warning</b>:  Undefined array key "file" in <b>/apps/www/wordpress/blogs/wp-includes/media.php</b> on line <b>1686</b><br />
{"id":782,"date":"2013-12-12T17:52:08","date_gmt":"2013-12-12T17:52:08","guid":{"rendered":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/?p=782"},"modified":"2013-12-12T17:52:08","modified_gmt":"2013-12-12T17:52:08","slug":"thinking-about-research-data-asset-registers","status":"publish","type":"post","link":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/2013\/12\/12\/thinking-about-research-data-asset-registers\/","title":{"rendered":"Thinking about Research Data Asset Registers"},"content":{"rendered":"<p>[Reposted from\u00a0<a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/blog\/2013\/12\/12\/thinking-about-research-data-asset-registers\/\">https:\/\/libraryblogs.is.ed.ac.uk\/blog\/2013\/12\/12\/thinking-about-research-data-asset-registers\/<\/a>]<\/p>\n<p>In my last blog post, I looked at the <a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/2013\/12\/06\/the-four-quadrants-of-research-data-curation-systems\/\">four quadrants of research data curation systems<\/a>.\u00a0 This categorised systems that manage or describe research data assets by whether their primary role is to store metadata or data, and whether the information is for private or public use.\u00a0 Four systems were then put into these quadrants.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-1853\" alt=\"quadrants3\" src=\"https:\/\/libraryblogs.is.ed.ac.uk\/wp-content\/uploads\/2013\/12\/quadrants3.png\" width=\"480\" height=\"360\" \/><\/p>\n<p>The University of Edinburgh already has two active services from this diagram: <a href=\"http:\/\/www.ed.ac.uk\/schools-departments\/governance-strategic-planning\/research\/pure\/home\">PURE<\/a>, our Current Research Information System and <a href=\"http:\/\/datashare.is.ed.ac.uk\/\">DataShare<\/a>, our open data repository.<\/p>\n<p>This blog post will start to unpack some of the requirements for a Data Asset Register.<\/p>\n<p>The first aspect to cover is its name.\u00a0 What should it be called?\u00a0 Traditionally systems like this, which only hold metadata records that either just describe, or describe and point to other resources, are known as registers, catalogues, directories, indexes, or inventories.<\/p>\n<p>The University already has a \u2018<a href=\"http:\/\/www.ed.ac.uk\/schools-departments\/information-services\/research-support\/data-library\/finding-data\">Data Catalogue<\/a>\u2019, maintained by the Data Library.\u00a0 However this list has a different purpose, to hold details of external data.\u00a0 Oxford University, instead of opting for a name such as this, have instead opted to call their service by the verb \u2018find\u2019 \u2013 <a href=\"https:\/\/github.com\/bhavanaananda\/datafinder\">DataFinder<\/a>.\u00a0 Whilst there may be some brand or service name applied to the system we create at the University of Edinburgh, for now its working title is \u2018Data Asset Register\u2019 as one of its main functions will be to allow data creators to \u2018register\u2019 their data assets by describing them, and if the data is published online to link to the data.<\/p>\n<p>But what should the Data Asset Register provide?\u00a0 The following diagram shows some early thoughts:<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-1947\" alt=\"dar-thoughts\" src=\"https:\/\/libraryblogs.is.ed.ac.uk\/wp-content\/uploads\/2013\/12\/dar-thoughts.png\" width=\"480\" height=\"360\" \/><\/p>\n<p>The diagram splits this up into three broad areas:<\/p>\n<ul>\n<li>Description \u2013 what the asset register should describe<\/li>\n<li>Functions \u2013 the functions needed to allow data asset description<\/li>\n<li>Services \u2013 the value-added services that will add benefit to people who register their data<\/li>\n<\/ul>\n<h2>Description<\/h2>\n<p>The core purpose of the system is to describe data.\u00a0 This is split into two categories: being able to describe single items or data assets, and describing collections of data assets.\u00a0 Many data assets are created on their own, for example a population health longitudinal study.\u00a0 As such, this should be described on its own.\u00a0 In contrast, some data are created in large sets, where it isn\u2019t necessarily useful to describe every part of that set on its own.\u00a0 In this case, the collection as a whole can be described. A good example of this is the <a href=\"http:\/\/researchdata.ands.org.au\/search\/#!\/q=\/class=collection\">Research Data Australia<\/a> service from the <a href=\"http:\/\/www.ands.org.au\/\">Australian National Data Service<\/a>.<\/p>\n<p>We\u2019ll need to decide how to describe the data.\u00a0 A likely initial candidate will be the <a href=\"http:\/\/schema.datacite.org\/\">DataCite Metadata Schema<\/a>, but we may find this needs to be extended to cover extra elements relevant to the University or the discipline of the data asset being described.\u00a0 There will also be requirements coming from a possible <a href=\"http:\/\/www.dcc.ac.uk\/blog\/scoping-approaches-uk-research-data-registry\">UK research data registry<\/a>, development of which is being led by the <a href=\"http:\/\/www.dcc.ac.uk\/\">Digital Curation Centre<\/a>.<\/p>\n<h2>Functions<\/h2>\n<p>In order to enable data asset description, a register will need certain functions.\u00a0 So far three have been identified:<\/p>\n<ol>\n<li>CRUD: Create \/ Read \/ Update \/ Delete are the basic functions required when manipulating data.\u00a0 The system should allow records of research data to be created, read later, updated, and if needed, deleted.<\/li>\n<li>User Interface (UI): In order to enable CRUD functionality, a user interface will be required.\u00a0 To be useful, this will need to provide search and display functionality, for example using faceted search and browse.<\/li>\n<li>Log: Some funders have requirements to keep data for certain lengths of time, or for periods of time that must be reset each time a data set is accessed.\u00a0 For this reason each access of a data asset must be logged by the system.\u00a0 An example is from the <a href=\"http:\/\/www.epsrc.ac.uk\/about\/standards\/researchdata\/Pages\/expectations.aspx\">EPSRC<\/a>:<\/li>\n<\/ol>\n<blockquote><p>\u201cResearch organisations will ensure that EPSRC-funded research data is securely preserved for a minimum of 10-years from the date that any researcher \u2018privileged access\u2019 period expires or, if others have accessed the data, from last date on which access to the data was requested by a third party;\u201d<\/p><\/blockquote>\n<p>It may also be that the Data Asset Register can be a front-end for our Data Vault too \u2013 <a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/2013\/12\/20\/thinking-about-a-data-vault\/\">more about that in another blog post<\/a>!<\/p>\n<h2>Services<\/h2>\n<p>Extra value-added services are required in order to make the Data Asset Register useful to people.\u00a0 Our initial thoughts about these services include the following:<\/p>\n<ul>\n<li>Identify: The ability to assign identifiers to data assets.\u00a0 Some of these identifiers will need to be persistent.\n<ul>\n<li>DOI: <a href=\"http:\/\/www.datacite.org\/\">DataCite DOIs<\/a> allow DOIs to be assigned to data assets, in the same way that DOIs are assigned to journal articles.\u00a0 This allows them to be persistently identified over time even if they move between systems, but also allow them to be cited using a well-known identifier.<\/li>\n<li>TinyURL: A short URL such as those provided by <a href=\"http:\/\/tinyurl.com\/\">TinyURL<\/a> or <a href=\"https:\/\/bitly.com\/\">bitly<\/a> are useful to give easy web identifiers to objects.\u00a0 For example it might be nice to be able to issue URLs such as http:\/\/data.ed.ac.uk\/abcd.<\/li>\n<li>Other: Are there any other identifier systems that we should consider using?<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<ul>\n<li>Discover: It is important that the data records held in the Data Asset Register are searchable and can be indexed by external services.\u00a0 This may be by national, international, or discipline-based data aggregators, or by normal web search engines.<\/li>\n<\/ul>\n<ul>\n<li>Share: Whilst often the data assets will be described online but kept offline by the researcher, they may wish to share the data.\u00a0 The Data Asset Register may need to facilitate this in a number of ways:\n<ul>\n<li>Deposit:\u00a0 If the data is held in the Data Vault, along with a description in the Data Asset Register, then using a deposit protocol such as <a href=\"http:\/\/swordapp.org\/\">SWORD<\/a> it would be possible to deposit the data into the institutional data repository, or into an external repository.\u00a0 The Data Asset Register can then record the identifier for the hosted data set.<\/li>\n<li>Redirect:\u00a0 Where the data is hosted online elsewhere, the Data Asset Register could automatically redirect users.\u00a0 For example visiting http:\/\/data.ed.ac.uk\/abcd could redirect a visitor directly to the repository, rather than showing them just the data asset record description.\u00a0 If the data is not shared openly, then contact details can be provided of the data owner.<\/li>\n<li>RCUK: Some funders, such as the RCUK members (Research Councils UK) require funded journal papers to include \u201ca statement on how the underlying research materials \u2013 such as data, samples or models \u2013 can be accessed\u201d.\u00a0 The data asset register could facilitate this by automatically writing statements such as \u201cDetails about accessing the data referenced in this paper may be found at http:\/\/data.ed.ac.uk\/abcd\u201d<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>It is very early days in our thinking about what features a Data Asset Register should offer, and like many components of a modern research data management infrastructure, there are very few existing examples to look at.\u00a0 Our thoughts will be refined over the coming months so that we can start looking at implementation options. \u00a0Is there an existing system that can do all of this for us, or is it better to build something new, either alone or with collaborators?<\/p>\n<p>Images available from\u00a0<a href=\"http:\/\/dx.doi.org\/10.6084\/m9.figshare.873617\">http:\/\/dx.doi.org\/10.6084\/m9.figshare.873617<\/a><\/p>\n<p>Stuart Lewis, Head of Research and Learning Services, Library &amp; University Collections.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>[Reposted from\u00a0https:\/\/libraryblogs.is.ed.ac.uk\/blog\/2013\/12\/12\/thinking-about-research-data-asset-registers\/] In my last blog post, I looked at the four quadrants of research data curation systems.\u00a0 This categorised systems that manage or describe research data assets by whether their primary role is to store metadata or data, and &hellip; <a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/2013\/12\/12\/thinking-about-research-data-asset-registers\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":175,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"advanced_seo_description":"","jetpack_seo_html_title":"","jetpack_seo_noindex":false,"jetpack_post_was_ever_published":false},"categories":[10,24],"tags":[40,77,81,85,92],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/posts\/782"}],"collection":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/users\/175"}],"replies":[{"embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/comments?post=782"}],"version-history":[{"count":0,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/posts\/782\/revisions"}],"wp:attachment":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/media?parent=782"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/categories?post=782"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/datablog\/wp-json\/wp\/v2\/tags?post=782"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}