{"id":49,"date":"2015-05-11T19:00:51","date_gmt":"2015-05-11T19:00:51","guid":{"rendered":"http:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/?p=49"},"modified":"2015-05-11T19:00:51","modified_gmt":"2015-05-11T19:00:51","slug":"describing-and-packaging-data","status":"publish","type":"post","link":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/2015\/05\/11\/describing-and-packaging-data\/","title":{"rendered":"Describing and Packaging data"},"content":{"rendered":"<p>The concept of the Data Vault service is to take research data that is no longer being actively used, and to archive it in long-term archival storage. \u00a0In order to facilitate this two processes need to take place as the data is prepared for storage in the vault:<\/p>\n<h2>Description<\/h2>\n<p>Metadata needs to be provided with any package that is being archived. \u00a0This means that the data can be found, understood, and any compliance issues complied with correctly (for example rights or retention). \u00a0Metadata needs to be applied at different levels, for example to the complete vault or container for a project, to deposits made into that vault, and to individual files.<\/p>\n<h2>Packaging<\/h2>\n<p>Rather than copying large structures of files into the archival storage, it has been decided to compile\u00a0them into a single packages. \u00a0This means that only single files need to be stored, and the packages can have extra information included, such as checksums of the files contained in the package and a copy of the metadata. \u00a0<a href=\"http:\/\/en.wikipedia.org\/wiki\/BagIt\">Bagit<\/a> seems to be the obvious choice for this, and there are many <a href=\"http:\/\/en.wikipedia.org\/wiki\/BagIt#Tools\">bagit libraries<\/a> available in different programming languages.<\/p>\n<p>As with the <a href=\"https:\/\/docs.google.com\/document\/d\/1k2XHlNBGR7sM6XBfyICIeGguoJc5uP3JJwJhrtgRvhI\/edit?usp=sharing\" target=\"_blank\">evolving project plan<\/a>, two openly editable documents have been created to discuss these two issues. \u00a0Please contribute if you have thoughts about these two issues!<\/p>\n<ul>\n<li>Metadata investigation:\n<ul>\n<li><a href=\"https:\/\/docs.google.com\/document\/d\/1K8c4Xm8saI4tbHOEKmJu5EbibsK5I44wm_l0UPBWUQU\/edit?usp=sharing\">https:\/\/docs.google.com\/document\/d\/1K8c4Xm8saI4tb&#8230;<\/a><\/li>\n<\/ul>\n<\/li>\n<li>Packaging investigation:\n<ul>\n<li><a href=\"https:\/\/docs.google.com\/document\/d\/1UIXjYVjgAHcDn0mZ1la7_CumlFjZ49ORJQCLWdBKoJQ\/edit?usp=sharing\">https:\/\/docs.google.com\/document\/d\/1UIXjYVjgAHcDn0&#8230;<\/a><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>The concept of the Data Vault service is to take research data that is no longer being actively used, and to archive it in long-term archival storage. \u00a0In order to facilitate this two processes need to take place as the &hellip; <a href=\"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/2015\/05\/11\/describing-and-packaging-data\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":8,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/posts\/49"}],"collection":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/comments?post=49"}],"version-history":[{"count":3,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/posts\/49\/revisions"}],"predecessor-version":[{"id":52,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/posts\/49\/revisions\/52"}],"wp:attachment":[{"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/media?parent=49"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/categories?post=49"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/libraryblogs.is.ed.ac.uk\/jiscdatavault\/wp-json\/wp\/v2\/tags?post=49"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}