US Government Data: Lost and Found

Image of rescue tube with floppy desk and words Data Rescue ProjectActions by the current US Trump administration (and others, including Trump’s first term) have spurred archivists, librarians and activists to archive, capture, collect, crawl, hoard, mirror, preserve, rescue, track and save datasets produced at taxpayer expense and until recently made available on government websites.

For example, just as US federal research into climate change, or even mentioning climate, has been paused and government agencies defunded, so the datasets produced from these activities have been removed from public reach or disappeared. The same is true for health data around vaccine research (National Institutes of Health, Centres for Disease Control and Prevention), human subject data deemed to be furthering EDI – equality, diversity, and inclusion – (USAID), and longitudinal educational data measuring attainment and social mobility (Department of Education). In some cases, as on this US government web page from the National Environmental Satellite, Data, and Information Service of the National Oceanic and Atmospheric Administration (NOAA), there are both items that are being decommissioned and archived, and others that are simply being decommissioned and deleted.

Particular challenges to archiving such data are capturing whole databases from scraping techniques, metadata loss, loss of provenance tracking (audit trail of changes), and the inability to add records or collect further data without massive government investment. Also, isolated efforts mean the data cannot easily be discovered.

Fortunately, the Data Rescue Project is coming to the rescue (along with other initiatives). It is a coordinated effort among data organisations and individuals, including librarians and data professionals. It serves as “a clearinghouse for data rescue-related efforts and data access points for public US governmental data that are currently at risk.” The web page provides a host of pointers to current efforts, resources, a tracker tool, and press coverage – including the New Yorker and Le Monde.

Researchers at University of Edinburgh who find that data they require for their research is being removed from publicly available sites may contact the Research Data Support team to discuss potential actions to take.

Robin Rice
Data Librarian and Head of Research Data Support
Library and University Collections

Leave a Reply

Your email address will not be published. Required fields are marked *