Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE

Authors

  • Hilary Jasmin The University of Tennessee Health Science Center, Memphis, Tennessee, United States of America

DOI:

https://doi.org/10.18438/eblip30378

Abstract

A Review of:

Federer, L. M. (2022). Long-term availability of data associated with articles in PLOS ONE. PLOS ONE 17(8), Article e0272845. https://doi.org/10.1371/journal.pone.0272845

Objective – To retrieve a range of PLOS ONE data availability statements and quantify their ability to point to the study data efficiently and accurately. Research questions focused on availability over time, availability of URLs versus DOIs, the ability to locate resources using the data availability statement and availability based on data sharing method.

Design – Observational study.

Setting – PLOS ONE archive.

Subjects – A corpus of 47,593 data availability statements from research articles in PLOS ONE between March 1, 2014, and May 31, 2016.

Methods – Use of custom R scripts to retrieve 47,593 data availability statements; of these, 6,912 (14.5%) contained at least one URL or DOI. Once these links were extracted, R scripts were run to fetch the resources and record HTTP status codes to determine if the resource was discoverable. To address the potential for the DOI or URL to fetch but not actually contain the appropriate data, the researchers selected at random and manually retrieved the data for 350 URLs and 350 DOIs.

Main Results – Of the unique URLs, 75% were able to be automatically retrieved by custom R scripts. In the manual sample of 350 URLs, which was used to test for accuracy of the URLs in containing the data, there was a 78% retrieval rate. Of the unique DOIs, 90% were able to be automatically retrieved by custom R scripts. The manual sample of 350 DOIs had a 98% retrieval rate.

Conclusion – DOIs, especially those linked with a repository, had the highest rate of success in retrieving the data attached to the article. While URLs were better than no link at all, URLs are susceptible to content drift and need more management for long-term data availability.

Downloads

Download data is not yet available.

References

Data Availability. (2019, December 5). PLOS ONE. Retrieved from https://journals.plos.org/plosone/s/data-availability

Federer, L. M. (2022). Long-term availability of data associated with articles in PLOS ONE. PLOS ONE 17(8), Article e0272845. https://doi.org/10.1371/journal.pone.0272845 DOI: https://doi.org/10.1371/journal.pone.0272845

Federer, L. M., Belter, C. W., Joubert, D. J., Livinski, A., Lu, Y-L., Snyders, L. N., & Thompson, H. (2018). Data sharing in PLOS ONE: An analysis of data availability statements. PLOS ONE 13(5), Article e0194768. https://doi.org/10.1371/journal.pone.0194768 DOI: https://doi.org/10.1371/journal.pone.0194768

Glynn, L. (2006). A critical appraisal tool for library and information research. Library Hi Tech, 24(3), 387–399. https://doi.org/10.1108/07378830610692154 DOI: https://doi.org/10.1108/07378830610692154

Downloads

Published

2023-09-24

How to Cite

Jasmin, H. (2023). Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE. Evidence Based Library and Information Practice, 18(3), 72–74. https://doi.org/10.18438/eblip30378

Issue

Section

Evidence Summaries