Repository Approaches to Improving the Quality of Shared Data and Code
Citations Over TimeTop 10% of 2021 papers
Abstract
Sharing data and code for reuse has become increasingly important in scientific work over the past decade. However, in practice, shared data and code may be unusable, or published results obtained from them may be irreproducible. Data repository features and services contribute significantly to the quality, longevity, and reusability of datasets. This paper presents a combination of original and secondary data analysis studies focusing on computational reproducibility, data curation, and gamified design elements that can be employed to indicate and improve the quality of shared data and code. The findings of these studies are sorted into three approaches that can be valuable to data repositories, archives, and other research dissemination platforms.
Related Papers
- → Identifying the Reusable Components from Component-Based System: Proposed Metrics and Model(2019)26 cited
- → Reusability Metrics for Program Source Code Written in C Language and Their Evaluation(2012)10 cited
- → On the reusability of samples in active learning(2022)
- → HOW NOT TO DATA ARCHIVE IN 10 WAYS(2023)