Verifying dataset accessibility in repositories like Figshare, Zenodo, and OSF is manual and inefficient. To address this, we introduce SciCiteCheck, an automated tool leveraging APIs such as re3data, which will be released under the AGPL 3.0 license.
Paper Abstract
In academic manuscripts, authors sometimes cite datasets stored in repositories such as Figshare, Dataverse, Zenodo, and the Open Science Framework (OSF); however, no efficient method exists to verify that these repositories contain the cited data beyond manual inspection.[https://oakland.libguides.com/c.php?g=1404215&p=10392907]. To address this issue, we have developed SciCiteCheck, a tool that checks dataset citations for accessibility status.
With SciCiteCheck, researchers can automatically query cited datasets and determine their accessibility status—whether they are open, under controlled access, or subject to embargo. This tool makes use of publicly available APIs from major data repositories like re3data, which is the most comprehensive source of reference for research data infrastructures [https://medium.com/towards-data-science/data-repositories-for-almost-every-type-of-data-science-project-7aa2f98128b].
SciCiteCheck offers a functional back-end and a friendly front-end interface to facilitate easier verification of dataset citations. Preliminary results suggest that SciCiteCheck helps researchers efficiently verify dataset citations, check accessibility status, and ensure proper referencing of datasets in their work. The tool will be released under the AGPL 3.0 license, ensuring open access and transparency.
Accepted Poster
Paper Short Abstract
Paper Abstract
In academic manuscripts, authors sometimes cite datasets stored in repositories such as Figshare, Dataverse, Zenodo, and the Open Science Framework (OSF); however, no efficient method exists to verify that these repositories contain the cited data beyond manual inspection.[https://oakland.libguides.com/c.php?g=1404215&p=10392907]. To address this issue, we have developed SciCiteCheck, a tool that checks dataset citations for accessibility status.
With SciCiteCheck, researchers can automatically query cited datasets and determine their accessibility status—whether they are open, under controlled access, or subject to embargo. This tool makes use of publicly available APIs from major data repositories like re3data, which is the most comprehensive source of reference for research data infrastructures [https://medium.com/towards-data-science/data-repositories-for-almost-every-type-of-data-science-project-7aa2f98128b].
SciCiteCheck offers a functional back-end and a friendly front-end interface to facilitate easier verification of dataset citations. Preliminary results suggest that SciCiteCheck helps researchers efficiently verify dataset citations, check accessibility status, and ensure proper referencing of datasets in their work. The tool will be released under the AGPL 3.0 license, ensuring open access and transparency.
Poster session
Session 1 Tuesday 1 July, 2025, -