Issue 14385

Tools support data citation needs of publishers and data users

14385
Reporter: ahahn
Assignee: trobertson
Type: Epic
Summary: Tools support data citation needs of publishers and data users
Priority: Blocker
Status: ToDo
Created: 2013-11-19 16:06:41.302
Updated: 2015-12-14 17:52:38.87
DueDate: 2014-09-30 00:00:00.0
        
Description: Consistent tools to access a resolvable citation text for all data sets and all data downloads. Update the portal and the IPT to support consistent data citation. Building on the introduction of DOIs as stable identifiers for all datasets, the basis is set to make sure that any data download can come with a citation file that persistently references and credits all contributing datasets, and that itself can be persisted and cited through a DOI, so that a user of data can reference a single DOI that will resolve into a full source citation.

*Portal and IPT updated to support data citation: milestone Sep 2014*

*Rationale*
Lack of stable identifiers for datasets and data records currently makes it difficult to cite data used in an analysis or to give credit to the data publishers, as it requires a literature-style citation naming all contributing datasets and in practice resulting in page-long citation texts. It is, however, good scientific practice as well as a requirement stated in GBIF's data use agreement to accurately cite the used resources. The contribution of individual datasets to such publications should be traceable for evaluation, so that the owners can follow and demonstrate the use made of their data. At the same time, a publication author needs an adequate procedure and tools to fulfil this requirement with a reasonable amount of effort.

*Scenarios*
1 - publisher: citation-style selection during publication. IPT to provide alternative of a free text block or selection of a default format for the citation
2 - user: citation for a data download
3 - user: citation for all data included in a map
4 - user:  citation for a modified download (i.e. only part of the download is used and needs to be cited). Proposal: service allows to submit dataset DOIs and generates/persists the citation for this set. NB: not at the level of occurrenceIDs
5 - publisher: usage tracking for publishers. Proposed strategy: piggy-back on DOI graph evaluations that other projects/DOI groups implement; use this to display portal metrics

*Required components*
[- policy on occurrence ids as stable identifiers for data records (milestone Mar 2014) -> not required here as only DOIs are concerned]
- process definition for the generation of data citations
- IPT supporting data citation (milestone Sept 2014)
- portal supporting data citation (milestone Sept 2014)

*Related*
- data citation working groups; first meeting Mar 2014. Feedback on submitted use case needs to be integrated into the work plan
- data citation working groups, second meeting on data flows: check whether further input is expected]]>