Issue 18859

DataCite citation metadata

18859
Reporter: mblissett
Type: Bug
Summary: DataCite citation metadata
Priority: Unassessed
Status: Open
Created: 2016-12-06 21:26:08.388
Updated: 2017-01-05 16:48:47.006
        
Description: A developer from OBIS reported a problem with DataCite metadata. I didn't fully follow (too noisy at lunch), but it could be the "occdownload gbif.org; at the beginning of the citation isn't the preferred citation format.

See http://data.datacite.org/10.15468/DL.3W4SGH

Alternatively, it may be that the citation in the datacite metadata for a dataset http://data.datacite.org/10.15468/LCBRCT is not what we put in the default citation on the dataset page http://www.gbif.org/dataset/107c4e84-79a1-4e3f-b96b-77d87d723628

Edit; nope, it was actually these, that show registry-migration in the citation: http://data.datacite.org/10.15468/6tkudz -- in any case, something that should probably be tidied up.

[~ahahn] was also listening.]]>
    


Author: cgendreau
Created: 2016-12-07 09:15:07.803
Updated: 2016-12-07 09:15:07.803
        
[~kbraak@gbif.org] can confirm but AFAIK the citation is not something we provide, DataCite generates it from the "creator".
We set the wrong creator in the past, it was fixed by this commit: https://github.com/gbif/registry/commit/10c25f1d1094cd359a318f6341f79308f49ed77f

I updated most of them but some have issues. The DOI 10.15468/6tkudz is linked to 2 datasets (this is our mistake when we accidentally harvest the Backbone constituents) so we need to clean the registry database.
    


Author: cgendreau
Created: 2016-12-07 10:02:09.135
Updated: 2016-12-07 10:02:09.135
        
I fixed the registry entry for this dataset and actually this is the real issue:

In the EML document:
http://api.gbif.org/v1/dataset/0938172b-2086-439c-a1dd-c21cb0109ed5/document

There is no  specified, this field is mandatory in the schema[1]. We accept it anyway but in absence of creator we use the user who created the resource (which is registry-migration in that case) to send to DataCite.

[1] http://rs.gbif.org/schema/eml-gbif-profile/1.1/eml-gbif-profile.xsd
    


Author: kbraak@gbif.org
Comment: Indeed you are correct Christian. [DataCite's metadata schema|https://schema.datacite.org/meta/kernel-3.1/doc/DataCite-MetadataKernel_v3.1.pdf] requires a prioritized list of one or more creators. They then use this list to produce the citation in the desired format, see http://search.datacite.org/works?query=10.15468%2Fdl.3w4sgh 
Created: 2016-12-07 12:32:25.616
Updated: 2016-12-07 12:32:25.616


Author: mblissett
Created: 2017-01-05 16:48:47.006
Updated: 2017-01-05 16:48:47.006
        
Comment from Daphnis de Pooter:

I wanted to comment on the Jira issue http://dev.gbif.org/issues/browse/POR-3198

It seems that most of the issue I reported was already addressed in the commit you mentioned.

Looking at another dataset  http://data.datacite.org/10.15468/lcbrct, I think there might be another problem.

The dataCite metadata correctly lists Alain Jaures Gbetoho as the only data creator (as this is the only data creator listed in IPT, if GBIF Benin wants Gangnibo, N.C. & Ganglo, C.J. to appear in the cition as well, these should also be listed as data creators).

However the default citation at http://www.gbif.org/dataset/107c4e84-79a1-4e3f-b96b-77d87d723628 lists

GBIF Benin: Agroforestry plant species found in protected areas and riparian lands in Sudano-guinean and Sudanian agroecological zones. Data mobilized in the framework of BID National project BID-AF2015-0065-NAC and funded by EU. doi:10.15468/lcbrct

Accessed via http://www.gbif.org/dataset/107c4e84-79a1-4e3f-b96b-77d87d723628 on 2016-12-19

Shouldn’t this be the following?

Alain Jaures Gbetoho: Agroforestry plant species found in protected areas and riparian lands in Sudano-guinean and Sudanian agroecological zones. Data mobilized in the framework of BID National project BID-AF2015-0065-NAC and funded by EU. doi:10.15468/lcbrct

Accessed via http://www.gbif.org/dataset/107c4e84-79a1-4e3f-b96b-77d87d723628 on 2016-12-19

Currently you do have a mismatch between your DOI metadata and your generated citation.

I don’t see a default citation at the new cite pages http://demo.gbif.org/dataset/107c4e84-79a1-4e3f-b96b-77d87d723628#citation. So these default citation will be deprecated?

Thnx!