Issue 12985

Checklist dataset with occurrence dwca indexed and in cube

12985
Reporter: mdoering
Assignee: ahahn
Type: Bug
Summary: Checklist dataset with occurrence dwca indexed and in cube
Priority: Major
Resolution: Fixed
Status: Closed
Created: 2013-03-11 17:27:58.506
Updated: 2013-03-14 16:07:06.614
Resolved: 2013-03-14 16:07:06.583
        
Description: http://uat.gbif.org/dataset/86cf5e96-9668-403d-81cc-59eeff861060
http://gbrds.gbif.org/browse/agent?uuid=86cf5e96-9668-403d-81cc-59eeff861060
says it is an occurrence DwC-A
http://staging.gbif.org:8080/metrics-ws/occurrence/count?datasetKey=86cf5e96-9668-403d-81cc-59eeff861060

http://uat.gbif.org/portal/occurrence/search?DATASET_KEY=86cf5e96-9668-403d-81cc-59eeff861060
they are in the SOLR too, so in HBase, meaning at some point... this had occurrences

so finally we hit a real inconsistency for a dataset with a dwca-occurrence being a dataset type checklist

]]>
    
Attachment dwca-wii_herpetological.zip


Author: trobertson@gbif.org
Created: 2013-03-11 17:30:07.272
Updated: 2013-03-11 17:30:07.272
        
The source IPT [1] says it is an occurrence dataset with 274 records, so it looks like HBase, the cube, SOLR might be correct.

The metadata confirms it is an occurrence on opening with "This dataset contains the records and associated media (pictures) of 274 herpetological specimens".

[1] http://ibif.gov.in:8080/ipt/

    


Author: trobertson@gbif.org
Comment: The actual archive (in case it goes offline again)
Created: 2013-03-11 17:31:00.528
Updated: 2013-03-11 17:31:00.528


Author: ahahn@gbif.org
Comment: Historical: the dataset was first (erroneously) registered as a checklist. This is maintained as the visible "DWC-ARCHIVE-CHECKLIST" type of the endpoint, and derived from this, in the value of agent.category (enum: http://staging.gbif.org:8080/enum-web/enum?id=org.gbif.api.vocabulary.DatasetType). On correction of this at the publisher side, the visible value changed to "DWC-ARCHIVE-OCCURRENCES", but the category value did not follow, causing an inconsistency. This workflow should already be corrected for similar cases in the future (to check with JC). The agent.category has now been updated to 18010 in the live registry.
Created: 2013-03-14 13:57:21.059
Updated: 2013-03-14 13:58:54.298


Author: ahahn@gbif.org
Comment: to do: check with JC
Created: 2013-03-14 14:00:45.584
Updated: 2013-03-14 14:00:45.584