Issue 17653

DwC archives block subsequent crawls

17653
Reporter: jlegind
Type: Bug
Summary: DwC archives block subsequent crawls
Priority: Critical
Resolution: Fixed
Status: Closed
Created: 2015-06-30 15:03:08.891
Updated: 2018-05-31 16:28:01.915
Resolved: 2018-05-31 16:28:01.861
        
Description: There are at least one IPT dataset that updates daily and it seems that there are some DwC datasets that block subsequent crawls.
This is one of the daily updaters:
http://b6g8.gbif.org:5601/index.html#eyJzZWFyY2giOiJAZmllbGRzLmRhdGFzZXRLZXk9XCJlMjQyZjU5OC02OWYzLTRkYmYtYjRmNi1kMmExYzVmNjk4NmJcIiIsImZpZWxkcyI6WyJAdHlwZSIsIkBmaWVsZHMubGV2ZWwiLCJAbWVzc2FnZSJdLCJvZmZzZXQiOjAsInRpbWVmcmFtZSI6Ijg2NDAwIiwiZ3JhcGhtb2RlIjoiY291bnQiLCJ0aW1lIjp7InVzZXJfaW50ZXJ2YWwiOjB9fQ==

The dataset contains some species NULL values. (Not sure of the impact of this)

DINA http://www.gbif.org/dataset/e242f598-69f3-4dbf-b4f6-d2a1c5f6986b ]]>
    


Author: mblissett
Created: 2018-05-31 16:28:01.894
Updated: 2018-05-31 16:28:01.894
        
It's deleted, and the logs are long gone, I'm not sure if this is still an issue.

Please open in Github if so.