Issue 16114

Update dwca validator to work with checklists

16114
Reporter: mdoering
Assignee: mdoering
Type: NewFeature
Summary: Update dwca validator to work with checklists
Priority: Major
Resolution: Fixed
Status: Closed
Created: 2014-07-14 10:27:13.63
Updated: 2014-09-09 14:24:28.628
Resolved: 2014-09-09 14:24:28.59
        
Description: In the crawler the dwca validator service is specific for occurrences. Checklist datasets are processed, but only verified for occurrences in extensions.

It needs to be updated to also validate checklists. Subtasks to be done:
- update DwcaValidationReport to have 2 distinct booleans to indicate an occurrence and checklist validation result.
- validate uniqueness of taxonID (fall back to core id if no taxonID is mapped). NULL ids are acceptable, but should be reported
- validate referential integrity of acceptedNameUsageID, parentNameUsageID and originalNameUsageID which must point to an existing core record (via taxonID if exists, otherwise coreid)]]>
    


Author: mdoering@gbif.org
Comment: https://github.com/gbif/gbif-api/commit/e5fe9d4a0005b7e2a7e535a0f52f7545cc0e482a
Created: 2014-09-04 17:37:49.865
Updated: 2014-09-04 17:37:49.865


Author: mdoering@gbif.org
Created: 2014-09-05 12:05:08.814
Updated: 2014-09-05 12:05:08.814
        
Basic Validation of unique taxonIDs done. Empty/NULL ids not allowed right now:
https://github.com/gbif/crawler/commit/13c9b3bc5b71748c8fc9543b0a2990e661a56bf4

Further referential integrity is not verified as it will be flagged in processing anyways
    


Author: mdoering@gbif.org
Comment: dwc archives with zero occurrence records validate now too, see https://github.com/gbif/gbif-api/commit/df50c95aadb055c8912c2f596557325fae2addb1
Created: 2014-09-09 14:24:23.679
Updated: 2014-09-09 14:24:23.679