Issue 16115

Create ChecklistNormalizer

16115
Reporter: mdoering
Assignee: mdoering
Type: NewFeature
Summary: Create ChecklistNormalizer
Priority: Major
Resolution: Fixed
Status: Resolved
Created: 2014-07-14 10:33:22.393
Updated: 2014-09-17 11:59:15.175
Resolved: 2014-08-01 15:58:35.819
        
Description: see https://github.com/gbif/checklistbank/blob/master/INDEXING.md#checklistnormalizer

listens to DwcaMetasyncFinishedMessage and filters relevant datasets by their DatasetType and validation result. After normalization is completed it returns a ChecklistNormalizedMessage with the relative location of the neo db directory and some stats:
 - record counts by ORIGIN: https://github.com/gbif/gbif-api/blob/master/src/main/java/org/gbif/api/vocabulary/Origin.java
 - record counts by KINGDOM
 - record counts by NAME_TYPE
]]>
    


Author: mdoering@gbif.org
Comment: Done, but specific normalization logic missing which are tracked by individual jiras
Created: 2014-08-01 15:58:35.848
Updated: 2014-08-01 15:58:35.848