Issue 18190

Missing occurrences due to incomplete DiGiR harvesting

18190
Reporter: rdmpage
Assignee: jlegind
Type: Feedback
Summary: Missing occurrences due to incomplete DiGiR harvesting
Priority: Major
Resolution: Fixed
Status: Closed
Created: 2016-01-31 14:01:27.605
Updated: 2016-02-01 15:25:15.741
Resolved: 2016-02-01 15:25:15.643
        
Description: Dataset "University of Alberta Museums, Ichthyology Collection http://www.gbif.org/dataset/4f2873fe-7a4e-4fac-9c77-a719049dfa65 has 1000 occurrences, which is a suspiciously rounded number. Looking at the DiGiR provider http://project.macs.ualberta.ca/digir/DiGIR.php it claims 8336 so GBIF harvesting has missed most of the specimens. The provider also states:

1000
1000

is this a limitation of the DiGiR provider or the harvester?  Ca the harvester page through DiGiR records?]]>
    


Author: cgendreau
Created: 2016-02-01 09:23:30.145
Updated: 2016-02-01 09:23:30.145
        
University of Alberta Museums, Ichthyology Collection is also available as Dwc-A
University of Alberta Ichthyology Collection (UAMZ)
http://www.gbif.org/dataset/84f3b06c-f762-11e1-a439-00145eb45e9a

Should we simply remove the harvesting of the DiGIR endpoint if they really represent the same collection?
    


Author: jlegind@gbif.org
Created: 2016-02-01 15:25:15.739
Updated: 2016-02-01 15:25:15.739
        
Redundant DiGIR datasets removed.
Related to http://dev.gbif.org/issues/browse/PF-2327