Issue 18026

Indexing error resulting in duplication of taxa

18026
Reporter: rdmpage
Type: Bug
Summary: Indexing error resulting in duplication of taxa
Priority: Major
Status: Open
Created: 2015-11-11 09:37:17.474
Updated: 2016-04-27 06:51:27.373
        
Description: The checklist dataset http://www.gbif.org/dataset/7a9bccd4-32fc-420e-a73b-352b92267571 "Checklist of Beetles (Coleoptera) of Canada and Alaska. Second Edition has multiple records for the same taxa, e.g the genus _Trypodendron_ http://www.gbif.org/species/search?q=Trypodendron&dataset_key=7a9bccd4-32fc-420e-a73b-352b92267571 is listed *six times*. Five of these genus-level records have a single species descendant.

Looking at the original DWcA this genus appears just once, so I think the error is in processing the data. Given that there are five species of _Trypodendron_ in the data set, and there are five extra entries in GBIF for _Trypodendron_, I suspect that there's a problem indexing the *higherClassification* field, and that this is the source of the extra entries. For a more extreme example, see the results for _Pityophthorus_ http://www.gbif.org/species/search?q=Pityophthorus&dataset_key=7a9bccd4-32fc-420e-a73b-352b92267571]]>
    


Author: rdmpage
Comment: This bug looks to be fixed so this issue can be closed (I don't seem to have the ability to do that).
Created: 2016-04-27 06:51:27.373
Updated: 2016-04-27 06:51:27.373