Issue 18378

Panopoda interpreted as Noctuidae

18378
Reporter: donald hobern
Type: Feedback
Summary: Panopoda interpreted as Noctuidae
Status: Open
Created: 2016-04-07 09:30:10.663
Updated: 2016-04-07 10:39:05.796
        
        
Description: See: http://www.gbif-uat.org/occurrence/search?HAS_GEOSPATIAL_ISSUE=false&GEOMETRY=-73.12%20-68.08%2C-73.12%20-60.53%2C-45.57%20-60.53%2C-45.57%20-68.08%2C-73.12%20-68.08&TAXON_KEY=7015

These are not insects ...

*Reporter*: Donald Hobern
*E-mail*: [mailto:dhobern]]]>
    


Author: rdmpage
Comment: It's a fuzzy match problem, verbatim scientificName "Pantopoda and Crustacea species" has been mapped to the genus _Panopoda_. Perhaps we need to filter names that are not properly formed before attempting fuzzy match? 
Created: 2016-04-07 09:52:23.433
Updated: 2016-04-07 09:52:23.433


Author: mdoering@gbif.org
Created: 2016-04-07 10:38:58.214
Updated: 2016-04-07 10:38:58.214
        
We did not match those in the old index, but I assume this has not happened because they were interpreted with an older code.
http://www.gbif.org/occurrence/344675417

The current matching service even in the old portal does link them to Panopoda:
http://api.gbif-uat.org/v1/species/match?kingdom=&family=&name=Pantopoda%20and%20Crustacea%20species

It boils down to 2 things, a fuzzy match of the genus Pantopoda to Panopoda and before that a cleaned and parsed scientific name resulting in "Pantopoda spec.": http://api.gbif.org/v1/parser/name?name=Pantopoda%20and%20Crustacea%20species

Detecting not well formed names isn't that simple unfortunately - unless we want to be very strict and reject all sorts of names we get. Case for example cannot be used as we see all upper case names. But "and" could be a good hint in this case