Issue 18480

Brazilian Flora contains far too many unmatched names

18480
Reporter: mdoering
Assignee: mdoering
Type: Bug
Summary: Brazilian Flora contains far too many unmatched names
Priority: Critical
Status: InProgress
Created: 2016-05-19 10:24:39.856
Updated: 2016-08-09 17:47:40.971
        
Description: The Brazilian flora has more than 100.000 unmatched names:
http://www.gbif.org/species/search?dataset_key=aacd816d-662c-49d2-ad1a-97e66e2a2908&issue=BACKBONE_MATCH_NONE

This seems wrong as the flora is used to build the backbone and random name test do actually exist in our backbone, e.g the genus Piper:

Brazilian Flora: http://www.gbif.org/species/114761199
Backbone: http://www.gbif.org/species/3075433

https://github.com/gbif/checklistbank/blob/master/checklistbank-nub/nub-sources.tsv#L24

So really pretty much all names should match.]]>
    


Author: mdoering@gbif.org
Created: 2016-07-04 16:10:49.226
Updated: 2016-07-04 16:10:49.226
        
An example that has not matched but should according to the service is Chamaecrista desvauxii:
http://api.gbif.org/v1/species/match?verbose=true&strict=true&family=Fabaceae&name=Chamaecrista%20desvauxii%20(Collad.)%20Killip

http://www.gbif.org/species/114721344
    


Author: mdoering@gbif.org
Created: 2016-08-04 13:31:25.88
Updated: 2016-08-04 22:16:20.16
        
there are only 2609 unmatched in the august 2016 backbone and it appears they all are from unsupported or higher ranks like TRIBE, CLASS:
http://api.gbif-uat.org/v1/species/search?dataset_key=aacd816d-662c-49d2-ad1a-97e66e2a2908&issue=BACKBONE_MATCH_NONE

still 1546 species:
http://api.gbif-uat.org/v1/species/search?dataset_key=aacd816d-662c-49d2-ad1a-97e66e2a2908&issue=BACKBONE_MATCH_NONE&rank=SPECIES
    


Author: rdmpage
Comment: At first glance lots of species with the name "inaccurate cedes", in other words identified to genus level only, and lots of _Euglena_ which have been lost with the removal of Algaebase (sigh).
Created: 2016-08-04 23:10:50.935
Updated: 2016-08-04 23:10:50.935


Author: joaomlanna@gmail.com
Created: 2016-08-08 22:40:30.111
Updated: 2016-08-08 22:40:30.111
        
Dear Markus,

I suppose that our list are already on the GBIF backbone. Am I right?

And about the 73.365 "Nomenclatural status invalid" in our statistics? What that means? Is it related to this POR-3107?

Regards,
    


Author: mdoering@gbif.org
Comment: [~joaomlanna@gmail.com] these "invalid" nomenclatural status flags mean that GBIF could not interpret them into our enumeration. This is nothing important and I will need to make sure we better understand them. But it's a bit down the priority line right now I am afraid. Nothing to worry about on your side
Created: 2016-08-09 17:47:40.971
Updated: 2016-08-09 17:47:40.971