Issue 13774

":) ia" is not a recognized kingdom

13774
Reporter: gaurav
Assignee: jlegind
Type: Bug
Summary: ":) ia" is not a recognized kingdom
Description: I've been wondering which kingdom http://uat.gbif.org/species/124484006 refers to. It appears to be a root node of some sort in EOL, maybe? Searching for that name on EOL finds no records. Looking up http://uat.gbif.org/species/124484006/verbatim gives me a 500 error.
Priority: Minor
Status: Reopened
Created: 2013-09-05 02:42:47.283
Updated: 2015-03-06 13:31:04.961


Author: mdoering@gbif.org
Created: 2013-09-05 10:14:59.298
Updated: 2013-09-05 10:14:59.298
        
The EOL data is some month old and it might have been removed in the meantime.
Scanning the underlying dwc archive for that name yields 2 entries in EOL at that time:

{noformat}
31473960        Tornoceratinae Arthaber 1911
31475139        :)ia    kingdom
31475140        :)      order
31476758        Aeschna caerulea
{noformat}

    


Author: mdoering@gbif.org
Comment: The verbatim 500 error is fixed and you should get a 404 instead now on http://portaldev.gbif.org
Created: 2013-09-05 10:17:25.866
Updated: 2013-09-05 10:17:39.749


Author: gaurav
Comment: It's still there, with 360,000+ species: http://www.gbif.org/species/124484006
Created: 2013-10-21 23:04:41.345
Updated: 2013-10-21 23:04:41.345


Author: mdoering@gbif.org
Created: 2013-10-22 11:14:35.503
Updated: 2013-10-22 11:14:35.503
        
Yes, it's there cause EOL published it. We will not modify or clean the data they have published.
Maybe someone should get in touch with EOL to ask them to republish an updated version?
    


Author: jlegind@gbif.org
Comment: EOL has been contacted.
Created: 2013-10-22 12:36:42.243
Updated: 2013-10-22 12:36:42.243


Author: jlegind@gbif.org
Created: 2015-03-06 11:26:11.474
Updated: 2015-03-06 11:26:11.474
        
[~mdoering@gbif.org] Here is a list of some of the scientific name values in the EOL DwC archive that I found suspicious:

+
* *
??
A.
2
x
H.
P
?
???
×
S
F
:)

    


Author: mdoering@gbif.org
Created: 2015-03-06 13:31:04.961
Updated: 2015-03-06 13:31:04.961
        
Yeah. Seems to still be the old copy then. Unless they publish an updated version we should keep the dataset empty as it is now.
A http head request gives:
Last-Modified: Tue, 25 Sep 2012 21:06:00 GMT

curl -i -X HEAD http://content60.eol.org/downloads/eol_names_archive.tar.gz