Issue 17418

typifiedName parsing bug for "Part of Holotype"

17418
Reporter: mdoering
Assignee: mdoering
Type: Bug
Summary: typifiedName parsing bug for "Part of Holotype"
Priority: Major
Resolution: Fixed
Status: Resolved
Created: 2015-03-13 10:37:27.137
Updated: 2015-03-17 15:12:54.895
Resolved: 2015-03-17 15:12:54.872
        
Description: Specimens from the NHM London contain typeStatus: "Part of Holotype which is then parsed to extract a potential typifiedName:
http://api.gbif.org/v1/occurrence/1056102202/verbatim

As a result "Holotype" is extracted thought of as a genus:
http://api.gbif.org/v1/occurrence/1056102202


Other erroneously parsed records of the NHM include:

GBIFID	TYPIFIED_NAME_PARSED
1056104193	Figured Specimen
1056111946	holotype
1056219539	syntype


Correctly parsed NHM examples:
614340713	Abutilon longicuspe Hochst. ex A.Rich.]]>
    


Author: mdoering@gbif.org
Created: 2015-03-17 15:08:55.159
Updated: 2015-03-17 15:08:55.159
        
The proper solution is to get the new typifiedName term ratified and ask NHM to make use of that, reserving dwc:typeStatus just for the status alone:
https://github.com/tdwg/dwc/issues/28
    


Author: mdoering@gbif.org
Created: 2015-03-17 15:12:51.111
Updated: 2015-03-17 15:12:51.111
        
Avoid all typified names that end in -type:
https://github.com/gbif/parsers/commit/e36553879a5cd9f28535f6aeb4e6ad10cc3f2c3d

This should fix most cases before we cab ask NHM to use dwc:typifiedName