Issue 18165
not existing parent name usage ids in the latest dwca
18165
Reporter: mdoering
Type: Bug
Summary: not existing parent name usage ids in the latest dwca
Priority: Major
Status: ToDo
Created: 2016-01-20 22:58:27.78
Updated: 2016-01-21 12:02:07.957
Description: not existing parent name usage ids in the latest dwca:
http://www.gbif.org/species/search?dataset_key=7ddf754f-d193-4cc9-b351-99906754a03b&issue=PARENT_NAME_USAGE_ID_INVALID
I have tried to grep the id mentioned in this record:
http://www.gbif.org/species/116192967/verbatim
And indeed there is no entry.
Just 2 records using it as a parentid (btw, is it correct to have 2 records for the same name?)
[crap@bla6 7ddf754f-d193-4cc9-b351-99906754a03b]$ grep -n "25576691" taxa.txt
3182441:25579865 urn:lsid:catalogueoflife.org:taxon:82b886e0-6ae5-11e5-9d43-bc764e092680:col20160114 106 WoRMS Echinoidea in Species 2000 & ITIS Catalogue of Life: 15th January 2016 25576691 accepted name infraspecies Agassizia cyrenaica pseudoinflala Desio, 1929 Animalia Echinodermata Echinoidea Spatangoida Prenasteridae Agassizia Agassizia cyrenaica pseudoinflala Desio, 1929 Kroh, Andreas 11-Mar-2014 W-Ech-757599 http://www.catalogueoflife.org/annual-checklist/details/species/id/ae994262fca554fdbf19d249fc1be051 false
3182443:25579866 urn:lsid:catalogueoflife.org:taxon:82b7eb46-6ae5-11e5-9d43-bc764e092680:col20160114 106 WoRMS Echinoidea in Species 2000 & ITIS Catalogue of Life: 15th January 2016 25576691 accepted name infraspecies Agassizia cyrenaica pseudoclevei Desio, 1929 Animalia Echinodermata Echinoidea Spatangoida Prenasteridae Agassizia Agassizia cyrenaica pseudoclevei Desio, 1929 Kroh, Andreas 11-Mar-2014 W-Ech-757598 http://www.catalogueoflife.org/annual-checklist/details/species/id/39115010b31dc63fe16330ef9ac7a377 false
]]>
Author: mdoering@gbif.org
Created: 2016-01-21 12:02:07.957
Updated: 2016-01-21 12:02:07.957
The problem consists of subspecies labelled extant, but with an extinct species as a parent. As currently extinct taxa are excluded from the export, this results in orphaned taxa in the download.
This issue for the download can be fixed quite easily by allowing extinct taxa in the download. Is there a good reason not to include extinct taxa?
The problem however also permeates to the CoL itself:
http://www.catalogueoflife.org/col/details/species/id/1ba6fa9b7197091f724e34cd59ce8606
If the "Include extinct taxa" is off, clicking the species or genus name in the classification leads to a tree opened up only to the family. As the genus and species are extinct, they will not appear.
Apparently this is an extra check we need to add somewhere in the conversion chain. The most logical step would be to "demote" taxa to extinct when their parent is extinct.
So two questions:
1. Do you we allow extinct taxa in the DCA archives
2. Do we post-process extant taxa with extinct parents?