Issue 17689

Include hybrid, strain & cultivar names in backbone

17689
Reporter: mdoering
Type: NewFeature
Summary: Include hybrid, strain & cultivar names in backbone
Priority: Major
Status: Open
Created: 2015-07-14 22:03:21.912
Updated: 2017-02-03 17:30:55.649
        
Description: Should we include hybrid names, bacterial strain and plant cultivar names in the backbone? They are both found in occurrences, even if they are pretty rare.

As a start for the new backbone I would exclude them for now and only add them when needed.]]>
    


Author: mdoering@gbif.org
Comment: [~trobertson@gbif.org], [~rdmpage] I'd be interested to know your opinion
Created: 2015-07-14 22:04:10.234
Updated: 2015-07-14 22:04:10.234


Author: trobertson@gbif.org
Created: 2015-07-15 09:23:20.432
Updated: 2015-07-15 09:23:20.432
        
I would suggest this is not high priority on this round of work.  We used to have them the backbone pre-2012 when we assembled it using occurrence names, and there are have been no user feedback commenting on this.
It's unlikely to be a small amount of work, and likely to require data model changes.  I would defer that and focus on the really critical issues we have currently.
    


Author: mdoering@gbif.org
Created: 2015-07-15 10:11:24.436
Updated: 2015-07-15 10:11:38.269
        
Agree Tim, though coding wise this is not a big thing as CLB and the name parser handles this already.
The current nub actually does have most kind of those names (even though many of them seem to be parsed badly):

http://www.gbif.org/species/search?q=&dataset_key=d7dddbf4-2cf0-4f39-9b2a-bb099caae36c&name_type=HYBRID
http://www.gbif.org/species/search?q=&dataset_key=d7dddbf4-2cf0-4f39-9b2a-bb099caae36c&name_type=INFORMAL
http://www.gbif.org/species/search?q=&dataset_key=d7dddbf4-2cf0-4f39-9b2a-bb099caae36c&name_type=CULTIVAR
http://www.gbif.org/species/search?q=&dataset_key=d7dddbf4-2cf0-4f39-9b2a-bb099caae36c&name_type=VIRUS

Bacterial strains are not being stored properly now, but hybrids, cultivar and virus names are.

I would suggest to just include regular latin names, virus names and hybrid names (not formulas which is the name type HYBRID and link above)  as a start? A hybrid name are those using the hybrid marker before a name part, for example ??Carex ×cayouettei?? which conforms to regular binomial nomenclature. While a hybrid formular combines several names and does therefore not fit into the regular name model. For example ??Carex comosa × C. lupulina??