Issue 17709

unknown.org being added to fields

17709
Reporter: trobertson
Assignee: cgendreau
Type: Improvement
Summary: unknown.org being added to fields
Priority: Minor
Status: Open
Created: 2015-07-21 18:59:01.327
Updated: 2016-06-23 19:49:56.99
        
Description: The occurrenceDetails field somehow got `http://unknown.org/` prepended to the beginning of the field name, e.g http://api.gbif.org/v1/occurrence/1088909149/

This is obviously nonsense]]>
    


Author: mdoering@gbif.org
Created: 2015-07-21 21:07:38.614
Updated: 2015-07-21 21:07:38.614
        
This is because "occurrenceDetails" is not covered in our enumeration (it was an old dwc term the TermFactory should know about though).
When an unqualified, unknown term is encountered by the factory it assigns it the unknown.org namespace. The namespace is silly, but we should move it to some well known non-namespace (maybe gbif.org/unknown) so it does not conflict with others.
    


Author: rdmpage
Comment: This is also happening with Audubon Core, e.g. http://www.gbif.org/occurrence/814093223/verbatim looks a mess. Is there a mechanism to register this namespace with GBIF so that the row headings for multimedia don't get the "unknown.org prefix?
Created: 2015-12-22 16:20:01.348
Updated: 2015-12-22 16:20:01.348


Author: mdoering@gbif.org
Created: 2015-12-22 18:02:35.292
Updated: 2015-12-22 18:02:35.292
        
The latest dwc-api release contains all those namespaces:
https://github.com/gbif/dwc-api/tree/master/src/main/java/org/gbif/dwc/terms
https://github.com/gbif/dwc-api

The problem with our occurrences is that we are using a pretty old version of that library which even predates last years darwin core changes, see POR-2598

Another problem with that Audobon Core record could be that the terms were published without any namespace - at least it appears like this on first glance.
Still the latest TermFactory should be able to understand most if not all of them - if we would use the latest dwc-api

    


Author: rdmpage
Created: 2015-12-22 18:48:09.582
Updated: 2015-12-22 18:48:09.582
        
The raw fragment http://www.gbif.org/occurrence/814093223/fragment includes the URL http://rs.tdwg.org/ac/terms/Multimedia so I'm assuming that http://rs.tdwg.org/ac/terms/ is the namespace for Audubon Core (see also http://terms.tdwg.org/wiki/Audubon_Core_Term_List#ac:Multimedia )

The day well all forget about custom vocabularies and use http://schema.org will be a happy day ;)