Issue 11101

registry-sync: Select most prioritized namespace from list of supported namespaces

11101
Reporter: kbraak
Assignee: fmendez
Type: Bug
Summary: registry-sync: Select most prioritized namespace from list of supported namespaces
Priority: Major
Resolution: Fixed
Status: Closed
Created: 2012-05-15 15:36:10.937
Updated: 2013-12-16 17:50:23.047
Resolved: 2012-05-29 11:11:46.437
TimeEstimate: 0
TimeSpent: 21600
        
Description: Applies to TAPIR and BioCASE

For example, in a BioCASE capabilities response there can be 2 supported namespaces: 2.06 and 1.2

2.06 has a higher priority than 1.2, and since they are both supported, 2.06 should be saved as the resource's content namespace, not 1.2]]>
    


Author: fmendez@gbif.org
Comment: Bug in getting searchables titles of capabilities response, XPath expression is now used to filter elements by namespace and searchable=1/true
Created: 2012-05-21 17:18:03.781
Updated: 2012-05-21 17:18:03.781


Author: kbraak@gbif.org
Created: 2012-05-25 16:10:13.039
Updated: 2012-05-25 16:10:13.039
        
Take for example this dataset, the content namespace should be the curatorial  version of DwC: http://staging.gbif.org:8080/registry-ws/dataset/f0f8e840-1df9-11de-be11-b8a03c50a862 reason being, according to the capabilities response: http://data1.gbif.fi/tapirlink/tapir.php/www_trichoptera?op=c there are 3 options, and we want to choose the one with the highest priority.

To be more clear, for the DwC namespaces, from highest to lowest, the namespaces are:

http://digir.net/schema/conceptual/darwin/2003/1.0 (rarely occurs for TAPIR, but if it does it trumps the others)
http://rs.tdwg.org/dwc/curatorial/
http://rs.tdwg.org/dwc/geospatial/
http://rs.tdwg.org/dwc/dwcore/

And for ABCD, it's

2.06 (2.05 also defaults to 2.06)
1.2

PLEASE NOTE: the output model varies as per the namespace. Please see here: http://code.google.com/p/gbif-indexingtoolkit/source/browse/trunk/harvest-webapp/src/main/resources/org/gbif/harvest/tapir/mapping/outputModelMapping.properties