Issue 15622

dwca-reader cant handle multiple core files

15622
Reporter: mdoering
Type: NewFeature
Summary: dwca-reader cant handle multiple core files
Priority: Major
Status: Open
Created: 2014-05-19 10:37:54.365
Updated: 2014-05-19 10:38:25.465
        
Description: Moved from https://code.google.com/p/darwincore/issues/detail?id=95

-----
The dwca specifications allow to list several files for the core or any extension. The dwca reader can't handle this setup so far because when extensions are present it needds to sort all files to be able to iterate over merged "star records". This needs to be fixed.

Option
a) first temporarly merge files into 1. On unix via shell trivial, on other systems hm
b) keep one iterator per file open and compare all of them when doing the final iteration. Far more difficult but portable...

-----
Feb 18, 2011 #1 wixner
Easy entry to this is in Archive.sortFiles() which does the sorting and could also do a concatenation beforehand.
I think we should definitely create a temp concat file instead of solution b).
Concatenation of files could also easily be written in pure java, avoiding shell calls
]]>