Issue 14531

Download not a valid DwC Archive

14531
Reporter: feedback bot
Assignee: mdoering
Type: Bug
Summary: Download not a valid DwC Archive
Priority: Blocker
Resolution: Fixed
Status: Resolved
Created: 2014-01-06 22:00:53.484
Updated: 2014-01-09 15:26:27.952
Resolved: 2014-01-09 15:26:27.927
        
        
Description: At first, naively, I thought the download was a DwC Archive, but now I think that was mistaken: the GBIF validator fails it. Given that, is it safe to assume that downloads going forward will be stable? ie, they'll still be zips with an occurenct.txt which is actually a CSV, where the column names seem to be DwC terms, except with underscores instead of camel case?

Or if it will change with v1.0, awesome, I'm just trying to understand where it stands, and how fragile I can expect my reader code to be.
*E-mail*: [mailto:mccalluc@yahoo.com]]]>
    


Author: mdoering@gbif.org
Created: 2014-01-07 14:30:23.086
Updated: 2014-01-07 14:30:23.086
        
Indeed the current download archives are invalid archives and lack the very important archive descriptor meta.xml
This is a bug to be fixed very soon and they used to be valid archives.

Stability is guaranteed to remain a valid archive, thus reading the meta.xml is highly recommended as header rows, their ordering and filenames might change over time. 
    


Author: mdoering@gbif.org
Created: 2014-01-09 15:26:27.949
Updated: 2014-01-09 15:26:27.949
        
https://code.google.com/p/gbif-occurrencestore/source/detail?r=2285

Sneaked latest 2 snapshot jars into dev and prod hdfs to fix the online issue