Issue 11604

Implement backfill process for name strings

11604
Reporter: trobertson
Assignee: mdoering
Type: Bug
Summary: Implement backfill process for name strings
Priority: Major
Resolution: WontFix
Status: Closed
Created: 2012-07-16 09:47:39.055
Updated: 2013-12-05 11:05:08.798
Resolved: 2012-09-05 16:33:34.65
        
Description: With the cube defined (see MET-1) we need to populate the cube.  We choose to do this using the backfill process, which allows us to repeat the operation in the future when we (e.g.) want to add a dimension to the cube.
This will require:
a) implementing the BackFill interface
b) iterating over the source data calling the cubeIO.writeSync(...)

See the recent GBIF blog as an introduction, which shows how to do a backfill using a MR job.  Here we would not do a MR job though.

Name strings might be more complex as it looks like distinct counts are going to be required.]]>
    


Author: mdoering@gbif.org
Comment: Not planned to use the cube for CLB now
Created: 2012-09-05 16:33:34.681
Updated: 2012-09-05 16:33:34.681