Issue 12644

Optionally interpret html entities in dwca reader

12644
Reporter: mdoering
Assignee: mdoering
Type: Improvement
Summary: Optionally interpret html entities in dwca reader
Priority: Major
Resolution: Fixed
Status: Resolved
Created: 2013-01-25 10:53:57.13
Updated: 2015-03-03 14:05:05.887
Resolved: 2015-03-03 14:05:05.86
        
Description: Quite a few data contains named html entities like {code}ä{code} or entities with unicode values like {code}©{code}
To avoid having to deal with unescaping these entities in various places it would be nice if the dwca reader did already. This should be an optional feature that is turned off by default.]]>
    


Author: mdoering@gbif.org
Comment: https://github.com/gbif/dwca-reader/commit/e57c26d651e383325c699f97c3fb66d704b838d8
Created: 2015-03-03 14:05:05.884
Updated: 2015-03-03 14:05:05.884