Issue 13019

Wikipedia Species Pages en, de, es (ECAT Development Publisher migration)

13019
Reporter: ahahn
Assignee: mdoering
Type: SubTask
Summary: Wikipedia Species Pages en, de, es (ECAT Development Publisher migration)
Priority: Major
Status: Open
Created: 2013-03-18 11:53:35.677
Updated: 2013-09-27 17:39:18.999
        
Description: Note:  The Wikimedia Foundation does not own copyright on Wikipedia article texts and illustrations. The text of Wikipedia is copyrighted (automatically, under the Berne Convention) by Wikipedia editors and contributors and is formally licensed to the public under one or several liberal licenses.

Wikipedia text content can be used under the terms of the Creative Commons Attribution ShareAlike license (CC-BY-SA).  This means two things when generating the DwC archive.  1) we should include the source URL to the taxon page and include it n the core file or appropriate extensions and 2) since we are deriving the data from the page content, we should indicate in the EML metadata that the source content has been modified.  The metadata file should also contain a link to the CC-BY-SA via http://creativecommons.org/licenses/by-sa/3.0/

http://uat.gbif.org/dataset/cbb6498e-8927-405a-916b-576d00a6289b
http://uat.gbif.org/dataset/16c3f9cb-4b19-4553-ac8e-ebb90003aa02
http://uat.gbif.org/dataset/cd9fa1dd-d29f-47c6-bac1-31245a9f08e9

Actions needed:
- include source URL to the taxon page in core file or extensions
- indicate change of source content in metadata
- add CC-BY-SA via http://creativecommons.org/licenses/by-sa/3.0/ to rights statement in metadata
]]>
    


Author: mdoering@gbif.org
Created: 2013-03-18 12:05:14.838
Updated: 2013-03-18 12:05:14.838
        
there always has been a url to the source and the description indicates how the data was extracted, see current EMLs:

http://api.gbif.org/uat/dataset/cd9fa1dd-d29f-47c6-bac1-31245a9f08e9/eml
http://api.gbif.org/uat/dataset/16c3f9cb-4b19-4553-ac8e-ebb90003aa02/eml
http://api.gbif.org/uat/dataset/cbb6498e-8927-405a-916b-576d00a6289b/eml

Well, that can be improved for sure. The eml should also indicate the exact details as listed here:
https://github.com/mdoering/wikipedia-dwca/blob/master/src/main/java/org/tdwg/dwca/wikipedia/taxonbox/TaxonboxWikiModel.java
    


Author: ahahn@gbif.org
Comment: There is no natural "owning organization" for this dataset. It will have to stay under this entity (which has been renamed to "GBIF Secretariat"
Created: 2013-09-27 17:39:18.999
Updated: 2013-09-27 17:39:18.999