Issue 18739

Meta data only datasets miss key information from the EML

18739
Reporter: hoefft
Assignee: kbraak
Type: Feedback
Summary: Meta data only datasets miss key information from the EML
Priority: Critical
Resolution: Fixed
Status: Closed
Created: 2016-09-19 14:30:55.278
Updated: 2018-05-31 16:21:21.825
Resolved: 2018-05-31 16:21:21.75
        
Description: http://www.gbif.org/dataset/d92e7617-616e-4928-9051-3d19cfb2ff0a

this meta data only dataset shows no information about geographical coverage nor taxonomy.

it isn't in the api response either
http://api.gbif.org/v1/dataset/d92e7617-616e-4928-9051-3d19cfb2ff0a

but it is in the original eml
http://ipt.medobis.eu/eml.do?r=danishexpedition_mediterranean_scopelidae

that is a shame - we should parse that if possible. IT seems to be the case for all meta data only datasets I have looked at]]>
    


Author: mdoering@gbif.org
Created: 2016-09-20 10:11:23.928
Updated: 2016-09-20 10:11:23.928
        
There are several points in our crawler architecture that need to change to actually index EML endpoints. We don't do this currently at all.
Currently:
 - the crawl coordinator ignores EML endpoints to be crawled
 - the dwca-metasync service listens only to DwcaValidationFinishedMessage requiring a dwc archive to exist

We might need to redesign the message workflow or just add new ones for pure EML/Metadata endpoints.

    


Author: mblissett
Comment: Metadata only datasets were added to crawling properly some time last year.
Created: 2018-05-31 16:21:21.802
Updated: 2018-05-31 16:21:21.802