Issue 10640

Populate Dataset.dataAvailable

10640
Reporter: mdoering
Type: Improvement
Summary: Populate Dataset.dataAvailable
Priority: Major
Resolution: Invalid
Status: Closed
Created: 2012-01-17 10:46:59.931
Updated: 2013-12-16 17:50:37.469
Resolved: 2013-12-09 16:31:18.195
        
Description: dataAvailable should be true if there is a service registered that provides data (biocase,digit,tapir,dwca).

Questions:
1) If the dataset hasn't been indexed yet, should dataAvailable be true?
2) If the service is offline, is it still considered available?
3) Should we also set it to true if the GBIF index doesn't make use of it, e.g. when external data provides links to excel sheets or other formats?

I will start by setting it to true if a service exists]]>
    


Author: ahahn@gbif.org
Created: 2012-01-17 11:50:28.276
Updated: 2012-01-17 11:50:28.276
        
dataAvailable was intended to allow the user to exclude datasets that are metadata-only, so that someone targeting a data download does not get bogged down in documents and links that do not help that goal. In this sense:
1) no; only datasets where the data are available through the data portal should be dataAvailabe=true
2) yes; as the index caches the data, they are still available to the search
3) no; again, availability to a search is the key

Another thing is the online-availability of services, which we have plans to log and track (probably in the registry), and would like to display (on the publishers' stats pages).

Do we need an additional facet that covers for general online availability of data, regardless of format and content?
    


Author: mdoering@gbif.org
Created: 2012-01-17 20:57:44.709
Updated: 2012-01-17 20:57:44.709
        
Agree.

I would refrain from yet another facet though as we already have isRegistered which is 95% the same as dataAvailable if not exactly the same.
Hm, or maybe drop isRegistered and have isIndexed (as outlined above) and dataAvailable (aka data with gbif or external link to download available)
    


Author: mdoering@gbif.org
Comment: dataAvailable is currently not checking if a dataset indeed has been indexed. 
Created: 2012-01-18 12:17:10.833
Updated: 2012-01-18 12:17:10.833


Author: mdoering@gbif.org
Created: 2012-01-18 12:18:27.623
Updated: 2012-01-18 12:18:27.623
        
We need to know if or better when a dataset has been indexed.
We need to consider to modify the registry database for such a property
    


Author: kbraak@gbif.org
Comment: dataset.dataAvailable no longer exists in the Solr dataset index. Closing issue.
Created: 2013-12-09 16:31:18.219
Updated: 2013-12-09 16:31:18.219