Issue 17706

Crawling: BioCASe declaredCount

17706
Reporter: trobertson
Type: Improvement
Summary: Crawling: BioCASe declaredCount
Priority: Major
Resolution: Fixed
Status: Closed
Created: 2015-07-21 16:07:38.595
Updated: 2018-05-31 16:41:57.815
Resolved: 2018-05-31 16:41:57.789
        
Description: At the start of crawling, or during the inventory, try and determine the count of records and store it as a machineTag like so.

{code}
{"namespace": "metasync.gbif.org", "name": "declaredCount", "value": "10"}
{code}

Code is in place to determine sensible page ranges in the scientific name range crawl strategy already, but it relies on the existence of the machine tag.  It would save a huge amount of time for the smaller datasets.]]>