[API-users] Difference between /occurrence/search and /occurrence/count?
trobertson at gbif.org
Thu Apr 9 09:21:50 CEST 2015
2 things can cause this:
1. Eventual consistency
The count service is an insanely high throughput service, while search is lower throughput - they have different backends, and a messaging bus keeps them in sync. Because of this there is often a short period (up to 1 hr but normally < 5 mins) where they can differ during indexing runs. Issues can creep in and they drift and occasionally we rebuild the count service. The search service is always the correct one.
2. Geospatial issues
The isGeoreferenced only counts records with coordinates and no known geospatial issues - i.e. records we’d consider suitable for using the coordinates.
In this case it is 2. that provides the difference, and the search service should be using the &hasGeospatialIssue parameter.
Both report 4515 records.
I hope this helps - please feel free to quote me verbatim on the issue.
On 09 Apr 2015, at 07:40, Scott Chamberlain <myrmecocystus at gmail.com> wrote:
> A user of the R client we make for GBIF reports different number of occurrences for the /occurrence/search endpoint and the /occurrence/count endpoint with the same taxonkey, and limiting to georeferenced data only. See https://discuss.ropensci.org/t/rgbif-occ-count-and-occ-search-results-differ/174 for the discussion.
> I imagine there's a good explanation for this, but I'm not sure what it is right now.
> Thanks! Scott
> API-users mailing list
> API-users at lists.gbif.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the API-users