Hi Srini,

the species count on the species page represents the species our backbone taxonomy contains for this phylum regardless whether we have occurrences or not.
The distinct species in your occurrences are less cause for some species we do not have any evidence.
Btw, in the upcoming portal you can see the list of distinct, accepted species right away: 
https://demo.gbif.org/occurrence/species?taxon_key=79 (select species instead of table view)

Best,
Markus

On 4 Apr 2017, at 18:06, Srini Anand <srianand@umail.iu.edu> wrote:

Hello,
I'm a student from Indiana University, working towards a Masters in Data Science. As part of a class project, we are working with the team from Global Biointeractions (GloBI) to compare how much their data covers the known species list. To that end we are using GBIF as the reference system. 

I have worked out the details of navigating your site and extracting information for us to compare against. However, I'm unable to reconcile counts between the download and what I see on the species page.

Mismatch in species count
Using Kingdom = Archaea and Phylum = Crenarchaeota, I navigated to the Occurrences page and downloaded the 3,824 occurrences. When I extract unique Species from this list, there are 59, whereas the main page indicates there should be 68. 

I see a similar issue with 
a) Kingdom = Archaea (should be 523, see 485) 
b) Class = Pinopsida (should be 2,366 species, but see 983) 

There is probably a small and obvious mistake I'm making, but not sure I know how to identify that.


Any guidance would be must appreciated.

Regards,
Srini Anand
Masters of Data Science student
_______________________________________________
API-users mailing list
API-users@lists.gbif.org
https://lists.gbif.org/mailman/listinfo/api-users