<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">
Hi Eduardo,
<div class=""><br class="">
</div>
<div class="">Unfortunately this is a commonly encountered problem, usually because data providers change the data GBIF harvests. In this case, the difference between the data sets is that one has the field “occurrenceID” set and the other doesn’t, so the records
 appear to be two different records to GBIF. In an ideal world the older dataset would be deleted or otherwise deprecated, and only the newer data displayed.</div>
<div class=""><br class="">
</div>
<div class="">Regards</div>
<div class=""><br class="">
</div>
<div class="">Rod</div>
<div class=""><br class="">
<div apple-content-edited="true" class="">
<div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">
<div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">
<div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">
<div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">
---------------------------------------------------------<br class="">
Roderic Page<br class="">
Professor of Taxonomy<br class="">
Institute of Biodiversity, Animal Health and Comparative Medicine<br class="">
College of Medical, Veterinary and Life Sciences<br class="">
Graham Kerr Building<br class="">
University of Glasgow<br class="">
Glasgow G12 8QQ, UK<br class="">
<br class="">
Email: <span class="Apple-tab-span" style="white-space: pre;"> </span><a href="mailto:Roderic.Page@glasgow.ac.uk" class="">Roderic.Page@glasgow.ac.uk</a><br class="">
Tel: <span class="Apple-tab-span" style="white-space: pre;"> </span>+44 141 330 4778<br class="">
Skype: <span class="Apple-tab-span" style="white-space: pre;"> </span>rdmpage<br class="">
Facebook: <span class="Apple-tab-span" style="white-space: pre;"> </span>http://www.facebook.com/rdmpage<br class="">
LinkedIn: <span class="Apple-tab-span" style="white-space: pre;"> </span>http://uk.linkedin.com/in/rdmpage<br class="">
Twitter: <span class="Apple-tab-span" style="white-space: pre;"> </span>http://twitter.com/rdmpage<br class="">
Blog: <span class="Apple-tab-span" style="white-space: pre;"> </span>http://iphylo.blogspot.com<br class="">
ORCID: <span class="Apple-tab-span" style="white-space: pre;"> </span>http://orcid.org/0000-0002-7101-9767</div>
<div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">
Citations: <span class="Apple-tab-span" style="white-space: pre;"> </span><a href="http://scholar.google.co.uk/citations?hl=en&user=4Z5WABAAAAAJ" class="">http://scholar.google.co.uk/citations?hl=en&user=4Z5WABAAAAAJ</a></div>
<div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">
ResearchGate<span class="Apple-tab-span" style="white-space: pre;"> </span><a href="https://www.researchgate.net/profile/Roderic_Page" class="">https://www.researchgate.net/profile/Roderic_Page</a><br class="">
<br class="">
</div>
</div>
</div>
</div>
</div>
<br class="">
<div>
<blockquote type="cite" class="">
<div class="">On 14 Sep 2015, at 17:25, Eduardo Dalcin <<a href="mailto:edalcin@jbrj.org" class="">edalcin@jbrj.org</a>> wrote:</div>
<br class="Apple-interchange-newline">
<div class="">
<div dir="ltr" class="">
<div class="gmail_default" style="font-size:small">Good points Markus, Thanks!</div>
<div class="gmail_default" style="font-size:small"><br class="">
</div>
<div class="gmail_default" style="font-size:small">However, other publishers are *very* online, like this example:</div>
<div class="gmail_default" style="font-size:small"><br class="">
</div>
<div class="gmail_default">
<div class="gmail_default" style="font-size:16px">"The New York Botanical Garden Herbarium (NY) - Vascular Plant Collection" (<a href="https://mailtrack.io/trace/link/0a54ebc017ec4ddde255d8f470cf1d5eb58d6ff1?url=http%3A%2F%2Fwww.gbif.org%2Fdataset%2Fd415c253-4d61-4459-9d25-4015b9084fb0&signature=9fbac047f6b2d815" target="_blank" class="">http://www.gbif.org/dataset/d415c253-4d61-4459-9d25-4015b9084fb0</a>)
 and the "Herbarium of The New York Botanical Garden" (<a href="https://mailtrack.io/trace/link/c5595e540f23c50c332c5d3aba65d9b857daec6c?url=http%3A%2F%2Fwww.gbif.org%2Fdataset%2F7133ff0a-f762-11e1-a439-00145eb45e9a&signature=3cd4b1e2eec64e92" target="_blank" class="">http://www.gbif.org/dataset/7133ff0a-f762-11e1-a439-00145eb45e9a</a>).</div>
<div class="gmail_default" style="font-size:16px"><br class="">
</div>
<div class="gmail_default" style="font-size:16px">Same stuff, twice.</div>
<div class="gmail_default" style="font-size:16px"><br class="">
</div>
<div class="gmail_default" style="font-size:16px">The thing is that when we search for, for instance, "Belemia fucsioides" we got a duplication of records of the same entity: </div>
<div class="gmail_default" style="font-size:16px"><br class="">
</div>
<div class="gmail_default" style="font-size:16px"><span id="cid:ii_idrlrq8a0_14f65d191f35c5b7"><FireShot Pro Screen Capture #076 - 'Occurrence Search Results' - www_gbif_org_occurrence_search_TAXON_KEY=5553637.png></span><br class="">
​</div>
<div class="gmail_default" style="font-size:16px"><a href="https://mailtrack.io/trace/link/ec43e42a6e6e903eea24db7611a53591ef91ecff?url=http%3A%2F%2Fwww.gbif.org%2Foccurrence%2F216419815&signature=48bd3b924b438606" target="_blank" class="">http://www.gbif.org/occurrence/216419815</a></div>
<div class="gmail_default" style="font-size:16px"><a href="https://mailtrack.io/trace/link/9e4d9ffa65cef4747df77c1c708df94d1da1b929?url=http%3A%2F%2Fwww.gbif.org%2Foccurrence%2F1098393958&signature=6d279f2c395d493a" target="_blank" class="">http://www.gbif.org/occurrence/1098393958</a></div>
<div class="gmail_default" style="font-size:16px"><br class="">
</div>
<div class="gmail_default"><span style="font-size:16px" class="">This is very annoying and give us a lot of work to clean up.</span><br class="">
</div>
<div class="gmail_default"><span style="font-size:16px" class=""><br class="">
</span></div>
<div class="gmail_default"><span style="font-size:16px" class="">Cheers,</span></div>
<div class="gmail_default"><span style="font-size:16px" class=""><br class="">
</span></div>
<div class="gmail_default"><span style="font-size:16px" class="">Eduardo</span></div>
<div class="gmail_default"><span style="font-size:16px" class=""><br class="">
</span></div>
<div class="gmail_default"><span style="font-size:16px" class=""><br class="">
</span></div>
</div>
<img width="0" height="0" class="mailtrack-img" src="https://mailtrack.io/trace/mail/d458dee66c25922147ac63e045fa04ffde38fdeb.png"></div>
<div class="gmail_extra"><br clear="all" class="">
<div class="">
<div class="gmail_signature">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class=""><font face="verdana, sans-serif" class=""><br class="">
</font><font face="verdana, sans-serif" class=""><font size="1" class="">--------------------------------</font></font>
<div class=""><b class=""><span style="line-height:14.720000267028809px" class=""><font face="verdana, sans-serif" size="1" class=""><a href="https://mailtrack.io/trace/link/12fd73de9c0d11461d2da7249c58967486d95ffb?url=http%3A%2F%2Feduardo.dalc.in&signature=b76aae61fa71c8a0" target="_blank" class="">Eduardo
 Dalcin</a></font></span></b></div>
<div class=""><font face="verdana, sans-serif" size="1" class=""><b class=""><span style="line-height:11.5px" class=""></span></b><span style="line-height:11px" class="">Instituto de Pesquisas Jardim Botânico do Rio de Janeiro - JBRJ</span></font></div>
<div class=""><span style="font-size:x-small" class="">e-mail: </span><a href="mailto:edalcin@jbrj.gov.br" style="font-size:x-small" target="_blank" class="">edalcin@jbrj.gov.br</a><br class="">
</div>
<div class=""><font face="verdana, sans-serif" size="1" class=""><span style="line-height:11px" class=""></span><span style="line-height:11px" class="">Trabalho / Work: +55 21 3204 2116</span></font></div>
<div class=""><span style="font-family:verdana,sans-serif;font-size:x-small" class="">--------------------------------</span><br class="">
</div>
<div class="">
<div style="font-family:verdana,sans-serif" class=""><font face="verdana, sans-serif" size="1" color="#ff0000" style="font-size:x-small" class=""><b class="">e-mail alternativo / </b></font><span style="color:rgb(68,68,68);line-height:16px" class=""> </span><b class=""><span style="line-height:16px" class=""><font size="1" color="#ff0000" class="">alternate
 email</font></span><span style="font-size:x-small;color:rgb(255,0,0)" class="">:</span></b><b style="font-size:x-small;color:rgb(255,0,0)" class=""> <a href="mailto:edalcin@jbrj.org" target="_blank" class="">edalcin@jbrj.org</a></b></div>
</div>
<div style="font-family:verdana,sans-serif" class=""><span style="font-size:x-small" class="">--------------------------------</span></div>
<div class=""><span style="font-family:verdana,sans-serif;font-size:x-small" class="">Agendar reunião / </span><font face="verdana, sans-serif" size="1" class="">Schedule a meeting: <a href="https://mailtrack.io/trace/link/8eb76452df5772642c41cbc47d035ab63fb88da6?url=http%3A%2F%2Fagendar.dalc.in&signature=db7d545fe68e0cb0" target="_blank" class="">http://agendar.dalc.in</a></font></div>
<div style="font-family:verdana,sans-serif" class=""></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br class="">
<div class="gmail_quote">On Thu, Sep 10, 2015 at 4:50 AM, Markus Döring <span dir="ltr" class="">
<<a href="mailto:mdoering@gbif.org" target="_blank" class="">mdoering@gbif.org</a>></span> wrote:<br class="">
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div style="word-wrap:break-word" class="">Eduardo,
<div class=""><br class="">
</div>
<div class="">another difference in using downloads periodically is that you get the interpreted data from us (together with the original if you want to).</div>
<div class="">That already contains quite a bit of data cleaning and aligning to controlled vocabularies that might be painful to reproduce otherwise. </div>
<div class="">Also publishers are *very* often offline. Especially for the long running xml harvesting protocols (biocase,tapir,digir) this can be a bit of a challenge to index them entirely.</div>
<span class="HOEnZb"><font color="#888888" class="">
<div class=""><br class="">
</div>
<div class="">Markus</div>
</font></span>
<div class="">
<div class="h5">
<div class=""><br class="">
</div>
<div class=""><br class="">
<div class="">
<blockquote type="cite" class="">
<div class="">On 09 Sep 2015, at 20:02, Eduardo Dalcin <<a href="mailto:edalcin@jbrj.org" target="_blank" class="">edalcin@jbrj.org</a>> wrote:</div>
<br class="">
<div class="">
<div dir="ltr" class="">
<div class="gmail_default" style="font-size:small">Thanks Alex. Food for thought.</div>
<div class="gmail_default" style="font-size:small"><br class="">
</div>
<div class="gmail_default" style="font-size:small">Best,</div>
<div class="gmail_default" style="font-size:small"><br class="">
</div>
<div class="gmail_default" style="font-size:small">Eduardo</div>
<img width="0" height="0" src="data:image/gif;base64,R0lGODlhAQABAIAAAAAAAP///yH5BAEAAAAALAAAAAABAAEAAAIBRAA7" class=""></div>
<div class="gmail_extra"><br clear="all" class="">
<div class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class=""><font face="verdana, sans-serif" class=""><br class="">
</font><font face="verdana, sans-serif" class=""><font size="1" class="">--------------------------------</font></font>
<div class=""><b class=""><span style="line-height:14.720000267028809px" class=""><font face="verdana, sans-serif" size="1" class=""><a href="https://mailtrack.io/trace/link/216b27d1db9fb6d2aa5a6d7b07aaf7a6b6b19ed9?url=http%3A%2F%2Feduardo.dalc.in&signature=fd3c844b73e3c400" target="_blank" class="">Eduardo
 Dalcin</a></font></span></b></div>
<div class=""><font face="verdana, sans-serif" size="1" class=""><b class=""><span style="line-height:11.5px" class=""></span></b><span style="line-height:11px" class="">Instituto de Pesquisas Jardim Botânico do Rio de Janeiro - JBRJ</span></font></div>
<div class=""><span style="font-size:x-small" class="">e-mail: </span><a href="mailto:edalcin@jbrj.gov.br" style="font-size:x-small" target="_blank" class="">edalcin@jbrj.gov.br</a><br class="">
</div>
<div class=""><font face="verdana, sans-serif" size="1" class=""><span style="line-height:11px" class=""></span><span style="line-height:11px" class="">Trabalho / Work:
<a href="tel:%2B55%2021%203204%202116" value="+552132042116" target="_blank" class="">
+55 21 3204 2116</a></span></font></div>
<div class=""><span style="font-family:verdana,sans-serif;font-size:x-small" class="">--------------------------------</span><br class="">
</div>
<div class="">
<div style="font-family:verdana,sans-serif" class=""><font face="verdana, sans-serif" size="1" color="#ff0000" style="font-size:x-small" class=""><b class="">e-mail alternativo / </b></font><span style="color:rgb(68,68,68);line-height:16px" class=""> </span><b class=""><span style="line-height:16px" class=""><font size="1" color="#ff0000" class="">alternate
 email</font></span><span style="font-size:x-small;color:rgb(255,0,0)" class="">:</span></b><b style="font-size:x-small;color:rgb(255,0,0)" class=""> <a href="mailto:edalcin@jbrj.org" target="_blank" class="">edalcin@jbrj.org</a></b></div>
</div>
<div style="font-family:verdana,sans-serif" class=""><span style="font-size:x-small" class="">--------------------------------</span></div>
<div class=""><span style="font-family:verdana,sans-serif;font-size:x-small" class="">Agendar reunião / </span><font face="verdana, sans-serif" size="1" class="">Schedule a meeting: <a href="https://mailtrack.io/trace/link/dc51cf0b6ea2d4897c2ae7a70bab52163650de15?url=http%3A%2F%2Fagendar.dalc.in&signature=c87a5e78e01a7976" target="_blank" class="">http://agendar.dalc.in</a></font></div>
<div style="font-family:verdana,sans-serif" class=""></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br class="">
<div class="gmail_quote">On Wed, Sep 9, 2015 at 2:28 PM, Alex Thompson <span dir="ltr" class="">
<<a href="mailto:godfoder@acis.ufl.edu" target="_blank" class="">godfoder@acis.ufl.edu</a>></span> wrote:<br class="">
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div bgcolor="#FFFFFF" text="#000000" class="">I'm kind of seconding Rod here.<br class="">
<br class="">
It might make more sense, depending on your use case and local computer resources, to just get a download of Plantae *AND* Brazil from GBIF periodically, then process that to exclude existing Brazilian datasets. You could then use something like Apache hadoop
 / spark to efficiently split the file by dataset or by institution code.<br class="">
<br class="">
This would greatly simplify your interactions with GBIF (down to just periodically generating a download programmatically) and you would have an easy place to insert any additional data transformations you want. This is the path i take for my work at least
 - the incremental cost of a couple million more records is worth the reduction in complexity overall.<span class=""><font color="#888888" class=""><br class="">
<br class="">
- Alex</font></span>
<div class="">
<div class=""><br class="">
<br class="">
<div class="">On 09/09/2015 12:16 PM, Eduardo Dalcin wrote:<br class="">
</div>
<blockquote type="cite" class="">
<div dir="ltr" class="">
<div class="gmail_default">
<div class="gmail_default">
<div class="gmail_default">Hi Rod,</div>
<div class="gmail_default"><br class="">
</div>
<div class="gmail_default">The real purpose is to have a list of UUID and the "source web page" for the data set. Thus, one way to do it is to select those resources that counts <> 0 for PLANTAE *AND* Brazil.</div>
<div class="gmail_default"><br class="">
</div>
<div class="gmail_default">I don't want to do any stats analysis, but feed up one local harverster / agregator.</div>
<div class="gmail_default"><br class="">
</div>
<div class="gmail_default">The problem is, considering the reply from Jan Legind at Sep 3, we have to check one by one (<a href="https://goo.gl/3wysaA" target="_blank" class="">https://goo.gl/3wysaA</a>) to check if it is a Herbarium / Preserved Specimen (Plantae)
 or not, from the request <a href="http://api.gbif.org/v1/occurrence/counts/datasets?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN" target="_blank" class="">
http://api.gbif.org/v1/occurrence/counts/datasets?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN</a>.</div>
<div class="gmail_default"><br class="">
</div>
<div class="gmail_default">Does it make sense?<br class="">
</div>
<div class="gmail_default"><br class="">
</div>
<div class="gmail_default">Thanks for your curiosity! :)</div>
<div class="gmail_default"><br class="">
</div>
<div class="gmail_default">Cheers,</div>
<div class="gmail_default"><br class="">
</div>
<div class="gmail_default">Eduardo</div>
</div>
</div>
<img height="0" width="0" class=""></div>
<div class="gmail_extra"><br clear="all" class="">
<div class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class=""><font face="verdana, sans-serif" class=""><br class="">
</font><font face="verdana, sans-serif" class=""><font size="1" class="">--------------------------------</font></font>
<div class=""><b class=""><span style="line-height:14.720000267028809px" class=""><font size="1" face="verdana, sans-serif" class=""><a href="https://mailtrack.io/trace/link/5516ed5e4f903c6ee9bd9fb3876fb65ffffc687c?url=http%3A%2F%2Feduardo.dalc.in&signature=cda9e9bf584a828c" target="_blank" class="">Eduardo
 Dalcin</a></font></span></b></div>
<div class=""><font size="1" face="verdana, sans-serif" class=""><b class=""><span style="line-height:11.5px" class=""></span></b><span style="line-height:11px" class="">Instituto de Pesquisas Jardim Botânico do Rio de Janeiro - JBRJ</span></font></div>
<div class=""><span style="font-size:x-small" class="">e-mail: </span><a href="mailto:edalcin@jbrj.gov.br" style="font-size:x-small" target="_blank" class="">edalcin@jbrj.gov.br</a><br class="">
</div>
<div class=""><font size="1" face="verdana, sans-serif" class=""><span style="line-height:11px" class=""></span><span style="line-height:11px" class="">Trabalho / Work:
<a href="tel:%2B55%2021%203204%202116" value="+552132042116" target="_blank" class="">
+55 21 3204 2116</a></span></font></div>
<div class=""><span style="font-family:verdana,sans-serif;font-size:x-small" class="">--------------------------------</span><br class="">
</div>
<div class="">
<div style="font-family:verdana,sans-serif" class=""><font style="font-size:x-small" size="1" color="#ff0000" face="verdana, sans-serif" class=""><b class="">e-mail alternativo / </b></font><span style="color:rgb(68,68,68);line-height:16px" class=""> </span><b class=""><span style="line-height:16px" class=""><font size="1" color="#ff0000" class="">alternate
 email</font></span><span style="font-size:x-small;color:rgb(255,0,0)" class="">:</span></b><b style="font-size:x-small;color:rgb(255,0,0)" class=""> <a href="mailto:edalcin@jbrj.org" target="_blank" class="">edalcin@jbrj.org</a></b></div>
</div>
<div style="font-family:verdana,sans-serif" class=""><span style="font-size:x-small" class="">--------------------------------</span></div>
<div class=""><span style="font-family:verdana,sans-serif;font-size:x-small" class="">Agendar reunião / </span><font size="1" face="verdana,
                          sans-serif" class="">Schedule a meeting: <a href="https://mailtrack.io/trace/link/3a5eaa1df56016285886497766577e5357ddc6c1?url=http%3A%2F%2Fagendar.dalc.in&signature=c4e8d8113c34937f" target="_blank" class="">http://agendar.dalc.in</a></font></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br class="">
<div class="gmail_quote">On Mon, Sep 7, 2015 at 12:33 PM, Roderic Page <span dir="ltr" class="">
<<a href="mailto:Roderic.Page@glasgow.ac.uk" target="_blank" class="">Roderic.Page@glasgow.ac.uk</a>></span> wrote:<br class="">
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div style="word-wrap:break-word" class="">Hi Eduardo,
<div class=""><br class="">
</div>
<div class="">I’m curious, is the purpose to get counts by dataset by country, or to get all the plant occurrences for Brazil? The later can be obtained by downloading all plant occurrences in Brazil <a href="http://www.gbif.org/occurrence/search?TAXON_KEY=6&COUNTRY=BR" target="_blank" class="">http://www.gbif.org/occurrence/search?TAXON_KEY=6&COUNTRY=BR</a> (you
 could then compute the per-dataset stats locally). I realise that this isn’t as convenient as having GBIF slice the data for you in the API.<br class="">
<div class=""><br class="">
</div>
<div class="">Regards</div>
<div class=""><br class="">
</div>
<div class="">Rod</div>
<div class=""><br class="">
<div class="">
<div style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;word-wrap:break-word" class="">
<div style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;word-wrap:break-word" class="">
<div style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;word-wrap:break-word" class="">
<div style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;word-wrap:break-word" class="">
---------------------------------------------------------<br class="">
Roderic Page<br class="">
Professor of Taxonomy<br class="">
Institute of Biodiversity, Animal Health and Comparative Medicine<br class="">
College of Medical, Veterinary and Life Sciences<br class="">
Graham Kerr Building<br class="">
University of Glasgow<br class="">
Glasgow G12 8QQ, UK<br class="">
<br class="">
Email: <span style="white-space:pre-wrap" class=""> </span><a href="mailto:Roderic.Page@glasgow.ac.uk" target="_blank" class="">Roderic.Page@glasgow.ac.uk</a><br class="">
Tel: <span style="white-space:pre-wrap" class=""> </span><a href="tel:%2B44%20141%20330%204778" value="+441413304778" target="_blank" class="">+44 141 330 4778</a><br class="">
Skype: <span style="white-space:pre-wrap" class=""> </span>rdmpage<br class="">
Facebook: <span style="white-space:pre-wrap" class=""> </span><a href="http://www.facebook.com/rdmpage" target="_blank" class="">http://www.facebook.com/rdmpage</a><br class="">
LinkedIn: <span style="white-space:pre-wrap" class=""> </span><a href="http://uk.linkedin.com/in/rdmpage" target="_blank" class="">http://uk.linkedin.com/in/rdmpage</a><br class="">
Twitter: <span style="white-space:pre-wrap" class=""> </span><a href="http://twitter.com/rdmpage" target="_blank" class="">http://twitter.com/rdmpage</a><br class="">
Blog: <span style="white-space:pre-wrap" class=""> </span><a href="http://iphylo.blogspot.com/" target="_blank" class="">http://iphylo.blogspot.com</a><br class="">
ORCID: <span style="white-space:pre-wrap" class=""> </span><a href="http://orcid.org/0000-0002-7101-9767" target="_blank" class="">http://orcid.org/0000-0002-7101-9767</a></div>
<div style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;word-wrap:break-word" class="">
Citations: <span style="white-space:pre-wrap" class=""> </span><a href="http://scholar.google.co.uk/citations?hl=en&user=4Z5WABAAAAAJ" target="_blank" class="">http://scholar.google.co.uk/citations?hl=en&user=4Z5WABAAAAAJ</a></div>
<div style="letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;word-wrap:break-word" class="">
ResearchGate<span style="white-space:pre-wrap" class=""> </span><a href="https://www.researchgate.net/profile/Roderic_Page" target="_blank" class="">https://www.researchgate.net/profile/Roderic_Page</a><br class="">
<br class="">
</div>
</div>
</div>
</div>
</div>
<br class="">
<div class="">
<blockquote type="cite" class="">
<div class="">
<div class="">
<div class="">On 4 Sep 2015, at 10:39, Eduardo Dalcin <<a href="mailto:edalcin@jbrj.org" target="_blank" class="">edalcin@jbrj.org</a>> wrote:</div>
<br class="">
</div>
</div>
<div class="">
<div class="">
<div class="">
<div dir="ltr" class="">
<div class="gmail_default" style="font-size:small">Hi Markus,</div>
<div class="gmail_default" style="font-size:small"><br class="">
</div>
<div class="gmail_default" style="font-size:small">Yes, that's a shame I can't have country and "nub" together. There is any hope about it?</div>
<div class="gmail_default" style="font-size:small"><br class="">
</div>
<div class="gmail_default" style="font-size:small">Eduardo</div>
<img height="0" width="0" class=""></div>
<div class="gmail_extra"><br clear="all" class="">
<div class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class=""><font face="verdana, sans-serif" class=""><br class="">
</font><font face="verdana,
                                              sans-serif" class=""><font size="1" class="">--------------------------------</font></font>
<div class=""><b class=""><span style="line-height:14.720000267028809px" class=""><font size="1" face="verdana,
                                                    sans-serif" class=""><a href="https://mailtrack.io/trace/link/bac23864202354f3789938ce352a878faa0cd8b8?url=http%3A%2F%2Feduardo.dalc.in&signature=aea58ef6f439535b" target="_blank" class="">Eduardo
 Dalcin</a></font></span></b></div>
<div class=""><font size="1" face="verdana,
                                                sans-serif" class=""><b class=""><span style="line-height:11.5px" class=""></span></b><span style="line-height:11px" class="">Instituto de Pesquisas Jardim Botânico
 do Rio de Janeiro - JBRJ</span></font></div>
<div class=""><span style="font-size:x-small" class="">e-mail: </span><a href="mailto:edalcin@jbrj.gov.br" style="font-size:x-small" target="_blank" class="">edalcin@jbrj.gov.br</a><br class="">
</div>
<div class=""><font size="1" face="verdana,
                                                sans-serif" class=""><span style="line-height:11px" class=""></span><span style="line-height:11px" class="">Trabalho / Work:
<a href="tel:%2B55%2021%203204%202116" value="+552132042116" target="_blank" class="">
+55 21 3204 2116</a></span></font></div>
<div class=""><span style="font-family:verdana,sans-serif;font-size:x-small" class="">--------------------------------</span><br class="">
</div>
<div class="">
<div style="font-family:verdana,sans-serif" class=""><font style="font-size:x-small" size="1" color="#ff0000" face="verdana,
                                                  sans-serif" class=""><b class="">e-mail alternativo / </b></font><span style="color:rgb(68,68,68);line-height:16px" class=""> </span><b class=""><span style="line-height:16px" class=""><font size="1" color="#ff0000" class="">alternate
 email</font></span><span style="font-size:x-small;color:rgb(255,0,0)" class="">:</span></b><b style="font-size:x-small;color:rgb(255,0,0)" class=""> <a href="mailto:edalcin@jbrj.org" target="_blank" class="">edalcin@jbrj.org</a></b></div>
</div>
<div style="font-family:verdana,sans-serif" class=""><span style="font-size:x-small" class="">--------------------------------</span></div>
<div class=""><span style="font-family:verdana,sans-serif;font-size:x-small" class="">Agendar reunião / </span><font size="1" face="verdana,
                                                sans-serif" class="">Schedule a meeting: <a href="https://mailtrack.io/trace/link/db57b837be515d4b7caefe43d55b60467cd7c2c1?url=http%3A%2F%2Fagendar.dalc.in&signature=69b244942739c0f5" target="_blank" class="">http://agendar.dalc.in</a></font></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br class="">
<div class="gmail_quote">On Thu, Sep 3, 2015 at 4:29 PM, Markus Döring <span dir="ltr" class="">
<<a href="mailto:mdoering@gbif.org" target="_blank" class="">mdoering@gbif.org</a>></span> wrote:<br class="">
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Eduardo,<br class="">
<br class="">
as you might have seen from my issue comment the webservice uses a different parameter name for taxonKey which is a bug we need to fix at some point.<br class="">
Please use nubKey for now to use the service like that:<br class="">
<br class="">
<a href="http://api.gbif.org/v1/occurrence/counts/datasets?nubKey=6" rel="noreferrer" target="_blank" class="">http://api.gbif.org/v1/occurrence/counts/datasets?nubKey=6</a><br class="">
<br class="">
The real problem for you will be that we do not support the combination of the country and the taxon filter, just one of the two. So you cannot search for plants in Brazil I am afraid, just for datasets about Brazil and datasets with plant records.<br class="">
<span class=""><font color="#888888" class=""><br class="">
Markus<br class="">
</font></span><span class=""><br class="">
<br class="">
<br class="">
> On 03 Sep 2015, at 14:12, Eduardo Dalcin <<a href="mailto:edalcin@jbrj.org" target="_blank" class="">edalcin@jbrj.org</a>> wrote:<br class="">
><br class="">
> Thanks Jan. I'll keep exploring and I'll be in touch, if I need.<br class="">
><br class="">
> Best,<br class="">
><br class="">
> Eduardo<br class="">
><br class="">
><br class="">
><br class="">
</span>
<div class="">
<div class="">> --------------------------------<br class="">
> Eduardo Dalcin<br class="">
> Instituto de Pesquisas Jardim Botânico do Rio de Janeiro - JBRJ<br class="">
> e-mail: <a href="mailto:edalcin@jbrj.gov.br" target="_blank" class="">edalcin@jbrj.gov.br</a><br class="">
> Trabalho / Work: <a href="tel:%2B55%2021%203204%202116" value="+552132042116" target="_blank" class="">
+55 21 3204 2116</a><br class="">
> --------------------------------<br class="">
> e-mail alternativo /  alternate email: <a href="mailto:edalcin@jbrj.org" target="_blank" class="">
edalcin@jbrj.org</a><br class="">
> --------------------------------<br class="">
> Agendar reunião / Schedule a meeting: <a href="https://mailtrack.io/trace/link/db57b837be515d4b7caefe43d55b60467cd7c2c1?url=http%3A%2F%2Fagendar.dalc.in&signature=69b244942739c0f5" rel="noreferrer" target="_blank" class="">
http://agendar.dalc.in</a><br class="">
><br class="">
> On Thu, Sep 3, 2015 at 4:51 AM, Jan Legind [GBIF] <<a href="mailto:jlegind@gbif.org" target="_blank" class="">jlegind@gbif.org</a>> wrote:<br class="">
> Dear Eduardo,<br class="">
><br class="">
><br class="">
><br class="">
> Thanks for getting in touch with us about these issues.<br class="">
><br class="">
><br class="">
><br class="">
> The first request <a href="http://api.gbif.org/v1/occurrence/count?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN" rel="noreferrer" target="_blank" class="">
http://api.gbif.org/v1/occurrence/count?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN</a> returns the number of records located in Brazil for the facets in the request.<br class="">
><br class="">
> The second query <a href="http://api.gbif.org/v1/occurrence/counts/datasets?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN" rel="noreferrer" target="_blank" class="">
http://api.gbif.org/v1/occurrence/counts/datasets?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN</a> uses the Occurrence Inventories web service
<a href="http://www.gbif.org/developer/occurrence#inventories" rel="noreferrer" target="_blank" class="">
http://www.gbif.org/developer/occurrence#inventories</a> which does not support the basis-of-record facet in the /datasets request. I understand that it would be better if the API response yielded an error message in this instance.<br class="">
><br class="">
><br class="">
><br class="">
> Concerning the other issues – you are indeed right that the counts do not make sense in the context of taxon key 6 which is Plantae. Actually the API does not handle the taxonKey search at all, contrary to what the documentation states:<br class="">
><br class="">
><br class="">
><br class="">
> /occurrence/counts/datasets<br class="">
><br class="">
> GET<br class="">
><br class="">
> Counts<br class="">
><br class="">
> Lists occurrence counts for datasets that cover a given taxon or country.<br class="">
><br class="">
> country, taxonKey<br class="">
><br class="">
><br class="">
><br class="">
> As you can see here, <a href="http://api.gbif.org/v1/occurrence/counts/datasets?taxonKey=6" rel="noreferrer" target="_blank" class="">
http://api.gbif.org/v1/occurrence/counts/datasets?taxonKey=6</a> , this request doesn’t return anything.<br class="">
><br class="">
><br class="">
><br class="">
> The GBIF developers will handle this issue in due time.<br class="">
><br class="">
> You can follow the issue in our bug tracking service here: <a href="http://dev.gbif.org/issues/browse/POR-2828" rel="noreferrer" target="_blank" class="">
http://dev.gbif.org/issues/browse/POR-2828</a><br class="">
><br class="">
><br class="">
><br class="">
><br class="">
><br class="">
> With best regards,<br class="">
><br class="">
><br class="">
><br class="">
> Jan K. Legind<br class="">
><br class="">
> Data manager, GBIF Secretariat<br class="">
><br class="">
><br class="">
><br class="">
><br class="">
><br class="">
> From: API-users [mailto:<a href="mailto:api-users-bounces@lists.gbif.org" target="_blank" class="">api-users-bounces@lists.gbif.org</a>] On Behalf Of Eduardo Dalcin<br class="">
> Sent: 2. september 2015 20:06<br class="">
> To: <a href="mailto:api-users@lists.gbif.org" target="_blank" class="">api-users@lists.gbif.org</a>;
<a href="mailto:dev@gbif.org" target="_blank" class="">dev@gbif.org</a><br class="">
> Cc: João Monnerat Lanna; Natália Queiroz; Diogo Silva; Laura; Ricardo Avancini<br class="">
> Subject: [API-users] Some questions from a begginer<br class="">
><br class="">
><br class="">
><br class="">
> Hi folks,<br class="">
><br class="">
><br class="">
><br class="">
> This is my first message to the list. So, please, be nice :)<br class="">
><br class="">
><br class="">
><br class="">
> I'm working here at Rio de Janeiro Botanical Garden, together with the guys at the National Center for Flora Conservation. We are doing the risk assessment of the Brazilian flora to the government. We assess, so far, the risk of ca. 6.000 species, but we
 still have to assess ca. 35.000. Access occurrence records for Brazil is crucial, and every occurrence is important.<br class="">
><br class="">
><br class="">
><br class="">
> That means that we have to put together occurrence data from different sources and, after the first batch of the risk assessment, we realize that we need to build up our aggregator. We are planning to do this with the Lontra-harvester, with the help of the
 guys at Brazilian GBIF Node.<br class="">
><br class="">
><br class="">
><br class="">
> So, the one of the firsts steps was to list the available resources to understand the dimension of the task and, that brings me to my questions.<br class="">
><br class="">
><br class="">
><br class="">
> First:<br class="">
><br class="">
><br class="">
><br class="">
> The request:<br class="">
><br class="">
><br class="">
><br class="">
> <a href="http://api.gbif.org/v1/occurrence/count?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN" rel="noreferrer" target="_blank" class="">
http://api.gbif.org/v1/occurrence/count?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN</a><br class="">
><br class="">
><br class="">
><br class="">
> returns 4.982.689 records<br class="">
><br class="">
><br class="">
><br class="">
> And the request:<br class="">
><br class="">
><br class="">
><br class="">
> <a href="http://api.gbif.org/v1/occurrence/counts/datasets?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN" rel="noreferrer" target="_blank" class="">
http://api.gbif.org/v1/occurrence/counts/datasets?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN</a><br class="">
><br class="">
><br class="">
><br class="">
> returns (here) 7.406.310 records<br class="">
><br class="">
><br class="">
><br class="">
> Comments?<br class="">
><br class="">
><br class="">
><br class="">
> Second:<br class="">
><br class="">
><br class="">
><br class="">
> The request:<br class="">
><br class="">
><br class="">
><br class="">
> <a href="http://api.gbif.org/v1/occurrence/count?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN" rel="noreferrer" target="_blank" class="">
http://api.gbif.org/v1/occurrence/count?country=BR&taxonKey=6&basisOfRecord=PRESERVED_SPECIMEN</a><br class="">
><br class="">
><br class="">
><br class="">
> return things like this:<br class="">
><br class="">
><br class="">
><br class="">
> "197908d0-5565-11d8-b290-b8a03c50a862":27629<br class="">
><br class="">
><br class="">
> But the consult of the same dataset:<br class="">
><br class="">
><br class="">
><br class="">
> <a href="http://www.gbif.org/occurrence/search?TAXON_KEY=6&DATASET_KEY=197908d0-5565-11d8-b290-b8a03c50a862" rel="noreferrer" target="_blank" class="">
http://www.gbif.org/occurrence/search?TAXON_KEY=6&DATASET_KEY=197908d0-5565-11d8-b290-b8a03c50a862</a><br class="">
><br class="">
><br class="">
><br class="">
> Returns "null" (of course, is a FishBase!)<br class="">
><br class="">
><br class="">
><br class="">
> I have plenty of examples like this, on yellow here (not finished!):<br class="">
><br class="">
><br class="">
><br class="">
> <a href="https://docs.google.com/spreadsheets/d/1msUjwMLoKwnXxJFzF20SeN_C65RIkGLbwaYyj459VTc/edit?usp=sharing" rel="noreferrer" target="_blank" class="">
https://docs.google.com/spreadsheets/d/1msUjwMLoKwnXxJFzF20SeN_C65RIkGLbwaYyj459VTc/edit?usp=sharing</a><br class="">
><br class="">
><br class="">
><br class="">
> Comments?<br class="">
><br class="">
><br class="">
><br class="">
> I think those two questions is a good start. Please, let me know if I'm doing something wrong.<br class="">
><br class="">
><br class="">
><br class="">
> Cheers,<br class="">
><br class="">
><br class="">
><br class="">
> Eduardo<br class="">
><br class="">
> --------------------------------<br class="">
><br class="">
> Eduardo Dalcin<br class="">
><br class="">
> Instituto de Pesquisas Jardim Botânico do Rio de Janeiro - JBRJ<br class="">
><br class="">
> e-mail: <a href="mailto:edalcin@jbrj.gov.br" target="_blank" class="">edalcin@jbrj.gov.br</a><br class="">
><br class="">
> Trabalho / Work: <a href="tel:%2B55%2021%203204%202116" value="+552132042116" target="_blank" class="">
+55 21 3204 2116</a><br class="">
><br class="">
> --------------------------------<br class="">
><br class="">
> e-mail alternativo /  alternate email: <a href="mailto:edalcin@jbrj.org" target="_blank" class="">
edalcin@jbrj.org</a><br class="">
><br class="">
> --------------------------------<br class="">
><br class="">
> Agendar reunião / Schedule a meeting: <a href="https://mailtrack.io/trace/link/db57b837be515d4b7caefe43d55b60467cd7c2c1?url=http%3A%2F%2Fagendar.dalc.in&signature=69b244942739c0f5" rel="noreferrer" target="_blank" class="">
http://agendar.dalc.in</a><br class="">
><br class="">
><br class="">
><br class="">
><br class="">
<br class="">
</div>
</div>
</blockquote>
</div>
<br class="">
</div>
</div>
</div>
_______________________________________________<br class="">
API-users mailing list<br class="">
<a href="mailto:API-users@lists.gbif.org" target="_blank" class="">API-users@lists.gbif.org</a><br class="">
<a href="http://lists.gbif.org/mailman/listinfo/api-users" target="_blank" class="">http://lists.gbif.org/mailman/listinfo/api-users</a><br class="">
</div>
</blockquote>
</div>
<br class="">
</div>
</div>
</div>
</blockquote>
</div>
<br class="">
</div>
<br class="">
<fieldset class=""></fieldset> <br class="">
<pre class="">_______________________________________________
API-users mailing list
<a href="mailto:API-users@lists.gbif.org" target="_blank" class="">API-users@lists.gbif.org</a>
<a href="http://lists.gbif.org/mailman/listinfo/api-users" target="_blank" class="">http://lists.gbif.org/mailman/listinfo/api-users</a>
</pre>
</blockquote>
<br class="">
</div>
</div>
</div>
<br class="">
_______________________________________________<br class="">
API-users mailing list<br class="">
<a href="mailto:API-users@lists.gbif.org" target="_blank" class="">API-users@lists.gbif.org</a><br class="">
<a href="http://lists.gbif.org/mailman/listinfo/api-users" rel="noreferrer" target="_blank" class="">http://lists.gbif.org/mailman/listinfo/api-users</a><br class="">
<br class="">
</blockquote>
</div>
<br class="">
</div>
_______________________________________________<br class="">
API-users mailing list<br class="">
<a href="mailto:API-users@lists.gbif.org" target="_blank" class="">API-users@lists.gbif.org</a><br class="">
<a href="http://lists.gbif.org/mailman/listinfo/api-users" target="_blank" class="">http://lists.gbif.org/mailman/listinfo/api-users</a><br class="">
</div>
</blockquote>
</div>
<br class="">
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br class="">
</div>
_______________________________________________<br class="">
API-users mailing list<br class="">
<a href="mailto:API-users@lists.gbif.org" class="">API-users@lists.gbif.org</a><br class="">
http://lists.gbif.org/mailman/listinfo/api-users<br class="">
</div>
</blockquote>
</div>
<br class="">
</div>
</body>
</html>