Asynchronous download DwCA format no publishingOrgKey
data:image/s3,"s3://crabby-images/5bd09/5bd093560a49554f2e0b30fcc4a9dcfd51e56f25" alt=""
Dear GBIF, Is there a reason why the Asynchronous download DwCA format doesn't have publishingOrgKey while the CSV format has it. We want to use the DwCA format because of the multimedia addition but realized the publishingOrgKey was missing. Hopefully publishingOrgKey will be included in the DwCA format so we don't need to download both formats. Thanks, Eli
data:image/s3,"s3://crabby-images/1e279/1e2799c1f4f68735197b86e73dab96ef82a8c135" alt=""
Dear Eli, I don't know why it is this way, according to our API the term publishingOrgKey is "not exposed in user downloads": http://gbif.github.io/dwc-api/apidocs/org/gbif/dwc/terms/GbifInternalTerm.ht... Given a datasetKey (which is in the DwCA format), you can query for the publishingOrganizationKey with a query like http://api.gbif.org/v1/dataset/2814930a-3b93-48d6-b887-00979481ac9e, for example using "curl" and "jq": curl -Ss http://api.gbif.org/v1/dataset/2814930a-3b93-48d6-b887-00979481ac9e | jq .publishingOrganizationKey There are 20,000 datasets in a complete download, so making these queries and caching the result will probably be faster than using both download formats. Best regards, Matt Blissett On 12/08/18 20:40, Eli Agbayani wrote:
Dear GBIF, Is there a reason why the Asynchronous download DwCA format doesn't have publishingOrgKey while the CSV format has it.
We want to use the DwCA format because of the multimedia addition but realized the publishingOrgKey was missing.
Hopefully publishingOrgKey will be included in the DwCA format so we don't need to download both formats.
Thanks, Eli
_______________________________________________ API-users mailing list API-users@lists.gbif.org https://lists.gbif.org/mailman/listinfo/api-users
data:image/s3,"s3://crabby-images/5bd09/5bd093560a49554f2e0b30fcc4a9dcfd51e56f25" alt=""
Thanks Matt. Yes, that will work for us. Will use the dataset API to get publishingOrgKey. We've actually already cached the ~20K dataset calls, so we are good. Thanks much. Eli On Thu, Aug 16, 2018 at 9:22 AM, Matthew Blissett <mblissett@gbif.org> wrote:
Dear Eli,
I don't know why it is this way, according to our API the term publishingOrgKey is "not exposed in user downloads": http://gbif.github.io/dwc-api/apidocs/org/gbif/dwc/terms/ GbifInternalTerm.html
Given a datasetKey (which is in the DwCA format), you can query for the publishingOrganizationKey with a query like http://api.gbif.org/v1/ dataset/2814930a-3b93-48d6-b887-00979481ac9e, for example using "curl" and "jq":
curl -Ss http://api.gbif.org/v1/dataset/2814930a-3b93-48d6- b887-00979481ac9e | jq .publishingOrganizationKey
There are 20,000 datasets in a complete download, so making these queries and caching the result will probably be faster than using both download formats.
Best regards,
Matt Blissett
On 12/08/18 20:40, Eli Agbayani wrote:
Dear GBIF, Is there a reason why the Asynchronous download DwCA format doesn't have publishingOrgKey while the CSV format has it.
We want to use the DwCA format because of the multimedia addition but realized the publishingOrgKey was missing.
Hopefully publishingOrgKey will be included in the DwCA format so we don't need to download both formats.
Thanks, Eli
_______________________________________________ API-users mailing listAPI-users@lists.gbif.orghttps://lists.gbif.org/mailman/listinfo/api-users
_______________________________________________ API-users mailing list API-users@lists.gbif.org https://lists.gbif.org/mailman/listinfo/api-users
participants (2)
-
Eli Agbayani
-
Matthew Blissett