Hi Markus,
Thanks for the reply. I've actually finished our basic implementation of DwC-A writing already.
I used the ConceptTerm interface and DwcTerm enum which took a lot of the work out making sure I was using correct identifiers. The main task was mapping our model to the appropriate concept term. After that I wrote some xml and csv files and hey presto - DwC-A :)
My implementation is coupled very closely to our own model, admittedly it's naive as it doesn't handle the relationship between core and extension records as your approach does.
Unfortunately I haven't been assigned to continue working on this feature of our system but if due to requirements we need to revisit our DwC implementation I will review your DwcaWriter.java code again. I remember have a look at it before but it didn't seem appropriate for some reason (sorry it was a few weeks ago) and I'll get back to you :)
The BDRS is on google code. When we push up our latest public build I'll put a link on this mailing list to the source of our implementation - I guess it might be useful to someone out there?
Regards, Aaron
On 10/19/2011 05:14 AM, "Markus Döring (GBIF)" wrote:
Hi Aaron, unfortunately the archive generation inside the IPT does not exist as a separate library. Its rather tight to the IPT right now and not that easy to rip out I think. For this reason I started to write another small library while I was trying to export the species descriptions of the various wikipedias into dwc archives. Its also not entirely separate now, but I wouldn't mind making it a completely separate project.
The project is on github right now with the main class DwcaWriter here: https://github.com/mdoering/wikipedia-dwca/blob/master/src/main/java/org/tdw...
This writer is creating tab data files, the meta.xml descriptor and some xml metatda if demanded. The writer allows you to add core records one after the other while being able to add any number of extension records to the current core record. Once you are done with a record you simply call newRecord() and pass it a core id. At the very end you call finalize to write the meta.xml and bundle the archive.
The writer is being used here: https://github.com/mdoering/wikipedia-dwca/blob/master/src/main/java/org/tdw...
Sorry for the incomplete codebase, but its a side project I never had time to finish properly. If you like the idea Id be glad to move the dwca writer to a project on its own that we can both work on?
best, Markus
On 26.09.2011, at 09:39, Aaron Low wrote:
Hi,
I'm a developer working on the BDRS - http://code.google.com/p/ala-citizenscience/.
I'm in the process of adding the ability to export observations of species as a darwin core archive. I've been perusing your code files at http://repository.gbif.org hoping to find a jar file that I can leverage some of the functionality of the IPT without having to run whole application. To achieve this I would like to include one (or more) of the org.gbif jars as a dependency in our maven build.
Unfortunately, I can't seem to find the jar that constructs a new archive.
Could someone please point me in the right direction? If the functionality I'm looking for doesn't exist, that would be great for me to know as well.
Thanks for your time,
Regards,
Aaron Low Gaia Resources p +61 8 9227 7309 w www.gaiaresources.com.au e aaron.low@gaiaresources.com.au
IPT mailing list IPT@lists.gbif.org http://lists.gbif.org/mailman/listinfo/ipt