[IPT] Looking for JAR file that is responsible for DwC-A creation.

Wed Oct 19 04:16:53 CEST 2011

Hi Markus,

Thanks for the reply. I've actually finished our basic implementation of 
DwC-A writing already.

I used the ConceptTerm interface and DwcTerm enum which took a lot of 
the work out making sure I was using correct identifiers. The main task 
was mapping our model to the appropriate concept term. After that I 
wrote some xml and csv files and hey presto - DwC-A :)

My implementation is coupled very closely to our own model, admittedly 
it's naive as it doesn't handle the relationship between core and 
extension records as your approach does.

Unfortunately I haven't been assigned to continue working on this 
feature of our system but if due to requirements we need to revisit our 
DwC implementation I will review your DwcaWriter.java code again. I 
remember have a look at it before but it didn't seem appropriate for 
some reason (sorry it was a few weeks ago) and I'll get back to you :)

The BDRS is on google code. When we push up our latest public build I'll 
put a link on this mailing list to the source of our implementation - I 
guess it might be useful to someone out there?

Regards,
Aaron

On 10/19/2011 05:14 AM, "Markus Döring (GBIF)" wrote:
> Hi Aaron,
> unfortunately the archive generation inside the IPT does not exist as a separate library. Its rather tight to the IPT right now and not that easy to rip out I think.
> For this reason I started to write another small library while I was trying to export the species descriptions of the various wikipedias into dwc archives.
> Its also not entirely separate now, but I wouldn't mind making it a completely separate project.
>
> The project is on github right now with the main  class DwcaWriter here:
> https://github.com/mdoering/wikipedia-dwca/blob/master/src/main/java/org/tdwg/dwca/wikipedia/DwcaWriter.java
>
> This writer is creating tab data files, the meta.xml descriptor and some xml metatda if demanded.
> The writer allows you to add core records one after the other while being able to add any number of extension records to the current core record. Once you are done with a record you simply call newRecord() and pass it a core id. At the very end you call finalize to write the meta.xml and bundle the archive.
>
> The writer is being used here:
> https://github.com/mdoering/wikipedia-dwca/blob/master/src/main/java/org/tdwg/dwca/wikipedia/WikipediaParser.java
>
> Sorry for the incomplete codebase, but its a side project I never had time to finish properly.
> If you like the idea Id be glad to move the dwca writer to a project on its own that we can both work on?
>
> best,
> Markus
>
>
>
>
>
> On 26.09.2011, at 09:39, Aaron Low wrote:
>
>> Hi,
>>
>> I'm a developer working on the BDRS -
>> http://code.google.com/p/ala-citizenscience/.
>>
>> I'm in the process of adding the ability to export observations of
>> species as a darwin core archive. I've been perusing your code files at
>> http://repository.gbif.org hoping to find a jar file that I can leverage
>> some of the functionality of the IPT without having to run whole
>> application. To achieve this I would like to include one (or more) of
>> the org.gbif jars as a dependency in our maven build.
>>
>> Unfortunately, I can't seem to find the jar that constructs a new archive.
>>
>> Could someone please point me in the right direction? If the
>> functionality I'm looking for doesn't exist, that would be great for me
>> to know as well.
>>
>> Thanks for your time,
>>
>> Regards,
>>
>> Aaron Low
>> Gaia Resources
>> p +61 8 9227 7309
>> w www.gaiaresources.com.au
>> e aaron.low at gaiaresources.com.au
>>
>> _______________________________________________
>> IPT mailing list
>> IPT at lists.gbif.org
>> http://lists.gbif.org/mailman/listinfo/ipt
>>

-- 
Aaron Low
Gaia Resources
p +61 8 9227 7309
w www.gaiaresources.com.au
e aaron.low at gaiaresources.com.au