<html dir="ltr">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<style type="text/css" id="owaParaStyle"></style>
</head>
<body fpstyle="1" ocsi="0">
<div style="direction: ltr;font-family: Tahoma;color: #000000;font-size: 10pt;">Thanks Daniel.
<div><br>
</div>
<div>The unique key is specified in the collectory.&nbsp;</div>
<div><span style="font-size: 10pt;">For the dr0 resource, what are the settings in your instance of the collectory ?</span></div>
<div><span style="font-size: 10pt;">If its possible, please send through a URL to your collectory instance.</span></div>
<div><br>
</div>
<div>Cheers</div>
<div><br>
</div>
<div>Dave</div>
<div><br>
<div style="font-family: Times New Roman; color: #000000; font-size: 16px">
<hr tabindex="-1">
<div id="divRpF348296" style="direction: ltr;"><font face="Tahoma" size="2" color="#000000"><b>From:</b> Daniel Lins [daniel.lins@gmail.com]<br>
<b>Sent:</b> 09 May 2014 13:59<br>
<b>To:</b> Martin, Dave (CES, Black Mountain)<br>
<b>Cc:</b> ala-portal@lists.gbif.org; dos Remedios, Nick (CES, Black Mountain); Pedro Corręa; Nicholls, Miles (CES, Black Mountain)<br>
<b>Subject:</b> Re: [Ala-portal] DwC-A loading problems<br>
</font><br>
</div>
<div></div>
<div>
<div dir="ltr">Thanks David,
<div><br>
</div>
<div>We use the DwC term &quot;occurrenceID&quot; to identify the records. It's a unique&nbsp;key.</div>
<div><br>
</div>
<div>
<div>However, when I reload a dataset to update some DwC terms of the records, the system duplicates this data (keeps the old record and creates another with changes).&nbsp;<br>
</div>
</div>
<div><br>
</div>
<div>For instance (update of locality).</div>
<div><br>
</div>
<div>Load 1 ($ <span id="3efa410c-0d2b-4f43-b04a-543d74e896ac" class="GINGER_SOFTWARE_mark">
java</span> -<span id="d9c8d25b-9867-491a-8ea1-bc95b1b11c6b" class="GINGER_SOFTWARE_mark">cp</span><span id="ffb81da4-2604-4c40-bba1-5690705bd701" class="GINGER_SOFTWARE_mark"> .</span><span id="3c9ff585-b63e-44dc-88e6-97346e9225b5" class="GINGER_SOFTWARE_mark">:</span><span id="baa905c9-4038-404e-9beb-29cdd33a83bd" class="GINGER_SOFTWARE_mark">biocache</span><span id="31c7c582-5f2b-402d-88ae-17112fd0cd1f" class="GINGER_SOFTWARE_mark">.</span>jar
 au.org.ala.util.DwcCSVLoader dr0 -l dataset.csv -<span id="ea715af4-5888-48a1-84f1-6a6b357a0fcc" class="GINGER_SOFTWARE_mark">b</span> true)</div>
<div><br>
</div>
<div><font color="#38761d">{<span id="0d5a3d17-eb60-4df8-8eba-98ab1dbf8292" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 1,&nbsp;municipality: Sao Paulo,<span id="d9be3258-859e-4b79-acff-17bc926b45b0" class="GINGER_SOFTWARE_mark">&nbsp;...</span>},</font></div>
<div><font color="#3d85c6">{<span id="29b8fd7e-ea33-4f2a-bd34-5b1abf76240e" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 2,&nbsp;municipality: Sao Paulo,<span id="406859bf-9ddc-4eea-b77c-6d9fcc7741c4" class="GINGER_SOFTWARE_mark">&nbsp;...</span>}</font></div>
<div><br>
</div>
<div>Process 1 (biocache$ process dr0)</div>
<div>Index 1 (biocache$ index dr0)</div>
<div><br>
</div>
<div>Load 2 (updated records and new records) (($&nbsp;<span id="b1aa9990-7f4c-47a6-824b-28fd33f3c3c9" class="GINGER_SOFTWARE_mark">java</span>&nbsp;-<span id="fb3389f4-14a1-4b88-b1b3-9a2ec6b03b98" class="GINGER_SOFTWARE_mark">cp</span><span id="0d38fa6e-f8bf-4451-bf7b-3ffe24f44a5f" class="GINGER_SOFTWARE_mark">&nbsp;.</span><span id="769d7228-ec95-40c6-9708-ca4704bddd10" class="GINGER_SOFTWARE_mark">:</span><span id="16cc0b29-e143-4f56-b2f5-1f56e4c85579" class="GINGER_SOFTWARE_mark">biocache</span><span id="38a71ca4-0207-4c15-bfa8-9947a7d23827" class="GINGER_SOFTWARE_mark">.</span>jar
 au.org.ala.util.DwcCSVLoader dr0 -l dataset-updated<span id="fc541191-25d4-459a-8a7d-b3298e4d8d65" class="GINGER_SOFTWARE_mark">.</span><span id="545f3e37-56a8-48e8-981c-4895661449d8" class="GINGER_SOFTWARE_mark">csv</span> -<span id="3f7b5a12-0f36-4bf2-ad10-7a6f34134680" class="GINGER_SOFTWARE_mark">b</span>&nbsp;true)</div>
<div>
<div><br>
</div>
<div><font color="#38761d">{<span id="af714c92-e9cd-4a3b-9790-626cbad14b1f" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 1,&nbsp;</font><span style="color:rgb(56,118,29)">municipality</span><font color="#38761d">: Rio de Janeiro,<span id="a0217845-7eeb-4755-9247-183626153370" class="GINGER_SOFTWARE_mark">&nbsp;...</span>},</font></div>
<div><font color="#3d85c6">{<span id="da447ce1-dee2-4584-b4d2-10cf2b699302" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 2,&nbsp;</font><span style="color:rgb(61,133,198)">municipality</span><font color="#3d85c6">: Rio de&nbsp;Janeiro,<span id="7df68567-83c4-4653-a56d-874e65e40fad" class="GINGER_SOFTWARE_mark">&nbsp;...</span>},</font></div>
</div>
<div>{<span id="04340ea5-32fa-4b56-99e5-a84a831a2ae5" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 3, municipality: Sao Paulo,<span id="d35ff14b-cbc6-456f-9981-4e98ac79a547" class="GINGER_SOFTWARE_mark">&nbsp;...</span>}<br>
</div>
<div><br>
</div>
<div>
<div>Process 2 (biocache$ process dr0)</div>
<div>Index&nbsp;2&nbsp;(biocache$ index dr0)</div>
</div>
<div><br>
</div>
<div>Results shown by ALA:</div>
<div><br>
</div>
<div>
<div><font color="#38761d">{<span id="82163306-055e-446a-b419-a4b2833dc642" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 1,&nbsp;</font><span style="color:rgb(56,118,29)">municipality</span><font color="#38761d">:&nbsp;</font><span style="color:rgb(56,118,29)">Sao
 Paulo</span><font color="#38761d">,<span id="b243e37e-8d69-4d90-b8b5-d138a2667e8c" class="GINGER_SOFTWARE_mark">&nbsp;...</span>},</font></div>
<div><font color="#3d85c6">{<span id="97902fda-0410-41eb-a3d5-6afc8c894146" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 2,&nbsp;</font><span style="color:rgb(61,133,198)">municipality</span><font color="#3d85c6">:&nbsp;</font><span style="color:rgb(61,133,198)">Sao
 Paulo</span><font color="#3d85c6">,<span id="dedde570-ce77-47c7-bc9a-4bb518b7884c" class="GINGER_SOFTWARE_mark">&nbsp;...</span>},</font></div>
</div>
<div>
<div>
<div><font color="#38761d">{<span id="8e9b8e3f-d21f-468a-972e-5f21b569d46a" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 1,&nbsp;</font><span style="color:rgb(56,118,29)">municipality</span><font color="#38761d">:&nbsp;</font><span style="color:rgb(56,118,29)">Rio
 de Janeiro</span><font color="#38761d">,<span id="1b84c27d-47ca-43f5-a22d-51da444f1e40" class="GINGER_SOFTWARE_mark">&nbsp;...</span>},</font></div>
<div><font color="#3d85c6">{<span id="90ad797f-ebd4-4501-81f4-f33200e11fab" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 2,&nbsp;</font><span style="color:rgb(61,133,198)">municipality</span><font color="#3d85c6">:&nbsp;</font><span style="color:rgb(61,133,198)">Rio
 de&nbsp;Janeiro</span><font color="#3d85c6">,<span id="813b02f3-e9ba-44f1-82c8-9c270b4b5292" class="GINGER_SOFTWARE_mark">&nbsp;...</span>}</font></div>
</div>
</div>
<div>{<span id="9efc202b-9746-4e15-965a-deeee1803fe7" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 3,&nbsp;municipality:&nbsp;Sao Paulo,<span id="8217fb6a-e53e-43fb-b778-0661a2c12843" class="GINGER_SOFTWARE_mark">&nbsp;...</span>}<br>
</div>
<div><br>
</div>
<div>But I expected:</div>
<div><br>
</div>
<div>
<div><span style="color:rgb(56,118,29)">{<span id="b1087731-22c3-46fe-aac5-b47a7149ff3b" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 1,&nbsp;</span><span style="color:rgb(56,118,29)">municipality</span><span style="color:rgb(56,118,29)">:&nbsp;</span><span style="color:rgb(56,118,29)">Rio
 de Janeiro</span><span style="color:rgb(56,118,29)">,<span id="1bcca153-06c5-441a-b07b-128cc02e8006" class="GINGER_SOFTWARE_mark">&nbsp;...</span>},</span><br>
</div>
<div>
<div><font color="#3d85c6">{<span id="66dd30b3-bc93-44d5-821c-e91cd600b01f" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 2,&nbsp;</font><span style="color:rgb(61,133,198)">municipality</span><font color="#3d85c6">:&nbsp;</font><span style="color:rgb(61,133,198)">Rio
 de&nbsp;Janeiro</span><font color="#3d85c6">,<span id="cc097dbb-acf0-44d0-a610-6beda16e8061" class="GINGER_SOFTWARE_mark">&nbsp;...</span>}</font></div>
</div>
<div>{<span id="f6d00885-eaac-4db8-b0c4-f4e8f7697fd4" class="GINGER_SOFTWARE_mark">OccurrenceID</span>: 3,&nbsp;municipality:&nbsp;Sao Paulo,<span id="186d1073-0a9b-410a-851e-07d36737a7d8" class="GINGER_SOFTWARE_mark">&nbsp;...</span>}</div>
</div>
<div><br>
</div>
<div>I need to delete (delete-resource function) existing data before the reload? If no,&nbsp;what I did wrong to generate this data duplication?</div>
<div><br>
</div>
<div>Thanks!&nbsp;</div>
<div><br>
</div>
<div><br>
</div>
<div>Regards,</div>
<div><br>
</div>
<div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
Daniel Lins da Silva</div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
(Mobile)&nbsp;<a href="tel:55%2011%2096144-4050" value="&#43;5511961444050" target="_blank">55 11 96144-4050</a></div>
<div style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
<div class="gmail_extra"><font color="#000000">Research Center&nbsp;<span id="94fa7a93-d70b-4b91-9671-29699a404506" class="GINGER_SOFTWARE_mark">on</span>&nbsp;Biodiversity and Computing (Biocomp)</font></div>
<div class="gmail_extra"><font color="#000000">University of Sao Paulo, Brazil</font></div>
</div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
<a href="mailto:daniellins@usp.br" target="_blank">daniellins@usp.br</a></div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
<a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a></div>
</div>
<div><br>
</div>
<div><br>
</div>
<div>&nbsp;</div>
</div>
<div class="gmail_extra"><br>
<br>
<div class="gmail_quote">2014-05-07 0:46 GMT-03:00 <span dir="ltr">&lt;<a href="mailto:David.Martin@csiro.au" target="_blank">David.Martin@csiro.au</a>&gt;</span>:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex; border-left:1px #ccc solid; padding-left:1ex">
<div>
<div style="direction:ltr; font-family:Tahoma; color:#000000; font-size:10pt">
<div>
<div>Thanks Daniel. Natasha has now left the ALA.</div>
<div><br>
</div>
<div>The uniqueness of records is determined by information stored in the collectory. See screenshot [1].</div>
<div><span style="font-size:10pt">By default, &quot;catalogNumber&quot; is used but you can change this to any number of fields that should be stable in the data.&nbsp;</span></div>
<div>Using unstable fields for the ID isn't recommended (e.g. scientificName). &nbsp;To update the records, the process is to just re-load the dataset.</div>
<div><br>
</div>
<div>Automatically loaded - this isnt in use and we may remove from the UI in future iterations.</div>
<div>Incremental Load - affects the sample/process/index steps to only run these against the new records. &nbsp;Load is always incremental based on the key field(s) but if the incremental load box isn’t checked it runs the sample/process/index steps against the
 whole data set. This can cause a large processing overhead when there’s a minor update to a large data set.</div>
<div><br>
</div>
<div>Cheers</div>
<div><br>
</div>
<div>Dave Martin</div>
</div>
<div>ALA</div>
<div><br>
</div>
<div>[1]&nbsp;<a href="http://bit.ly/1g72HFN" style="font-size:10pt" target="_blank">http://bit.ly/1g72HFN</a></div>
<div><br>
</div>
<div style="font-family:Times New Roman; color:#000000; font-size:16px">
<hr>
<div style="direction:ltr"><font face="Tahoma" color="#000000"><b>From:</b> Daniel Lins [<a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a>]<br>
<b>Sent:</b> 05 May 2014 15:39<br>
<b>To:</b> Quimby, Natasha (CES, Black Mountain)<br>
<b>Cc:</b> <a href="mailto:ala-portal@lists.gbif.org" target="_blank">ala-portal@lists.gbif.org</a>; dos Remedios, Nick (CES, Black Mountain); Martin, Dave (CES, Black Mountain); Pedro Corręa<br>
<b>Subject:</b> Re: [Ala-portal] DwC-A loading problems<br>
</font><br>
</div>
<div>
<div class="h5">
<div></div>
<div>
<div dir="ltr">
<div>
<div>Hi Natasha,&nbsp;</div>
<div><br>
</div>
<div>I managed to import the DwC-A file following the steps reported in the previous email. Thank you!</div>
<div><br>
</div>
<div>However, when I tried to update some metadata of an occurrence record (already stored in the database), the system created a new record with these duplicated information. So I started to have several records with the same
<span>occurrenceID</span> (I did set in the data resource configuration to use &quot;OcurrenceID&quot; to uniquely identify a record).</div>
<div><br>
</div>
<div>How can I update existing records in the database? For instance, the location's metadata of an occurrence record stored in my database?</div>
<div><br>
</div>
<div>I also would like to better understand the behavior of the properties &quot;Automatically loaded&quot; and &quot;Incremental Load&quot;.&nbsp;</div>
</div>
<div><br>
</div>
<div>Thanks!!</div>
<div><br>
</div>
<div>Regards,</div>
<div><br>
</div>
<div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
Daniel Lins da Silva</div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
(Mobile)&nbsp;<a href="tel:55%2011%2096144-4050" value="&#43;5511961444050" target="_blank">55 11 96144-4050</a></div>
<div style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
<div class="gmail_extra"><font color="#000000">Research Center&nbsp;<span>on</span>&nbsp;Biodiversity and Computing (Biocomp)</font></div>
<div class="gmail_extra"><font color="#000000">University of Sao Paulo, Brazil</font></div>
</div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
<a href="mailto:daniellins@usp.br" target="_blank">daniellins@usp.br</a></div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px; color:rgb(136,136,136)">
<a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a></div>
</div>
<div class="gmail_extra"><br>
<br>
<div class="gmail_quote">2014-04-28 3:52 GMT-03:00 Daniel Lins <span dir="ltr">&lt;<a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a>&gt;</span>:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex; border-left-width:1px; border-left-color:rgb(204,204,204); border-left-style:solid; padding-left:1ex">
<div dir="ltr">Thanks Natasha!
<div><br>
</div>
<div>I will try your recommendations. Once finished, I will contact you.</div>
<div><br>
</div>
<div>Regards<br>
</div>
<div class="gmail_extra">
<div><br>
<div class="gmail_extra" style="color:rgb(136,136,136); font-family:arial,sans-serif; font-size:13px">
Daniel Lins da Silva</div>
<div class="gmail_extra" style="color:rgb(136,136,136); font-family:arial,sans-serif; font-size:13px">
(Mobile)&nbsp;<a href="tel:55%2011%2096144-4050" value="&#43;5511961444050" target="_blank">55 11 96144-4050</a></div>
<div style="color:rgb(136,136,136); font-family:arial,sans-serif; font-size:13px">
<div class="gmail_extra"><font color="#000000">Research Center&nbsp;<span><span>on</span></span>&nbsp;Biodiversity and Computing (Biocomp)</font></div>
<div class="gmail_extra"><font color="#000000">University of Sao Paulo, Brazil</font></div>
</div>
<div class="gmail_extra" style="color:rgb(136,136,136); font-family:arial,sans-serif; font-size:13px">
<a href="mailto:daniellins@usp.br" target="_blank">daniellins@usp.br</a></div>
<div class="gmail_extra" style="color:rgb(136,136,136); font-family:arial,sans-serif; font-size:13px">
<a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a></div>
<div class="gmail_extra" style="color:rgb(136,136,136); font-family:arial,sans-serif; font-size:13px">
<br>
</div>
<div class="gmail_extra" style="color:rgb(136,136,136); font-family:arial,sans-serif; font-size:13px">
<br>
</div>
<br>
</div>
<div class="gmail_quote">2014-04-28 3:26 GMT-03:00 <span dir="ltr">&lt;<a href="mailto:Natasha.Quimby@csiro.au" target="_blank">Natasha.Quimby@csiro.au</a></span><span dir="ltr"></span><span dir="ltr"></span><span dir="ltr"></span><span dir="ltr">&gt;</span><span>:</span><span></span><span></span><span></span>
<div>
<div><br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex; border-left-width:1px; border-left-color:rgb(204,204,204); border-left-style:solid; padding-left:1ex">
<div style="font-size:14px; font-family:Calibri,sans-serif; word-wrap:break-word">
<div>Hi Daniel,</div>
<div><br>
</div>
<div>When you specify a local DwcA Load the archive needs to be unzipped. Try unzipping&nbsp;<b>2f676abc-4503-489e-8f0c-fcb6e1bc554b<span><span>.</span></span>zip
</b>and then running the following:</div>
<div>
<div><span><span>s<b>udo</b></span><b></b><b></b><b></b></span><b> <span><span>java</span></span> -<span><span>cp</span></span><span><span> .</span></span>:biocache.jar au.org.ala.util.DwCALoader dr7 -l /data/collectory/upload/1398658607824/2f676abc-4503-489e-8f0c-fcb6e1bc554b</b></div>
<div><b><br>
</b></div>
</div>
<div>If you configure the <span><span>collectory</span></span> to provide the <span>
<span>dwca</span></span> the <span><span>biocache</span></span> automatically unzips the archive for you. &nbsp;You would need to configure dr7 with the following connection parameters:</div>
<div><br>
</div>
<div>&quot;<span><span>protocol</span></span>&quot;:&quot;DwCA&quot;</div>
<div>&quot;<span><span>termsForUniqueKey</span></span>&quot;<span><span>:</span></span>[&quot;<span><span>occurrenceID</span></span>&quot;],</div>
<div><span>&quot;url&quot;:&quot;file:////data/collectory/upload/</span>1398658607824/2f676abc-4503-489e-8f0c-fcb6e1bc554b.zip<span>&quot;</span></div>
<div><b><br>
</b></div>
<div>You could then load the resource by:</div>
<div>
<div><span><span>s<b>udo</b></span><b></b><b></b><b></b></span><b> <span><span>java</span></span> -<span><span>cp</span></span><span><span> .</span></span><span><span>:</span></span><span><span>biocache</span></span><span><span>.</span></span>jar au.org.ala.util.DwCALoader
 dr7</b></div>
<div><b><br>
</b></div>
</div>
<div>If you continue to have <span><span>issues please</span></span> let us know.&nbsp;</div>
<div><br>
</div>
<div>Hope that this helps.</div>
<div><br>
</div>
<div>Regards</div>
<div>Natasha</div>
<div><br>
</div>
<span>
<div style="border-width:1pt medium medium; border-style:solid none none; padding:3pt 0in 0in; text-align:left; font-size:11pt; font-family:Calibri; border-top-color:rgb(181,196,223)">
<span style="font-weight:bold">From: </span>Daniel Lins &lt;<a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a>&gt;<br>
<span style="font-weight:bold">Date: </span>Monday, 28 April 2014 3:54 PM<br>
<span style="font-weight:bold">To: </span><span><span>&quot;<a href="mailto:ala-portal@lists.gbif.org" target="_blank">ala-portal@lists.gbif.org</a>&quot; &lt;<a href="mailto:ala-portal@lists.gbif.org" target="_blank">ala-portal@lists.gbif.org</a></span><a href="mailto:ala-portal@lists.gbif.org" target="_blank"></a><a href="mailto:ala-portal@lists.gbif.org" target="_blank"></a><a href="mailto:ala-portal@lists.gbif.org" target="_blank"></a></span><a href="mailto:ala-portal@lists.gbif.org" target="_blank"></a><span><span>&gt;,</span></span>
 &quot;dos Remedios, Nick (CES, Black Mountain)&quot; &lt;<a href="mailto:Nick.Dosremedios@csiro.au" target="_blank">Nick.Dosremedios@csiro.au</a><span><span>&gt;,</span></span> &quot;Martin, Dave (CES, Black Mountain)&quot; &lt;<a href="mailto:David.Martin@csiro.au" target="_blank">David.Martin@csiro.au</a>&gt;<br>
<span style="font-weight:bold">Subject: </span>[Ala-portal] DwC-A loading <span><span>problems</span></span><br>
</div>
<div>
<div>
<div><br>
</div>
<blockquote style="border-left:#b5c4df 5 solid; padding:0 0 0 5; margin:0 0 0 5">
<div>
<div>
<div dir="ltr">Hi Nick and Dave,
<div><br>
</div>
<div>We are having some problems in Biocache during the upload of DwC-A files.</div>
<div><br>
</div>
<div>
<div>As shown below, after run the method &quot;au.org.ala.util.DwCALoader&quot;, our system returns the error message
<font color="#ff0000">&quot;Exception in thread &quot;main&quot; org<span><span>.</span></span><span><span>gbif</span></span><span><span>.</span></span><span><span>dwc</span></span><span><span>.</span></span>text<span><span>.</span></span>UnkownDelimitersException: Unable
 to detect field delimiter&quot;</font></div>
<div><br>
</div>
<div>I accomplished tests using DwC-A files with tab-delimited text files and comma-delimited text files. In both cases the error generated was the same.</div>
</div>
<div><br>
</div>
<div>What causes these problems? (** CSV Loader works great)<br>
</div>
<div><br>
</div>
<div><b><i><span><span><span>tab</span></span></span>-delimited file test</i></b></div>
<div><b><br>
</b></div>
<div>
<div>poliusp@poliusp-VirtualBox:~/dev/biocache$ s<b>udo java -cp .:biocache.jar au.org.ala.util.DwCALoader dr7 -l /data/collectory/upload/1398658607824/2f676abc-4503-489e-8f0c-fcb6e1bc554b.zip</b></div>
<div>2014-04-28 01:44:02,837 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] - Loading configuration from /data/biocache/config/biocache-config.properties</div>
<div>2014-04-28 01:44:03,090 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] - Initialise SOLR</div>
<div>2014-04-28 01:44:03,103 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] -
<span><span><span>Initialise</span></span></span> name matching indexes</div>
<div>2014-04-28 01:44:03,605 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] - Initialise persistence manager</div>
<div>2014-04-28 01:44:03,606 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] - Configure complete</div>
<div>Loading archive /data/<span><span><span>collectory</span></span></span>/upload/1398658607824/2f676abc-4503-489e-8f0c-fcb6e1bc554b<span><span><span>.</span></span></span>zip for resource dr7 with unique terms List<span><span><span>(</span></span></span><span><span><span>dwc</span></span></span><span><span><span>:</span></span></span><span><span><span>occurrenceID</span></span></span>)
 stripping spaces false incremental false testing <span><span><span>false</span></span></span></div>
<div><b>Exception in thread &quot;main&quot; org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span><span><span><span>dwc</span></span></span><span><span><span>.</span></span></span>text<span><span><span>.</span></span></span>UnkownDelimitersException:
 Unable to detect field delimiter</b></div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span>file<span><span><span>.</span></span></span>CSVReaderFactory<span><span><span>.</span></span></span><span><span><span>buildArchiveFile</span></span></span><span><span><span>(</span></span></span>CSVReaderFactory.java<span><span><span>:</span></span></span>129)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span>file<span><span><span>.</span></span></span>CSVReaderFactory<span><span><span>.</span></span></span>build<span><span><span>(</span></span></span>CSVReaderFactory.java<span><span><span>:</span></span></span>46)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span><span><span><span>dwc</span></span></span><span><span><span>.</span></span></span>text<span><span><span>.</span></span></span>ArchiveFactory<span><span><span>.</span></span></span><span><span><span>readFileHeaders</span></span></span><span><span><span>(</span></span></span>ArchiveFactory.java<span><span><span>:</span></span></span>344)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span><span><span><span>dwc</span></span></span><span><span><span>.</span></span></span>text<span><span><span>.</span></span></span>ArchiveFactory<span><span><span>.</span></span></span><span><span><span>openArchive</span></span></span><span><span><span>(</span></span></span>ArchiveFactory.java<span><span><span>:</span></span></span>289)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> au.org.ala.util.DwCALoader.loadArchive(DwCALoader.scala:129)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> au.org.ala.util.DwCALoader.loadLocal(DwCALoader.scala:106)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> au.org.ala.util.DwCALoader$.main(DwCALoader.scala:52)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> au.org.ala.util.DwCALoader.main(DwCALoader.scala)</div>
<div><br>
</div>
<div><br>
</div>
<div><b><i><span><span><span>comma</span></span></span>-delimited file test</i></b><br>
</div>
<div><br>
</div>
<div>
<div>poliusp@poliusp-VirtualBox:~/dev/biocache$ <b>sudo java -cp .:biocache.jar au.org.ala.util.DwCALoader dr7 -l ./<span><span><span>dwca</span></span></span>-teste3<span><span><span>.</span></span></span>zip</b></div>
<div>2014-04-28 01:56:04,683 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] - Loading configuration from /data/biocache/config/biocache-config.properties</div>
<div>2014-04-28 01:56:04,940 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] - Initialise SOLR</div>
<div>2014-04-28 01:56:04,951 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] -
<span><span><span>Initialise</span></span></span> name matching indexes</div>
<div>2014-04-28 01:56:05,437 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] - Initialise persistence manager</div>
<div>2014-04-28 01:56:05,438 INFO<span><span><span> :</span></span></span> [<span><span><span>ConfigModule</span></span></span>] - Configure complete</div>
<div>Loading archive<span><span><span> .</span></span></span>/<span><span><span>dwca</span></span></span>-teste3<span><span><span>.</span></span></span>zip for resource dr7 with unique terms List<span><span><span>(</span></span></span><span><span><span>dwc</span></span></span><span><span><span>:</span></span></span><span><span><span>occurrenceID</span></span></span>)
 stripping spaces false incremental false testing <span><span><span>false</span></span></span></div>
<div><b>Exception in thread &quot;main&quot; org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span><span><span><span>dwc</span></span></span><span><span><span>.</span></span></span>text<span><span><span>.</span></span></span>UnkownDelimitersException:
 Unable to detect field delimiter</b></div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span>file<span><span><span>.</span></span></span>CSVReaderFactory<span><span><span>.</span></span></span><span><span><span>buildArchiveFile</span></span></span><span><span><span>(</span></span></span>CSVReaderFactory.java<span><span><span>:</span></span></span>129)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span>file<span><span><span>.</span></span></span>CSVReaderFactory<span><span><span>.</span></span></span>build<span><span><span>(</span></span></span>CSVReaderFactory.java<span><span><span>:</span></span></span>46)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span><span><span><span>dwc</span></span></span><span><span><span>.</span></span></span>text<span><span><span>.</span></span></span>ArchiveFactory<span><span><span>.</span></span></span><span><span><span>readFileHeaders</span></span></span><span><span><span>(</span></span></span>ArchiveFactory.java<span><span><span>:</span></span></span>344)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> org<span><span><span>.</span></span></span><span><span><span>gbif</span></span></span><span><span><span>.</span></span></span><span><span><span>dwc</span></span></span><span><span><span>.</span></span></span>text<span><span><span>.</span></span></span>ArchiveFactory<span><span><span>.</span></span></span><span><span><span>openArchive</span></span></span><span><span><span>(</span></span></span>ArchiveFactory.java<span><span><span>:</span></span></span>289)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> au.org.ala.util.DwCALoader.loadArchive(DwCALoader.scala:129)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> au.org.ala.util.DwCALoader.loadLocal(DwCALoader.scala:106)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> au.org.ala.util.DwCALoader$.main(DwCALoader.scala:52)</div>
<div>&nbsp; &nbsp; &nbsp; &nbsp; <span><span><span>at</span></span></span> au.org.ala.util.DwCALoader.main(DwCALoader.scala)</div>
</div>
<div><br>
</div>
<div><br>
</div>
<div>Thanks!</div>
<div><br>
</div>
<div>Regards.</div>
-- <br>
<div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px">Daniel Lins da Silva</div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px">(Mobile)&nbsp;<a href="tel:55%2011%2096144-4050" value="&#43;5511961444050" target="_blank">55 11 96144-4050</a></div>
<div style="font-family:arial,sans-serif; font-size:13px">
<div class="gmail_extra"><font color="#000000">Research Center&nbsp;<span><span><span>on</span></span></span>&nbsp;Biodiversity and Computing (Biocomp)</font></div>
<div class="gmail_extra"><font color="#000000">University of Sao Paulo, Brazil</font></div>
</div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px"><a href="mailto:daniellins@usp.br" target="_blank">daniellins@usp.br</a></div>
<div class="gmail_extra" style="font-family:arial,sans-serif; font-size:13px"><a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a></div>
</div>
<div><br>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
</div>
</span></div>
</blockquote>
</div>
</div>
</div>
<span><font color="#888888"><br>
<br clear="all">
<div><br>
</div>
-- <br>
<div>Daniel Lins da Silva<br>
(<span>Cel</span>) 11 6144-4050</div>
<div><a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a><br>
</div>
</font></span></div>
</div>
</blockquote>
</div>
<br>
<br clear="all">
<div><br>
</div>
-- <br>
<div>Daniel Lins da Silva<br>
(<span>Cel</span>) 11 6144-4050</div>
<div><a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a><br>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br>
<br clear="all">
<div><br>
</div>
-- <br>
<div>Daniel Lins da Silva<br>
(Cel) 11 6144-4050</div>
<div><a href="mailto:daniel.lins@gmail.com" target="_blank">daniel.lins@gmail.com</a><br>
</div>
</div>
</div>
</div>
</div>
</div>
</body>
</html>