Re: [IPT] GBIF Integrated Publishing Toolkit

20 Sep 2010

      Dear Jonas,

Thank you for your questions.  Answers given inline:
...
Dear Tim
Thanks for the information, and the great work on the IPT. I think  
it will be a good move, just seen from my perspective, to  
concentrate on the core functionality for better performance.
Thank you for this feedback - this is important to help steer the IPT  
process.
...
After having done some promotion and held some training and  
installation sessions of the IPT under the GBIF - NordGen evaluation  
project I see that the general interest in the genebank  
documentation community is quite strong and the installation and  
configuration is very user-friendly. We just this week had a  
workshop in Estonia introducing the IPT and the GBIF network to  
people from genebanks in Estonia, Latvia and Lithuania and the  
response was very good.
To be able to make the most out of these training sessions, there  
will be a few more during the remainder of this year, I have a  
couple of questions about the coming version:
if the GUI for mapping imported dataset fields to the DwC / DwC  
extension fields will be significantly different?
It will be slightly different, but familiar enough for people who  
understand the older version to pick it up easily.  We received  
feedback during the IPT experts training event last year, and have  
implemented their recommendations.  In short, the terms are simply  
provided in a long list.  I attach a screenshot (dwc-mapping.png)  
showing the mapping page for occurrences.

Please remember that we will initiate a testing phase in October, and  
there will be the ability to comment on this and have it revised if  
there are problems.
...
if the mapping through SQL / direct database connection will work  
significantly different?
No changes here.  You create the SQL statement and then do a mapping  
the same, regardless of whether it is a SQL source or text file.
...
if it will be possible to host multiple resources registered under  
different GBIF nodes on the same IPT webapp or if all resources on  
one IPT webapp must be under the same GBIF node?
The revised version will handle this:

The IPT instance is registered with the Institution/Organisation/Node  
to which it is associated.
The ADMIN user can then associate further Institution/Organisation/ 
Nodes from which data MANAGERs can choose when registering a data  
RESOURCE.
Therefore the following could be handled:

- ADMIN in Node1 installs the IPT
- ADMIN enables Node2 and Institution3 as available entities for  
MANAGERs to use
- ADMIN creates accounts for MANAGERs
- MANAGERs can log in and create resources
- when a MANAGER creates a resource they see a choice of Node1, Node2  
or Institution3 to associate the resource with

In addition to this, the ADMIN can provide an alias to each  
Organisation/Node/Institution, so that MANAGERs see something  
familiar.  For example the GBIF Registry might have "The Botanical  
holdings of the Harvard Herbarium Museum" but internally the ADMIN and  
MANAGERs all call this "HUH".  The ADMIN will be able to declare this  
and MANAGERs will simply see "HUH" in the choice of Institutions to  
associate the resource with.

I hope this helps, and thanks again for the feedback,
Tim
...
Best regards
Jonas Nordling, NordGen
From: trobertson@gbif.org
To: jonas.nordling@nordgen.org
Sent: Mon, 13 Sep 2010 16:10:36 +0200
Subject: GBIF Integrated Publishing Toolkit
[FOR INFORMATION]
Dear Node Managers and Data Publishers,
The objective of this communication is to update you on the current  
situation of the GBIF Integrated Publishing Toolkit (IPT). The IPT  
is a tool providing publishing capabilities for primary biodiversity  
occurrence data, taxonomic checklists and the associated metadata  
for these resource types.
Since its first introduction in March 2009, the IPT has been used  
with some success by a limited number of users to publish primary  
biodiversity data into the GBIF network, along with descriptive  
metadata at the dataset level.
The GBIF Secretariat (GBIFS) has solicited and received a vast  
amount of feedback during the first year of the IPT use, and would  
like to thank all those who provided this. The feedback received  
overwhelmingly confirms that the concept of the IPT is sound.  
However, feedback has also made it clear that it is over-specified  
for the majority of user needs, is not yet sufficiently robust, is  
too slow in operation, and (due to the excessive functionality) has  
high server requirements creating a barrier for many to adopt and  
use it. In addition, the feedback has made it clear that our  
communications surrounding the status of the IPT releases have been  
too infrequent and unclear concerning which release candidate we  
were offering and what functionalities this release candidate would  
include or not include, as per user needs/expectations.
Based upon the extensive feedback GBIFS has revised our planned  
development roadmap for the IPT and are currently performing a major  
refactoring effort to:
• Reduce the server requirements significantly
• Increase the data import performance by removing the embedded  
database
• Remove the dependency on heavy libraries and tools such as Geoserver
• Remove all data interfaces that are not necessary for data  
publication through GBIF
• Improve the robustness of the tool
When complete, the refactored IPT will offer through an intuitive  
interface:
• Authoring of metadata according to the GBIF metadata profile
• Import and mapping of checklist and occurrence data either from  
file upload or by connection of a database
• Registration to and publication onto the GBIF network
• Importing and mapping of data to the DarwinCore, and DarwinCore  
extensions
• Output formats of EML (2.1.0) metadata and DarwinCore Archive
• Improved customisation options for the “About this IPT”
• Improved GBIF Registry integration
• Improved DarwinCore extension and vocabulary organization
• Improved management of the organisations to which the IPT and  
resources are related, to enable co-hosting capabilities
• Enhanced dataset metadata authoring
Whilst this is a reduction in existing functionality this has been  
deemed necessary to ensure the IPT is meeting the core requirements  
as expressed by the user community. The following features will be  
removed and only reintroduced if necessary through consultation with  
the IPT community:
• TAPIR interface
• TCS output format
• Search and browse web interface
• OGC web services
All IPT development efforts are now focused on this revised roadmap,  
and it is anticipated that early testing of the new codebase will  
commence in October 2010 with the involvement of willing Nodes. The  
release of a fully stable IPT is targeted for the end of 2010.  
However, this will only occur if the testing community is satisfied  
that the product is ‘release-ready’ together with the required user  
manual and technical documentation. Should that not be the case, as  
it may take more time to implement and test the improvements  
outlined above, you will be informed as soon as possible.
Further developments of the IPT beyond this release will only be  
implemented after a further scoping with the GBIF community. An area  
for further IPT discussion will be set up on the GBIF community  
website (http://community.gbif.org)
If you have any questions or would like to know more about the GBIF  
IPT and/or revised process, feel free to contact:
Tim Robertson
GBIF Information Systems Architect
trobertson@gbif.org
With best regards,
Tim Robertson
GBIF Secretariat

Tim Robertson (GBIF)

tags

participants (1)