Main Page

From Berlin Harvesting and Indexing Toolkit
Jump to: navigation, search

B-HIT Berlin Harvesting and Indexing Toolkit

Check out our publication on B-HIT : [Kelbert et al. 2015 PloS ONE].

B-HIT is based on the GBIF HIT and used to harvest data for GGBN, BiNHum and WorldFloraOnline. The old documentation from GBIF can be accessed here.

It has been actively extended for the BiNHum and GGBN project, in order to support:

  • multiple identifications (without duplication)
  • unit associations
  • ABCD2.1
  • ABCD-Archives
  • extented multimedia fields
  • Supported schema and protocols:
    • ABCD: 2.06, 2.1, EFG, GGBN, GGBN Enviro, ABCD - Archive
    • DwC: DwC 1.0, 1.4, 1.4-Geospatial, 1.4-Curatorial, MaNIS 1.0, MaNIS 1.21, DwC Archive, DwC GGBN

Also, the technology got updated for security issues (latest Java Spring).

The latest version can be downloaded from our svn http://ww2.biocase.org/svn/binhum/trunk/BinHum/Harvester/

Overview.jpg

Table of Contents

Beginner's Guide
What B-HIT is all about.
Installation
How to install and use it.
Indexed fields
Detailed list of ABCD and DwC fields that are indexed.
Database
How the database is organized, how/why/what and where.
Edit the parser, index new elements
How to add new elements in the index and how to edit the parser
Debug
Fiew debugs hints