Exporting XML files from BPS

From reBiND Documentation
Revision as of 15:33, 14 January 2015 by AgnesKirchhoff (talk | contribs) (The following steps describe how to do this:)
Jump to: navigation, search

Uploading large data files from BPS to reBiND

After the mapping of the data to the ABCD elements (http://wiki.bgbm.org/bps/index.php/ABCD2Mapping) the data needs to be either exported as XML files from BPS or uploaded directly to reBiND (see the following sections for details).

When the XML data files are large (> 1000 records or abcd:Units) it is currently necessary to export multiple files from the BPS and run an xquery script to join them into a single ABCD file.

The following steps describe how to do this:

1. export the data from BPS using ABCD2 search. After the mapping is complete within the BPS then select the "QueryForms" on the top menu as shown in the screenshot below:


BPS ABCD mapping.PNG


After selecting "QueryForms" click on "ABCD2 search" from the list below the textfield, see the screenshot below. After selecting the "ABCD2 search" the field is automatically filled with the Query tags for the ABCD search. Please remove the <filter> element from the text as shown in the screenshot. Then set the limit to 999 in the following tag: <responseFormat start='0' limit='998'> (the default limit is set to 10). Click the "submit" button. An XML file is displayed in the browser.


BPS ABCD query.PNG


2. Save the file as {dataset name}_1.xml. For the next file click on the "back" button on your browser to return to the query form and modify the tag to <responseFormat start='999' limit='998'> . This retrieves the next block of data records. Press the "submit" button again and then save the XML file as {dataset name}_2.xml ...

3. Continue to export XML files as in step 2, in each case modifying the start (1998, 2997, 3996 and so on). When saving the file naming it with a sequential suffix (_1, _2, _3 ...) until their are no further records. In each step you retrieve 999 records.

4. Create a new project in rebind (see also the instructions here and name it with a unique name e.g. HBGnummeriert (dataset name) and then create a collection called ‘abcd’ by pressing 'Create collection'. Click on 'abcd' to navigate into the new collection. Store all of the individual ABCD files in the 'abcd' collection by pressing 'Upload File' and selecting each file - one at a time until all files are uploaded: e.g. HBGnummeriert _1.XML, HBGnummeriert_2.XML etc.

5. In another browser tab, call the following script using the following URL (replace the final part of the URL with the name of your collection e.g. if your collection is called HBGnummeriert then the URL should end with coll=HBGnummeriert)

http://http://data-rebind.bgbm.org/rebind/join_abcd.xquery?coll=HBGnummeriert