GeoPortal REST Harvesting

This harvester will connect to a remote GeoPortal version 9.3.x server and retrieve metadata records that match the query parameters specified using the GeoPortal REST API.

Adding a GeoPortal REST harvester

../../../_images/web-harvesting-geoportal-rest.png

Adding a GeoPortal REST harvester

The figure above shows the options available:

  • Site - Options about the remote site.

    • Name - This is a short description of the remote site. It will be shown in the harvesting main page as the name for this instance of the GeoPortal REST harvester.
    • Base URL - The base URL of the GeoPortal server to be harvested. eg. http://yourhost.com/geoportal. The harvester will add the additional path required to access the REST services on the GeoPortal server.
    • Icon - An icon to assign to harvested metadata. The icon will be used when showing harvested metadata records in the search results.
  • Search criteria - Using the Add button, you can add several search criteria. You can query any field on the GeoPortal server using the Lucene query syntax described at http://webhelp.esri.com/geoportal_extension/9.3.1/index.htm#srch_lucene.htm.

  • Options - Scheduling options.

  • Run at - The time when the harvester will run.
  • Will run again every - Choose an interval from the drop down list and then select the days for which this scheduling will take place.
  • One run only - Checking this box will cause the harvester to run only when manually started using the Run button on the Harvesting Management page.
  • Harvested Content - Options that are applied to harvested content.
    • Apply this XSLT to harvested records - Choose an XSLT here that will convert harvested records to a different format. See notes section below for typical usage.
    • Validate - If checked, the metadata will be validated after retrieval. If the validation does not pass, the metadata will be skipped.
  • Privileges - Assign privileges to harvested metadata.
  • Groups - Groups can be selected from the scrolling list. When the Add button is pushed, a row of privileges will be created below the scrolling list for each group. Privileges can then be checked/unchecked for each group as required.
  • Remove - To remove a row click on the Remove button on the right of the row.
  • Categories
  • Select one or more categories from the scrolling list. The harvested metadata will be assigned to the selected categories.

Notes

  • in order to be successfully harvested, metadata records retrieved from the remote site must match a metadata schema in the local GeoNetwork instance
  • this harvester uses two REST services from the GeoPortal API:
  • rest/find/document with searchText parameter to return an RSS listing of metadata records that meet the search criteria (maximum 100000)
  • rest/document with id parameter from each result returned in the RSS listing
  • this harvester has been tested with GeoPortal 9.3.x. It should be used for that version of GeoPortal in preference to the CSW harvester
  • typically ISO19115 metadata produced by the Geoportal software will not have a ‘gmd’ prefix for the namespace http://www.isotc211.org/2005/gmd. GeoNetwork XSLTs will not have any trouble understanding this metadata but will not be able to map titles and codelists in the viewer/editor. To fix this problem, please select the Add-gmd-prefix XSLT for the Apply this XSLT to harvested records in the Harvested Content set of options described earlier