Settings hierarchy

Introduction

GeoNetwork stores configuration settings in the Settings table. The settings are organized into a ‘tree’ structure where the leaf nodes of the tree are key/value pairs.

A key is limited to 64 characters whilst a value is effectively unlimited in most databases supported by GeoNetwork.

The root of the tree contains two nodes:

  • system: contains nodes that describe system configuration settings
  • harvesting: contains nodes that describe the settings used by harvesters

In the following sections:

  • the indentation is used to show hierarchy
  • names in bold represent keys
  • the datatype of the value is shown in parenthesis after the key.
  • an italic font is used to indicate types: string, integer, boolean, enumerated (values separated by a ‘|’ represent the set of allowed values).
  • a missing datatype means that the value of the node is not used.
  • square brackets indicate cardinality. If they are missing, a cardinality of [1..1] should be inferred.

System node

Some examples of system configuration settings nodes contained in the system node are shown in the hierarchy below.

  • system: information on system configuration settings
  • site: Contains information about the site
  • name (string): Name used to present this site to other sites. Used to fill comboboxes or lists.
  • organisation (string): Name of the organization/company/institute that is running GeoNetwork
  • siteId (string): A UUID that uniquely identifies the site. It is generated by the installer.
  • platform: Contains information about the current software version
  • version (string): GeoNetwork’s version in the X.Y.Z format
  • subVersion (string): A small extra description about the version, like ’alpha-1’, ’beta’ etc...
  • server: Used when it is necessary to build absolute URLs to the GeoNetwork server. This is the case, for example, when creating links inside a metadata record or when providing CSW capabilities.
  • host (string): HTTP server address
  • port (integer): HTTP server port (can be empty which means port 80)
  • Intranet: specify the network address details of the Intranet
  • network (string): Intranet address
  • netmask (string): Intranet netmask
  • z3950: A container for Z39.50 server parameters
  • enable (boolean): If true, GeoNetwork will start the Z30.50 server
  • port (integer): The port opened by GeoNetwork to listen to Z39.50 requests. Usually 2100.
  • proxy: This container specifies the proxy configuration for GeoNetwork to use when making outgoing connections
  • use (boolean): If true, GeoNetwork will use the given proxy for outgoing connections
  • host (string): Proxy host name
  • port (integer): Proxy host port
  • username (string): Proxy credentials.
  • password (string): Proxy credentials.
  • feedback: email messages sent via web feedback form or when downloading a resource.
  • email (string): email address of person who will receive feedback
  • mailServer: This container configures the mail server that will be used to send email
    • host (string): Address of the SMTP server to use
    • port (string): SMTP port on server to use
  • removedMetadata: This container contains settings about removed metadata.
  • dir: This folder will contain removed metadata in MEF format. It gets populated when the user deletes a metadata using the web interface.
  • LDAP: Parameters for LDAP authentication by GeoNetwork
  • use (boolean): If true, GeoNetwork will use LDAP authentication for usernames and passwords
  • host (string): LDAP host
  • port (integer): Port number on LDAP host
  • defaultProfile (string): Default GeoNetwork profile to use when the profile user attribute does not exist in the LDAP attributes.
  • login - optional credentials used to obtain access to the LDAP service
    • userDN (string)
    • password (string)
  • distinguishedNames - optional LDAP info used by GeoNetwork to locate user credentials
    • base (string)
    • users (string)
  • userAttribs: A container for user attributes present into the LDAP directory that must be retrieved and used to create the user in GeoNetwork.
    • name (string)
    • password (string)
    • profile (string)

Harvesting nodes

All harvesters share a common set of nodes, which are retrieved by the harvesting engine for all harvesters. These common nodes are described in the hierarchy below.

  • harvesting
  • node [0..n] (geonetwork|csw|arcsde|filesystem|geonetwork20|oaipmh| ogcwxs|thredds|webdav|wfsfeatures|z3950|z3950Config|geoPREST): Type of harvesting node
    • site: A container for site information.
      • name (string): Name of harvest instance (shown in the harvester user interface).
      • uuid (string): A unique identifier assigned by the system when the harvesting node is created.
      • useAccount (boolean): Indicates if the harvester has to authenticate to access the data.
        • username (string): credentials to be used for authentication
        • password (string): credentials to be used for authentication
    • options:
      • every (string): Harvesting schedule - similar to the Unix CRON format
      • oneRunOnly (boolean): If true, the harvester will harvest just once and then it will set the status to inactive.
      • status (active|inactive): Indicates if the harvesting from this node is stopped (inactive) or if the harvester is waiting for the next scheduled harvest (active).
    • content: A container for options relating to processing content after harvesting but before storage in the GeoNetwork database
      • importxslt (string): Name of XSLT to apply to metadata records before being stored and indexed in GeoNetwork (note: not every harvester supports this option - see notes section for each harvester in the harvester settings descriptions below). A list of the possible XSLTs that can be used for this parameter can be obtained from the xml.harvesting.info service - see importStylesheets.
      • validate (boolean): If true, the harvester will validate the metadata record against the matching metadata schema in GeoNetwork. (note: not every harvester supports this option - see notes section for each harvester in the harvester settings descriptions below)
    • privileges [0..1]: This is a container for privileges to assign to each harvested metadata record.
      • group (integer) [0..n]: A local group. The value is the local identifier of the group. There can be several group nodes each with its own set of privileges.
        • operation (integer) [0..n]: Privilege to assign to the group. The value is the numeric id of the operation like 0=view, 1=download, 2=edit etc...
    • categories [0..1]: This is a container for categories to assign to each harvested metadata record.
      • category (integer) [0..n]: A local category. The value is the local identifier of the category.
    • info: Container for status information about harvesting from this node.
      • lastRun (string): When the harvester was last run. The value is the current time in milliseconds since 1 January, 1970. If empty then the harvester has not yet been run.

Settings for each of the different harvesters that show the additional elements specific to those harvesters are in the following sections.

Harvesting node geonetwork

This is the native harvesting supported by GeoNetwork 2.1 and above.

  • node (string): geonetwork
    • site: Contains host and account information
      • host (string) eg. http://localhost:8080/geonetwork
      • createRemoteCategory (boolean) True: If local category exists with same name as the remote category, then assign harvested metadata to that category. False: Ignore categories.
      • mefFormatFull (boolean) True: harvest remote metadata as a full MEF file. ie. metadata plus thumbnails and data files uploaded with metadata.
      • importXslt (string) Name and parameters of a metadata processing XSLT - see Metadata Processing services for more details.
    • search [0..n]: Contains the search parameters. If this element is missing, an unconstrained search will be performed.
      • freeText (string) - free text search on remote instance
      • title (string) - search title on remote instance
      • abstract (string) - search abstract on remote instance
      • keywords (string) - search keywords on remote instance
      • digital (boolean) - search for records marked ‘digital’ on remote instance
      • hardcopy (boolean) - search for records marked ‘hardcopy’ on remote instance
      • sourceUuid (string) - search for records that come from source with this uuid on the remote instance
      • sourceName (string) - search for records that come from source with this name on the remote instance
    • groupsCopyPolicy [0..n]: Represents a copy policy for a remote group. It is used to maintain remote privileges on harvested metadata.
      • name (string): Internal name (not localised) of a remote group.
      • policy (string): Copy policy. For the group all, policies are: copy, copyToIntranet. For all other groups, policies are: copy, createAndCopy. The Intranet group is not considered.

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services.

Example of GeoNetwork harvester settings in XML::

<node id="954" type="geonetwork">
 <site>
   <name>test</name>
   <uuid>683bfc02-73e2-4100-a601-369936b6f82a</uuid>
   <account>
     <use>true</use>
     <username>admin</username>
     <password>admin</password>
   </account>
   <host>http://localhost:8080/geonetwork</host>
   <createRemoteCategory>true</createRemoteCategory>
   <mefFormatFull>true</mefFormatFull>
   <xslfilter />
 </site>
 <content>
   <validate>true</validate>
   <importxslt>none</importxslt>
 </content>
 <options>
   <every>0 0 0/3 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
 </options>
 <searches>
   <search>
     <freeText>Maps</freeText>
     <title></title>
     <abstract></abstract>
     <keywords></keywords>
     <digital>false</digital>
     <hardcopy>true</hardcopy>
     <source>
       <uuid>b7ce20f2-888a-4139-8802-916730c4be06</uuid>
       <name>My GeoNetwork catalogue</name>
     </source>
   </search>
 </searches>
 <categories>
   <category id="5" />
 </categories>
 <groupsCopyPolicy />
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Notes

  • this harvester does not use the content/importXslt setting - instead a more sophisticated approach using the metadata processing services can be applied through the importXslt setting - for more details and an example see Metadata Processing services for more details.

Harvesting node webdav

This harvester type is capable of connecting to a web server which is WebDAV enabled (WebDAV is WEB Distributed Authoring and Versioning) or WAF (Web Accessible Folder) enabled.

  • node (string): webdav
    • site: Contains the URL to connect to and account information
      • url (string): URL to connect to. Must be well formed, starting with http://, file:// or a supported protocol.
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.
    • options
      • recurse (boolean): Indicates if the remote folder must be recursively scanned for metadata.
      • validate (boolean): If set, the harvester will validate the metadata against its schema and the metadata will be harvested only if it is valid.
      • subtype (waf|webdav): Indicates the type of server being harvested

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services.

Example of WEBDAV/WAF harvester settings in XML::

<node id="1039" type="webdav">
 <site>
   <name>burbble</name>
   <uuid>b64e006a-a26c-4268-85ec-0bf0450be966</uuid>
   <account>
     <use>true</use>
     <username>apples</username>
     <password>oranges</password>
   </account>
   <url>http://davtest.com/webdav</url>
   <icon>webdav.gif</icon>
 </site>
 <content>
   <validate>true</validate>
   <importxslt>none</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
   <validate>true</validate>
   <recurse>true</recurse>
   <subtype>webdav</subtype>
 </options>
 <privileges>
   <group id="1">
     <operation name="view" />
   </group>
 </privileges>
 <categories>
   <category id="2" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Notes

  • this harvester does not use the content/importXslt setting

Harvesting node csw

This harvester type is capable of querying an OGC Catalogue Services for the Web (CSW) server and harvesting metadata.

  • node (string): csw
    • site
      • capabUrl (string): URL of the GetCapabilities statement. This will be used to retrieve the operations and server address.
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.
    • search [0..n]: Contains search parameters. If this element is missing, an unconstrained search will be performed.
      • freeText (string)
      • title (string)
      • abstract (string)
      • subject (string)
      • minscale (integer) - minimum scale denominator
      • maxscale (integer) - maximum scale denominator

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of CSW harvester settings in XML::

<node id="1122" type="csw">
 <site>
   <name>blonks</name>
   <uuid>723953dc-1308-4905-abc0-9869f18af132</uuid>
   <account>
     <use>true</use>
     <username>frogmouth</username>
     <password>tawny</password>
   </account>
   <capabilitiesUrl>http://cswserver.com/services/csw?request=GetCapabilities&amp;service=CSW</capabilitiesUrl>
   <icon>csw.gif</icon>
 </site>
 <content>
   <validate>false</validate>
   <importxslt>none</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
 </options>
 <searches>
   <search>
     <freeText>cobblers</freeText>
     <title />
     <abstract />
     <subject />
     <minscale>25000</minscale>
     <maxscale>400000</maxscale>
   </search>
 </searches>
 <privileges>
   <group id="1">
     <operation name="view" />
   </group>
 </privileges>
 <categories>
   <category id="2" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Notes

  • this harvester does not use the content/importXslt setting

Harvesting node z3950

This harvester type is capable of querying one or more Z3950 servers and harvesting metadata.

  • node (string): z3950
    • site
      • query (string): Z3950 query in Prefix Query Format (mandatory). See the discussion on PQF in the Z3950 Harvester section of the Users Manual and/or http://www.indexdata.com/yaz/doc/tools.html#PQF.
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.
      • repositories
        • repository [0..n]: Contains the Z3950 repository ids that will be harvested. Z3950 repository ids in GeoNetwork can be obtained through the Site Information (xml.info) service.

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of Z3950 harvester settings in XML::

<node id="1090" type="z3950">
 <site>
   <name>testz3950</name>
   <uuid>8554ec2b-2b80-4f9e-9d9d-028e2407ee89</uuid>
   <account>
     <use>true</use>
     <username />
     <password />
   </account>
   <query>@attrset geo @attr 1=1016 @attr 4=6 @attr 2=3 "water"</query>
   <icon>Z3950.gif</icon>
   <repositories>
     <repository id="act" />
     <repository id="geonetwork-auscope" />
     <repository id="product" />
     <repository id="publication" />
     <repository id="source" />
     <repository id="geonetwork-aims" />
   </repositories>
 </site>
 <content>
   <validate>false</validate>
   <importxslt>none</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
 </options>
 <privileges>
   <group id="1">
     <operation name="view" />
     <operation name="dynamic" />
     <operation name="featured" />
   </group>
 </privileges>
 <categories>
   <category id="2" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Harvesting node oaipmh

This harvester type is capable of querying an OAIPMH (Open Archives Initiative Protocol for Metadata Harvesting version 2.0) server and harvesting metadata.

  • node (string): oaipmh
    • site
      • url (string): OAIPMH server URL
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.
    • options
      • validate (boolean): Validate metadata before saving to database. This option will be deprecated in favour of content/validate.
    • search [0..n]: Contains search parameters which because they are constrained to follow the OAIPMH version 2.0 standard, are curiously limited to a date range, prefix and metadata set name. If this element is missing, an unconstrained search will be performed.

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of OAIPMH harvester settings in XML::

<node id="1152" type="oaipmh">
 <site>
   <name>oaitest</name>
   <uuid>0a3e4a4a-4c4c-4f82-860f-551b6cf12341</uuid>
   <account>
     <use>true</use>
     <username />
     <password />
   </account>
   <url>http://localhost:6060/joai</url>
   <icon>oai-mhp.gif</icon>
 </site>
 <content>
   <validate>false</validate>
   <importxslt>{IMPORTXSLT}</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
   <validate>false</validate>
 </options>
 <searches>
   <search>
     <from>2013-01-08</from>
     <until>2013-01-30</until>
     <set>iso19139</set>
     <prefix>gmd</prefix>
     <stylesheet />
   </search>
 </searches>
 <privileges>
   <group id="1">
     <operation name="view" />
   </group>
 </privileges>
 <categories>
   <category id="2" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Notes

  • this harvester does not use the content/importXslt setting
  • this harvester does not use the content/validate setting - this will change

Harvesting node thredds

This harvester type is capable of harvesting metadata from a THREDDS (Thematic Real-time Environmental Data Directory Service). The metadata fragments are:

  • transformed into ISO19115/19139 (or profile) metadata fragments
  • copied or linked into an ISO19115/19139 (or profile) template

The harvester supports metadata fragments following the Unidata Data Discovery Conventions (http://www.unidata.ucar.edu/software/netcdf-java/formats/DataDiscoveryAttConvention.html) from two types of THREDDS datasets:

  • collections: essentially datasets that contain other datasets (like directories in a filesystem tree)
  • atomics: datasets that do not contain other datasets (like files in a filesystem tree)

The settings for a THREDDS harvester type are:

  • node (string): thredds
    • site
      • url (string): THREDDS catalog server URL - should always be the URL of an XML THREDDS catalog eg. http://motherlode.ucar.edu:8080/thredds/catalog/satellite/3.9/WEST-CONUS_4km/catalog.xml - HTML URLs will not work!
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.
    • options
      • lang (string) - three letter language code describing language of metadata being harvested (almost always eng) - this value is used to set the metadata language in the records harvested from THREDDS
      • topic (string) - ISO topic category describing the harvested metadata
      • createThumbnails (boolean) - If true and the THREDDS catalog delivers WMS images for a dataset whose metadata is being harvested then a thumbnail will be created from the WMS for the harvested metadata
      • createServiceMd (boolean) - If true then a skeleton ISO19119 service metadata record will be created for the services specified in the THREDDS catalog.
      • createCollectionDatasetMd (boolean) - If true then metadata records will be created using harvested metadata fragments from collection datasets in the THREDDS catalog.
      • collectionGeneration (fragments|default) - Use fragments. default refers to DIF metadata extraction which will be deprecated.
      • outputSchemaOnCollectionsFragments (string) Metadata schema of ISO records that will be created from THREDDS catalog collection metadata. A list of possible values for this parameter can be obtained from the xml.harvesting.info service, see threddsFragmentSchemas.
      • collectionMetadataTemplate (integer) Template record that harvested fragments will copied or linked into to create metadata records. Use http://your_geonetwork/geonetwork/srv/eng/xml.info?type=templates to get a list of template metadata record ids that can be used for this setting.
      • createCollectionSubtemplates (boolean) - If true then subtemplates are created from the metadata fragments harvested from the THREDDS catalog datasets. The subtemplates are stored in the GeoNetwork database and linked into the collectionMetadataTemplate template record to create the harvested metadata records. If false, then subtemplates are not created, instead the harvested fragments are copied into the collectionMetadataTemplate record to create normal metadata records.
      • collectionFragmentStylesheet (string) Stylesheet that turns collection metadata fragments in the THREDDS catalog into ISO metadata fragments. A list of possible values for this parameter can be obtained from the xml.harvesting.info service, see threddsFragmentStylesheets.
      • createAtomicDatasetMd (boolean) - If true then metadata records will be created using harvested metadata fragments from atom datasets in the THREDDS catalog.
      • atomicGeneration (fragments|default) - Use fragments. default refers to DIF metadata extraction which will be deprecated.
      • outputSchemaOnAtomicFragments (string) Metadata schema of ISO records that will be created from THREDDS catalog atomic metadata. A list of possible values for this parameter can be obtained from the xml.harvesting.info service, see threddsFragmentSchemas.
      • atomicMetadataTemplate (integer) Template record that harvested fragments will copied or linked into to create metadata records. Use http://your_geonetwork/geonetwork/srv/eng/xml.info?type=templates to get a list of template metadata record ids that can be used for this setting.
      • createAtomicSubtemplates (boolean) - If true then subtemplates are created from the metadata fragments harvested from the THREDDS catalog atoms. The subtemplates are stored in the GeoNetwork database and linked into the atomicMetadataTemplate template record to create the harvested metadata records. If false, then subtemplates are not created, instead the harvested fragments are copied into the atomicMetadataTemplate record to create normal metadata records.
      • atomicFragmentStylesheet (string) Stylesheet that turns atom metadata fragments in the THREDDS catalog into ISO metadata fragments. A list of possible values for this parameter can be obtained from the xml.harvesting.info service, see threddsFragmentStylesheets.

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of THREDDS harvester settings in XML:

<node id="977" type="thredds">
 <site>
   <name>test thredds with motherlode</name>
   <uuid>b3307257-6b2a-48e7-80f5-74acadeff66f</uuid>
   <account>
     <use>true</use>
     <username />
     <password />
   </account>
   <url>http://motherlode.ucar.edu:8080/thredds/catalog/satellite/3.9/WEST-CONUS_4km/catalog.xml</url>
   <icon>thredds.gif</icon>
 </site>
 <content>
   <validate>false</validate>
   <importxslt>{IMPORTXSLT}</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
   <lang>eng</lang>
   <topic>environment</topic>
   <createThumbnails>true</createThumbnails>
   <createServiceMd>true</createServiceMd>
   <createCollectionDatasetMd>true</createCollectionDatasetMd>
   <createAtomicDatasetMd>true</createAtomicDatasetMd>
   <ignoreHarvestOnAtomics>true</ignoreHarvestOnAtomics>
   <atomicGeneration>fragments</atomicGeneration>
   <modifiedOnly>true</modifiedOnly>
   <atomicFragmentStylesheet>thredds-metadata.xsl</atomicFragmentStylesheet>
   <atomicMetadataTemplate>7</atomicMetadataTemplate>
   <createAtomicSubtemplates>true</createAtomicSubtemplates>
   <outputSchemaOnAtomicsDIF />
   <outputSchemaOnAtomicsFragments>iso19139</outputSchemaOnAtomicsFragments>
   <ignoreHarvestOnCollections>true</ignoreHarvestOnCollections>
   <collectionGeneration>fragments</collectionGeneration>
   <collectionFragmentStylesheet>thredds-metadata.xsl</collectionFragmentStylesheet>
   <collectionMetadataTemplate>7</collectionMetadataTemplate>
   <createCollectionSubtemplates>true</createCollectionSubtemplates>
   <outputSchemaOnCollectionsDIF />
   <outputSchemaOnCollectionsFragments>iso19139</outputSchemaOnCollectionsFragments>
   <datasetCategory>2</datasetCategory>
 </options>
 <privileges>
   <group id="1">
     <operation name="view" />
   </group>
 </privileges>
 <categories>
   <category id="3" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Notes

  • this harvester does not use the content/importXslt setting
  • this harvester does not use the content/validate setting

Harvesting node wfsfeatures

This harvester type is capable of querying and harvesting metadata from the GetFeature response from an OGC Web Feature Service (WFS). The metadata fragments are:

  • transformed into ISO19115/19139 (or profile) metadata fragments
  • copied or linked into an ISO19115/19139 (or profile) template

The settings for a WFS Features harvester type are:

  • node (string): wfsfeatures
    • site
      • url (string): OGC WFS service URL. eg. http://localhost:7070/deegree/services - the harvester adds the necessary parameters to call the GetFeature service with the query provided from the options.
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.
    • options
      • lang (string) - three letter language code describing language of metadata being harvested (almost always eng) - this value is used to set the metadata language in the records harvested from THREDDS
      • query (string) - An OGC WFS GetFeature query
      • outputSchema (string) Metadata schema of ISO records that will be created from WFS GetFeature harvester. A list of possible values for this parameter can be obtained from the xml.harvesting.info service, see wfsFragmentSchemas.
      • stylesheet (string) Stylesheet that turns WFS Features into ISO metadata fragments. A list of possible values for this parameter can be obtained from the xml.harvesting.info service, see wfsFragmentStylesheets.
      • streamFeatures (boolean) - If true then responses will be saved to disk and features will be extracted using the streaming XML parser. Use for large WFS GetFeatures responses.
      • createSubtemplates (boolean) - If true then subtemplates are created from the metadata fragments harvested from the WFS GetFeatures response. The subtemplates are stored in the GeoNetwork database and linked into the templateId template record to create the harvested metadata records. If false, then subtemplates are not created, instead the harvested fragments are copied into the templateId record to create normal metadata records.
      • templateId (integer) Template record that harvested fragments will copied or linked into to create metadata records. Use http://your_geonetwork/geonetwork/srv/eng/xml.info?type=templates to get a list of template metadata record ids that can be used for this setting.
      • recordsCategory (integer) Category id of GeoNetwork category that will be assigned to the metadata records created by the harvester. Use http://your_geonetwork/geonetwork/srv/eng/xml.info?type=categories to get a list of category ids that can be used for this setting.

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of WFS Features harvester settings in XML:

<node id="1201" type="wfsfeatures">
 <site>
   <name>test wfs</name>
   <uuid>588ea4fa-a105-40fd-9697-8082d23cc967</uuid>
   <account>
     <use>true</use>
     <username />
     <password />
   </account>
   <url>http://localhost:6060/deegree/services</url>
   <icon>wfs.gif</icon>
 </site>
 <content>
   <validate>false</validate>
   <importxslt>{IMPORTXSLT}</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
   <lang>eng</lang>
   <query>&lt;wfs:GetFeature service="WFS" version="1.1.0"
       xmlns:wfs="http://www.opengis.net/wfs"&gt;
    &lt;wfs:Query typeName="gboundaries"/&gt;&lt;/wfs:GetFeature&gt;</query>
   <outputSchema>iso19139</outputSchema>
   <stylesheet>geoserver_boundary_fragments.xsl</stylesheet>
   <streamFeatures>false</streamFeatures>
   <createSubtemplates>true</createSubtemplates>
   <templateId>2</templateId>
   <recordsCategory>2</recordsCategory>
 </options>
 <privileges>
   <group id="1">
     <operation name="view" />
     <operation name="dynamic" />
     <operation name="featured" />
   </group>
 </privileges>
 <categories>
   <category id="2" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Notes

  • this harvester does not use the content/importXslt setting
  • this harvester does not use the content/validate setting

Harvesting node filesystem

This harvester type is capable of querying and harvesting metadata (filename *.xml) from a directory tree on a filesystem accessible to the GeoNetwork server.

The settings for the filesystem harvester type are:

  • node (string): filesystem
    • site
      • directory (string): This is the filesystem directory (on the server) from which metadata files (*.xml) will be harvested.
      • recurse (boolean): If true, then any directories in the fileystem directory will be checked for *.xml files (and any directories in those directories etc etc).
      • nodelete (boolean): If true, then metadata from previous executions of the filesystem harvester will be left in the catalog (ie. they will not be deleted)
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of Filesystem harvester settings in XML:

<node id="1234" type="filesystem">
 <site>
   <name>/Users filesystem harvester</name>
   <uuid>c7b7660a-337a-4aa1-843b-648280ad8d86</uuid>
   <account>
     <use>false</use>
     <username />
     <password />
   </account>
   <directory>/Users</directory>
   <recurse>true</recurse>
   <nodelete>true</nodelete>
   <icon>filesystem.gif</icon>
 </site>
 <content>
   <validate>true</validate>
   <importxslt>DIF-to-ISO19139.xsl</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
 </options>
 <searches />
 <privileges>
   <group id="1">
     <operation name="view" />
   </group>
 </privileges>
 <categories>
   <category id="2" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Harvesting node arcsde

This harvester type is capable of harvesting metadata from an arcsde server.

The settings for the arcsde harvester type are:

  • node (string): arcsde
    • site
      • server (string): This is the name of the arcde server.
      • port (integer): Port number of arcsde server.
      • username (string): arcsde username
      • password (string): arcsde password
      • database (string): Name of arcsde database
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of arcsde harvester settings in XML:

<node id="1259" type="arcsde">
 <site>
   <name>test arcsde harvester</name>
   <uuid>5b5070d3-464a-484e-941a-e56812e46431</uuid>
   <account>
     <use>false</use>
     <username />
     <password />
   </account>
   <server>arcsde_server.esri.com</server>
   <port>5151</port>
   <username>sde_user</username>
   <password>sde_password</password>
   <database>esri_sde</database>
   <icon>esri.gif</icon>
 </site>
 <content>
   <validate>true</validate>
   <importxslt>none</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>false</oneRunOnly>
   <status>inactive</status>
 </options>
 <searches />
 <categories>
   <category id="4" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Notes

  • this harvester does not use the content/importXslt setting
  • ArcSDE java API libraries need to be installed by the user in GeoNetwork (folder INSTALL_DIR/web/geonetwork/WEB-INF/lib), as these are proprietary libraries not distributed with GeoNetwork. The following jars are required:
  • jpe_sdk.jar
  • jsde_sdk.jar

Note

dummy-api-XXX.jar must be removed from INSTALL_DIR/web/geonetwork/WEB-INF/lib

Harvesting node ogcwxs

This harvester type is capable of harvesting metadata from the GetCapabilities response of OGC services.

The settings for an OGCWxS harvester type are:

  • node (string): ogcwxs
    • site
      • url (string): OGC WxS service URL entry point. eg. http://localhost:7070/deegree/services - the harvester adds the necessary parameters to call the GetCapabilities service.
      • ogctype (WMS1.0.0|WMS1.1.1|WMS1.3.0|WFS1.0.0|WFS1.1.0| |WCS1.0.0|WPS0.4.0|WPS1.0.0|CSW2.0.2|SOS1.0.0): Type and version of OGC service being harvested.
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.
    • options
      • lang (string) - three letter language code describing language of metadata being harvested - this value is used to set the metadata language in the records created by this harvester. Use http://your_geonetwork/geonetwork/srv/eng/xml.info?type=isolanguages to get a list of language codes that can be used for this setting.
      • topic (string) - ISO Topic Category. List of values comes from codelist of ISO topic category element (gmd:topicCategory).
      • createThumbnails (boolean) - if true and service is a WMS then thumbnails will be created for dataset metadata, if false then thumbnails will not be created
      • useLayer (boolean) - Create metadata for layer elements using GetCapabilities information
      • useLayerMd (boolean) - Create metadata for layer elements using remote metadata provided in the MetadataURL attribute (if the remote metadata doesn’t exist/isn’t recognized then GetCapabilities info will be used).
      • datasetCategory - (integer) Category id of GeoNetwork category that will be assigned to the dataset metadata records created by the harvester. Use http://your_geonetwork/geonetwork/srv/eng/xml.info?type=categories to get a list of category ids that can be used for this setting.
      • outputSchema (string) Metadata schema of ISO records that will be created from OGCWxS GetCapabilities harvester. A list of possible values for this parameter can be obtained from the xml.harvesting.info service, see ogcwxsOutputSchemas.

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of OGCWxS GetCapabilities harvester settings in XML:

<node id="1138" type="ogcwxs">
 <site>
   <name>test ogcwxs</name>
   <uuid>87852766-536a-40b7-875a-3ab5ff24fa78</uuid>
   <account>
     <use>false</use>
     <username />
     <password />
   </account>
   <url>http://localhost:6060/deegree2/services</url>
   <ogctype>WFS1.1.0</ogctype>
   <icon>default.gif</icon>
 </site>
 <content>
   <validate>false</validate>
   <importxslt>none</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>true</oneRunOnly>
   <status>inactive</status>
   <lang>eng</lang>
   <topic>elevation</topic>
   <createThumbnails>true</createThumbnails>
   <useLayer>true</useLayer>
   <useLayerMd>true</useLayerMd>
   <datasetCategory>2</datasetCategory>
   <outputSchema>iso19139</outputSchema>
 </options>
 <privileges>
   <group id="1">
     <operation name="view" />
     <operation name="dynamic" />
     <operation name="featured" />
   </group>
 </privileges>
 <categories>
   <category id="3" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
 </info>
</node>

Harvesting node geoPREST

This harvester type is capable of querying a GeoPortal 9.3.x site and harvesting metadata using the GeoPortal REST API.

  • node (string): geoPREST
    • site
      • baseUrl (string): Base URL of the GeoPortal server site. eg. http://yoursite.com/geoportal. REST URLs will be built from this URL.
      • icon (string): This is the icon that will be used as the metadata source logo. The image is taken from the images/harvesting folder and copied to the images/logos folder. A list of the possible values that can be used for this parameter can be obtained from the xml.harvesting.info service - see icons.
    • search [0..n]: Contains search parameters. If this element is missing, no results will be returned. You can use the Lucene query syntax documented at http://webhelp.esri.com/geoportal_extension/9.3.1/index.htm#srch_lucene.htm.
      • freeText (string)

XML Example

Harvester settings in XML are used by the harvester services - see Harvesting services. Example of geoPREST harvester settings in XML::

<node id="978" type="geoPREST">
 <site>
   <name>qldtest</name>
   <uuid>4b9de966-1d5f-4133-97da-f4232fb7761a</uuid>
   <account>
     <use>true</use>
     <username />
     <password />
   </account>
   <baseUrl>http://qspatial.information.qld.gov.au/geoportal</baseUrl>
   <icon>default.gif</icon>
 </site>
 <content>
   <validate>false</validate>
   <importxslt>none</importxslt>
 </content>
 <options>
   <every>0 0 0 ? * *</every>
   <oneRunOnly>true</oneRunOnly>
   <status>inactive</status>
 </options>
 <searches>
   <search>
     <freeText>Barrier Reef</freeText>
   </search>
   <search>
     <freeText>Cadastral</freeText>
   </search>
 </searches>
 <privileges>
   <group id="1">
     <operation name="view" />
     <operation name="dynamic" />
     <operation name="featured" />
   </group>
 </privileges>
 <categories>
   <category id="2" />
 </categories>
 <info>
   <lastRun />
   <running>false</running>
   <result>
     <total>119</total>
     <added>87</added>
     <updated>32</updated>
     <unchanged>0</unchanged>
     <unknownSchema>0</unknownSchema>
     <removed>0</removed>
     <unretrievable>0</unretrievable>
     <badFormat>0</badFormat>
     <doesNotValidate>0</doesNotValidate>
   </result>
 </info>
</node>

Notes

  • this harvester uses two REST services from the GeoPortal API:
  • rest/find/document with searchText parameter to return an RSS listing of metadata records that meet the search criteria
  • rest/document with id parameter from each result returned in the RSS listing
  • this harvester has been tested with GeoPortal 9.3.x. It should be used for that version of GeoPortal in preference to the CSW harvester.