Search service configuration parameters
Learn about the portal search service parameters and possible values.
To configure a portal search service, use the parameters
given in the following list. For details about how to set the values
for these parameters, read Managing search services or the Manage Search administration portlet
help.
Notes for configuring a search service:
- Unless otherwise stated, the values that you set for parameters of a portal search service apply to that search service and all its collections. They do not affect other search services of the portal or their search collections.
- Unless otherwise stated, changing the value of a parameter apply to both the existing search collections and newly created search collections. Some parameters affect only newly created search collections. These parameters cannot be updated for existing search collections.
- The search administration portlet Manage Search lists the Default Portal Search Service and its collection Portal Content or other collections in the default portal language. It does not list these items in the language that the user selected as preferred language for the portal or set in the browser. Example: The portal default language is set to English and the user selected German as the preferred portal language or as the browser language. In this case, the Default Portal Search Service and its collections show in English.
- SOAP support for remote search services was deprecated with WebSphere® Portal Version 8.0. EJB is still supported.
- If you delete a search service, the portal does not delete the search collections that are related to this search service. Delete the search collections by using the Manage Search administration portlet. If you delete the default search service, it is re-created new when you restart the portal.
Notes related to the search service configuration parameter list:
- The parameter list in both the Search Services pane of the Manage Search portlet and in the following information shows several parameters that end with the suffix _EXAMPLE. These example parameters are not used by the portal. They serve as an example for the same parameter without the suffix _EXAMPLE. They give an example value that you might use. Deleting these parameters or modifying their value has no effect.
- If you want to set a parameter that is listed here, but not listed in the portlet, add it. To add a parameter, type the parameter and the value in the entry fields Parameter key: and New parameter value: and click Add Parameter.
- In the following list, the abbreviation pse in parameters or values stands for Portal Search Engine.
- The following list is arranged in alphabetical order. Parameters might be listed in a different order in the portlet.
- boostingSettings
- Use this parameter to specify which metadata fields are given extra weight in an overall rank
score during a search. You can also specify how much the selected metadata fields contribute to
relevance circulation when you run a search. Specify the following values:
- fieldBoost
- This value defines which metadata fields have extra weight when search results are returned, and
how much extra weight is given to the specified fields. Provide the following attributes:
- field
- The relevant string-based metadata field that you want search to focus on. Some common or default field values are title, description, and keywords.
- boost
- A factor that increases the relevancy for matches in the specified metadata field. This value can be set between a range of 1.0 and 10.0. However, it is good practice to set it between a range of 1.0 and 3.0. Value 1.0 means no extra relevancy.
- phraseBoost
- This parameter is not mandatory. If enabled, it improves the ranking of a document when the search terms are found as a phrase in that document. For instance, if the user is searching for "John Smith" and a document contains exactly that phrase, this document gets a higher ranking. However, if that document contains "Smith, John", then the ranking is not improved.
- CLEAN_UP_TIME_OF_DAY_HOURS
- Time of day at which the portal runs the maintenance process for search collections to remove
outdated files and broken links. Possible values are positive integers 0 - 24 for the full hours of
the day. The default value is 0, which runs the cleanup at
midnight.Note: If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
- DefaultCollectionsDirectory
- You can use this parameter to specify the default directory for search collections. If you use Portal Search locally, this parameter is optional. If you specify no value for this parameter, the default collection directory is wp_profile_root/PortalServer/collections. If you set up a remote search service, this parameter is mandatory. For details about setting this parameter, read Configuring the default location for search collections.
- DEFAULT_SEARCH_OPERATOR
- Use this parameter to specify how the Portal search engine responds
to search queries with two or more terms. The default value
is or. Only one search term
must be in the document in order for that document to be displayed
in the search results list. Change this value to and to
retrieve only those documents that contain all of the search
terms that are listed in the query. Note: After you change this parameter, you must restart the portal server and remote search service.
- CONFIG_FOLDER_PATH
- Use this parameter to determine where the configuration data for search collections is stored. The default is wp_profile_root/CollectionsConfig.
- EJB
- If you set up a remote search service by using EJB, use this parameter
to specify the EJB name in JNDI. An example value is ejb/com/ibm/hrl/portlets/WsPse/WebScannerLiteEJBHome .
If you set this parameter, you also need to set the IIOP_URL parameter.
- EJB_Example
- This parameter is an example that gives an example value for the parameter EJB . The example value is ejb/com/ibm/hrl/portlets/WsPse/WebScannerLiteEJBHome .
- ExternalSecurityResolverUrl
- Use this parameter to configure the Portal Search service with the information about an external security resolver. This parameter is required for security filtering of Connections resources to function properly. An example value of the resolver URL is https://host:port/ConnectionsResourceId/seedlist/authverify/getACLTokens where ConnectionsResourceID is any Connections resource identifier.
- HTTP_MAX_BODY_SIZE_MB
- Use this parameter to limit how much content is fetched during
a crawl from application files, such as PDF or Microsoft Word. The
specified unit is MB. The default value is 20 MB.
If a file exceeds the specified limit, the document is truncated,
and Portal Search indexes the fetched portion as is possible. However,
indexing might fail on truncated documents. In this case, the document
is not listed under search results at all. Notes:
- If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
- Document Conversion Services might not be able to convert the content of truncated application files. If Document Conversion Services fails to convert a truncated application file, it logs an error to the SystemErr.log file. If the tracing feature tracing is enabled for the portal, Portal Search logs a warning message to the portal log file.
- HTTP_MAX_SEEDLIST_SIZE_MB
- This parameter limits how much portal content is fetched during
a crawl from your own portal site. It determines the amount of space
that is reserved for listing portal site resources or managed web
content resources. The specified unit is MB. The default value is 4 MB.
If a crawl exceeds the limit set for this parameter, the crawl fails,
and Portal Search logs an error message. In this case, or if returned
search results do not represent to complete extent of your portal
site resources, increase this value.Note: If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
- HTTP_NON_APPL_MAX_BODY_SIZE_MB
- Use this parameter to limit how much content of each HTML page
is fetched from websites of collections that belong to this search
service. The specified unit is MB. The default value is 0.2 MB. This value means that the amount of
content that is sent for indexing is always the first 0.2 MB of text.Note: If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
- IIOP_URL
- If you set up a remote search service by using EJB, use this parameter to specify the IIOP URL. An example value is iiop://localhost:2811.
- IIOP_URL_Example
- This example gives an example value for the parameter IIOP_URL. The example value is iiop://localhost:2811.
- PSE_TYPE
- Use this parameter to specify the type of search service. Possible
values are localhost, ejb, and soap.
The default value is localhost for
local search service.
If you use Portal Search locally, this parameter is optional.
If you set up a remote search, this parameter is mandatory. In this case, specify the type of remote service that you use, EJB, or SOAP. If you specify ejb here, you also need to specify the values for the parameters EJB and IIOP_URL. If you specify soap here, you also need to specify the values for the parameter SOAP_URL.
- SEARCH_SECURITY_MODE
- This parameter defines access control enforcement during search.
Three filter modes are supported. Specify one of the following values,
depending on the filter mode that you want to use:
- SECURITY_MODE_PREFILTER
- Specify this value to use pre-filtering mode. Pre-filtering provides
the fastest filtering, as it is performed in the search index
level. An extra advantage of this filtering mode is that
remote secured content sources can be searched from portal.
However, it is based on search index only. The search result
list can be temporarily inconsistent with user access rights
if these access rights were changed after the last crawl:
- Example 1:
- A user's access rights were restricted after the last crawl. In this case, the use might get search results listed to which the user had access before, but to which the user no longer has access. When the user clicks such a link in the search result list, the user cannot access the document.
- Example 2:
- A user was given access rights on documents after the last crawl. In this case, the user will not get these documents listed among the search results until after the next crawl.
Note: If the search service contains Portal content (a collection that contains a content source of type Portal site), then this security mode is invalid and must not be used. - SECURITY_MODE_POSTFILTER
- Post-filtering
- SECURITY_MODE_PRE_POST_FILTER
- Pre-post-filtering
- SEEDLIST_PAGE_TIMEOUT
- Use this parameter to increase the timeout for fetching the seedlist
page. The specified unit for the value is seconds. The default value
is 150 seconds. This value means that the portal search attempts to
fetch the seedlist main URL for 150 seconds. Note: If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
- SOAP_URL
- If you set up a remote search service by using SOAP, use this parameter to specify the SOAP URL. An example value is http://localhost:10000/WebScannerSOAP/servlet/rpcrouter.
- SOAP_URL_Example
- This example gives an example value for the parameter SOAP_URL. The example value is http://localhost:10000/WebScannerSOAP/servlet/rpcrouter.
- dateFieldPattern
- By default, portal search does not know
whether a field contains a date. Use this parameter to enable
search for documents by date. A regular expression is used to
check whether a field must be handled as a date field or not.
The default pattern is ".*date$", and matches all
fields that end with the word "date".Note: After you change this parameter, you must restart the Portal server and remote search service.
- dateFormat
- Specify the format that is used for date
queries. The default is yyyy-MM-dd. You can
specify a different format by using the Java date syntax with
the exception that spaces cannot be used, since that would break
the date range queries. Make sure to communicate any changes
in format to search users. To verify that the format is supported,
you can enable tracing for com.ibm.lotus.search.index.lucene.search.PseSiapiQueryParser=all and
then perform a series of searches. Do not forget to disable the trace
after you verify that the format is supported. Note: After you change this parameter, you must restart the Portal server and remote search service.
- dateTimeFormat
- Specify the format that is used for date
queries with a time part. The default is yyyy-MM-dd,hh:mm.
You can specify a different format by using the Java date syntax
with the exception that spaces cannot be used, since that would
break the date range queries. Make sure to communicate any changes
in format to search users. The typical letters that are used
in the format are listed here:
- yyyy
- Specifies the year.
- MM
- Specifies the numerical month in the year. For instance, the month of December would be represented by 12.
- dd
- Specifies the numerical day in the month.
- hh
- Specifies the hour in the day. By default, the hour is specified in the 24-hour format. For instance, the number 18 specifies 6 PM. However, you can specify a custom format that uses the 12-hour time format instead.
- mm
- Specifies the minute in the hour.
- ss
- Specifies the second in the minute.
- Z
- Specifies the time zone. For example, -0800.
Note: After you change this parameter, you must restart the Portal server and remote search service. - dateFormatLocale
- This parameter specifies the locale that
is used when a date is parsed. The portal default locale is used as
the default value. Note: After you change this parameter, you must restart the portal server and the remote search service.
The following parameters are reserved for internal use only. Do not change their values.
- CONTENT_SOURCE_TYPE_FEATURE_NAME
- This parameter is reserved for internal use only. Do not change its value. The default value is ContentSourceType.
- CONTENT_SOURCE_TYPE_FEATURE_VAL_PORTAL
- This parameter is reserved for internal use only. Do not change its value. The default value is Portal.
- CONTENT_SOURCE_TYPE_FEATURE_VAL_WEB
- This parameter is reserved for internal use only. Do not change its value. The default value is Web.
- SecurityResolverId
- This parameter is reserved for internal use only. Do not change its value. The default value is com.ibm.lotus.search.plugins.provider.core.PortalSecurityResolverFactory.
- SetProperties
- This parameter is reserved for internal use only. Do not change its value. Possible values are on or off. The default value is on.
- startup
- This parameter is reserved for internal use only. Do not change its value. The default value is false.
- VALIDATE_COOKIE
- This parameter is reserved for internal use only. Do not change its value. The default value is 123.
- WORK_MANAGER
- You can use this parameter to specify the work manager. This parameter is reserved for internal use only. Do not change its value. The default value is wps/searchIndexWM.
- WORK_MANAGER_DEPLOY
- This parameter is an example of the deployed WORK_MANAGER parameter. The example value is wps/searchIndexWM.
- WORK_MANAGER_NATIVE
- This parameter is an example of the parameter WORK_MANAGER for native threads for debug purposes only. The example value is force.hrl.work.manager.use.native.threads.
- WORK_MANAGER_NAME
- This parameter specifies the JNDI name of the work manager that Portal Search uses.