Search service configuration parameters
Learn about the portal search service parameters and possible values.
To configure a portal search service, use the parameters given in the following list. For details about how to set the values for these parameters, read Managing search services or the Manage Search administration portlet help.
Configuring a search service
- Unless otherwise stated, the values that you set for parameters of a portal search service apply to that search service and all its collections. They do not affect other search services of the portal or their search collections.
- Unless otherwise stated, changing the value of a parameter apply to both the existing search collections and newly created search collections. Some parameters affect only newly created search collections. These parameters cannot be updated for existing search collections.
- The search administration portlet Manage Search lists the Default Portal Search Service and its collection Portal Content or other collections in the default portal language. It does not list these items in the language that the user selected as preferred language for the portal or set in the browser. Example: The portal default language is set to English and the user selected German as the preferred portal language or as the browser language. In this case, the Default Portal Search Service and its collections show in English.
- SOAP support for remote search services was deprecated with HCL Digital Experience Version 8.0. EJB is the only type supported for remote search.
- If you delete a search service, the portal does not delete the search collections that are related to this search service. Delete the search collections by using the Manage Search administration portlet. If you delete the default search service, it is re-created new when you restart the portal.
Search service configuration parameter list
The parameter list in both the Search Services pane of the Manage Search portlet and in the following information shows several parameters that end with the suffix _EXAMPLE
. These example parameters are not used by the portal. They serve as an example for the same parameter without the suffix _EXAMPLE
. They give an example value that you might use. Deleting these parameters or modifying their value has no effect.
If you want to set a parameter that is listed here, but not listed in the portlet, add it. To add a parameter, type the parameter and the value in the entry fields Parameter key: and New parameter value: and click Add Parameter.
In the following list, the abbreviation pse in parameters or values stands for Portal Search Engine.
The following list is arranged in alphabetical order. Parameters might be listed in a different order in the portlet.
-
boostingSettings
Use this parameter to specify which metadata fields are given extra weight in an overall rank score during a search. You can also specify how much the selected metadata fields contribute to relevance circulation when you run a search. Specify the following values:
-
fieldBoost
This value defines which metadata fields have extra weight when search results are returned, and how much extra weight is given to the specified fields. Provide the following attributes:
-
field
The relevant string-based metadata field that you want search to focus on. Some common or default field values are
title
,description
, andkeywords
. -
boost
A factor that increases the relevancy for matches in the specified metadata field. This value can be set between a range of 1.0 and 10.0. However, it is good practice to set it between a range of 1.0 and 3.0. Value 1.0 means no extra relevancy.
-
-
phraseBoost
This parameter is not mandatory. If enabled, it improves the ranking of a document when the search terms are found as a phrase in that document. For instance, if the user is searching for "John Smith" and a document contains exactly that phrase, this document gets a higher ranking. However, if that document contains "Smith, John", then the ranking is not improved.
The following example shows the boost parameters and some sample values.
{"phraseBoost": {"Enabled":"true"}, "fieldBoost": [{"field":"title", "boost": 3.0} , {"field":"description", "boost":3.0}, {"field":"keywords", "boost":2.0}]}
Important
After you set these values, restart the portal server. If you changed your content items to test your settings, recrawl your content to see the effects of the settings. Readjust the settings to find the best values for your search data.
-
-
CLEAN_UP_TIME_OF_DAY_HOURS
Time of day at which the portal runs the maintenance process for search collections to remove outdated files and broken links. Possible values are positive integers 0 - 24 for the full hours of the day. The default value is
0
, which runs the cleanup at midnight.Note
If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
-
DefaultCollectionsDirectory
You can use this parameter to specify the default directory for search collections. If you use Portal Search locally, this parameter is optional. If you specify no value for this parameter, the default collection directory is
wp_profile\PortalServer\collections
. If you set up a remote search service, this parameter is mandatory. For details about setting this parameter, read Configuring the default location for search collections. -
DEFAULT_SEARCH_OPERATOR
Use this parameter to specify how the Portal search engine responds to search queries with two or more terms. The default value is
or
. Only one search term must be in the document in order for that document to be displayed in the search results list. Change this value toand
to retrieve only those documents that contain all of the search terms that are listed in the query.Note
After you change this parameter, you must restart the portal server and remote search service.
-
CONFIG_FOLDER_PATH
Use this parameter to determine where the configuration data for search collections is stored. The default is
wp_profile/CollectionsConfig
. -
EJB
If you set up a remote search service by using EJB, use this parameter to specify the EJB name in JNDI. An example value is
ejb/com/ibm/hrl/portlets/WsPse/WebScannerLiteEJBHome
.If you set this parameter, you also need to set the
IIOP_URL
parameter. -
EJB_Example
This parameter is an example that gives an example value for the parameter
EJB
. The example value isejb/com/ibm/hrl/portlets/WsPse/WebScannerLiteEJBHome
. -
ExternalSecurityResolverUrl
Use this parameter to configure the Portal Search service with the information about an external security resolver. This parameter is required for security filtering of HCL Connections resources to function properly. An example value of the resolver URL is
https://host:port/ConnectionsResourceId/seedlist/authverify/getACLTokens
where ConnectionsResourceID is any HCL Connections resource identifier. -
HTTP_MAX_BODY_SIZE_MB
Use this parameter to limit how much content is fetched during a crawl from application files, such as PDF or Microsoft Word. The specified unit is MB. The default value is
20 MB
. If a file exceeds the specified limit, the document is truncated, and Portal Search indexes the fetched portion as is possible. However, indexing might fail on truncated documents. In this case, the document is not listed under search results at all.Note
- If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
- Document Conversion Services might not be able to convert the content of truncated application files. If Document Conversion Services fails to convert a truncated application file, it logs an error to the
SystemErr.log
file. If the tracing feature tracing is enabled for the portal, Portal Search logs a warning message to the portal log file.
-
HTTP_MAX_SEEDLIST_SIZE_MB
This parameter limits how much portal content is fetched during a crawl from your own portal site. It determines the amount of space that is reserved for listing portal site resources or managed web content resources. The specified unit is MB. The default value is
4
MB. If a crawl exceeds the limit set for this parameter, the crawl fails, and Portal Search logs an error message. In this case, or if returned search results do not represent to complete extent of your portal site resources, increase this value.Note
If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
-
HTTP_NON_APPL_MAX_BODY_SIZE_MB
Use this parameter to limit how much content of each HTML page is fetched from websites of collections that belong to this search service. The specified unit is MB. The default value is
0.2
MB. This value means that the amount of content that is sent for indexing is always the first 0.2 MB of text.Note
If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
-
IIOP_URL
If you set up a remote search service by using EJB, use this parameter to specify the IIOP URL. An example value is
iiop://localhost:2811
. -
IIOP_URL_Example
This example gives an example value for the parameter
IIOP_URL
. The example value isiiop://localhost:2811
. When looking at the WebSphere console, this port is also know as theBOOTSTRAP_ADDRESS
. -
PSE_TYPE
Use this parameter to specify the type of search service. Possible values are
localhost
,ejb
, andsoap
. The default value islocalhost
for local search service.If you use Portal Search locally, this parameter is optional.
If you set up a remote search, this parameter is mandatory. In this case, specify the type of remote service that you use, EJB, or SOAP. If you specify
ejb
here, you also need to specify the values for the parametersEJB
andIIOP_URL
. If you specifysoap
here, you also need to specify the values for the parameterSOAP_URL
. -
SEARCH_SECURITY_MODE
This parameter defines access control enforcement during search. Three filter modes are supported. Specify one of the following values, depending on the filter mode that you want to use:
-
SECURITY_MODE_PREFILTER
Specify this value to use pre-filtering mode. Pre-filtering provides the fastest filtering, as it is performed in the search index level. An extra advantage of this filtering mode is that remote secured content sources can be searched from portal. However, it is based on search index only. The search result list can be temporarily inconsistent with user access rights if these access rights were changed after the last crawl:
-
Example 1:
A user's access rights were restricted after the last crawl. In this case, the use might get search results listed to which the user had access before, but to which the user no longer has access. When the user clicks such a link in the search result list, the user cannot access the document.
-
Example 2:
A user was given access rights on documents after the last crawl. In this case, the user will not get these documents listed among the search results until after the next crawl.
Note
If the search service contains Portal content (a collection that contains a content source of type Portal site), then this security mode is invalid and must not be used.
-
-
SECURITY_MODE_POSTFILTER
Post-filtering
Specify this value to use post-filtering mode. Post-filtering provides the safest but costly filtering approach. It checks access permission in real time for each returned search result against Portal Access Control. As a result you can use it only for local content sources. This filtering mode was the only filtering mode available before portal V 7.0.
-
SECURITY_MODE_PRE_POST_FILTER
Pre-post-filtering
Specify this value to use pre-post-filtering mode. This value is the default value. Pre-post-filtering combines the two filter modes mentioned earlier. It provides a balanced method for enforced access control. It filters most irrelevant documents at the pre-filtering phase based on the search index. This behavior results in fewer rejections in the post-filtering phase. As it still uses post-filtering, you can apply it only for local content sources. As it uses pre-filtering, search result lists might be temporarily inconsistent with users' access rights until after the next crawl.
-
-
SEEDLIST_PAGE_TIMEOUT
Use this parameter to increase the timeout for fetching the seedlist page. The specified unit for the value is seconds. The default value is 150 seconds. This value means that the portal search attempts to fetch the seedlist main URL for 150 seconds.
Note
If you modify the value for this parameter, the new value is applied only to newly created collections of the search service. You cannot update this parameter for existing search collections.
-
SOAP_URL
If you set up a remote search service by using SOAP, use this parameter to specify the SOAP URL. An example value is
http://localhost:10000/WebScannerSOAP/servlet/rpcrouter
. -
SOAP_URL_Example
This example gives an example value for the parameter
SOAP_URL
. The example value ishttp://localhost:10000/WebScannerSOAP/servlet/rpcrouter
. -
dateFieldPattern
By default, portal search does not know whether a field contains a date. Use this parameter to enable search for documents by date. A regular expression is used to check whether a field must be handled as a date field or not. The default pattern is
".*date$"
, and matches all fields that end with the word “date”.Note
After you change this parameter, you must restart the Portal server and remote search service.
-
dateFormat
Specify the format that is used for date queries. The default is
yyyy-MM-dd
. You can specify a different format by using the Java date syntax with the exception that spaces cannot be used, since that would break the date range queries. Make sure to communicate any changes in format to search users. To verify that the format is supported, you can enable tracing forcom.ibm.lotus.search.index.lucene.search.PseSiapiQueryParser=all
and then perform a series of searches. Do not forget to disable the trace after you verify that the format is supported.Note
After you change this parameter, you must restart the Portal server and remote search service.
-
dateTimeFormat
Specify the format that is used for date queries with a time part. The default is
yyyy-MM-dd,hh:mm
. You can specify a different format by using the Java date syntax with the exception that spaces cannot be used, since that would break the date range queries. Make sure to communicate any changes in format to search users. The typical letters that are used in the format are listed here:-
yyyy
Specifies the year.
-
MM
Specifies the numerical month in the year. For instance, the month of December would be represented by
12
. -
dd
Specifies the numerical day in the month.
-
hh
Specifies the hour in the day. By default, the hour is specified in the 24-hour format. For instance, the number
18
specifies 6 PM. However, you can specify a custom format that uses the 12-hour time format instead. -
mm
Specifies the minute in the hour.
-
ss
Specifies the second in the minute.
-
Z
Specifies the time zone. For example,
-0800
.
To verify that the format is supported, you can enable tracing for
com.ibm.lotus.search.index.lucene.search.PseSiapiQueryParser=all
and then perform a series of searches. Do not forget to disable the trace after you verify that the format is supported.Note
After you change this parameter, you must restart the Portal server and remote search service.
-
-
dateFormatLocale
This parameter specifies the locale that is used when a date is parsed. The portal default locale is used as the default value.
Note
After you change this parameter, you must restart the portal server and the remote search service.
The following parameters are reserved for internal use only. Do not change their values.
-
CONTENT_SOURCE_TYPE_FEATURE_NAME
This parameter is reserved for internal use only. Do not change its value. The default value is
ContentSourceType
. -
CONTENT_SOURCE_TYPE_FEATURE_VAL_PORTAL
This parameter is reserved for internal use only. Do not change its value. The default value is
Portal
. -
CONTENT_SOURCE_TYPE_FEATURE_VAL_WEB
This parameter is reserved for internal use only. Do not change its value. The default value is
Web
. -
SecurityResolverId
This parameter is reserved for internal use only. Do not change its value. The default value is
com.ibm.lotus.search.plugins.provider.core.PortalSecurityResolverFactory
. -
SetProperties
This parameter is reserved for internal use only. Do not change its value. Possible values are
on
oroff
. The default value ison
. -
startup
This parameter is reserved for internal use only. Do not change its value. The default value is
false
. -
VALIDATE_COOKIE
This parameter is reserved for internal use only. Do not change its value. The default value is
123
. -
WORK_MANAGER
You can use this parameter to specify the work manager. This parameter is reserved for internal use only. Do not change its value. The default value is
wps/searchIndexWM
. -
WORK_MANAGER_DEPLOY
This parameter is an example of the deployed
WORK_MANAGER
parameter. The example value iswps/searchIndexWM
. -
WORK_MANAGER_NATIVE
This parameter is an example of the parameter
WORK_MANAGER
for native threads for debug purposes only. The example value isforce.hrl.work.manager.use.native.threads
. -
WORK_MANAGER_NAME
This parameter specifies the JNDI name of the work manager that Portal Search uses.
-
JCR search service configuration parameters
The following search service configuration parameters can be modified to enable and configure searching for content that is stored in the JCR database. These JCR search service configuration parameters can be modified by accessing the JCR ConfigService PortalContent resource environment provider.