StemmingFile

The name of a file to use to control stemming of the words that it contains. The specified StemmingFile overrides the standard IDOL stemming algorithms for the language type MyLanguage. For words that are not in the stemming file, IDOL Content Component stems values according to the value of the Stemming parameter for the language type.

IDOL Content Component applies the stemming file rules before it performs transliteration (if it is enabled).

The stemming file must be in the LanguageDirectory. You must format it like a stop list file, except that each line contains two terms; the original term followed by its stem. For example:

[ASCII]
MICE MOUSE
MOUSE MOUSE
CHILDREN CHILD
...

The stemming terms can contain only alphanumeric values.

To ensure that two words stem to the same value, you must add both words to the stemming file, with the appropriate stem. For example:

MICE MOUSE
MOUSE MOUSE

In this example, MICE and MOUSE both stem to MOUSE. If you specify only MICE MOUSE in the stemming file, MOUSE stems to MOUS, according to IDOL stemming rules.

NOTE:

Use a stemming file only for unusual terms in special situations. HPE does not recommend that you use a stemming file as a replacement for the IDOL stemming algorithms.

If you change the stemming file after you have indexed content into IDOL Content Component, you must reindex the content.

If you specify a full path to the Stoplist, you must also use a full path in the DecompositionFile and StemmingFile parameters. If you do not specify a path to the Stoplist, you must use only the file name in the DecompositionFile and StemmingFile parameters (that is, you must place these files in the LanguageDirectory).

Type: String
Default:  
Required: No
Configuration Section: LanguageTypes or MyLanguage
Example: [English]
StemmingFile=english_stem.dat
See Also: Stemming
Transliteration
NOTE:

If you change this setting after you have indexed content into IDOL Server, the new setting applies only to new content, and the server logs a warning. To clear the warning and ensure that your change applies to all your content, you must initialize your index and reindex the content.


_HP_HTML5_bannerTitle.htm