Open topic with navigation
The name of a file to use to control stemming of the words that it contains. The specified
StemmingFile overrides the standard IDOL stemming algorithms for the language type
MyLanguage. For words that are not in the stemming file, IDOL Content Component stems values according to the value of the Stemming parameter for the language type.
IDOL Content Component applies the stemming file rules before it performs transliteration (if it is enabled).
The stemming file must be in the LanguageDirectory. You must format it like a stop list file, except that each line contains two terms; the original term followed by its stem. For example:
[ASCII] MICE MOUSE MOUSE MOUSE CHILDREN CHILD ...
The stemming terms can contain only alphanumeric values.
To ensure that two words stem to the same value, you must add both words to the stemming file, with the appropriate stem. For example:
MICE MOUSE MOUSE MOUSE
In this example,
MOUSE both stem to
MOUSE. If you specify only
MOUSE in the stemming file,
MOUSE stems to
MOUS, according to IDOL stemming rules.
Use a stemming file only for unusual terms in special situations. HPE does not recommend that you use a stemming file as a replacement for the IDOL stemming algorithms.
If you change the stemming file after you have indexed content into IDOL Content Component, you must reindex the content.
If you specify a full path to the Stoplist, you must also use a full path in the DecompositionFile and
StemmingFile parameters. If you do not specify a path to the Stoplist, you must use only the file name in the DecompositionFile and
StemmingFile parameters (that is, you must place these files in the LanguageDirectory).
|Configuration Section:||LanguageTypes or
If you change this setting after you have indexed content into IDOL Server, the new setting applies only to new content, and the server logs a warning. To clear the warning and ensure that your change applies to all your content, you must initialize your index and reindex the content.