Open topic with navigation
The name of a file to use to control stemming of the words that it contains. The specified
StemmingFile overrides the standard HPE IDOL stemming algorithms for the language type
MyLanguage. For words that are not in the stemming file, HPE Content Component stems values according to the value of the Stemming parameter for the language type.
HPE Content Component applies the stemming file rules before it performs transliteration (if it is enabled).
The stemming file must be in the LanguageDirectory. You must format it like a stop list file, except that each line contains two terms; the original term followed by its stem. For example:
[ASCII] MICE MOUSE MOUSE MOUSE CHILDREN CHILD ...
The stemming terms can contain only alphanumeric values.
To ensure that two words stem to the same value, you must add both words to the stemming file, with the appropriate stem. For example:
MICE MOUSE MOUSE MOUSE
In this example,
MOUSE both stem to
MOUSE. If you specify only
MOUSE in the stemming file,
MOUSE stems to
MOUS, according to HPE IDOL stemming rules.
Use a stemming file only for unusual terms in special situations. HPE does not recommend that you use a stemming file as a replacement for the HPE IDOL stemming algorithms.
If you change the stemming file after you have indexed content into HPE Content Component, you must reindex the content.
If you specify a full path to the Stoplist, you must also use a full path in the DecompositionFile and
StemmingFile parameters. If you do not specify a path to the Stoplist, you must use only the file name in the DecompositionFile and
StemmingFile parameters (that is, you must place these files in the LanguageDirectory).
|Configuration Section:||LanguageTypes or