NGram

The size of the character N-grams to use to tokenize Asian text.

NOTE:

You must not use NGram with the SentenceBreaking configuration parameter.

If you set NGram for Japanese, you can use SentenceBreakingOptions for normalization.

Type: Long
Default: 0 (off)
Required: No
Configuration Section: LanguageTypes or MyLanguage
Example: Encodings=UTF8:JapaneseUTF8
NGram=2

In this example, all text is indexed as N-grams of two characters.
See Also: NGramMultiByteOnly
NGramOrientalOnly
SentenceBreaking
SentenceBreakingOptions
NOTE:

If you change this setting after you have indexed content into IDOL Server, the new setting applies only to new content, and the server logs a warning. To clear the warning and ensure that your change applies to all your content, you must initialize your index and reindex the content.


_HP_HTML5_bannerTitle.htm