SentenceBreakingOptions

Additional options to use for the Chinese and Japanese sentence-breaking libraries.

Options for the Chinese Sentence-Breaking Library

Options for the Japanese Sentence-Breaking Library

For the Japanese sentence-breaking library, SentenceBreakingOptions specifies how IDOL Content Component normalizes equivalent multi-byte characters, to ensure that it tokenizes terms that contain equivalent characters in the same way. You can use one or more of the following options:

Separate multiple options with commas. There must be no space before or after a comma.

Type: String
Default:  
Required: No
Configuration Section: LanguageTypes or MyLanguage
Example: SentenceBreakingOptions=kana,dbcs,number,weakeol
See Also: SentenceBreaking
NGram
NOTE:

If you change this setting after you have indexed content into IDOL Server, the new setting applies only to new content, and the server logs a warning. To clear the warning and ensure that your change applies to all your content, you must initialize your index and reindex the content.


_HP_HTML5_bannerTitle.htm