HyphenChars

A list of the characters (in unencoded UTF-8) to use as hyphens.

Set HyphenChars to NONE if you do not want to use any characters as hyphens.

HPE Content Component treats hyphenated words specially when it indexes them, and indexes both the subparts and the whole hyphenated word. For example, if HyphenChars is set to - (the default value), HPE Content Component indexes second-hand as second, hand, and secondhand, and you can query all three terms.

At query time, HPE Content Component tokenizes the hyphenated term as the whole term and individual subterms, all separated by the configured DefaultQueryOperator. For example, if the DefaultQueryOperator is OR (the default value), a query for second-hand becomes secondhand OR second OR hand.

NOTE:

If you change the DefaultQueryOperator, you might not get expected query results for hyphenated terms.

The separate parts of a hyphenated term (such as second and hand) are available only for basic keyword searches, and they do not match proximity expressions or field-restricted searches. For example, if a document contains the phrase second-hand car, a keyword search for hand matches the document, but an exact phrase search for "hand car" does not.

In most situations, HPE recommends that you set HyphenChars to NONE, and use the hyphen as a separator (for example, set AugmentSeparators to -). For more information, refer to IDOL Expert.

Type: String
Default: -
Required: No
Configuration Section: LanguageTypes or MyLanguage
Example: HyphenChars=-/.
See Also: AugmentSeparators
TangibleCharacters
NOTE:

If you change this setting after you have indexed content into HPE Content Component, you must reindex the content.


_HP_HTML5_bannerTitle.htm