MainRangeRegex<N>

The MainRangeRegex<N> parameter defines the main part of a document. The main part of the document includes the content and all of the fields that are extracted to the main document. This parameter returns the entire document by default.

This parameter accepts one or more regular expressions. The regular expressions can contain sub-matches (enclosed in parentheses). If multiple matches are found, the content is concatenated.

For example, to define the main part of the document as all content that is enclosed by <html> </html> tags, set the parameter to:

MainRangeRegex0=<html>(.*)</html>
Type: String
Default: *
Required No
Configuration Section: Any section that you have defined for TextToDocs settings
Example: MainRangeRegex0=<html>(.*)</html>
See Also:  

_HP_HTML5_bannerTitle.htm