The MainContentRegex<N> parameter defines the content that is extracted as the main document content. The content must be located in the range defined by the MainRangeRegex parameter.

This parameter accepts one or more regular expressions. The regular expression can contain sub-matches (enclosed in parentheses). If multiple matches are found, the content is concatenated (separated by new line characters).

For example, to define the main document content as all content that is enclosed by <p> </p> tags, set the parameter to:

Type: String
Default: None
Required No
Configuration Section: Any section that you have defined for TextToDocs settings
Example: MainContentRegex0=<p>(.*)</p>
See Also: MainRangeRegex<N>