CantHaveCSVs
 
Type

String

Default

 

Allowed range

Minimum:  

Maximum:  

Recommended range

Minimum:  

Maximum:  

Required

no

Configuration section

[Default] and [SpiderJob]

Description

Enter one or more strings a page must not contain. If the connector finds one of these strings in the part of the document specified in the parameter CantHaveCheck, the page is discarded. Use CantHaveCheck to specify which parts of the page the connector should check for the specified strings, and whether the connector should check for the strings before or after the pages are downloaded.

Multiple strings must be separated by commas (with no space before or after a comma). You can use wildcards in the strings that you specify.

Example

CantHaveCSVs=*archive*,*test*

In this example, the connector determines whether the part of the page specified in the parameter CantHaveCheck contains the strings archive and test (as part of a word or a whole word). If the page contains either of these strings, it is discarded.

See also

CantHaveCheck

MustHaveCheck

MustHaveCSVs