DateCheck
 
Type

Long

Default

0

Allowed range

Minimum: 0

Maximum: 461

Recommended range

Minimum: 0

Maximum: 461

Required

no

Configuration section

[Default] and [indvidual_spider]

Description

A bitwise mask number used to determine the date used to index a page. Enter a bitwise mask number that specifies where and how the connector should check for dates of the formats specified by the parameter DateFormats. You can create the number by adding together some of the following numbers as appropriate:

URL: 1
If you enter 1, the connector checks whether the URL of a page contains any of the date formats specified in the parameter DateFormats. If the URL does not contain any of these date formats, the connector indexes the page using the date you specify in the parameter DefaultDate (or today's date, if DefaultDate is not set).

Page header: 4
If you enter 4, the connector determines whether the HTML <HEAD> tag of a page contains any of the date formats specified in the parameter DateFormats. If the header does not contain any of these date formats, the connector indexes the page using the date you specify in the parameter DefaultDate (or today's date, if DefaultDate is not set).

Page content: 8
If you enter 8, the connector checks whether the content of a page contains any of the date formats specified in the parameter DateFormats. If the content does not contain any of these date formats, the connector indexes the page using the date you specify in the parameter DefaultDate (or today's date, if DefaultDate is not set).

Case insensitive: 64
If you add 64 to the DateCheck value, the connector checks the appropriate page parts for a case-insensitive match for any of the date formats specified in the parameter DateFormats. If the connector does not find any of these date formats, it indexes the page using the date you specify in the parameter DefaultDate (or today's date, if DefaultDate is not set).

Note: If you specify 64, you must also specify another value to indicate which part of the page the connector should check for the DateFormats strings.

Before download: 128
If you add 128 to the DateCheck value, the connector checks the URL to determine whether the date for the page is in the range specified by the parameters BeforeDate and AfterDate. If a page's date is not in this range, the connector does not download the page.

Note: If specify 128, you must also include 1 in the DateCheck value.

Discard if not found in specified formats: 256
If you add 256 to the DateCheck value, the connector checks the appropriate page parts for any of the date formats specified in the parameter DateFormats, and discards any page that does not contain one of the specified formats.

Note: If you specify 256, you must also define another value to specify which parts of the page the connector should check for the DateFormats strings.

If you enter 0, the connector does not check for DateFormats.

Example

DateCheck=393

In this example, the connector determines whether the URLs and content of pages contain the formats specified in DateFormats before downloading the pages. If neither the URL nor the content of a page contains any of the specified date formats, the connector deletes the page.

See also

AfterDate

BeforeDate

DateFormats

DefaultDate