DateFormats
 
Type

String

Default

 

Allowed range

Minimum:

Maximum:

Recommended range

Minimum:

Maximum:

Required

no

Configuration section

[Default] and [indvidual_spider]

Description

Specify the date formats that can be extracted from a Web page. Use DateCheck to specify the parts of a page the connector should check for dates and whether the connector should discard a page when it does not contain any dates in the specified formats.

You can specify the following strings:

YY
Year (2 digits). For example, 99, 00, 01, and so on.

YYYY
Year (4 digits). For example, 1999, 2000, 2001, and so on.

LONGMONTH
For example, January, February, March, and so on. Use DateLongMonthCSVs to define the string.

SHORTMONTH
For example, Jan, Feb, and so on. Use DateMonthCSVs to define the string.

MM
Month (2 digits). For example, 01, 10, 12, and so on.

M+
Month (1 or 2 digits). For example, 1,2,3,10, and so on.

DD
Day (2 digits). For example, 01, 02, 03, 12, 23, and so on.

D+
Day (1 or more digits). For example, 1, 2, 12, 13, 31, and so on.

LONGDAY
1 or 2 digits with postfix. For example, 1st, 2nd, 3rd, and so on. Use DatePostfixCSVs to define the postfix string.

HH
Hour (2 digits). For example, 01, 12, 13, and so on.

H+
Hour (1 or 2 digits). For example, 1, 12, 23, and so on

NN
Minute (2 digits). For example, 01, 10, 59, and so on.

N+
Minute (1 or 2 digits). For example, 1, 10, 59, and so on.

SS
Second (2 digits). For example, 01, 10, 59, and so on.

S+
Second (1 or 2 digits). For example, 1, 10, 59.

ZZZ
Time Zone. For example, GMT, EST, PST, and so on.

#S
Whitespace.

Note: To specify multiple formats, separate them with commas (with no space before or after a comma).

Example

DateFormats=D+/SHORTMONTH/YYYY,DDMMYYYY

In this example, only dates that have the format D+/SHORTMONTH/YYYY (for example, 1/Jan/2001) or DDMMYYYY (for example, 01012001) can be extracted.

See also

AfterDate

BeforeDate

DateCheck

DefaultDate