edktool Command-Line Tool > Configuration Files for Eduction Settings > Define Eduction Settings in the .CFG Configuration File

Define Eduction Settings in the .CFG Configuration File
The Eduction configuration settings that can be defined in the IDOL Server format .CFG configuration file are described in Eduction Parameters.
The .CFG configuration file consists of several sections that are identified by a phrase in square brackets. Each section contains parameters (name/value pairs). For example:
[Eduction]
ResourceFiles=C:\MyGrammar\gram1.ecr
 
TIP If you are using edktool and a configuration file for extraction, you do not need to include the [IndexTasks] section in the configuration file if the [Eduction] section is present. Therefore the minimal possible configuration file is two lines long:
TIP [Eduction]
ResourceFiles=<CSV of grammars>
To define Eduction settings in the .CFG configuration file
1.
2.
 
EnableComponents
(used by edktool only)
OutputScores
(used by edktool and educemodule)
OutputSimpleMatchInfo
(used by edktool only)
 
NOTE If you set CaseNormalization to Lower or Upper, set MatchCase to true.
3.
Set the following parameters in the [Server] section of the configuration file. These settings are critical for the correct reading of documents.
CantHaveFieldCSVs
DocumentDelimiterCSVs
4.
Modify Configuration Parameter Values
The following section describes how to enter parameter values in the configuration file.
Enter Boolean Values
The following settings for Boolean parameters are interchangeable:
TRUE = true = True = ON = on = Y = y = 1
FALSE = false = False = OFF = off = N = n =0
Enter String Values
Some parameters require string values that contain quotation marks. Percent-encode each quotation mark by inserting a backslash before it.
For example:
FIELDSTART0="<font face=\"arial\"size=\"+1\"><b>"
Here, the beginning and end of the string are indicated by quotation marks, while all quotation marks that are contained in the string are percent-encoded.
If you want to enter a comma-separated list of strings for a parameter, and one of the strings contains a comma, you must indicate the start and the end of this string with quotation marks.
For example:
ParameterName=cat,dog,bird,"wing,beak",turtle
If any string in a comma-separated list contains quotation marks, you must put this string into quotation marks and percent-encode each quotation mark in the string by inserting a backslash before it.
For example:
ParameterName="<font face=\"arial\"size=\"+1\"><b>",dog,bird,"wing,beak",turtle
 
Sample Configuration File
 
The following shows the configuration for a sample Eduction task:
[Eduction]
ResourceFiles=C:\MyGrammar\gram1.ecr,C:\MyGrammar\gram2.ecr
ZoneStart0=<TEXT>
ZoneEnd0=</TEXT>
ZoneStart1=acknowledgements
ZoneEnd1=introduction
Entity0=common/aus_holidays
EntityField0=HOLIDAYS
EntityZone0=0
Entity1=common/us_holidays
EntityField1=HOLIDAYS
EntityZone1=0
Entity2=us/social_security_number
EntityField2=SS_NUMBER
EntityZone2=1
SearchFields=DRECONTENT
AllowDuplicates=HOLIDAYS
 
[Logging]
LogLevel=Full
This sample uses two grammar files. It searches for all Australian and U.S. holidays in the DRECONTENT field between the text <Text> and </Text>, adding the matches as additional fields HOLIDAYS. It also searches for a single social security number in DRECONTENT between the text acknowledgements and introduction and adds the results as a new field SS_NUMBER.