Introduction to Eduction > Eduction Components

Eduction Components
Eduction consists of the following components:
*
Eduction Engine. The core logic in Eduction that supports the extraction of predefined entities from text, using Eduction grammar files compiled from XML source.
*
Command-line tool (edktool). edktool compiles grammar files, performs extraction from the command line, lists entities in a grammar file, and tests the accuracy of the extraction. See edktool Command-Line Tool.
*
Eduction module (educemodule). The Eduction module integrates with the Index Tasks component of IDOL to perform extraction on IDOL IDX files and add fields to the documents based on the text matches found. Refer to the IDOL Eduction User Guide*.
*
Standard grammars. A collection of ECR grammar files covering commonly sought entities such as Social Security numbers, postal addresses, telephone numbers, people names, and so on. See Standard Grammars.
You can license standard grammar files by category and by language, so that it is possible to be licensed for any combination of category (for example, sentiment, place, or person) and language.
The license is now required when using edktool to compile any grammar files that use the standard grammars as resources, as well as for extraction.
 
*
User grammar. XML files created by the user that describe entities that can locate patterns in text using the Eduction grammar language. See Grammar Reference.
*
IDOL configuration file (AutonomyIDOLConfig.cfg). Eduction is controlled by settings contained in a configuration file which can be read by both IDOL server, the IDOL Index Tasks component, Eduction module (educemodule), edktool, and the ACI server.
 
NOTE The configuration file is not required when controlling IDOL using the SDK.
*
Eduction SDK. This is the C and Java programming interface for Eduction.
For information on the IDOL configuration file for the Eduction module, refer to the IDOL Eduction User Guide*.
For information on the IDOL configuration file for the ACI server, see ACI Server.
*
Linguistic Sentiment Analysis (LSA) grammars (ECR). These standard grammars are designed to identify those phrases in a passage of text that indicate positive or negative sentiment. These grammars can also identify which sentiments are expressed for which topics.
*
Eduction ACI server (eductionserver). This component is a stand-alone Eduction server, included in the SDK and typically used with the LSA grammar to identify sentiment in passages of text.
* Not all distributions of Eduction contain this component.