Building the Taxonomy

By organizing information into a hierarchy of categories, a taxonomy, you can define a view of your content. The result of this stage is a skeletal navigation structure of categories, each with a name. You can have a category “Animal” with child categories labeled “Cats” and “Birds.” These child categories might have descendants, and so on. Several techniques exist for building a taxonomy:

Using human domain experts

 

Taxonomies built by domain experts are quite common. A domain expert typically builds a skeleton taxonomy and assigns names to its categories.

Importing from an existing hierarchy

 

This technique allows users to create a taxonomy by extracting the implicit hierarchies from existing URLs or file system hierarchies, or hierarchies defined in metadata such as the Dewey decimal number in a library catalog, and mirror them in a taxonomy.

Using Verity taxonomy packages

 

Verity provides taxonomy packages in a number of industries to help companies start their categorization initiatives. Other vendors also provide taxonomies for particular industry segments.

Using concept extraction and naming

 

Thematic mapping automatically extracts key concepts contained in a set of documents and organizes them into a hierarchy called a concept tree. The Verity thematic mapping engine analyzes your documents and groups together concepts that recur throughout the corpus into categories. The engine then automatically creates a taxonomy structure for these concepts. Automatic naming generates labels for these categories using linguistic analysis.