To specify knowledge base properties, use the Management Console.
Specifies a display name the knowledge base. If you are adding a knowledge base to the IBM® Content Classification server, type a name that complies with 7-bit Unicode Transformation Format (UTF-7) character encoding.
You can also add an empty knowledge base here, and then export a new knowledge base from Classification Workbench. If you use this approach to add a new knowledge base to the Content Classification server, the empty knowledge base must exist before you export the knowledge base from Classification Workbench for the first time.
The default path for knowledge bases that you create in Classification Workbench is Classification_Home\Classification Workbench\Projects_Unicode\project_name\project_name.kb, where project_name is the name that was given to the knowledge base project in Classification Workbench.
To import statistics to an existing knowledge base, ensure that the knowledge base is running before you click OK. These options are available:
d:\directory_path\knowledge_base_name.kb
\\computer_name\directory_path\knowledge_base_name.kb
Categories can be organized under a top-level classifier and be physically separated from other classifiers through principal nodes. For example, categories named Honda, Ford, or Saab might be organized under a classifier named Cars. When training a cached knowledge base, each principal node, which contains the set of categories that comprise a classifier, is loaded into memory and trained separately from the other principal nodes. After a classifier is trained, its statistical data is saved to disk, a digest of the results is created, and the next principal node is loaded into memory and trained.
Digests are all that is needed to run matches on a knowledge base. A read-only knowledge base, which contains only digests, does not require a cache. Because digests do not require much memory, all digests can be loaded into memory at the same time.
A read/write knowledge base can handle both matching and training. With a very large category set, a read/write knowledge base might require a cache so that the categories can be partitioned into classifiers (through principal nodes) which can be separately trained.
The read/write instance of a cached knowledge base is primarily suited for submitting feedback. In addition, a cached knowledge base should have at least one read-only instance to handle Suggest requests.
The backups are created in the Classification_Home/dserverdir/VERSIONS directory on the data server. The file name is the name of the knowledge base concatenated with the backup version number.
If
your knowledge base has associated learning data, you might have several identical
knowledge bases with different version numbers. This occurs when the value of the
global feedback frequency is less than the value of the learning data retrain
frequency setting.
Associate learning data (SARC file) 
Select this option to store learning data with the knowledge base.
Learning data is configured by using Classification Workbench. 
SARC files are stored in Classification Workbench knowledge base project
folders: Classification_Home\Classification
Workbench\Projects_Unicode\knowledge_base_project_name\KBCache.
SARC files on the server are stored in the following directory on the computer where the
read\write instance of the knowledge base is running:
Classification_Home/data/rw_knowledge_base_name. 
Retrain frequency 
Specifies the minimum number of new feedbacks that are sent by the server to the learning data
file that are required to trigger an update of knowledge base statistics. The rate that
the server sends the feedback to the learning data file is controlled by the Knowledge
Base Training settings in the Management Console's Global
Properties window. 

For more information about learning data, see Saving learning data. 
A read/write instance is a server component that can handle read/write and read-only requests on a knowledge base. A single read/write instance must exist for each knowledge base.
A read-only instance is a server component that can handle read-only requests on a knowledge base. Read-only requests can be processed by the read/write instance or forwarded to a read-only instance depending on the current workload of the read/write instance. Read-only instances are optional components. To provide scalability and enhance performance, you can configure the system to run any number of read-only instances of a specific knowledge base on multiple computers.
Specifies the languages that the knowledge base is required to support.
Each knowledge base has its own set of supported languages. The classification technology provides suggestions only for texts that are written in a language that the knowledge base supports. A knowledge base can be either monolingual or multilingual, but must support at least one language. If a knowledge base is monolingual, all questions submitted to the knowledge base are assumed to be in that language.
The GenericLanguage option is provided for basic processing of texts in unsupported or partially supported languages. This option is available for monolingual knowledge bases and cannot be selected for use with other languages.
The set of languages that you define for the knowledge base determines memory consumption. Each language requires approximately 20 MB.