You can create a project by importing a content set: a
collection of sample data (for example, documents or emails) that
you use to create and analyze a knowledge base or decision plan. You
can also import an existing knowledge base or decision plan, or create
an empty project and import data later.
About this task
The following procedure describes how to create a project
by importing a content set. See these topics for descriptions of supported
data formats:
To create a project:
Procedure
- Open the New Project wizard: click New on
the Open Project window, or click the New
Project button the toolbar.
- Type a project name (English only), select
a project type (decision plan or knowledge base), enter a description
(optional), and click Next.
- Select one of the following options and
click Next:
- Create a project by importing a content set
- Choose this option to import a content set (for example, DOC files
in file system folders, XML files, emails in PST format, and so on).
- Create a project by importing a knowledge base or decision
plan
- Choose this option if you want to import an existing knowledge
base or decision plan.
- Create an empty project
- Choose this option to create a blank project. You can import a
content set in the future by using the Import wizard, or manually
create a new content set by adding new content items. For more information,
see Creating an empty project.
- If you chose to import a content set, select
one of the following formats.
- External formats
- Files from a file system folder
- Files such as Microsoft Word documents,
PDF documents, XML files, and HTML documents (see Importing files from a file system folder).
- XML files that conform to the Content Classification XML schema
- One or more XML files that contain content set data (see Importing XML files that conform to the Content Classification schema).
- CSV (Comma Separated Values file)
- A comma-delimited file (see Importing CSV files).
- PST (Outlook file)
- A PST file created from Microsoft Outlook email messages (see Importing PST files).
- Internal formats
- Classification Workbench Content
Set (COR file)
- A content set that was created by using Classification Workbench (see Classification Workbench Content Set (COR file)).
- Click Next. Depending
on the format, a screen is displayed that allows you to navigate to
the appropriate folder or file.
If you are importing
data that is not in Unicode format, you must specify the character
set that was used to create the data.
- Click Finish.
Results
When the import process is complete, the
system automatically generates a content set file (project_name.cor)
based on the imported data. The Field Definitions panel
displays all predefined fields and their properties, and the Categories panel
shows all categories identified in the content set. You can use this
content set to analyze a decision plan, or create and analyze a knowledge
base by using the Create, Analyze, and Learn wizard.