Creating a project

You can create a project by importing a content set: a collection of sample data (for example, documents or emails) that you use to create and analyze a knowledge base or decision plan. You can also import an existing knowledge base or decision plan, or create an empty project and import data later.

About this task

The following procedure describes how to create a project by importing a content set. See these topics for descriptions of supported data formats:

To create a project:

Procedure

  1. Open the New Project wizard: click New on the Open Project window, or click the New Project button the toolbar.
  2. Type a project name (English only), select a project type (decision plan or knowledge base), enter a description (optional), and click Next.
  3. Select one of the following options and click Next:
    Create a project by importing a content set
    Choose this option to import a content set (for example, DOC files in file system folders, XML files, emails in PST format, and so on).
    Create a project by importing a knowledge base or decision plan
    Choose this option if you want to import an existing knowledge base or decision plan.
    Create an empty project
    Choose this option to create a blank project. You can import a content set in the future by using the Import wizard, or manually create a new content set by adding new content items. For more information, see Creating an empty project.
  4. If you chose to import a content set, select one of the following formats.
    External formats
    Files from a file system folder
    Files such as Microsoft Word documents, PDF documents, XML files, and HTML documents (see Importing files from a file system folder).
    XML files that conform to the Content Classification XML schema
    One or more XML files that contain content set data (see Importing XML files that conform to the Content Classification schema).
    CSV (Comma Separated Values file)
    A comma-delimited file (see Importing CSV files).
    PST (Outlook file)
    A PST file created from Microsoft Outlook email messages (see Importing PST files).
    Internal formats
    Classification Workbench Content Set (COR file)
    A content set that was created by using Classification Workbench (see Classification Workbench Content Set (COR file)).
  5. Click Next. Depending on the format, a screen is displayed that allows you to navigate to the appropriate folder or file.

    If you are importing data that is not in Unicode format, you must specify the character set that was used to create the data.

  6. Click Finish.

Results

When the import process is complete, the system automatically generates a content set file (project_name.cor) based on the imported data. The Field Definitions panel displays all predefined fields and their properties, and the Categories panel shows all categories identified in the content set. You can use this content set to analyze a decision plan, or create and analyze a knowledge base by using the Create, Analyze, and Learn wizard.