Downloading and Installing the Sample Data

Once you have downloaded this archive file, you should extract it into the directory /data/enron/maildir. (You can actually use any directory, but this tutorial assumes that you have used the directory /data/enron/maildir in its instructions and examples.) How you extract the contents of this file this depends on the operating system that you are using:

  • Linux:
    1. If it does not already exist, create the directory /data/enron on the system where the Watson™ Explorer Engine is installed. You may need administrative privileges to create this directory.

      The directory /data/training/maildir80 has already been created for you to follow along with this tutorial in a Watson Explorer Engine training class.

    2. Search for the latest version of the archive file, such as enron_mail_20150507.tar.gz at http://www.cs.cmu.edu/~enron/enron_mail_20150507.tar.gz. Download the file into this directory and extract its contents using the following command: tar -zxvf enron_mail_20150507.tar.gz

      The file enron_80MB.zip has already been downloaded and installed for you to follow along with this tutorial in a Watson Explorer Engine training class. You will not need to unzip this file.

      This command displays a verbose list of each file and directory as it is extracted from the archive file.

  • Microsoft Windows:
    1. Create the directory c:\data\enron on the system where Watson Explorer Engine is installed.
    2. Search for the latest version of the archive file, such as enron_mail_20150507.tar.gz at http://www.cs.cmu.edu/~enron/enron_mail_20150507.tar.gz and download it into c:\data\enron.
    3. If you do not already have a program that is capable of extracting the contents of a compressed Unix TAR archive on your Windows system, download and install a file archiving and compression tool.
    4. Extract the contents of the downloaded archive file into the directory \data\enron.
      Note: Some utilities, such as 7-Zip, will extract all of the files and directories in the archive file into a new subdirectory with the uncompressed name of the archive file, which would be enron_mail_20150507.tar in this example. If this is the case, you must rename that directory to be called \data\enron\maildir on your system. You can do this from the Windows Explorer by dragging, dropping, and right-clicking to access the rename command, or from a Windows command prompt by using the command: rename \data\enron\enron_mail_20150507.tar.gz\enron_mail_20150507 \data\enron\maildir

After you have created the Linux directory /data/enron/maildir or the Windows directory c:\data\enron\maildir and have successfully extracted the contents of the archive file, you can safely delete the archive file.

To proceed to the next section, click Creating and Customizing a Display.