Skip to main content


developerWorks  >  Information Management  >

Leverage OmniFind text analysis results with your relational database, Part 2: Combine semantic search with structured query results

developerWorks

Level: Introductory

Sebastian Nelke (snelke@de.ibm.com), OmniFind and UIMA developer, IBM 

18 Oct 2007

Register now or sign in using your IBM ID and password.

IBM® OmniFind™ Enterprise Edition is full of features and functionalities that allow you to build up powerful, high value applications and solutions based on IBM's enterprise search functionality. Part 1 of this series described in detail how to store selected text analysis results and their relations among each other in a relational database. Part 2 focuses on how to combine the results of structured database queries with the powerful capabilities of semantic search.

Objectives

  • Build up powerful, high value applications and solutions based on IBM's enterprise search functionality.

  • Combine the results of structured database queries with the powerful capabilities of semantic search.

Prerequisites

This tutorial is written for people who want to build custom solutions based on IBM OmniFind Enterprise Edition V8.4. To understand this tutorial and get the maximum benefit of it, you should already be familiar with the following topics:

  • Unstructured Information Management Architecture (UIMA):
    Basic UIMA skill is needed. You should know about UIMA typesystems:
    • What is an UIMA typesystem?
    • What does it look like?
    • what it is needed for?

    It's not necessary for this tutorial to know how an UIMA annotator works internally, but things will be easier for you if you have at least a vague idea of how annotators work and what they are good for.

    If you already have a base understanding of annotators, you may want to have a look at the source of the sample annotators found with this tutorial after you've finished the tutorial.

  • DB2®:
    Again, only basic skills are needed. You should be able to install DB2 and know how to create a database and some sample tables. The database and all tables needed to run this tutorial will be created by a script.
  • OmniFind:
    You only need basic skills. During this tutorial you will make use of the OmniFind administration GUI to create OmniFind collections and upload a custom annotator and a database mapping file for the text analysis results. Each of these tasks is explained in detail and there is a step-by-step description with screenshots for each of them. However, you should already have a basic understanding of how OmniFind works and how to use it. A look at the Text Analysis Integration book found with the documentation of your OmniFind installation is also highly recommended.
    • Cas2Jdbc
      You should be familiar with the features and the functionality of Cas2Jdbc. It is strongly recommended to do the first tutorial of this series before continuing with this tutorial. To become familiar with Cas2Jdbc, refer to Part 1 of this series.
    • Semantic search and custom index mappings
      You should have a basic understanding of what semantic search means and what custom index mappings are needed for. To build your own applications using semantic search, you should have a deeper understanding of custom index mappings. It is recommended to read the short chapter "Index mapping for custom analysis results" of the "Text Analysis Integration" handbook coming with your OmniFind installation before continuing with this tutorial. In case you prefer a more down-to-earth approach, you might find "Semantic search in WebSphere Information Integrator OmniFind Edition: Deploy a semantic search solution" (developerWorks, August 2005) useful.
    • SIAPI
      Again, you should already be familiar with the usage of OmniFind's SIAPI. Basic skill is recommended for this tutorial. To get a better understanding of the source code found with the sample search application in this tutorial, refer to "IBM Search and Index APIs (SIAPI) for WebSphere Information Integrator OmniFind Edition" (developerWorks, January 2006).
  • Linux® and Windows®:
    This tutorial is described for both Linux and Windows platforms, you should have basic skills regarding how to copy files, create directories, run scripts, and change file and directory permissions. In some parts of the tutorial, you have to edit files found in the Download section. It is recommended to use an XML editor of your choice to manipulate XML files and validate them before you upload them to your OmniFind system. The OmniFind system will reject invalid XML files with an error message but it's more comfortable to see these errors in a good XML editor (that can, for example, highlight the affected XML elements).

System requirements

Share this tutorial

digg Digg this story
del.icio.us Post to del.icio.us
Slashdot Slashdot it!

To run this tutorial, you need to have IBM OmniFind Enterprise Edition V8.4 installed and running. Make sure to have the latest fix pack installed. You also need DB2 Version 8.2 or later installed. (Starting with OmniFind 8.4 this can be one and the same machine). It is not recommended to run this tutorial in a production environment. The sample document set found in the Download section of this tutorial contains around 20 documents and consumes around 1MB of space. The sample database on your DB2 machine needs around 20MB of space.



Duration

2-3 hours


Formats

html, pdf


Tutorial overview

This tutorial demonstrates how to combine the search results delivered by the OmniFind Search and Index API (SIAPI) with the results of structured database queries. The combination of semantic search and SQL queries allows you to build powerful applications that can close the gap between structured and unstructured information.

Share this....

digg Digg this story del.icio.us del.icio.us Slashdot Slashdot it!


Back to top


Document options

Document options requiring JavaScript are not displayed


More in this series:
Leverage OmniFind text analysis results with your relational database