Text analysis in InfoSphere Warehouse, Part 2, Dictionary-based information extraction combined with IBM Cognos reporting

Gain business insights from unstructured data

From the developerWorks archives

Stefan Abraham, Simone Daum, and Benjamin Leonhardi

Date archived: January 11, 2017 | First published: July 09, 2009

Unstructured information represents the largest, most current, and fastest growing source of information that is available today. This information exists in many different sources such as call center records, repair reports, product reviews, e-mails, and many others. The text analysis features of IBM InfoSphere Warehouse can help you uncover the hidden value in this unstructured data. This series of articles covers the general architecture and business opportunities of analyzing unstructured data with the text analysis capabilities of InfoSphere Warehouse. The integration of this capability with IBM Cognos reporting enables people across the company to exploit the text analysis results.

The first article of this series gave an overview of the text analysis capabilities in InfoSphere Warehouse and showed how to use regular expressions to extract concepts from free-form text. This second article shows you how to use dictionaries for concept extraction and how you can use taxonomies to structure them. It also explains how you can present the results in an interactive Cognos report.

This content is no longer being updated or maintained. The full article is provided "as is" in a PDF file. Given the rapid evolution of technology, some steps and illustrations may have changed.



static.content.url=http://www.ibm.com/developerworks/js/artrating/
SITE_ID=1
Zone=Information Management, Big data and analytics
ArticleID=407544
ArticleTitle=Text analysis in InfoSphere Warehouse, Part 2: Dictionary-based information extraction combined with IBM Cognos reporting
publish-date=07092009