Rich text and proprietary format support

Db2® Text Search supports indexing and searching of documents in rich text format and proprietary formats within a properly configured Db2 Text Search instance.

Db2 Text Search supports TEXT, XML, and HTML text index formats to prepare indexes for full-text search on text data. In addition, the INSO format enables indexing and searching in documents with rich text or proprietary formatting:
  • Rich text documents are documents that contain text as well as formatting instructions such as bold, italics, font types, font sizes, spacing, and more.
  • Proprietary formats encompass a variety of common office products, such as, pdf, doc, ppt, ods.

For information about the enablement and configuration of the INSO format feature, see the topic about setting up Db2 Text Search for rich text and proprietary formats.