Net Search Extender needs to know the format (or type) of text documents that you intend to search.
This information is necessary for indexing text documents.
Document format XML is the default for column data type XML, and is the only supported document format for that data type.
For the document formats HTML, XML, GPP, and the Outside In filter formats, searching can be restricted to specific parts of a document.
Where Outside In filters can not be used because the format of your document is not supported, you can write a User Defined Function (UDF) that does its own filtering. This UDF must be specified at index creation time and converts the data from the unsupported format to a supported format.
You can index documents if they are stored in one of the supported Coded Character Set Identifiers (CCSIDs). See the DB2® documentation for a list of these code pages.
db2 GET DB CFG for dbname
and
take the value written for Database code page.For consistency, DB2 normally converts the code page of a document to the code page of the database. However, when you store data in a DB2 database in a column with a binary data type, such as BLOB or FOR BIT DATA, DB2 does not convert the data, and the documents retain their original CCSIDs.
Note that incompatible code pages might cause problems when creating a text index or searching.