IBM Support

Information Analyzer - Unix - Analyzing non-ASCII flat file data

Question & Answer


Question

When I use Information Analyzer to Analyze non-ASCII data from a flat file I see corrupted data in "view data sample" or in Column Analysis details and drill down or both.

Cause

Information Analyzer is a Unicode enabled program and requires knowledge of the encoding in use for the flat file.

Answer

In order to see data correctly for the non-ASCII, possibly multi byte data, not only must an IANAAppcodePage setting must be added to the DSN in the .odbc.ini file but also the LANG definition in the dsenv file must be changed to be set to the encoding of the file you are analysing.

IANAAppCodePage values can be seen in the ODBC reference guide installed on the client in IBM\InformationServer\ODBCDrivers\odbcref.pdf.

Values for the LANG variable can be seen by issuing the "locale -a" command at the Unix shell prompt

[{"Product":{"code":"SSZJLG","label":"InfoSphere Information Analyzer"},"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Component":"--","Platform":[{"code":"PF002","label":"AIX"},{"code":"PF010","label":"HP-UX"},{"code":"PF027","label":"Solaris"},{"code":"PF016","label":"Linux"}],"Version":"11.3;11.5;8.5;8.7;9.1","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
16 June 2018

UID

swg21599567