Skip to main content

alphaWorks  >  Forums  >  IBM Unstructured Information Modeler  >  developerWorks

Supported Code Page    Point your RSS reader here for a feed of the latest messages in this thread


     

 
 

My developerWorks
 Welcome, Guest
Sign in or register
Permlink Replies: 1 - Pages: 1 - Last Post: Jan 16, 2008 11:36 AM Last Post By: spangles Threads: [ Previous | Next ]

Posts: 2
Registered: Oct 14, 2005 03:27:34 AM
Supported Code Page
Posted: Sep 16, 2007 10:04:44 AM
Click to report abuse...   Click to reply to this thread Reply
Hello,

I have tried using the modeler with non-us characters but encoded in UTF-8. After running K-Means i am getting only 2 clusters and the text (in text.dat) does not appear right on the GUI. Is there any work-around for this?

Thanks in advance

spangles

Posts: 4
Registered: Jul 25, 2005 09:43:59 AM
Re: Supported Code Page
Posted: Jan 16, 2008 11:36:18 AM   in response to: in response to: 's post
Click to report abuse...   Click to reply to this thread Reply
First you need to download the package "icu4j.jar" which contains the unicode parser:

http://www.icu-project.org/download /

Then start up UIM using the following command line call:

java -cp uimodeler.jar;icu4j.jar -Dunicode=yes com.ibm.cv.text.EAdvisor

This should cause it to parse the text input as UNICODE instead of ASCII.

--Scott Spangler

Point your RSS reader here for a feed of the latest messages in all forums