Translate Node
The Translate node can be used to translate text from supported languages, such as Arabic, Chinese, and Persian, into English for analysis using IBM® SPSS® Modeler Text Analytics. This makes it possible to mine documents in double-byte languages that would not otherwise be supported and allows analysts to extract concepts from foreign-language documents even if they are unable to comprehend the language in question. Note that you must be able to connect to SDL's Software as a Service (SaaS) to be able to use the Translate node.
When mining text in any of these languages, simply add a Translate node prior to the Text Mining modeling node in your stream. You can also enable caching in the Translate node to avoid repeating the translation each time the stream is executed.
You can find this node on the IBM SPSS Modeler Text Analytics tab of nodes palette at the bottom of the IBM SPSS Modeler window. See the topic IBM SPSS Modeler Text Analytics Nodes for more information.
Caching the translation. If you cache the translation, the translated text is stored in the stream rather than in external files. To avoid repeating the translation each time the stream is executed, select the Translate node and from the menus choose, Edit > Node > Cache > Enable. The next time the stream is executed, the output from the translation is cached in the node. The node icon displays a tiny "document" graphic that changes from white to green when the cache is filled. The cache is preserved for the duration of the session. To preserve the cache for another day (after the stream is closed and reopened), select the node and from the menus choose, Edit > Node > Cache > Save Cache. The next time you open the stream, you can reload the saved cache rather than running the translation again.
Alternatively, you can save or enable a node cache by right-clicking the node and choosing Cache from the context menu.
Important! If you are trying to retrieve information over the web through a proxy server, you must enable the proxy server in the net.properties file for both the IBM SPSS Modeler Text Analytics Client and Server. Follow the instructions detailed inside this file. This applies when accessing the web through the Web Feed node or retrieving an SDL Software as a Service (SaaS) license since these connections go through Java. This file is located in C:\Program Files\IBM\SPSS\Modeler\18\jre\lib\net.properties by default.