Text Mining modeling node

The Text Mining node uses linguistic and frequency techniques to extract key concepts from the text and create categories with these concepts and other data. The node can be used to explore the text data contents or to produce either a concept model nugget or category model nugget. When you execute this modeling node, an internal linguistic extraction engine extracts and organizes the concepts, patterns, and/or categories using natural language processing methods.

You can execute the Text Mining node and automatically produce a concept or category model nugget using the Generate directly option. Alternatively, you can use a more hands-on, exploratory approach using the Build interactively mode in which not only can you extract concepts, create categories, and refine your linguistic resources, but also perform text link analysis and explore clusters. See the topic Text Mining Node: Model Tab for more information.

You can find this node on the IBM® SPSS® Modeler Text Analytics tab of nodes palette at the bottom of the IBM SPSS Modeler window. See the topic IBM SPSS Modeler Text Analytics nodes for more information.

Requirements. Text Mining modeling nodes accept text data from a Web Feed node, File List node, or any of the standard source nodes. This node is installed with IBM SPSS Modeler Text Analytics and can be accessed on the IBM SPSS Modeler Text Analytics palette.

Note: This node replaces the Text Extraction node, which was offered in old versions of the product. If you have older streams that use the old nodes or model nuggets, you must rebuild your streams using the Text Mining node.