Text Mining model nugget: TMWBModelApplier

You can use the properties in the following table for scripting. The nugget itself is called TMWBModelApplier.

Table 1. Text Mining Model Nugget Properties
Scripting properties Data type Property description
scoring_mode Fields Records  
field_values Flags Counts This option is not available in the Category model nugget. For Flags, set to TRUE or FALSE
true_value string With Flags, define the value for true.
false_value string With Flags, define the value for false.
extension_concept string Specify an extension for the field name. Field names are generated by using the concept name plus this extension. Specify where to put this extension using the add_as value.
extension_category string Field name extension. You can choose to specify an extension prefix/suffix for the field name or you can choose to use the category codes. Field names are generated by using the category name plus this extension. Specify where to put this extension using the add_as value.
add_as Suffix Prefix  
fix_punctuation flag  
excluded_subcategories_descriptors RollUpToParent Ignore For category models only. If a subcategory is unselected. This option allows you to specify how the descriptors belonging to subcategories that were not selected for scoring will be handled. There are two options.
  • Ignore. The option Exclude its descriptors completely from scoring will cause the descriptors of subcategories that do not have checkmarks (unselected) to be ignored and unused during scoring.
  • RollUpToParent. The option Aggregate descriptors with those in parent category will cause the descriptors of subcategories that do not have checkmarks (unselected) to be used as descriptors for the parent category (the category above this subcategory). If several levels of subcategories and unselected, the descriptors will be rolled up under the first available parent category
check_model flag Deprecated in version 14
text field  
method ReadText ReadPath  
docType integer With possible values (0,1,2) where 0 = Full Text, 1 = Structured Text, and 2 = XML
encoding Automatic "UTF-8" "UTF-16" "ISO-8859-1" "US-ASCII" "CP850" "EUC-JP" "SHIFT-JIS" "ISO2022-JP" Note that values with special characters, such as "UTF-8", should be quoted to avoid confusion with a mathematical operator.
language de en es fr it ja nl pt