|
The Distribution node shows the occurrence of symbolic (categorical)
values, such as mortgage type or gender. Typically, you might use
the Distribution node to show imbalances in the data, which you could
then rectify using a Balance node before creating a model. |
Example
node = stream.create("distribution", "My node")
# "Plot" tab
node.setPropertyValue("plot", "Flags")
node.setPropertyValue("x_field", "Age")
node.setPropertyValue("color_field", "Drug")
node.setPropertyValue("normalize", True)
node.setPropertyValue("sort_mode", "ByOccurence")
node.setPropertyValue("use_proportional_scale", True)
Table 1. distributionnode propertiesdistributionnode properties |
Data type |
Property description |
plot |
SelectedFields
Flags |
|
x_field |
field |
|
color_field |
field |
Overlay field. |
normalize |
flag |
|
sort_mode |
ByOccurence
Alphabetic |
|
use_proportional_scale |
flag |
|