OPTIMAL BINNING
OPTIMAL BINNING
is available in Statistics Base Edition.
The OPTIMAL BINNING
procedure discretizes one or more scale
variables by distributing the values of each variable into bins. Bins can then be used instead of
the original data values of the binning input variables for further analysis. OPTIMAL
BINNING
is useful for reducing the number of distinct values in the given binning input
variables.
OPTIMAL BINNING
/VARIABLES [GUIDE = variable] BIN = varlist [SAVE = {NO** }]
{YES [(INTO = new varlist)]}
[/CRITERIA
[PREPROCESS = {EQUALFREQ**[(BINS = {1000**})]}]
{n }
{NONE }
[METHOD = {MDLP** }]
{EQUALFREQ [(BINS = {10**})]}
{n }
[LOWEREND = {UNBOUNDED**}] [UPPEREND = {UNBOUNDED**}]
{OBSERVED } {OBSERVED }
[LOWERLIMIT = {INCLUSIVE**}]
{EXCLUSIVE }
[FORCEMERGE = {0** }]]
{value}
[/MISSING [SCOPE = {PAIRWISE**}]]
{LISTWISE }
[/OUTFILE RULES = filespec]
[/PRINT [ENDPOINTS**] [DESCRIPTIVES] [ENTROPY] [NONE]]
** Default if the subcommand or keyword is omitted.
This command reads the active dataset and causes execution of any pending commands. See the topic Command Order for more information.
Syntax for the OPTIMAL
BINNING
command can be generated from the Optimal Binning
dialog.
Release History
Release 15.0
- Command introduced.
Example
OPTIMAL BINNING
/VARIABLES GUIDE = guide-variable BIN = binning-input-variable