ADP

ADP is available in Statistics Base Edition.

ADP (Automated Data Preparation) helps to prepare data for analysis by automating tedious and repetitive data preparation tasks that would otherwise be done manually. The operations it performs improve analysis speed, predictive power, and robustness.

ADP
  /FIELDS [TARGET=targetField]
          INPUT=predictorFieldlist
          [ANALYSISWEIGHT=analysisweightField]
  [/PREPDATETIME 
          [DATEDURATION={YES**(REFERENCE={CURRENT**    }
                                         {YMD(datespec)}
                               UNIT={AUTO**                       })}]
                                    {YEARS[(SUFFIX={'_years'  })] }
                                                   {suffixname}
                                    {MONTHS[(SUFFIX={'_months  })]}
                                                    {suffixname}
                                    {DAYS[(SUFFIX={'_days'   })]  }
                                                  {suffixname}
                        {NO                                         }
          [TIMEDURATION={YES**(REFERENCE={CURRENT**    } 
                                         {HMS(timespec)} 
                               UNIT={AUTO**                        })}]
                                    {HOURS[(SUFFIX={'_hours'  })]  }
                                                   {suffixname}
                                    {MINUTES[(SUFFIX={'_minutes'})]}
                                                     {suffixname}
                                    {SECONDS[(SUFFIX={'_seconds'})]}
                                                     {suffixname}
                        {NO                                          }
          [EXTRACTYEAR={YES[(SUFFIX={'_year'   })]}]
                                    {suffixname}
                       {NO**                      }
          [EXTRACTMONTH={YES[(SUFFIX={'_month'  })]}]
                                     {suffixname}
                        {NO**                      }
          [EXTRACTDAY={YES(SUFFIX={'_day'    })}]
                                  {suffixname}
                       {NO**                   }
          [EXTRACTHOUR={YES(SUFFIX={'_hour'   })}] 
                                   {suffixname}
                       {NO**                    }
          [EXTRACTMINUTE={YES(SUFFIX={'_minute' })}] 
                                     {suffixname}
                       {NO**                      }
          [EXTRACTSECOND={YES(SUFFIX={'_second' })}]
                                     {suffixname}
                         {NO**                    }
  [/SCREENING [PCTMISSING={YES**(MAXPCT={50**})}]
                                        {value}
                          {NO                   }
              [UNIQUECAT={YES**(MAXCAT={100**   })}]
                                       {integer}
                         {NO                     }
              [SINGLECAT={YES(MAXPCT={95** })}]
                                     {value}
                         {NO**               }   
  [/ADJUSTLEVEL [INPUT={YES**}] [TARGET={YES**}]]
                       {NO   }          {NO   }
            [MAXVALORDINAL={10**   }]  
                           {integer}
            [MINVALCONTINUOUS={5**    }]
                              {integer}
  [/OUTLIERHANDLING [INPUT={YES**}] [TARGET={YES**}]]
                           {NO   }          {NO   }
                    [CUTOFF=SD({3**  })]
                               {value}
                    [REPLACEWITH={CUTOFFVALUE**}]
                                 {MISSING      }
  [/REPLACEMISSING [INPUT={YES**[(EXCLUDE([CONTINUOUS] [NOMINAL] [ORDINAL]))]}]
                          {NO                                                }
                   [TARGET={YES[(EXCLUDE([CONTINUOUS] [NOMINAL] [ORDINAL]))]}]
                           {NO**                                            }
  [/REORDERNOMINAL [INPUT={YES }] [TARGET={YES }]
                          {NO**}          {NO**}
  [/RESCALE [INPUT={ZSCORE**([MEAN={0**  }] [SD={1**  }])}]
                                   {value}      {value}
                   {MINMAX([MIN={0**  }] [MAX={100**}])}
                                    {value}       {value}
                   {NONE                               }
            [TARGET={BOXCOX**([MEAN={0**  }] [SD={1**  }])}]
                                    {value}      {value}
                    {NONE                               }
  [/TRANSFORM [MERGESUPERVISED={YES**(PVALUE={0.05**})}]
                                             {value }
                               {NO                    }
              [MERGEUNSUPERVISED={YES{([ORDINAL] [NOMINAL] [MINPCT={10** })}]
                                                                   {value}
                                 {NO**                                     }
              [BINNING={SUPERVISED**(PVALUE={0.05**})}]
                                            {value }
                       {NONE                         }
              [SELECTION={YES**(PVALUE={0.05**})}]
                         {NO                    }
              [CONSTRUCTION={YES }(ROOT={feature })]
                                        {rootname}
                            {NO**} 
  [/CRITERIA [SUFFIX(TARGET={'_transformed'} INPUT={'_transformed'})]
                            {suffixname    }        {suffixname    }
  /OUTFILE PREPXML='filespec'

** Default if the subcommand or keyword is omitted.

This command reads the active dataset and causes execution of any pending commands. See the topic Command Order for more information.

Release History

Release 18

  • Command introduced.

Example

ADP
  /FIELDS TARGET=targetVar
          INPUT=var1 var2 var3
  /OUTFILE PREPXML='file.xml'.