Anomaly Detection Model Settings

Use the Settings tab to specify options for scoring the model nugget.

Indicate anomalous records with Specifies how anomalous records are treated in the output.

  • Flag and index Creates a flag field that is set to True for all records that exceed the cutoff value included in the model. The anomaly index is also reported for each record in a separate field. See the topic Anomaly Detection Model Options for more information.
  • Flag only Creates a flag field but without reporting the anomaly index for each record.
  • Index only Reports the anomaly index without creating a flag field.

Number of anomaly fields to report Specifies the number of fields to report as an indication of why a particular record is flagged as an anomaly. The most anomalous fields are reported, defined as those that show the greatest deviation from the field norm for the cluster to which the record is assigned.

Discard records Select this option to discard all Non anomalous records from the stream, making it easier to focus on potential anomalies in any downstream nodes. Alternatively, you can choose to discard all Anomalous records in order to limit the subsequent analysis to those records that are not flagged as potential anomalies based on the model.

Note: Due to slight differences in rounding, the actual number of records flagged during scoring may not be identical to those flagged while training the model even if run on the same data.

Generate SQL for this model When using data from a database, SQL code can be pushed back to the database for execution, providing superior performance for many operations.

Select one of the following options to specify how SQL generation is performed.

  • Default: Score using Server Scoring Adapter (if installed) otherwise in process If connected to a database with a scoring adapter installed, generates SQL using the scoring adapter and associated user defined functions (UDF) and scores your model within the database. When no scoring adapter is available, this option fetches your data back from the database and scores it in SPSS® Modeler.
  • Score outside of the Database If selected, this option fetches your data back from the database and scores it in SPSS Modeler.