Example Streams: Decision Trees

The following streams can be used together in sequence as an example of the database mining process using the Decision Trees algorithm provided by MS Analysis Services.

Table 1. Decision Trees - example streams
Stream Description
1_upload_data.str Used to clean and upload data from a flat file into the database.
2_explore_data.str Provides an example of data exploration with IBM® SPSS® Modeler
3_build_model.str Builds the model using the database-native algorithm.
4_evaluate_model.str Used as an example of model evaluation with IBM SPSS Modeler
5_deploy_model.str Deploys the model for in-database scoring.

Note: In order to run the example, streams must be executed in order. In addition, source and modeling nodes in each stream must be updated to reference a valid data source for the database you want to use.

The dataset used in the example streams concerns credit card applications and presents a classification problem with a mixture of categorical and continuous predictors. For more information about this dataset, see the crx.names file in the same folder as the sample streams.

This dataset is available from the UCI Machine Learning Repository at ftp://ftp.ics.uci.edu/pub/machine-learning-databases/credit-screening/.