Condition Monitoring (Neural Net/C5.0)

This example concerns monitoring status information from a machine and the problem of recognizing and predicting fault states. The data is created from a fictitious simulation and consists of a number of concatenated series measured over time. Each record is a snapshot report on the machine in terms of the following:

This example uses the streams named condplot.str and condlearn.str, which reference the data files named COND1n and COND2n. These files are available from the Demos directory of any IBM® SPSS® Modeler installation. This can be accessed from the IBM SPSS Modeler program group on the Windows Start menu. The condplot.str and condlearn.str files are in the streams directory.

For each time series, there is a series of records from a period of normal operation followed by a period leading to the fault, as shown in the following table:

Time Power Temperature Pressure Uptime Status Outcome
0 1059 259 0 404 0 0
1 1059 259 0 404 0 0
      ...      
51 1059 259 0 404 0 0
52 1059 259 0 404 0 0
53 1007 259 0 404 0 303
54 998 259 0 404 0 303
      ...      
89 839 259 0 404 0 303
90 834 259 0 404 303 303
0 965 251 0 209 0 0
1 965 251 0 209 0 0
      ...      
51 965 251 0 209 0 0
52 965 251 0 209 0 0
53 938 251 0 209 0 101
54 936 251 0 209 0 101
      ...      
208 644 251 0 209 0 101
209 640 251 0 209 101 101

The following process is common to most data mining projects:

Next