Data sets that RUNSTATS uses
The RUNSTATS utility uses a number of data sets during its operation.
The following table lists the data sets that RUNSTATS uses. The table lists the DD name that is used to identify the data set, a description of the data set, and an indication of whether it is required. Include statements in your JCL for each required data set and any optional data sets that you want to use.
| Data set | Description | Required? |
|---|---|---|
| SYSIN | Input data set that contains the utility control statement. | Yes |
| SYSPRINT | Output data set for messages. | Yes |
| RNPRIN01 | A data set that contains messages from the sort program (usually, SYSOUT or DUMMY). This data set is used when distribution statistics are collected for column groups. | No1 |
| STPRIN01 | A data set that contains messages from the sort program (usually, SYSOUT or DUMMY). This data set is used when frequency statistics are collected on data-partitioned secondary indexes, or when TABLESPACE TABLE COLGROUP FREQVAL is specified. | Yes1,2,5 |
| Sort work data sets6 | Temporary data sets for sort input and output when collecting statistics on at least one data-partitioned secondary index. This data set is used when the COLGROUP option is specified or the COLGROUP and FREQVAL options are specified. The DD names have the form ST01WKnn. | No3,4 |
| Sort work data sets6 | Temporary data sets for sort input and output when collecting distribution statistics for column groups. The DD names have the form STATWK01. | No1,4 |
| Sort work data sets6 | Temporary data sets for sort input and output when collecting frequency statistics. The DD names have the form SORTWK01 and ST02WKnn. | No4 |
Note:
|
||
- Table space or index
- Object that is to be scanned.
Calculating the size of the sort work data sets
Depending on the type of statistics that RUNSTATS collects, the utility uses the ST01WKnn data sets, the SORTWK01 data set, both types of data sets, or neither.
The ST01WKnn data sets are used when collecting statistics on at least one data-partitioned secondary index. To calculate the approximate size (in bytes) of the ST01WKnn data set, use the following formula:
2 ×(maximum record length × numcols × (count + 2) × number of indexes)
The variables in the preceding formula have the following values:
- maximum record length
- Maximum record length of the SYSCOLDISTSTATS record that is processed when collecting frequency statistics (You can obtain this value from the RECLENGTH column in SYSTABLES.)
- numcols
- Number of key columns to concatenate when you collect frequent values from the specified index.
- count
- Number of frequent values that RUNSTATS is to collect.
The SORTWK01 data set is used when collecting distribution statistics. To calculate the approximate size (in bytes) of the SORTWK01 data set, use the following formula:
(longest_record_length + prefix) × sum from 1 to N (#colgroupsn × #rows - n)
The variables in the preceding formula have the following values:
- N
- Number of tables for which distribution statistics are collected
- #colgroupsn
- Number of column groups that are specified for the nth table
- #rows
- Number of rows for the nth table
The ST02WKnn data sets are used when collecting frequency statistics on at least one COLGROUP. To calculate the approximate size (in bytes) of the ST02WKnn data set, use the following formula:
2 ×(maximum record length × (count + 2) × number of parts)
The variables in the preceding formula have the following values:
- maximum record length
- Maximum record length of the SYSCOLDISTSTATS record that is processed when collecting frequency statistics (You can obtain this value from the RECLENGTH column in SYSTABLES.)
- count
- Number of frequent values that RUNSTATS is to collect.
Sort work data sets cannot span volumes. Smaller volumes require more sort work data sets to sort the same amount of data; therefore, large volume sizes can reduce the number of needed sort work data sets. When you allocate sort work data sets on disk, the recommended amount of space to allow provides at least 1.2 times the amount of data that is to be sorted.
Tape devices are not
supported for sort work data sets.