Automatic monitoring of JES3 functions

JES3 provides automatic monitoring of JES3 functions on the global and local processors and also in the CI FSS address spaces. Two types of monitors are used:
JES3 loop and wait monitor
The JES3 loop and wait monitor ensures the two main JES3 tasks, the Nuc and the Aux task, are not suspended or in an infinite loop. When that happens, no other JES3 function (called FCTs) can be dispatched besides the currently active one. JES3 periodically examines the status of those two functions and informs the operator, through WTOR IAT6410, about the condition.

The operator can take action based on that information by either failing the currently active FCT or by taking the whole JES3 global, local, or CI FSS down. The operator can also request that monitoring be temporarily suspended by providing a one-time interval in response to the WTOR.

The loop and wait monitor is activated automatically on the global, local, and all CI FSS address spaces. Two parameters can be modified by an operator command:
  • The interval at which JES3 checks what the active FCT is.
  • The threshold for how long an FCT can be continuously active before the operator is notified.

Those parameters can be modified by a MODIFY (F) JES3 command.

Operator messages will be issued periodically even during the times a WTOR is outstanding, for example, during Hot/Warmstart with analysis. You can temporarily suspend the monitoring by responding with an interval value to the IAT6410 WTOR.

JES3 monitor DSP
The monitor DSP runs as an FCT under the JES3 Nuc task. It monitors unavailable JES3 resource. A JES3 resource is anything that can use an FCT or a job that can become unavailable. The monitor DSP allows you to monitor the following JES3 resources:
  • Generalized subtasks
  • AENQ resources
  • JQEs
  • Job numbers
  • File directory entries
  • JSAM buffers
  • Spool space
The monitor DSP also monitors resources that a job is waiting for (data set allocation requests and tape mounts).

Typically the monitoring focuses on 'unusual' AWAITS (the JES3 equivalent of an MVS™ WAIT). Each of the areas has a threshold and interval associated with them. Whenever an AWAIT is detected, lasting more than the threshold specifies, the operator is informed.

A complement to both monitors is an exception analysis function. This function is invoked by an operator command while JES3 is running or offline during dump analysis. The function provides a summary report of 'unusual' conditions as defined by JES3. This function will be enhanced and fine-tuned over time as more and more checks are implemented.

The same function can be invoked by DC and IPCS.