Failsoft *FAIL

Function

The JES3 failsoft processing diagnoses errors. It determines whether abnormal termination should continue or whether processing can be resumed.

At the time of a JES3 failure a message is received that identifies the failure by a unique number. Detailed failure information is also displayed in the hardcopy log. No operator intervention is required unless WANTDUMP=ASK is coded on the OPTIONS initialization statement (or WANTDUMP=YES is coded but WANTDUMP=ASK is assumed because the number of dumps in the specified interval has exceeded the specified limit).

You can use the *FAIL command to immediately terminate the DSP and to enter failsoft recovery. Failsoft recovery returns the resources held by the DSP. Use this command when the *CANCEL command fails to end the DSP.

CAUTION:

Use this command with care; it can cause a critical JES3 DSP to end with abend code DM133. Other DSPs may fail during the recovery process if you issue a *FAIL command. If issuing multiple commands for the same device or DSP, wait for JES3 to display the results of a previous *FAIL command before issuing another *FAIL command. Issuing multiple *FAIL commands for the same device or DSP without waiting for each previous *FAIL to complete can interrupt recovery or cleanup processing at a critical point. This can lead to loss of function and abends other than the expected DM133 in the DSP being failed.

Syntax

Read syntax diagramSkip visual syntax diagram
>>---*FAIL----+-,--devname-------+--+---------+----------------><
              +-,--+---+--devnum-+  '-,--DUMP-'   
              |    '-⁄-'         |                
              +-,--dspname-------+                
              +-,--fssname-------+                
              '-,--J--=--jobno---'                

Parameters

devname or devnum or ⁄devnum
Specifies the device name or device number (by 3-digit or 4-digit hexadecimal number) associated with the DSP to be ended. If the device you are failing is the last device running under a functional subsystem (FSS), the FSS will also fail. A slash (⁄) preceding the device number is not required. Device numbers can be specified by any of the following formats:
ddd
dddd
/ddd
/dddd
where ddd and dddd represent the device numbers.
dspname
Specifies the name of the DSP or the FSS to be ended. Any DSP name defined by the IATYDSD macro instruction is valid. (IATYDSD is used in the generation of the IATGRPT module; this module is part of the JES3 nucleus and contains the JES3 DSP dictionary.)
fssname
Specifies the name of the FSS to be ended. All devices running under the FSS are varied offline. Use the devname parameter to fail a specific device running under the FSS.
J=jobno
Specifies the JES3 job number of the DSP.
DUMP
Specifies that the JES3 abend dump is to be taken. When an active TCP DSP is being failed, the associated NETSERV address space is also dumped. The system continues to process normally.

Example

The writer for device 0002 (JNAME=PRT002) is not responding to commands. Issue a *FAIL command to end it:
*FAIL,PRT002
IAT3713  FAILURE LOGOUT
IAT3713  *************************************************************
IAT3713  *************************************************************
IAT3713  DATE = 1998092 TIME = 1608372   JES3 OS260
IAT3713  JES3 FAILURE NUMBER = 0002   FAILED  DM133
IAT3729  FAILURE EXPLANATION:
IAT3731  OPERATOR ISSUED AN *FAIL DSPNAME COMMAND.
IAT3713  ACTIVE FCT = WTR        DEVICE = PRT002    FCT FAIL NO  = 0001
IAT3713  JOB IEESYSAS (JOB33518)
IAT3713  MODULE = IATOSPS        MOD BASE = 0519C000  DISP = 00065C
IAT3713  APAR NUMBER =           PTF NUMBER = OS260
IAT3713  CALLING SEQUENCE (HIGHEST LEVEL MODULE LISTED LAST)
IAT3713  MODULE = IATOSPR        MOD BASE = 0519902C  DISP = 00044A
IAT3713  APAR NUMBER =           PTF NUMBER = OS260
IAT3713  MODULE = IATOSWD        MOD BASE = 05191000  DISP = 00038E
IAT3713  APAR NUMBER =           PTF NUMBER = OS260
IAT3713  MODULE = IATOSWC        MOD BASE = 0518E000  DISP = 000876
IAT3713  APAR NUMBER =           PTF NUMBER = OS260
IAT3713  PSW AT TIME OF FAILURE  071C1000 80013AE4  ILC  2  INTC   000D
IAT3713  THE FAILING INSTRUCTION IS   0A0D
IAT3713  REGISTERS AT TIME OF FAILURE
IAT3713  REGS  0- 3   05194618   00000085   050ABCD0   10000004
IAT3713  REGS  4- 7   0000004C   00000000   00000000   050ABC98
IAT3713  REGS  8-11   05194A9C   0519D000   0519C000   050937D8
IAT3713  REGS 12-15   04F01000   05194000   00013AE0   04F26148
IAT3713  *************************************************************
IAT3713  *************************************************************
IAT3702 WTR     (PRT002  ) FAILED  DM133 - JES3 FAILURE NO. 0002
IAT3708 WTR     (PRT002  ) IS BEING REINSTATED