APAR status
Closed as program error.
Error description
The issue is that FTA status does not become INITTED on BDM, if FTA is not able to link to BDM by the first autolink attempt. Environment: MDM: TWS84+FP03 on AIX (DR84) enSwFaultTol / sw = YES BDM: TWS84+FP02 on AIX (DR84Z) NGFTA: TWS84+PF02 on AIX (APAT84A) (The issue occurs) OKFTA: TWS83+FP01 on AIX (APAT83Z) (The issue does not occur) This cpu is added so that logs can be compared with NGFTA logs. FTA: Dummy FTA. (DR84A) (Only a definition is defined) Replication steps: 1. Define cpus. Notes: Prerequisite of defining cpus The following definition setting forces test FTAs to receive new Symphonys and try to link to BDM before BDM receives a new Symphony. 1. Define cpus in alphabetical order, so that BDM is the last cpu to receive the new Symphony during FINAL. 2. Define a cpu with an invalid host name to force delay to occur on BDM to receive a new Symphony. (DR84A) 2. Perform JnextPlan 3. Wait until BDM receives a new Symphony and FTA retry links to BDM. 4. Check "conman" sc on BDM. The FTA APAT84A status is "F I W" instead of "F I JW". BDM:DR84Z %cs CPUID RUN NODE LIMIT FENCE DATE TIME STATE METHOD DOMAIN DR84 478 UNIX MASTER 0 0 10/08/09 10:54 LTI JW M EA MASTERDM APAT83Z 478 UNIX FTA 0 0 10/08/09 10:53 F I JW MASTERDM APAT84A 0 UNIX FTA 0 0 F W MASTERDM DR84A 0 UNIX FTA 0 0 MASTERDM DR84Z 478 *UNIX FTA 0 0 10/08/09 10:56 F I J MASTERDM % Findings: By comparing BDM traces/20091008_TWSMERGE.log regards OKFTA and NGFTA, BDM receives the following "My" messages from OK FTA, MY:WRITER-UP MY:JOBMAN-UP MY:INIT MY:LINK where as only the following "My" messages from NG FTA. MY:WRITER-UP MY:LINK OKFTA: APAT83Z 10:57:10 08.10.2009|APAT83Z:WRITER:INFO:ipc_get_connection, local host address:(::ffff:9.188.197.103,30841) has established connection with remote host:(::ffff:9.188.197.163,54602), using family IPv6 10:57:10 08.10.2009|APAT83Z:WRITER:AWSBCW028I Started by MAILMAN/8.3 from APAT83Z; workstation type: UNIX 10:57:10 08.10.2009|APAT83Z:WRITER:AWSBAT001I The event counter is successfully initializing: AWSBAT051I During its initialization, the process copied the records for the event counter table from the version stored in the EventCounter file. 10:57:10 08.10.2009|APAT83Z:WRITER:AWSBAT006I The event counter table was successfully filled. Initialization is complete. 10:57:10 08.10.2009|APAT83Z:WRITER:INFO:ipc_bind, architecture type:BIG ENDIAN 10:57:10 08.10.2009|APAT83Z:WRITER:AWSDEB001I Getting a new socket: 11 10:57:10 08.10.2009|APAT83Z:WRITER:INFO:Setting check run number 478 10:57:10 08.10.2009|APAT83Z:WRITER:AWSBCW031I Handshake command_type StartMailbox 10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :My, FM:DR84Z #J565478, TO:DR84Z LEN:588 ID:0 10:57:10 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My, FM:DR84Z #J565478, TO:DR84Z LEN:588 ID:0 10:57:10 08.10.2009|BATCHMAN:AWSBDY112I Received command MY:WRITER-UP for run number -1 for workstation APAT83Z from workstation DR84Z. 10:57:10 08.10.2009|BATCHMAN:Workstation APAT83Z State is being changed: WRITER 10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :My, FM:APAT83Z #J565416, TO: LEN:588 ID:1 10:57:10 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My, FM:APAT83Z #J565416, TO: LEN:588 ID:1 10:57:10 08.10.2009|BATCHMAN:AWSBDY110I Received command MY:JOBMAN-UP for run number 477 from workstation APAT83Z. 10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :My, FM:APAT83Z #J565416, TO: LEN:588 ID:2 10:57:10 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My, FM:APAT83Z #J565416, TO: LEN:588 ID:2 10:57:10 08.10.2009|BATCHMAN:AWSBDY106I Received command MY:INIT for run number 478 from workstation APAT83Z. 10:57:10 08.10.2009|BATCHMAN:Workstation APAT83Z State is being changed: INITTED 10:57:10 08.10.2009|BATCHMAN:AWSBHT033I Workstation APAT83Z is now active, scheduling is resuming. 10:57:10 08.10.2009|BATCHMAN:AWSBHT035I Workstation APAT83Z completed its INIT process. 0 jobs running. 10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :Hi, FM:APAT83Z #J499962, TO:DR84 LEN:56 ID:3 10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :Hi, FM:APAT83Z #J499962, TO:DR84 LEN:56 ID:4 10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :My, FM:APAT83Z #J1, TO: LEN:588 ID:5 10:57:10 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My, FM:APAT83Z #J1, TO: LEN:588 ID:5 10:57:10 08.10.2009|BATCHMAN:AWSBDY104I Received command MY:LINK for run number 478 for workstation APAT83Z from workstation APAT83Z. 10:57:10 08.10.2009|BATCHMAN:Workstation APAT83Z State is being changed: FULLY LINKED=LINKED=TCP NGFTA: APAT84A 10:57:25 08.10.2009|APAT84A:WRITER:INFO:ipc_get_connection, local host address:(::ffff:9.188.197.103,30841) has established connection with remote host:(::ffff:9.188.197.163,54603), using family IPv6 10:57:25 08.10.2009|APAT84A:WRITER:AWSBCW028I Started by MAILMAN/8.4 from APAT84A; workstation type: UNIX 10:57:25 08.10.2009|APAT84A:WRITER:AWSBAT001I The event counter is successfully initializing: AWSBAT051I During its initialization, the process copied the records for the event counter table from the version stored in the EventCounter file. 10:57:25 08.10.2009|APAT84A:WRITER:AWSBAT006I The event counter table was successfully filled. Initialization is complete. 10:57:25 08.10.2009|APAT84A:WRITER:INFO:ipc_bind, architecture type:BIG ENDIAN 10:57:25 08.10.2009|APAT84A:WRITER:AWSDEB001I Getting a new socket: 11 10:57:25 08.10.2009|APAT84A:WRITER:INFO:Setting check run number 478 10:57:25 08.10.2009|APAT84A:WRITER:AWSBCW031I Handshake command_type StartMailbox 10:57:25 08.10.2009|MAILMAN:AWSBCV096I Read input, :My, FM:DR84Z #J565478, TO:DR84Z LEN:588 ID:0 10:57:25 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My, FM:DR84Z #J565478, TO:DR84Z LEN:588 ID:0 10:57:25 08.10.2009|MAILMAN:AWSBCV096I Read input, :Hi, FM:APAT84A #J426214, TO:DR84 LEN:56 ID:3 10:57:25 08.10.2009|BATCHMAN:AWSBDY112I Received command MY:WRITER-UP for run number -1 for workstation APAT84A from workstation DR84Z. 10:57:25 08.10.2009|BATCHMAN:Workstation APAT84A State is being changed: WRITER 10:57:25 08.10.2009|MAILMAN:AWSBCV096I Read input, :Hi, FM:APAT84A #J426214, TO:DR84 LEN:56 ID:4 10:57:25 08.10.2009|MAILMAN:AWSBCV096I Read input, :My, FM:APAT84A #J1, TO: LEN:588 ID:5 10:57:25 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My, FM:APAT84A #J1, TO: LEN:588 ID:5 10:57:25 08.10.2009|BATCHMAN:AWSBDY104I Received command MY:LINK for run number 478 for workstation APAT84A from workstation APAT84A. 10:57:25 08.10.2009|BATCHMAN:Workstation APAT84A State is being changed: FULLY LINKED=LINKED=TCP
Local fix
n/a
Problem summary
See apar description. Problem occurs even if the enSwFaultTol / sw = NO. It is not related to this feature.
Problem conclusion
This apar will be fixed into 8.3.0-TIV-TWS-FP0008, 8.4.0-TIV-TWS-FP0005, 8.5.0-TIV- TWS-FP0001.
Temporary fix
Comments
APAR Information
APAR number
IZ64479
Reported component name
TIV WKLD SCHDL
Reported component ID
5698WKB84
Reported release
84A
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt
Submitted date
2009-11-04
Closed date
2009-11-10
Last modified date
2009-11-10
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
TIV WKLD SCHDL
Fixed component ID
5698WKB84
Applicable component levels
R84A PSY
UP
[{"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSGSPN","label":"IBM Workload Scheduler"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"84A","Edition":"","Line of Business":{"code":"LOB45","label":"Automation"}}]
Document Information
Modified date:
10 November 2009