IBM Support

IZ64479: FTA STATUS DOES NOT BECOME INITTED ON BDM, IF FTA IS NOT ABLE TO LINK TO BDM BY THE FIRST AUTOLINK ATTEMPT.

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • The issue is that FTA status does not become INITTED on BDM, if
    FTA is not able to link to BDM by the first autolink attempt.
    
    Environment:
    MDM: TWS84+FP03 on AIX (DR84)
         enSwFaultTol / sw = YES
    
    BDM: TWS84+FP02 on AIX (DR84Z)
    NGFTA: TWS84+PF02 on AIX (APAT84A) (The issue occurs)
    OKFTA: TWS83+FP01 on AIX (APAT83Z) (The issue does not occur)
           This cpu is added so that logs can be compared with NGFTA
           logs.
    FTA: Dummy FTA. (DR84A) (Only a definition is defined)
    
    Replication steps:
    
    1. Define cpus.
       Notes: Prerequisite of defining cpus
         The following definition setting forces test FTAs to
         receive new Symphonys and try to link to BDM before
         BDM receives a new Symphony.
    
        1. Define cpus in alphabetical order, so that BDM is the
           last cpu to receive the new Symphony during FINAL.
        2. Define a cpu with an invalid host name to force delay to
           occur on BDM to receive a new Symphony. (DR84A)
    
    2. Perform JnextPlan
    3. Wait until BDM receives a new Symphony and FTA retry links to
       BDM.
    4. Check "conman" sc on BDM.  The FTA APAT84A status is "F I  W"
       instead of "F I JW".
    
      BDM:DR84Z
      %cs
      CPUID   RUN  NODE   LIMIT FENCE DATE TIME   STATE  METHOD
    DOMAIN
      DR84    478  UNIX MASTER 0 0 10/08/09 10:54 LTI JW M EA
    MASTERDM
      APAT83Z 478  UNIX FTA    0 0 10/08/09 10:53 F I JW
    MASTERDM
      APAT84A   0  UNIX FTA    0 0                F    W
    MASTERDM
      DR84A     0  UNIX FTA    0 0
    MASTERDM
      DR84Z   478 *UNIX FTA    0 0 10/08/09 10:56 F I J
    MASTERDM
      %
    
    
    Findings:
    By comparing BDM traces/20091008_TWSMERGE.log regards OKFTA and
    NGFTA,
    BDM receives the following "My" messages from OK FTA,
    
     MY:WRITER-UP
     MY:JOBMAN-UP
     MY:INIT
     MY:LINK
    
    where as only the following "My" messages from NG FTA.
    
     MY:WRITER-UP
     MY:LINK
    
    OKFTA: APAT83Z
    10:57:10 08.10.2009|APAT83Z:WRITER:INFO:ipc_get_connection,
    local host
    address:(::ffff:9.188.197.103,30841) has established connection
    with
    remote host:(::ffff:9.188.197.163,54602), using family IPv6
    10:57:10 08.10.2009|APAT83Z:WRITER:AWSBCW028I Started by
    MAILMAN/8.3
    from APAT83Z; workstation type: UNIX
    10:57:10 08.10.2009|APAT83Z:WRITER:AWSBAT001I The event counter
    is
    successfully initializing: AWSBAT051I During its initialization,
    the
    process copied the records for the event counter table from the
    version
    stored in the EventCounter file.
    10:57:10 08.10.2009|APAT83Z:WRITER:AWSBAT006I The event counter
    table
    was successfully filled. Initialization is complete.
    10:57:10 08.10.2009|APAT83Z:WRITER:INFO:ipc_bind, architecture
    type:BIG
    ENDIAN
    10:57:10 08.10.2009|APAT83Z:WRITER:AWSDEB001I Getting a new
    socket: 11
    10:57:10 08.10.2009|APAT83Z:WRITER:INFO:Setting check run number
    478
    10:57:10 08.10.2009|APAT83Z:WRITER:AWSBCW031I Handshake
    command_type
    StartMailbox
    10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :My, FM:DR84Z
    #J565478, TO:DR84Z LEN:588 ID:0
    10:57:10 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My,
    FM:DR84Z
    #J565478, TO:DR84Z LEN:588 ID:0
    10:57:10 08.10.2009|BATCHMAN:AWSBDY112I Received command
    MY:WRITER-UP
    for run number -1 for workstation APAT83Z from workstation
    DR84Z.
    10:57:10 08.10.2009|BATCHMAN:Workstation APAT83Z State is being
    changed:
    WRITER
    10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :My,
    FM:APAT83Z
    #J565416, TO: LEN:588 ID:1
    10:57:10 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My,
    FM:APAT83Z
    #J565416, TO: LEN:588 ID:1
    10:57:10 08.10.2009|BATCHMAN:AWSBDY110I Received command
    MY:JOBMAN-UP
    for run number 477 from workstation APAT83Z.
    10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :My,
    FM:APAT83Z
    #J565416, TO: LEN:588 ID:2
    10:57:10 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My,
    FM:APAT83Z
    #J565416, TO: LEN:588 ID:2
    10:57:10 08.10.2009|BATCHMAN:AWSBDY106I Received command MY:INIT
    for run
    number 478 from workstation APAT83Z.
    10:57:10 08.10.2009|BATCHMAN:Workstation APAT83Z State is being
    changed:
    INITTED
    10:57:10 08.10.2009|BATCHMAN:AWSBHT033I Workstation APAT83Z is
    now
    active, scheduling is resuming.
    10:57:10 08.10.2009|BATCHMAN:AWSBHT035I Workstation APAT83Z
    completed
    its INIT process. 0 jobs running.
    10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :Hi,
    FM:APAT83Z
    #J499962, TO:DR84 LEN:56 ID:3
    10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :Hi,
    FM:APAT83Z
    #J499962, TO:DR84 LEN:56 ID:4
    10:57:10 08.10.2009|MAILMAN:AWSBCV096I Read input, :My,
    FM:APAT83Z #J1,
    TO: LEN:588 ID:5
    10:57:10 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My,
    FM:APAT83Z
    #J1, TO: LEN:588 ID:5
    10:57:10 08.10.2009|BATCHMAN:AWSBDY104I Received command MY:LINK
    for run
    number 478 for workstation APAT83Z from workstation APAT83Z.
    10:57:10 08.10.2009|BATCHMAN:Workstation APAT83Z State is being
    changed:
    FULLY LINKED=LINKED=TCP
    
    
    NGFTA: APAT84A
    10:57:25 08.10.2009|APAT84A:WRITER:INFO:ipc_get_connection,
    local host
    address:(::ffff:9.188.197.103,30841) has established connection
    with
    remote host:(::ffff:9.188.197.163,54603), using family IPv6
    10:57:25 08.10.2009|APAT84A:WRITER:AWSBCW028I Started by
    MAILMAN/8.4
    from APAT84A; workstation type: UNIX
    10:57:25 08.10.2009|APAT84A:WRITER:AWSBAT001I The event counter
    is
    successfully initializing: AWSBAT051I During its initialization,
    the
    process copied the records for the event counter table from the
    version
    stored in the EventCounter file.
    10:57:25 08.10.2009|APAT84A:WRITER:AWSBAT006I The event counter
    table
    was successfully filled. Initialization is complete.
    10:57:25 08.10.2009|APAT84A:WRITER:INFO:ipc_bind, architecture
    type:BIG
    ENDIAN
    10:57:25 08.10.2009|APAT84A:WRITER:AWSDEB001I Getting a new
    socket: 11
    10:57:25 08.10.2009|APAT84A:WRITER:INFO:Setting check run number
    478
    10:57:25 08.10.2009|APAT84A:WRITER:AWSBCW031I Handshake
    command_type
    StartMailbox
    10:57:25 08.10.2009|MAILMAN:AWSBCV096I Read input, :My, FM:DR84Z
    #J565478, TO:DR84Z LEN:588 ID:0
    10:57:25 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My,
    FM:DR84Z
    #J565478, TO:DR84Z LEN:588 ID:0
    10:57:25 08.10.2009|MAILMAN:AWSBCV096I Read input, :Hi,
    FM:APAT84A
    #J426214, TO:DR84 LEN:56 ID:3
    10:57:25 08.10.2009|BATCHMAN:AWSBDY112I Received command
    MY:WRITER-UP
    for run number -1 for workstation APAT84A from workstation
    DR84Z.
    10:57:25 08.10.2009|BATCHMAN:Workstation APAT84A State is being
    changed:
    WRITER
    10:57:25 08.10.2009|MAILMAN:AWSBCV096I Read input, :Hi,
    FM:APAT84A
    #J426214, TO:DR84 LEN:56 ID:4
    10:57:25 08.10.2009|MAILMAN:AWSBCV096I Read input, :My,
    FM:APAT84A #J1,
    TO: LEN:588 ID:5
    10:57:25 08.10.2009|MAILMAN:AWSBCV091I Wrote intercom, :My,
    FM:APAT84A
    #J1, TO: LEN:588 ID:5
    10:57:25 08.10.2009|BATCHMAN:AWSBDY104I Received command MY:LINK
    for run
    number 478 for workstation APAT84A from workstation APAT84A.
    10:57:25 08.10.2009|BATCHMAN:Workstation APAT84A State is being
    changed:
    FULLY LINKED=LINKED=TCP
    

Local fix

  • n/a
    

Problem summary

  • See apar description. Problem occurs
    even if the  enSwFaultTol / sw = NO. It is not
    related to this feature.
    

Problem conclusion

  • This apar will be fixed
    into 8.3.0-TIV-TWS-FP0008, 8.4.0-TIV-TWS-FP0005, 8.5.0-TIV-
    TWS-FP0001.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IZ64479

  • Reported component name

    TIV WKLD SCHDL

  • Reported component ID

    5698WKB84

  • Reported release

    84A

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2009-11-04

  • Closed date

    2009-11-10

  • Last modified date

    2009-11-10

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TIV WKLD SCHDL

  • Fixed component ID

    5698WKB84

Applicable component levels

  • R84A PSY

       UP

[{"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSGSPN","label":"IBM Workload Scheduler"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"84A","Edition":"","Line of Business":{"code":"LOB45","label":"Automation"}}]

Document Information

Modified date:
10 November 2009