IBM Support

IV83403: POWERHA: C-SPOC SENDS COMMANDS TO UNREACHABLE NODES

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • The C-SPOC component of PowerHA checks to see if any of
    the
    target nodes of commands are noted as down by CAA, but
    does
    not check to see if the nodes are reachable via clcomd.
    In
    particular, a node that is reachable only through the
    repository disk is effectively down, and commands should
    not be sent to it.
    

Local fix

Problem summary

  • PowerHA resource group(RG) failover may take long time if all
    the network interfaces are down in active node.
    
    PowerHA log files may show below message.  hacmp.out ---------
    WARNING: Cluster mem78_cluster has been running recovery
    program 'TE_RG_MOVE_ACQUIRE' for 360 seconds. Please check
    cluster status.  WARNING: Cluster mem78_cluster has been
    running recovery program 'TE_RG_MOVE_ACQUIRE' for 390 seconds.
    Please check cluster status.
    ....
    ....
    WARNING: Cluster mem78_cluster has been running recovery
    program 'TE_RG_MOVE_ACQUIRE' for 1860 seconds. Please check
    cluster status
    
    The events sequence in hacmp.out
    --------------------------------- EVENT START: network_down
    cloverint0 net_ether_01 EVENT START: network_down_complete
    cloverint0 net_ether_01 EVENT START: resource_state_change
    cloverint1 EVENT START: rg_move_release cloverint1 1 EVENT
    START: rg_move cloverint1 1 RELEASE EVENT START:
    config_too_long 1200 TE_RG_MOVE_RELEASE
    
    errpt log:  --------- <DateTime>cluster0   T
    CL_NETWORK_ISSUE    Node is heartbeating solely over disk or FC
    for more than 15 minutes
    

Problem conclusion

  • Optimized the code to reduce duration of failover in
    node down scenario.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IV83403

  • Reported component name

    POWERHA SYSMIR

  • Reported component ID

    5765H3900

  • Reported release

    712

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Submitted date

    2016-04-05

  • Closed date

    2016-04-05

  • Last modified date

    2016-08-11

  • APAR is sysrouted FROM one or more of the following:

    IV82155

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    POWERHA SYSMIR

  • Fixed component ID

    5765H3900

Applicable component levels

  • R712 PSY U873739

       UP16/08/11 I 1000

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSLM9V","label":"PowerHA SystemMirror Standard Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"712","Edition":"","Line of Business":{"code":"LOB57","label":"Power"}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SSXU4N","label":"PowerHA SystemMirror Enterprise Edition for AIX"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"712","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSLM9V","label":"PowerHA SystemMirror Standard Edition for AIX"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"712","Edition":"","Line of Business":{"code":"LOB57","label":"Power"}},{"Business Unit":{"code":"BU008","label":"Security"},"Product":{"code":"SGL4G4","label":"PowerHA"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"712","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11R","label":"APARs - AIX 7.1 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"712","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11Q","label":"AIX 6.1 HIPERS, APARs and Fixes"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"712","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
19 October 2021