IBM Support

IJ22962: POTENTIAL UNDETECTED DATA CORRUPTION IN A POWERHA PARTITIONED CL

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

Direct link to fix

 

APAR status

  • Closed as program error.

Error description

  • **************************************************************
    * USERS AFFECTED:
    * Systems running PowerHA 7.2.4 with
    * cluster.es.server.rte at 7.2.4.0.
      **************************************************************
    * ERROR DESCRIPTION:
    * If a partition occurs in a PowerHA cluster, there is
    * potential for undetected data corruption if both sides of
    * the partition attempt to access the shared storage.
    *
    * Under certain timing and load conditions the existing
    * mechanisms used to prevent this problem may not function in
    * a timely manner in order to effectively isolate the shared
    * storage.
    *
    * This problem can occur with all versions of PowerHA and
    * all cluster types (standard, stretched or linked) with
    * shared storage configurations.
    *
    * You will need to apply the PowerHA fix to avoid this
    * problem.
    * You may also need a fix for CAA depending on the version of
    * AIX you are using.
    * If you need to do any additional tuning of the fix you will
    * also need a fix for RSCT.
    * See the tables below for the specific combination of fixes
    * you will need.
      **************************************************************
    * RECOMMENDATION:
    * Install APAR IJ22962.
    * Prior to fix availability, an interim fix is available from
    * either
    * ftp://aix.software.ibm.com/aix/ifixes/ij22962/
    * https://aix.software.ibm.com/aix/ifixes/ij22962/
    * Installation of the ifix does not require a reboot.
      **************************************************************
    

Local fix

  • NA
    

Problem summary

  • In a PowerHA cluster configuration, which includes
    mutual-takeover resource groups and more than 1 resource group
    exist on the same critical resource group node, the disk fencing
    and other split/merge policies malfunctions in a particular
    cluster scenario.
    

Problem conclusion

  • Modified to address the problems with split/merge policies with
    and without the quarantine policies.
    This fix addresses all the problems with split/merge scenarios
    and make sure there is no data corruption when split or merge
    occurs.
    

Temporary fix

  • Not available
    

Comments

APAR Information

  • APAR number

    IJ22962

  • Reported component name

    POWERHA SYSMIR

  • Reported component ID

    5765H3900

  • Reported release

    724

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    YesHIPER

  • Submitted date

    2020-02-21

  • Closed date

    2020-03-26

  • Last modified date

    2020-07-22

  • APAR is sysrouted FROM one or more of the following:

    IJ22627

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    POWERHA SYSMIR

  • Fixed component ID

    5765H3900

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSLM9V","label":"PowerHA SystemMirror Standard Edition for AIX"},"Platform":[{"code":"PF053","label":"Power Systems"}],"Version":"724","Line of Business":{"code":"LOB57","label":"Power"}}]

Document Information

Modified date:
23 July 2020