IBM Support

IJ10418: FIXED RG RESIGN DUE TO INCORRECT MEDIA/CHECKSUM ERROR

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • If pdisk corruption occurs, for example if a bad SAS HBA card
    or bad CPU chip causes silent data corruption on writes to
    pdisks, then after the problem hardware has been repaired, the
    system can continue to report misleading "I/O error", "err 110"
    messages, and may continually resign and recover service of the
    recovery group, causing recovery from the corruption to take an
    unexpectedly long time.
    

Local fix

  • na
    

Problem summary

  • If pdisk corruption occurs, for example if a bad SAS HBA card
    or bad CPU chip causes silent data corruption on writes to
    pdisks, then after the problem hardware has been repaired, the
    system can continue to report misleading "I/O error", "err 110"
    messages, and may continually resign and recover service of the
    recovery group, causing recovery from the corruption to take an
    unexpectedly long time.
    

Problem conclusion

  • The fix eliminates the misleading "I/O error", "err 110"
    messages
    and prevents the system from continually resigning and
    restarting.
    
    Work around:
    None.
    
    Problem trigger:
    The problem is triggered by checksum errors detected on pdisks.
    This can be triggered by faulty hardware that writes incorrect
    data to disk without reporting any errors back or it may be
    caused by a malicious program writing over the disk drives.
    
    Symptom:
    Performance Impact/Degradation
    
    Platforms affected:
    ALL Operating System environments
    
    Functional Area affected:
    ESS/GNR
    
    Customer Impact:
    High Importance
    
    Changed Externals:
    None.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ10418

  • Reported component name

    SPEC SCALE STD

  • Reported component ID

    5737F33AP

  • Reported release

    502

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2018-10-11

  • Closed date

    2018-10-11

  • Last modified date

    2019-02-12

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE STD

  • Fixed component ID

    5737F33AP

Applicable component levels

  • R502 PSY U883600

       UP18/12/18 I 1000

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY","label":"IBM Spectrum Scale"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"502","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
12 February 2019