IBM Support

IJ43790: GSKIT ISSUE WITH SPECIFIC AMD EPYC PROCESSORS

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • Customer run mmcrcluster command hang on tsgskkm due to
    an known problem in GSKit (used by tsgskkm) with specific
    AMD EPYC processors.
    This is CPU family 25.
    Earlier fix covers only CPU family 23 and only for EPYC
    7F72 and EPYC 7302 model.
    

Local fix

  • The known fix is to use the ICC_SHIFT=3 env variable,
    either in the process environment or in the ICCSIG.txt
    file for the GSKit library used (FIPS-certifed or not.)
    
    
    
    By setting the ICC_SHIFT=3 env variable in the root's
    profile; or by adding ICC_SHIFT=3 to the following files,
    as follows:
    
    
    
    /usr/lpp/mmfs/lib/gsk8/C/icc/icclib/ICCSIG.txt
    
    /usr/lpp/mmfs/lib/gsk8/N/icc/icclib/ICCSIG.txt
    
    
    
    # IBM Crypto for C.
    
    # ICC Version 8.6.0.0
    
    ...
    
    #
    
    #Do not edit before this line
    
    #
    
    # Global Settings
    
    ICC_ALLOW_2KEY3DES=1
    
    ICC_SHIFT=3
    
    #
    

Problem summary

  • Commands like mmcrcluster or mmaddnode may hang in GSKIT
    layer on AMD EPYC family 25 processors.  A particular model
    from family 25 that is known to hang in GSKIT layer is
    AMD EPYC 7343.
    

Problem conclusion

  • This problem is fixed in 5.1.6.1
    To see all Spectrum Scale APARs and their respective
    Fix solutions refer to page:
    https://public.dhe.ibm.com/storage/spectrumscale/spectrum_scale_
    apars.html
    
    Benefits of the solution:
    Apply the GSKIT hang workaround automatically on AMD EPYC
    family 25 processors
    
    Work Around:
    Add "ICC_SHIFT=3" line in
    /usr/lpp/mmfs/lib/gsk8/Cicc/icclib/ICCSIG.txt
    file on problem nodes.
    
    Problem trigger:
    This problem affects AMD EPYC family 25 processors
    
    Symptom:
    Admin commands hangs
    
    Platforms affected:
    Linux OS environments
    
    Functional Area affected:
    Admin Commands, gskit
    
    Customer Impact:
    High Importance
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ43790

  • Reported component name

    SPEC SCALE ADV

  • Reported component ID

    5737F35AP

  • Reported release

    511

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2022-10-12

  • Closed date

    2023-01-17

  • Last modified date

    2023-01-17

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE ADV

  • Fixed component ID

    5737F35AP

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"511","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
17 January 2023