IBM Support

IJ41758: LINUX OS CRASH CAUSED BY MMCCR AND TRACEDEV MODULE

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • Part of GPFS are kernel modules that are loaded upon
    startup and used by other components. Usage counters
    were not used correctly in the tracedev module,
    which can lead to the module being unloaded while
    still in use, resulting in a kernel crash. One case
    where this is possible is running the "mmvdisk server configure"
    and "mmvdisk server unconfigure"
    commands with the --recycle option.
    

Local fix

  • Avoid stopping GPFS immediately after starting up.
    

Problem summary

  • Part of GPFS are kernel modules that are loaded upon
    startup and used by other components. Usage counters
    were not used correctly in the tracedev module,
    which can lead to the module being unloaded while
    still in use, resulting in a kernel crash. One case
    where this is possible is running the "mmvdisk server configure"
    and "mmvdisk server unconfigure"
    commands with the --recycle option.
    

Problem conclusion

  • This problem is fixed in 5.1.2 PTF 1
    To see all Spectrum Scale APARs and
    their respective fix solutions refer to page
    https://public.dhe.ibm.com/storage/spectrumscale/spectrum_scale
    apars.html
    
    
    Benefits of the solution:
    Avoid the kernel crash, by handling the usage
    counters of the tracedev module correctly.
    
    Work Around:
    Avoid stopping GPFS immediately after starting up.
    Problem trigger:
    Run GPFS shutdown and startup. This is a rare problem,
    so running this or the mentioned "mmvdisk server" command
    in a loop will be necessary to trigger the problem.
    Symptom: Abend/Crash
    Platforms affected: ALL Linux OS environments
    Functional Area affected: All Scale Users
    Customer Impact: Suggested
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ41758

  • Reported component name

    SPEC SCALE STD

  • Reported component ID

    5737F33AP

  • Reported release

    512

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2022-08-23

  • Closed date

    2022-08-23

  • Last modified date

    2022-08-23

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE STD

  • Fixed component ID

    5737F33AP

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"512","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
23 August 2022