IBM Support

IT31179: STORAGE INSIGHTS INSTABILITY

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • Storage Insights becomes unstable when monitoring large numbers
    of storage systems. In an observed case, monitoring 100 storage
    devices, 44 of which are from the SVC/Storwize family, the heap
    grows to over 1.5 gigabytes and causes the device server
    component to become unstable and for services running within it
    to stop.
    The heapdumps generated when OOM errors occur show as
    leak suspects :
    - arrays of PdPerfStatsData objects which
    accumulated the performance metrics for subsystems entities:
    volumes, ports, nodes, host, etc. and
    - ArrayList of
    MapperResult objects of which parents are
    NAPISVCEventMiniprobePostProcessor objects.
     The
    NAPISVCEventMiniprobePostProcessor keeps the results of
    intermediary processing in a list of MapperResult and this
    becomes very large.
    RECREATE STEPS:
    Observe occasional offline
    services resulting in failed probe/performance monitor jobs and
    a failure to load the web GUI.
    Problem as described by
    customer:
    Storage Insights instability
    Initial customer impact
    (low/med/high): high
    

Local fix

  • Storage Insights support/operations can increase device server
    JVM size to 3gb to provide some relief
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * IBM Spectrum Control & Storage Insights users monitoring     *
    * large number of storage systems from SVC family.             *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * Spectrum Control or Storage Insights may become unstable     *
    * when monitoring large numbers of storage systems. In an      *
    * observed case, monitoring 100 storage devices, 44 of which   *
    * are from the SVC/Storwize family, the heap grows to over     *
    * 1.5 gigabytes and causes the device server component to      *
    * become unstable and for services running within it to stop.  *
    ****************************************************************
    * RECOMMENDATION:                                              *
    ****************************************************************
    During SVC Events processing keep in the heap only relevant
    information for final post processing.
    

Problem conclusion

  • The fix for this APAR is targeted for the following release:
    
    IBM Spectrum Control 5.3.6   [ 5.3.6-IBM-SC ]
    IBM Storage Insights 1Q20   [ 5.3.6-IBM-SC ]
    
    ( release target February 2020 )
    
    http://www.ibm.com/support/docview.wss?&uid=swg21320822
    
    The target dates for future releases do not represent a formal
    commitment by IBM. The dates are subject to change without
    notice.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT31179

  • Reported component name

    STORAGE INSIGHT

  • Reported component ID

    5608TPCSI

  • Reported release

    535

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2019-12-05

  • Closed date

    2020-01-17

  • Last modified date

    2020-01-17

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    STORAGE INSIGHT

  • Fixed component ID

    5608TPCSI

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSQRB8","label":"IBM Storage Insights"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"535","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
08 February 2022