IBM Support

IC92568: Platform Agent and CIM provider agent (cimprovagt) crashes after several network I/O operation failures.

 

APAR status

  • Closed as fixed if next.

Error description

  • The IBM Systems Director Platform Agent may crash following
    several network Input/Output (I/O) operation failures. The crash
    occurs in Standards Based Linux Instrumentation Module (SBLIM)
    and this impacts the Common Information Model (CIM) provider
    agent (cimprovagt) which crashes and is unable to restart.
    

Local fix

  •      There is no usage change that will avoid this issue.
    

Problem summary

  • The SBLIM implements the CIM monitoring model. Broadly it is
    composed of the Gatherer Daemon (gatherd) responsible for
    gathering metric data and the Repository Daemon (reposd)
    responsible for centralized data collection. The gatherd is
    further composed of multiple Metric Retrieval Plugins that do
    the work of data retrieval. When Director Server Platform Agent
    starts and encounters several network I/O operation failures,
    this metric data is gathered by the Gatherer Plugin for IP
    Protocol Endpoint Metrics. The IPProtocolEndpoint and
    NetworkPort plugins collect raw network counters from
    /proc/net/dev.  Processing of this data results in a buffer
    overflow and generates a segmentation fault (segfault). This is
    reported as a failure in the OSBase_MetricValueProvider module
    and subsequent segfault in CIM provider agent (cimprovagt). Then
     the Platform Agent Watchdog daemon will determine that the
    Gatherer services are not running and will try to restart the
    Gatherer services.  This issue will be fixed in a future release
     of the product.
    

Problem conclusion

Temporary fix

Comments

APAR Information

  • APAR number

    IC92568

  • Reported component name

    IBM DIR AGT XLI

  • Reported component ID

    5765DRXLA

  • Reported release

    630

  • Status

    CLOSED FIN

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2013-05-23

  • Closed date

    2013-07-18

  • Last modified date

    2013-07-18

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

Applicable component levels

  • R631 PSY

       UP

[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SUPPORT","label":"IBM Worldwide Support"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"630","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SGZ2Z3","label":"IBM Systems Director"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"630","Edition":"","Line of Business":{"code":"LOB35","label":"Mainframe SW"}}]

Document Information

Modified date:
22 August 2022