IBM Support

IZ49810: HWSVRRMD CORE DUE TO UNRESERVE IS BEING CALLED MORE THAN ONCE

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • After all HMCs reboot, IBM.HWSVRRM on all HMCs CORE DUMP.
    CRHS is not working
    
    # frame -l
    2610-422 Cannot execute the command on node xx.xx.xx.xx.
    The
    resource manager IBM.HWSVRRM is not available.
    
    (gdb) where
    #0 0xffffe410 in ?? ()
    #1 0x406ebb59 in *__GI_abort () from /lib/tls/libc.so.6
    #2 0x406e3b2a in *__GI___assert_fail () from
    /lib/tls/libc.so.6
    #3 0x401ad372 in rsct_rmf2v::RMRcp::unreserve()
    (this=0x818bea0)
    at
    /project/spreldvoh/build/rdvohs003a/src/rsct/SDK/rmf/RMCl
    asses.C:10209
    #4 0x0804f429 in set_manager_configured() ()
    #5 0x0804f17d in HWSVRRMDaemon::mainInitThread(void*) ()
    #6 0x401f5c9e in rsct_rmf2v::RMInitThread::run(void*)
    (this=0x80be9b0, theParameters=0x80b9868)
    at
    /project/spreldvoh/build/rdvohs003a/src/rsct/SDK/rmf/RMDa
    emon.C:927
    #7 0x403835ac in rsct_base::CRunnable::threadMain()
    (this=0x80be9b0)
    at
    /project/spreldvoh/build/rdvohs003a/src/rsct/SDK/base/CRu
    nnable.C:664
    #8 0x40381bc2 in stubCRunnable (pToken=0x80be9b0) at
    /project/spreldvoh/build/rdvohs003a/src/rsct/SDK/base/CRu
    nnable.C:81
    #9 0x40052be3 in start_thread () from
    /lib/tls/libpthread.so.0
    
    cat IBM.HWSVRRM.stderr
    (/data3/rootpl/19316/19316.001.806-trace.out)
    IBM.HWSVRRMd:
    /project/spreldvoh/build/rdvohs003a/src/rsct/SDK/rmf/RMCl
    asses.C:10209: void rsct_rmf2v::RMRcp::unreserve():
    Assertion '(pDataInt->itsProperties & 0x80000000) != 0'
    failed.
    
    The trace file is showing:
    
     02:06:02 PM.638338 T(1083370416) _RMF
    RMVerUpdGbl::doUpdates Entered.
     02:06:02 PM.638354 T(1083370416) _RMF             Error
    98305 was returned from
     RMVerUpdGbl::doUpdates on line 786 of
    /project/spreldvoh/build/rdvohs003a/src/rsct/SDK/rmfg/RMV
    erUpdGbl.C.
    
    Message=2645-000 Operation failed due to error 0 returned
    from RMVerUpdGbl::evalQuorum.
    
    FFDCID= ^A
    

Local fix

  • Remove the hmcs from the peer.
    Ensure that CSM is at the latest level.
    Add the HMCs back into the peer one at a time.
    Make sure that the Manager_Configured attribute is "1"
    for the HMCs - run lsrhws -m.
    If it is not "1" then force it to be "1" by using the
    chrhws command.
    

Problem summary

  • It is possible for IBM.HWSVRRM to core due to a code bug.
    In the set_manager_configured function in HWSVRRMDaemon.C,
    a call is made to unreserve twice with the same pointer.
    This second call can potentially cause an assert.
    

Problem conclusion

  • HWSVRRMDaemon.C has been modified to only call unreserve
    once in the set_manager_configured function.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IZ49810

  • Reported component name

    CSM HMC

  • Reported component ID

    5765E88LH

  • Reported release

    171

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Submitted date

    2009-04-22

  • Closed date

    2009-04-22

  • Last modified date

    2016-03-23

  • APAR is sysrouted FROM one or more of the following:

    IZ48069

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    CSM HMC

  • Fixed component ID

    5765E88LH

Applicable component levels

  • R170 PSY

       UP

PTF to Fileset Mapping

[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SGS882","label":"Cluster Systems Management"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"171","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SUPPORT","label":"IBM Worldwide Support"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"171","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
22 August 2022