IBM Support

IT27298: ON PURESCALE SYSTEMS WITH A LARGE NUMBER OF MEMBERS OR FILESYSTEMS, DB2CLUSTER -CFS -ENTER -MAINTENANCE -ALL MAY HANG

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • On Purescale systems with a large number of file systems or
    cluster
    nodes, the output of underlying Spectrum Scale command
    mmshutdown
    can output more than default system pipe buffer capacity. This
    causes
    the mmshutdown command to block and causing the db2cluster
    command
    to hang.
    
    When encountering this error, you may see a procstack and
    proctree on the db2cluster command similar to the following:
    
    procstack:
    waitpid(??, ??, ??) + 0x200
    GPFSCommandAdaptor::executeCommand(const char*,const char*,const
    unsigned int)(0x8, 0x11006a1b0, 0xffffffffffc3040, 0x0) + 0xc4c
    GPFSCommandAdaptor::executeCommandV2(const char*,const
    char*,const unsigned int)(??, ??, ??, ??) + 0x5c
    GPFSCluster::stop(int,char(*)[255])(??, ??, ??) + 0x258
    db2haCFSEnterMaintenanceMode(0x18a8800018a88, 0x110031da0) +
    0x54c
    sqlhaCFSCallFunctionDirect(0x1a0000001a, 0x110171050) + 0x438
    sqlhaCFSCallFunction(??, ??, ??) + 0xc00
    sqlhaCFSEnterMaintenanceMode(??, ??) + 0x3c
    sqlhaUIEnterMaintenanceModeForCFSAll(sqlhaUICB*)(??) + 0xcac
    main(??, ??) + 0x4820
    __start() + 0x70
    
    proctree:
    3932170     /usr/sbin/srcmstr
          7077918     /usr/sbin/sshd
                22937706     sshd:  db2inst  [priv]
                      2496318     sshd:  db2inst@pts/2
                          24772836     -ksh
                                22741122    -ksh
                                      24117288     ./db2cluster -cfs
    -enter -maintenance -all
                                            21692660
    /usr/lpp/mmfs/bin/mmksh  /usr/lpp/mmfs/bin/mmshutdown -a
                                                  25296900
    /usr/lpp/mmfs/bin/mmksh   /usr/lpp/mmfs/bin/mmcommon onall
    /var/mmfs/tmp/nodefile
                                                        20185254
    /usr/bin/perl  /usr/lpp/mmfs/bin/mmdsh  -vF
    /var/mmfs/tmp/cmdTmpDir.mmcommon.2529
    
    22544406     <defunct>
    
    The output of mmshutdown command will also exceed 32k.
    

Local fix

  • Redirect STDOUT and STDERR of mmshutdown command.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * ALL                                                          *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See Error Description                                        *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Upgrade to Db2 11.1 Mod 4 Fixpack 5 or higher                *
    ****************************************************************
    

Problem conclusion

  • First fixed in Db2 11.1 Mod 4 Fixpack 5
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT27298

  • Reported component name

    DB2 FOR LUW

  • Reported component ID

    DB2FORLUW

  • Reported release

    B10

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2018-12-12

  • Closed date

    2020-01-16

  • Last modified date

    2020-01-16

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    DB2 FOR LUW

  • Fixed component ID

    DB2FORLUW

Applicable component levels

  • RB10 PSN

       UP

[{"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSEPGG","label":"Db2 for Linux, UNIX and Windows"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"11.1","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]

Document Information

Modified date:
16 January 2020