IBM Support

IT30095: OFFLOAD JOB HANGS DURING CANCEL AND VSNAP SHOWS KERNEL PANIC ERROR

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • When attempting to cancel an offload job in IBM Spectrum Protect
    Plus, the job hangs or takes a very long time to cancel. The job
    log shows messages like:
    
    CTGGA0360 Aborting replication data transfer
    CTGGA0361 Aborting replication data transfer failed
    
    The vSnap server
    
    On the vSnap server from which data is being offloaded, the
    following messages are shown on the console and in the system
    log:
    
    kernel: print_req_error: critical medium error, dev sdd, sector
    1649269361792
    kernel: VERIFY((db = dbuf_hold(dn, blkid, FTAG)) != NULL) failed
    kernel: PANIC at dmu.c:1497:dmu_assign_arcbuf()
    

Local fix

  • Reboot the vSnap server while no other jobs are active apart
    from the hung offload job. Rebooting will resolve the kernel
    panic and will cause the hung job to go into failed state. The
    offload job can then be rerun.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * IBM Spectrum Protect Plus level 10.1.4.                      *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See Error Description.                                       *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing level when available. This problem is currently *
    * projected to be fixed in IBM Spectrum Protect Plus level     *
    * 10.1.5. Note that this is subject to change at the           *
    * discretion of IBM.                                           *
    ****************************************************************
    

Problem conclusion

  • The kernel panic was caused by a null pointer dereference in the
    ZFS filesystem module on vSnap. The problem has been resolved by
    adding appropriate checks for the null pointer during cleanup or
    cancellation of an offload job.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT30095

  • Reported component name

    SP PLUS

  • Reported component ID

    5737SPLUS

  • Reported release

    A14

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2019-08-27

  • Closed date

    2019-09-04

  • Last modified date

    2019-09-04

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SP PLUS

  • Fixed component ID

    5737SPLUS

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSNQFQ","label":"IBM Spectrum Protect Plus"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"A14","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
30 January 2024