IBM Support

IZ82660: TAPE DRIVE EIO ERROR MAY CRASH SYSTEM APPLIES TO AIX 5300-10

A fix is available

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • If a tape drive returns an EIO error during an open,
    (gets hung or non responsive)
    and the tape drive is later removed from the system,
    ie: (rmdev -l rmt0)
    the system may crash later on when sysdumpdev -e
    command gets executed.
    If, after the above event, another device driver gets
    loaded,
    then this specifically sets up the conditions for the
    crash.
    The crash MAY be an ISI_PROC with an errpt entry such as:
    ---------------------------------------------------------
    ------------------
    LABEL:          ISI_PROC
    IDENTIFIER:     3BBD4751
    
    Date/Time:       Wed Mar 17 15:06:21 2010
    Sequence Number: 59
    Machine Id:      002B51CF4C00
    Node Id:         chicis09a
    Class:           S
    Type:            PERM
    WPAR:            Global
    Resource Name:   SYSVMM
    
    Description
    INSTRUCTION STORAGE INTERRUPT
    
    Probable Causes
    SOFTWARE PROGRAM
    
    Failure Causes
    SOFTWARE PROGRAM
    
            Recommended Actions
            IF PROBLEM PERSISTS THEN DO THE FOLLOWING
            CONTACT APPROPRIATE SERVICE REPRESENTATIVE
    
    Detail Data
    ISISR
    0000 0000 0000 9032
    SEGMENT REGISTER, SEGREG
    0000 7FFF FFFF D000
    ISSR0
    6079 0000 4182 00B0
    EXVAL
    0000 0000 0000 000E
    ---------------------------------------------------------
    ------------------
    
    The ISSR0 number may or may not match exactly the
    60790000418200B0 it just so happens this has been
    the number in 5 cases to date.
    But the stack will definitely look like this:
    CRASH INFORMATION:
    CPU 0 CSA F00000002FF47600 at time of crash, error code
    for LEDs: 40000000
    pvthread+015F00 STACK:
    WARNING: bad IAR: 60790000418200B0, display stack from
    LR: 006173EC
    [006173EC]dmp_size+0000EC (0000000000000004 [??])
    [00616990]dmpioctl+000410 (??, ??, ??, ??)
    [004EAAE0]rdevioctl+0000C0 (??, ??, ??, ??, ??, ??)
    [0069E7C0]spec_ioctl+000080 (??, ??, ??, ??, ??, ??)
    [0054A890]vnop_ioctl+000050 (??, ??, ??, ??, ??, ??)
    [0055CCDC]vno_ioctl+00009C (??, ??, ??, ??, ??)
    [005E8F38]common_ioctl+0000F8 (??, ??, ??, ??)
    [00003850]ovlya_addr_sc_flih_main+000130 ()
    [kdb_get_virtual_memory] no real storage @ 2FF22690
    [D0123AB4]D0123AB4 ()
    [kdb_read_mem] no real storage @ FFFFFFFFFFF6460
    (0)>
    

Local fix

  • To prevent this crash, until a fix can be obtained, if
    a tape drive gets hung - do NOT remove it with rmdev
    command.
    

Problem summary

  • crash
    possible stack trace of:
     006173EC dmp_size+0000EC (0000000000000004  ?? )
     00616990 dmpioctl+000410 (??, ??, ??, ??)
     004EAAE0 rdevioctl+0000C0 (??, ??, ??, ??, ??, ??)
     0069E7C0 spec_ioctl+000080 (??, ??, ??, ??, ??, ??)
     0054A890 vnop_ioctl+000050 (??, ??, ??, ??, ??, ??)
     0055CCDC vno_ioctl+00009C (??, ??, ??, ??, ??)
     005E8F38 common_ioctl+0000F8 (??, ??, ??, ??)
     00003850 ovlya_addr_sc_flih_main+000130 ()
     kdb_get_virtual_memory  no real storage @ 2FF22690
     D0123AB4 D0123AB4 ()
     kdb_read_mem  no real storage @ FFFFFFFFFFF6460
    

Problem conclusion

  • Move component dump table function registration in stropen()
    to after all of the error processing, where the open has been
    determined to be successful.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IZ82660

  • Reported component name

    AIX 5.3

  • Reported component ID

    5765G0300

  • Reported release

    530

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Submitted date

    2010-08-11

  • Closed date

    2010-08-11

  • Last modified date

    2013-03-28

  • APAR is sysrouted FROM one or more of the following:

    IZ77002

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    AIX 5.3

  • Fixed component ID

    5765G0300

Applicable component levels

  • R530 PSY U837905

       UP10/09/20 I 1000

PTF to Fileset Mapping

[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11P","label":"APARs - AIX 5.3 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"530","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
28 March 2013