Topic
  • 6 replies
  • Latest Post - ‏2013-03-21T03:22:11Z by SystemAdmin
SystemAdmin
SystemAdmin
6902 Posts

Pinned topic SAN-Boot with ds8100 LUN report disk operation error

‏2013-03-14T09:46:42Z |
All disk are from DS8100,but only below two disk report disk operation error on server after server reboot.
xxx>errpt
B6267342 0310103313 P H hdisk2 DISK OPERATION ERROR
B6267342 0310103313 P H hdisk0 DISK OPERATION ERROR
B6267342 0310103313 P H hdisk2 DISK OPERATION ERROR
B6267342 0310103313 P H hdisk0 DISK OPERATION ERROR
xxx>lsdev -Cc disk
hdisk0 Available 03-08-02 IBM MPIO FC 2107
hdisk1 Available 03-08-02 IBM MPIO FC 2107
hdisk2 Available 03-08-02 IBM MPIO FC 2107
hdisk3 Available 03-08-02 IBM MPIO FC 2107
hdisk4 Available 03-08-02 IBM MPIO FC 2107
hdisk5 Available 03-08-02 IBM MPIO FC 2107
hdisk6 Available 03-08-02 IBM MPIO FC 2107
hdisk7 Available 03-08-02 IBM MPIO FC 2107
hdisk8 Available 03-08-02 IBM MPIO FC 2107
hdisk9 Available 03-08-02 IBM MPIO FC 2107
hdisk10 Available 03-08-02 IBM MPIO FC 2107
hdisk11 Available 03-08-02 IBM MPIO FC 2107
hdisk12 Available 03-08-02 IBM MPIO FC 2107
hdisk13 Available 03-08-02 IBM MPIO FC 2107
hdisk14 Available 03-08-02 IBM MPIO FC 2107
hdisk15 Available 03-08-02 IBM MPIO FC 2107
hdisk16 Available 03-08-02 IBM MPIO FC 2107
xxx>oslevel -s
5300-12-06-1216
xxx>lslpp -L |grep sdd
devices.sddpcm.53.rte 2.6.0.3 C F IBM SDD PCM for AIX V53
xxx>lspv
hdisk1 000316af88acb0bd rootvg active
hdisk3 000316af88acb3fb pagevg active
hdisk4 000316af8747798d sapvg active
hdisk5 000316af87477ce3 sapvg active
hdisk0 000316af7e1be1b0 archbkp active
hdisk2 000316af7e1cdd34 archbkp active
hdisk6 00ca99ea9d278d7c sapvg active
hdisk7 00ca99ea9d2bd209 sapvg active
hdisk8 00ca99ea9d2c1229 sapvg active
xxx>
Updated on 2013-03-21T03:22:11Z at 2013-03-21T03:22:11Z by SystemAdmin
  • ColombianJoker
    ColombianJoker
    68 Posts

    Re: SAN-Boot with ds8100 LUN report disk operation error

    ‏2013-03-14T15:12:38Z  
    Show us the output of

    lspath
    pcmpath query device 0
    pcmpath query device 2
  • SystemAdmin
    SystemAdmin
    6902 Posts

    Re: SAN-Boot with ds8100 LUN report disk operation error

    ‏2013-03-15T07:18:59Z  
    Show us the output of

    lspath
    pcmpath query device 0
    pcmpath query device 2
    xxx>lspath
    Enabled hdisk0 fscsi0
    Enabled hdisk1 fscsi0
    Enabled hdisk2 fscsi0
    Enabled hdisk3 fscsi0
    Enabled hdisk4 fscsi0
    Enabled hdisk5 fscsi0
    Enabled hdisk0 fscsi2
    Enabled hdisk1 fscsi2
    Enabled hdisk2 fscsi2
    Enabled hdisk3 fscsi2
    Enabled hdisk4 fscsi2
    Enabled hdisk5 fscsi2
    ...
    xxx>sudo datapath query device
    ...
    DEV#: 0 DEVICE NAME: hdisk0 TYPE: 2107900 ALGORITHM: Load Balance
    SERIAL: 75W94711100
    ==========================================================================
    Path# Adapter/Path Name State Mode Select Errors
    0 fscsi2/path1 OPEN NORMAL 4304 0
    1 fscsi0/path0 OPEN NORMAL 8202 0

    DEV#: 2 DEVICE NAME: hdisk2 TYPE: 2107900 ALGORITHM: Load Balance
    SERIAL: 75W94711200
    ==========================================================================
    Path# Adapter/Path Name State Mode Select Errors
    0 fscsi2/path1 OPEN NORMAL 85 0
    1 fscsi0/path0 OPEN NORMAL 103 0
    ...
    xxx>
  • dukessd
    dukessd
    345 Posts

    Re: SAN-Boot with ds8100 LUN report disk operation error

    ‏2013-03-19T00:36:13Z  
    xxx>lspath
    Enabled hdisk0 fscsi0
    Enabled hdisk1 fscsi0
    Enabled hdisk2 fscsi0
    Enabled hdisk3 fscsi0
    Enabled hdisk4 fscsi0
    Enabled hdisk5 fscsi0
    Enabled hdisk0 fscsi2
    Enabled hdisk1 fscsi2
    Enabled hdisk2 fscsi2
    Enabled hdisk3 fscsi2
    Enabled hdisk4 fscsi2
    Enabled hdisk5 fscsi2
    ...
    xxx>sudo datapath query device
    ...
    DEV#: 0 DEVICE NAME: hdisk0 TYPE: 2107900 ALGORITHM: Load Balance
    SERIAL: 75W94711100
    ==========================================================================
    Path# Adapter/Path Name State Mode Select Errors
    0 fscsi2/path1 OPEN NORMAL 4304 0
    1 fscsi0/path0 OPEN NORMAL 8202 0

    DEV#: 2 DEVICE NAME: hdisk2 TYPE: 2107900 ALGORITHM: Load Balance
    SERIAL: 75W94711200
    ==========================================================================
    Path# Adapter/Path Name State Mode Select Errors
    0 fscsi2/path1 OPEN NORMAL 85 0
    1 fscsi0/path0 OPEN NORMAL 103 0
    ...
    xxx>
    Can you post the first line of the sense data?

    SC_DISK_ERR2 is a problem accessing the disk down a given path but there can be several reasons for the error.

    Or you can go here:
    http://publibfp.dhe.ibm.com/epubs/pdf/c2343293.pdf

    Page 89 tells you how to decode it your self.

    HTH
  • SystemAdmin
    SystemAdmin
    6902 Posts

    Re: SAN-Boot with ds8100 LUN report disk operation error

    ‏2013-03-21T03:11:32Z  
    • dukessd
    • ‏2013-03-19T00:36:13Z
    Can you post the first line of the sense data?

    SC_DISK_ERR2 is a problem accessing the disk down a given path but there can be several reasons for the error.

    Or you can go here:
    http://publibfp.dhe.ibm.com/epubs/pdf/c2343293.pdf

    Page 89 tells you how to decode it your self.

    HTH
    I post the error data as below, and this alert had been fixed by command.
    After issue command
    xxx:/>pcmquerypr -Vh /dev/hdisk2
    connection type: fscsi0
    open dev: /dev/hdisk2

    Attempt to read reservation key...
    *> ioctl(PR_READ) error; errno = 16 (Device busy)
    *> status_validity=0x1, scsi_bus_status=0x18

    Attempt to read reservation key...
    *> ioctl(PR_READ) error; errno = 16 (Device busy)
    *> status_validity=0x1, scsi_bus_status=0x18

    Attempt to read reservation key...
    *> ioctl(PR_READ) error; errno = 16 (Device busy)
    *> status_validity=0x1, scsi_bus_status=0x18

    Attempt to read reservation key...
    *> ioctl(PR_READ) error; errno = 16 (Device busy)
    *> status_validity=0x1, scsi_bus_status=0x18
    xxx:/>

    We found something wrong with the reservation as this is standalone server.
    issue command "relbootrsv vg_name";shutdown -Fr ; pcmquerypr -Vh /dev/hdisk2
    Then, this problem fixed.

    xxx>errpt -aj B6267342 |more

    LABEL: SC_DISK_ERR2
    IDENTIFIER: B6267342

    Date/Time: Sun Mar 17 17:38:53 TAIST 2013
    Sequence Number: 19411
    Machine Id: 00CA99EA4C00
    Node Id: xxx
    Class: H
    Type: PERM
    Resource Name: hdisk2
    Resource Class: disk
    Resource Type: 2107
    Location: U7879.001.DQD0EJ5-P1-C5-T1-W5005076306180665-L4012400000000000
    VPD:
    Manufacturer................IBM
    Machine Type and Model......2107900
    Serial Number...............75W94711200
    EC Level.....................149
    Device Specific.(Z0)........10
    Device Specific.(Z1)........0100
    Device Specific.(Z2)........075
    Device Specific.(Z3)........26007
    ...skipping...

    LABEL: SC_DISK_ERR2
    IDENTIFIER: B6267342

    Date/Time: Sun Mar 17 17:38:53 TAIST 2013
    Sequence Number: 19411
    Machine Id: 00CA99EA4C00
    Node Id: r3dev
    Class: H
    Type: PERM
    Resource Name: hdisk2
    Resource Class: disk
    Resource Type: 2107
    Location: U7879.001.DQD0EJ5-P1-C5-T1-W5005076306180665-L4012400000000000
    VPD:
    Manufacturer................IBM
    Machine Type and Model......2107900
    Serial Number...............75W94711200
    EC Level.....................149
    Device Specific.(Z0)........10
    Device Specific.(Z1)........0100
    Device Specific.(Z2)........075
    Device Specific.(Z3)........26007
    Device Specific.(Z4)........08
    Device Specific.(Z5)........00

    Description
    DISK OPERATION ERROR

    Probable Causes
    DASD DEVICE

    Failure Causes
    DISK DRIVE
    DISK DRIVE ELECTRONICS

    Recommended Actions
    PERFORM PROBLEM DETERMINATION PROCEDURES

    Detail Data
    PATH ID
    1
    SENSE DATA
    0600 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0118 0000 0000 0000
    ...skipping...
    Device Specific.(Z3)........26007
    Device Specific.(Z4)........08
    Device Specific.(Z5)........00

    Description
    DISK OPERATION ERROR

    Probable Causes
    DASD DEVICE

    Failure Causes
    DISK DRIVE
    DISK DRIVE ELECTRONICS

    Recommended Actions
    PERFORM PROBLEM DETERMINATION PROCEDURES
  • SystemAdmin
    SystemAdmin
    6902 Posts

    Re: SAN-Boot with ds8100 LUN report disk operation error

    ‏2013-03-21T03:22:07Z  
    • dukessd
    • ‏2013-03-19T00:36:13Z
    Can you post the first line of the sense data?

    SC_DISK_ERR2 is a problem accessing the disk down a given path but there can be several reasons for the error.

    Or you can go here:
    http://publibfp.dhe.ibm.com/epubs/pdf/c2343293.pdf

    Page 89 tells you how to decode it your self.

    HTH
    SENSE DATA
    0600 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0118 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0200 0000 0000 0000 0000 0000 0000 0000 0003 0000
    0000 003D 001A
    To Duke: Yes,you are right. Sense data: 0118 indicate that the scsi disk reservation error.
  • SystemAdmin
    SystemAdmin
    6902 Posts

    Re: SAN-Boot with ds8100 LUN report disk operation error

    ‏2013-03-21T03:22:11Z  
    • dukessd
    • ‏2013-03-19T00:36:13Z
    Can you post the first line of the sense data?

    SC_DISK_ERR2 is a problem accessing the disk down a given path but there can be several reasons for the error.

    Or you can go here:
    http://publibfp.dhe.ibm.com/epubs/pdf/c2343293.pdf

    Page 89 tells you how to decode it your self.

    HTH
    SENSE DATA
    0600 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0118 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
    0000 0000 0000 0000 0000 0000 0200 0000 0000 0000 0000 0000 0000 0000 0003 0000
    0000 003D 001A
    To Duke: Yes,you are right. Sense data: 0118 indicate that the scsi disk reservation error.