Topic
3 replies Latest Post - ‏2013-01-21T14:30:40Z by Novikov_Alexander
SystemAdmin
SystemAdmin
9111 Posts
ACCEPTED ANSWER

Pinned topic ServRAID 8k controller does not recognize WD10EFRX

‏2013-01-18T15:22:28Z |
Hi,

I originally posted this to the absolutely wrong WebSphere support forum, so reposting here once more.

two of our Samsung SpinPoint F1 harddisks failed in the last couple of weeks in one of our x3650 servers. We replaced the first one with a Western Digital WD1003FBYX drive that got recognized instantly without problems.

The second replacement is a Western Digital WD10EFRX drive and as it seems the server just does not see the drive.

Firmware and BIOS of both the server and the RAID controller are up to date.

What I've tried so far:

  • simple plug it in as a replacement for the failing drive:
the drive's lights flash a couple of times and the server's DASD LED turns off, but nevertheless the drive is not seen

  • reboot the server and make a CTRL-A "Drive Scan" from within the controller's BIOS
the drive does not show up in the list of found drives

  • reboot the server and leave only the single WD10EFRX drive in the server
the server refuses to boot because no harddrive/array is found

  • try the same procedures with another spare WD10EFRX drive
with exactly the same results as above

  • try the WD10EFRX drive in a completely different server (Fujitsu)
the drive is recognized instantly

This is what arconf says about the controller:

---------CUT--------
% ./arconf GETCONFIG 1
Controllers found: 1
Controller information
Controller Status : Okay
Channel description : SAS/SATA
Controller Model : IBM ServeRAID 8k
Controller Serial Number : 1BDE6A20
Physical Slot : 0
Installed memory : 256 MB
Copyback : Enabled
Data scrubbing : Enabled
Defunct disk drive count : 0
Logical drives/Offline/Critical : 1/0/1
Controller Version Information
BIOS : 5.2-0 (17003)
Firmware : 5.2-0 (17003)
Driver : 1.1-7 (28000)
Boot Flash : 5.1-0 (15411)
Controller Battery Information
Status : Okay
Over temperature : No
Capacity remaining : 100 percent
Time remaining (at current draw) : 3 days, 20 hours, 36 minutes
Controller Vital Product Data
VPD Assigned# : 39R8875
EC Version# : J85096
Controller FRU# : 25R8076
Battery FRU# : 25R8088

Logical drive information
Logical drive number 1
Logical drive name : main
RAID level : 1
Status of logical drive : Critical
Size : 953699 MB
Read-cache mode : Enabled
Write-cache mode : Enabled (write-back)
Write-cache setting : Enabled (write-back) when protected by battery
Partitioned : Yes
Number of segments : 2
Drive(s) (Channel,Device) : DDD 0,0
Defunct segments : No

Physical Device information
Device #0
Device is a Hard drive
State : Online
Supported : Yes
Transfer Speed : SATA 3.0 Gb/s
Reported Channel,Device : 0,0
Reported Location : Enclosure 0, Slot 0
Reported ESD : 2,0
Vendor : WDC
Model : WD1003FBYX-01Y7B
Firmware : 01.01V02
Serial number : WD-WCAW34316439
Size : 953869 MB
Write Cache : Enabled (write-back)
FRU : None
PFA : No
Device #2
Device is an Enclosure services device
Reported Channel,Device : 2,0
Enclosure ID : 0
Type : SES2
Vendor : IBM-ESXS
Model : VSC7160
Firmware : 1.06
Status of Enclosure services device
Temperature : Normal
---------CUT--------

and for the servers BIOS:

---------CUT--------
% dmidecode -t bios
dmidecode 2.11
SMBIOS 2.4 present.

Handle 0x0000, DMI type 0, 24 bytes
BIOS Information
Vendor: IBM
Version: -http://GGEL49AUS-1.19-
Release Date: 02/11/2011
Address: 0xF17A0
Runtime Size: 59488 bytes
ROM Size: 4096 kB
http://...
---------CUT--------

the adapter's devlice log shows something that I don't understand, however:

---------CUT--------
% ./arcconf GETLOGS 1 DEVICE
Controllers found: 1
<ControllerLog controllerID="0" type="0" time="1358516156" version="1">
<driveErrorEntry adapterID="0" channelID="0" deviceID="0" numParityErrors="0" linkFailures="1" hwErrors="0" slotNum="0" enclIndex="0" abortedCmds="0" mediumErrors="0"/>
<driveErrorEntry adapterID="0" channelID="0" deviceID="1" numParityErrors="0" linkFailures="79" hwErrors="0" slotNum="1" enclIndex="0" abortedCmds="0" mediumErrors="0"/>
</ControllerLog>
---------CUT--------

Any ideas on how to use those drives with the controller?

Thanks in advance!
Updated on 2013-01-21T14:30:40Z at 2013-01-21T14:30:40Z by Novikov_Alexander
  • Novikov_Alexander
    Novikov_Alexander
    6863 Posts
    ACCEPTED ANSWER

    Re: ServRAID 8k controller does not recognize WD10EFRX

    ‏2013-01-18T17:30:40Z  in response to SystemAdmin
    Dear Udo,

    could you provide DSA log of your server.

    Regards,
    Alexander Novikov
    Russia, Moscow
    • SystemAdmin
      SystemAdmin
      9111 Posts
      ACCEPTED ANSWER

      Re: ServRAID 8k controller does not recognize WD10EFRX

      ‏2013-01-21T11:55:28Z  in response to Novikov_Alexander
      Hi Alexander,

      the logs are attached.

      Thanks

      Udo
      • Novikov_Alexander
        Novikov_Alexander
        6863 Posts
        ACCEPTED ANSWER

        Re: ServRAID 8k controller does not recognize WD10EFRX

        ‏2013-01-21T14:30:40Z  in response to SystemAdmin
        Hi Udo,

        first of all update all server's software (BMC, ...) and HDDs backplane:

        System x3650 and x3655 SAS/SATA hot-swap backplane firmware ...
        http://www.ibm.com/support/entry/portal/docdisplay?lndocid=MIGR-5072146

        Inform us about p/n and FRU p/n of your HDD.

        Pay attention that non-supported OS used. Use any supported OS version from compatibility list:

        Red Hat Operating System
        OS and OS Extension Information
        http://www-03.ibm.com/systems/info/x86servers/serverproven/compat/us/nos/redchatold.html

        Regards,
        Alexander Novikov
        Russia, Moscow