IBM Support

IBM hard drives go offline during error recovery - IBM ServeRAID

Troubleshooting


Problem

Microsoft Server 2000 or Server 2003 environments may report the following system (hardware) event messages, while other operating systems will report similar events: - Lost Delayed-Write Data: The system was attempting to transfer file data from buffersto \Device\HarddiskVolumex. The write operation failed, and only some of the data may have been written to the file. - The device, \Device\SCSI\nfrd960, did not respond within the timeout period. An error was detected on device \Device\Harddiskx\DR1 during a paging operation. Other symptoms that may occur: - Command timeouts may be present which would eventually cause the array to become offline. - The controller may report the array offline due to a drive failure. - The defined Hot Spare drive may fail during a rebuild process that may cause another drive in the array to fail prematurely when trying to stripe data across other drives. - A drive may fail to rebuild when reaching 100% complete. - IBM ServeRAID Manager or the IPSSEND ut

Resolving The Problem

Source

RETAIN tip: H192308

Symptom

Microsoft Server 2000 or Server 2003 environments may report the following system (hardware) event messages, while other operating systems will report similar events:

  • Lost Delayed-Write Data: The system was attempting to transfer file data from buffers to \Device\HarddiskVolumex. The write operation failed, and only some of the data may have been written to the file.
  • The device, \Device\SCSI\nfrd960, did not respond within the timeout period. An error was detected on device \Device\Harddiskx\DR1 during a paging operation.

Other symptoms that may occur:

  • Command timeouts may be present which would eventually cause the array to become offline.
  • The controller may report the array offline due to a drive failure.
  • The defined Hot Spare drive may fail during a rebuild process that may cause another drive in the array to fail prematurely when trying to stripe data across other drives.
  • A drive may fail to rebuild when reaching 100% complete.
  • IBM ServeRAID Manager or the IPSSEND utility may show the state of a drive rebuilding, but the percentage remains at zero percent for a long period of time.
Affected configurations

This tip is not machine specific.

The following firmware level(s) are affected:

JP83, JP84, JP85

The system is configured with one or more of the following IBM Options:

  • Hard Drive SCSI (Hot-swap) - 146 GB, Option 32P0728
  • Hard Drive SCSI (Hot-swap) - 146 GB, Option 90P1382
  • Hard Drive SCSI (Hot-swap) - 300 GB, Option 90P1307
  • Hard Drive SCSI (Hot-swap) - 73 GB, Option 90P1305
  • Hard Drive SCSI (Hot-swap) - 73 GB, Option 90P1381
  • ServeRAID-4Lx Ultra160 SCSI Controller, Option 06P5740, replacement part number 06P5741
  • ServeRAID-4Mx Ultra160 SCSI Controller, Option 06P5736, replacement part number 06P5737
  • ServeRAID-5i Controller, Option 25P3492, replacement part number 32P0016
  • ServeRAID-6M Controller (128MB Cache), Option 32P0033, replacement part number 02R0985
  • ServeRAID-6M Controller (256MB Cache), Option 02R0988, replacement part number 02R0998
  • ServeRAID-6i Controller, Option 71P8595, replacement part number 71P8627
  • ServeRAID-7k Controller, Option 71P8642, replacement part number 71P8644
  • System Storage EXP300, Option 35311RU
  • System Storage EXP300, Option 35311RX
  • System Storage EXP300, Option 35314RX

This tip is not software specific.

Solution

Download the IBM Critical Hard Drive Update Program CD, version 1.19b.

Boot to the CD to update the affected drives to firmware level JP86.

Additional information

Under certain work conditions, when the hard drive begins to experience an unusually increase of heavy I/O activity, drives may begin to fail at high failure rates.

Updating these drives, that are identified as GNSxxxx (Genesis) and BBDxxxx (BlackBird), to firmware JP86, corrects the drives from getting marked defunct prematurely.

The affected drives may be in any of these options, but not all options contain the affected drives.

Other drives affected are:

  • Hard Drive SCSI (Hot-swap) 300 GB, Option 26K5761
  • Hard Drive SCSI (Hot-swap) 146 GB, Option 25R4912
  • Hard Drive SCSI (Hot-swap) 146 GB, Option 25R4863
  • Hard Drive SCSI (Hot-swap) 73 GB, Option 25R4906
  • Hard Drive SCSI (Hot-swap) 73 GB, Option 25R4903
  • Hard Drive SCSI (Hot-swap) 73 GB, Option 25R4909
  • Hard Drive SCSI (Hot-swap) 73 GB, Option 90P1318
  • Hard Drive SCSI (Hot-swap) 73 GB, Option 25R4860
  • Hard Drive SCSI (Hot-swap) 36 GB, Option 26K5745
  • Hard Drive SCSI (Hot-swap) 36 GB, Option 26K5748
  • Hard Drive SCSI (Hot-swap) 36 GB, Option 90P1318
  • Hard Drive SCSI (Hot-swap) 36 GB, Option 90P1380

Other expansion units, that the drives are supported in, would be the IBM EXP400 Storage Expansion Unit (Type 1733).

Document Location

Worldwide

Operating System

System x Hardware Options:Windows Server 2003

System x Hardware Options:Windows Server 2003 x86-64

System x Hardware Options:Windows 2000

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU00HFE","label":"System x Hardware Options->Storage expansion->EXP300->35311RU"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU00HFF","label":"System x Hardware Options->Storage expansion->EXP300->35311RX"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU00JXU","label":"System x Hardware Options->ServeRAID->ServeRAID-4x->06P5740"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU00JXV","label":"System x Hardware Options->ServeRAID->ServeRAID-4x->06P5736"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU00MIW","label":"System x Hardware Options->ServeRAID->ServeRAID-5x->25P3492"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU00NWU","label":"System x Hardware Options->Hard drive - SCSI (Hot-Swap)->146 GB->32P0728"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU00PGO","label":"System x Hardware Options->ServeRAID->ServeRAID-6x->32P0033"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU00PGP","label":"System x Hardware Options->ServeRAID->ServeRAID-6x->02R0988"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SUNSET","label":"PRODUCT REMOVED"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"SUNSET","label":"PRODUCT REMOVED"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU00WSX","label":"System x Hardware Options->Hard drive - SCSI (Hot-Swap)->73 GB->90P1305"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU00WSZ","label":"System x Hardware Options->Hard drive - SCSI (Hot-Swap)->36 GB->90P1318"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU01AZZ","label":"System x Hardware Options->Hard drive - SCSI (Hot-Swap)->300 GB->90P1307"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU01BOD","label":"System x Hardware Options->Hard drive - SCSI (Hot-Swap)->73 GB->90P1381"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU01BOF","label":"System x Hardware Options->Hard drive - SCSI (Hot-Swap)->146 GB->90P1382"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU01BOQ","label":"System x Hardware Options->Hard drive - SCSI (Hot-Swap)->36 GB->90P1380"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QUOFDV2","label":"System x Hardware Options->Storage expansion->EXP300->35314RX"},"Platform":[{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
29 January 2019

UID

ibm1MIGR-5069889