Troubleshooting
Problem
The following failing symptoms are related using IBM ServeRAID 8k controllers Option 25R8064 (FRU 25R8076): When System x and BladeCenter servers are powered on or after entering into the Adaptec (Control+A) BIOS utility, the server may display the following message in POST: Error: Controller Kernel Stopped Running << Press any key to continue >> The following message may appear in the Advanced Management Module (AMM) Management Module (MM) BladeCenter logs: System internal expansion board 2 (Battery -BSE3 RAID BAT): Assertion: Battery failed. When some System x and BladeCenter servers are restarted, the Diagnostic panel may have the RAID or DASD/RAID LED on and the Baseboard Management Controller (BMC)/IPMI logs may show the following message: GenerationID= 0020 Sensor Type= 0D Deassertion Event/Assertion Event Drive Slot (Bay) Discrete The ServeRAID Controller 1 logs.txt may report the following message: INF localhost Bad battery improved to the good battery state: controller 1 WRN
Resolving The Problem
Source
RETAIN tip: H191923
Symptom
The following failing symptoms are related using IBM ServeRAID 8k controllers Option 25R8064 (replacement part number 25R8076):
When System x and BladeCenter servers are powered on or after entering into the Adaptec (Control+A) BIOS utility, the server may display the following message in POST:
| Error: Controller Kernel Stopped Running << Press any key to continue >> |
The following message may appear in the Advanced Management Module (AMM) Management Module (MM) BladeCenter logs:
| System internal expansion board 2 (Battery - BSE3 RAID BAT): Assertion: Battery failed. |
When some System x and BladeCenter servers are restarted, the Diagnostic panel may have the RAID or DASD/RAID LED on and the Baseboard Management Controller (BMC)/IPMI logs may show the following message:
| GenerationID= 0020 Sensor Type= 0D Deassertion Event/Assertion Event Drive Slot (Bay) Discrete |
The ServeRAID Controller 1 logs.txt may report the following message:
| INF localhost Bad battery improved to the good battery state: controller 1 WRN localhost Good battery degraded to the low battery state: INF localhost Drive inserted: controller 1, channel 0, SCSI ID 0 WRN localhost An error occurred while accessing the logical drive: controller 1, logical drive 1 INF localhost Drive inserted: controller 1, channel 0, SCSI ID 0 |
Affected configurations
The system may be any of the following IBM servers:
- BladeCenter HS21 XM, type 1915, any model
- BladeCenter HS21 XM, type 7995, any model
- BladeCenter HS21, type 1885, any model
- BladeCenter HS21, type 8853, any model
- BladeCenter LS41, type 7972, any model
- System x3400, type 7973, any model
- System x3400, type 7974, any model
- System x3400, type 7975, any model
- System x3400, type 7976, any model
- System x3500, type 7977, any model
- System x3550, type 1913, any model
- System x3550, type 7978, any model
- System x3650, type 1914, any model
- System x3650, type 7979, any model
- System x3655, type 7985, any model
- System x3755, type 8877, any model
The system is configured with one or more of the following IBM Options:
- ServeRAID-8k SAS Controller, Option 25R8064
This tip is not software specific.
Solution
Download and install the latest version of ServeRAID firmware revision 15414, or higher.
Workaround
OPTION 1:
Shut down the failing server and disassemble the old battery.
From a system that has a known good charged ServeRAID 8K battery, remove that battery from the system and install the fully charged battery to the ServeRAID 8K controller in the failing server. Power on the server, press F1 Setup, and allow the system to remain idle for five minutes. Next, power down the server, remove the fully charged battery, and reinstall the original battery to start the charge process using the original battery.
Follow the steps below to check the battery status from ServeRAID Manager 8.40 or higher:
- Open ServeRAID Manager 8.40 application or boot to the ServeRAID Support CD v8.40.
- In the Enterprise View panel, right mouse click Controller 1 (IBM ServeRAID 8K), and select Properties.
- Select the Status tab to view the Battery status, temperature readings, and charge information.
OPTION 2:
Order one replacement battery, replacement part number 25R8088, which has a 30 percent charge equivalent enough to allow the Programmable Interrupt Controller (PIC) to be flashed. If this issue involves multiple machines, once the PIC has been successfully flashed, use this charged battery to flash the other systems.
Additional information
The ServeRAID 8K battery has not failed and does not require replacement. The system board has not failed and does not require replacement.
The cause of failure is the on-board SAS controller initially cannot successfully flash the PIC that is built into the ServeRAID 8K controller due to a very low discharged battery state possibly due to batteries not connected properly to the 8K controllers configured in IBM System x and Blade systems.
The ServeRAID 8K controller contains the PIC code and the battery charging circuitry. This PIC code is currently running v103 and must be flashed to version 106 or higher via the SAS controller firmware that resides on the planar. A voltage of at least 3 volt is required from a charged battery to begin the PIC code update. Once that voltage is met and the PIC has successfully updated, it now has the ability to start the expected battery charging process on an ServeRAID 8K battery.
The new firmware fix will wait for the battery to become fully charged before trying to erase and reprogram the PIC.
Document Location
Worldwide
Was this topic helpful?
Document Information
Modified date:
29 January 2019
UID
ibm1MIGR-5072922