IBM Support

Power module transient error over current fault - Servers

Troubleshooting


Problem

What is the issue? - Proper FRU isolation and fault determination when a power module shuts down for an over-current. When does it occur? - When a component on the system loads external of the power supply and fails causing a short circuit. Are there specific steps to create the issue? - This condition will only occur if a system component or FRU has a critical failure that causes a short circuit on the 12V or 48V power domains. How will the customer recognize the problem? - The system error log will have one of two entries depending on whether the over-current was on the 12V or 48V power domains: - "Power module %d over current fault" - "Power module %d 48V over current fault" The power module and it's LEDs will be in the following states during thisevent: - The affected power supply will be shutdown. - The input power good LED will be on. - The output power good LED will be off. - The amber power module fault LED will be off. Are there specific error messages or indicators? - The s

Resolving The Problem

Source

RETAIN tip: H191283

Symptom

What is the issue?

  • Proper FRU isolation and fault determination when a power module shuts down for an over-current.

When does it occur?

  • When a component on the system loads external of the power supply and fails causing a short circuit.

Are there specific steps to create the issue?

  • This condition will only occur if a system component or FRU has a critical failure that causes a short circuit on the 12V or 48V power domains.

How will the customer recognize the problem?

  • The system error log will have one of two entries depending on whether the over-current was on the 12V or 48V power domains:
    • "Power module %d over current fault"
    • "Power module %d 48V over current fault"
  • The power module and it's LEDs will be in the following states during this event:
    • The affected power supply will be shutdown.
    • The input power good LED will be on.
    • The output power good LED will be off.
    • The amber power module fault LED will be off.

Are there specific error messages or indicators?

  • The system error log will have one of two entries depending on whether the over-current was on the 12V or 48V power domains:
    • "Power module %d over current fault"
    • "Power module %d 48V over current fault"
  • The power module and it's LEDs will be in the following states during this event:
    • The affected power supply will be shutdown.
    • The input power good LED will be on.
    • The output power good LED will be off.
    • The amber power module fault LED will be off.

Can you give an example of the problem?

  • The main problem is in determining what FRU failed and caused the short circuit which in turn caused the power supply to over-current.

    Proper FRU failure isolation procedures for an over-current fault:

  1. If in a redundant power configuration, look for any blades, system cards (mux, media tray, etc.), or fan modules that are powered off. The blades, system cards (mux, media tray, etc.), and fan modules will protect themselves and shutdown if they sense a short circuit on their load. If any of the above system components is in this state, remove it, and reset the power module. The power module should turn on and remain on.
  2. If no system components can be isolated per step 1, remove the fan shuttle and look for any physical signs of damage on system connectors (bent pins) and on the system midplane.
  3. If there are no signs of physical damage per step 2, replace the affected power module.
Affected configurations

- The system may be any of the following IBM servers:

BladeCenter Chassis, type 8677, any model
BladeCenter H, type 8852, any model
BladeCenter HT, type 8740, any model
BladeCenter HT, type 8750, any model
BladeCenter T, type 8720, any model
BladeCenter T, type 8730, any model

- This Tip is not software specific.

- This Tip is not option specific.

- The system has the symptom described above.

Solution

None. The event is generated as designed.

Workaround

None.

Additional Information

The event is for informational purposes only. It is not a hardware or firmware failure.

Document Location

Worldwide

Operating System

BladeCenter:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW20T","label":"BladeCenter E Chassis"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"HW20M","label":"BladeCenter T Chassis"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"LOB57","label":"Power"}},{"Type":"HW","Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"HW20M","label":"BladeCenter T Chassis"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"LOB57","label":"Power"}},{"Type":"HW","Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"HW21Y","label":"BladeCenter H Chassis"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"LOB57","label":"Power"}},{"Type":"HW","Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"HW22Q","label":"BladeCenter HT Chassis"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"LOB57","label":"Power"}},{"Type":"HW","Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"HW22Q","label":"BladeCenter HT Chassis"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"LOB57","label":"Power"}}]

Document Information

Modified date:
10 April 2023

UID

ibm1MIGR-5071602