IBM Support

NMI may occur when network cables are removed or connected - IBM System x

Troubleshooting


Problem

Non-Maskable Interrupt (NMI) and Peripheral Component Interconnect (PCI) errors may occur on System x3550 M2 and System x3650 M2 when removing or connecting the Ethernet cable into the Ethernet ports of the Dual-Port Gigabit Ethernet Daughter Card, Option part number 46M1076. When failure symptom occurs, the NMI and PCI error Light Emitting Diode (LED) will be illuminated on the Light Path Diagnostics (LPD) panel. The Integrated Management Module (IMM) will log the following errors: I -- 7/22/2009:19:8:0 -- The System "S/N ABCDEFG" encountered a POST Error I -- 7/22/2009:19:7:19 -- Remote Login Successful. Login ID: USERID from Web at IP address 0.1.1.1 I -- 7/22/2009:19:0:16 -- RECOVERY:A Uncorrectable Bus Error has occurred on system "S/N ABCDEFG" E -- 7/22/2009:18:59:50 -- A Uncorrectable Bus Error has occurred on system "S/N ABCDEFG" E -- 7/22/2009:18:58:40 -- A software NMI has occurred on system "S/N ABCDEFG" The server may reboot after logging the NMI.

Resolving The Problem

Source

RETAIN tip: H197009

Symptom

Non-Maskable Interrupt (NMI) and Peripheral Component Interconnect (PCI) errors may occur on System x3550 M2 and System x3650 M2 when removing or connecting the Ethernet cable into the Ethernet ports of the Dual-Port Gigabit Ethernet Daughter Card, Option part number 46M1076.

When failure symptom occurs, the NMI and PCI error Light Emitting Diode (LED) will be illuminated on the Light Path Diagnostics (LPD) panel.

The Integrated Management Module (IMM) will log the following errors:

 

I -- 7/22/2009:19:8:0 -- The System "S/N ABCDEFG" encountered a POST Error

I -- 7/22/2009:19:7:19 -- Remote Login Successful. Login ID: USERID from Web at IP address 0.1.1.1

I -- 7/22/2009:19:0:16 -- RECOVERY:A Uncorrectable Bus Error has occurred on system "S/N ABCDEFG"

E -- 7/22/2009:18:59:50 -- A Uncorrectable Bus Error has occurred on system "S/N ABCDEFG"

E -- 7/22/2009:18:58:40 -- A software NMI has occurred on system "S/N ABCDEFG"

The server may reboot after logging the NMI.

Affected configurations

The system may be any of the following IBM servers:

  • System x3550 M2, type 4198, any model
  • System x3550 M2, type 7946, any model
  • System x3550 M3, type 4254, any model
  • System x3550 M3, type 7944, any model
  • System x3650 M2, type 4199, any model
  • System x3650 M2, type 7947, any model
  • System x3650 M3, type 4255, any model
  • System x3650 M3, type 7945, any model

The system is configured with one or more of the following IBM options:

  • Dual-Port Giabit Ethernet Daughter Card, Option part number 46M1076, replacement part number (FRU) 43V7073

This tip is not software specific.

The system is configured with Light Path.

The system's Light Path diagnostic did not complete.

The system's Light Path diagnostic fails.

The system has the symptom described above.

Solution

  1. Order Miscellaneous Parts Kit Customer Replaceable Unit (replacement part number) 69Y4505 for System x3650 M2 and System x3650 M3, or Miscellaneous Kit replacement part number 69Y4506 for System x3550 M2, and Miscellaneous Kit replacement part number 69Y5639 for System x3550 M3.

    This kit contains a metal clip and rubber bumper and two plastic standoffs in addition to screws, latches, baffles, etc.

  2. Follow the instructions in the following .PDF files to install a metal clip on the rear Inoput/Output (I/O) panel and rubber bumper on the system board tray to reduce the movement of the Ethernet card while removing and connecting the Ethernet cable.

    The files are available from the following URL:

Workaround

Re-seat the daughter card.

Additional information

The issue is caused by PCI Express (PCIe) signal degradation due to an impedance change of the connector for the Broadcom Dual-Port daughter card during movement of the Ethernet card.

 

Document Location

Worldwide

Operating System

System x:Operating system independent / None

System x Hardware Options:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWX30","label":"System x->System x3550 M2"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWX40","label":"System x->System x3650 M2"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWX90","label":"System x->System x3550 M3"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXA0","label":"System x->System x3650 M3"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QUOFFCB","label":"System x Hardware Options->Ethernet->Gigabit->46M1076"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
30 January 2019

UID

ibm1MIGR-5084146