Troubleshooting
Problem
Non-Maskable Interrupt (NMI) and Peripheral Component Interconnect (PCI) errors may occur on System x3550 M2 and System x3650 M2 when removing or connecting the Ethernet cable into the Ethernet ports of the Dual-Port Gigabit Ethernet Daughter Card, Option part number 46M1076. When failure symptom occurs, the NMI and PCI error Light Emitting Diode (LED) will be illuminated on the Light Path Diagnostics (LPD) panel. The Integrated Management Module (IMM) will log the following errors: I -- 7/22/2009:19:8:0 -- The System "S/N ABCDEFG" encountered a POST Error I -- 7/22/2009:19:7:19 -- Remote Login Successful. Login ID: USERID from Web at IP address 0.1.1.1 I -- 7/22/2009:19:0:16 -- RECOVERY:A Uncorrectable Bus Error has occurred on system "S/N ABCDEFG" E -- 7/22/2009:18:59:50 -- A Uncorrectable Bus Error has occurred on system "S/N ABCDEFG" E -- 7/22/2009:18:58:40 -- A software NMI has occurred on system "S/N ABCDEFG" The server may reboot after logging the NMI.
Resolving The Problem
Source
RETAIN tip: H197009
Symptom
Non-Maskable Interrupt (NMI) and Peripheral Component Interconnect (PCI) errors may occur on System x3550 M2 and System x3650 M2 when removing or connecting the Ethernet cable into the Ethernet ports of the Dual-Port Gigabit Ethernet Daughter Card, Option part number 46M1076.
When failure symptom occurs, the NMI and PCI error Light Emitting Diode (LED) will be illuminated on the Light Path Diagnostics (LPD) panel.
The Integrated Management Module (IMM) will log the following errors:
|
I -- 7/22/2009:19:8:0 -- The System "S/N ABCDEFG" encountered a POST Error I -- 7/22/2009:19:7:19 -- Remote Login Successful. Login ID: USERID from Web at IP address 0.1.1.1 I -- 7/22/2009:19:0:16 -- RECOVERY:A Uncorrectable Bus Error has occurred on system "S/N ABCDEFG" E -- 7/22/2009:18:59:50 -- A Uncorrectable Bus Error has occurred on system "S/N ABCDEFG" E -- 7/22/2009:18:58:40 -- A software NMI has occurred on system "S/N ABCDEFG" |
The server may reboot after logging the NMI.
Affected configurations
The system may be any of the following IBM servers:
- System x3550 M2, type 4198, any model
- System x3550 M2, type 7946, any model
- System x3550 M3, type 4254, any model
- System x3550 M3, type 7944, any model
- System x3650 M2, type 4199, any model
- System x3650 M2, type 7947, any model
- System x3650 M3, type 4255, any model
- System x3650 M3, type 7945, any model
The system is configured with one or more of the following IBM options:
- Dual-Port Giabit Ethernet Daughter Card, Option part number 46M1076, replacement part number (FRU) 43V7073
This tip is not software specific.
The system is configured with Light Path.
The system's Light Path diagnostic did not complete.
The system's Light Path diagnostic fails.
The system has the symptom described above.
Solution
- Order Miscellaneous Parts Kit Customer Replaceable Unit
(replacement part number) 69Y4505 for System x3650 M2 and System
x3650 M3, or Miscellaneous Kit replacement part number 69Y4506 for
System x3550 M2, and Miscellaneous Kit replacement part number
69Y5639 for System x3550 M3.
This kit contains a metal clip and rubber bumper and two plastic standoffs in addition to screws, latches, baffles, etc.
- Follow the instructions in the following .PDF files to install
a metal clip on the rear Inoput/Output (I/O) panel and rubber
bumper on the system board tray to reduce the movement of the
Ethernet card while removing and connecting the Ethernet cable.
The files are available from the following URL:
Workaround
Re-seat the daughter card.
Additional information
The issue is caused by PCI Express (PCIe) signal degradation due to an impedance change of the connector for the Broadcom Dual-Port daughter card during movement of the Ethernet card.
Document Location
Worldwide
Was this topic helpful?
Document Information
Modified date:
30 January 2019
UID
ibm1MIGR-5084146