Troubleshooting
Problem
Error codes can be used to resolve problems that have definite symptoms
Resolving The Problem
|
Error symptoms
|
|---|
Click here for system service part numbers (if part replacement is needed).
You can use the following information to find solutions to problems that have definite symptoms.
Note: If diagnostic error messages appear that are not listed in the following tables, make sure that your BladeCenter unit has the latest level of firmware code installed.
If you have just added a new option and your system is not working, complete the following procedure before using the troubleshooting charts:
- Remove the option that you just added.
- Run the diagnostic tests to determine if your system is running correctly.
- Reinstall the new device.
|
Device
|
Suggested action
|
|---|---|
| Blade server problem | |
|
Blade servers turn off for no apparent reason.
|
All blade bays must have a blade server, expansion unit, or filler blade in them. Blade bays that do not have these items installed or have them installed improperly disturb airflow in the BladeCenter unit with an adverse effect on BladeCenter unit cooling. If the BladeCenter unit begins to overheat, blade server processors will begin to slow down and will eventually turn off the system. |
| Blade server does not turn off during Red Hat Linux 7.3 shutdown. |
Issuing any of the Linux shutdown commands shuts down the operating system, but does not turn off the blade server. The blade server shuts down to the point where a message is displayed on the management module saying that the blade server is off, but the blade server power-on LED remains lit. Complete the instructions provided for Red Hat Linux in "Configuring Wake on LAN in the BladeCenter unit" on page 25 of the BladeCenter Hardware Maintenance Manual to enable Wake On LAN for both onboard Broadcom integrated Ethernet controllers. This will allow the blade server to restart using Wake on LAN even if the Linux shutdown is incomplete and the blade server does not turn off. Forcing a blade server to turn off. Complete the following steps to turn off a blade server after an incomplete Linux shutdown:
|
| Blade server does not turn off during SUSE LINUX v8.0 shutdown. |
Issuing any of the Linux shutdown commands shuts down the operating system but does not turn off the blade server. The blade server shuts down to the point where a message is displayed on the management module saying that the blade server is off, but the blade server power-on LED remains lit. A blade server running SUSE Linux 8.0 will not turn off during shutdown if ACPI is not enabled. Complete the following steps to enable ACPI:
|
| Blade server does not turn on, the amber system-error LED on the BladeCenter system-LED panel is lit, the amber blade error LED on the blade server LED panel is lit, and the system-error log contains the following message: "CRUs MisMatched" |
The problem occurs after installing the second microprocessor option or after replacing a failed microprocessor in a two-way Blade server. The processor with the lowest feature set and stepping level must be used as the Bootstrap Processor (BSP). This is Microprocessor 1 in location U66. Move the microprocessor in location U66 to location U70, and move the microprocessor in location U70 to location U66. |
| CD-ROM cannot be configured with a JS20 blade server but works with an HSxx blade server. | The customer interface card (replacement part number (FRU) 59P6629) is not compatible with the JS20 blade server and must be replaced with the latest level FRU. Note: The customer interface card installed in the BladeCenter is listed as the "media tray" instead of the "customer interface card" on the Hardware VPD view of the management-module web interface. The FRU number is also printed on the customer interface card. Replace the customer interface card with the latest level. Click System service parts listing - IBM eServer BladeCenter (Type 8677) for the correct FRU number. |
| Some components do not report environmental status (temperature, voltage). | The green status dot for a component is not automatically a link to environmental information (temperature and voltage) for the component. Only the management module and blade servers have environmental information, and only the green dot for those components contains a link to environmental information. |
| Switching KVM control between blade servers gives USB device error. | If a blade server is under heavy load, it can take several minutes before it enumerates the USB devices connected to it. If control of the KVM and media tray is switched away from the blade server before this enumeration is complete, a USB device installation error might be displayed. Do not switch KVM control between blade servers until the mouse and keyboard are both working on the blade server that has control of the KVM and media tray. |
| "Unsafe Removal of Device" error message appears on blade server running Microsoft Windows 2000. |
Before switching ownership of the media tray to another blade server, safely stop the media tray devices on the blade server that currently owns the media tray, as follows:
|
| "Media not found" error message and other file system error occur on a blade server running Linux or DOS. |
Attempting to access the mounted CD-ROM drive or diskette drive (media tray) after it has been switched to another blade server results in I/O errors, even if the media tray has been switched back. Note: Because the BladeCenter unit uses a USB bus to communicate with the media tray devices, switching ownership of the media tray to another blade server is the same as unplugging a USB device.
|
| Linux does not install form the BladeCenter CD-ROM drive or will not start afterward. |
Neither Red Hat Linux 7.3 nor SUSE Linux version 8.0 will install locally, although Red Hat Linux might appear to install.
Download the latest operating system installation instructions for your operating system from the IBM Support Web page at http://www.ibm.com/systems/support/. The necessary workaround is described in the instructions for your operating system. |
| X does not start in Red Hat 7.3 when the blade server does not own the KVM. |
The X Window does not start in a blade server if the blade server is not the current owner of the keyboard, video, and mouse (KVM). Because the blade server does not own the KVM, the X Window cannot find the mouse. Depending on the Red Hat Linux kernel version, do one of the following:
|
| Remote control does not work with default SUSE Linux, version 8.0, display settings. |
The remote console requires a display setting of 1024x768@60Hz in the blade server operating system. The default resolution in SUSE is 1024x768, but the default refresh rate falls somewhere between 50Hz and 60Hz. The remote console does not work for a blade server running SUSE with a display refresh rate other than exactly 60Hz. The message "eServer/No video available" displays. Other operating systems do not exhibit the problem. Set the refresh rate in the XF86Config file to exactly 60Hz. There are two methods.
|
| Diskette drive problems | |
| Diskette drive is seen as /dev/sdx by Red Hat Linux 7.3 and SUSE version 8.0 |
Both Red Hat Linux and SUSE Linux see the diskette drive as /dev/sdx (where x is the last SCSI device in the line) instead of /dev/fd0. When you try to mount the drive as fd0, the Linux operating system does not recognize the device ("not a valid block device"). There are two options:
|
| Diskette drive problem. |
|
| CD-ROM drive problems | |
| CD-ROM drive is seen as /dev/sr0 by SUSE. |
If the SUSE Linux operating system is installed remotely onto a blade server that is not the current owner of the media tray (CD-ROM drive, diskette drive, and USB port), SUSE sees the CD-ROM drive as /dev/sr0 instead of /dev/cdrom, establish a link between /dev/sr0 and /dev/cdrom as follows:
|
| CD-ROM drive is not recognized after being switched back to blade server running on Windows 2000 Advanced Server with SP3 applied. |
When the CD-ROM drive is owned by blade server x, is switched to another blade server, then is switched back to blade server x, the operating system in blade server x no longer recognizes the CD-ROM drive. This happens when you have not safely stopped the drives before switching ownership of the CD-ROM drive, diskette drive, and USB port (media tray). Note: Because the BladeCenter unit uses a USB bus to communicate with the media tray devices, switching ownership of the media tray to another blade server is the same as unplugging a USB device. Before switching ownership of the CD-ROM drive (media tray) to another blade server, safely stop the media tray devices on the blade server that currently owns the media tray, as follows:
You can now safely switch ownership of the media tray to another blade server. |
| CD-ROM problem |
|
| Ethernet controller problems | |
| Operating systems number Ethernet controllers differently. |
Enumeration of the Ethernet controllers in a blade server is operating-system dependent. In the blade server Configuration/Setup Utility program, the Ethernet port designated as Planar Ethernet 1 is routed to Ethernet switch module 2 and the Ethernet port designated as Planar Ethernet 2 is routed to Ethernet switch module 1. Verify the designations through your operating system settings or by testing:
|
| Ethernet switch module problems | |
| First ping from Ethernet switch module through Telnet reports failure. |
When you use the Ethernet switch module Telnet interface to request the switch module to ping something, the first ping response reports a failure, although the other repetitions might report success. This occurs regardless of whether the switch module port the pinged object is connected to is internal or external, and applies to pinging blade servers but not to pinging the management module or objects connected to its external Ethernet port, such as the network management station. To get accurate results, always specify multiple repetitions (>1) in the ping request, and ignore the first ping response from that request. See the IBM 4-Port Gb Ethernet Switch Module for BladeCenter Installation Guide for instructions on how to ping through the Telnet interface. |
| Ethernet switch-module firmware graphics shows a blank panel when the blade server is present but powered off. |
If the Wake-on-LAN (WOL) feature is disabled on a blade server, and the blade server is turned off, the switch module internal port link to that blade is down. This is not an error, but the graphic of the BladeCenter unit might show a blank panel instead of a blade server in that bay. Note: You can enable or disable the WOL feature on a blade server through the management-module Web interface or through the IBM Director console. Do not rely on the BladeCenter graphic in the Ethernet switch-module firmware Web interface to determine the presence or absence of blade servers in the BladeCenter unit. |
| Ethernet switch-module log reports elapsed time, not time of day. | The timestamp on entries in the Ethernet switch module log uses elapsed time (since last switch restart). The timestamp on entries restarts from 0 each time the switch is restarted, although the entries do remain in order of occurrence. |
| Ethernet disconnect notice will not appear when running Windows 2000. | If an Ethernet cable is accidentally removed from the back of the BladeCenter unit, the small red X (disconnect notice) that normally would appear to indicate that the cable was disconnected will not appear in the bottom right of the screen. The disconnect notice does not appear because the blade server Ethernet controller connects to the Ethernet switch module through integrated circuitry inside the BladeCenter unit. When troubleshooting Ethernet-related problems, ensure that the Ethernetcables on the back of the BladeCenter unit are connected properly. |
| The default IP address set by the Ethernet switch module does not match the one assigned by the management module. | When troubleshooting Ethernet-related problems, ensure that the Ethernet cables on the back of the BladeCenter unit are connected properly. |
| Updating the Ethernet switch module configuration through the management module does not save the switch NVRAM. |
When you use the management-module Web interface to update the Ethernet switch module configuration, the management module firmware writes its settings for the switch module only to the management module NVRAM; it does not write its settings for the switch module to the switch-module NVRAM. If the switch module restarts when the management module is not able to apply the IP address it has in NVRAM for the switch module, the switch module will use whatever IP address it has in its own NVRAM. If the two IP addresses are not the same, you might not be able to manage the Ethernet switch module any more. The management module cannot apply the switch IP address from its NVRAM if:
When you use the management-module Web interface to update the Ethernet switch module configuration, the management module firmware writes its settings for the switch module only to the management module NVRAM; it does not write its settings for the switch module to the switch module NVRAM. If the switch module restarts when the management module is not able to apply the IP address it has in NVRAM for the switch module, the switch module will use whatever IP address it has in its own NVRAM. If the two IP addresses are not the same, you might not be able to manage the Ethernet switch module any more. The management module cannot apply the switch IP address from its NVRAM if:
|
| Keyboard problems | |
| The keyboard is very slow when using an operating system that does not have USB drivers. |
Although the keyboard attached to the BladeCenter unit is a PS/2-style keyboard, communication with it is through a USB bus. When you are running an operating system that does not have USB drivers, such as in the following instances, the keyboard responds very slowly.
|
| The keyboard is very slow when using an operating system that does not have USB drivers. | Sometimes when switching ownership of the KVM to a blade server, the video for the blade server appears almost immediately, but it takes up to 10 or 20 seconds for the mouse and keyboard to be usable. No action required. |
| Pressing F1 brings up browser help instead of performing BladeCenter management functions. | Connecting to the BladeCenter management module through the Web interface does not provide proper coverage for the F1 key. In particular, pressing F1 to access the Configuration/Setup Utility when a blade server is started brings up browser help instead of the Configuration/Setup Utility. This problem is peculiar to the Sun Java browser plug-in. Use the Microsoft virtual machine (VM) that is built into the browser. |
| Remote console has keyboard entry problems with Sun Java plug. |
When you are redirecting the server console (remote console function) of a blade server that is running Microsoft Windows 2000 or Windows XP and using the Sun Java plug-in (Java Virtual Machine), the remote console can have keyboard entry problems. Use the Microsoft Java Virtual Machine (JVM) or Java Runtime Environment (JRE) on the blade server instead of using the Sun Java Virtual Machine. The Microsoft JVM comes with the Windows XP Service Pack You can obtain the Microsoft JVM for Windows 2000 from the Microsoft corporation. If you are using the Internet Explorer browser, version 6.0 or later, to log into the management module and use the remote control function, you must also adjust the browser settings:
|
| Management module problems | |
| The management module password cannot be reset. | If you forget the management-module password, you will not be able to access the BladeCenter management module. The management-module password cannot be overridden, and the management module will need to be replaced. |
| Management module does not complete changeover to redundant module on hardware failure. | Replace the management module. |
| Media tray problems | |
| Media tray access is lost temporarily during management module restart. |
When the BladeCenter management module is restarted, use of the media tray (CD-ROM drive, diskette drive, and USB port) is lost temporarily. If you or a failure condition initiates a management module restart while I/O activity is taking place on the media tray, the disruption can interrupt reads to the CD-ROM drive or lose data being written to a diskette. Note: You can restart the management module through the Web interface to the management module or from a network management station such as the IBM Director console. Some failures on the BladeCenter unit can result in the management module restarting automatically. Make sure there is no I/O activity on the media tray before you restart the management module. |
| Monitor problems | |
| The monitor works when you start the BladeCenter unit, but goes blank when you start some application programs in the blade servers. | See the IBM 4-Port Gb Ethernet Switch Module for BladeCenter Installation Guide for instructions on how to ping through the Telnet interface. |
| The monitor displays video for blade server 14 during management module restart. | The monitor attached to the BladeCenter management module normally shows the video output from the blade server that is the current owner of the keyboard, video, and mouse (KVM). When there is no actively selected video from any blade server, the video from blade server 14 is routed to the management module. While the managementmodule is restarting, there is temporarily no current KVM owner. The video from blade server 14 displays on the monitor briefly until the management module uses its NVRAM values to reestablish ownership of the KVM and media tray (CD-ROM drive, diskette drive, and USB port). After that, the video from the blade server that is the current KVM owner displays on the monitor. |
| The screen is blank. |
Verify that:
|
| Only the cursor appears. | See "Undetermined problems" on page 90 of the BladeCenter Hardware Maintenance Manual. |
| The screen is wavy, unreadable, rolling, distorted, or has screen jitter. |
If the monitor self-tests show the monitor is working properly, consider the location the monitor. Magnetic fields around other devices (such as transformers, appliances, fluorescent lights, and other monitors) can cause screen jitter or wavy, unreadable, rolling, or distorted screen images. If this happens, turn off the monitor. (Moving a color monitor while it is turned on might cause screen discoloration.) Then move the device and the monitor at least 305mm (12in) apart. Turn on the monitor. NOTES:
|
| Wrong characters appear on the screen. | If the wrong language is displayed, update the firmware in the management module with the correct language. If the problem remains, replace the management module. |
| Mouse problems | |
| Mouse function lost during Red Hat installation. | If, while installing Red Hat Linux 7.3 to a blade server, you or someone else selects a different blade server as owner of the keyboard, video, and monitor (KVM), you might lose mouse function for the installation process. Do not switch KVM owners until the installation process begins to install the packages (after the "About to Install" window). |
| Mouse is not detected during SUSE installation. | The installation of the SUSE Linux version 8.0 operating system does not detect the mouse. You will need to select the mouse manually. Download the latest operating system installation instructions for your operating system from the IBM Support Web page at http://www.ibm.com/systems/support/. The steps for selecting the mouse are described in the instructions for your operating system. |
| Mouse offset problems occur when using remote console and running X. | When you are using the remote console on a blade server that is running X Windows in Red Hat Linux 7.3 or SUSE Linux version 8.0, you see two cursor arrows on the screen, widely spaced, one white and one black. Configure Linux and X Windows for accurate mouse tracking. See the online help information in the management module configuration and management software for instructions (Blade Tasks . Remote Control, click the circled question mark next to Redirect Server Console, read the section titled "Notes on mouse support under Linux"). |
| Power problems | |
| The system does not power on. |
Verify that:
|
| Option problems | |
| An IBM option that was just installed does not work. |
Verify that:
|
| An IBM option that used to work does not work now. | Verify that all of the option hardware and cable connections are secure. If the option comes with its own test instructions, use those instructions to test the option. If the problem remains, replace the option. |
| Service processor problems | |
| Service processor in the management module reports a general monitor failure. | Disconnect the BladeCenter unit from all electrical sources, wait for 30 seconds, reconnect the BladeCenter unit to the electrical sources, and restart the server. If a problem remains, replace the management module. |
| Switch module problems | |
| Updating the switch-module configuration through the switch does not save the management-module NVRAM. | If you log in to the Ethernet switch module directly (through the Ethernet switch-module Web interface or Telnet interface instead of through the management module Web interface) and update the switch module configuration, saving the new configuration saves only to the switch NVRAM, not to the management-module NVRAM. The management module will not be able to communicate with the switch module. In the management module Web interface, in Switch Tasks . Management, change the New Static IP Configuration values to match the ones in Current IP Configuration, and apply the configuration. |
Note: See the BladeCenter Hardware Maintenance Manual to determine which components should be replaced by a field service technician.
| Support forums |
| Submit a technical question |
| Before you call IBM Service |
Document Location
Worldwide
Was this topic helpful?
Document Information
Modified date:
29 January 2019
UID
ibm1MIGR-55302