subscribe iconSubscribe to this information

Linux server or Linux partition symptoms

Use the following tables to find the symptom you are experiencing. If you cannot find your symptom, contact your next level of support.

Choose the description that best describes your situation:

You have a service action to perform

Symptom What you should do:
You have an open service event in the service action event log. Go to Start of call procedure.
You have parts to exchange or a corrective action to perform.
  1. Go to Removing and replacing parts.
  2. Go to the End-of-call procedure.
You need to verify that a part exchange or corrective action corrected the problem.
  1. Go to Verifying the repair.
  2. Go to the End-of-call procedure.
You need to verify correct system operation.
  1. Go to Verifying the repair.
  2. Go to the End-of-call procedure.

Integrated Virtualization Manager (IVM) Problem

Symptom What you should do:

The partitions do not activate - partition configuration is damaged

Restore the partition configuration data using IVM. See Backing up and restoring partition data
Other partition problems when the server is managed by IVM Perform Troubleshooting with the Integrated Virtualization Manager

An LED is not operating as expected

Symptom What you should do:
The system attention LED on the control panel is on. Go to Linux fast-path problem isolation.
The rack identify LED does not operate properly. Go to the Linux fast-path problem isolation.
The rack indicator LED does not turn on, but a drawer identify LED is on.
  1. Make sure the rack indicator LED is properly mounted to the rack.
  2. Make sure that the rack identify LED is properly cabled to the bus bar on the rack and to the drawer identify LED connector.
  3. Replace the following parts one at a time:
    • Rack LED to bus bar cable
    • LED bus bar to drawer cable
    • LED bus bar
  4. Contact your next level of support

Control (operator) panel problems

Symptom What you should do:
01 does not appear in the upper-left corner of the operator panel display after the power is connected and before pressing the power-on button. Other symptoms appear in the operator panel display or LEDs before the power on button is pressed. Go to Power problems.
A bouncing or scrolling ball (moving row of dots) remains on the operator panel display, or the operator panel display is filled with dashes or blocks.

Verify that the operator panel connections to the system backplane are connected and properly seated. Also, reseat the Service Processor card.

If a client computer (such as a PC with Ethernet capability and a Web browser) is available, connect it to the service processor in the server that is displaying the symptom.

To connect a personal computer with Ethernet capability and a Web browser, or an ASCII terminal, to access the Advanced System Management Interface (ASMI), go to Managing your server using the Advanced System Management Interface.

  • If you can successfully access the ASMI, replace the operator panel assembly. Refer to Finding part locations to determine the part number and correct exchange procedure.
  • If you cannot successfully access the ASMI, replace the service processor. Refer to Finding part locations to determine the part number and correct exchange procedure.

If you do not have a PC or ASCII terminal, replace the following one at a time (go to Finding part locations to determine the part number and correct exchange procedure):

  1. Operator panel assembly.
  2. Service processor.
You have a blank display on the operator panel. Other LEDs on the operator panel appear to behave normally.

Verify that the operator panel connections to the system backplane are connected and properly seated.

If a client computer (such as a PC with Ethernet capability and a Web browser) is available, connect it to the service processor in the server that is displaying the symptom.

To connect a personal computer with Ethernet capability and a Web browser, or an ASCII terminal, to access the Advanced System Management Interface (ASMI), go to Managing your server using the Advanced System Management Interface.

  • If you can successfully access the ASMI, replace the operator panel assembly. Refer to Finding part locations to determine the part number and correct exchange procedure.
  • If you cannot successfully access the ASMI, replace the service processor. Refer to Finding part locations to determine the part number and correct exchange procedure.

If you do not have a PC or ASCII terminal, replace the following one at a time (go to Finding part locations to determine the part number and correct exchange procedure):

  1. Control (operator) panel assembly.
  2. Service processor.
You have a blank display on the operator panel. Other LEDs on the operator panel are off. Go to Power problems.

Reference codes

Symptom What you should do:
You have an 8-digit error code displayed. Look up the reference code in the Reference codes section of the information center.
Note: If the repair for this code does not involve replacing a FRU (for instance, running an AIX® command that fixes the problem or changing a hot-pluggable FRU), then update the AIX error log after the problem is resolved by performing the following steps:
  1. In the online diagnostics, select Task SelectionLog Repair Action.
  2. Select resource sysplanar0.

On systems with a fault indicator LED, this changes the "fault indicator" LED from the "fault" state to the "normal" state.

The system stops with an 8-digit error code displayed when booting. Look up the reference code in the Reference codes section of the information center.
The system stops and a 4-digit code displays on the control panel that does not begin with 0 or 2. Look up the reference code in the Reference codes section of the information center.
The system stops and a 4-digit code displays on the control panel that begins with 0 or 2 is displayed in the operator panel display. Record SRN 101-xxxx where xxxx is the 4-digit code displayed in the control panel, then look up this reference code in the Reference codes section of the information center. Follow the instructions given in the Description and Action column for your SRN.
The system stops and a 3-digit number displays on the control panel.

Add 101– to the left of the three digits to create an SRN, then look up this reference code in the Reference codes section of the information center. Follow the instructions given in the Description and Action column for your SRN.

If there is a location code displayed under the 3-digit error code, look at the location to see if it matches the failing component that the SRN pointed to. If they do not match, perform the action given in the error code table. If the problem still exists, then replace the failing component from the location code.

If there is a location code displayed under the 3-digit error code, record the location code.

Record SRN 101-xxx, where xxx is the 3-digit number displayed in the operator panel display, then look up this reference code in the Reference codes section of the information center. Follow the instructions given in the Description and Action column for your SRN.

Hardware Management Console (HMC) Problem

Symptom What you should do:
Hardware Management Console (HMC) cannot be used to manage a managed system, or the connection to the managed system is failing. If the managed system is operating normally (no error codes or other symptoms), the HMC might have a problem, or the connection to the managed system might be damaged or incorrectly cabled. Do the following:
  1. Check the connections between the HMC and the managed system. Correct any cabling errors if found. If another cable is available, connect it in place of the existing cables and refresh the HMC interface. You may have to wait up to 30 seconds for the managed system to reconnect.
  2. Verify that any connected HMC is connected to the managed system by checking the Management Environment of the HMC.
    Note: The managed system must have power connected and the system running, or waiting for a power-on instruction (01 is in the upper-left corner of the operator panel.)

    If the managed system does not appear in the Navigation area of the HMC Management Environment, the HMC or the connection to the managed system might be failing.

  3. Go to the Managing your server using the Hardware Management Console section.
  4. There might be a problem with the service processor card or the HMC system backplane. If you cannot fix the problem using the HMC tests in the Managing your server using the Hardware Management Console section:
    1. Replace the service processor card. Refer to Removing and replacing parts.
    2. Replace the HMC system backplane. Refer to Removing and replacing parts.
Hardware Management Console (HMC) cannot call out using the attached modem and the customer's telephone line. If the managed system is operating normally (no error codes or other symptoms), the HMC might have a problem, or the connection to the modem and telephone line might have a problem. Do the following:
  1. Check the connections between the HMC and the modem and telephone line. Correct any cabling errors if found.
  2. Go to the Managing your server using the Hardware Management Console for information about the HMC.

There is a display problem (for example, distortion or blurring)

Symptom What you should do:
All display problems.
  1. If you are using the Hardware Management Console, go to the Managing your server using the Hardware Management Console section.
  2. If you are using a graphics display:
    1. Go to the problem determination procedures for the display.
    2. If you do not find a problem:
There appears to be a display problem (distortion, blurring, and so on) Go to the problem determination procedures for the display.

Power and cooling problems

Symptom What you should do:
The system will not power on and no error codes are available. Go to Power problems.
The power LEDs on the operator panel and the power supply do not come on or stay on.
  1. Check the service processor error log.
  2. Go to Power problems.
The power LEDs on the operator panel and the power supply come on and stay on, but the system does not power on.
  1. Check the service processor error log.
  2. Go to Power problems.
A rack or a rack-mounted unit will not power on.
  1. Check the service processor error log.
  2. Go to Power problems.
The cooling fan(s) do not come on, or come on but do not stay on.
  1. Check the service processor error log.
  2. Go to Power problems.
The system attention LED on the operator panel is on and there is no error code displayed.
  1. Check the service processor error log.
  2. Go to Power problems.

Other symptoms or problems

Symptom What you should do:
The system stopped and a code is displayed on the operator panel. Go to Start of call procedure.
01 is displayed in the upper-left corner of the operator panel and the fans are off. The service processor is ready. The system is waiting for power-on. Boot the system. If the boot is unsuccessful, and the system returns to the default display (indicated by 01 in the upper-left corner of the operator panel), go to MAP 0020: Problem determination procedure.
The operator panel displays STBY. The service processor is ready. The server was shut down by the operating system and is still powered on. This condition can be requested by a privileged system user with no faults. Go to Start-of-call.
Note: See the service processor error log for possible operating system fault indications.
All of the system POST indicators are displayed on the firmware console, the system pauses and then restarts. The term POST indicators refers to the device mnemonics (the words memory, keyboard, network, scsi, and speaker) that appear on the firmware console during the power-on self-test (POST). Go to Problems with loading and starting the operating system.
The system stops and all of the POST indicators are displayed on the firmware console. The term POST indicators refers to the device mnemonics (the words memory, keyboard, network, scsi, and speaker) that appear on the firmware console during the power-on self-test (POST). Go to Problems with loading and starting the operating system.
The system stops and the message starting software please wait...is displayed on the firmware console. Go to Problems with loading and starting the operating system.
The system does not respond to the password being entered or the system login prompt is displayed when booting in service mode.
  1. If the password is being entered from the Hardware Management Console (HMC), go to the Managing your server using the Hardware Management Console.
  2. If the password is being entered from a keyboard attached to the system, the keyboard or its controller may be faulty. In this case, replace these parts in the following order:
    1. Keyboard
    2. Service processor

If the problem is fixed, go to MAP 0410: Repair checkout.

The system stops with a prompt to enter a password. Enter the password. You cannot continue until a correct password has been entered. When you have entered a valid password, go to the beginning of this table and wait for one of the other conditions to occur.
The system does not respond when the password is entered. Go to Step 1020-2.
No codes are displayed on the operator panel within a few seconds of turning on the system. The operator panel is blank before the system is powered on. Reseat the operator panel cable. If the problem is not resolved, replace in the following order:
  1. Operator panel assembly. Refer to Removing and replacing parts
  2. Service processor. Refer to Removing and replacing parts.

If the problem is fixed, go to MAP 0410: Repair checkout.

If the problem is still not corrected, go to MAP 0020: Problem determination procedure.

The SMS configuration list or boot sequence selection menu shows more SCSI devices attached to a controller/adapter than are actually attached. A device may be set to use the same SCSI bus ID as the control adapter. Note the ID being used by the controller/adapter (this can be checked and/or changed through an SMS utility), and verify that no device attached to the controller is set to use that ID.

If settings do not appear to be in conflict:

  1. Go to MAP 0020: Problem determination procedure.
  2. Replace the SCSI cable.
  3. Replace the device.
  4. Replace the SCSI adapter
Note: In a "twin-tailed" configuration where there is more than one initiator device (normally another system) attached to the SCSI bus, it may be necessary to use SMS utilities to change the ID of the SCSI controller or adapter.
You suspect a cable problem. Go to Adapters, Devices and Cables for Multiple Bus Systems.
You have a problem that does not prevent the system from booting. The operator panel is functional and the rack indicator LED operates as expected. Go to MAP 0020: Problem determination procedure.
All other symptoms. Go to MAP 0020: Problem determination procedure.
All other problems. Go to MAP 0020: Problem determination procedure.
You do not have a symptom. Go to MAP 0020: Problem determination procedure.
You have parts to exchange or a corrective action to perform.
  1. Go to Start of call procedure.
  2. Go to End of call procedure.
You need to verify that a part exchange or corrective action corrected the problem. Go to MAP 0410: Repair checkout.
You need to verify correct system operation. Go to MAP 0410: Repair checkout.
The system stopped. A POST indicator is displayed on the system console and an eight-digit error code is not displayed. If the POST indicator represents:
  1. Memory, go to PFW1548: Memory and processor subsystem problem isolation procedure.
  2. Keyboard
    1. Replace the keyboard.
    2. Replace the service processor, location: model dependent.
    3. Go to PFW1548: Memory and processor subsystem problem isolation procedure.
  3. Network, go to PFW1548: Memory and processor subsystem problem isolation procedure.
  4. SCSI, go to PFW1548: Memory and processor subsystem problem isolation procedure.
  5. Speaker
    1. Replace the control panel. The location is model dependent; refer to Installing features
    2. Replace the service processor. The location is model dependent.
    3. Go to PFW1548: Memory and processor subsystem problem isolation procedure.
The diagnostic operating instructions are displayed. Go to MAP 0020: Problem determination procedure.
The system login prompt is displayed. If you are loading the diagnostics from a CD-ROM, you may not have pressed the correct key or you may not have pressed the key soon enough when you were trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of this step.
Note: Perform the system shutdown procedure before turning off the system.

If you are sure you pressed the correct key in a timely manner, go to Step 1020-2.

If you are loading diagnostics from a Network Installation Management (NIM) server, check for the following:

  • The bootlist on the client may be incorrect.
  • Cstate on the NIM server may be incorrect.
  • There may be network problems preventing you from connecting to the NIM server.

Verify the settings and the status of the network. If you continue to have problems refer to Problems with loading and starting the operating system and follow the steps for network boot problems.

The System Management Services (SMS) menu is displayed when you were trying to boot standalone diagnostics. If you are loading diagnostics from the CD-ROM, you may not have pressed the correct key when you were trying to indicate a service mode IPL of the diagnostic programs. If this is the case, try to boot the CD-ROM again and press the correct key.

If you are sure you pressed the correct key, the device or media you are attempting to boot from may be faulty.

  1. Try to boot from an alternate boot device connected to the same controller as the original boot device. If the boot succeeds, replace the original boot device (for removable media devices, try the media first).

    If the boot fails, go to problems with loading and starting the operating system.

  2. Go to PFW1548: Memory and processor subsystem problem isolation procedure.
The SMS boot sequence selection menu or remote IPL menu does not show all of the bootable devices in the partition or system. If an AIX or Linux partition is being booted, verify that the devices that you expect to see in the list are assigned to this partition. If they are not, use the HMC to reassign the required resources. If they are assigned to this partition, go to Problems with loading and starting the operating system to resolve the problem.

Send feedback | Rate this page

Last updated: Fri, Oct 30, 2009