HMC problem analysis

Use this information to diagnose and repair problems that are related to the Hardware Management Console (HMC).

DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard:
  • Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Do not open or service any power supply assembly.
  • Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm.
  • The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords.
  • Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate.
  • Connect any equipment that will be attached to this product to properly wired outlets.
  • When possible, use one hand only to connect or disconnect signal cables.
  • Never turn on any equipment when there is evidence of fire, water, or structural damage.
  • Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures.
  • Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices.
    To Disconnect:
    1. Turn off everything (unless instructed otherwise).
    2. Remove the power cords from the outlets.
    3. Remove the signal cables from the connectors.
    4. Remove all cables from the devices
    To Connect:
    1. Turn off everything (unless instructed otherwise).
    2. Attach all cables to the devices.
    3. Attach the signal cables to the connectors.
    4. Attach the power cords to the outlets.
    5. Turn on the devices.

    (D005)

If you were directed here from the Beginning problem analysis procedure because your HMC is not functioning correctly, continue with the Entry point for HMC problem determination.

To perform other maintenance tasks on your HMC, refer to the following procedures:

Entry point for HMC problem determination

Find the symptom you are having in the Symptom column of the following table. Then perform the action described in the Action column.
Symptom Action
Operator reported that the HMC did not start, but no other problems were reported. Go to Beginning HMC problem determination.
Operator reported "Communication not active" on the HMC. Go to Testing the HMC's Ethernet adapter.
Operator reported communication problems with a remotely connected HMC or a managed system. Go to Testing the modem connection to the managed system.
Power problems Go to Testing for a power problem.
HMC boot problems Go to Beginning HMC problem determination.
Display problem Go to Testing the HMC's display.
DVD-RAM drive problem Go to Testing the HMC's DVD-RAM.
Diskette drive problem Go to Testing the HMC's diskette drive.
Ethernet LAN problem Go to Testing the HMC's Ethernet adapter.
A problem with any of the following:
  • display
  • diskette drive
  • DVD-RAM Drive
  • disk drive
  • Ethernet LAN
Go to Performing system unit testing using PC-Doctor.
Eight character error code beginning with HMC was received when using the HMC graphical user interface. Go to the HMC error codes.
HMC does not communicate through the modem. Go to Testing the HMC's modem connection.
PC-Doctor diagnostic tests detected an error. For PC hardware maintenance manuals to help isolate the problem to a failing part, refer to Equivalent maintenance information for the HMC's personal computer hardware.
Problems understanding the usage of the HMC. Go to Managing your server using the Hardware Management Console.
All other problems (for example: HMC GUI is unresponsive, parity errors, power, POST codes, blank display, mouse, or keyboard). Go to Beginning HMC problem determination.
Symptoms not in this list. Go to Beginning HMC problem determination.

Beginning HMC problem determination

Use this procedure to determine if there is a problem with the HMC hardware. This procedure might direct you to procedures in various sections of this information or to personal computer maintenance information.

Step 1. HMC problem determination

  1. If the HMC is running, shut down the console by exiting the graphical user interface, the PC should turn off the power automatically. If the PC cannot turn off the power, then turn the power switch off.
  2. Turn on the HMC power.
  3. Watch the console and allow enough time for the system to complete the POST and load the HMC machine code.
  4. Watch and listen for the following failing symptoms during power-on:
    • POST error condition.
    • A series of beeps that indicate an error condition.
    • The HMC login screen and user interface fails to start.
    • A reference code or any other error information is displayed.
  5. Do you have any of the failing symptoms during power on?
  6. Perform system unit testing using PC-Doctor. For details, see Performing system unit testing using PC-Doctor.
  7. Does the PC-Doctor diagnostics diskette start and allow tests to be run on the PC system unit?

Step 2. HMC problem determination

  1. Perform system unit testing using PC-Doctor. For details, see Performing system unit testing using PC-Doctor. Select System Unit > Run All Selected.
  2. Did the system unit tests detect errors?
  3. Use the PC-Doctor diagnostics and the maintenance procedures for the type of PC that you are working on to isolate the failure and exchange customer replaceable units (CRUs). Refer to the publications listed in the Equivalent maintenance information for the HMC's personal computer hardware.
  4. When the problem is repaired, or if the problem cannot be isolated, continue with Step 4. HMC problem determination.

Step 3. HMC problem determination

Attention: This step requires HMC support assistance. Contact HMC support before continuing.
  1. If you are directed to reload the HMC from the recovery DVD and then reload the backup profile and configuration data, see Recovering the HMC.
  2. After reloading the machine code from the recovery DVD, does the HMC start correctly?
    • No: Contact your next level of support.
    • Yes: This ends the procedure.

Step 4. HMC problem determination

Note: If you reach this step and you have not been able to isolate a failure, contact your next level of support for assistance.
  1. Reinstall all CRUs that did not fix the problem.
  2. You must have performed a repair action to continue. If you have not already done so, verify the repair. For instructions, see Performing system unit testing using PC-Doctor.
  3. Did the system unit tests run without errors?
  4. Does the HMC communicate with all connected managed systems?

Testing the HMC

Use these procedures when you are directed to them from the HMC problem analysis procedure to test the HMC. If a failure is detected, you will be instructed to fix the failing part and then verify the repair.

Testing for a power problem

To troubleshoot a power problem on the PC, refer to the service documentation for the PC on which your HMC is based. For PC hardware maintenance manuals to help isolate the problem to a failing part, refer to the publications listed in Equivalent maintenance information for the HMC's personal computer hardware.

Performing diagnostic procedures

You should have been directed here to test a specific part of the HMC. For the following problem areas, see Performing system unit testing using PC-Doctor:
  • Display
  • Keyboard
  • Mouse
  • Floppy Drive
  • DVD-RAM
  • DASD (disk drive)
  • Memory
  • Power
  • Run All Selected
  • SCSI
  • System Port/Modem
  • 16/4 Port Serial
  • Ethernet

To access the HMC diagnostic information, follow the procedures in HMC diagnostics overview.

Performing system unit testing using PC-Doctor

This section provides information to help you use the PC-Doctor diagnostics. For information on running PC-Doctor, go to HMC diagnostics overview.
Note: Performing procedures other than those specified in the following steps might cause errors.
  1. Start the PC-Doctor diagnostics.
  2. From the diagnostic selection display, select either Diagnostics or Interactive Tests for a list of devices to test:
    • Devices that require manual intervention (keyboard, video, mouse, diskette, optical drive) might be selected from the interactive tests task bar.
    • Devices that do not require manual intervention (processor, system board, I/O ports, fixed disks, memory) and predefined test sequences (Run Normal/Quick Test) can be selected from the diagnostic task bar.
  3. Select the task containing the device or devices or test or tests and follow the instructions. If you were instructed to Run All Selected, continue with that selection.
  4. If the diagnostics report a failure on the PC hardware, refer to the service documentation for the PC on which your HMC is based. For PC hardware maintenance manuals to help isolate the problem to a failing part, refer to the publications listed in Equivalent maintenance information for the HMC's personal computer hardware.
  5. If the diagnostics do not report a failure, the HMC PC hardware is functioning correctly. If you still have a problem, go to Beginning HMC problem determination.

Testing the modem connection to the managed system

Use this procedure to test the modem connection to the PC for the HMC.
  1. Can the HMC be used to communicate through the modem?
    • No: Go to step 2.
    • Yes: This ends the procedure.
  2. Is a device other than a modem attached to system port 2 on the HMC?
    Note: If the HMC is a rack-mounted model, answer no to this question.
  3. System port 2 of the HMC is reserved for external modem use only. Move the serial cable from serial port 2 of the HMC to another HMC system port. Connect the modem to system port 2 and go to step 1.
    Note: If the HMC is a rack-mounted model, an internal modem is installed to handle the connection of the HMC to the telephone line. If a redundant telephone connection is required, connect an external modem to the system port on the rear of the HMC. If needed, go to the MultiTech MultiModemII installation guide for modem installation instructions. If you do not have access to a hardcopy version of the MultiTech MultiModemII documentation, go to the MultiTechLink outside Information Center Web site at http://www.multitech.com/documents/families/multimodemII/manuals.asp.

Testing the HMC's modem connection

Use this procedure to test the modem connection to the PC for the HMC.
  1. Verify that the modem and the telephone line are functioning correctly by performing the following steps:
    1. On the HMC console, open the Service Agent application.
    2. Select Test Tools.
    3. Initiate a Test PMR.
    4. Monitor the call log to verify that the call is completed successfully. If the call is completed successfully, the modem is functioning correctly.
  2. Is the installed modem currently functioning on the HMC?
    • No: Go to step 3.
    • Yes: The problem is not in the modem. This ends the procedure.
  3. Are the HMC configurations, relating to the modem operation, correct?
    • No: Correct the HMC configuration arguments. Return to step 1.
    • Yes: Continue with step 4.
  4. Is the modem powered on? (Are any indicators lit?)
    • No: Ensure the modem is powered on. For details, see 1. After power on verification is completed, continue with step 5.
    • Yes: Go to step 5.
  5. Is the serial cable between the serial (COM) port connector of the HMC and the modem attached?
    • No: Attach the serial cable between the serial (COM) port connector of the HMC and the modem.
    • Yes: Load the PC-Doctor diagnostics.
      • If you are working on a desktop model HMC, load the PC-Doctor from diskette or CD.
      • If you are working on a rack-mounted model HMC, load the PC-Doctor from diskette using an external USB diskette drive, CD or selecting F12 during a system shut down and restart.
      Note: If you are using a CD or diskette to load PC-Doctor, shut down and restart the HMC and wait until the PC-Doctor's main menu is displayed.
  6. Click Diagnostics > Other Devices > Modem to perform the modem diagnostic test.
  7. Did the diagnostic pass?
    • No: Go to step 8.
    • Yes: Go to 9.
  8. Is the modem connected correctly to a working telephone line (or equivalent)?
    Note: This can be checked by connecting a known good telephone to the line in place of the modem and making a telephone call.
    • No: Correctly connect the telephone line (or equivalent) to the modem. Go to step 1. After completing the telephone line and modem verification test, continue with step 9.
    • Yes: Go to step 9.
  9. Verify the COM port by performing the following steps:
    1. Disconnect the modem cable from the COM port of the HMC.
    2. Select Diagnostics from the top menu.
    3. Select System Ports from the pull-down menu. The SERIAL PORT TEST CATEGORY screen displays.
    4. Ensure the following:
      • On desktop HMC models - IRQ numbers 4 and 3 are assigned to COM 1 and COM 2, and the planar to COM 2 connector cable is present and correctly installed.
      • On rack-mounted models - IRQ number 4 is assigned from COM 1.
      Note: If the preceding information is not correct, the COM port might be disabled, or might be incorrectly configured. This can be resolved by accessing the setup utility (by pressing the F1 key) during power on.
    5. Ensure all diagnostics except "External Loopback" are selected
    6. Select Run Screen from the bottom menu.
    7. Ensure all selected diagnostics show Passed
      Note: If any diagnostics fail, replace the planar.
  10. Verify the external modem by performing the following steps:
    1. Reconnect the modem cable to the correct COM port.
    2. Ensure that the modem is powered on, connected to a working telephone line, and is securely cabled to the communications cable.
    3. Press Esc to return to the PC-Doctor main menu or shut down and restart the HMC.
    4. Select Hardware Info from the top menu
    5. Select COM and LPT Ports from the pull-down menu. The hardware query displays COM and LPT port information.
    6. Verify the following:
      • A modem was detected on the correct COM port
      • The modem test returned Passed
      • Dial tone: Detected
      • ATI1: Displays the modem's model information
    Note:
    1. If you were not able to get the desired results in step 10.f, refer to the MultiTech MultiModemII user's guide, installation guide, or reference guide for your modem. To access the MultiTech MultiModemII documentation, go to the MultiTechLink outside Information Center Web site at http://www.multitech.com/documents/families/multimodemII/manuals.asp.
    2. If necessary, after you have completed the MultiTech MultiModemII documentation, come back here to complete the final step of this procedure.
  11. Choose from the following options:
    • If the modem was not detected, replace and verify the following customer replaceable parts in the order listed:
      1. Communications cable
      2. Modem
    • If the modem test did not return with Passed, replace the modem.
    • If the dial tone did not return with Detected, verify the telephone line operation then retest. If the failure reoccurs, replace the modem.

Testing the HMC's Ethernet adapter

Use this procedure to test the Ethernet adapter in the HMC.
  1. Is the Ethernet port currently functioning though normal operation of the HMC?
    • No: Go to step 2.
    • Yes: This ends the procedure.
  2. Are the Ethernet configuration values set correctly? (IP address, subnet mask, and so on.)
    • No: Set the Ethernet configuration values to their correct settings. Then go back to step 1.
    • Yes: Go to step 3.
  3. Can the IP address of the HMC be 'pinged' by another system that should be able to 'see' the HMC on the network?
    • No: Go to step 4.
    • Yes: Go to step 10.
  4. Is the Ethernet cable attached correctly to the HMC and the network?
    • No: Attach the HMC to the network using an Ethernet cable with the correct pinout. Then go to step 1.
    • Yes: Go to step 5.
  5. Is the Ethernet cable the correct pinout? (There are two types of Ethernet cables in use, which are distinguished by different pinouts. The network will determine which version of cable to use.)
    • No: Replace the Ethernet cable with the correct version. Then go to step 1.
    • Yes: Go to step 6.
  6. Perform the following steps:
    1. Install the PC-Doctor diskette into drive A of the HMC: (floppy drive).
    2. Shut down and restart the HMC and wait until the PC-Doctor main menu is displayed.
    3. To run the Ethernet diagnostics, select Diagnostics > Other Devices > Intel Ethernet.
      Did the diagnostic pass?
      • No: Go to step 7.
      • Yes: Go to step 10.
  7. Refer to the Ethernet hardware's hardware maintenance manual to determine if there are any internal settings or jumpers that might disable the Ethernet port.
    Are there any internal settings or jumpers?
    • No: Go to step 8.
    • Yes: Go to step 9.
  8. Replace the Ethernet hardware in the HMC. (This might be a PCI card or system board replacement, depending on the HMC hardware.) Go to step 1.
  9. Set the internal settings/jumpers to enable the Ethernet port on the HMC. Go to step 1.
  10. The failure does not seem to be in the HMC.

Testing the HMC's disk drive

Use this procedure to test the disk drive in the HMC.
  1. Did the disk drive test fail?
    • No: Go to step 5.
    • Yes: Continue with the next step.
  2. Perform the following steps:
    1. Exchange the CRUs called by the diagnostics one at a time. For CRU removal and replacement instructions, refer to the PC hardware maintenance manual for the system on which you are working. See Equivalent maintenance information for the HMC's personal computer hardware to access the hardware maintenance manual for your HMC personal computer model.
    2. After each CRU is exchanged, test the repair using the information in Performing system unit testing using PC-Doctor. Select Hard Disk Drive problem area.
      Did the disk drive test fail?
      • No: Continue with the next step.
      • Yes: Contact your next level of support.
  3. Ensure the following and then continue with the next step:
    • If you exchanged the disk drive and there are jumpers or tab settings on the new disk drive, ensure that the settings are the same as the old drive.
    • If there is a SCSI cable-terminating resistor device, ensure it is secured to the cable and (if necessary) reattached to its original location on the PC.

      Go to "Hard Disk Jumper Settings" in the PC hardware maintenance manual. See Equivalent maintenance information for the HMC's personal computer hardware to access the hardware maintenance manual for your HMC personal computer model.

  4. If you exchanged the disk drive, restore the HMC image to the new disk drive.
  5. Use the information in Performing system unit testing using PC-Doctor to test the PC. Select Run All Selected problem area.
    • If the tests fail, isolate the problem using the Beginning HMC problem determination procedure.
    • If the tests run without errors, turn off the PC power and then turn on the power. Ensure that the system boots and the HMC screen displays. This ends the procedure.

Testing the HMC's DVD-RAM

Use this procedure to test the DVD drive in the PC for the HMC.
  1. Determine the media in the DVD-RAM drive:
    • Compact Disk Recordable (CD-R) similar to a CD
    • DVD-RAM media cartridge
    Is the media a CD-R?
    • No: Go to step 4.
    • Yes: Continue with the next step.
  2. Perform the following steps:
    1. Clean the compact disk as follows:
      • Hold the disk by its edges. Do not touch the surface.
      • Remove dust and fingerprints from the surface by wiping from the center to the outside using a dry, soft cloth.
    2. Reinstall the CD, with the label-side facing up.
    3. Continue with the next step.
  3. Retry the failing task using the original media.
    Does the failure occur again?
    • No: Continue with the next step.
    • Yes: Go to step 4.

    This ends the procedure.

  4. Ensure the write protect tab is in the "disabled" (down) position.
    Was the write protect tab in the "disabled" (down) position?
    • No: Go to step 3.
    • Yes: Continue with the next step.
  5. Perform the following steps:
    1. With the original media in the drive, note the following:
      • If you are attempting a restore procedure, turn off the PC power.
      • For any other operation, shut down the HMC, and then turn off the PC power.
        Note: For shutdown procedures, see Powering on and off.
    2. Turn on the PC power and test the DVD-RAM drive using the information in Performing system unit testing using PC-Doctor.
    3. Select DVD-RAM Drive problem area.
    4. When the test is complete, return here and continue with the next step.
  6. Did the DVD-RAM test fail while testing with the original media?
    • No: Go to step 1.
    • Yes: Continue with the next step.
  7. Exchange the original media with a new one.
    Note: If you are replacing DVD-RAM media, the new cartridge must be formatted. If possible, use another HMC to format the new cartridge.
  8. Turn off the PC power.
  9. Turn on the PC power, and test the DVD-RAM drive with the new media. Use the information in Performing system unit testing using PC-Doctor to test the PC. Select DVD-RAM Drive problem area.
  10. Did the DVD-RAM test fail while testing with the new media?
    • No: The original media was defective. This ends the procedure.
    • Yes: Continue with the next step.
  11. Verify the following:
    • All DVD-RAM drive data and power cables are secure.
    • The DVD-RAM drive is jumpered as "master" and is cabled to the secondary IDE bus.
  12. If the diagnostics continue to fail, exchange the DVD-RAM drive. When complete, run the DVD-RAM test again.
    Note: If there are any jumpers or tab settings on the new drive, ensure that the settings match the old drive.
  13. Did the DVD-RAM Drive test continue to fail?
    • No: The original DVD-RAM drive was defective. This ends the procedure.
    • Yes: Continue with the next step.
  14. Continue exchanging CRU from the CRU list and running the DVD-RAM drive tests.
    • If the CRUs fix the problem, this ends the procedure.
    • If you cannot isolate the problem, call your next level of support for assistance.
  15. The PC resources (for example: interrupt, I/O address) might be configured incorrectly. Verify the PC resources are correctly configured.
    1. Select System Unit for the configuration area, and verify configuration for the system unit and all adapters.
    2. When you complete the verification, retry the failing procedure and continue with the next step.
  16. Does the failing procedure continue to fail?
    • No: The resource settings were incorrect. This ends the procedure.
    • Yes: If you cannot isolate the problem, contact your next level of support for assistance. This ends the procedure.

Testing the HMC's diskette drive

Use this procedure to test the diskette drive in the PC for the HMC.
  1. Perform the following steps:
    1. Turn on the PC power and test the diskette drive using the information in Performing system unit testing using PC-Doctor.
    2. Select System Unit problem area and the test of Diskette Drive.
      Note: Do not test with the diskette on which the errors occurred. Use a new diskette.
    3. When the test is complete, continue with the next step.
  2. Did the diskette test fail while testing with a new diskette?
    • No: Go to step 7.
    • Yes: Continue with the next step.
  3. Exchange the diskette drive and run the diskette test again.
  4. Did the diskette test fail again?
    • No: The original diskette drive was failing. This ends the procedure.
    • Yes: Continue with the next step.
  5. Continue exchanging CRUs from the CRU list and running tests. If the CRUs fix the problem, this ends the procedure.
  6. Choose from the following options:
    • If the problem is resolved by replacing a CRU, this ends the procedure.
    • If you cannot resolve the problem, contact your next level of support for assistance. This ends the procedure.
  7. Did the original failure occur while writing to a diskette?
    • No: Go to step 9.
    • Yes: Continue with the next step.
  8. Retry the original task using a new diskette.
    • If the failure occurs again, go to step 11.
    • If no failures occur, the original diskette was failing. This ends the procedure.
  9. Re-create the information on the diskette, or obtain a new diskette with the information.
  10. Retry the original task.
    • If the failure occurs again, continue with the next step.
    • If no failures occur, the original diskette was failing. This ends the procedure.
  11. Test the diskette drive, using the information in Performing system unit testing using PC-Doctor. Select System Unit problem area and the test of Diskette Drive.
    • If the tests fail, isolate the problem using the procedures found in the PC hardware maintenance manual. For additional PC maintenance information, see Equivalent maintenance information for the HMC's personal computer hardware to access the hardware maintenance manual for your HMC personal computer model.
      For the latest information regarding maintenance of your HMC PC, system board specifications, and parts, refer to the PC hardware maintenance manuals. The latest versions of these publications are available through the IBM® Web site:
      1. Go to the IBMLink outside Information Center Web site at http:\\www.ibm.com.
      2. Select Support & downloads.
      3. In the Search Technical support field, type the order number of the manual you want to access.
      4. Click Search.
      5. In the search results, click the link for the document you want.
    • If the tests do not isolate the problem, contact your next level of support for assistance.

Testing the HMC's display

Use this procedure when you are having a display problem with the HMC.
  1. Is the display type a 95xx (17P, 17X, 21P)?
    • No: Continue with the next step.
    • Yes: 95xx-xxx repairs might require replacing internal display CRUs.

      Repair and test the display using the procedures in Monitor Hardware Maintenance Manual Vol 2, S41G-3317.

  2. Is the display type a 65xx (P70, P200)?
    • No: Continue with the next step.
    • Yes: 65xx-xxx repairs might require replacing the entire display. There are no internal display CRUs. Repair and test the display using the procedures in Monitor Hardware Maintenance Manual Vol 3, P and G series, S52H-3679.

      When the test and repair are complete, continue with step 5.

  3. Is the display type a 65xx (P72, P202)?
    • No: Continue with the next step.
    • Yes: 65xx-xxx repairs might require replacing the entire display. There are no internal display CRUs. Repair and test the display using the procedures in Color Monitor Operating Instructions.

      When the test and repair are complete, continue with step 5.

  4. Repair and test the display using the documentation included with the display. When the test and repair are complete, continue with step 6.
  5. Verify the repair using the information in Performing system unit testing using PC-Doctor. Select System Unit for the problem area and the Test for Display.

    When the test and repair are complete, continue to step 6.

  6. Return the system to normal operations. This ends the procedure.