Resolving an over temperature problem for a water-cooled 8335-GTB system

Learn how to identify the service action that is needed to resolve an over temperature problem.

  1. Go to Water cooling system specification and requirements. Are all of the requirements for water-cooled systems met?
    Note: For information specific to the 8335-GTB, see Model 8335-GTB water cooling option (Feature code E2RD).
    If Then
    Yes: Continue with the next step.
    No: Work with the customer to ensure that all of the requirements for water-cooled systems are met. This ends the procedure.
  2. Is the room temperature less than 40°C (104°F)?
    If Then
    Yes: Continue with the next step.
    No: Notify the customer. The customer must bring the room temperature within normal range. Continue with the next step.
  3. Ensure that the following requirements are met:
    1. The quick-connects between the 8335-GTB system and the water manifold are mated and connected to the proper circuits of the manifold. The supply hose must be connected to the supply manifold circuit, which is the manifold circuit that is located toward the inside of the rack. The return hose must be connected to the return manifold circuit, which is the manifold circuit that is located toward the outside of the rack.
    2. The facility water supply hose is properly connected to the supply hose on the manifold and the return hose on the manifold is properly connected to the facility water return hose.
      • The ball valves that connect the facility water supply hose to the manifold supply hose and the facility water return hose to the manifold return hose are open. For more information about connecting the facility water hoses to the manifold hoses, see Replacing the water manifold in the 8335-GTB.
      • All of the valves that might restrict the flow of water through the hoses are open in the facility water system.
      • The pumping unit of the facility water system is on and does not have errors.
    3. The facility water system is supplying water at the required temperature and flow. For instructions, see Model 8335-GTB water cooling option (Feature code E2RD).
    Does the problem persist?
    If Then
    Yes: Continue with the next step.
    Note: Steps 1- 3 resolve most problems. Ensure that you carefully check steps 1 - 3 before you continue with the next step.
    No: This ends the procedure.
  4. Is a processor over heating, but the other processor and the graphics processing units (GPUs) are not over heating?
    If Then
    Yes: Check the thermal interface material (TIM) between the cold plate and the processor that is over heating. Go to Removing a system processor module from a water-cooled 8335-GTB system and complete the steps to lift the cold plate off the processor. If the TIM pad is damaged, replace the TIM pad. To replace a TIM pad, go to Replacing a system processor module in a water-cooled 8335-GTB system and complete the steps for removing and installing a new TIM pad. This ends the procedure.
    No: Continue with the next step.
  5. Is a GPU over heating, but the other GPUs and the processors are not over heating?
    If Then
    Yes: Replace the thermal interface material (TIM) between the cold plate and the GPU that is over heating. Go to Removing the graphics processing unit from a water-cooled 8335-GTB system and complete the steps to lift the cold plate off the GPU. Then, go to Replacing the graphics processing unit in a water-cooled 8335-GTB system and complete the steps for installing a new TIM pad. If the problem is not resolved, replace the GPU. For instructions about replacing a GPU, see Removing and replacing a graphics processing unit in the 8335-GTB. This ends the procedure.
    No: Continue with the next step.
  6. Replace the cold plates. For instructions about how to replace the cold plates, see Removing and replacing the cold plates in the 8335-GTB. Does the problem persist?
    If Then
    Yes: Go to Contacting IBM service and support. This ends the procedure.
    No: This ends the procedure.



Last updated: Thu, December 02, 2021