subscribe iconSubscribe to this information
POWER7 information

CEC hot-node add and hot-node repair maintenance operations checklists for models 8412-EAD, 9117-MMB, 9117-MMC, 9117-MMD, 9179-MHB, 9179-MHC, and 9179-MHD

The planning checklists contain a list of tasks that must be completed before you perform CEC hot-node add and hot-node repair maintenance operations on IBM® Power® ESE (8412-EAD), IBM Power 770 (9117-MMB, 9117-MMC, and 9117-MMD) and IBM Power 780 (9179-MHB, 9179-MHC, and 9179-MHD) systems. It also lists which tasks are associated with each type of operation.

For guidelines that apply to all CEC adapter hot-add and hot-repair maintenance operations, see Prerequisites for CEC hot-add and hot-repair. These general guidelines and the following checklist must be followed.

Table 1. CEC hot-node add and hot-node repair maintenance operations checklists for models 8412-EAD, 9117-MMC, 9117-MMD, 9179-MHB, 9179-MHC, and 9179-MHD
Operations Model 8412-EAD, 9117-MMB, 9117-MMC, 9117-MMD, 9179-MHB, 9179-MHC, and 9179-MHD tasks Responsibility

Hot-node add

Ensure that sufficient space is available in the rack that contains the system for another node to be installed under the existing nodes.

Customer

Hot-node add

Adding a node increases the power requirements of the system. If necessary, the power circuits that supply the system must be upgraded to handle these additional power requirements. This upgrade must be complete before the hot-node add begins.

The power requirements of additional nodes are detailed in Chapter 4, Server Specifications and Chapter 8, Planning for Power in Site and Hardware Planning.

Customer responsibility to perform the required site planning activities

Hot-node add, hot-node upgrade (memory), hot-node repair, concurrent GX adapter add, and hot GX adapter repair

If the system is managed by a Hardware Management Console (HMC), verify that the HMC level is the same as, or later than, the level shown in Model 8412-EAD system firmware and HMC levels, Model 9117-MMB and 9179-MHB system firmware and HMC levels, Model 9117-MMC and 9179-MHC system firmware and HMC levels, or Model 9117-MMD and 9179-MHD system firmware and HMC levels.

Verify that the system firmware level is the same as, or later than, the level shown in Model 8412-EAD system firmware and HMC levels, Model 9117-MMB and 9179-MHB system firmware and HMC levels, Model 9117-MMC and 9179-MHC system firmware and HMC levels, or Model 9117-MMD and 9179-MHD system firmware and HMC levels.

For information about determining firmware levels using the Hardware Management Console (HMC), see Using the HMC to view the existing firmware levels.

For information about disconnecting an HMC from a managed system, see Removing HMC connection data.

Customer

Service representative must verify

Hot-node upgrade (memory), hot-node add, and hot-node repair

CAUTION:

Observe the following with the Symmetric Multiprocessing (SMP) and the service processor cables on IBM Power ESE, Power 770, and Power 780 systems when performing a CEC hot add or repair operation:

  • Verify that the SMP cable (on the front of the system) and the service processor cable (on the rear of the system) are layered correctly. See Model 9117-MMB, 9117-MMC, 9117-MMD, 9179-MHB, 9179-MHC, and 9179-MHD multidrawer external SMP and FSP flex cabling. Do not start the CEC hot add or repair operation if the layering is not correct. If the layering is not correct, schedule a system outage with the customer to correct the cabling and perform the CEC hot add or repair operation with the power turned off.
  • Do not disturb the SMP and the service processor cables on active nodes. If a node repair is being performed, remove only the cables on the node that is being repaired (powered off) unless otherwise directed by WCII instructions or the removal and replacement procedure.

Service representative must verify

Hot-node upgrade (memory), hot-node repair, and hot GX adapter repair

If there is only one partition running on the system, the system can be powered down if that partition must be shut down during the CEC hot add or repair operation. To verify or change the setting that controls this behavior, see System power off policy in Powering the system on and off.

Service representative

Hot-node add, hot-node upgrade (memory), hot-node repair, concurrent GX adapter add, and hot GX adapter repair

Verify that service processor redundancy is enabled, except on a system with a single node.

To enable service processor redundancy on the managed system on an HMC, in the navigation area, select Serviceability > FSP Failover > Setup. Failover status is displayed in the contents area and can be enabled or disabled.

Note: The server must be power cycled for the enabling of service processor redundancy to take effect. If power cycling is required, the node can be added nonconcurrently (with power off).

For information about redundancy on the HMC, see FSP Failover.

Customer

Service representative must verify

Hot-node add, hot-node upgrade (memory), hot-node repair, concurrent GX adapter add, and hot GX adapter repair

If the system is managed by an HMC, verify the network connections between the system service processors and the HMC before the service representative arrives by using the service processor status function on the HMC. For information about testing the network connections on the HMC, see Testing the connection between the HMC and the managed system.

Customer

Service representative must verify

Hot-node add, hot-node upgrade (memory), hot-node repair, concurrent GX adapter add, and hot GX adapter repair

Ensure that Electronic Service Agent™ (ESA) is enabled to call home.

For additional details about the call home function on the HMC, see Manage Systems Call-Home.

Customer

Service representative must verify

Hot-node add, hot-node upgrade (memory), hot-node repair, and hot GX adapter repair

For 9117-MMB and 9179-MHB systems, you should quiesce the server before the hot-node operation. This means that all critical application should be halted or moved to another server using Live Partition Mobility (LPM) before the operation begins. Noncritical applications can be left running. The partitions can be left running at the operating system command prompt.

For 8412-EAD, 9117-MMC, 9117-MMD, 9179-MHC, and 9179-MHD systems, you must quiesce the server before the hot-node operation. This means that all critical applications must be halted or moved to another server using Live Partition Mobility (LPM) before the operation begins. Noncritical applications can be left running. The partitions can be left running at the operating system command prompt.

For additional information on a system managed by an HMC, see Mobility in the Systems management for partitions topic in Managing the Hardware Management Console.

Note: IBM recommends that adequate protection is in place for business critical applications before you perform CEC hot-node add and hot-node repair maintenance operations. To mitigate the possibility of an unexpected outage, you can manually move critical workloads to another server before the operation begins. Performing CEC hot-node add and hot-node repair maintenance operations during a regularly scheduled maintenance window or during off-peak hours reduces the impact of an unplanned outage.

Customer

Hot-node add and hot-node upgrade (memory)

Obtain and apply Capacity Upgrade on Demand (CUoD) activation codes prior to the operation so that the memory resources are available immediately after the hot add operation. For information about CUoD activation codes, see Capacity Upgrade on Demand activation codes.

Customer

Hot-node add and hot-node upgrade (memory)

If the customer wants to dynamically add the new processors and memory resources to activated partitions after the Hot-node add, the profiles that were used to activate the partitions must have the maximums for memory set to the appropriate values. When the Hot-node add is complete, the processor and memory resources can be added to the partitions by using dynamic logical partitions (DLPAR). For information about DLPAR, see Dynamic logical partitioning.

Customer

Hot-node upgrade (memory)

Verify that more than one node is present in the system. Hot-node upgrade (memory) cannot be done if the system has only one node.

Customer

Service representative must verify

Hot-node repair, hot GX adapter repair, and hot-node upgrade (memory)

Run the Preparing for Hot Repair or Upgrade utility on the HMC to determine if the system is prepared for a node upgrade (memory). For more information about the utility, see Preparing for hot repair or upgrade.

Customer

Service representative must verify

Hot-node add and concurrent GX adapter add

System firmware enforces the node and GX adapter plugging order. Only the next GX adapter slot or node position based on the plugging order is available.

Customer

Service representative must verify

Concurrent GX adapter add

Ensure that there is a GX adapter slot reservation for each GX adapter that is being added. Each installed GX adapter and GX adapter slot reservation consumes approximately 128 MB of memory. Additional GX adapter slots can be reserved by using the Advanced System Management Interface (ASMI). New GX adapter slot reservations take effect the next time the system is powered on.

Customer

Concurrent GX adapter add

The GX adapter cannot be added to a node that has been deconfigured. The node must be repaired and reconfigured before another GX adapter can be added. To verify that all nodes in the system are configured, see Using the ASMI to view deconfigured resources.

Customer

Service representative must verify onsite1

Hot-node add

When multiple hot adds are planned that include node and GX adapter adds, the GX adapter(s) should be installed in the node before the node is installed in the system.

Service representative

Hot-node add and concurrent GX adapter add

For multiple upgrades that include node or GX adapter adds, as well as I/O drawer adds, the node or GX adapter add must be completed first. The I/O drawer can then be added as a separate concurrent I/O drawer add.

Service representative

Hot-node repair and hot GX adapter repair

Ensure that the replacement parts are the same type as the parts that are being removed.

Service representative

Hot-node repair and hot GX adapter repair

Replace only the existing parts. New hardware must not be added during a Hot-node repair or a hot GX adapter repair.

Service representative

Hot-node repair

Ensure that existing node hardware is not moved during the Hot-node repair.

Service representative

1 Hardware in a node can fail at any time. When this happens a node can become unconfigured during system IPL to isolate the failure. The failure and unconfigured event can occur after the hardware upgrade package has arrived at the customer site and the service representative has been dispatched. The service representative must ask the customer if the system has any outstanding deferred repairs.

Note: The hot-node add function adds and enables the I/O adapters only in the new node.


Send feedback Rate this page

Last updated: Thu, July 23, 2015