Verifying fault resolutions in the TS7610 or TS7620 ProtecTIERProtecTIER Service menu

About this task

After you have resolved a hardware fault, you will need to verify that the resolution was successful and the fault no longer exists. You can do so from the server by using the ProtecTIER® Service Menu, or from the ProtecTIER Manager workstation, using the Hardware faults window in the GUI.
Note: It is recommended that if you first verify the resolution using the ProtecTIER Service Menu, that you also make sure that the fault information is cleared from ProtecTIER Manager, as described in Rechecking faults in the TS7610 Appliance Express, ProtecTIER V3.3.6 Hardware Faults window.

Using the ProtecTIER Service Menu

Procedure

  1. Access the ProtecTIER Service Menu with a monitor and keyboard plugged into the TS7610 Appliance Express®. Log on with ID ptconfig, password ptconfig
  2. When the ProtecTIER Service Menu appears, select the ProtecTIER Configuration option.
    ------------------------------------------------------------------------------
     ProtecTIER Service Menu running on rassmx                                    
    ------------------------------------------------------------------------------
      1) ProtecTIER Configuration (...)                                           
      2) Manage ProtecTIER services (...)                                         
      3) Health Monitoring (...)                                                  
      4) Problem Alerting (...)                                                   
      5) Version Information (...)                                                
      6) Generate a service report                                                
      7) Generate a system view                                                   
      8) Update ProtecTIER code                                                   
                                                                                  
      E) Exit                                                                     
    ------------------------------------------------------------------------------
    >>> Your choice?
  3. Select Health Monitoring. Type: 3 <enter>.

    The Health Monitoring sub-menu displays:

    ------------------------------------------------------------------------------
     ProtecTIER Service Menu running on rassmx                                    
     Health Monitoring (...)                                                      
    ------------------------------------------------------------------------------
      1) Display system health summary                                            
      2) Display detailed system health				                                    
      3) Run a full system check                                                  
      4) List open problems                                                       
      5) Service Mode                                                             
                                                                                  
      B) Back                                                                     
      E) Exit                                                                     
    ------------------------------------------------------------------------------
    >>> Your choice?
  4. Select Run a full system check. Type: 3 <enter>.

    The Begin Processing Procedure message displays.

    Note: This menu option may take several seconds to complete.

    When the check completes, a Checkout summary displays. The summary indicates whether the checked items (components, applications, and system utilities) are functioning properly or in a compromised (NON-OK) state, and lists the individual items that were included in the check. Items that are functioning properly are listed as Normal. Items not in use by the system are listed as Unconfigured. Items with faults are listed as Failed, Degraded, etc., and include additional details about the fault. An example checkout summary with no items in NON-OK status, is shown below:

    TS7610 Checkout Version 7121.130-0 executed on: 2010-06-22T16:44:09

    =====================================================================

    Summary of NON-OK Statuses:

    Offline 0

    Failed 0

    Unknown 0

    Degraded 0

    Rebuilding 0

    Missing 0

    =====================================================================

    Verify state of Server 1 (Node 0/Enclosure 78) ..........Normal

    Verify state of CPU 1 (Node 0/Enclosure 78) .............Normal

    Verify state of Memory 1 (Node 0/Enclosure 78) ..........Normal

    .

    .

    Verify state of Memory 6 (Node 0/Enclosure 78) ..........Normal

    Verify state of Fan 1 (Node 0/Enclosure 78) .............Normal

    .

    .

    Verify state of Fan 10 (Node 0/Enclosure 78) ............Normal

    Verify state of Boot Drive 1 (Node 0/Enclosure 78) ......Normal

    Verify state of Boot Drive 2 (Node 0/Enclosure 78) ......Normal

    Verify state of Eth Card 1 (Node 0/Enclosure 78) ........Normal

    Verify state of Eth Card 2 (Node 0/Enclosure 78) ........Unconfigured

    Verify state of Eth Card 3 (Node 0/Enclosure 78) ........Unconfigured

    Verify state of PowerSupply 1 (Node 0/Enclosure 78) .....Normal

    Verify state of PowerSupply 2 (Node 0/Enclosure 78) .....Normal

    .

    .

    Verify state of Local FS (Node 0) .......................Normal

    End Processing Procedure

    Press any key to continue

     

     

    An example checkout summary with one item in Failed status, is shown below. In this example, a PSU failure was detected and reported. Detailed information about the failed PSU is provided below the PSU's entry in the list:

    TS7610 Checkout Version 7121.130-0 executed on: 2010-06-22T16:44:09

    =====================================================================

    Summary of NON-OK Statuses:

    Offline 0

    Failed 1

    Unknown 0

    Degraded 0

    Rebuilding 0

    Missing 0

    =====================================================================

    Verify state of Server 1 (Node 0/Enclosure 78) ..........Normal

    .

    .

    Verify state of PowerSupply 1 (Node 0/Enclosure 78) .....Normal

    Verify state of PowerSupply 2 (Node 0/Enclosure 78) .....Failed

    ------------------------------------------------------------

    *Failed: Component Location: Node 0/Enclosure 78/PowerSupply 2

    *Failed: FRU ID: 45W0425

    *Failed: FRU ID: 45W0425

    *Failed: SRN: 0xAB030001

    *Failed: Power Supply status not OK

    *Failed: SRN: 0xAB030004

    *Failed: Power supply: AC fail

    *Failed: SRN: 0xAB030002

    *Failed: Power supply is off

    *Failed: SRN: 0xAB030005

    *Failed: Power supply: DC fail

    *Failed: SRN: 0xAB030003

    *Failed: Power supply is failed

    ------------------------------------------------------------

    .

    .

    End Processing Procedure

    Press any key to continue

  5. After reviewing the summary and list, press any key to return to the parent menu.
  6. Exit the ProtecTIER Service Menu. Type: E <enter>.

    You are returned to the server command prompt.

  7. If a problem was reported on the component you just repaired or replaced, repeat the fault resolution process and repeat the verification.
Terms of use Support Feedback
Copyright IBM Corporation 2010, 2011.
Powered by Eclipse Technology. This product includes software developed by the Eclipse Project (http://www.eclipse.org/).