subscribe iconSubscribe to this information
POWER6 information

Fault reporting mechanisms

Problems with the cluster can be identified through several mechanisms that are part of the management subsystem.

Faults (problems) can be surfaced through the fault reporting mechanisms found in the following table.

Table 1. Fault reporting mechanisms
Reporting mechanism Description

Cluster Systems Management (CSM) event management fabric log

The CSM event management fabric log is used to monitor and consolidate Fabric Manager and switch error logs in one location.

This log is located on the Cluster Systems Management/Management Server (CSM/MS) in the following file:

/var/log/csm/errorlog/CSM/MS hostname

CSM audit log

This log is part of the standard event management function. It is accessed by using the lsevent command. It is a summary point for Reliable Scalable Cluster Technology (RSCT) and CSM event management. It can help point to activity in the /var/log/csm/errorlog file and serviceable events on the Hardware Management Console (HMC).

Hardware light emitting diodes (LEDs)

The switches and host channel adapters (HCAs) have LEDs.

Manage serviceable events task

This task is the standard reporting mechanism for IBM® Power Systems™ servers that are managed by HMCs.

Chassis viewer LED

This user interface runs on the switch and is accessible from a Web browser. It provides virtual LEDs that represent the switch hardware LEDs.

Fast Fabric Toolset

The Fast Fabric Toolset reports fabric problems in two ways. The first is from a report output. The other is in a health check output.

Customer reported problem

This action is any problem that the customer reports without using any of the reporting mechanisms.

Fabric viewer

This user interface provides a view into current fabric status.

The following logs typically do not have to be accessed when remote logging and CSM Event Management are enabled. However, sometimes they must be captured for debugging purposes.

Fabric notices log on CSM/MS

This intermediate log is where notice or higher severity log entries from switches and subnet managers are received through the syslogd command on the CSM/MS.

This log is located on the CSM/MS in the following file:

/var/log/csm/errorlog/syslogd.fabric.notices

This log is a pipe on a Linux® CSM/MS and cannot be viewed normally. Reading from the pipe causes event management to lose events.

Information log on CSM/MS

This log is an optional intermediate log where info or higher severity log entries from switches and subnet managers are received through the syslogd command on the CSM/MS.

This log is located on the CSM/MS in the following file:

/var/log/csm/errorlog/syslogd.fabric.info

Switch log

This log includes any errors reported by the chassis manager (for example, internal switch chassis problems such as power and cooling, or logic errors.)

This log is accessed through the switch command-line interface (CLI) or Fast Fabric tools.

/var/log/messages on fabric management server

This log is the syslog command on the fabric management server where host-based subnet manager logs reside. This is the log for the entire fabric management server; therefore, there might be entries in it from components other than the subnet manager.


Send feedback | Rate this page

Last updated: Tue, February 08, 2011