Switch monitoring

Switch monitoring provides continuous tracking details of the health, performance, and availability of switches within a hardware appliance. With switch monitoring, you can track switch status and quickly determine used and unused ports, fans, loosen or unseated connection details of the switch in the IBM Fusion HCI System. This topic provides you instructions and guidelines to monitor switches from the Overview dashboard page.

  1. Go to Infrastructure > Overview in the IBM Fusion HCI System user interface to view the graphical view of the hardware appliance.

    Under the Resource summary section, you can find the total number of switches along with their health statuses.

    Figure 1. Example Overview page showing error for a switch
    the rack shows degraded state for AFM node in RU23 and critical state for a storage node at RU15.
    In the graphical view of the hardware appliance, the color indicates the health status of a node.
  2. Hover over a graphical view of the hardware to identify switch details.

    For switch, it shows the name of the hardware, the health status, the type of switch, and the rack unit.

  3. Go through the color indicators and decode their statuses.
  4. Fix errors, failures, and warnings based on the guidance.
  5. After all the errors and failures are fixed, go to the Overview dashboard page and check the health status of problematic nodes. For more information about nodes, see Switch details.

Understanding color indicators and decoding status

Important: The color gray does not exist for the switches as failed, degraded, or normal are the available states.
Switch in Green color
It indicates that the switch is in a healthy or normal state, and no action is required.
Switch in Red or Yellow color
It indicates that the switch is failed or in a degraded state. Do the following steps to resolve the issue:
  • Click the component that is in red or yellow color.

    The slide out pane is displayed with the Hardware status, type, firmware, s/n, rack, and rack unit.

  • Go through the details in the slide out pane. If you want more information about the switch, click View full details. Alternatively, go to the Network page > Switches tab and click the switch name link.

    It opens Network page that includes the graphical view, recent events, and other details of the switch and its internal components.

  • Select Front in the Components section to see front view of the switch and hover over a graphical view to check internal components status such as used and unused port slots along with their working status.
  • Select Back in the Components section to see back view of the switch and hover over a graphical view to check internal components used and unused fan slots along with their working status.

    To debug further, click the internal components such as fans and PSUs that are in red or yellow color. It opens a new slide out pane for fans and PSUs with more details such as slot, speed, and state.

  • Click View table to check all internal components and its status in the table format.

    It opens Components table page that includes Ports, Fans, and Power supply in three tab pages. For network details, see Switch details.

  • Go through the Recent events section to understand the error.
  • Click View all to go to the events page and view all recent events on the hardware component.

    The BMYxxx code and the error message informs you about the error.

  • Go through the Details section to get more details of the switch such as type, model, IBM S/N, manufacture S/N, rack unit, rack, and firmware.
  • To diagnose and take corrective action, try the following options:
Switch in color blue with diagonal stripes
It indicates that a firmware upgrade is in progress. See Upgrading switch firmware.