IBM Support

Unexpected Brocade Enterprise/Entry SAN Fibre Switch Module (FSM) failover at reboot after 49.71 days - IBM BladeCenter

Troubleshooting


Problem

Unexpected Brocade Enterprise/Entry SAN Fibre Switch Module (FSM) failover at reboot after 49.71 days

Resolving The Problem

 

Source

Retain tip: H085852

Symptom

When the Brocade Enterprise/Entry SAN Fibre Switch Module (FSM) running Fabric OS version 4.4.0, version 4.4.0a or version
4.4.1. has been up for longer than 49.71 days, the FSM may panic, resulting in a reboot or failover.

Affected configuration

The system may be any of the following IBM BladeCenters:

  • IBM BladeCenter, type 8677, any model
  • IBM BladeCenter HS40, type 8839, any model
  • IBM BladeCenter HS20, type 8678, any model
  • IBM BladeCenter HS20, type 8832, any model
  • IBM BladeCenter JS20, type 8842, any model
  • IBM BladeCenter T, type 8730, any model
  • IBM BladeCenter T, type 8720, any model

The following network operating systems are affected:

  • The Fabric OS 4.4.0 Firmware for the FSM is affected.
  • The Fabric OS 4.4.0a Firmware for the FSM is affected.
  • The Fabric OS 4.4.1 Firmware for the FSM is affected.
Solution

Upgrade switch firmware to Fabric OS version 4.4.0b or later or Fabric OS version 4.4.1a or later, both scheduled for release in the second Quarter 2005.

Workaround

Possible disruption of the fabric can be minimized by ensuring that switches logically adjacent to the switch modules are running Fabric OS version 4.4.0, version 4.4.0a or version 4.4.1.

Ensuring that switches logically adjacent to the switch modules are running Fabric OS version 4.4.0, version 4.4.0a or version 4.4.1
  1. Login to the switch as USERID
     
    Note: To view the current uptime, use the command uptime at the command prompt of the switch telnet screen.
  2. Issue the hareboot command.
     
    Note: This is used to reboot the switch without disruption to the fabric (hareboot will take approximately 60 seconds with no interruption to traffic).
Additional information

In Fabric OS version 4.4.0, version 4.4.0a or version 4.4.1 the Simple Network Management Protocol (SNMP) daemon incorrectly computes its refresh time due to an msticks counter wraparound. As a result, the software watchdog begins recovery procedures, which results in a switch failover or reboot. This wraparound will occur once the switch has exceeded 49.71 days of uptime.



Document Location

Worldwide

Operating System

BladeCenter:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW19V","label":"BladeCenter->BladeCenter HS20"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"HW20D","label":"BladeCenter JS20 Blade"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW20G","label":"BladeCenter->BladeCenter HS40"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"LOB18","label":"Miscellaneous LOB"}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW20M","label":"BladeCenter->BladeCenter T Chassis"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW20T","label":"BladeCenter E Chassis"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
29 January 2019

UID

ibm1MIGR-58862