Flashes (Alerts)
Abstract
Marvell QLogic Fibre Channel adapters might fail to be removed properly with DLPAR operations on Red Hat Enterprise Linux (RHEL) 9.5.
Content
Linux Releases Affected
RHEL 9.5
IBM Systems Affected
All POWER9 and Power10 systems with QLogic FC adapters on RHEL 9.5.
Symptoms
When you perform a dynamic logical partition (DLPAR) remove operation, there is a possibility that the adapter is not removed correctly from the system, which causes a future DLPAR add operation to fail. Typical symptoms include an EEH error followed by timeout errors.
EEH: Recovering PHB#0-PE#10000
EEH: PE location: N/A, PHB location: N/A
EEH: Frozen PHB#0-PE#10000 detected
EEH: Call Trace:
EEH: [c00000000005007c] __eeh_send_failure_event+0x7c/0x160
EEH: [c000000000048d64] eeh_dev_check_failure.part.0+0x254/0x650
EEH: [c008000005862250] qla24xx_read_flash_dword+0x198/0x1f0 [qla2xxx]
EEH: [c008000005866110] qla24xx_read_flash_data+0x78/0xf0 [qla2xxx]
EEH: [c0080000058048e4] qla24xx_load_risc_flash+0x24c/0x680 [qla2xxx]
EEH: [c00800000581aa80] qla81xx_load_risc+0x178/0x1e0 [qla2xxx]
EEH: [c008000005812280] qla2x00_setup_chip+0x208/0x8f8 [qla2xxx]
EEH: [c00800000581b4e8] qla2x00_initialize_adapter+0x3f0/0x8a0 [qla2xxx]
EEH: [c0080000057fae58] qla2x00_probe_one+0xe40/0x1db0 [qla2xxx]
EEH: [c000000000939de0] local_pci_probe+0x80/0x120
EEH: [c00000000093ae1c] pci_call_probe+0x8c/0x1f0
EEH: [c00000000093b71c] pci_device_probe+0xbc/0x1a0
EEH: [c000000000a65d44] really_probe+0x104/0x540
EEH: [c000000000a662fc] __driver_probe_device+0x17c/0x220
EEH: [c000000000a663f4] driver_probe_device+0x54/0x130
EEH: [c000000000a665dc] __device_attach_driver+0x10c/0x1d0
EEH: [c000000000a626e4] bus_for_each_drv+0xb4/0x130
EEH: [c000000000a66cd8] __device_attach+0xe8/0x2a0
EEH: [c000000000925748] pci_bus_add_device+0x78/0xf0
EEH: [c000000000925814] pci_bus_add_devices+0x54/0xb0
EEH: [c000000000072fc8] pcibios_finish_adding_to_bus+0x68/0xe0
EEH: [c000000000103ed0] init_phb_dynamic+0xd0/0x110
EEH: [c00800000ae80624] dlpar_add_slot+0x18c/0x380 [rpadlpar_io]
EEH: [c00800000ae80cac] add_slot_store+0xa4/0x150 [rpadlpar_io]
EEH: [c0000000008de95c] kobj_attr_store+0x2c/0x50
EEH: [c0000000006c1694] sysfs_kf_write+0x64/0x80
EEH: [c0000000006c0108] kernfs_fop_write_iter+0x1b8/0x2a0
EEH: [c0000000005c6cd4] vfs_write+0x364/0x4e0
EEH: [c0000000005c7154] ksys_write+0x84/0x140
EEH: [c00000000002ef54] system_call_exception+0x164/0x310
EEH: [c00000000000bfe8] system_call_vectored_common+0xe8/0x278
EEH: This PCI device has failed 1 times in the last hour and will be permanently disabled after 5 failures.
EEH: Notify device drivers to shutdown
EEH: Beginning: 'error_detected(IO frozen)'
And timeout errors such as the following.
qla2xxx [0000:01:00.0]-d04c:4: MBX Command timeout for cmd b, iocontrol=ffffffff jiffies=10000829f mb[0-3]=[0xffff 0xffff 0xffff 0xffff] mb7 0xffff host_status 0xffffffff hccr 0xffffffff
qla2xxx [0000:01:00.0]-d04c:4: MBX Command timeout for cmd b, iocontrol=ffffffff jiffies=100008e64 mb[0-3]=[0xffff 0xffff 0xffff 0xffff] mb7 0xffff host_status 0xffffffff hccr 0xffffffff
qla2xxx [0000:01:00.0]-d04c:4: MBX Command timeout for cmd b, iocontrol=ffffffff jiffies=100009a29 mb[0-3]=[0xffff 0xffff 0xffff 0xffff] mb7 0xffff host_status 0xffffffff hccr 0xffffffff
qla2xxx [0000:01:00.0]-d04c:4: MBX Command timeout for cmd b, iocontrol=ffffffff jiffies=10000a5ee mb[0-3]=[0xffff 0xffff 0xffff 0xffff] mb7 0xffff host_status 0xffffffff hccr 0xffffffff
Workaround
There is no workaround available for this issue currently. Instead of using the DLPAR operation, you can shutdown the logical partition before removing or adding Qlogic FC adapters to the configuration.
For more information about DLPAR, see: Dynamic Logical Partitioning.
Fix Outlook
There is no fix available for this issue currently. Resolution for this issue will be pursued in a future qla2xxx driver update. After a fix is available, then the relevant update will be available to resolve this issue.
I/O device impacted
QLogic FC adapters
[{"Type":"MASTER","Line of Business":{"code":"LOB26","label":"Storage"},"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SGMV157","label":"IBM Support for Red Hat Enterprise Linux Server"},"ARM Category":[{"code":"a8m0z000000Gnl7AAC","label":"Red Hat Enterprise Linux"}],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"}]
Was this topic helpful?
Document Information
Modified date:
08 October 2024
UID
ibm17170348