IBM Support

MegaRAID virtual disk drive missing in VMWare ESXi virtual machine operating systems - System x3650 M4, x3750 M4

Troubleshooting


Problem

[This abstract has been truncated due to length constraints] Users running VMware ESXi 5.5 using the ServeRAID M5110e in pass-through mode with firmware version 23.16.0-0027 or a later level, might find the Virtual Disk missing from the Virtual Machineoperating system. The following information related to this issue are found in the logs below:

Resolving The Problem

Source

RETAIN tip: H212857

Symptom

Users running VMware ESXi 5.5 using the ServeRAID M5110e in pass-through mode with firmware version 23.16.0-0027 or a later level, might find the Virtual Disk missing from the Virtual Machine operating system. The following information related to this issue are found in the logs below:

firmware EvtLog:

  Fatal Firmware Error has occurred

FWTermLog:

 

_MonTask: Online Controller Reset

[0]: fp=c03ffdf8, lr=c186df78 - _MonTask+1b8

[1]: fp=c03fff28, lr=c196a6bc - enterMonTask+78

[2]: fp=c03fff38, lr=c03f90e4 - exceptionDataTlbError+3c

[3]: fp=c03fff50, lr=c03f8920 - _CommonNonCriticalException Handler+60

[4]: fp=c130ad98, lr=c136c0e8 - LdInfo+0

[5]: fp=c130ae28, lr=c1993e10 - sscd_associate_disassociate_ld+18c

[6]: fp=c130aec8, lr=c182089c - CmdBlocked+2b8

[7]: fp=c130aef0, lr=c1318230 - set_state+b24

[8]: fp=c130af30, lr=c1318330 - raid_task_idle_loop+20

[9]: fp=c130af40, lr=c18961a8 - raid_task+e68

[10]: fp=c130bfc8, lr=c196a19c - _main+a58

[11]: fp=c130bff8, lr=fc801e60 - __start+e08

MonTask: line 280 in file ../../raid/1078int.c

UIC_ER=95e80:8070b07, UIC_MSR=0:0, MSR=21000, sp=c03ffdf8

online reset - no input needed 00/00/00 00:00:00: online reset - wait for Reset to occur

Operating System boot.msg file:

  <6>[5.835556] megasas: 00.00.05.38-rc1 <DATE/TIME>
<6>[5.835656] megasas: 0x1000:0x005b:0x1014:0x040b: bus 3:slot 0:func 0
<6>[5.835838] megaraid_sas 0000:03:00.0: PCI INT A -> GSI 18 (level, low) -> IRQ 18
<6>[5.836058] megasas: Waiting for firmware to come to ready state
<7>[5.836141] megasas: firmware in FAULT state!!
<6>[5.836195] megaraid_sas 0000:03:00.0: PCI INT A disabled

When VMware ESXi 5.5 using the ServeRAID M5110e in pass-through mod is working correctly, the boot.msg log shows:

  Ä 4.624327Ü megasas: firmware now in Ready state

This issue is present when:

Affected configurations

The system can be any of the following IBM servers:

The system is configured with at least one of the following:

The system is configured with one or more of the following IBM options:

The version 23.16.0-0027 or higher level of firmware for the ServeRAID M5110e SAS/SATA controller is affected.

Note: This does not imply that the network operating system will work under all combinations of hardware and software.

Please see the compatibility page for more information:

http://www.ibm.com/systems/info/x86servers/serverproven/compat/us/

Solution

Users should follow the VMware Guidelines found on the link below for configuring a Peripheral Component Interconnect (PCI) device as pass-through. This document is from VMware and any user attempting to configure PCI device as pass-through should reference it first.

http://www.vmware.com/pdf/vsp_4_vmdirectpath_host.pdf

Note: The attached document is from VMware and it is a generic document for all PCI devices. It states that users can add an entry in the passthrough.map file for D3 to D0 reset method.

To resolve this issue, modify the passthrough.map file by adding a new line for this card being used, save the file and restart the VMware ESXi host server:

 

passthrough attributes for devices

# LSILogic 1068 based SAS controllers

1000 0056 d3d0 default

1000 0058 d3d0 default

1000 005b d3d0 default ==> Add this line

Workaround

No valid workaround is available for this issue because the VMware ESXi host server must be corrected and restarted.

Additional information

After adding the following line: 1000 005b d3d0 default in the passthrough.map file and restarting the VMware ESXi host, the PCI pass-through for the ServeRAID M5110e will work properly in SLES 11 and Microsoft Windows 2012 Virtual Machines.

This special configuration settings change in VMware ESXi is required for this configuration to work properly. While configuring a Host Bus Adapter (HBA) as a pass-through device under VMware, the user must make sure that the proper reset method is used in order for the reset to complete properly.

For PCI devices, the D3D0 reset is the proper method, which uses PCI power management features (D3 to D0 level transition method) to reset pass-through devices.

In the absence of this reset method, the rest of the functionalities of pass-through device might work fine but when it comes to Virtual Machines issuing a reset to a device, it might not work properly.

Document Location

Worldwide

Operating System

System x:SUSE Linux Enterprise Server 11

System x:SUSE Linux Enterprise Server 11 x86-64

System x:VMware vSphere 5.0

System x:VMware vSphere 5.0 x64

System x:Windows Server 2012

System x:Windows Server 2012 R2

Lenovo x86 servers:SUSE Linux Enterprise Server 11

Lenovo x86 servers:SUSE Linux Enterprise Server 11 x86-64

Lenovo x86 servers:Operating system independent / None

Lenovo x86 servers:VMware vSphere 5.0

Lenovo x86 servers:VMware vSphere 5.0 x64

Lenovo x86 servers:Windows Server 2012

Lenovo x86 servers:Windows Server 2012 R2

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU01DKP","label":"System x->System x3650 M4->7915"},"Platform":[{"code":"PF025","label":"Platform Independent"},{"code":"PF047","label":"SurePOS"},{"code":"PF032","label":"VM"},{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU01GBW","label":"System x->System x3750 M4->8733"},"Platform":[{"code":"PF025","label":"Platform Independent"},{"code":"PF047","label":"SurePOS"},{"code":"PF032","label":"VM"},{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU91NAJ","label":"System x->System x3750 M4->8722"},"Platform":[{"code":"PF025","label":"Platform Independent"},{"code":"PF047","label":"SurePOS"},{"code":"PF032","label":"VM"},{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QUOEWIQ","label":"System x->System x3750 M4->8752"},"Platform":[{"code":"PF025","label":"Platform Independent"},{"code":"PF047","label":"SurePOS"},{"code":"PF032","label":"VM"},{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QUOEWJD","label":"System x->System x3750 M4->8718"},"Platform":[{"code":"PF025","label":"Platform Independent"},{"code":"PF047","label":"SurePOS"},{"code":"PF032","label":"VM"},{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QUOFNIO","label":"Lenovo x86 servers->Lenovo System x3650 M4->7915"},"Platform":[{"code":"PF025","label":"Platform Independent"},{"code":"PF047","label":"SurePOS"},{"code":"PF032","label":"VM"},{"code":"PF033","label":"Windows"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
30 January 2019

UID

ibm1MIGR-5095905