IBM Support

ServeRAID M5100 Series: Drive(s) in Protection Information Virtual Disk Marked Offline - Servers

Troubleshooting


Problem

During a read intensive drive operation, such as a consistency check, a drive is marked offline which in turn degrades a Protection Information (PI) enabled virtual drive. If multiple drives are marked offline, the virtual drive will no longer be available. This symptom occurs when Protection Information is enabled on the virtual drive (VD). Verify if PI is enabled by running the following MegaCLI/StorCLI command, looking for the 'PI type'. If enabled, it will be anything other then 'None'. MegaCLI -CfgDsply -aALL Protection Information can also be verified through MegaRAID Storage Manger (MSM) by selecting the virtual drive(s) under the Logical tab. Look for the 'Data Protection' field. Or, alterantively, through WebBIOS unde r th e Di sk G roup field. In the ServeRAID M51xx Controller's firmware/termlog, one of the following entries will be logged, noting the issue has occurred. SAS core detected EEDP error in application tag PD= SAS core detected EEDP error i n guar d fiel d

Resolving The Problem

Source

RETAIN tip: H211416

Symptom

During a read intensive drive operation, such as a consistency check, a drive is marked offline which in turn degrades a Protection Information (PI) enabled virtual drive. If multiple drives are marked offline, the virtual drive will no longer be available.

This symptom occurs when Protection Information is enabled on the virtual drive (VD).

Verify if PI is enabled by running the following MegaCLI/StorCLI command, looking for the 'PI type'. If enabled, it will be anything other then 'None'.

  MegaCLI -CfgDsply -aALL

Protection Information can also be verified through MegaRAID Storage Manger (MSM) by selecting the virtual drive(s) under the Logical tab. Look for the 'Data Protection' field. Or, alterantively, through WebBIOS under the Disk Group field.

In the ServeRAID M51xx Controller's firmware/termlog, one (1) of the following entries will be logged, noting the issue has occurred.

 

SAS core detected EEDP error in application tag PD=

SAS core detected EEDP error in guard field PD=

The following MegaCLI/StorCLI command will retrieve the firmware/termlog.

 

MegaCLI -FwTermLog -Dsply -a0

StorCLI /c0 show termlog

Both MegaCLI and StorCLI can be downloaded from IBM Fix Central.

Affected configurations

The system can be any of the following IBM servers:

The system is configured with one or more of the following IBM options:

This tip is not software specific.

The 23.7.0-0029 and later firmware levels for the ServeRAID M5100 Series SAS/SATA Controller are affected.

The system has the symptom described above.

Solution

This behavior has been corrected in ServeRAID M5100 Series SAS/SATA Controller firmware version 23.22.0-0024.

The file is or will be available by selecting the appropriate Product Group, type of System, Product name, Product machine type, and Operating system on IBM Support's Fix Central web page, at the following URL

Workaround

Upgrade the ServeRAID M5100 Series SAS/SATA Controller to firmware level 23.22.0-0024 or later.

Alternatively, Data Protection can be disabled either through WebBIOS, MegaRAID Storage Manager, or MegaCLI/StorCLI.

EFI WebBIOS

  1. When prompted, select F1 during system startup
  2. Go into System Settings, Adapters and UEFI Drivers
  3. Select the first PCI Root line under LSI EFI SAS Driver
  4. Press 1 for EFI WebBIOS
  5. Select the controller, and then Start
  6. Select the Drive Group under the Virtual Disk with PI enabled
  7. Select the 'Disable Protection Information' option

Legacy WebBIOS

- When prompted, select CTRL-C during system startup and follow the same steps under Extensible Firmware Interface (EFI) WebBIOS.

MegaRAID Storage Manager (MSM)

- Under the Logical tab, right click the Drive Group under the Virtual Disk with PI enabled, and select 'Disable Data Protection'.

MegaCLI/StorCLI
  - LDSetProp DsblPI -LX -aZ
    (where 'X' is the logical volume with PI enabled, and 'Z' is the controller where the logical volume is located)

There is no option currently to disable Data Protection through EFI Human Interface Infrastructure (HII).

Changes to the PI will take affect immediately, and a restart is not required. Existing data is not affected by this change.

Additional information

If a PI drive that is a part of a PI enabled virtual drive does not complete a Full Initialization, then when a read failure occurs and the firmware receives a PI CRC error, the firmware will mark the drive as bad.

The default initialization is a quick initialization, followed by a background initialization. In this scenario, the issue will be encountered. A firmware issue was discovered whereby Cyclical Redundancy Check (CRC) checking was being conducted on an uninitialized portion of the virtual drive during a Consistency Check. The Consistency Check was not honoring the watermark associated with a Background Initialization. Firmware version 23.22.0-0024 resolves this issue.

A read failure occurs when there is a non-recoverable media error on the disk.

Note: In MegaRAID Storage Manager, the term 'Full Initialize' is used whilst the WebBIOS of the ServeRAID adapter uses the term 'Slow Initialize'. Both terms refer to the same procedure and are interchangeable.


Document Location

Worldwide

Operating System

System x:Operating system independent / None

System x Hardware Options:Operating system independent / None

Lenovo x86 servers:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWMJ0","label":"Lenovo x86 servers->Lenovo System x3750 M4"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXB8","label":"Lenovo x86 servers->Lenovo System x3300 M4"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXB9","label":"Lenovo x86 servers->Lenovo System x3530 M4"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXC1","label":"Lenovo x86 servers->Lenovo System x3550 M4"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXC2","label":"Lenovo x86 servers->Lenovo System x3630 M4"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXC3","label":"Lenovo x86 servers->Lenovo System x3650 M4 BD"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXC4","label":"Lenovo x86 servers->Lenovo System x3650 M4 HD"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXX0","label":"Lenovo x86 servers->Lenovo System x3500 M4"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HWXX2","label":"Lenovo x86 servers->Lenovo System x3650 M4"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU00ZUG","label":"System x Hardware Options->ServeRAID->ServeRAID M and MR10 Series->90Y4304"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU01DEW","label":"System x->System x3500 M4->7383"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU01DKP","label":"System x->System x3650 M4->7915"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU01GBW","label":"System x->System x3750 M4->8733"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU01GCQ","label":"System x->System x3530 M4->7160"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04SRF","label":"System x->System x3850 X5->7146"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04SRO","label":"System x->System x3850 X5->7145"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04WDX","label":"System x->System x3690 X5->7149"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04WDY","label":"System x->System x3690 X5->7148"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU90ABQ","label":"System x->System x3690 X5->7147"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU90ABX","label":"System x->System x3850 X5->7143"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU91IPI","label":"System x->System x3550 M4->7914"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU91NAJ","label":"System x->System x3750 M4->8722"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU91NCW","label":"System x->System x3630 M4->7158"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU91SVT","label":"System x->System x3300 M4->7382"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QUOEARD","label":"System x Hardware Options->ServeRAID->ServeRAID M and MR10 Series->81Y4478"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QUOEARE","label":"System x Hardware Options->ServeRAID->ServeRAID M and MR10 Series->81Y4481"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QUOEARG","label":"System x Hardware Options->ServeRAID->ServeRAID M and MR10 Series->00D7082"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
30 January 2019

UID

ibm1MIGR-5094050