Tuesday 7 February 2017

IBM X Series IMM Error Message - Failure Predicted on drive Drive 1 for array


How to Troubleshoot Predictive Failures in IBM Servers:-  


IMM Message:-

There was warning  message observed on IMM - Failure Predicted on drive Drive 1 for array

Steps: -

1. Login into server CLI and run below mega raid command.


ibmser1# /opt/MegaRAID/MegaCli/MegaCli64 -PdList -aAll
                                     
Adapter #0

Enclosure Device ID: 252
Slot Number: 0
Drive's postion: DiskGroup: 0, Span: 0, Arm: 0
Enclosure position: 0
Device Id: 9
WWN: 50000395580A7DA1
Sequence Number: 2
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS
Raw Size: 279.396 GB [0x22ecb25c Sectors]
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
Coerced Size: 278.464 GB [0x22cee000 Sectors]
Firmware state: Online, Spun Up
Is Commissioned Spare : NO
Device Firmware Level: SC2E
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x50000395580a7da2
SAS Address(1): 0x0
Connected Port Number: 3(path0) 
Inquiry Data: IBM-ESXSMK3001GRRB      SC2E2471GC92SC2ESC2ESC2E
IBM FRU/CRU: 81Y9671     


Enclosure Device ID: 252
Slot Number: 1
Drive's postion: DiskGroup: 0, Span: 0, Arm: 1
Enclosure position: 0
Device Id: 8
WWN: 50000395580A7D99
Sequence Number: 2
Media Error Count: 93
Other Error Count: 0
Predictive Failure Count: 202
Last Predictive Failure Event Seq Number: 18023
PD Type: SAS
Raw Size: 279.396 GB [0x22ecb25c Sectors]
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
Coerced Size: 278.464 GB [0x22cee000 Sectors]
Firmware state: Online, Spun Up
Is Commissioned Spare : NO
Device Firmware Level: SC2E
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x50000395580a7d9a
SAS Address(1): 0x0
Connected Port Number: 2(path0) 
Inquiry Data: IBM-ESXSMK3001GRRB      SC2E2471GA92SC2ESC2ESC2E
IBM FRU/CRU: 81Y9671     


2. From the output we understand predictive failure count on drive 2 is 202 no' s
3. If predictive failure counts are excess we need to go ahead and replace the drive.

Explanation from IBM

Failure predicted (PFA) on the hard drive. May also be shown as 806f020d0401ffff or 0x806f020d0401ffff

Severity Warning

Alert Category System - Predicted Failure

Serviceable Yes

SNMP Trap ID -27

Automatically notify Support Yes

User response - Replace hard disk drive 1 at the next maintenance period.





No comments:

Post a Comment