Seperate the disk failure from MegaRAID

Server machine usually connected with the external storage device via a PCI-E storage card, and then built a RAID array with these disks. it’s good job that can separate the storage ,but how to find and replace a bad disk especially when it’ raid mode.

Linux OS would say something error in its messages file, such as ’Megaraid_disk_35 SMART FAILURE’. This article is to introduce how to find this disk,NO 35?May be.

 

Since the raid controller is LSI MegarRaid controller, we can install 8-07-14_MegaCLI.zip (https://docs.broadcom.com/docs/12351587) to check the raid info under Linux .

 

# rpm -ivh MegaCli-8.07.14.noarch.rpm

 

#/opt/MegaRAID/MegaCli/MegaCli64

Fatal error – Command Tool invoked with wrong parameters

Exit Code: 0×01

 

#ln -sf /opt/MegaRAID/MegaCli/MegaCli64 /usr/bin/megacli

Or cp /opt/MegaRAID/MegaCli/MegaCli64 /usr/bin/

 

MegaCli command
(1)display all of RAID level ,setting and the logical disk info

#megacli -LdInfo -LALL -aAll
(2)display the RAID module info , RAID setting , physical disk info

#megacli -cfgdsply -aALL | more
(3)check the RAID module specify info
#MegaCli -AdpAllInfo -aALL
(4)check the adapter count
#megacli -adpCount
(5)check the logical disk count
#megacli -LdGetNum -aALL
(6)check the helpful info
#megacli –help

 

Then we will get a list of disk info if run the command , find the Device ID:35 info.

#megacli -cfgdsply –aALL

 

Physical Disk: 1

Enclosure Device ID: 65

Slot Number: 11

Drive’s position: DiskGroup: 2, Span: 1, Arm: 1

Enclosure position: 1

Device Id: 35

WWN: 5000039698D882BD

Sequence Number: 2

Media Error Count: 0

Other Error Count: 0

Predictive Failure Count: 1

Last Predictive Failure Event Seq Number: 2991

PD Type: SAS

 

Raw Size: 931.512 GB [0x74706db0 Sectors]

Non Coerced Size: 931.012 GB [0x74606db0 Sectors]

Coerced Size: 930.390 GB [0x744c8000 Sectors]

Sector Size: 512

Logical Sector Size: 512

Physical Sector Size: 512

Firmware state: Online, Spun Up

Device Firmware Level: 0108

Shield Counter: 0

Successful diagnostics completion on : N/A

SAS Address(0): 0x5000039698d882be

SAS Address(1): 0x0

Connected Port Number: 8(path0)

Inquiry Data: TOSHIBA MG03SCA100     0108X5Q0A06HFTQ5

FDE Capable: Not Capable

FDE Enable: Disable

Secured: Unsecured

Locked: Unlocked

Needs EKM Attention: No

Foreign State: None

Device Speed: 6.0Gb/s

Link Speed: 12.0Gb/s

Media Type: Hard Disk Device

Drive Temperature :34C (93.20 F)

PI Eligibility: No

Drive is formatted for PI information: No

PI: No PI

Port-0 :

Port status: Active

Port’s Linkspeed: 12.0Gb/s

Port-1 :

Port status: Active

Port’s Linkspeed: 12.0Gb/s

Drive has flagged a S.M.A.R.T alert : Yes

 

We would like to find out the module info: Slot Number: 11

Drive’s position: DiskGroup: 2, Span: 1, Arm: 1; TOSHIBA MG03SCA100     0108X5Q0A06HFTQ5 .and find it in the storage device.