[CentOS] Hardware raid health?

Mon Aug 25 20:52:31 UTC 2014
Jason Pyeron <jpyeron at pdinc.us>

> -----Original Message-----
> From: Digimer
> Sent: Monday, August 25, 2014 16:23
> 
> On 25/08/14 04:11 PM, Jason Pyeron wrote:
> >> -----Original Message-----
> >> From: centos-bounces at centos.org
> >> [mailto:centos-bounces at centos.org] On Behalf Of Les Mikesell
> >> Sent: Monday, August 25, 2014 16:03
> >> To: CentOS mailing list
> >> Subject: [CentOS] Hardware raid health?
> >>
> >> I just had an IBM in a remote location with a hardware 
> raid1 have both
> >> drives go bad.  With local machines I probably would have caught it
> >> from the drive light before the 2nd one died...  What is 
> the state of
> >> the art in linux software monitoring for this?   Long ago when that
> >> box was set up I think the best I could have done was a 
> Java GUI tool
> >> that IBM had for their servers - and that seemed like 
> overkill for a
> >> simple monitor.    Is there anything more lightweight that 
> knows about
> >> the underlying drives in a hardware raid set on IBM's - and also
> >> recent HP servers?
> >
> > We use MegaCLI, but it has the risk of hanging the box 
> (observed only once).
> >
> > Just changed out a drive last night because of it.
> >
> > -Jason
> 
> Can you share any detail on this? Controller/drive model? MegaCli 
> version? How exactly did it lock up?

Locked up the OS, not the array. Power cycled after the array synced the new drive 6 hours later.

On a Dell PE2970
Product Name    : PERC 6/i Integrated
FW Package Build: 6.2.0-0013

                    Mfg. Data
                ================
Mfg. Date       : 06/24/08
Rework Date     : 06/24/08
Revision No     :
Battery FRU     : N/A

                Image Versions in Flash:
                ================
FW Version         : 1.22.02-0612
BIOS Version       : 2.04.00
WebBIOS Version    : 1.1-46-e_15-Rel
Ctrl-R Version     : 1.02-015B
Preboot CLI Version: 01.00-023:#%00006
Boot Block Version : 1.00.00.01-0011

      MegaCLI SAS RAID Management Tool  Ver 8.05.71 Apr 30, 2013


$ while MegaCli64 -PDRbld -ShowProg -PhysDrv [32:1] -aALL; do sleep 1; done

The sleep 1 was abusive!

> 
> I use it extensively so this worries me. :)
> 
> -- 
> Digimer
> Papers and Projects: https://alteeve.ca/w/
> What if the cure for cancer is trapped in the mind of a 
> person without 
> access to education?
> _______________________________________________
> CentOS mailing list
> CentOS at centos.org
> http://lists.centos.org/mailman/listinfo/centos
> 
>