On 25/08/14 04:11 PM, Jason Pyeron wrote:
-----Original Message----- From: centos-bounces@centos.org [mailto:centos-bounces@centos.org] On Behalf Of Les Mikesell Sent: Monday, August 25, 2014 16:03 To: CentOS mailing list Subject: [CentOS] Hardware raid health?
I just had an IBM in a remote location with a hardware raid1 have both drives go bad. With local machines I probably would have caught it from the drive light before the 2nd one died... What is the state of the art in linux software monitoring for this? Long ago when that box was set up I think the best I could have done was a Java GUI tool that IBM had for their servers - and that seemed like overkill for a simple monitor. Is there anything more lightweight that knows about the underlying drives in a hardware raid set on IBM's - and also recent HP servers?
We use MegaCLI, but it has the risk of hanging the box (observed only once).
Just changed out a drive last night because of it.
-Jason
Can you share any detail on this? Controller/drive model? MegaCli version? How exactly did it lock up?
I use it extensively so this worries me. :)