[CentOS] LSI MegaRAID system status

Thu Jul 30 03:17:17 UTC 2009
Raymond Lillard <rlillard at sonic.net>

Dear list,

I have inherited a CentOS-5 box with a LSI Sata Raid controller.
It is configured as a RAID5 w/hot-standby.  My concern is that
I have no present means of determining the RAID status w/o
downing the system and going into the bios to get status.  If
a disk has failed and brought the standby disk online, I would
like to know about it, the day it happens by seeing it in syslog
or getting an e-mail message.

The only solution I have found is on the LSI web-site.  It is
tool called "MegaMon for Linux".  It most recent release was
in 2005 and it requires that I use their driver.  Just the
package title gives me chills.  It runs as a daemon and is
almost constantly polling the h/w .

I would much prefer a simple command line tool that I can
wrap a script around and run it under cron.

My questions are:

1.  Does anyone know of a command line tool ?

2.  Failing a positive response to #1, has anyone any experience
    with "MegaMon for Linux", good or otherwise ?

Thanks to all,
Ray


The h/w is a dual Opteron dual-core system with lots of memory.
The relevant dmesg lines and lspci output are shown below:

dmesg:

 megaraid cmm: 2.20.2.7 (Release Date: Sun Jul 16 00:01:03 EST 2006)
 megaraid: 2.20.5.1 (Release Date: Thu Nov 16 15:32:35 EST 2006)
 megaraid: probe new device 0x1000:0x0409:0x1000:0x3008: bus 3:slot 14:func 0
 GSI 17 sharing vector 0xA8 and IRQ 17
 ACPI: PCI Interrupt 0000:03:0e.0[C] -> GSI 18 (level, low) -> IRQ 17
 megaraid: fw version:[814B] bios version:[H431]
 scsi5 : LSI Logic MegaRAID driver
 scsi[5]: scanning scsi channel 0 [Phy 0] for non-raid devices
 scsi[5]: scanning scsi channel 1 [virtual] for logical drives
   Vendor: MegaRAID  Model: LD 0 RAID5 1144G  Rev: 814B
   Type:   Direct-Access                      ANSI SCSI revision: 02
*SCSI device sdc: 2343745536 512-byte hdwr sectors (1199998 MB)
*sdc: Write Protect is off
*sdc: Mode Sense: 00 00 00 00
*sdc: asking for cache data failed
*sdc: assuming drive cache: write through
*SCSI device sdc: 2343745536 512-byte hdwr sectors (1199998 MB)
*sdc: Write Protect is off
*sdc: Mode Sense: 00 00 00 00
*sdc: asking for cache data failed
*sdc: assuming drive cache: write through
* sdc: sdc1 sdc2 sdc3
*sd 5:1:0:0: Attached scsi disk sdc


lspci -v

 03:0e.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID (rev 0a)
     Subsystem: LSI Logic / Symbios Logic MegaRAID SATA 300-8X RAID Controller
     Flags: bus master, stepping, 66MHz, medium devsel, latency 64, IRQ 17
     Memory at cfcf0000 (32-bit, prefetchable) [size=64K]
     Memory at fe500000 (32-bit, non-prefetchable) [size=1M]
     Expansion ROM at fe4e0000 [disabled] [size=128K]
     Capabilities: [c0] Power Management version 2
     Capabilities: [d0] Message Signalled Interrupts: 64bit+ Queue=0/1 Enable-
     Capabilities: [e0] PCI-X non-bridge device