Is there a way on Dell R320 (two power supplies and hardware RAID-1) on centos to get a message that a power supply is failing or that one of the hardware RAID disks is failing? Sure there is the front panel - but no-one is there ...
Is that reported to linux/centos some way? This is my first R320.
Thanks,
Jerry
----- Original Message ----- | Is there a way on Dell R320 (two power supplies and hardware RAID-1) | on | centos | to get a message that a power supply is failing or that one of the | hardware RAID disks is failing? | Sure there is the front panel - but no-one is there ... | | Is that reported to linux/centos some way? | This is my first R320. | | Thanks, | | Jerry | _______________________________________________ | CentOS mailing list | CentOS@centos.org | http://lists.centos.org/mailman/listinfo/centos |
You can use IPMI if the machine has it. That way you get a consistent method to get many errors from the machine.
you should certainly have a DRAC in that machine. Look for the network port with a wrench below/above it. DRAC is similar to HP ILO or IBM MMI
On Thu, May 23, 2013 at 5:05 PM, James A. Peltier jpeltier@sfu.ca wrote:
----- Original Message ----- | Is there a way on Dell R320 (two power supplies and hardware RAID-1) | on | centos | to get a message that a power supply is failing or that one of the | hardware RAID disks is failing? | Sure there is the front panel - but no-one is there ... | | Is that reported to linux/centos some way? | This is my first R320. | | Thanks, | | Jerry | _______________________________________________ | CentOS mailing list | CentOS@centos.org | http://lists.centos.org/mailman/listinfo/centos |
You can use IPMI if the machine has it. That way you get a consistent method to get many errors from the machine.
-- James A. Peltier Manager, IT Services - Research Computing Group Simon Fraser University - Burnaby Campus Phone : 778-782-6573 Fax : 778-782-3045 E-Mail : jpeltier@sfu.ca Website : http://www.sfu.ca/itservices
“A successful person is one who can lay a solid foundation from the bricks others have thrown at them.” -David Brinkley via Luke Shaw _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
How about installing dell omsa and using check_openmanage with nagios?
2013/5/24 Jerry Geis geisj@pagestation.com
Is there a way on Dell R320 (two power supplies and hardware RAID-1) on centos to get a message that a power supply is failing or that one of the hardware RAID disks is failing? Sure there is the front panel - but no-one is there ...
Is that reported to linux/centos some way? This is my first R320.
Thanks,
Jerry _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos
On 05/24/2013 12:18 AM, Eero Volotinen wrote:
How about installing dell omsa and using check_openmanage with nagios?
2013/5/24 Jerry Geis geisj@pagestation.com
Is there a way on Dell R320 (two power supplies and hardware RAID-1) on centos to get a message that a power supply is failing or that one of the hardware RAID disks is failing? Sure there is the front panel - but no-one is there ...
Is that reported to linux/centos some way? This is my first R320.
Thanks,
Jerry _______________________________________________
Please don't top post.
You can find the OMSA here http://downloads.dell.com/Pages/Drivers/poweredge-r320.html at Systems Management - Application.
On Fri, 24 May 2013, Bazy wrote:
On 05/24/2013 12:18 AM, Eero Volotinen wrote:
How about installing dell omsa and using check_openmanage with nagios?
2013/5/24 Jerry Geis geisj@pagestation.com
Is there a way on Dell R320 (two power supplies and hardware RAID-1) on centos to get a message that a power supply is failing or that one of the hardware RAID disks is failing? Sure there is the front panel - but no-one is there ...
Is that reported to linux/centos some way? This is my first R320.
Thanks,
Jerry _______________________________________________
Please don't top post.
You can find the OMSA here http://downloads.dell.com/Pages/Drivers/poweredge-r320.html at Systems Management - Application.
Or better yet
http://linux.dell.com/wiki/index.php/Repository/OMSA
Regards,
On 05/23/13 17:04, Jerry Geis wrote:
Is there a way on Dell R320 (two power supplies and hardware RAID-1) on centos to get a message that a power supply is failing or that one of the hardware RAID disks is failing? Sure there is the front panel - but no-one is there ...
Is that reported to linux/centos some way? This is my first R320.
First, the tiny LCD screen will most certainly tell you if a PSU is failing - it will go from the normal blue background to orange, and tell you which PSU. It will also say if the RAID controller is having problems.
The disks themselves are another issue. Find and d/l MegaRaid, which will let you manage the RAID (I don't know which PERC this has - we have 4xx, 6xx, and 8xx servers) if it's a PERC 6 or 7, while the system is running, without having to reboot and use the firmware software.
mark
You can use IPMI if the machine has it. That way you get a consistent method to get many errors from the machine.
James - I did the following...
yum install ipmitool modprobe ipmi_devintf ipmitool chassis status
and I got this below.
System Power : on Power Overload : false Power Interlock : inactive Main Power Fault : false Power Control Fault : false Power Restore Policy : previous Last Power Event : Chassis Intrusion : inactive Front-Panel Lockout : inactive Drive Fault : false Cooling/Fan Fault : false Sleep Button Disable : not allowed Diag Button Disable : allowed Reset Button Disable : not allowed Power Button Disable : allowed Sleep Button Disabled: false Diag Button Disabled : true Reset Button Disabled: false Power Button Disabled: false
Then I unplugged the power to 1 of the two supplies and waited for the "orange" display so the machine noticed the power fault. Then ran the command "ipmitool chassis status" again and got the same output. It was not reporting the power supply issue.
Is there something I am missing?
Thanks,
jerry
On 05/24/2013 08:44 AM, Jerry Geis wrote:
You can use IPMI if the machine has it. That way you get a consistent method to get many errors from the machine.
James - I did the following...
yum install ipmitool modprobe ipmi_devintf ipmitool chassis status
and I got this below.
System Power : on Power Overload : false Power Interlock : inactive Main Power Fault : false Power Control Fault : false Power Restore Policy : previous Last Power Event : Chassis Intrusion : inactive Front-Panel Lockout : inactive Drive Fault : false Cooling/Fan Fault : false Sleep Button Disable : not allowed Diag Button Disable : allowed Reset Button Disable : not allowed Power Button Disable : allowed Sleep Button Disabled: false Diag Button Disabled : true Reset Button Disabled: false Power Button Disabled: false
Then I unplugged the power to 1 of the two supplies and waited for the "orange" display so the machine noticed the power fault. Then ran the command "ipmitool chassis status" again and got the same output. It was not reporting the power supply issue.
Is there something I am missing?
Thanks,
jerry
I finally found what I needed...
ipmitool sdr type "Power Supply"
Gives me correct status for the dual power supply.
jerry
----- Original Message ----- | > | > | > You can use IPMI if the machine has it. That way you get a | > consistent method to get many errors from the machine. | > | > - | James - I did the following... | | yum install ipmitool | modprobe ipmi_devintf | ipmitool chassis status | | and I got this below. | | System Power : on | Power Overload : false | Power Interlock : inactive | Main Power Fault : false | Power Control Fault : false | Power Restore Policy : previous | Last Power Event : | Chassis Intrusion : inactive | Front-Panel Lockout : inactive | Drive Fault : false | Cooling/Fan Fault : false | Sleep Button Disable : not allowed | Diag Button Disable : allowed | Reset Button Disable : not allowed | Power Button Disable : allowed | Sleep Button Disabled: false | Diag Button Disabled : true | Reset Button Disabled: false | Power Button Disabled: false | | | Then I unplugged the power to 1 of the two supplies and waited for | the "orange" display so | the machine noticed the power fault. Then ran the command "ipmitool | chassis status" again | and got the same output. It was not reporting the power supply issue. | | Is there something I am missing? | | Thanks, | | jerry | | _______________________________________________ | CentOS mailing list | CentOS@centos.org | http://lists.centos.org/mailman/listinfo/centos |
Have a look at ipmitool sel list as well. SEL=System Event Log.