Can anyone recommend an enterprise-class monitoring system for both Linux and Windows servers? Here are my requirements:
SNMP trap collection, ability to import custom MIBs isup/isdown monitoring of ports and daemons Server health monitors (CPU, Disk, Memory, etc) SLA reporting with nice graphs Pager/Email/SMS alerts with groups, filters and escalations Built-in MTBF and MTTR reporting Robust parent-child relationships between monitors or probes. For example, the system must be smart enough to know that if 25 URLs have gone down all at once, that they belong to an apache process that has died. I don't want 25 alerts, I want *one* alert telling me that the parent apache daemon is down. Ability to easily create dashboards from various monitors. We want this so we can see all components of a website in one place, eg, apache URL, database server, disk storage, etc. Attractive, easy to use GUI. We don't want a homebrew project with ugly graphs and a web 1.0 GUI.
So far the products I have looked at are: NimBUS SolarWinds IP Monitor WhatsUPGold GroundWork Open Source Nagios
And none of them have met all my requirements. Any suggestions?
On Tue, Feb 05, 2008 at 11:49:54AM -0600, Sean Carolan wrote:
Can anyone recommend an enterprise-class monitoring system for both Linux and Windows servers? Here are my requirements:
SNMP trap collection, ability to import custom MIBs isup/isdown monitoring of ports and daemons Server health monitors (CPU, Disk, Memory, etc) SLA reporting with nice graphs Pager/Email/SMS alerts with groups, filters and escalations Built-in MTBF and MTTR reporting Robust parent-child relationships between monitors or probes. For example, the system must be smart enough to know that if 25 URLs have gone down all at once, that they belong to an apache process that has died. I don't want 25 alerts, I want *one* alert telling me that the parent apache daemon is down. Ability to easily create dashboards from various monitors. We want this so we can see all components of a website in one place, eg, apache URL, database server, disk storage, etc. Attractive, easy to use GUI. We don't want a homebrew project with ugly graphs and a web 1.0 GUI.
So far the products I have looked at are: NimBUS SolarWinds IP Monitor WhatsUPGold GroundWork Open Source Nagios
You might take a look at OpenNMS and ZenOSS. I'm not sure if either could do everything you're asking for out of the box however.
Ray
Sean Carolan wrote:
You might take a look at OpenNMS and ZenOSS. I'm not sure if either could do everything you're asking for out of the box however.
Thanks, ZenOSS just might fit the bill.
I tried to use Zenoss for monitoring a small network (about 5 subnets) and i had really a hard time with relationships (a version of sept 2007).
Zenoss seems to get its relationships when it scans the network and components by itself. I was unable to edit and set relationships manually. I was flooded by emails!
I finally gave up en went for Nagios in which i have full control.
I heard that the Zenoss team was supposed to work on that eventually.
Regards,
Guy Boisvert IngTegration inc.
Sean Carolan wrote:
I tried to use Zenoss for monitoring a small network (about 5 subnets) and i had really a hard time with relationships (a version of sept 2007).
Did you use the 'enterprise' or the OS version?
Open source. Sorry, i forgot to mention it.
Does the OS version is crippled up to prevent relationships !?! IMHO, relationships are simply mandatory! But they can do whatever they want and we simply do the same!
Guy Boisvert IngTegration inc.
Quoting Sean Carolan scarolan@gmail.com:
Can anyone recommend an enterprise-class monitoring system for both Linux and Windows servers? Here are my requirements:
SNMP trap collection, ability to import custom MIBs isup/isdown monitoring of ports and daemons Server health monitors (CPU, Disk, Memory, etc) SLA reporting with nice graphs Pager/Email/SMS alerts with groups, filters and escalations Built-in MTBF and MTTR reporting Robust parent-child relationships between monitors or probes. For example, the system must be smart enough to know that if 25 URLs have gone down all at once, that they belong to an apache process that has died. I don't want 25 alerts, I want *one* alert telling me that the parent apache daemon is down. Ability to easily create dashboards from various monitors. We want this so we can see all components of a website in one place, eg, apache URL, database server, disk storage, etc. Attractive, easy to use GUI. We don't want a homebrew project with ugly graphs and a web 1.0 GUI.
So far the products I have looked at are: NimBUS SolarWinds IP Monitor WhatsUPGold GroundWork Open Source Nagios
And none of them have met all my requirements. Any suggestions?
Zabbix http://www.zabbix.com
Sean Carolan wrote:
Can anyone recommend an enterprise-class monitoring system for both Linux and Windows servers? Here are my requirements:
SNMP trap collection, ability to import custom MIBs isup/isdown monitoring of ports and daemons Server health monitors (CPU, Disk, Memory, etc) SLA reporting with nice graphs Pager/Email/SMS alerts with groups, filters and escalations Built-in MTBF and MTTR reporting Robust parent-child relationships between monitors or probes. For example, the system must be smart enough to know that if 25 URLs have gone down all at once, that they belong to an apache process that has died. I don't want 25 alerts, I want *one* alert telling me that the parent apache daemon is down. Ability to easily create dashboards from various monitors. We want this so we can see all components of a website in one place, eg, apache URL, database server, disk storage, etc. Attractive, easy to use GUI. We don't want a homebrew project with ugly graphs and a web 1.0 GUI.
So far the products I have looked at are: NimBUS SolarWinds IP Monitor WhatsUPGold GroundWork Open Source Nagios
And none of them have met all my requirements. Any suggestions?
What was wrong with IP Monitor?
It seems to have everything you mentioned, plus a SOAP interface for designing external dashboards.
We use it here and it works well.
-Ross
______________________________________________________________________ This e-mail, and any attachments thereto, is intended only for use by the addressee(s) named herein and may contain legally privileged and/or confidential information. If you are not the intended recipient of this e-mail, you are hereby notified that any dissemination, distribution or copying of this e-mail, and any attachments thereto, is strictly prohibited. If you have received this e-mail in error, please immediately notify the sender and permanently delete the original and any copy or printout thereof.
Sean Carolan wrote:
Can anyone recommend an enterprise-class monitoring system for both Linux and Windows servers? Here are my requirements:
SNMP trap collection, ability to import custom MIBs isup/isdown monitoring of ports and daemons Server health monitors (CPU, Disk, Memory, etc) SLA reporting with nice graphs Pager/Email/SMS alerts with groups, filters and escalations Built-in MTBF and MTTR reporting Robust parent-child relationships between monitors or probes. For example, the system must be smart enough to know that if 25 URLs have gone down all at once, that they belong to an apache process that has died. I don't want 25 alerts, I want *one* alert telling me that the parent apache daemon is down. Ability to easily create dashboards from various monitors. We want this so we can see all components of a website in one place, eg, apache URL, database server, disk storage, etc. Attractive, easy to use GUI. We don't want a homebrew project with ugly graphs and a web 1.0 GUI.
So far the products I have looked at are: NimBUS SolarWinds IP Monitor WhatsUPGold GroundWork Open Source Nagios
And none of them have met all my requirements. Any suggestions?
http://www.opennms.org - and they have a yum repo for easy installation on Centos. Expect to do some work setting up relationships, but the framework is all there.