
On Fri, Sep 23, 2016 at 04:06:42PM +1000, russell@coker.com.au wrote:
The Nagios model is to have a single very complex monitoring system while the mon model tends towards multiple simple installations. Nagios has a nrpe daemon on each monitored server while with Mon you have Mon on each server and a master Mon monitoring them all.
and for logging and graphing all sorts of info about systems (disk space, memory utilisation, cpu load, network traffic etc) and the services they're running (e.g. postgres/mysql query load, VMs/containers running), munin isn't bad. some prefer cricket or cacti or still use the ancient mrtg, but I find munin's easier to set up and write plugins for (e.g. a simple plugin I wrote was a small sh + awk script to query slurm to graph the list of running, cancelled, failed, queued, etc jobs for a HPC cluster) craig -- craig sanders <cas@taz.net.au>