Answer the question
In order to leave comments, you need to log in
What is the best way to implement monitoring of the availability of several servers for each other?
given: there are 10 servers in different countries and 1 server with a monitoring system, no matter what, nagios, cacti, etc.
task: to organize monitoring of the availability of all servers within this network. so that server "1" polls all 10 servers from its host and says which ones are not available for it, and so on all servers.
I imagine the implementation of this task as follows:
commands are sent from the monitoring server to the servers on some open port to some daemon, the command parameters indicate which hosts need to be polled (query-1-2-3-4...), server "1" queries all 10 servers that were listed in the parameters (query-1-2-3-4...) for availability and returns the result to the monitoring server in the form of the response time of each server (respon-1.32-1.35-1.11-0.245.. .)
the monitoring server receives results from 10 servers and processes
the polling interval = every minute
protocol = TCP
polled port = HTTP which is open on all servers
, so the question is how best to implement this task, is there ready-made software.
Answer the question
In order to leave comments, you need to log in
Perhaps the answer is in the question:
...no matter what, nagios...
Simple enough. Install collectd on all servers. It has a plugin https://collectd.org/wiki/index.php/Plugin:Ping
In the plugin configuration, set pings to all your servers.
You give the result to nagios via, for example , https://collectd.org/documentation/manpages/collec... (I haven't tried it!!!)
You can also pour the result into mongo, redis, and rabbitmq. Well, you can get nagios plugins from everywhere, since they are written in 5 minutes.
Or more radically, put graphite (we had it) or unfluxdb + grafana (now) and pour data from collecd servers there.
PS. As a bonus of installing collectd, you will get a fast and resource-free collection of almost any statistics.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question