Answer the question
In order to leave comments, you need to log in
Is there a sensible replacement for monit?
The company uses the monitoring system monit.
On his own, he's the worst shit. But he has an agent that allows you to reboot a service that has hung.
There is a question to pass to other system of monitoring. But I would like to leave the ability to automatically reboot the service.
Is there such a thing on the market?
I haven't found anything like this yet.
Answer the question
In order to leave comments, you need to log in
there was a similar task
, it was necessary to do certain actions on hosting and on hosts
, since all monitoring was built around Prometheus + Grafana + Alertmanager + a bunch of exporters, there was a desire to screw everything into this scheme, a
solution that covered all tasks
https://github.com/adnanh /webhook/
in short, there is an alert rule with a certain label, when the rule is triggered, the alert manager sends a message (POST) via routes to the receiver - webhook endpoint, which launches the execute-command that is configured for this webhook endpoint, and then as a fantasy and opportunities allow, I had work on API with hosting, running jobs through the API on Ansible Tower, just running commands via SSH
For monitoring - prometheus. He does not have the ability to boot the service. But it can be screwed with the help of any crutches, like this one: https://github.com/imgix/prometheus-am-executor
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question