Y
Y
yiicoder2015-10-01 21:24:14
System administration
yiicoder, 2015-10-01 21:24:14

What server monitoring system do you use?

Today the server died, the SSD died. Those. support showed SSD diagnostics and a counter
The question arose - how could this situation be prevented? Well, at least you could help yourself with monitoring.
The SSD died on the Wear Leveling Count parameter, which is easily tracked from the system, but in Zabbix in the "default delivery" it is not tracked.
It is clear that you can write all the user params, discovery, templats yourself? But it seems strange to me that by default monitoring systems do not track the health status of a disk.
Even now, for zabbix, it’s not so easy to google something ready for smartctl.
Are there any more advanced monitoring systems that can do a lot themselves?

Answer the question

In order to leave comments, you need to log in

4 answer(s)
O
Oioraen, 2015-10-02
@Oioraen

> Even now, with zabbix, it's not so easy to google something ready for smartctl
Congratulations, citizen of the lie!
habrahabr.ru/post/162627
habrahabr.ru/company/zabbix/blog/196218
The second and third link in Google on "zabbix smart".
And then there are dozens of links, all with ready-made solutions. I won’t give my example out of principle, damn it, you should at least learn how to use Google.

A
Azazel PW, 2015-10-02
@azazelpw

I am using Zabbix.
There is no default setting for this.
It is necessary to prescribe with pens, I did it long ago on a curve, I indicated the disks manually, too lazy to rewrite. :)
We write in the agent config.
UserParameter=custom.rse.sda,smartctl --all /dev/sda | grep Reallocated_Sector |awk '{print $10}'
UserParameter=custom.ree.sda,smartctl --all /dev/sda | grep Reallocated_Eve |awk '{print $10}'
UserParameter=custom.cpe.sda,smartctl --all /dev/sda | grep Current_Pen |awk '{print $10}'
UserParameter=custom.ue.sda,smartctl --all /dev/sda | grep Uncorrect |awk '{print $10}'
UserParameter=custom.udmae.sda,smartctl --all /dev/sda | grep UDMA |awk '{print $10}'
UserParameter=custom.temphdd.sda, smartctl --all /dev/sda | grep Temp |awk '{print $10}'
On the server, we make a call to
custom.rse.sda, etc. and so on

A
Annette, 2015-10-01
@laduwka

There is no perfect monitoring.
I also asked such questions, now I’m just finishing the tools that I have to suit my needs.
Write a plugin for zabbix.

D
Denis Khabarov, 2015-10-01
@Saymon21

mdadm, smartmontools

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question