On 21/10/2018 23.05, Bruce Ferrell wrote:
On 10/21/18 1:44 PM, Carlos E. R. wrote:
smartmon is "sensitive" to some disks.... Not so much others.
something is running it periodically in default mode, without the parameters you used to get it to read the disk properly.
I'd say cron, but with systemd in the mix who knows.
It is smartd daemon, and it sends an email to me on another machine. This is configured in smartd.conf: /dev/disk/by-id/usb-WD_My_Book_25EE_32544B5354325344-0:0 -d sat,16 -a -o on -S on -s (S/../.././02|L/../../6/03) -m root@telcontar.valinor No cron job or systemd job can do it, because they don't know the mail address of the second machine. The sequence is this: <3.6> 2018-10-21T21:15:24.879374+02:00 Isengard smartd 11255 - - Device: /dev/disk/by-id/wwn-0x5000c5009399305f [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 74 to 75 One disk working. <3.6> 2018-10-21T21:15:31.890721+02:00 Isengard smartd 11255 - - Device: /dev/disk/by-id/usb-WD_My_Book_25EE_32544B5354325344-0:0 [SAT], not capable of SMART self-check antoher disk, tested by smartd, says no smart self-check possible. <3.2> 2018-10-21T21:15:33.840630+02:00 Isengard smartd 11255 - - Device: /dev/disk/by-id/usb-WD_My_Book_25EE_32544B5354325344-0:0 [SAT], failed to read SMART Attribute Data <3.6> 2018-10-21T21:15:33.841283+02:00 Isengard smartd 11255 - - Sending warning via <mail> to root@telcontar.valinor ... smartd (on server) sends email to the desktop machine. <3.6> 2018-10-21T21:15:33.903519+02:00 Isengard smartd 11255 - - Warning via <mail> to root@telcontar.valinor: successful Half an hour later smartd routinely logs temperature of problematic disk - thus it has no problem reading it: <3.6> 2018-10-21T21:45:24.837650+02:00 Isengard smartd 11255 - - Device: /dev/disk/by-id/usb-WD_My_Book_25EE_32544B5354325344-0:0 [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 144 to 147 repeats <3.6> 2018-10-21T22:45:25.310794+02:00 Isengard smartd 11255 - - Device: /dev/disk/by-id/usb-WD_My_Book_25EE_32544B5354325344-0:0 [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 147 to 144 Another sequence, earlier - smartd sends email: <3.2> 2018-10-21T20:15:33.916416+02:00 Isengard smartd 11255 - - Device: /dev/disk/by-id/usb-WD_My_Book_25EE_32544B5354325344-0:0 [SAT], failed to read SMART Attribute Data <3.6> 2018-10-21T20:15:33.917008+02:00 Isengard smartd 11255 - - Sending warning via <mail> to root@telcontar.valinor ... <3.6> 2018-10-21T20:15:33.979416+02:00 Isengard smartd 11255 - - Warning via <mail> to root@telcontar.valinor: successful and the next thing is half an hour later: <3.6> 2018-10-21T20:45:24.602691+02:00 Isengard smartd 11255 - - Device: /dev/disk/by-id/usb-WD_My_Book_25EE_32544B5354325344-0:0 [SAT], read SMART Attribute Data worked again, warning condition reset after 1 email It is always smartd who talks. I see no messages from systemd; and anyway, systemd doesn't know the mail address. -- Cheers / Saludos, Carlos E. R. (from 42.3 x86_64 "Malachite" at Telcontar)