Z
Z
zehil2021-02-11 22:35:00
linux
zehil, 2021-02-11 22:35:00

Why does HDD randomly stop in Linux?

I noticed one very annoying moment. During the installation of the system itself and sometimes during the installation of many packages (namely, many, if there are up to 100 of them, then this effect was not observed), the hard drive spontaneously stops and starts almost immediately. This is repeated more than 70 times, after which the hard drive does not start until a hard reset from the button. Counting is easy, for each packet - 1 restart.
Installed system - openSUSE Leap 15.2 It was installed
on this hardware for the first time, on another distribution kit (ubuntu 16.04) this was not the case.
The computer is stationary. There is no TLP in the system, Windows does not suffer from this (Windows was installed on the computer most of the time)
I don’t want to try other distributions, I’m used to this, since it is installed on my laptop and has shown itself to be very reliable for quite a long time.

HDD - W&D 500Gb. I'll fix the info below in the spoiler.
The problem, as I understand it, is purely software? Or should you start worrying about your data?

spoiler
=== START OF INFORMATION SECTION ===
Model Family: Western Digital Blue
Device Model: WDC WD5000AAKX-001CA0
Serial Number: WD-WCAYUH771921
LU WWN Device Id: 5 0014ee 103c6d686
Firmware Version: 15.01H15
User Capacity: 500 107 862 016 bytes [500 GB]
Sector Size: 512 bytes logical/physical
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is: Thu Feb 11 21:31:09 2021 EET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

spoiler
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 683
3 Spin_Up_Time 0x0027 190 139 021 Pre-fail Always - 1458
4 Start_Stop_Count 0x0032 093 093 000 Old_age Always - 7362
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 062 062 000 Old_age Always - 28123
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 093 093 000 Old_age Always - 7016
192 Power-Off_Retract_Count 0x0032 199 199 000 Old_age Always - 1442
193 Load_Cycle_Count 0x0032 199 199 000 Old_age Always - 5919
194 Temperature_Celsius 0x0022 111 090 000 Old_age Always - 32
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 8
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 360
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 7

SMART Error Log Version: 1
No Errors Logged


6025877d058ab313984224.png

Answer the question

In order to leave comments, you need to log in

1 answer(s)
J
jcmvbkbc, 2021-02-11
@zehil

One can start by viewing through hdparm the Advanced Power Management (hdparm -B) and Automatic Acoustic Management (hdparm -M) options and setting them to the maximum (254 or 255, depending on the disk) if they are not already there.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question