T
T
tvoyadres2020-10-13 07:35:40
Solid State Drives
tvoyadres, 2020-10-13 07:35:40

Do I need to change Samsung SM951 NVME SSD?

In the messages logs, messages appeared on the Mysql disk, worked for about 5 years.

smartd[1206]: Device: /dev/nvme0, Critical Warning (0x04): Reliability

command
smartctl -a /dev/nvme0

Issued

smartctl 7.0 2018-12-30 r4883 [x86_64-linux-5.6.14-1.el7.elrepo .x86_64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number: SAMSUNG MZVPV256HDGL-00000
Serial Number: S1XWNY0HA01300
Firmware Version: BXW7300Q
PCI Vendor/Subsystem ID: 0x144d
IEEE OUI Identifier: 0x002538
Controller ID: 1
Number of Namespaces: 1
Namespace 1 Size/Capacity: 256 060 514 304 [256 GB]
Namespace 1 Utilization: 45 267 107 840 [45.2 GB]
Namespace 1 Formatted LBA Size: 512
Local Time is: Tue Oct 13 07:14:21 2020 MSK
Firmware Updates (0x06): 3 Slots
Optional Admin Commands (0x0007): Security Format Frmw_DL
Optional NVM Commands (0x001f): Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat
Maximum Data Transfer Size: 32 Pages

Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 9.00W - - 0 0 0 0 5 5
1 + 4.60W - - 1 1 1 1 30 30
2 + 3.80W - - 2 2 2 2 100 100
3 - 0.0700W - - 3 3 3 3 500 5000
4 - 0.0050W - - 4 4 4 4 2000 22000

Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
- NVM subsystem reliability has been degraded

SMART/Health Information (NVMe Log 0x02)
Critical Warning: 0x04
Temperature: 44 Celsius
Available Spare: 96%
Available Spare Threshold: 10%
Percentage Used: 123%
Data Units Read: 7,344,994 [3.76 TB]
Data Units Written: 308,698,968 [158 TB]
Host Read Commands: 231,630,418
Host Write Commands: 8,135,588,810
Controller Busy Time: 106,978
Power Cycles: 145
Power On Hours:
33,898 Unsafe Shutdowns: 79
Media and Data Integrity Errors: 0
Error Information Log Entries: 20

Error Information (NVMe Log 0x01, max 64 entries)
Num ErrCount SQId CmdId Status PELoc LBA NSID VS
0 20 0x000a 0x4016 0x000 0 1 - 1 19 0x000a
0x4016 0x000 0 0x000 0 1 - 6 14 0 0x001c 0x4004 0x000 0 0 - 7 13 0 0x001b 0x4004 0x000 0 0 - 8 12 0 0x001a 0x4004 0x000 0 0 - 9 11 0 0x001c 0 - 0 0x40
10 10 0x001b 0x4004 0x000 0 -
11 9 0x000a 0x4016 0x000 0 1 -
12 8 0x000a 0x4016 0x000 0x000 0x000 0x00 1 -
13 7 0x000a 0x4016 0x000
0x000a 0x4016 0x000 0 1 -
15 5 0x000a 0x4016 0x000 0 1 -
... (4 entries not shown)

Answer the question

In order to leave comments, you need to log in

3 answer(s)
R
Ronald McDonald, 2020-10-13
@tvoyadres

So far, I see no reason to change it.
Follow these options:

Available Spare: 96%
Available Spare Threshold: 10%
Percentage Used: 123%

Especially for the first one. When there remains 30% - look for a replacement, when 10% - change.

T
tvoyadres, 2020-10-13
@tvoyadres

Now another one today bought a Samsung 970Pro on 512GB formatted in EXT4
When you run iostat -xm -t 5 %util goes to 100% is it normal to highlight the line in bold?
I remind you mysql is spinning there with a lot of online
10/3/2020 22:59:40
avg-cpu: % user % nice % system % iowait % steal % idle
3.48 0.00 2.73 1.38 0.00 92.42
Device : rrqm/s wrqm/sr/sw/s rMB/s wMB/s avgrq-sz avgqu-sz await r_await w_await svctm %util
nvme1n1 29.00 0.00 8.00 1.00 0.16 0.00 37, 33 0.00 0.11 0.12 0.00 0.67 0.60
nvme0n1 0.00 72.00 2825.00 218.00 44.14 2.50 31.39 0.53 0.15 0.09 0.94 0.32 98.40
sdb 127.00 0.00 21.00 0.00 0.68 0.00 66.29 0.00 0.24 0.24 0.00 0.67 1.40
sda 0.00 0.00 1.00 0.00 0.00 0.00 0.00 0.00 1.00 1.00 0.00 2.00 0.20
sdc 0.00 0.00 1.00 0.00 0.00 0.00 0 .00 0.00 1.00 1.00 0.00 1.00 0.10

E
Evgeny Vorobyov, 2020-12-22
@astrave

If I'm not mistaken, this is an analogue of the 950 Pro for OEM. They have a real resource of 3-4 or more PBs. 150TB is like seeds to them.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question