Hi there,
I am getting this notification in my nextcloud instance over and over again:
NextCloudPi HDD health CurrentPendingSector
Device: /dev/sda [SAT], 1 Currently unreadable (pending) sectors
NextCloudPi HDD health OfflineUncorrectableSector
Device: /dev/sda [SAT], 1 Offline uncorrectable sectors
I know, that this means that the condition of my SSD is bad, but I have a few questions though.
- How bad is it really? The count of β1β did not rise since the notifications started a couple of weeks ago.
I formatted the SSD in hope that it corrects itself, but after a few days this notification showed again.
I also performed a smartctl long test, with this result:
=== START OF INFORMATION SECTION ===
Device Model: Netac SSD 512GB
Serial Number: **removed**
Firmware Version: R0831B0
User Capacity: 512,110,190,592 bytes [512 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: Solid State Device
Form Factor: 2.5 inches
TRIM Command: Available, deterministic, zeroed
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: ACS-2 T13/2015-D revision 3
SATA Version is: SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Fri Apr 8 15:27:40 2022 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x02) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 120) seconds.
Offline data collection
capabilities: (0x11) SMART execute Offline immediate.
No Auto Offline data collection support.
Suspend Offline collection upon new
command.
No Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
No Selective Self-test supported.
SMART capabilities: (0x0002) Does not save SMART data before
entering power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 10) minutes.
SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x0032 100 100 050 Old_age Always - 0
5 Reallocated_Sector_Ct 0x0032 100 100 050 Old_age Always - 1
9 Power_On_Hours 0x0032 100 100 050 Old_age Always - 11274
12 Power_Cycle_Count 0x0032 100 100 050 Old_age Always - 445
160 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 1
161 Unknown_Attribute 0x0033 100 100 050 Pre-fail Always - 98
163 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 12
164 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 16136
165 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 58
166 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 9
167 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 30
168 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 7000
169 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 100
175 Program_Fail_Count_Chip 0x0032 100 100 050 Old_age Always - 0
176 Erase_Fail_Count_Chip 0x0032 100 100 050 Old_age Always - 0
177 Wear_Leveling_Count 0x0032 100 100 050 Old_age Always - 0
178 Used_Rsvd_Blk_Cnt_Chip 0x0032 100 100 050 Old_age Always - 1
181 Program_Fail_Cnt_Total 0x0032 100 100 050 Old_age Always - 0
182 Erase_Fail_Count_Total 0x0032 100 100 050 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age Always - 445
194 Temperature_Celsius 0x0022 100 100 050 Old_age Always - 40
195 Hardware_ECC_Recovered 0x0032 100 100 050 Old_age Always - 15272
196 Reallocated_Event_Count 0x0032 100 100 050 Old_age Always - 1
197 Current_Pending_Sector 0x0032 100 100 050 Old_age Always - 1
198 Offline_Uncorrectable 0x0032 100 100 050 Old_age Always - 1
232 Available_Reservd_Space 0x0032 100 100 050 Old_age Always - 98
241 Total_LBAs_Written 0x0030 100 100 050 Old_age Offline - 132440
242 Total_LBAs_Read 0x0030 100 100 050 Old_age Offline - 406115
245 Unknown_Attribute 0x0032 100 100 050 Old_age Always - 202236
SMART Error Log Version: 1
Warning: ATA error count 0 inconsistent with error log pointer 4
ATA Error Count: 0
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error -1 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
00 00 00 00 00 00 00
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d0 01 00 4f c2 00 00 00:00:00.000 SMART READ DATA
b0 d1 01 01 4f c2 00 00 00:00:00.000 SMART READ ATTRIBUTE THRESHOLDS [OBS-4]
b0 d5 01 00 4f c2 00 00 00:00:00.000 SMART READ LOG
b0 d5 01 06 4f c2 00 00 00:00:00.000 SMART READ LOG
b0 d5 01 01 4f c2 00 00 00:00:00.000 SMART READ LOG
Error -2 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
00 00 00 00 00 00 00
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d0 01 00 4f c2 00 00 00:00:00.000 SMART READ DATA
b0 d1 01 01 4f c2 00 00 00:00:00.000 SMART READ ATTRIBUTE THRESHOLDS [OBS-4]
b0 d5 01 00 4f c2 00 00 00:00:00.000 SMART READ LOG
b0 d5 01 06 4f c2 00 00 00:00:00.000 SMART READ LOG
b0 d5 01 01 4f c2 00 00 00:00:00.000 SMART READ LOG
Error -3 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
00 00 00 00 00 00 00
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d0 01 00 4f c2 00 00 00:00:00.000 SMART READ DATA
b0 d1 01 01 4f c2 00 00 00:00:00.000 SMART READ ATTRIBUTE THRESHOLDS [OBS-4]
b0 d5 01 00 4f c2 00 00 00:00:00.000 SMART READ LOG
b0 d5 01 06 4f c2 00 00 00:00:00.000 SMART READ LOG
b0 d5 01 01 4f c2 00 00 00:00:00.000 SMART READ LOG
Error -4 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
00 00 00 00 00 00 00
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
b0 d0 01 00 4f c2 00 00 00:00:00.000 SMART READ DATA
b0 d1 01 01 4f c2 00 00 00:00:00.000 SMART READ ATTRIBUTE THRESHOLDS [OBS-4]
b0 da 00 00 4f c2 00 00 00:00:00.000 SMART RETURN STATUS
b0 d5 01 00 4f c2 00 00 00:00:00.000 SMART READ LOG
b0 d5 01 01 4f c2 00 00 00:00:00.000 SMART READ LOG
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 11274 -
# 2 Extended offline Aborted by host 50% 11274 -
# 3 Short offline Aborted by host 60% 11274 -
# 4 Short offline Aborted by host 60% 11274 -
# 5 Short offline Aborted by host 00% 11274 -
# 6 Short offline Completed without error 00% 11260 -
# 7 Short offline Completed without error 00% 11236 -
# 8 Short offline Completed without error 00% 11213 -
# 9 Short offline Completed without error 00% 11189 -
#10 Short offline Completed without error 00% 11166 -
#11 Extended offline Completed without error 00% 11143 -
#12 Short offline Completed without error 00% 11142 -
#13 Short offline Completed without error 00% 11119 -
#14 Short offline Completed without error 00% 11095 -
#15 Short offline Completed without error 00% 11072 -
#16 Short offline Completed without error 00% 11048 -
#17 Short offline Completed without error 00% 11024 -
#18 Extended offline Completed without error 00% 11023 -
#19 Short offline Completed without error 00% 11001 -
#20 Short offline Completed without error 00% 10978 -
#21 Short offline Completed without error 00% 10954 -
Selective Self-tests/Logging not supported
-
Can I get rid of the frequent notification? I really want to know if my drives condition is bad or worsens, but if the count stays at β1β the notification twice a day is very annoying. So is there a way to calibrate this?
-
Any general advice? Would you change the drive immediately?
Thank you very much (: