Hi. My boot SSD was purchased on May, 2015. It is a Crucial 128GB one. Cockpit is showing me an error of /sys/firmware/efi/efivars 0 free
and smartctl is telling me the following: ==== beginning of SMARTCTL INFO ======= === START OF INFORMATION SECTION === Model Family: Crucial/Micron Client SSDs Device Model: Crucial_CT128MX100SSD1 Serial Number: 15030E6E5A80 LU WWN Device Id: 5 00a075 10e6e5a80 Firmware Version: MU01 User Capacity: 128,035,676,160 bytes [128 GB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: Solid State Device Form Factor: 2.5 inches TRIM Command: Available, deterministic, zeroed Device is: In smartctl database 7.3/5528 ATA Version is: ACS-2, ATA8-ACS T13/1699-D revision 6 SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s) Local Time is: Sat Nov 4 17:38:59 2023 EST SMART support is: Available - device has SMART capability. SMART support is: Enabled
=== START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED
General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 130) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 3) minutes. Conveyance self-test routine recommended polling time: ( 3) minutes. SCT capabilities: (0x0035) SCT Status supported. SCT Feature Control supported. SCT Data Table supported.
SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 100 000 Pre-fail Always - 31 5 Reallocate_NAND_Blk_Cnt 0x0033 100 100 000 Pre-fail Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 74107 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 241 171 Program_Fail_Count 0x0032 100 100 000 Old_age Always - 0 172 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0 173 Ave_Block-Erase_Count 0x0032 097 097 000 Old_age Always - 108 174 Unexpect_Power_Loss_Ct 0x0032 100 100 000 Old_age Always - 176 180 Unused_Reserve_NAND_Blk 0x0033 000 000 000 Pre-fail Always - 1036 183 SATA_Interfac_Downshift 0x0032 100 100 000 Old_age Always - 0 184 Error_Correction_Count 0x0032 100 100 000 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 194 Temperature_Celsius 0x0022 053 049 000 Old_age Always - 47 (Min/Max 22/51) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 16 197 Current_Pending_ECC_Cnt 0x0032 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 100 100 000 Old_age Always - 0 202 Percent_Lifetime_Remain 0x0031 097 097 000 Pre-fail Offline - 3 206 Write_Error_Rate 0x000e 100 100 000 Old_age Always - 0 210 Success_RAIN_Recov_Cnt 0x0032 100 100 000 Old_age Always - 0 246 Total_LBAs_Written 0x0032 100 100 000 Old_age Always - 23588083829 247 Host_Program_Page_Count 0x0032 100 100 000 Old_age Always - 738387194 248 FTL_Program_Page_Count 0x0032 100 100 000 Old_age Always - 172332261
SMART Error Log Version: 1 No Errors Logged
SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Vendor (0xff) Completed without error 00% 8509 - # 2 Vendor (0xff) Completed without error 00% 8490 - # 3 Vendor (0xff) Completed without error 00% 8385 - # 4 Vendor (0xff) Completed without error 00% 8333 - # 5 Vendor (0xff) Completed without error 00% 8251 - # 6 Vendor (0xff) Completed without error 00% 8146 - # 7 Vendor (0xff) Completed without error 00% 8092 - # 8 Vendor (0xff) Completed without error 00% 7843 - # 9 Vendor (0xff) Completed without error 00% 7765 - #10 Vendor (0xff) Completed without error 00% 7721 - #11 Vendor (0xff) Completed without error 00% 7607 - #12 Vendor (0xff) Completed without error 00% 7571 - #13 Vendor (0xff) Completed without error 00% 7535 - #14 Vendor (0xff) Completed without error 00% 7433 - #15 Vendor (0xff) Completed without error 00% 7193 - #16 Vendor (0xff) Completed without error 00% 7156 - #17 Vendor (0xff) Completed without error 00% 7121 - #18 Vendor (0xff) Completed without error 00% 7086 - #19 Vendor (0xff) Completed without error 00% 7050 - #20 Vendor (0xff) Completed without error 00% 7011 - #21 Vendor (0xff) Completed without error 00% 6971 -
SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Completed [00% left] (56994816-57060351) 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
The above only provides legacy SMART information - try 'smartctl ======== END OF SMARTCTL ====
Is it time to say good by and replace this disk? Should I get a larger one? It has the boot info and operating system.
Thanks for your insights
On 11/04/2023 04:48 PM, Javier Perez wrote:
Hi. My boot SSD was purchased on May, 2015. It is a Crucial 128GB one. Cockpit is showing me an error of /sys/firmware/efi/efivars 0 free
This may be redundant, but before you start, make a complete backup of that disk on removable media. Running Bleachbit as root on that to get rid of any cruft before the backup might also be a Good Idea.
On 4 Nov 2023, at 22:50, Javier Perez pepebuho@gmail.com wrote:
Cockpit is showing me an error of /sys/firmware/efi/efivars 0 free
I believe that this is referring to the NVRAM on the motherboard not the boot disk.
I had this happen to one of my machines and the fix was to reset the efivar’s on the motherboard and setup efi boot again.
Barry
On 11/4/23 23:48, Javier Perez wrote:
Hi. My boot SSD was purchased on May, 2015. It is a Crucial 128GB one.
It looks like this has been working for 8 years continuously:
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 74107
$ date --date "now - 74107 hours" Sat May 23 04:18:56 PM CEST 2015
and completely rewritten about 100 times:
246 Total_LBAs_Written 0x0032 100 100 000 Old_age Always - 23588083829
$ calc C-style arbitrary precision calculator (version 2.14.0.14) Calc is open software. For license details type: help copyright [Type "exit" to exit, or "help" for help.]
; 23588083829*512/128e9 94.352335316
but it is not showing signs of failures:
171 Program_Fail_Count 0x0032 100 100 000 Old_age Always - 0 172 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 16 197 Current_Pending_ECC_Cnt 0x0032 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0
The only strange thing is this:
202 Percent_Lifetime_Remain 0x0031 097 097 000 Pre-fail Offline - 3
which SHOULD mean that it has 3% of life time left. But I would not trust this parameter too much, there have been bugs in how it is reported. In particular, the SMART threshold is 0 and the current value is 97, which, in SMART world means that 97 has to go down to 0 to indicate a problem. So maybe the disk has spent 3% of its life, not 97% (100 rewrites are nothing, SLC are typically rated at 100,000)
Given all this, I would not throw away the SSD.It has proven to be reliable for almost 10 years, it is probably a very robust SLC flash, not common nowadays. Sure, it is only 128GB, so a replacement would be very cheap, but I think it may continue in its job. Pairing it with another young SSD in RAID1 would be the best option, or at least have some backups of data (you should always have).
Regards.
On Sun, Nov 5, 2023 at 3:37 AM Roberto Ragusa mail@robertoragusa.it wrote:
On 11/4/23 23:48, Javier Perez wrote:
Hi. My boot SSD was purchased on May, 2015. It is a Crucial 128GB one.
It looks like this has been working for 8 years continuously:
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 74107
$ date --date "now - 74107 hours" Sat May 23 04:18:56 PM CEST 2015
and completely rewritten about 100 times:
246 Total_LBAs_Written 0x0032 100 100 000 Old_age Always - 23588083829
$ calc C-style arbitrary precision calculator (version 2.14.0.14) Calc is open software. For license details type: help copyright [Type "exit" to exit, or "help" for help.]
; 23588083829*512/128e9 94.352335316
but it is not showing signs of failures:
171 Program_Fail_Count 0x0032 100 100 000 Old_age Always - 0 172 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 16 197 Current_Pending_ECC_Cnt 0x0032 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0
The only strange thing is this:
202 Percent_Lifetime_Remain 0x0031 097 097 000 Pre-fail Offline - 3
which SHOULD mean that it has 3% of life time left. But I would not trust this parameter too much, there have been bugs in how it is reported. In particular, the SMART threshold is 0 and the current value is 97, which, in SMART world means that 97 has to go down to 0 to indicate a problem. So maybe the disk has spent 3% of its life, not 97% (100 rewrites are nothing, SLC are typically rated at 100,000)
Given all this, I would not throw away the SSD.It has proven to be reliable for almost 10 years, it is probably a very robust SLC flash, not common nowadays. Sure, it is only 128GB, so a replacement would be very cheap, but I think it may continue in its job. Pairing it with another young SSD in RAID1 would be the best option, or at least have some backups of data (you should always have).
Note what he said. the % lifetime is unreliable.
I have a disk (I have it mirrored with another one). That went 0 - 100 (failing now), then spend 7-8 months at 100% FAILING_NOW, then dropped down to 78% and after 7 months back to 100% and stayed at 100% for 3 months, and then down to 29% and is currently 63%. I write about 50% of the size of the SSD each night and my disk registered "FAILING NOW" 3 years ago. I do not appear to have yet got any blocks on the ssd that have failed erase/rewrite, so since I have it mirrored I am going to keep running it and see how long it really lasts.
Hi. Thanks for the analysis. Indeed yes, this pc is practically always on. I like the RAID 1 idea, if one disk goes down I won't be caught up rebuilding from scratch even if it is just the OS. The Data is in a different disk.
JP
On Sun, Nov 5, 2023 at 4:37 AM Roberto Ragusa mail@robertoragusa.it wrote:
On 11/4/23 23:48, Javier Perez wrote:
Hi. My boot SSD was purchased on May, 2015. It is a Crucial 128GB one.
It looks like this has been working for 8 years continuously:
9 Power_On_Hours 0x0032 100 100 000 Old_age
Always - 74107
$ date --date "now - 74107 hours" Sat May 23 04:18:56 PM CEST 2015
and completely rewritten about 100 times:
246 Total_LBAs_Written 0x0032 100 100 000 Old_age
Always - 23588083829
$ calc C-style arbitrary precision calculator (version 2.14.0.14) Calc is open software. For license details type: help copyright [Type "exit" to exit, or "help" for help.]
; 23588083829*512/128e9 94.352335316
but it is not showing signs of failures:
171 Program_Fail_Count 0x0032 100 100 000 Old_age
Always - 0
172 Erase_Fail_Count 0x0032 100 100 000 Old_age
Always - 0
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age
Always - 16
197 Current_Pending_ECC_Cnt 0x0032 100 100 000 Old_age
Always - 0
198 Offline_Uncorrectable 0x0030 100 100 000 Old_age
Offline - 0
The only strange thing is this:
202 Percent_Lifetime_Remain 0x0031 097 097 000 Pre-fail
Offline - 3
which SHOULD mean that it has 3% of life time left. But I would not trust this parameter too much, there have been bugs in how it is reported. In particular, the SMART threshold is 0 and the current value is 97, which, in SMART world means that 97 has to go down to 0 to indicate a problem. So maybe the disk has spent 3% of its life, not 97% (100 rewrites are nothing, SLC are typically rated at 100,000)
Given all this, I would not throw away the SSD.It has proven to be reliable for almost 10 years, it is probably a very robust SLC flash, not common nowadays. Sure, it is only 128GB, so a replacement would be very cheap, but I think it may continue in its job. Pairing it with another young SSD in RAID1 would be the best option, or at least have some backups of data (you should always have).
Regards.
-- Roberto Ragusa mail at robertoragusa.it _______________________________________________ users mailing list -- users@lists.fedoraproject.org To unsubscribe send an email to users-leave@lists.fedoraproject.org Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue
On 4 Nov 2023, at 22:50, Javier Perez pepebuho@gmail.com wrote:
246 Total_LBAs_Written 0x0032 100 100 000 Old_age Always - 23588083829
From that counter you can calculate the total number of bytes written. If you know the drive spec for endurance total bytes written before the drive fails then you can estimate the amount of drive life used. Beware of SSD models that do not provide that figure in their specs.
For example https://semiconductor.samsung.com/consumer-storage/internal-ssd/870evo/ Lists the figures in the warranty section of the specification.
Barry
On Sat, Nov 4, 2023 at 8:11 PM Barry barry@barrys-emacs.org wrote:
On 4 Nov 2023, at 22:50, Javier Perez pepebuho@gmail.com wrote:
Cockpit is showing me an error of /sys/firmware/efi/efivars 0 free
I believe that this is referring to the NVRAM on the motherboard not the boot disk.
I also believe that. NVRAM on an older machine may have limited space, so if cruft has accumulated the NVRAM could be full. See: https://ruuucker.github.io/EFI-Basics-NVRAM-Variables/
I had this happen to one of my machines and the fix was to reset the efivar’s on the motherboard and setup efi boot again.
From https://en.wikipedia.org/wiki/Nonvolatile_BIOS_memory: "by many OEMs https://en.wikipedia.org/wiki/OEM' design, the UEFI settings are still lost if the CMOS battery fails." So removing the battery may free up space.
Hi. I could not find the specs back in the crucial web page. But from a review site I could find a figure, 72TB https://www.storagereview.com/review/crucial-mx100-ssd-review Looks like I have used up about 16% of its Endurance.
On Sun, Nov 5, 2023 at 3:42 PM Barry barry@barrys-emacs.org wrote:
On 4 Nov 2023, at 22:50, Javier Perez pepebuho@gmail.com wrote:
246 Total_LBAs_Written 0x0032 100 100 000 Old_age Always - 23588083829
From that counter you can calculate the total number of bytes written. If you know the drive spec for endurance total bytes written before the drive fails then you can estimate the amount of drive life used. Beware of SSD models that do not provide that figure in their specs.
For example https://semiconductor.samsung.com/consumer-storage/internal-ssd/870evo/ Lists the figures in the warranty section of the specification.
Barry
users mailing list -- users@lists.fedoraproject.org To unsubscribe send an email to users-leave@lists.fedoraproject.org Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue
On 11/5/23 23:39, Javier Perez wrote:
Hi. I could not find the specs back in the crucial web page. But from a review site I could find a figure, 72TB https://www.storagereview.com/review/crucial-mx100-ssd-review https://www.storagereview.com/review/crucial-mx100-ssd-review Looks like I have used up about 16% of its Endurance.
But this is entirely irrelevant to the original question. It has nothing to do with your SSD. efivars isn't on a disk as was explained.
I know, I am just digressing. As far as the original question, I understand I will have to reset the nvram and recheck.
On Mon, Nov 6, 2023, 02:52 Samuel Sieb samuel@sieb.net wrote:
On 11/5/23 23:39, Javier Perez wrote:
Hi. I could not find the specs back in the crucial web page. But from a review site I could find a figure, 72TB https://www.storagereview.com/review/crucial-mx100-ssd-review https://www.storagereview.com/review/crucial-mx100-ssd-review Looks like I have used up about 16% of its Endurance.
But this is entirely irrelevant to the original question. It has nothing to do with your SSD. efivars isn't on a disk as was explained. _______________________________________________ users mailing list -- users@lists.fedoraproject.org To unsubscribe send an email to users-leave@lists.fedoraproject.org Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue
To all who helped me out, Thanks. Mission accomplished, Just deleted one of the Boot Options at the Bios level and now I got 92.2KB Free
Thanks
On Mon, Nov 6, 2023 at 3:07 AM Javier Perez pepebuho@gmail.com wrote:
I know, I am just digressing. As far as the original question, I understand I will have to reset the nvram and recheck.
On Mon, Nov 6, 2023, 02:52 Samuel Sieb samuel@sieb.net wrote:
On 11/5/23 23:39, Javier Perez wrote:
Hi. I could not find the specs back in the crucial web page. But from a review site I could find a figure, 72TB https://www.storagereview.com/review/crucial-mx100-ssd-review https://www.storagereview.com/review/crucial-mx100-ssd-review Looks like I have used up about 16% of its Endurance.
But this is entirely irrelevant to the original question. It has nothing to do with your SSD. efivars isn't on a disk as was explained. _______________________________________________ users mailing list -- users@lists.fedoraproject.org To unsubscribe send an email to users-leave@lists.fedoraproject.org Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue