[ltp] Re: Error 2100 - SSD in good health, but not booting into laptop
Axel Braun
linux-thinkpad@linux-thinkpad.org
Wed, 04 Jan 2017 18:33:18 +0100
Joerg Bruehe wrote:
> Hi Fen, all!
>
> On 03.01.2017 16:39, Fen Labalme wrote:
>> [[...]]
>>
>> I have a T530 with Samsung 840 EVO SSD running Arch Linux. Been running
>> fine (for three years) then just wouldn't boot. Got the "2100: Detection
>> error" screen of death but the drive passes Lenovo diagnostics (all
>> SMART tests, etc.). With an Arch or Ubuntu Live CD it boots and I can
>> mount the SSD (and made a full backup).
>
> Congratulations!
>
>> I reformatted the SSD and put a
>> fresh OS on it, but it still drops right into the boot menu like it
>> doesn't see the bootable HDD at all.
>
> Not sure whether the below will help you ...
>
> I just read an article in c't (a German computer magazine) where they
> had tested SSDs for failure by writing data at full speed. All had a
> much longer lifetime than specified by the manufacturer (good!), but the
> sad fact remains that SMART data did not really indicate the increasing
> wear, so they didn't give advance warning.
[...]
Quite interesting, just yesterday my Evo 840, 1TB died das well. It was 3
years and one month old! I contacted Samsung service and they claim that it
has only 3 years warranty. WTF! Lets see what the outcome is.
First the Laptop (ThinkPad T520) was frozen completely. After a restart some
error messages came up:
-----
Exception Emask 0x50 SAct 0x8 SErr 0x4090000 action 0xe frozen
[...] irq_stat connection status changed
failed command: FLUSH CACHE EXT
ata1.00:status: {DRDY}
ata1: COMRESET failed (errno=-16)
-----
I reinstalled the original HD into the Laptop (openSUSE 13.1 - quite aged
but still working :-) and could run a smartctl:
-----
T520:/home/docb # smartctl --all /dev/sdb
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.11.6-4-desktop] (SUSE RPM)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Device Model: Samsung SSD 840 EVO 1TB
Serial Number: S1D9NEADA00473E
LU WWN Device Id: 5 002538 85009abf2
Firmware Version: EXT0DB6Q
User Capacity: 1.000.204.886.016 bytes [1,00 TB]
Sector Size: 512 bytes logical/physical
Rotation Rate: Solid State Device
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: ACS-2, ATA8-ACS T13/1699-D revision 4c
SATA Version is: SATA 3.1, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is: Tue Jan 3 14:29:46 2017 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection:
Disabled.
Self-test execution status: ( 0) The previous self-test routine
completed
without error or no self-test has
ever
been run.
Total time to complete Offline
data collection: (15000) seconds.
Offline data collection
capabilities: (0x53) SMART execute Offline immediate.
Auto Offline data collection on/off
support.
Suspend Offline collection upon new
command.
No Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 250) minutes.
SCT capabilities: (0x003d) SCT Status supported.
SCT Error Recovery Control
supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED
WHEN_FAILED RAW_VALUE
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always
- 0
9 Power_On_Hours 0x0032 098 098 000 Old_age Always
- 5369
12 Power_Cycle_Count 0x0032 096 096 000 Old_age Always
- 3564
177 Wear_Leveling_Count 0x0013 099 099 000 Pre-fail Always
- 11
179 Used_Rsvd_Blk_Cnt_Tot 0x0013 100 100 010 Pre-fail Always
- 0
181 Program_Fail_Cnt_Total 0x0032 100 100 010 Old_age Always
- 0
182 Erase_Fail_Count_Total 0x0032 100 100 010 Old_age Always
- 0
183 Runtime_Bad_Block 0x0013 100 100 010 Pre-fail Always
- 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always
- 0
190 Airflow_Temperature_Cel 0x0032 071 049 000 Old_age Always
- 29
195 Hardware_ECC_Recovered 0x001a 200 200 000 Old_age Always
- 0
199 UDMA_CRC_Error_Count 0x003e 099 099 000 Old_age Always
- 10
235 Unknown_Attribute 0x0012 099 099 000 Old_age Always
- 140
241 Total_LBAs_Written 0x0032 099 099 000 Old_age Always
- 19612688471
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
----
If I see it right, it had 19,6 TB written
I tried to reformat as well, but installation of a new system (Leap 42.2)
always fails die to HD errors.
So, lets wait for the outcome of the warranty case....
Cheers
Axel