Sample Header Ad - 728x90

Is my SSD dead? can't mount. I/O errors, 29k+ fsck errors

0 votes
1 answer
332 views
So I work with a couple of people and we ssh into a Linux server for work. There are several SSDs, and one of the non-boot Samsung 870 SSDs crashed the other day in the morning. This led to an inability to SSH into the server for those whose home directory are in this SSD. I did `dmesg and saw the following errors. what can I do at this point? besides restoring to a back up (I didn't make any recently... oops) [Thu Jun 27 09:50:12 2024] RTL8226 2.5Gbps PHY r8169-0-4600:00: attached PHY driver (mii_bus:phy_addr=r8169-0-4600:00, irq=MAC) [Thu Jun 27 09:50:13 2024] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 [Thu Jun 27 09:50:13 2024] ata3.00: irq_stat 0x40000001 [Thu Jun 27 09:50:13 2024] ata3.00: failed command: FLUSH CACHE EXT [Thu Jun 27 09:50:13 2024] ata3.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 2 res 51/04:00:00:00:00/00:00:00:00:00/a0 Emask 0x1 (device error) [Thu Jun 27 09:50:13 2024] ata3.00: status: { DRDY ERR } [Thu Jun 27 09:50:13 2024] ata3.00: error: { ABRT } [Thu Jun 27 09:50:13 2024] ata3.00: supports DRM functions and may not be fully accessible [Thu Jun 27 09:50:13 2024] ata3.00: failed to enable AA (error_mask=0x1) [Thu Jun 27 09:50:13 2024] ata3.00: supports DRM functions and may not be fully accessible [Thu Jun 27 09:50:13 2024] ata3.00: failed to enable AA (error_mask=0x1) [Thu Jun 27 09:50:13 2024] ata3.00: configured for UDMA/133 (device error ignored) [Thu Jun 27 09:50:13 2024] ata3.00: device reported invalid CHS sector 0 [Thu Jun 27 09:50:13 2024] ata3: EH complete [Thu Jun 27 09:50:13 2024] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 [Thu Jun 27 09:50:13 2024] ata3.00: irq_stat 0x40000001 [Thu Jun 27 09:50:13 2024] ata3.00: failed command: FLUSH CACHE EXT [Thu Jun 27 09:50:13 2024] ata3.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 10 res 51/04:00:00:00:00/00:00:00:00:00/a0 Emask 0x1 (device error) [Thu Jun 27 09:50:14 2024] ata3: EH complete [Thu Jun 27 09:50:14 2024] ata3.00: Enabling discard_zeroes_data [Thu Jun 27 09:50:16 2024] atlantic 0000:44:00.0 enp68s0: atlantic: link change old 0 new 1000 [Thu Jun 27 09:50:16 2024] IPv6: ADDRCONF(NETDEV_CHANGE): enp68s0: link becomes ready [Thu Jun 27 09:50:23 2024] rfkill: input handler disabled [Thu Jun 27 09:50:35 2024] Bluetooth: RFCOMM TTY layer initialized [Thu Jun 27 09:50:35 2024] Bluetooth: RFCOMM socket layer initialized [Thu Jun 27 09:50:35 2024] Bluetooth: RFCOMM ver 1.11 [Thu Jun 27 09:50:36 2024] rfkill: input handler enabled [Thu Jun 27 09:50:39 2024] rfkill: input handler disabled [Thu Jun 27 09:51:06 2024] EXT4-fs (sda): warning: mounting fs with errors, running e2fsck is recommended [Thu Jun 27 09:51:06 2024] EXT4-fs (sda): mounted filesystem with ordered data mode. Opts: (null). Quota mode: none. [Thu Jun 27 09:51:11 2024] EXT4-fs (sdc): mounted filesystem with ordered data mode. Opts: (null). Quota mode: none. [Thu Jun 27 09:51:16 2024] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 [Thu Jun 27 09:51:16 2024] ata3.00: irq_stat 0x40000001 [Thu Jun 27 09:51:16 2024] ata3.00: failed command: WRITE DMA [Thu Jun 27 09:51:16 2024] ata3.00: cmd ca/00:08:00:00:00/00:00:00:00:00/e0 tag 9 dma 4096 out res 51/04:08:00:00:00/00:00:00:00:00/e0 Emask 0x1 (device error) [Thu Jun 27 09:51:06 2024] EXT4-fs (sda): warning: mounting fs with errors, running e2fsck is recommended [Thu Jun 27 09:51:06 2024] EXT4-fs (sda): mounted filesystem with ordered data mode. Opts: (null). Quota mode: none. [Thu Jun 27 09:51:28 2024] Buffer I/O error on dev sdb, logical block 0, lost async page write [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] Read Capacity(10) failed: Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] Sense not available. [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#20 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#20 CDB: Write(16) 8a 00 00 00 00 00 00 00 02 80 00 00 00 08 00 00 [Thu Jun 27 09:51:28 2024] blk_update_request: I/O error, dev sdb, sector 640 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio cla ss 0 ...(repeats until sector 800)... [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#11 access beyond end of device [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#25 access beyond end of device [Thu Jun 27 09:51:28 2024] JBD2: recovery failed [Thu Jun 27 09:51:28 2024] EXT4-fs (sdb): error loading journal [Thu Jun 27 09:51:28 2024] Buffer I/O error on dev sdb, logical block 0, lost async page write [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] Read Capacity(10) failed: Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] Sense not available. [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#20 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#20 CDB: Write(16) 8a 00 00 00 00 00 00 00 02 80 00 00 00 08 00 00 [Thu Jun 27 09:51:28 2024] blk_update_request: I/O error, dev sdb, sector 640 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio cla ss 0 [Thu Jun 27 09:51:28 2024] Buffer I/O error on dev sdb, logical block 80, lost async page write [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#21 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#21 CDB: Write(16) 8a 00 00 00 00 00 00 00 02 90 00 00 00 08 00 00 [Thu Jun 27 09:51:28 2024] blk_update_request: I/O error, dev sdb, sector 656 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio cla ss 0 [Thu Jun 27 09:51:28 2024] Buffer I/O error on dev sdb, logical block 82, lost async page write [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] 0 512-byte logical blocks: (0 B/0 B) [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#22 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#22 CDB: Write(16) 8a 00 00 00 00 00 00 00 02 b0 00 00 00 08 00 00 [Thu Jun 27 09:51:28 2024] blk_update_request: I/O error, dev sdb, sector 688 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio cla ss 0 [Thu Jun 27 09:51:28 2024] Buffer I/O error on dev sdb, logical block 86, lost async page write [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#23 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#23 CDB: Write(16) 8a 00 00 00 00 00 00 00 02 e0 00 00 00 10 00 00 [Thu Jun 27 09:51:28 2024] blk_update_request: I/O error, dev sdb, sector 736 op 0x1:(WRITE) flags 0x800 phys_seg 2 prio cla ss 0 [Thu Jun 27 09:51:28 2024] Buffer I/O error on dev sdb, logical block 92, lost async page write [Thu Jun 27 09:51:28 2024] Buffer I/O error on dev sdb, logical block 93, lost async page write [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#26 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#26 CDB: Write(16) 8a 00 00 00 00 00 00 00 03 10 00 00 00 08 00 00 [Thu Jun 27 09:51:28 2024] blk_update_request: I/O error, dev sdb, sector 784 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio cla ss 0 [Thu Jun 27 09:51:28 2024] Buffer I/O error on dev sdb, logical block 98, lost async page write [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#27 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#27 CDB: Write(16) 8a 00 00 00 00 00 00 00 03 20 00 00 00 08 00 00 [Thu Jun 27 09:51:28 2024] sdb: detected capacity change from 7814037168 to 0 [Thu Jun 27 09:51:28 2024] blk_update_request: I/O error, dev sdb, sector 800 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio cla ss 0 [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#11 access beyond end of device [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#25 access beyond end of device [Thu Jun 27 09:51:28 2024] JBD2: recovery failed [Thu Jun 27 09:51:28 2024] EXT4-fs (sdb): error loading journal [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#27 CDB: Write(16) 8a 00 00 00 00 00 00 00 03 20 00 00 00 08 00 00 [Thu Jun 27 09:51:28 2024] sdb: detected capacity change from 7814037168 to 0 [Thu Jun 27 09:51:28 2024] blk_update_request: I/O error, dev sdb, sector 800 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio class 0 [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#11 access beyond end of device [Thu Jun 27 09:51:28 2024] sd 2:0:0:0: [sdb] tag#25 access beyond end of device ...... [Thu Jun 27 09:52:19 2024] ata2.00: Enabling discard_zeroes_data [Thu Jun 27 09:52:19 2024] ata7.00: Enabling discard_zeroes_data [Thu Jun 27 09:56:11 2024] EXT4-fs (sda): error count since last fsck: 29674 [Thu Jun 27 09:56:11 2024] EXT4-fs (sda): initial error at time 1677717172: __ext4_get_inode_loc_noinmem:4410: inode 179610110: block 718277183 and then there was one of these every 10 minutes throughout the day [Thu Jun 27 13:30:14 2024] sd 2:0:0:0: [sdb] tag#6 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s [Thu Jun 27 13:30:14 2024] sd 2:0:0:0: [sdb] tag#6 CDB: ATA command pass through(16) 85 06 20 00 00 00 00 00 00 00 00 00 00 00 e5 00 As far as I’m aware, the things to avoid are: - fsck Things I’m not sure I should avoid (please let me know what proper safe procedure is if I want to recover data) - fsck -n - Smartctl My options (which should I choose?) - ddrescue with the right options to make it fast and read good sectors first. Can this damage the ssd further? - Send to data recovery people
Asked by Derek Xiao (1 rep)
Jul 1, 2024, 07:56 PM
Last activity: Jul 1, 2024, 10:03 PM