Discussion
Loading...

Post

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Anthropy
@anthropy@mastodon.derg.nz  ·  activity timestamp 4 days ago

Reminder that SMART is not magic, in my experience, disks are dead long before SMART stops reporting them as 'PASSED'.

I actually came across the perfect example today: Two disks in my backup server are having issues, one is clearly broken, the other gave 2 checksum errors.

Despite this, neither disk is 'failed' according to SMART, one has 456 offline uncorrectable errors / pending sectors, the other one is fine.

Don't rely on broken hardware to tell you that it's broken.

#storage #HDD #ZFS

the other disk's SMART info, a Seagate 4TB drive, showing 456 pending/offline uncorrectable sectors, however it is also marked as 'PASSED'. The disk has been used for 7 years (70k hours), and is currently 27C.
the other disk's SMART info, a Seagate 4TB drive, showing 456 pending/offline uncorrectable sectors, however it is also marked as 'PASSED'. The disk has been used for 7 years (70k hours), and is currently 27C.
the other disk's SMART info, a Seagate 4TB drive, showing 456 pending/offline uncorrectable sectors, however it is also marked as 'PASSED'. The disk has been used for 7 years (70k hours), and is currently 27C.
the disk that had 2 checksum errors, a Seagate 4TB drive, showing zero signs of issues in SMART, healthy temperature of 29C, though it has been on for 68145 hours (7.7 years)
the disk that had 2 checksum errors, a Seagate 4TB drive, showing zero signs of issues in SMART, healthy temperature of 29C, though it has been on for 68145 hours (7.7 years)
the disk that had 2 checksum errors, a Seagate 4TB drive, showing zero signs of issues in SMART, healthy temperature of 29C, though it has been on for 68145 hours (7.7 years)
ZFS pool status overview, showing 2 disks with issues, 1 being marked as FAULTED. The faulted one has 789 Read errors and 14 checksum errors. The other one only has 2 checksum errors.
ZFS pool status overview, showing 2 disks with issues, 1 being marked as FAULTED. The faulted one has 789 Read errors and 14 checksum errors. The other one only has 2 checksum errors.
ZFS pool status overview, showing 2 disks with issues, 1 being marked as FAULTED. The faulted one has 789 Read errors and 14 checksum errors. The other one only has 2 checksum errors.
  • Copy link
  • Flag this post
  • Block
Anthropy
@anthropy@mastodon.derg.nz replied  ·  activity timestamp 4 days ago

"haha seagate is bad"
these are enterprise seagate constellation SAS disks, not the OEM seagate disks used in cheap laptops, and they've ran for 7-8 years without issues.

"wait is that an SSD in the middle of that HDD pool"
yea at some point a disk failed and getting a 4TB QLC SSD was actually cheaper at the time woozy_king it's a mixed array anyway, it wouldn't be a Redundant Array of INEXPENSIVE Disks if I stuffed it full of new enterprise drives haha (smh companies always get this wrong)

  • Copy link
  • Flag this comment
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login