As in data destroying major.
Originally via Tech Report.
http://blog.algolia.com/when-solid-state-drives-are-not-that-solid/
Another reason to never go near Samsung SSDs from now on
-
Spartan@HIDevolution Company Representative
-
Spartan@HIDevolution Company Representative
https://blog.algolia.com/when-solid-state-drives-are-not-that-solid/
The complete picture
At this moment we finally got a complete picture of what was going on. The system was issuing a TRIM to erase empty blocks, the command got misinterpreted by the drive and the controller erased blocks it was not supposed to. Therefore our files ended-up with 512 bytes of zeroes, files smaller than 512 bytes were completely zeroed. When we were lucky enough, the misbehaving TRIM hit the super-block of the filesystem and caused a corruption. After disabling the TRIM, the live big files were no longer corrupted but the small files that were once mapped to the memory and never changed since then had two states – correct content in the memory and corrupted one on the drive. Running a check on the files found nothing because they were never fetched again from the drive and just silently read from the memory. Massive reboot of servers came into play to restore the data consistency but after many weeks of hunting a ghost we came to the end.
As a result, we informed our server provider about the affected SSDs and they informed the manufacturer. Our new deployments were switched to different SSD drives and we don’t recommend anyone to use any SSD that is anyhow mentioned in a bad way by the Linux kernel. Also be careful, even when you don’t enable the TRIM explicitly, at least since Ubuntu 14.04 the explicit FSTRIM runs in a cron once per week on all partitions – the freeze of your storage for a couple of seconds will be your smallest problem.
=================================================================
Broken SSDs:
SAMSUNG MZ7WD480HCGM-00003
SAMSUNG MZ7GE480HMHP-00003
SAMSUNG MZ7GE240HMGR-00003
Samsung SSD 840 PRO Series
recently blacklisted for 8-series blacklist
Samsung SSD 850 PRO 512GB
recently blacklisted as 850 Pro and later in 8-series blacklist
Working SSDs:
Intel S3500
Intel S3700
Intel S3710Tinderbox (UK) likes this. -
Tinderbox (UK) BAKED BEAN KING
Well I have an Plextor, so i am alright, not good for Samsung`s reputation.
And Intel uses Samsung WHAT????
I recommended an Samsung to my nephew he got an 256GB 850 i dont think it was a pro, so i think he will be fine.
So will a firmware update fix this problem? , should you turn trim off until a fix is found.
John.Last edited: Jun 16, 2015 -
Spartan@HIDevolution Company Representative
Initially, I had a 256GB 850 PRO which had amazing performance.
Next, I buy two 850 PRO 1TB as an upgrade but they had the new firmware which was known to be buggy and was removed from the Samsung site few days after releasing it. Not only did the new firmware brick many people's 850s, it had worse performance. So I was waiting for 3 months hoping that Samsung would release a new firmware to fix their bad firmware but nope. So I got tired and sold it as the performance dropped from 520 / 500 to 480/460 on the sequential tests so I lost hope in Samsung after their recent firmware issues one after the other and sold em. And now this, the topping on the cake -
Tinderbox (UK) BAKED BEAN KING
Yeah, but now Samsung will test their SSD firmware a lot more, so you can expect slower releases once the current trim problem is fixed.
I will still buy Samsung if i am in the market for an larger SSD.
John. -
I am now in the market for a 1TB SSD. This is too replace the second 750GB HDD and eventually have Linux on it. I have a 480GB Mushkin Chronos drive with Windows 7 for the primary right now.
I may just tear the install down and use the 1TB with Linux as a primary drive and rebuild the Windows 7 install. Once I can convert all non essential system internet access to Linux I will no longer have an issue with the 2020 date for production work in Windows 7. -
Hmm, do you think this would affect Windows systems as well with the affected drives?
-
This seems like Linux only issue. I have 850 Pro 1TB OS drive and write/erase around 2-300GB par day due to work stuff. I run TRIM (Windows defrag & optimise utility) everyday and never saw any problem.
-
saturnotaku Notebook Nobel Laureate
Unless you're running Linux,
Molehill ---> Mountaindjembe likes this. -
According to the article, the error was in how Samsung SSDs interpreted some of the TRIM commands in the Linux kernel. In simple terms, they were saying these drives should have been listed on the blacklist, which would specify commands to avoid the problem, but were not listed. As noted in the list, all Samsung 830/840/850 drives were placed on the blacklist recently (presumably after the article was first published and/or after the company's experiences), which should ensure they work properly. According to the blacklist (linked in the article), both Micron/Crucial and Samsung SSDs exhibit some problems with the default implementation of queued TRIM in the Linux kernel, which is fixed by assigning specific actions to those drives.
So essentially it's a problem that only happens in Linux and either has been or is in the process of being addressed in the Linux kernel. I agree with Saturnotaku. This is only going to affect a very small number of users and there's a fix available, so it's not anywhere near the catastrophe the original poster made it out to be. -
I have too ask as well, is this all based form a Linux kernel under BSD? What kernel(s) will be affected by this?
-
It is not only Samsung drives - Crucial, and some other brands are also now blacklisted with horribly broken queued trim.alexhawker likes this. -
I see. I'll just hope that MS/Intel do some testing with a wide variety of hardware combinations before they decide to implement queued TRIM in their Windows 8/10 drivers. If they ever decide to.
-
Unqueued TRIM means that the host must stop all writes and wait for queued writes to finish, send the TRIM, then when it is complete, start writing to the drive again. This means that disk io freezes while the TRIM is processed. For an enterprise web server, this might be unacceptable (which is why the enterprise shop with the horror story was using drives allegedly implementing QUEUED TRIM). But for a consumer drive in a single user system, I'm going to run fstrim once a week at 2am, and I really don't care if disk io freezes for a few seconds.
QUEUED TRIM is a very tricky real-time task to get right. You have hundreds of queued writes in process, many of which will require erasing an erase block and copying in-use sectors - and updating the on-drive data structures that track all that. And now the queued trim is going to jump in there and update the same data structures in a different way at the same time. Multi-threaded code is notoriously hard to test and debug. So it really is no reflection on the manhood of Samsung engineers that they failed to get it working reliably in time for market. I suspect that the decision to make the drive advertise it anyway was made by a PHB.
One way to amortize the cost of getting queued trim to work right is to have it OFF on consumer drives, and only enabled on more expensive enterprise drives (where it had better work).Last edited: Jun 30, 2015
Major TRIM bug found in Samsung SSD's (Limited to Linux)
Discussion in 'Hardware Components and Aftermarket Upgrades' started by Spartan@HIDevolution, Jun 16, 2015.