Updated WD Black NVMe: what is 3D NAND really capable of?
By the standards of the market of computer components, Western Digital began to master the segment of solid-state drives relatively recently: if you omit the first attempts in the form of Silicon Edge Blue (2009) and hybrid WD Black SSHD (2013), you can call the starting point on October 11, 2016, when the announcement line of SATA-drives WD Green and Blue SSD, and a little later came the high-speed WD Black NVMe SSD, aimed at enthusiasts. 2 years have passed since then. During this time, we have redesigned production facilities to release an innovative 3D NAND, devoid of a number of flaws in planar flash memory, and updated the lineup of SSD drives. The “Black” series also evolved: on June 4, 2018, we introduced a new generation of WD Black 3D NVMe SSD. What benefits did the transition to the new process technology provide and how do NVMe devices differ from SATA?
WD Black 3D NVMe SSD clearly demonstrated all the advantages of the new architecture of flash memory chips and a redesigned controller, which significantly improved the performance of the drives, as the table below shows.
Comparison of WD Black NVMe SSD performance of the first and second generations using the example of 500 GB models
PCI Express 3.0 x4 - NVMe
SanDisk 15-nm 128-Gbps TLC NAND
SanDisk 64-layer 256-Gbps BiCS3 3D TLC NAND
LPDDR3-1600, 512 MB
DDR4-2400, 512 MB
Sequential read speed, MB / s
Sequential write speed, MB / s
Random read speed (4 KB blocks), IOPS
Random write speed (4 KB blocks), IOPS
Record resource, TB
The results are really impressive, and even more strange is the situation with SATA versions of solid-state drives. Surely, those who followed the development of 3D NAND technology were disappointed after reading the numbers:
- sequential read speed - 560 MB / s against 545 previously;
- sequential write speed - 530 MB / s against 525;
- random read speed - 95000 IOPS against 100000;
- random write speed - 84000 IOPS against 80,000.
Confusing, isn't it? Other things being equal, in the case of SATA, the performance remained almost unchanged, but then what was the point of modernization? In short, an increase in the volume in the former form factor, as well as an increase in the fault tolerance of the drives, which we wrote about in detail in one of the previous materials on 3D NAND . As for the speed indicators, alas, here everything rests on the interface itself, which simply do not allow to unleash the full potential of the technology. We will talk more about this today.
A few words about SATA, AHCI and NVMe
A long time ago, a few decades ago, the hard drives for personal computers used the ST-506/412 interface, named after the first 5.25-inch hard drive.
The first 5.25-inch Seagate ST-506 hard drive.
Its main feature was that the HDD board itself contained only engine control modules, an actuator and a head switch, as well as analog processing - everything else was located in the controller of the PC itself. This approach was rather cheap to implement, but even then the interface bandwidth was lower than the first personal computers could potentially provide. But in those days there was nowhere to hurry, but in terms of price and quality, everyone was happy with everything.
However, progress did not stand still, and the needs of private and commercial users grew inexorably. Neither the previous performance nor the number of devices (ST-506/412 allowed to connect only two disks) could not satisfy anyone, and the developers began to look for an alternative. This is how ATA emerged, which for many years became a standard for all PCs, which was largely supported by the expansion of ATAPI, which in essence was the implementation of a set of SCSI standards used in high-performance servers.
A few years later, the parallel interface ran into the 133 MB / s ceiling. The peculiarity of the ST-506/412 inherited from the aforementioned one was known: a pair of disk devices were connected to one channel, and it turned out to be impossible to level the mutual influence of each other on each other. This is how SATA appeared, the bandwidth of which increased to 150 MB / s with the possibility of further expansion, and thanks to the use of a star topology, each channel became independent.
By and large, it was the topology that was the trump card, since even one and a half hundred megabytes were redundant for slow drives of that time. To improve their performance, it was necessary to optimize read / write requests and minimize the number of head unit movements. And then the Advanced Host Controller Interface (AHCI) with the support of the Native Command Queuing (NCQ) technology, that is, the hardware installation of the order of commands, came at an opportune time. Of course, there is no real parallelism here: although NCQ can receive requests from several sources at the same time, their further reorganization is carried out within one queue, helping only to reduce the number of movement of writing heads and the waiting period for the desired sector on the track. On the other hand, more was not required
With the advent of solid-state drives, the situation has changed to the exact opposite. For SSD, even SATA III was cramped, and only the development of a new protocol helped change the situation. Crucially: if AHCI supported only one queue with a depth of 32 requests, then the Non-Volatile Memory Host Controller Interface (NVMe) introduced in August 2012 was already able to handle 65536 queues with a depth of 65536 each (ie, 64K) and use multi-core processors. Also added optimization of interrupt latency, providing almost a one and a half performance gain.
Comparing read / write delays between SAS, SATA and NVMe
However, NVMe is only a tool, and by itself could not provide such impressive results without proper hardware support.
Unleash potential: what are new controllers capable of?
Choosing controllers for the updated WD Black SSD line, we came to a disappointing conclusion: the solutions offered by Marvell, which were used earlier, as well as the products of their competitors, simply do not meet our current needs. The only right step in this situation was the creation of our own microcontrollers, so we got, firstly, complete independence from third-party developers, and secondly - the ability to subtly optimize the hardware platform for the features of specific drive modifications.
Operations are distributed between computational blocks.
So there was SanDisk 20-82-007011, which was based on the 28-nanometer tri-core processor on the ARM Cortex-R, which is already noticeably superior in power to the Marvell Eldora, previously used. The main innovation was the transfer of a part of operations processed at the program level to specialized computational units: for example, reading data from flash memory, processing NVMe commands, as well as LDPC coding. The latter, by the way, became three-level and acquired a layered character. In practice, this means that the most suitable algorithm is selected based on the degree of wear of memory cells. The new WD Black 3D NAND SSD that has just been installed on a computer will use a lightweight version of error correction, featuring fast execution and minimal energy consumption. On the contrary
The ECC scenario is selected depending on the state of the flash memory.
The controller itself supports the newest protocol NVM Express 1.3, while it implements 4 PCI-E 3.0 lanes, and the information is transmitted via eight channels, so there are no problems with throughput.
We also worked on eliminating another bottleneck, which was played by the cache. As before, there is an intermediate SLC buffer here, but the proprietary algorithm nCache 3.0 has now learned to switch to direct-to-TLC mode and write data directly to the TLC memory. What does this mean? You, as before, take full advantage of the high-speed cache, but if the buffer overflows, there will be no dramatic drop in performance, since the information will be flushed directly into memory, bypassing the cache. At the same time, in high-speed SLC-mode, WD Black NVMe demonstrates impressive 2.4 GB / s sequential recording, and direct recording to TLC-memory is performed at a speed of 840 MB / s, which is more than 2 times faster than the previous modification. The amount of cache memory remained the same, which made the device cheaper.
The dynamics of performance changes as the cache is filled.
Thus, the WD Black NVMe SSD turned out to be really balanced: the updated hardware platform perfectly complements the 3D NAND chips, fully unleashing their potential. The bottom line is a truly reliable solution, aimed at those who see SATA as too narrow, demonstrating decent performance indicators and capable of competing with the majority of reference models in the consumer segment due to the best price / quality ratio.