PCIe 4.0 Card Hosts 21M.2 SSDs: Up To 168TB, 31 GB/s

eqvinox · on March 13, 2023

(Pulling up from child comment)

The chip this uses is likely a PM4x100 (x ∈ {0, 1, 2}) from Microchip (formerly Microsemi (formerly PMC-Sierra)):

https://www.microchip.com/en-us/product/PM40100

^ runs you $800 without bulk discounts [https://www2.mouser.com/ProductDetail/Microchip-Technology-A...] — if you can get them, that is.

https://www.microchip.com/en-us/product/PM41100

https://www.microchip.com/en-us/product/PM42100

^ these latter two I don't see publicly listed prices for anywhere.

The PCIe 5.0 equivalent is in "Samples available", i.e. not full production yet, which is likely why the card only does PCIe 4.0:

https://www.microchip.com/en-us/product/PM50100

h2odragon · on March 13, 2023

> the manufacturer confirmed that the X21 offers 100 PCIe lanes, suggesting the presence of a PCIe switch.

Almost like it's custom designed for a particular application where money's no concern... Perhaps someone in Utah needs big rainbow tables?

rektide · on March 13, 2023

PCIe switches just shouldn't be so dammed expensive. A decade ago there was a lot more market competition but now there are, what, two companies with chips?

It has gotten a good bit harder to build, especially with so many of the tricks & tight timings in PCIe 5 and 6, but the lack of market competition has made getting any kind of parts at all much much more expensive.

bick_nyers · on March 13, 2023

With the cost of PLX PCIE Switches (allows you to e.g. PCIE 4.0 x16 -> PCIE 3.0 x32) it is actually worth considering just buying a second desktop and throwing in some high-bandwidth NIC and forming your own HPC. Or instead of using desktop parts just going EPYC/Xeon/Threadripper.

Of course it all comes down to the fact that if you need those PCIE lanes, there's a very good chance that it's for your job, meaning that businesses are the target market, not the enthusiast building a homelab for tinkering with LLM off the clock.

eqvinox · on March 13, 2023

> Almost like it's custom designed […]

https://www.microchip.com/en-us/product/PM42100

It's a standard COTS part.

Coincidentally, the PCIe 5.0 variant is in "Samples available", i.e. not full production yet, which is very likely the reason for this card only being PCIe 4.0.

https://www.microchip.com/en-us/product/PM50100

rasz · on March 13, 2023

Doesnt help that PM42100 is $7.5K and out of stock.

eqvinox · on March 13, 2023

That's the price of the development/evaluation kit. Those are produced in small numbers, have provisions for everything and debugging the kitchen sink, and thus always this expensive.

With the PM40100 being $800 (single unit, no bulk pricing), the PM41100 / PM42100 are probably < $1500. (They do seem to have more features, not quite clear without proper datasheet sadly.)

amluto · on March 13, 2023

This actually sounds like it could be a nice mid-range product. For lots of money, you can get a fancy motherboard and enclosure that routes a ton of CPU PCIe lanes direct to the NVMe drives. This ends up with a lot of performance per unit storage, which one might not want.

With a card like this, one can get a ton of high-speed (much better than SATA but not as fast as direct NVMe) storage in a regular machine.

bick_nyers · on March 13, 2023

Could one hypothetically install an M.2 -> PCIE x16 riser and install a GPU?

amluto · on March 13, 2023

I don’t see why not. OTOH, if you try this on an NVMe “hardware RAID” slot, you may get hilarious results.

bick_nyers · on March 13, 2023

Stripe your GPU in a Raid 0 configuration for maximum performance, if your GPU doesn't have ECC VRAM, consider mirroring them :)

mikece · on March 13, 2023

This is No Such Agency who would buy as many of these as could be produced...

jeffbee · on March 13, 2023

People like to throw the innuendo around but the NSA's pathetic little datacenter is something that you would lose in a corner of a real datacenter operated by a real hyperscale system like Amazon or Google.

zamnos · on March 13, 2023

Just like a cluster of Bitcoin miners will run absolute circles around a similarly sized corner of an AWS data center, and the supercomputer at Oak Ridge will run circles around a similar sized corner of AWS of GPU EC2 instances connected via gigabit Ethernet, the NSA's cluster's got a different use case than running web services for every SaaS company that wants to run in AWS. I imagine it's aimed at saving and analyzing/decrypting large amounts of data, and thus is architected and tuned towards that purpose, and thus runs circles around a similarly sized corner of AWS for that particular task.

Unless you have experience with the NSA's cluster that you'd like to share with the rest of the class, that is.

jeffbee · on March 13, 2023

Needs citation. I think the idea that the NSA has stronger data storage and analysis infrastructure than commercial operators is not even conjecture, it's something weaker, a fantasy. Commercial hyperscale operators claimed the ability to sort 50PB datasets at 600GB/s, eleven years ago. Storing and analyzing bulk data is the #1 thing these guys are good at.

xen2xen1 · on March 13, 2023

But the NSA one is dedicated to invading everyone's privac... Hey, wait!

throitallaway · on March 13, 2023

Does anyone else marvel at data throughputs nowadays? People talk about 5GB/s NVME cards as being "slow." Same with Internet speeds. It's unreal the progress that we've made (and continue to make.)

DaiPlusPlus · on March 13, 2023

I marvel at 5GB/s mass-storage, but then I groan at the knowledge it will be used to run a denornalized Postgres database table containing JSON blobs for queries that will do full tablescans because apparently dropping $Lots on a high-end storage system is preferable to learning how to do things properly.

LeifCarrotson · on March 13, 2023

I'm in the automation/manufacturing industry, and while I know there are big servers running ERPs and SCMs and MES and other TLAs with poorly-optimized giant databases I don't personally touch those. But still I suspect the same truths apply whether you're talking about "big iron" computers or literal metals: when evaluating the strength of an average weldment on an ordinary conveyor or machine, we like to say "Steel is cheap, engineering is expensive."

Obviously there are situations when you have to employ more rigor and do the FEA, but typically, when choosing between a just-right solution and one that's obviously strong enough, just overbuilding it is a lot more efficient in terms of value.

With 21 of the pictured $150 Samsung 1TB 990 Pro SSDs and, hypothetically, $1000 for this card, you're looking at $4,150 for this storage solution. If that solves your problem and lets you apply off-the-shelf Postgres and JSON and unoptimized queries, do it! That money only buys a handful of site visits and maybe a week of engineering hours to change a system that may involve dozens of users, tens or hundreds of thousands of lines of code, and rigid requirements from upstream and downstream...maybe you can change those eventually, and it would definitely have been cheaper if all the stakeholders had a fundamental understanding of the compute requirements of full table scans and non-native blobs and designed their business around those mathematics, but that doesn't sound likely.

whoomp12342 · on March 13, 2023

reasoning: engineers can be lazy too

foepys · on March 13, 2023

The end result of this is something like Microsoft Teams where they are in the process of swapping out the entire underlying runtime because the team that wrote the application itself was so lazy (read: incompetent) that it is slow on literally every machine it runs on and this is apparently the only sane way to fix the whole mess - bar a whole rewrite of the entire application.

0cf8612b2e1e · on March 13, 2023

Is this true or confirmed somewhere official? Clearly the underlying architecture has major issues.

DaiPlusPlus · on March 14, 2023

According to this 2021 blog article[1], MS Teams is undergoing two simultaneous overhauls:

1. Teams will use a shared, Windows systemwide Blink-based WebView2-based host instead of using its own private Electron environment.

2. The Teams' UI is changing from Angular to ReactJS.

So Teams will remain a modern-day HTA, for better or for worse, but sourcing from my own experiences working with Angular, ReactJS, and MS's WebView2 vs. Electron, I'm not convinced any of these changes will substantially benefit the end-user experience except perhaps a modest reduction in memory-usage attributed to using WebView2 instead of Electron.

[1]: https://blog.thoughtstuff.co.uk/2021/08/stop-saying-microsof...

canucker2016 · on March 14, 2023

I assume it's a lot easier to find React developers and answers to React-related programming questions than for Angular in 2023+.

DaiPlusPlus · on March 14, 2023

Disclaimer: I'm a former Microsoft FTE SE.

Microsoft doesn't hire FTE SEs on the basis of their knowledge of a single platform or library - anyone who is good-enough overall will be able to familiarize themselves with Angular - or React - or any other framework, platform, or entire paradigm - that's how the industry works.

Employing people for knowledge with a specific library or platform can, and does, make sense, but only in a situation where a company needs a consultant or contractor(s) to make changes to an existing product for a short contract and then, poof, they no-longer work at the company.

While Microsoft does hire plenty of contractor staff (orange-badges, "v-dash trash", etc), only a minority of them are involved in product development, and an even tinier number of those are employed in any kind of consultancy role (which makes sense, considering that Microsoft almost entirely uses only its own platforms, frameworks and libraries for its consumer-facing products) - so the fact that Microsoft swallowed its pride and adopted Electron, Angular, React, Blink/Chromium in recent years marks a significant shift in the company's ideology (for want of a better word). No-one would have predicted this even as late as 2015.

runnerup · on March 13, 2023

Moreso, because this solution costs $4,000 so it only needs to save one week of one persons engineering time to be a more cost effective solution than “better software engineering”.

It’s laziness, but it’s cost effective laziness.

doubled112 · on March 13, 2023

Absolutely. I keep saying to people "it doesn't matter, everything is fast now".

Gigabit fibre to the home, NVMe that is way faster than RAM was not that long ago, CPUs in phones that make old desktops look like toasters.

The disconnect is that the numbers feel huge in comparison, and what my computer can do for me really, is not hugely different.

rnk · on March 14, 2023

That kind of reads like a sarcastic post. The us has so much terrible internet. I'm about to try trip channel bonding starlink, dsl, and an lte internet connection. Starlink and dsl drop packets a lot.

I'm in the us, pay $100 a month to get 5 mbit dsl. This is not in a big city though, 20 miles outside of a city, next to a highway. There's fiber that runs by the street at the small subdivision this is in, kind of in the woods. The company that owns it refuses to connect us to fiber, instead preferring to put 100 homes on 5mbit dsl at far more profit. There's one big commerical user that paid for the fiber. This is the story of the us of course, I'm not unique. A family member lives on the other side of the country, he's closer to a big city but has the same problem.

TheHappyOddish · on March 18, 2023

The important thing is you kept those damn socialists out of your country and let good old American free market forces solve the problem.

kiratp · on March 13, 2023

Lets all keep in mind that a very small portion of the global population has access to this. It is our responsibility to bring all of humanity forward with us as we write software.

https://perfnow.nl/speakers#alex

SketchySeaBeast · on March 13, 2023

While you're not wrong about the portions who can access it, I wonder at how many of us doing work that brings humanity forward at all. Sure, let's not use fast system as an excuse for bad code, but I'm not going to pretend my CRUD app is elevating humanity. Even the stuff that claims to feels like at best a lateral move a lot of the time.

tester756 · on March 13, 2023

Seems like we'll start using NVMe as RAM, so we'll be able to have higher RAM sizes

Night_Thastus · on March 13, 2023

Isn't NVMe storage much higher latency than RAM, which is no good for the CPU? IIRC, NVMe is also poor at random access.

bick_nyers · on March 13, 2023

Yup.

Now Optane on the other hand...

trentnelson · on March 13, 2023

Just found out Intel killed Optane (announced 2nd Aug 2022) :-(

bick_nyers · on March 14, 2023

Yes, but certain models are on a fire sale now, keep an eye on Newegg!

rhn_mk1 · on March 13, 2023

It's called swap.

tester756 · on March 13, 2023

Conceptually? yes, but I meant using NVMe fully as RAM, without your RAM sticks.

godelski · on March 13, 2023

Isn't the issue endurance? I'm pretty sure your drive would die before the year is over. Probably a month or two.

bravetraveler · on March 14, 2023

Latency would be significant as well

Endurance may surprise you, but then again - I haven't paid much attention to newer (cheaper/weaker/more dense) NAND types

doubled112 · on March 13, 2023

Could we go full circle and use that for a RAM disk?

sidpatil · on March 13, 2023

IIRC tmpfs already does that, by swapping to disk.

DaiPlusPlus · on March 13, 2023

> NVMe that is way faster than RAM was not that long ago

Citation?

doubled112 · on March 13, 2023

Depends on what you consider a long time, maybe.

https://www.samsung.com/us/computing/memory-storage/solid-st...

> Sequential read/write speeds up to 7,450/6,900 MB/s

https://en.wikipedia.org/wiki/DDR2_SDRAM

Lists DDR2-400 capable of 3200 MB/s of throughput.

ciupicri · on March 13, 2023

Yeah, but there's a random in RAM and SSDs aren't that great yet.

DaiPlusPlus · on March 14, 2023

Not to mention deletes/overwrites.

jakogut · on March 13, 2023

DDR2-800 has a maximum theoretical bandwidth of 6,400 MB/s, and was in common use well after 2010.

rektide · on March 13, 2023

DDR2-1066 was pretty rare (fast), and rated as PC2-8500, meaning 8.5GBps. DDR3 started here-about, in ~2007.

Consumer PCIe 5.0 ssds will in some cases likely surpass that.

kickaha · on March 13, 2023

Got a VIC-20 for my 13th birthday in 1983. The local TV shop sold Commodore hardware and hosted a BBS that my little nerd cronies and I connected to at 300 bps. All of us knew the owner who ran the store and mentored us and hired some of us when we got old enough. On one visit he took me back, behind an actual curtain, and showed me the BBS machine. If you know the 1541 Disk Drive, you know. Well this beast had a Commodore branded 5 MB hard disk connected to a C-64. In my memory it was 6” by 8” by 24” long, with an enormous power supply, and must have cost thousands of Reagan-first-term US dollars.

Two or three years later another mentor hired me to put a 33 MB hard disk into his IBM PC. Not a clone. My memory tells me it was a DOS imposed limit, those 33 MB: the biggest drive available. I managed to plug the connector in upside down and released the magic smoke. That was a many-hundreds-of-dollars mistake. (And a good lesson in patient mentoring.)

In 1991 I obtained a used 80 MB drive (half height!) to put into my own PC XT clone, via a local Usenet group. I set the volume name to $1_PER_MB because going under that threshold was so impressive.

Those are my reference points for storage.

Mountain_Skies · on March 13, 2023

300 baud was comfortable reading speed, at least for me as a child. It was also fun to pick up the phone and be able to distinguish individual bytes of data (though not the actual content of the byte). Once we got to 1200 baud, it just became a stream of warbling.

kickaha · on March 13, 2023

I’m so old I forget that old memories can be found again with Wikipedia and Google.

https://en.m.wikipedia.org/wiki/Commodore_D9060

https://www.commodore-info.com/brochure/item/commodore_d9060...

Arrath · on March 13, 2023

> I set the volume name to $1_PER_MB because going under that threshold was so impressive.

Hah! Too funny, my own personal memory for 'cheap' storage was keeping an eye on the Fry's print ads in the Sunday newspaper while saving all my allowance and summer job money, finally buying the outrageously large 200GB HDD for a mere $1 a gig!

forinti · on March 13, 2023

Sometimes you have to move 15TB about and then nothing is fast enough.

At 5GB/s, that would take nearly an hour and 5GB/s would be the fastest part of the trip. If it has to land on tape or travel through the net, it's going to take days.

LeonM · on March 13, 2023

"Never underestimate the bandwidth of a station wagon full of magnetic tapes hurtling down the highway" - Andrew S. Tannenbaum

But you are right, writing to and reading from tape will take a long time. Modern tape drives can do ~500MB/s, so 15TB will still take ~9 hours. Though that may still be faster than a 1 gbit internet connection (depending on how far you must drive).

jgalt212 · on March 13, 2023

sneakernet will always be with us.

godelski · on March 13, 2023

Yes and no. Throughputs are crazy high these days but the rate at which they increase is slower than the rate at which compute increases, by a lot. So if we're discussing in a relative sense, then there's a growing divergence and thus one can argue that I/O is getting "slower." This is actually one of the major topics in HPC discussion and why there have been so many crazy hacks. Things like flashbuffers are pretty much essential these days. Even if you're doing multi-node ML training you see pretty big differences using infiniband due to the frequency in which nodes need to communicate (there are regularization interval tricks too). In scientific computing this is a big limitation to our ability to visualize at high resolutions and is why in situ visualization is growing popular.

As far as consumer hardware and consumer usages, yeah, everything just feels fast though.

Mountain_Skies · on March 13, 2023

It's starting to become difficult to comprehend much of it now. My ISP is pushing me to replace my 300 Mbps service with 2Gbps, but I never even saturate what I have. Maybe if I were a gamer with huge downloads it would make sense to upgrade.

SketchySeaBeast · on March 13, 2023

I'm a gamer on 300 Mbps. Even 100 GB huge games aren't much more than a half hour away on steam. I don't really see the value in being able to download that much in 5 minutes.

0cf8612b2e1e · on March 13, 2023

I do not know if it is still true, but originally the PlayStation did not perform patch diffs, and any kind of update could be 10s of gigabytes. If I were routinely having to wait to start a frequently patched game, that would get old pretty quickly.

Outside of that, yeah, I am not sure what use greater than gigabyte would be for 95% of the population.

jlokier · on March 14, 2023

Meanwhile, my ISP is offering me 17 Mbps. This is in the UK, in the centre of a "SuperConnected City".

I use a phone hotspot for internet instead, because it's faster than any fixed line I can get.

runnerup · on March 13, 2023

Blizzards servers never saturate my 1gbps even during extreme off hours.

alfiedotwtf · on March 13, 2023

Talking size and not speed, to be honest without joking, I still marvel that I have a 64Gb USB stick. 64Gb! HUGE!

vinyl7 · on March 13, 2023

Its a shame all our data is sent/received over HTTP these days, otherwise I'd be excited about it

organsnyder · on March 13, 2023

This is a very different HTTP than 1.0 or 1.1.

kevin_thibedeau · on March 13, 2023

The protocol implementation doesn't matter if everything is serialized into ASCII. At some point there's going to be a web 4.0 where people figure out the performance advantages of binary data.

jjoonathan · on March 13, 2023

What are the current strategies for leveraging NVMe speed & volume in a NAS?

When I look at NAS offerings, I see lots of 2.5" and 3.5" bays and 1Gbe (maaaybe 10Gbe at the high end) which is a bit stifling.

toast0 · on March 13, 2023

u.2 NVMe uses 2.5" bays and SATA Express connectors to offer up to 4 lanes of PCI-e or two lanes of SATA. That's probably where most of the enterprisy NAS is going.

jeffbee · on March 13, 2023

In my humble opinion some home NAS use cases are beneficially converted to Thunderbolt. It's a lot more practical than the faster varieties of ethernet. If you have two hosts that need fast access to the NAS you can do it with TB4, and one of those hosts can re-export the stored resources for applications with lesser performance requirements, over SMB or iSCSI or whatever.

jjoonathan · on March 13, 2023

Yeah, but then the primary host inherits the uptime requirements of a NAS and this gets really awkward when you have workflows in different operating systems that both want to use the fast storage. Both OSes can access thunderbolt, of course, but only one can be a good server. Now that I've experienced the separation of concerns that comes from an independent NAS I do not want to go back.

jeffbee · on March 13, 2023

Which host have you nominated as "primary" in this scenario? I'm only thinking of the TB4 link as a relatively cheap and simple point-to-point networking link. I think it's a lot more practical, and cheaper, that 25gbps ethernet. A dual-port 25gb ethernet NIC costs hundreds of dollars, while you can find all kinds of cheap computers with 2 TB4 ports.

jjoonathan · on March 13, 2023

The host connected to the NVMe storage.

rektide · on March 13, 2023

40Gbps host-to-host networking in thunderbolt/usb4 is so epic & great to have.

jjoonathan · on March 13, 2023

Does someone make a thunderbolt -> thunderbolt connector that exposes an ethernet PCIe endpoint to each host and forwards packets between them?

10 years ago I joked that someone should do this, but I thought high speed ethernet would trickle down and obviate the need. Evidently not, lol.

jeffbee · on March 13, 2023

IP-over-Thunderbolt is a thing. It's the thing we are discussing! All major operating systems can do it.

jjoonathan · on March 13, 2023

I didn't know you could just connect two thunderbolt hosts and get a network interface. Amazing!

Ok, so now I need to find a small cheap computer with a thunderbolt port or two and lots of NVMe.

MrFoof · on March 14, 2023

I really need to finish my YouTube video on this...

-- -----

So, the answer is you don't need a lot to get big impact.

Let's take some NICs like the Intel XL710-QDA1. These are about $250 used. This is a 40Gbps NIC using just PCIe 3.0 x8. I used DACs to drop power consumption through the floor, and to reduce latency. All of this is presented via ISCSI, and with jumbo frames to eke out a few extra percent throughput. If you're using passive DACs, figure 4W of power per port for the NIC, plus another 1.2W at the switch (assuming four SFP+ fan-out), for one end of that connection. You could also just direct connect between the server and the client.

At this point, you can basically shove older prosumer (say 980 Pro) PCIe 4.0 x4 NVMe sustained transfer over the network. Granted, if you're outside of that STR use case, you'll fundamentally be limited by IOPS. Figure for every 40Gbps, you can throw up to 1,200,000 4K IOPS worth of data over the wire.

If you hit the IOPS cap, increase your link speed. A PCIe 4.0 x16 card can handle two 100GbE ports just fine. Note that as you increase IOPS, you'll eventually hit your operating system's IO scheduler limits somewhere in the 10M-15M IOPS range.

-- -----

The question then is actually keeping the NIC fed if you're going over the network. If you're local, it's at least much easier.

First you likely have data buffered in memory for reads. So RAM will cover you there. For writes, you probably want a FUSE pass-through filesystem on NVMe in front of your real backing store (if it's disk, and you're not pure NVMe), or alternatively a writeback cache. The idea here is that pass-through filesystem is basically a storage tier that sits in front of another volume (or volumes) in an entirely transparent manner, so it still appears as if you're writing to a volume that might just be a RAID with a large amount of disks, but instead it's being written to the (presumably mirrored or striped+mirrored) NVMe first to move it to the final destination later. Alternatively, you could also double it as additional read cache, if you don't need to use is all as a write cache too.

-- -----

That's basically what I've done. I have QSFP+ NICs in the storage server and my HEDT (10GbE, 2.5GbE and 1GbE in the rest of the lab hosts), a mirrored NVMe cache/ingest tier, and a boatload of 16TB HDDs in ZFS across two ZFS volumes behind it. This is further backed by 128GB of ECC memory for ZFS ARC, and all powered by an AMD Ryzen 7 PRO 5750GE that maxes out at just under 39W. The whole system, with the NIC, HBA, two 2TB NVMe drives, eight 16TB HDDs, SATA DOM, 128GB ECC, 8-core 35W TDP CPU, and onboard BMC idles at about 43W, and loads that can primarily hit the cache, sits in the 60-65W range, with an absolute system peak under 140W. These figures can be confirmed by a metered PDU.

This gives me ~80TiB of usable redundant storage, with native prosumer PCIe 4.0 x4 NVMe performance, over the network... with an idle of 42-43W that bumps up to 60-65W for most work.

dhess · on March 15, 2023

Do you mind spelling out your NAS’s specs? Chassis, motherboard, etc.?

MrFoof · on March 18, 2023

Off the top of my head...

* Silverstone RM21-308 2U Chassis

* SilverStone RMS08-20 rail kit

* SilverStone EPDM Sound Dampening Foam

* Three NoiseBlocker BlackSilentPRO PC-P fans (80x10mm)

* ASRock Rack X570D4U motherboard

* ASRock Rack TPM2.0 (Infineon SLB9665) module

* AMD Ryzen 7 PRO 5750GE

* ARCTIC MX-6 thermal paste

* Noctua NH-L9a-AM4 cooler

* Corsair SF450 Platinum SFX power supply (backplane needs four Molex power connectors)

* Extra twin Molex power cable for PSU

* LSI/Broadcom 9500-8i HBA

* SFF-8654 8i 74pin to Dual SFF-8087 Mini SAS Cable

* Two Noctua NF-A4x10 PWM fans. One for X570 chipset, one for HBA, with 3D printed mounts

* Intel XL710-QDA1 QSFP+ (single port) network adapter

* FS.com Customized 40G QSFP+ to 4x10G SFP+ Passive Direct Attach Copper Breakout Cable

* One SuperMicro 64GB SuperDOM (SSD-DM064-SMCMVN1)

* Two SK Hynix P31 Platinum 2TB PCIe 3.0 x4 NVMe. Mirrored and using 45Drives autotier.

* Two Be Quiet! MC1 Pro M.2 SSD Heatsinks

* Eight 16TB HDDs. Two Seagate IronWolf Pro, six shucked WDs with kapton tape over pins 1-3. Two-disk mirror, six disk RAIDZ2.

* Two more 16TB HDDs (spares).

* Running TrueNAS Scale 22.12.1.

MrFoof · on March 18, 2023

Forgot the memory.

* Four Kingston 32GB DDR4-3200 CL22 2Rx8 ECC Unubffered Memory (KSM32ED8/32ME )

Havoc · on March 13, 2023

Really hope they release a smaller version for home use.

Something with say 8 slots would turn all those gen 4 pcie gaming motherboards retiring soon into a great NAS.

Asus I think already makes a similar one but it isn't fanless

toast0 · on March 13, 2023

You can do a passive x16 -> 4 x4, iff your board supports pci-e bifurcation. Theoretically, you could do x16 -> 8 x2 also passively, but I haven't seen bifurcation go down to x2. PCI-e switches are probably too expensive for anything active though.

Havoc · on March 13, 2023

Yep - board supports it. Unfortunately even the passive cards seem to be minimum 150 bucks.

I have a feeling that by the time I get round to this 8TBs may be so cheap that dual of those in the mobo ports may be enough haha

toast0 · on March 13, 2023

Here's one for half that... https://www.amazon.com/ASUS-M-2-X16-V2-Threadripper/dp/B07NQ...

This isn't an endorsement. Just an encouragement to shop more. This one says pci-e 3.0, fwiw, but I don't know how important pci-e 4.0 is to you?

Havoc · on March 14, 2023

Thanks :)

> I don't know how important pci-e 4.0 is to you?

My understanding was that 3.0 would bottleneck the nvme while 4.0 would not. 4 lanes of 1GB/s vs modern gen 4 nvmes at 6ish read.

All a bit academic in NAS I suppose & still very much a concept...think I can get a bit more gaming/desktop use out of the old faithful still

lyind · on March 13, 2023

This nice approach has at least these drawbacks:

1. Swapping drives is hard

   * may be overcome by declaring failure domain = node

2. No powerloss protection advertised to OS, ie. slow synchronous writes

   * may be overcome by software hacks and whole-system battery supply

3. Potential slowdown on continuous write load (weeks or months, depending on drive)

   * may be overcome by software in _some_ situations

At least the last two points are a no-go for enterprise use-cases, if not addressed.

eqvinox · on March 13, 2023

I'm gonna claim that this card isn't aimed at enterprise use-cases that need this kind of service. I'd put it along the lines of "nearline SAS" HDDs, aimed at non-critical applications where you care more about bulk capacity than reliability.

Relatedly, M.2 SSDs are inherently slower than the same pile of silicon in an U.2/2.5" form factor — the power/heat budget is noticeably lower.

ciupicri · on March 13, 2023

> No powerloss protection advertised to OS

How can the OS (I'm interested in Linux) know about this feature?

wtallis · on March 14, 2023

Enterprise SSDs almost always include power loss protection capacitors on the drive itself, so the drive can either directly advertise that it's write caches are non-volatile or simply ignore cache flush requests from the host since data in the cache can already be considered durable from the host's perspective.

Unfortunately for this product, enterprise M.2 SSDs are almost always 110mm long rather than 80mm long, precisely because of the space taken up by those capacitors.

rasz · on March 13, 2023

>Apex Storage doesn't reveal the inner details of the X21

>In a single-card configuration, the X21 delivers sequential read and write speeds up to 30.5 GBps and 28.5 GBps, respectively.

did they test it or reprint press release?

>According to Apex Storage

ah

>The AIC has an average read and write access latency of 79us and 52us

that doesnt make sense unless its additional latency of controller or they ship it populated with drieves.

>However, Apex Storage didn't expose the type of RAID arrays. The X21 also flaunts "enterprise-grade reliability," NVMe 2.0 support, advanced EEC, data protection, and error recovery. Apex Storage didn't reveal the pricing or availability for the X21.

so only revealed performance figures and pictures

Their previous product was a fancy looking bracket for holding 16 SATA M.2 drives https://www.kickstarter.com/projects/storage-scaler/storage-....

>A cross platform drop in card that can massively increase the amount of storage for your computer using cost effective m.2 SSDs.

You had to read fine print to realized its just a m.2 stand requiring proper 16 port SATA controller to function. It still hasnt shipped to this day. Im mildly optimistic.

wdb · on March 13, 2023

Nice, I am currently in the jungle of SSD enclosures for my Apple M2 Pro device and it’s pretty confusing. But 31 GB/s seems wild.

AceJohnny2 · on March 13, 2023

Note that a Thunderbolt 4 Hub can be a bottleneck:

https://eclecticlight.co/2023/02/21/thunderbolt-4-hubs-can-s...

manav · on March 13, 2023

Might be able to eventually get 80Gbps out of USB4v2.

formerly_proven · on March 13, 2023

Note 32 Gb/s, not GB/s.

phonon · on March 13, 2023

No, it's GB/s. PCIe 4.0 x16 has a bandwidth of 32 Gigabytes/s.

vardump · on March 13, 2023

Same issue here, found anything good?

wdb · on March 13, 2023

Not yet Orico seems pretty disappointing 600MB/s for their 20Gbps enclosures. I would thought you should already get that with their 10Gbps ones. So a bit wary to try out their 40Gbps enclosures.

qwertox · on March 13, 2023

I wonder how the SSDs are exposed to the OS.

While dealing with the Samsung Pro Firmware issue, I read that SSDs mounted on a hardware RAID controller need to be removed from the RAID in order to have their Firmware update applied, since Samsung's tool won't see the SSDs if they are placed on the controller.

rasz · on March 13, 2023

If its indeed PM42100 based then you will see separate drives.

rektide · on March 13, 2023

CXL will also make attaching drives & ram a much easier experience, much more regular.

rkagerer · on March 14, 2023

Finally! A card worthy of replacing my Areca controllers.

Now, is there an NVMe equivalent?

_joel · on March 13, 2023

I'll take two please