Calculating Pi: My Attempt at Breaking the Pi World Record

simias · on Feb 17, 2020

I appreciate the author sharing his experience but I must admit that it was less exciting that I expected it to be. He basically used off-the-shelf hardware, ran an off-the-shelf program and then waited for a while. There's no obvious innovation, anybody who cares enough to break that record and has ~$20k lying around can do it.

If anything it got me a lot more curious about this y-cruncher program and all the fun optimizations it must implement.

darkwater · on Feb 17, 2020

The previous record, which was basically just corporate promotion [1], did use the exact same program.

For a sysadmin/hardware nerd this post was really interesting, although I would have probably have appreciated even more details (quick example, how many disks failed partially/totally during the process?).

[1] https://cloud.google.com/blog/products/compute/calculating-3...

krzepah · on Feb 17, 2020

I understand your disappointment but I'd like to point out this is a really nice achievement on it's portfolio as a sys admin

prox · on Feb 17, 2020

It also is a great side project; And having tested his system he donated it for STEM research. There’s lots to commend in this project.

blazespin · on Feb 17, 2020

I know it isn't probably polite, but would really love to know the bill of materials.

I have to say though, I think it's very cool if he came on budget. I'm very interested to see what he can do for STEM research.

These cloud vendors are way too expensive compared to what he can do off ebay.

Someone · on Feb 17, 2020

http://www.numberworld.org/y-cruncher/ (Mostly closed source)

tromp · on Feb 17, 2020

Reminds me of my attempt to compute the number of Go positions, which similarly needed many TB of disk space and ended up generating 30 petabytes of disk IO.

https://news.ycombinator.com/item?id=9167781 Number of legal 18x18 Go positions computed. One more to go

https://news.ycombinator.com/item?id=10950875 Number of legal Go positions computed

Not much point in breaking that record, as 19x19 is the largest (and standard size) Go board.

mikorym · on Feb 17, 2020

For those that are interested in reciting digits of Pi (I would guess you can either memorise the digits or calculate as you go) the current record is held by a South African: https://www.youtube.com/watch?v=_mGjJMVKWcU

las_balas_tres · on Feb 17, 2020

He only won the record for the fastest recall of pi. 1500 digits in 4 minutes. Daniel Tammet recited 22514 in 5 hours. https://en.wikipedia.org/wiki/Daniel_Tammet

begemotz · on Feb 17, 2020

the Guinness world record is over 70k digits set by Rajveer Meena: https://www.guinnessworldrecords.com/world-records/most-pi-p...

imglorp · on Feb 17, 2020

Such an awe inspiring achievement, to store that much structured data accurately. What else is the brain capable of doing?

Ma8ee · on Feb 17, 2020

Structured?

imglorp · on Feb 17, 2020

Lossless, ordered set of symbols with no errors.

It's much different than remembering a story or a travel route or an image.

Ma8ee · on Feb 17, 2020

Yes, but structured is in general in opposition to random, which the digits of pi seem to be. No one knows of any structure in the digits of pi.

athriren · on Feb 18, 2020

But pi defines its structure. By being seemingly random and irrational, it has created a specific ordered set of symbols and imbued that set with meaning because it has a referent: we call it pi.

Ma8ee · on Feb 18, 2020

It’s not random, but that doesn’t mean it is structured. And there’s absolutely no difference between the digits of pi and a completely random sequence for anyone trying to memorise them.

gesman · on Feb 17, 2020

>> As luck would have it, a transformer next to my house blew and the power went out yet again. This means I had to restart from the last checkpoint yet again, losing another 2 weeks worth of work.

Ouch!!!

nixpulvis · on Feb 17, 2020

Man, it fucking sucks.

I remember a time back freshman year of collage... It was an extra credit assignment. Who could factor the largest number (or something like this) in the least time before the due date. My friend and I devised an algorithm, and launched it on my computer. I honestly don't remember anything other than the fact the god damn power transformer downtown blew and fucked up our tests because of a large scale power outage. I assume we'd have not even come in top #5 for that assignment (who knows), but it's just so frustrating.

Basically, OSs should have better fail recovery mechanics than are the default.

malux85 · on Feb 17, 2020

The OS recovered right?

It was your program that did not

kragen · on Feb 17, 2020

Running under EUMEL, L3, or KeyKOS would have enabled the program to continue from a recent checkpoint, without requiring any logic for this in the program itself.

malux85 · on Feb 17, 2020

Oh of course! EUMEL, L3 or KeyKOS! Why didn’t I think of them?!

kragen · on Feb 17, 2020

Probably you're so familiar with modern computing environments — and unfamiliar with any others — that you take their drawbacks for granted, perhaps even assuming they are unavoidable.

malux85 · on Feb 17, 2020

This got me thinking, and I have a few weeks spare time - can you please recommend something that’s as fundamentally different as possible so that I can get way out of my comfort zone?

I can’t reply to your below comment I’m not sure why - but you seem experienced in alternative computing environments, whereas I’m mostly a HPC python / c++ developer that’s spent the last 10ish years doing deep learning and scientific computing - the newer environment doesn’t have to be practical at all, I’m interested to use it for a change in perspective

kragen · on Feb 17, 2020

Well, what's your comfort zone?

I wish I had a recommendation based on experience for one of these really strange operating systems like EUMEL, Guardian, OS/400, and L3. But I don't. I've used CP/M and MS-DOS, but those are just really limited, not really interesting. Although, with ZCPR and 4DOS, you could make them reasonably usable, it was like coming out of Plato's cave when I switched my primary operating environment from 4DOS to csh on Ultrix.

Squeak is a pretty different operating environment that isn't simply primitive. Oberon is another. They can both run as user processes on top of Linux, as well as on bare metal. Both of them are somewhat alien.

Are you comfortable with embedded development? If not, try Arduino. It starts out easy, since you program the boards in C++, but you have the opportunity to build things that will run for months on a AA battery with submicrosecond interrupt response time — because there's no OS. (It's routine for even programming novices to write their own interrupt handlers.) Arduino instantly gives you the ability to measure things on microsecond timescales, a thousand times faster than you can normally see. Modern boards like the Blue Pill have response latencies in the 100-nanosecond range when they're awake. That's the time it takes light to go 30 meters, as you're probably aware.

In retrocomputing land, VMS was the first OS I used that was really usable. The OpenVMS Hobbyist Program still exists, and it's actually possible to run old versions of Mozilla on it. F-83 was an interactive Forth IDE that provided higher-order programming, virtual memory, and multithreading under MS-DOS, in 1983 — without syntax or types. Turbo Pascal was also an IDE, in a way the first modern IDE, around the same time; the first versions ran on CP/M and MS-DOS. But I think that you kind of had to be grappling with the limitations of BASIC on those systems to appreciate that.

There are Pick systems that still have enthusiastic users: https://www.pickwiki.com/index.php/Pick_Operating_System but they don't sound appealing to me. Other systems with cult fanbases include FileMaker, HyperCard, and Lotus Agenda, which last I think you can run successfully under FreeDOS. Agenda is interesting in part because it's so alien. (It's easy to forget that it was normal at the time to have to use the program manual to figure out how to exit.)

There are a bunch of modern specialized development environments that can do strange things. Radare2 is an environment focused on reverse engineering. Emacs is focused on text editing, but for some reason it's also the main user interface for interactive proof assistants like Coq and Lean, which are shaping up to be pretty interesting. R is focused on statistics. Jupyter is sort of focused on data visualization, although not really. (Now I see you've been doing deep learning for 10 years, so I guess Jupyter is your best friend.) LibreOffice Calc is focused on rectangular arrays of mostly numerical data (although in many cases their most advanced users use Excel instead). You can develop applications in all of them.

How about math? It's one thing to invoke a Runge-Kutta integration method; it's another to be able to prove convergence bounds on it. And machine-checked formal proof is shaping up to be an interesting thing, like I said.

How about cryptography? That has the advantage that there are right answers and wrong answers, so you can test your code.

How about shaders? Shadertoy is accessible and super fun. Maybe that's too similar to HPC, but the shader parallelism model (similar to ispc) is pretty different from both AVX and MPI.

How about mobile development? SIGCHI papers are full of experimental user interface ideas to explore, and Android Studio is free and relatively usable, if clumsy. Have you seen Onyx Ashanti's Beatjazz?

In the neighborhood of beatjazz, there's livecoding. It's a thrill to get a nightclub full of people dancing to your code, and there are a bunch of different environments.

GNU Radio with an RTL-SDR makes it possible for you to run DSP algorithms on RF signals over a pretty wide frequency range, with applications in communications and sensing. Maybe if you've been doing HPC, DSP is already second nature, but if not it might be rewarding. And DSP has close connections to control theory and image processing, as well as the more obvious applications.

How about alternative programming paradigms? If you're comfortable in procedural and OO programming, how about extreme alternatives — answer-set programming like miniKANREN, constraint-logic programming (as supported by modern Prologs https://www.metalevel.at/prolog/clpz not just Mozart/Oz), Erlang-style fault-tolerance-focused programming, APL-style array programming (though maybe you're familiar enough with that to take it for granted), or Forth? How about strongly typed programming like Haskell, Rust, or OCaml? (And of course Haskell is purely functional, and OCaml is mostly so.)

And STM solvers like Z3 can easily solve problems now that were infeasible only a few years ago.

Also, wasm.

Or maybe try hacking together some games in Godot.

I don't know, myself I find that it's hard to avoid getting out of my comfort zone in some direction, just because the world is so big and my knowledge is so small. Deep learning is the out-of-my-comfort-zone programming thing I want to try next!

malux85 · on Feb 17, 2020

Holy moly. Over the past hour I've come back to your response and read it several times. There's so much to unpack there.

I started writing several replies but felt I wasn't able to give you the praise you deserved, but the passage of time compels me to respond so please know you have my full gratitude

Thank you so much for such a detailed reply

kragen · on Feb 17, 2020

I'm so glad it's helpful! I was worried it might be overwhelming.

kragen · on Feb 17, 2020

*SMT solvers

STM is either a scanning tunneling microscope or software transactional memory, both of which are pretty interesting, but Z3 is neither.

Also, it occurred to me that I didn't even mention browsers, probably because I myself am so comfortable with them. The big software platforms right now are POSIX, Android, Java, browsers, and whatever Microsoft is doing now — it's enough effort to reuse code written for one environment for another that people often just rewrite it. Browsers have by far the best GUI library — not that the DOM is a great or even acceptable interface, but things like React and D3 are — and the major advantage that if you write stuff in a browser you can immediately show it to many people. The development tools are insane, and HTML5 supports most cellphone peripherals in a cross-platform way.

tedunangst · on Feb 17, 2020

Any OS under VMware or KVM too, no?

kragen · on Feb 17, 2020

I don't think those automatically checkpoint the system state every 30 seconds, do they? They didn't use to. Hmm, maybe they can now? https://wiki.qemu.org/Features/MicroCheckpointing

acqq · on Feb 17, 2020

“ This implementation currently only supports buffering for the network. (Any help on implementing disk support would be greatly appreciated).”

Razengan · on Feb 17, 2020

macOS also provides auto-save, snapshots, and some crash recovery for most native apps.

unlinked_dll · on Feb 17, 2020

Same thing happened to me in college. I actually saw the transformer blow, it's kinda freaky (and that ozone smell is horrid, you only get to smell it when you weld or really screw the pooch).

Except I wasn't working on anything academically meaningful, more spiritual. I was the only one that could fit a keg in my trunk. Thankfully it was a day party in the spring.

psaux · on Feb 17, 2020

Or, the ozone from a Tesla coil. :) In eighth grade I built one, and I actually still like the smell to this day.

bArray · on Feb 17, 2020

> I actually still like the smell to this day.

Air-conditioning units with ionizers generate ozone, it's why they smell sweet.

P.S. Ozone is meant to be pretty bad for you.

fctorial · on Feb 17, 2020

Every os has a suspend feature.

tedunangst · on Feb 17, 2020

For a prolonged power outage, the hibernate feature is probably more useful. And a UPS to perform it. But generally not impossible to survive such an event.

kozak · on Feb 17, 2020

By "compressed Pi digits", does the author mean simply an efficient encoding of decimal digits in binary, or there is some way to compress Pi digits further? I thought they were incompressible like random.

Someone · on Feb 17, 2020

Typically, that means storing two digits in a byte (halves storage size compared to a text string), 9 in each 32 bits (gains another 11%), 19 in each 64 bits (gains another 5%), or something similar (at this scale, I would guess it uses at least the ‘19 digits in each 64 bits’)

Idea is to not use “one digit per byte”, but to keep addressing individual digits cheap.

”I thought they were incompressible like random.”

They’re easily compressed, if you accept taking this program and it’s configuration file as a compressed version.

(And yes, the output of a pseudo-random number generator compresses extremely well, too)

lifthrasiir · on Feb 17, 2020

> at this scale, I would guess it uses at least the ‘19 digits in each 64 bits’

Your guess is correct, y-cruncher uses that exact format [1].

[1] https://github.com/Mysticial/DigitViewer

maweki · on Feb 17, 2020

That's the notion of kolmogorov complexity. The smallest program that can generate this output. Pi and I guess any other algebraic number, no matter how randomly distributed its digits are, are not that complex.

sp332 · on Feb 17, 2020

Pi is transcendental, not algebraic.

maweki · on Feb 17, 2020

Yeah, sorry, I was not clear on that one. I meant the downwritable ones. Of course, pi has no finite extension.

wang_li · on Feb 17, 2020

Use base pi and the shortest string of digits representing all of the digits of pi can be written as '1'.

This project seems pointless to me. There's no scientific value in knowing this many digits of pi. He used off the shelf components and off the shelf software. So there was no new engineering that advance the state of the art. At the end of the day the only thing of meaning that happened her is he used a bunch of electricity (and a corresponding CO2 release) for no socially valuable purpose.

bArray · on Feb 17, 2020

If they're stored as ASCII characters, there will be tonnes of compression that could be done.

matsemann · on Feb 17, 2020

But

> Did you win the Putnam? [0]

[0]: https://news.ycombinator.com/item?id=35076

thinkloop · on Feb 17, 2020

> I also saw that it cost them around $200,000, which is very expensive. I’m aiming to stay below 5% of that overall amount.

He may have stayed below $10K in hardware, but there is no way that includes the electricity needed to run the machines 24/7 for half a year.

dkdk8283 · on Feb 17, 2020

Assuming 3kw and a PUE of 1.5 for cooling comes out to approx 20,000 kwh of power. Assuming a high-ish rate of $.13/kwh it comes out to around 2.5k for 6m. Not too bad.

fizixer · on Feb 17, 2020

19-digit-per-64-bit compression storage requires about 19TiB [0].

1-digit-per-byte requires about 45TiB [1].

Can anyone explain how it requires 38TiB for final output?

[0] https://www.google.com/search?q=8+*+%285e13+%2F+19%29+%2F+10...

[1] https://www.google.com/search?q=5e13+%2F+1048576+%2F+1048576

redcalx · on Feb 17, 2020

If you cross byte boundaries, i.e. compactly pack the bits such that there are no wasted bits; then you have 4 bits per decimal digit, 50T digits becomes 200T bits == 25TB. Technically there is still some slack in there because each 4 bit block represents 16 values, and we only use 10 of those.

Pigo · on Feb 17, 2020

I got curious about the BOINC platform he mentioned. It reminds me of the SETI@home screensaver I used to run many years ago. I guess I'd just forgotten about it at some point, but I used to enjoy watching the data it was processing. It's pretty cool that there's another platform out there that lets you contribute to other programs.

prox · on Feb 17, 2020

The boinc wiki mentions seti@home is part of this platform. It probably grew out of this initiative.

kbob · on Feb 18, 2020

Did he calculate the value in decimal? The record would probably be a lot easier to achieve if he'd gone with, say, base 14. Everybody does decimal, even though there's no theoretical advantage, and no practical advantage since nobody actually needs trillions of digits of pi.

mNovak · on Feb 17, 2020

Curious now to know how much would this cost to compute in AWS?

dannyw · on Feb 17, 2020

Probably 5x as much. Maybe 2-3x as much using spot instances.

With AWS, you pay (dearly) for the ability to scale at a moments notice. You do not save money for predictable workloads.

While not applicable to this case, bandwidth costs are absurdly marked up expensive as well; like by a factor of 10x.

thesandlord · on Feb 17, 2020

Using the setup for the 31.4 trillion digits that Emma calculated with GCP (previous world record): https://cloud.google.com/blog/products/compute/calculating-3...

GCP Pricing Page: https://cloud.google.com/products/calculator#id=2eca3cef-746...

So ~$200 - $250k

Could probably save a good bit with committed use discounts.

Spot / Preemptible instances would not work, in fact before Emma did this calculation a lot of people thought this kind of thing wasn't possible on public cloud because of perceived instabilities in a multi-tenant system.

mNovak · on Feb 17, 2020

Does AWS still give $100k credit to YC startups? I think a couple should team up..

paranoidrobot · on Feb 17, 2020

I'm trying to do some estimates using the AWS Pricing calculator.

You can't go with a single instance and a ton of EBS storage, because it caps out at 16TB of disk, and 19Gbit[0] of EBS bandwidth, even on an instance with 100Gbit networking.

So, depending on how you can allocate storage, you're probably going to need some kind of clustered filesystem like GlusterFS

It's also not clear how well the application can spread it's writes - if it's all focussed on writing one file at a time, we need the most throughput to a single node at a time.

Storage:

Option 1: "GlusterFS: Hope it spreads writes" 20x c5n.9xlarge (36x vcpu/96GB RAM/50Gbit NIC / 9.5Gbit EBS) + 16TB st1 HDD storage each = $43k

Option 2: "GlusterFS: more EBS IO" 20x c5n.9xlarge (72x vcpu/192GB RAM/100Gbit NIC / 19Gbit EBS) + 16TB gp2 SSD storage each = $89k

Option 3: "GlusterFS: hey local storage is faster/cheaper" 6x i3en.24xlarge (96x vcpu/768GB RAM/100Gbit NIC/ no EBS) = $47k

I was wondering about insane ideas like using mdadm in RAID0 over NFS Mounts presented by (say) 281 t3.2xlarge instances each with 1x 1TB EBS volume. That comes out at around 62k for the storage instances.

Compute: I don't know how important CPU vs Disk IO bandwidth is. The instance the author is using has 4x 15 cores (60 cores total). The most I can get with standard EC2 instances is 64 cores, but that has 25Gbit network, and the next down from that is 48 cores.

1x i3en.24xlarge (96x vcpu/768GB RAM/100Gbit NIC/ no EBS) = $7.9k

There are bare metal instances with more cores/memory and up to 100Gbit networking[1], but I can't find any pricing on them.

All up, I think $51-55k/month using standard instances would probaably do the job.

[0] There are the bare metal instances mentioned in the link below that get up to 28Gbit EBS per instance, but again no details on pricing.

[1] https://aws.amazon.com/blogs/aws/ec2-high-memory-update-new-...

jsjohnst · on Feb 17, 2020

> You can't go with a single instance and a ton of EBS storage, because it caps out at 16TB of disk

I just used the calculator to price out a single instance without issue. Just type in nineteen 16TB EBS volumes (you’d create an LVM volume group for them if launched). I used to have EC2 instances (albeit not by choice, I inherited the bad architecture) with 42TB total of EBS volumes using LVM without issue.

paranoidrobot · on Feb 17, 2020

Ah, my mistake.

I didn't realise they'd upped the maximum volume size from 1TB to 16TB, so thought the calculator was telling me it was capped at 16x EBS volumes per instance. The new calculator isn't helping things here[1] telling me that I can only assign 16TB to an instance.

So, given that - then the issue becomes is 25Gbit NIC / 19Gbit EBS bandwidth enough IO to at least equal the needs of the task. On paper the total bandwidth of the author's disk controller was 24Gbit, but that will depend on how the output is spread and whether the EBS limit includes any overheads that aren't present in DAS.

Interestingly, if the requirements are mostly sequential, you can get better performance/$ going with throughput-optimised HDDs rather than gp2 SSD.

Applying striping in either case will ensure you saturate the per-instance EBS bandwidth limits.

So, single instance calculations:

r5.24xlarge (96vCPU/768GB RAM/25Gbit NIC/19Gbit EBS) = $4.4k

Storage: 18x 16TB gp2 SSD (250MB/sec / 16K IOPS) = $31k 18x 16TB st1 HDD (500MB/sec / 500 IOPS) = $14.5k

[1] https://calculator.aws/

jsjohnst · on Feb 18, 2020

Those are per month costs, right? Considering it took multiple months to calculate the 50T digits of Pi, huge difference in cost.

tyingq · on Feb 17, 2020

OVH would be relatively cheap. They have 72TB storage servers for $485.99/month.

layer8 · on Feb 17, 2020

I wonder how many bit flips are to be expected on HDDs or in RAM for that amount of data.

bn7t · on Feb 17, 2020

Is there a link somewhere where he published the result?

ficklepickle · on Feb 17, 2020

I found it, but clicking it caused my computer to run out of available memory.

Edit: on a more serious note, a site[0] that tracks these records says:

> Downloading of digits is no longer available due to the massive bandwidth requirements. Your best bet is to directly contact one of the record holders and see if they still have a copy of the digits.

[0] http://www.numberworld.org/digits/Pi

LeifCarrotson · on Feb 17, 2020

The compressed representation requires 44 TB of disk space.

Assuming the author has a typical home internet connection with about 5 Mbps upload rate, the transfer would take 2 years longer than it took to actually run the calculation in the first place!

brightbeige · on Feb 17, 2020

Here’s more info about the race between calculating and downloading digits of pi

https://opendata.stackexchange.com/a/4024/1511

saagarjha · on Feb 17, 2020

Have the results been verified yet?

aaronbwebber · on Feb 17, 2020

It appears they have, the y-cruncher program he used does verification. Alexander Yee, who wrote the y-cruncher program (which has been used for the previous 4 world record calculations of digits of Pi), has accepted it and posted it on their site, along with screenshots of the output of the program.

http://www.numberworld.org/y-cruncher/

hamiltont · on Feb 17, 2020

One cool note per my reading here - prior record holder Emma Haruka Iwao was working for GCP as a developer advocate, and GCP makes her 31.4 trillion digits available via cloud images. Neat :-)

For those of us with less insane needs, they also exposed an API to grab the specific digits of interest - https://pi.delivery/

not2b · on Feb 17, 2020

Output in hex is easier to check, because of https://en.wikipedia.org/wiki/Bailey%E2%80%93Borwein%E2%80%9...

droithomme · on Feb 17, 2020

[flagged]

saagarjha · on Feb 17, 2020

He almost certainly broke the record, and his word choice was intentionally modest.

saltyfamiliar · on Feb 17, 2020

What an irresponsible use of energy. At least mine Bitcoin...

stjo · on Feb 17, 2020

Of all the “irresponsible” uses of energy, this is the one I endorse the most.

goldenkey · on Feb 17, 2020

It's not irresponsible if the dude needed heat in his home. Far and few people realize that all electronics are heaters that happen to do computation in the process of conversion of electricity to heat. Sure, it's not convective if there aren't fans to disperse the heat.. but it's a beefed up computer so I am sure he had good PC fans.

In any case, heat that is created without convection fans to spread it quickly, will still diffuse into the home albeit at a slower pace. It has to go somewhere..

saagarjha · on Feb 17, 2020

> Far and few people realize that all electronics are heaters that happen to do computation in the process of conversion of electricity to heat.

Surely anyone who's used a computer with a fan in it knows this…

goldenkey · on Feb 18, 2020

That's easy to say but pretty much everyone I've talked to, albeit the few oddities, thought that dedicated space heaters were way different than X-Box getting hot, or computer getting hot. Think of the cognitive dissonance to switch to all LED bulbs in a cold climate, because you're told it's more efficient, meanwhile using electric space heaters at the same time... It happens more often than you think, given the politics.

Of course, there are some heating methods that are cheaper than electricity - like natural gas. So one would have to factor that in, along with power plant emissions vs home emissions.

It's a complex entrenchment. It'd be interesting to produce a video of street interviews just to see what the average person thinks about electronics getting hot. It would make for a good case study of the intersection between common sense/basic physics knowledge/and energy/eco politics.

xorfish · on Feb 17, 2020

Heat pumps are quite a bit more efficient than electric heaters.

finnh · on Feb 17, 2020

... only if you're willing to cool something else ;)

goldenkey · on Feb 18, 2020

Every time we clean something, we simply dirty something else. Ala Einstein, energy is neither created nor destroyed. Chaos is neither created nor destroyed, simply moved. And to move chaos, requires isotropic heat emission, which releases chaos in itself - hence entropy increasing. Thus, moving chaos, increases the total chaos a little bit. Until there's chaos everywhere and nothing can be moved, ie. heat death of the universe. Prepare for the big crunch, the big pancake deep freeze, the cyclic big bang, conformal cyclic rebirth, or whatever your cosmological spirit holds to be an inkling of inclination. :-)