Research Archive
Explainer: What Gemini powering Siri really means
Apple and Google did their joint announcement today, confirming Mark Gurman’s report from August (I swear this guy lives in the walls of Apple Park) that the new Siri and Apple Foundation Models will be based on Gemini models and technology. There’s a lot of nuance to this that many are overlooking from a product,…
The AI Bubble Question: Two Scenarios for the Largest Technology Buildout in History
There is perhaps no more consequential debate around the technology industry today than whether the current AI infrastructure buildout represents a bubble destined for collapse or the logical, sustainable deployment of mature technology. The numbers are indeed staggering, a root cause of people’s anxiety: hyperscalers are spending north of >$200 billion annually (and growing) on…
Running a 1T parameter model on a $40K Mac Studio Cluster
This will be a brief research note, mostly because I’m just talking about something cool. Back in March when Apple launched M3 Ultra in the Mac Studio, they graciously sent over a 512GB Unified Memory SKU of the system. It was insane for running LLMs, powering something like 4-bit quantized Deepseek R1 but limiting with…
The GPU’s Second Act: From Pixels to Tokens
The Architecture Graphics Built The GPU exists because graphics demanded a very specific kind of silicon—hardware capable of running identical mathematical operations across massive data volumes in parallel, repeatedly, without stalling. What looked like “drawing pictures” was always a continuous simulation under tight latency constraints. When neural networks arrived, they leaned on the same core…
Why NVIDIA’s DGX Spark is the best desktop CUDA testbed
This year on Black Friday, one of my favorite products of the year: NVIDIA’s DGX Spark! My list of favorite products is by no means short, but this was a quick add simply because I’ve had so much fun playing with this. Unlike most other products I’ve used this year, it’s something that feels rewarding…
The Semiconductor Gigacycle
The semiconductor industry has experienced cycles of varying magnitude throughout its history. The PC era brought sustained growth. The smartphone revolution created what many called a supercycle. The cloud computing buildout extended that expansion further. What is happening now is something categorically different. The artificial intelligence infrastructure buildout represents the largest total addressable market expansion…