DELIGHT
SCALE
Cultural Barriers to AI Adoption: Key Takeaways from the 2025 Microsoft New Future of Work Report
The Microsoft New Future of Work Report 2025 marks a significant milestone in how we understand the ongoing transformation of work. Over the past five years, the report series has traced the evolution of work, from remote and hybrid work models to the emerging integration of AI across teams and organizations. This year’s edition shifts…
The AI Infrastructure Gigacycle: A Primer for 2026
Note for paid subscribers: As we kick off the Dilligence Stack, I felt we needed to publish some anchor reports to set the foundation we will build upon. So, this report is quite long but has needed depth in each section. Each section will get its own deep dive in the coming months as well.…
Explainer: What Gemini powering Siri really means
Apple and Google did their joint announcement today, confirming Mark Gurman’s report from August (I swear this guy lives in the walls of Apple Park) that the new Siri and Apple Foundation Models will be based on Gemini models and technology. There’s a lot of nuance to this that many are overlooking from a product,…
The AI Bubble Question: Two Scenarios for the Largest Technology Buildout in History
There is perhaps no more consequential debate around the technology industry today than whether the current AI infrastructure buildout represents a bubble destined for collapse or the logical, sustainable deployment of mature technology. The numbers are indeed staggering, a root cause of people’s anxiety: hyperscalers are spending north of >$200 billion annually (and growing) on…
Running a 1T parameter model on a $40K Mac Studio Cluster
This will be a brief research note, mostly because I’m just talking about something cool. Back in March when Apple launched M3 Ultra in the Mac Studio, they graciously sent over a 512GB Unified Memory SKU of the system. It was insane for running LLMs, powering something like 4-bit quantized Deepseek R1 but limiting with…
The GPU’s Second Act: From Pixels to Tokens
The Architecture Graphics Built For decades, GPUs existed to generate images fast enough to feel real. This requirement forced a very specific kind of silicon—hardware capable of running the same mathematical operation across massive amounts of data in parallel, repeatedly, without stalling. Graphics was never really about “drawing pictures.” It was a continuous simulation under…