DELIGHT
SCALE

January 13, 2026 / Carolina Milanesi

Cultural Barriers to AI Adoption: Key Takeaways from the 2025 Microsoft New Future of Work Report

The Microsoft New Future of Work Report 2025 marks a significant milestone in how we understand the ongoing transformation of work. Over the past five years, the report series has traced the evolution of work, from remote and hybrid work models to the emerging integration of AI across teams and organizations. This year’s edition shifts…

January 13, 2026 / Ben Bajarin

The AI Infrastructure Gigacycle: A Primer for 2026

Note for paid subscribers: As we kick off the Dilligence Stack, I felt we needed to publish some anchor reports to set the foundation we will build upon. So, this report is quite long but has needed depth in each section. Each section will get its own deep dive in the coming months as well.…

January 12, 2026 / Max Weinbach

Explainer: What Gemini powering Siri really means

Apple and Google did their joint announcement today, confirming Mark Gurman’s report from August (I swear this guy lives in the walls of Apple Park) that the new Siri and Apple Foundation Models will be based on Gemini models and technology. There’s a lot of nuance to this that many are overlooking from a product,…

January 1, 2026 / Ben Bajarin

The AI Bubble Question: Two Scenarios for the Largest Technology Buildout in History

There is perhaps no more consequential debate around the technology industry today than whether the current AI infrastructure buildout represents a bubble destined for collapse or the logical, sustainable deployment of mature technology. The numbers are indeed staggering, a root cause of people’s anxiety: hyperscalers are spending north of >$200 billion annually (and growing) on…

December 18, 2025 / Max Weinbach

Running a 1T parameter model on a $40K Mac Studio Cluster

This will be a brief research note, mostly because I’m just talking about something cool. Back in March when Apple launched M3 Ultra in the Mac Studio, they graciously sent over a 512GB Unified Memory SKU of the system. It was insane for running LLMs, powering something like 4-bit quantized Deepseek R1 but limiting with…

December 18, 2025 / Ben Bajarin

The GPU’s Second Act: From Pixels to Tokens

The Architecture Graphics Built For decades, GPUs existed to generate images fast enough to feel real. This requirement forced a very specific kind of silicon—hardware capable of running the same mathematical operation across massive amounts of data in parallel, repeatedly, without stalling. Graphics was never really about “drawing pictures.” It was a continuous simulation under…

Trusted by 80% of the top 10 Fortune 500 technology companies