Featured Posts
NVIDIA's NVLink Fusion: True Innovation or Strategic Lock-in?
NVIDIA's NVLink Fusion True Innovation or Strategic Lock-in? Earlier this week at Computex, Jensen Huang introduced NVLink Fusion, positioning it as a means to "democratize scale-up" by allowing customers to mix and match compute architectures. On the surface, this suggests flexibility: integrating CPUs, GPUs, and specialized silicon, all interconnected via NVIDIA's high-performance NVLink. However, upon closer examination, this appears to be more of an illusion of choice.
From Silicon to Token
The world of large‑language‑model inference moves fast. Meta’s Llama 4 and DeepSeek's range of models turns yesterday’s “good enough” hardware into today’s bottleneck, so picking the right platform is more strategic than ever. I compared eight options that keep popping up in various engineering and sales conversations, including consumer RTX GPUs, Apple Silicon, NVIDIA’s H‑series, Groq’s purpose‑built LPU, Cerebras’ wafer‑scale engine, and turnkey DGX workstations. Each proves valuable in th
Our Latest Insights
GTC 2024 post-conference
Upon returning from GTC24, I've been able to reflect on all the new updates across NVIDIA's platforms and below is a summary of the various announcements. Blackwell was the star of the show, with the B100, B200 and GB200 chips announced. Note that there were no consumer facing graphics cards (RTX) were named, nor was there a successor to the L40S or BlueField3 DPU (though there was a new ConnectX8 NIC). As always though, a little bit of devil is in the details - as well as marketing waxing lyr
GTC 2024 preview
Who's new in the next-gen cooling zoo?