Featured Posts
NVIDIA's NVLink Fusion: True Innovation or Strategic Lock-in?
NVIDIA's NVLink Fusion True Innovation or Strategic Lock-in? Earlier this week at Computex, Jensen Huang introduced NVLink Fusion, positioning it as a means to "democratize scale-up" by allowing customers to mix and match compute architectures. On the surface, this suggests flexibility: integrating CPUs, GPUs, and specialized silicon, all interconnected via NVIDIA's high-performance NVLink. However, upon closer examination, this appears to be more of an illusion of choice.
From Silicon to Token
The world of large‑language‑model inference moves fast. Meta’s Llama 4 and DeepSeek's range of models turns yesterday’s “good enough” hardware into today’s bottleneck, so picking the right platform is more strategic than ever. I compared eight options that keep popping up in various engineering and sales conversations, including consumer RTX GPUs, Apple Silicon, NVIDIA’s H‑series, Groq’s purpose‑built LPU, Cerebras’ wafer‑scale engine, and turnkey DGX workstations. Each proves valuable in th
Our Latest Insights
GPU's: What are they, where did they come from, why do I need one for AI?
The history of Graphics Processing Units (GPUs) is a fascinating journey that begins with their humble origins as tools for enhancing 3D graphics in video games. These early GPUs (then called 3D accelerators) such as the 3dfx Voodoo2 released in 1998, laid the foundation for what would become a transformative force in computing technology. Early Beginnings and Gaming Acceleration In the late 1990s, the gaming industry was on the cusp of a revolution. The demand for more realistic and immersive
Immersion Datacenter Cooling: Future-Proofing
As chips and designs continue to push boundaries with ever higher Thermal Power Densities (TPDs), managing the dissipated heat becomes increasingly challenging. Current air-cooled rack designs typically top out at around 10-20kW per rack. However, when you consider the Thermal Power Densities of CPUs and GPUs, it quickly becomes apparent that filling a rack with these heat-producing components becomes a limiting factor. In a leaked Gigabyte roadmap, it has been revealed that most components ar
AI: Essential Considerations for Hosting Your Own Models
Artificial Intelligence (AI) has become a pervasive buzzword in the industry, evoking various reactions including an amusing supercut of Sunar Pichai at Google's 2023 I/O... AI demands high-powered compute, which necessitates effective cooling, with immersion cooling being a popular choice. However, this article will focus on other aspects of AI. Given the immense hype surrounding AI, it's essential to clarify that AI covers a broad spectrum. It is sometimes used interchangeably with Machine L
Immersion Datacenter Cooling: Sustainability and ESG
In my previous article, I discussed the flexibility of immersion cooled infrastructure, particularly for edge deployments, as well as its potential to enable significant scalability for datacenter operators while improving their environmental, social, and governance (ESG) posture. Now, let's delve deeper into the environmental sustainability benefits of immersion cooling, focusing on its impact on Power Usage Effectiveness (PUE), Water Usage Effectiveness (WUE), and heat reuse. I came across a