A deep dive into the architecture of Nvidia's Rubin CPX chip, which is optimized for long&context AI tasks and the prefill phase of inference (SemiAnalysis)

SemiAnalysis: A deep dive into the architecture of Nvidia's Rubin CPX chip, which is optimized for long-context AI tasks and the prefill phase of inference  —  New Prefill Specialized GPU, Rack Architecture, BOM, Disaggregated PD, Higher Perf per TCO, Lower TCO, GDDR7 & HBM Market Trends

Sep 15, 2025 - 14:06
A deep dive into the architecture of Nvidia's Rubin CPX chip, which is optimized for long&context AI tasks and the prefill phase of inference (SemiAnalysis)
SemiAnalysis: A deep dive into the architecture of Nvidia's Rubin CPX chip, which is optimized for long-context AI tasks and the prefill phase of inference  —  New Prefill Specialized GPU, Rack Architecture, BOM, Disaggregated PD, Higher Perf per TCO, Lower TCO, GDDR7 & HBM Market Trends