Contemplating Meta’s Homegrown MTIA Compute Engine Roadmap
…And when AI models are changing faster than chip development cycles, any of the big model builders as well as anyone running AI inference at scale has to have a multi-product…
…And when AI models are changing faster than chip development cycles, any of the big model builders as well as anyone running AI inference at scale has to have a multi-product…
…and for more than a decade has led the charge to the top of the AI heap with those chips firmly in hand Nvidia’s CUDA computing platform and programming model opened…
…Both the additional cores and the High Max Turbo frequencies also help the chip more efficiently feed data to the GPU, which speeds up the training time for AI models and makes…
…and its related “Qumran 4” fabric chips and “Ramon” aggregation chips to do it. You can do it with the Jericho 3+ ASICs, as Brendan Gibbs , vice president of AI, routing, and…
…protocol and the TiN hybrid switch-NIC chippery, plus extensions to the JAX and Pathways AI frameworks, Google can scale to 134,000 chips in a single Virgo fabric, and using OCS…
…Having said all that, we are keeping an eye out for differentiated innovation in both AI models and in XPU architectures to see if some upstart can shake things up. We strongly…
…agentic AI work,” as she put it. Here is how Su explained it further: “So you should think about we need all of the accelerators to run these foundational models, and then…
…Our model suggests that AWS spent around $115 billion on IT infrastructure, and of this around $105 billion was for AI infrastructure. So AI was 78 percent or so of all capex…
…Nvidia Mostly Owns Training, And Can Compete On Inference These roadmaps are important to the OEMs and ODMs that convert Nvidia’s technology into the systems that run AI training and inference…
…But the big AI model makers and probably more than a few large enterprises are going to want to get cheaper infrastructure by literally owning and running it themselves. Google has already…