Search

Showing top 23 results for "LLM-accelerated chip design"

Pruning and Distilling LLMs Using NVIDIA TensorRT Model Optimizer | NVIDIA Technical Blog

…He brings full-stack GPU expertise spanning from chip design, CUDA and kernel-level development to server and cloud for model training and inference, translating innovations into real-world impact. Before NVIDIA…

Oct 7, 2025 · Max Xu

Inside the NVIDIA Vera Rubin Platform: Six New Chips, One AI Supercomputer | NVIDIA Technical Blog

…Six new chips, one AI supercomputer Extreme co-design is expressed most clearly at the chip level. The Vera Rubin platform is built from six new chips, each engineered for a specific…

Jan 5, 2026 · Kyle Aubrey

Cut Checkpoint Costs with About 30 Lines of Python and NVIDIA nvCOMP | NVIDIA Technical Blog

…Before NVIDIA, Eugene helped Apple evolve their GPU HW and SW to strong-scale from phones to Ultra chips, focusing on professional applications and ML, co-designing new metal features together with…

Apr 9, 2026 · Wenqi Glantz

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics

Pruning and Distilling LLMs Using NVIDIA TensorRT Model Optimizer | NVIDIA Technical Blog

Inside the NVIDIA Vera Rubin Platform: Six New Chips, One AI Supercomputer | NVIDIA Technical Blog

Cut Checkpoint Costs with About 30 Lines of Python and NVIDIA nvCOMP | NVIDIA Technical Blog