Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer | NVIDIA Technical Blog
Agentic AI / Generative AI Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer Sep 10, 2024 By Jan Lasek , Onur Yilmaz , Chenjie Luo and Chenhan Yu Discuss (0…
