Search

Showing top 114 results for "model-by-model evaluation"

People also ask

What’s the difference between evaluating an AI model and evaluating an AI agent?

While model and agent evaluation are inextricably linked, their technical benchmarks and metrics for success are fundamentally different.

Mastering Agentic Techniques: AI Agent Evaluation | NVIDIA Technical Blog

What is NVIDIA Model Optimizer?

The NVIDIA Model Optimizer (ModelOpt) library incorporates state-of-the-art model optimization techniques to compress and accelerate AI models. These techniques include quantization, distillation, pruning, speculative decoding, and sparsity. ModelOpt accepts Hugging Face, PyTorch, or ONNX format models as input and provides Python APIs for users to easily combine different optimization techniques to produce optimized checkpoints. ModelOpt supports highly performant quantization formats such as FP4, FP8, INT8, and INT4, and advanced algorithms including SmoothQuant, AWQ, SVDQuant, and Double Q

Model Quantization: Post-Training Quantization Using NVIDIA Model Optimizer | NVIDIA Technical Blog

What is CLIP?

CLIP (Contrastive Language-Image Pretraining), introduced by OpenAI in 2021, is a foundation vision language model (VLM) that learns a shared embedding space for images and text through contrastive learning on large image-text pairs. Its ability to produce semantically aligned representations has made it a core building block across modern multimodal systems. The CLIP text encoder is widely reused as a conditioning module for text-to-image (Stable Diffusion, for example) and text-to-video (AnimateDiff, for example) synthesis. Its vision encoder serves as the visual backbone in multimodal LLMs

Model Quantization: Post-Training Quantization Using NVIDIA Model Optimizer | NVIDIA Technical Blog

Followed topics

Search

People also ask

Metropolis for Developers

Top stories

Build Your Own Transaction Foundation Model for Financial Intelligence | NVIDIA Technical Blog

Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes | NVIDIA Technical Blog

Pretrained to Imagine, Fine-Tuned to Act: The Rise of World-Action Models | NVIDIA Technical Blog

Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech | NVIDIA Technical Blog

Unlock Massive Token Throughput with GPU Fractioning in NVIDIA Run:ai | NVIDIA Technical Blog

NVIDIA Vera CPU Sets a New Standard for Agentic Workloads in AI Factories | NVIDIA Technical Blog

AR / VR – NVIDIA Technical Blog

Developer Tools & Techniques – NVIDIA Technical Blog

NVIDIA NVbandwidth: Your Essential Tool for Measuring GPU Interconnect and Memory Performance | NVIDIA Technical Blog

Isaac Sim

How to Automate AI Model Documentation with the NVIDIA MCG Toolkit | NVIDIA Technical Blog

Accelerating Long-Context Model Training in JAX and XLA | NVIDIA Technical Blog

NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents | NVIDIA Technical Blog