Data Science – NVIDIA Technical Blog
…10 MIN READ Mar 16, 2026 Scaling Autonomous AI Agents and Workloads with NVIDIA DGX Spark Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage…
Earlier this week at GTC Taipei, NVIDIA unveiled the NVIDIA RTX Spark product family, including small form factor desktops and laptops built for the age of personal assistants. These desktops and laptops deliver 1 petaflop of AI power, up to 128 GB of memory, and CUDA-accelerated AI frameworks for running large models alongside everyday work. Microsoft is creating an RTX Spark special developer edition—the Microsoft Surface NVIDIA RTX Spark Dev Box—preloaded with a modified Windows configured for developers and the top developer tools you need to get started. To learn more, see Building the n
Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA | NVIDIA Technical BlogWith agents running 24 hours a day, seven days a week on increasingly complex tasks, efficient local compute matters even more. NVIDIA has collaborated with the open source community to enhance the top inference backends for agents, llama.cpp and vLLM. llama.cpp now delivers 2x performance on Qwen 3.5 and 3.6 27B dense models, and 1.6x performance on Qwen 3.5 and 3.6 35B mixture-of-expert (MoE) models. The following two techniques make this possible: Multi-Token Prediction (MTP): An advanced speculative decoding technique, where a smaller draft model proposes several tokens ahead that the targ
Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA | NVIDIA Technical BlogOne popular way to run AI locally has been to use multiple GPUs to access more memory and compute. While cloud frameworks like vLLM are well optimized for multiple GPUs thanks to their use in data centers, PC frameworks like llama.cpp and the ComfyUI implementation in PyTorch are not optimized for it. To solve this challenge, NVIDIA has collaborated with both llama.cpp and ComfyUI to enhance performance for RTX PCs with two equivalent GPUs. This enables you to run larger models and use the compute of both GPUs for better performance. llama.cpp now supports tensor parallelism (TP), fully utiliz
Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA | NVIDIA Technical BlogNVIDIA NemoClaw for building autonomous AI agents now supports all NVIDIA client systems—GeForce RTX, NVIDIA RTX PRO, NVIDIA DGX Spark, and NVIDIA DGX Station for Windows—through Linux and Windows Subsystem for Linux (WSL). This enables you to easily set up and sandbox an agent, with optimized local models handpicked for your hardware. The update also includes enhancements to the installer to make it easier and more seamless. NemoClaw also now supports running Hermes Agent as an option. This week, Hermes Agent also released native Windows support, including both a command-line interface, alon
Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA | NVIDIA Technical Blog…10 MIN READ Mar 16, 2026 Scaling Autonomous AI Agents and Workloads with NVIDIA DGX Spark Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage…
…10 MIN READ Mar 16, 2026 Scaling Autonomous AI Agents and Workloads with NVIDIA DGX Spark Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage…
…10 MIN READ Mar 16, 2026 Scaling Autonomous AI Agents and Workloads with NVIDIA DGX Spark Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage…
…25, 2026, with information about NVIDIA DGX Spark. Discuss (0) Discuss (0) Tags Agentic AI / Generative AI | Computer Vision / Video Analytics | Manufacturing | Media & Entertainment | Retail / Consumer Packaged Goods | Smart Cities / Spaces | Blueprint…
NVIDIA DeepStream SDK NVIDIA DeepStream’s multi-platform support gives you a faster, easier way to develop and deploy real-time video streaming pipelines for generative AI agents and applications. You can…
To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.