Search

Showing top 120 results for "AI acceleration pipeline"

All sources developer.nvidia.com 55 amd.com 12 intel.com 8 blogs.nvidia.com 6 storagereview.com 5 techpowerup.com 5 about.gitlab.com 4 theregister.com 3 semiwiki.com 3 theverge.com 2 huggingface.co 2 guru3d.com 2

Videos

Paper page - SlimSpec: Low-Rank Draft LM-Head for Accelerated Speculative Decoding

…language model head while maintaining full vocabulary support and achieving significant speedup with minimal pipeline changes. AI-generated summary Speculative decoding speeds up autoregressive generation in Large Language Models (LLMs) through a…

May 12, 2026

Optimize Fine-Tuning and Deployment of LLMs on an AI PC

…Additional Resources Intel Gaudi 2 AI Accelerator OpenVINO™ Toolkit on GitHub AI PCs from Intel Intel Arc Graphics Optimum for Intel Library Neural Network Compression Framework (NNCF) Hugging Face Transformers Hugging Face…

· Kelli Belcher AI software solutions engineer

NVIDIA NeMo Retriever

…Try Jump-start building your AI solutions with the NVIDIA RAG Blueprint , available on build.nvidia.com . Starter Kits Start building information retrieval pipelines and generative AI applications for multimodal data ingestion…

HippoScreen Neurotech

HippoScreen uses Intel® software optimizations to accelerate AI model build times and build AI pipelines in an efficient and adaptable way.

Discussions and forums

Hacker News · u/aaronestrada · 5d ago

Show HN: I created a RAW to HDRI stacker in (mostly) Common Lisp

This is an upgrade of a tool I created 15 years ago in Python to learn OOP and solve some inadequacies in the HDR stacking tools I could find at the time. The problem was, none of them were really "batch friendly". None …

Hacker News · u/rishipankhaniya · 1w ago

Launch HN: Rudus (YC P26) – AI for concrete contractors

Hi HN, we’re Rishi and Sahil. We’ve developed Rudus (https://www.rudus.ai/), an AI-powered takeoff and estimation platform built for concrete subcontractors.Takeoff is the process of measuring and quantifying materials f…

38 14

AMD Instinct MI350P PCIe Targets Air-Cooled Enterprise AI Servers

…AMD is targeting use cases such as inference, retrieval-augmented generation, and production AI pipelines. Systems can be configured with up to eight accelerator cards, depending on the server platform. The card…

May 7, 2026 · Hilbert Hagedoorn

How to Build Vision AI Pipelines Using NVIDIA DeepStream Coding Agents | NVIDIA Technical Blog

…AI development platform, DeepStream accelerates a developer’s journey from concept to actionable insight across industries. Video 1. How to use the NVIDIA DeepStream coding agents to generate complete vision AI pipelines…

Apr 16, 2026 · Debraj Sinha

How to Build a Document Processing Pipeline for RAG with Nemotron | NVIDIA Technical Blog

…document processing pipelines that extract, embed, and retrieve structured dataincluding tables and chartsfrom complex PDFs for grounded, cited AI answers. The solution uses the NeMo Retriever library for GPU-accelerated extraction, then…

Feb 4, 2026 · Chia-Chih Chen

NVIDIA Provides Preview Driver With DRM Color Pipeline API Support

…NVIDIA published a R595-derived driver build that provides a preview implementation of the Color Pipeline API for letting Wayland compositors leverage GPU display hardware capabilities for accelerating color processing like HDR…

Apr 1, 2026

Maximizing Memory Efficiency to Run Bigger Models on NVIDIA Jetson | NVIDIA Technical Blog

…AI workloads. Inferencing pipeline This layer manages the end-to-end data flow through preprocessing, inference, and postprocessing to produce actionable outputs. Frameworks like NVIDIA DeepStream provide a high-performance, GPU-accelerated…

Apr 20, 2026 · Anshuman Bhat

QNAP Unveils QAI-h1290FX Edge AI Storage Server

…Run AI-powered chat assistants, document search engines, or knowledge bases fully on-premises. Keep sensitive data in-house while accelerating AI workflows. High-speed Networking and Scalable Architecture: Comes with dual…

Apr 30, 2026

Followed topics