Search

Showing top 126 results for "NVIDIA CUDA"

NVIDIA CUDA

37 articles indexed Last updated 2d ago See topic hub

Videos

Accelerating Long-Context Model Training in JAX and XLA | NVIDIA Technical Blog

…These collectives leverage SHARP, in-network reductions, and multicast acceleration features of NVIDIA NVLINK Switch to enable latency-optimized one-shot and throughput-optimized two-shot AllReduce algorithms. The underlying CUDA interface…

Feb 3, 2026 · Sevin Fide Varoglu

NVIDIA Jetson でメモリ効率を最大化して大規模なモデルを実行

…で有効です。 NVIDIA Jetson Orin NX におけるカーブアウト領域とカーネルおよびユーザー空間の最適化は、システム全体の効率を向上させる重要な要素です。以下のセクションでは、これらのレイヤーを最適化する実践的な手法を紹介します。カーブアウトの最適化 NVIDIA Jetson Orin NX と NVIDIA Jetson Orin Nano のカーブアウト領域は、特定のハードウェアエンジン、ファームウェア、リアルタイムサブシステムのために、起動時に予約される物理メモリです。Linux または NVIDIA CUDA アプリケーションはこれらにアクセスできず、オンチップのマイクロコントローラやアクセラレーターで使用されます…

Apr 20, 2026 · Anshuman Bhat

Maxwell Geforce GTX 870 Specifications and Benchmarks Leak Out - 1664 CUDA Cores and 4GB GDDR5 RAM

…As expected, the GPU core, which is GM204, is not properly recognized and the name detected shows Nvidia's internal nomenclature. I ran the name "Nvidia 17D-20" in a few databases…

Aug 11, 2014 · Usman Pirzada

NVIDIA Developer

…Chain Decision Systems Using NVIDIA cuOpt Agent Skills April 30, 2026 Build AI-Powered Games with NVIDIA DLSS 4.5, RTX, and Unreal Engine 5 Latest Releases CUDA Toolkit 13.1 DLSS…

Discussions and forums

r/nvidia · u/Fcking_Chuck · 3w ago

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics

Search

NVIDIA CUDA

People also ask

Videos

Accelerating Long-Context Model Training in JAX and XLA | NVIDIA Technical Blog

Top stories

NVIDIA CUDA Tile로 C++에서 고성능 GPU 커널 개발하기

NVIDIA CUDA 13.3 Rolls Out CUDA Python 1.0, CUDA Tile For C++

Develop High-Performance GPU Kernels in C++ with NVIDIA CUDA Tile | NVIDIA Technical Blog

NVIDIA CUDA 13.3 Enhances GPU Development with Tile Programming in C++, Compiler Autotuning, and Python Updates | NVIDIA Technical Blog

NVIDIA Jetson でメモリ効率を最大化して大規模なモデルを実行

Maxwell Geforce GTX 870 Specifications and Benchmarks Leak Out - 1664 CUDA Cores and 4GB GDDR5 RAM

NVIDIA Developer

Discussions and forums

NVIDIA releases CUDA-Oxide 0.1 for experimental Rust-to-CUDA compiler

NVIDIA releases CUDA-Oxide 0.1 for experimental Rust-to-CUDA compiler

CUDA Proves Nvidia Is a Software Company

[Megathread] Introducing NVIDIA RTX Spark

Tell HN: Llamacpp now supports unified system RAM offloading on Linux

How to Post-Train Autonomous Vehicle Models in Closed-Loop with NVIDIA Alpamayo | NVIDIA Technical Blog

NVIDIA GeForce GTX 1070 Ti Full Specifications Leak Out - Rumored To Feature Locked Clocks Across AIB Models