Search

Showing top 2 results for "kernel hardware requirements"

Paper page - DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices

… Our specialized acceleration kernel delivers a 3.00× speedup on real hardware compared with dense inference. …

May 12, 2026

We Got Claude to Build CUDA Kernels and teach open models!

… We need to talk about the 'magic' behind Claude’s CUDA kernels. Is it superior synthetic data, or did Anthropic find a better way to teach LLMs hardware-level logic? …

Jan 28, 2026 · ben burtenshaw

Followed topics

Paper page - DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices

We Got Claude to Build CUDA Kernels and teach open models!