extremetech.com › computing A Modder Repurposed a Used V100 For LLM Acceleration …In the Ollama LLM Benchmark, this bootstrapped V100 with 16GB of HBM2 was able to put out 130 tokens per second, outpacing an RX 7800 XT. In another test using Gemma 4… May 11, 2026 · Jon Martindale