Search

Showing top 116 results for "reviews and benchmarks"

Top stories

Discussions and forums

r/nvidia · u/pcgameshardware · 1d ago

Gothic Remake Performance Review: 500+ Benchmarks with 62 CPUs, 40 GPUs and Linux

Hey everyone, we at PCGH benchmarked the Gothic Remake quite extensively and included a large Nvidia GPU lineup in the test. The article has 500+ benchmark results in total, with 40 GPUs, 62 CPUs, several resolutions, VR…

Hacker News · u/iliaov · 2d ago

I'm tired of LLM skill slop, so I built mine with regression tests

I've recently tried skills like Garry Tan's GStack, spent a week with it, and realized it has some flaws (I'll post separately about that).Here's my problem: how do I know if a skill or prompt is any good (e.g. GStack's …

5
r/LocalLLaMA · u/janvitos · 2w ago

110 tok/s with 12GB VRAM on Qwen3.6 35B A3B and ik_llama.cpp

Had been getting great MTP performance with llama.cpp on my RTX 4070 Super 12GB, until they actually merged the MTP PR. Then, performance tanked and was barely above non-MTP. So, I decided to try out ik_llama.cpp since i…

Hacker News · u/bhu8 · 2w ago

Show HN: Viberia – Civ/Polytopia-like command center for AI agents (BYOK/BYOS)

Hey HN,This is my take on the agent harness. Everything on an isometric map. Agents are grouped into "buildings" that run in a sequence or a loop; e.g., the CodeForge has an agent that writes a PRD, another one that impl…

1
r/LocalLLaMA · u/janvitos · 4w ago

80 tok/sec and 128K context on 12GB VRAM with Qwen3.6 35B A3B and llama.cpp MTP

Just wanted to share my config in hopes of helping other 12GB GPU owners achieve what I see as very respectable token generation speeds with modest VRAM. Using the latest llama.cpp build + MTP PR, I got over 80 tok/sec w…