Search

Showing top 11 results for "Reddit Tom's Guide"

Related topics: Reddit

Top stories

2 sources covering this — show 1 more
tomshardware.com › tech-industry › artificial-intelligence

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

… APFrisco explains in a mini tutorial/guide on the Local LLaMA subreddit how they bought some used Intel Optane Persistent Memory, acquired relatively cheaply second-hand, to “run a 1 trillion parameter model in this case Kimi K2.5 locally at ~4 tokens/second” on their Xeon workstation. …

May 23, 2026 · Mark Tyson
2 sources covering this — show 1 more