I replaced Claude Pro with a local 9B model for a week, and finally found out what I was paying $20 a month for
…with 8GB VRAM, sitting at a 60k context window. The reason it can do that on modest hardware is GDN - a hybrid architecture that keeps memory usage mostly flat as context grows…