I replaced Claude Pro with a local 9B model for a week, and finally found out what I was paying $20 a month for
…The reason it can do that on modest hardware is GDN - a hybrid architecture that keeps memory usage mostly flat as context grows instead of climbing with every token the way a…