My RTX 5090 can't keep up with Apple Silicon on the biggest local LLMs, and I hate to admit it
…It's not a CUDA equivalent in maturity or scope, and plenty of local LLM tooling still uses Metal directly, but it shows Apple is aware that unified memory has become one…