Google's TurboQuant cuts AI working memory by 6x, but it won't fix the global RAM shortage
…That translates to reduced memory requirements in AI inference workloads, making it easier for LLMs to run on consumer smartphones or mid-range laptops. It's similar to how DeepSeek R1 was…