Ollama adopts MLX for faster AI performance on Apple silicon - 9to5Mac
… Here’s Ollama: This results in a large speedup of Ollama on all Apple Silicon devices. On Apple’s M5, M5 Pro and M5 Max chips, Ollama leverages the new GPU Neural Accelerators to accelerate both time to first token TTFT and generation speed tokens per second . …
