Ollama, the popular app for running AI models locally on a computer, has released an update that takes advantage of Apple’s own machine learning framework, MLX. The result is a hefty speed boost on Macs with Apple silicon.

According to Ollama, the new version processes prompts around 1.6 times faster (prefill speed) and nearly doubles the speed at which it generates responses (decode speed). Macs with M5-series chips are said to see the largest improvements, thanks to Apple’s new GPU Neural…

![[CITYPNG.COM]White Google Play PlayStore Logo – 1500×1500](https://startupnews.fyi/wp-content/uploads/2025/08/CITYPNG.COMWhite-Google-Play-PlayStore-Logo-1500x1500-1-630x630.png)