Is there a performance benefit for inference speed on M-series MacBooks, or is t... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		cwoolfe 12 months ago \| parent \| context \| favorite \| on: Run LLMs on Apple Neural Engine (ANE) Is there a performance benefit for inference speed on M-series MacBooks, or is the primary task here simply to get inference working on other platforms (like iOS)? If there is a performance benefit, it would be great to see tokens/s of this vs. Ollama.

SparkyMcUnicorn 12 months ago [–]

See my other comment for results.

mlx is much faster, but anemll appeared to use only 500MB of memory compared to the 8GB mlx used.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact