MLX is worth paying attention to. It's still pretty young (just over a year old)...

masto · on April 28, 2025

Ran it and it crapped out with a huge backtrace. I spotted `./build_bundled.sh: line 21: cmake: command not found` in it, so I guessed I needed cmake installed. `brew install cmake` and try again. Then it crapped out with `Compatibility with CMake < 3.5 has been removed from CMake.`. Then I give up.

This is typical of what happens any time I try to run something written in Python. It may be easier than setting up an NVIDIA GPU, but that's a low bar.

H3X_K1TT3N · on April 28, 2025

This is absolutely every experience I have with python.

simonw · on April 28, 2025

Which Python version was that? Could be that MLX have binary wheels for some versions but not others.

masto · on April 28, 2025

Adding `-p 3.12` made it work. Leaving that here in case it helps someone.

porridgeraisin · on April 28, 2025

Aha, knew you wouldn't give up. Not what our kind do

hnfong · on April 29, 2025

Never gonna give you up…

(Sorry I’ll excuse myself now…

jack_pp · on April 29, 2025

for the record these problems don't really exist on linux in my experience

mobiuscog · on April 29, 2025

Python problems exist on all platforms. It's just that most people using Python have figured out their 'happy path' workarounds in the past and keep using them.

Python is awesome in many ways, one of my favourite languages, but unless you are happy with venv manipulation (or live in Conda), it's often a nightmare that ends up worse than DLL-hell.

masto · on April 29, 2025

Python is in a category of things you can't just use without being an expert in the minutiae. This is unfortunate because there are a lot of people who are not Python developers who would like to run programs which happen to be written in Python.

Python is by no means alone in this or particularly egregious. Having been a heavy Perl developer in the 2000s, I was part of the problem. I didn't understand why other people had so much trouble doing things that seemed simple to me, because I was eating, breathing, and sleeping Perl. I knew how to prolong the intervals between breaking my installation, and how to troubleshoot and repair it, but there was no reason why anyone who wanted to deploy, or even develop on, my code base should have needed that encyclopedic knowledge.

This is why, for all their faults, I count containers as the biggest revolution in the software industry, at least for us "backend" folks.

esafak · on April 28, 2025

https://ml-explore.github.io/mlx/

marci · on April 28, 2025

For those who never heard of those:

mlx is similar to numpy/pytorch, but only for Apple Silicon.

mlx-lm is a llama.cpp equivalent, but built on top of mlx.

https://github.com/ml-explore/mlx-lm

mathfailure · on April 28, 2025

How much disk & RAM does it need?

What's your tokens/sec rate (and on which device)?

simonw · on April 28, 2025

I've been running it on a 64GB M2. My favorite models to run tend to be about 20GB to download (eg Mistral Small 3.1) and use about 20GB of RAM while they are running.

I don't have a token/second figure to hand but it's fast enough that I'm not frustrated by it.

_bin_ · on April 28, 2025

I wish apple would spend some more time paying attention to metal-jax :) it crashes with a few lines still and seems like an obvious need if apple wants to be serious about enabling ML work on their new MBPs.

MLX looks really nice from the demo-level playing around with it I've done, but I usually stick to jax so, you know, I can actually deploy it on a server without trying to find someone who racks macs.

dkga · on April 28, 2025

So, on an M4 I sometimes get faster training on plain vanilla jax compared to the same model in pytorch or tensorflow. And jax-metal often breaks :/

_bin_ · on April 28, 2025

No kidding? Might switch to CPU then. And yeah jax-metal is so utterly unreliable. I ran across an issue it turns out reduces to like a 2 line repro example which has been open on github for the better part of a year without updates