However it will be better for training / fine tuning, etc. type workflows.
For the DGX benchmarks I found, the Spark was mostly beating the M4. It wasn't cut and dry.
The M4 Max has double the memory bandwidth, so it should be faster for decode (token generation).
However it will be better for training / fine tuning, etc. type workflows.