Meanwhile, the GPU is powerful enough for LLMs but has been lacking matrix multiplication acceleration. This changes that.
And there's an official port of Stable Diffusion to it: https://github.com/apple/ml-stable-diffusion
So the new engine is accelerator for matmul accelerator ?