I don’t know how the inner-workings of NumPy work...but this article seems quite plausible because it’s making use of parallel computation in order to achieve a faster result.
The title just seems to imply skepticism over what looks to be a legit approach.