The nice thing about most of this deep learning stuff is that you can use a million machine hours to train your model, and almost no time to make an evaluation. So the single machine version has all of the pattern recognition given to it by the cluster, but a few ply less tree search depth.
So to me this underscores the relative importance of the deep learning model vs the tree search.