I was curious to see its performance. I found this page that benchmarks Keith's implementation with an implementation of std::sort and timsort.
In that test, it was thoroughly beaten in all benchmarks. The issue seems to be that it isn't cache friendly.
https://www.gamasutra.com/view/news/172542/Indepth_Smoothsor...