The blog post criticizes existing software for their inaccuracy, but that is not their problem. The key signature of a particular passage may deviate from the overall key. I also wonder how they evaluate accuracy shown in [1] and what is the human accuracy in comparison.
[1] https://www.reddit.com/r/DJs/comments/hwlzyt/key_detection_c...