It's very hard to evaluate whether a model is better than another, especially doing it in a scientifically sound way is time consuming and hard.
This is why I find these types of comments like "model X is so much better than model Y" to be about as useful as "chocolate ice cream is so much better than vanilla"