What I meant was that the "nearness" is somehow "random", I don't doubt that the results come from a clever analysis of a zillion documents or websites, still the final result makes little sense.
Your explanation is likely to be accurate in the case of celsius, but it seemingly doesn't fit on this other game/answer.
I don't want to spoil the answer to that game, but to me it is hard to believe that both "car" and "compare" are nearer than "digit" or "number" to the answer if it is based on number of occurences in context.