To give you an extreme example, I can ask 1000000 different models for a counterexample to the 3n + 1 problem, and all will get it wrong.
For reference: https://en.wikipedia.org/wiki/Collatz_conjecture