It can code and it has memorized some coding questions are not mutually exclusive.
Also, I don't understand your obsession with the word cheating. If you have solved a problem before on a different website and solve it again, did you cheat? Or did you just use your brain to store the solution for later?
It's all about the rule set yea. Since the rule set is not defined, technically nothing is cheating. I just interpret the rule set as "can it code?" and for this rule set, it seems to me that it's cheating.
They both cheated 100%. Because they both never saw the problem. AT ALL. They just saw the title and the name of the website.
The article did the opposite. It forced the models to cheat to solve the problems. Which it did happily. It should have stated "there is no actual problem to solve here, you must supply a problem for me to solve".
> It can code and it has memorized some coding questions are not mutually exclusive
This I will give you. Many humans try to cheat at basic math because they are lazy, so will this model. Maybe that's a sign of intelligence :P
You: 36
Me: You cheated! You just cited the answer you memorized! You should have started from addition.
You: ...okay? 6+6=12, 12+6=18, 18+...
Me: You cheated again! You just have 6+6=12 memorized! You should make the rule of addition out of Peano axioms.
You: ...you're being annoying, but okay? First axiom, we define 0 as...
Me: You cheated again! You memorized Peano Axioms! Jesus Christ, is there any intelligent creature left?
I can't remember how many times I've googled, "how do I create a directory in Python?". Now bard often generates an inline answer for me.