Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
obastani
2y ago
0 comments
Share
Important caveat with some of the results: they are using better prompting techniques for Gemini vs GPT-4, including their top line result on MMLU (CoT@32 vs top-5). But, they do have better results on zero-shot prompting below, e.g., on HumanEval.
0 comments
default
newest
oldest
cchance
2y ago
I do find it a bit dirty to use better prompt techniques and compare them in a chart like that
j
/
k
navigate · click thread line to collapse