undefined | Better HN

0 pointsobastani2y ago0 comments

Important caveat with some of the results: they are using better prompting techniques for Gemini vs GPT-4, including their top line result on MMLU (CoT@32 vs top-5). But, they do have better results on zero-shot prompting below, e.g., on HumanEval.

0 comments

cchance2y ago

I do find it a bit dirty to use better prompt techniques and compare them in a chart like that

j / k navigate · click thread line to collapse

0 pointsobastani2y ago0 comments

Important caveat with some of the results: they are using better prompting techniques for Gemini vs GPT-4, including their top line result on MMLU (CoT@32 vs top-5). But, they do have better results on zero-shot prompting below, e.g., on HumanEval.

0 comments

cchance2y ago

I do find it a bit dirty to use better prompt techniques and compare them in a chart like that

j / k navigate · click thread line to collapse