undefined | Better HN

0 pointsgeneralizations2y ago0 comments

How does it compare to GPT4 these days? The article yesterday made it sound like it's getting close.

0 comments

It's not getting close to GPT-4. It's getting closer according to synthetic benchmarks but spend any non-trivial time with both and you'll quickly realise GPT-4 is still leagues ahead, especially for writing code and more complex reasoning. Which makes sense since the model is orders of magnitude larger parameter count wise.

Don't get me wrong, it's still remarkable that we already have LLMs that can be ran on consumer grade hardware that are anywhere near GPT-3.5/4 levels. But if you want the absolute highest quality of output GPT-4 is still way to go.

soulofmischief2y ago

I have found it pretty decent at explaining math and physics concepts, and generating some basic code. It seems over-tuned for code generation (on purpose) as sometimes it inappropriately generates code when asking non-code questions.

Overall, it performs better than GPT-3.5-turbo in many use cases. Harder to quantify the GPT-4 comparison, as there are multiple versions of GPT-4 which are rumored to have significantly varied outputs.

It's definitely worth giving a shot. The parameter size of 34B makes a big difference, and it's been found that you're still better off extremely quantizing a larger-parameter model than using unquantized smaller models.

koprulusector2y ago

This post from today says that Llama 2 7b performs at the level of gpt-4, but 30x cheaper.

https://www.anyscale.com/blog/llama-2-is-about-as-factually-...

j / k navigate · click thread line to collapse