undefined | Better HN

0 pointswavemode2y ago0 comments

> video generation also seemed kind of stagnant before Sora

I take the opposite view. I don't think video generation was stagnating at all, and was in fact probably the area of generative AI that was seeing the biggest active strides. I'm highly optimistic about the future trajectory of image and video models.

By contrast, text generation has not improved significantly, in my opinion, for more than a year now, and even the improvement we saw back then was relatively marginal compared to GPT-3.5 (that is, for most day-to-day use cases we didn't really go from "this model can't do this task" to "this model can now do this task". It was more just "this model does these pre-existing tasks, in somewhat more detail".)

If OpenAI really is secretly cooking up some huge reasoning improvements for their text models, I'll eat my hat. But for now I'm skeptical.

0 comments

Eisenstein2y ago

> By contrast, text generation has not improved significantly, in my opinion, for more than a year now

With less than $800 worth of hardware including everything but the monitor, you can run an open weight model more powerful than GPT 3.5 locally, at around 6 - 7T/s[0]. I would say that is a huge improvement.

[0] https://www.reddit.com/r/LocalLLaMA/comments/1cmmob0/p40_bui...

j / k navigate · click thread line to collapse

0 comments

Eisenstein2y ago

> By contrast, text generation has not improved significantly, in my opinion, for more than a year now

[0] https://www.reddit.com/r/LocalLLaMA/comments/1cmmob0/p40_bui...

j / k navigate · click thread line to collapse