undefined | Better HN

0 pointsbgwalter5mo ago0 comments

How do we know it's not just a mashup of existing pictures? All generated pelicans on bikes look somewhat cartoonish and use historical or artsy bikes. This is training material from 2015:

https://www.behance.net/gallery/29122113/Pelican-on-bikes-wi...

There are other such images. Not an image model? How do we know that they don't convert all images to svg and train an LLM on it? How do we know that they do not cheat on this benchmark and route the query to an image model first?

0 comments

jstanley5mo ago

"it's not impressive because they might have cheated" isn't a great argument.

bgwalterOP5mo ago

The generated picture is not impressive and the excuse in this subthread was that an svg is created directly without using an image model. I offer alternative explanations why svg creation might not be impressive OR ALTERNATIVELY why they may have faked even a bad result because it is a popular benchmark (faking a perfect result would be too obvious).

But since everything is closed source with any number of potential special case hacks, we won't know.

j / k navigate · click thread line to collapse