undefined | Better HN

0 pointsdrcode4y ago0 comments

Imagine asking it to generate a picture for "duck wearing a hat on Mars":

First, it creates a random 10x10 pixel blurry image and asks a neural net: "Could this be a duck wearing a hat on Mars?" and the neural net replies "No, because all the pictures I've ever seen of Mars have lots of red color in them" so the system tweaks the pixels to make them more red, put some pixels in the center that have a plausible duck color, etc.

After it has a 10x10 image that is a plausible duck on Mars, the system scales the image to 20x20 pixels, and then uses 4 different neural nets on each corner to ask "Does this look like the upper/lower left/right corner of a duck wearing a hat on Mars?" Each neural net is just specialized for one corner of the image.

You keep repeating this with more neural nets until you have a pretty 1000x1000 (or whatever) image.

0 comments

1 comments · 1 top-level

refulgentis4y ago

Not the case, though in a handwave-y way, same idea - instead of iteratively scaling, you're iteratively denoising. See here, links out to the Cornell NLP PhD describe in even more detail: https://www.jpohhhh.com/articles/inflection-point-ml-art

1 more reply

j / k navigate · click thread line to collapse