Show HN: Open source colorizing grayscale images with NN (opens in new tab)

(github.com)

219 pointsducktracker10y ago77 comments

77 comments

These results: http://richzhang.github.io/colorization/ from Berkeley are much better than this model (and the code is open source as well).

Furthermore, those better results have the advantage of a much simpler model. This model has a fairly complicated architecture (a complex residual concatenation setup) and many more parameters (I would guess anywhere between 2x-10x as many, but I'd have to take a closer look), which means it's much slower to run and takes up more memory (disk and RAM).

I'd also say that in general the better model does things that are a lot more common sense: using the CIE Lab color space (perceptually uniform), omitting pooling, using a classification loss instead of regression (regression in generally performs poorly in deep learning), etc.

logicrook10y ago

And the thread concerning that much better model: https://news.ycombinator.com/item?id=11403653

im3w1l10y ago

The authors of that point out that those are the especially successful cases. Presumably the the new article also showcases the success.

For comparability, I think it would be best if we could see outputs for the two models for the same, chosen in advance and not cherry picked, images.

specialist10y ago

Idle thought: Could colorization be used to (lossy) compress full color images (usefully)?

What I imagine is full color input -> create B/W & color histogram (list of colors used) -> image viewer uses colorization algorithm to reapply colors.

adrusi10y ago

I don't really know a ton about image compression, but I exported the same photo to jpg using the same settings twice: once with 3 channels and once desaturated and then with two channels hidden (I couldn't figure out how to make a 1-channel jpg in gimp, no sure if it's possible). The 1-channel export had less than 30% file size reduction.

I don't think a compression technique that would require that much processing power and have that little size reduction would be too useful.

1 more reply

Lerc10y ago

I think you could do quite well with seeding areas with known colours. Provide data of x,y,colour at a few sample points where the initial guess was too far out. A very small amount of data might allow for quite accurate results.

fmeyer10y ago

I'm colorblind and I can't tell the difference between the prediction and the GT. My coworker says that they have slight differences in some colors.

Congrats, you wrote the first colorblind NN ever!

rimantas10y ago

I'd say the differences are very well expressed.

m_mueller10y ago

I'd apply it to an old B&W film to show off. Imagine Ben Hur or Citizen Kane in color!

nightcracker10y ago

I see a potential problem with moving pictures - the neural network might decide that while red was an appropriate color for the truck in the last frame, blue is more likely in this frame.

xiphias10y ago

You can use the previous few frames for fixing this problem for videos

1 more reply

WalterBright10y ago

I suspect that if an artist selected colors for a few key frames, the computer could figure out the rest of the frames.

1 more reply

aab010y ago

One idea here is to adopt a recent generative approach: a CNN which starts with two noise image inputs, and then repeatedly tweaks it plus a new noise image over multiple inputs until it does one last tweak to a final version. The noise serves as a RNG for making choices to the built-up image, I think. You could apply this recurrent idea to movies too: for the first BW frame, pass in a noise image and the BW frame, get out a C frame; now for the second BW frame, pass in the BW frame but also the C frame from before. The CNN may gradually learn to transfer colors from the C frame to the BW frame, thereby maintaining temporal coherency.

(Or you could just try to use a RNN directly and keep hidden state from frame to frame.)

datenwolf10y ago

> Ben Hur

The widely known Ben Hur (1959), the one with the chariot race, already is in colour. Did you mean "Ben-Hur: A Tale of the Christ" from 1925?

WalterBright10y ago

Part of the 1925 version was already in color (2 strip Technicolor).

m_mueller10y ago

Alright, I forgot about this. For some reason I've seen B&W still shots lately and forgot how I saw it.

SyneRyder10y ago

The title says it's open source, but I couldn't actually see what the license is? (Maybe I missed it?)

cyphar10y ago

And it looks like you have to download a torrent in order to get something. But yeah, it's missing a license and is still proprietary.

walrus10y ago

The torrent contains the parameters for another neural network, VGG16, that this network makes use of. VGG16 is a network developed by the Visual Geometry Group at Oxford[1]. They released their parameters under CC BY-NC 4.0 to save others from spending 2-3 weeks[2] training the network. Someone else converted those parameters to work with TensorFlow[3], which is what the torrent in this repository is.

I'm interested to see if neural network parameters become the new "binary blob". While in theory you could always retrain the network yourself, actually doing so takes a lot of work fiddling with the network's hyperparameters and requires significant computing resources.

[1] http://www.robots.ox.ac.uk/~vgg/research/very_deep/

[2] "On a system equipped with four NVIDIA Titan Black GPUs, training a single net took 2–3 weeks depending on the architecture." - arXiv:1409.1556

[3] https://github.com/ry/tensorflow-vgg16

syockit10y ago

It is open source. You can see the source. The license is no-license, so you can (only) fork it. That way, people can clone from your username instead of the original author, although that doesn't make any difference as they're all hosted on GitHub.

lucideer10y ago

You seem to be getting downvotes because you don't understand what the term "open source" means, but nobody has offered to explain so:

Open source does not just mean you can see the source. From Wikipedia[0]:

"Open-source software is computer software with its source code made available with a license in which the copyright holder provides the rights to study, change, and distribute the software to anyone and for any purpose."

https://en.wikipedia.org/wiki/Open-source_software

1 more reply

SyneRyder10y ago

In some countries, code is copyrighted by default unless the author specifies a different license or explicitly puts it into the public domain. Without a license (or asking the author for permission), I have to assume this code is copyright & All Rights Reserved, and that I can't do anything with it except read it.

It's still interesting and cool to see! Just not what I thought it was when I clicked on the link.

1 more reply

x5n110y ago

Can we get some instructions on how to actually use this. I want to test it out on pictures from:

https://www.reddit.com/r/OldSchoolCool/

mlsource10y ago

unfortunately I'm not contributor of this project, but I hope author can answer soon )

g_sch10y ago

I'd be interested to see what kind of hidden biases this might reveal, especially about people, e.g. skin tone, eye color, etc.

gcr10y ago

Agreed. I wonder how good this model is at filling the skin tone of someone. It would be sad if (for example) it turned everybody into white people or something like that.

In biometrics, there's been similar cases of software like face detectors and face recognition working very well on people from China and not very well for other people, because all the researchers who trained those models only had available large public databases from Chinese universities. The model hadn't seen any other ethnicity so its performance on "non-Chinese" folks wasn't surprising.

jkrippy10y ago

The boy at the beach looks very orange/blue to me and reminds me of this:

http://www.slashfilm.com/orangeblue-contrast-in-movie-poster...

gravypod10y ago

I don't know if OP will see this but I'd love it if you could use your experience making this to write up a blog post about how it works or port it to being able to adjust color scales for color blind people.

I'd love to combine this technology with this: http://matplotlib.org/style_changes.html

You would probably have some cool results as you could generate examples of what they would look like to color blind people, and a corrected set so color blind people could see them.

Would be a cool, and I am assuming simpler problem then the one you have already managed to solve.

Good show, great work.

taneq10y ago

I love the last one.

"But you didn't say what colour it was, so I made it a red truck."

LoSboccacc10y ago

Also the lighthouse's sky: "That's sky, it's surely blue!"

Generally it's interesting to see nn thinking out missing details. I'd like to see images with an element deleted and a nn filling in black spots to see what level of shape recognition could do.

Coincoin10y ago

I'm pretty sure if you asked a group of humans to colorize the lighthouse you would probably get the same result.

There is no way, from the greyscale, to know that the sky should be orange.

mosselman10y ago

Can I try this out in images myself somehow?

amelius10y ago

I'm looking for a technique that can upscale images nicely (using NNs).

I've found this: [1], but the results seem somewhat disappointing. One of the problems is that the quality measures are (in my case) subjective (the results should look convincing but need not be "perfect", whatever that may mean).

[1] http://engineering.flipboard.com/2015/05/scaling-convnets/

leichtgewicht10y ago

I think this was on HackerNews before?

astrosi10y ago

Are you thinking of this project instead? http://richzhang.github.io/colorization/

liotier10y ago

Yes - how do those two projects differ in their approach ?

landmark210y ago

https://news.ycombinator.com/item?id=10864801

leichtgewicht10y ago

Thanks, I think that is what I meant.

Gigablah10y ago

Now give it a picture of a bike shed.

illumen10y ago

I think a tool shed is more appropriate.

georgemcbay10y ago

I think it was a reference to this concept:

http://bikeshed.com/

1 more reply

abdulhaq10y ago

It came out red :-(

gwern10y ago

Reddit: https://www.reddit.com/r/programming/comments/4ft4hj/colorne...

SamDLC10y ago

It may be interesting to investigate using this for image compression/decompression. ie starting with an image having a greatly reduced color space, could the NN reproduce the original image?

nitrogen10y ago

I wonder if this could be used to improve the proposed color prediction in Daala, or if the memory and CPU requirements are too high to put into a video codec.

amelius10y ago

How would this system colorize this image?: http://imgur.com/RBEht9H

WhitneyLand10y ago

What's the application and/or interest in colorizing images? Does it generalize to other types of problems?

new_hackers10y ago

The sky is blue, grass is green, trucks are red, and everything else is a shade of brown :-) pretty cool work!

mushmouth10y ago

This looks pretty cool. Do any photography people or film people can use this?

mkagenius10y ago

Can this be used to train a model which can un-blur an image?

amelius10y ago

Does it color the sea blue and the grass green?

j / k navigate · click thread line to collapse

77 comments

argonaut10y ago

These results: http://richzhang.github.io/colorization/ from Berkeley are much better than this model (and the code is open source as well).

logicrook10y ago

And the thread concerning that much better model: https://news.ycombinator.com/item?id=11403653

im3w1l10y ago

The authors of that point out that those are the especially successful cases. Presumably the the new article also showcases the success.

For comparability, I think it would be best if we could see outputs for the two models for the same, chosen in advance and not cherry picked, images.

specialist10y ago

Idle thought: Could colorization be used to (lossy) compress full color images (usefully)?

What I imagine is full color input -> create B/W & color histogram (list of colors used) -> image viewer uses colorization algorithm to reapply colors.

adrusi10y ago

I don't think a compression technique that would require that much processing power and have that little size reduction would be too useful.

1 more reply

Lerc10y ago

fmeyer10y ago

I'm colorblind and I can't tell the difference between the prediction and the GT. My coworker says that they have slight differences in some colors.

Congrats, you wrote the first colorblind NN ever!

rimantas10y ago

I'd say the differences are very well expressed.

m_mueller10y ago

I'd apply it to an old B&W film to show off. Imagine Ben Hur or Citizen Kane in color!

nightcracker10y ago

I see a potential problem with moving pictures - the neural network might decide that while red was an appropriate color for the truck in the last frame, blue is more likely in this frame.

xiphias10y ago

You can use the previous few frames for fixing this problem for videos

1 more reply

WalterBright10y ago

I suspect that if an artist selected colors for a few key frames, the computer could figure out the rest of the frames.

1 more reply

aab010y ago

(Or you could just try to use a RNN directly and keep hidden state from frame to frame.)

datenwolf10y ago

> Ben Hur

The widely known Ben Hur (1959), the one with the chariot race, already is in colour. Did you mean "Ben-Hur: A Tale of the Christ" from 1925?

WalterBright10y ago

Part of the 1925 version was already in color (2 strip Technicolor).

m_mueller10y ago

Alright, I forgot about this. For some reason I've seen B&W still shots lately and forgot how I saw it.

SyneRyder10y ago

The title says it's open source, but I couldn't actually see what the license is? (Maybe I missed it?)

cyphar10y ago

And it looks like you have to download a torrent in order to get something. But yeah, it's missing a license and is still proprietary.

walrus10y ago

[1] http://www.robots.ox.ac.uk/~vgg/research/very_deep/

[2] "On a system equipped with four NVIDIA Titan Black GPUs, training a single net took 2–3 weeks depending on the architecture." - arXiv:1409.1556

[3] https://github.com/ry/tensorflow-vgg16

syockit10y ago

lucideer10y ago

You seem to be getting downvotes because you don't understand what the term "open source" means, but nobody has offered to explain so:

Open source does not just mean you can see the source. From Wikipedia[0]:

https://en.wikipedia.org/wiki/Open-source_software

1 more reply

SyneRyder10y ago

It's still interesting and cool to see! Just not what I thought it was when I clicked on the link.

1 more reply

x5n110y ago

Can we get some instructions on how to actually use this. I want to test it out on pictures from:

https://www.reddit.com/r/OldSchoolCool/

mlsource10y ago

unfortunately I'm not contributor of this project, but I hope author can answer soon )

g_sch10y ago

I'd be interested to see what kind of hidden biases this might reveal, especially about people, e.g. skin tone, eye color, etc.

gcr10y ago

Agreed. I wonder how good this model is at filling the skin tone of someone. It would be sad if (for example) it turned everybody into white people or something like that.

jkrippy10y ago

The boy at the beach looks very orange/blue to me and reminds me of this:

http://www.slashfilm.com/orangeblue-contrast-in-movie-poster...

gravypod10y ago

I'd love to combine this technology with this: http://matplotlib.org/style_changes.html

You would probably have some cool results as you could generate examples of what they would look like to color blind people, and a corrected set so color blind people could see them.

Would be a cool, and I am assuming simpler problem then the one you have already managed to solve.

Good show, great work.

taneq10y ago

I love the last one.

"But you didn't say what colour it was, so I made it a red truck."

LoSboccacc10y ago

Also the lighthouse's sky: "That's sky, it's surely blue!"

Generally it's interesting to see nn thinking out missing details. I'd like to see images with an element deleted and a nn filling in black spots to see what level of shape recognition could do.

Coincoin10y ago

I'm pretty sure if you asked a group of humans to colorize the lighthouse you would probably get the same result.

There is no way, from the greyscale, to know that the sky should be orange.

mosselman10y ago

Can I try this out in images myself somehow?

amelius10y ago

I'm looking for a technique that can upscale images nicely (using NNs).

[1] http://engineering.flipboard.com/2015/05/scaling-convnets/

leichtgewicht10y ago

I think this was on HackerNews before?

astrosi10y ago

Are you thinking of this project instead? http://richzhang.github.io/colorization/

liotier10y ago

Yes - how do those two projects differ in their approach ?

landmark210y ago

https://news.ycombinator.com/item?id=10864801

leichtgewicht10y ago

Thanks, I think that is what I meant.

Gigablah10y ago

Now give it a picture of a bike shed.

illumen10y ago

I think a tool shed is more appropriate.

georgemcbay10y ago

I think it was a reference to this concept:

http://bikeshed.com/

1 more reply

abdulhaq10y ago

It came out red :-(

gwern10y ago

Reddit: https://www.reddit.com/r/programming/comments/4ft4hj/colorne...

SamDLC10y ago

It may be interesting to investigate using this for image compression/decompression. ie starting with an image having a greatly reduced color space, could the NN reproduce the original image?

nitrogen10y ago

I wonder if this could be used to improve the proposed color prediction in Daala, or if the memory and CPU requirements are too high to put into a video codec.

amelius10y ago

How would this system colorize this image?: http://imgur.com/RBEht9H

WhitneyLand10y ago

What's the application and/or interest in colorizing images? Does it generalize to other types of problems?

new_hackers10y ago

The sky is blue, grass is green, trucks are red, and everything else is a shade of brown :-) pretty cool work!

mushmouth10y ago

This looks pretty cool. Do any photography people or film people can use this?

mkagenius10y ago

Can this be used to train a model which can un-blur an image?

amelius10y ago

Does it color the sea blue and the grass green?

j / k navigate · click thread line to collapse