"a woman in a kitchen, leaps in the air while attempting to balance a glass cup in one hand."
and
"a young man playing wii in front of a large knife."
but then I realized that I cant come up with a description of what is going there myself.
Ill get you next time CPU!
I would really like to know how this works because the captions are... interesting.
The singularity is not going to be an AI revolution, it's when we decode the human brain's signals.
http://www.cs.toronto.edu/~nitish/nips2014demo/results/84824...
Generated caption: "a man appears to be a banana on a tree"
http://www.cs.toronto.edu/~nitish/nips2014demo/results/92679...
"a man wielding an electric razor is gleefully shaving away another man ' s hair ."
Hilarious!
I think they've stumbled on computer-generated comedy. Some funny stuff in there.
It must be making some very broad generalisations to come up with the tag 'homosexuals'...
Nitish Srivastava, co-instructor for CSC 321 : Intro to Neural Networks ( http://www.cs.toronto.edu/~rgrosse/csc321/ ) pretrains a convolutional neural net using image sets ( https://github.com/torontodeeplearning/convnet/tree/master/e... and https://github.com/torontodeeplearning/convnet/tree/master/e... )
also has a demo to upload your own images and get them captioned or classified http://deeplearning.cs.toronto.edu/i2t but seems their servers are getting blasted right now