Nano Banana 2: Google's latest AI image generation model

575 comments

Some random predictions about what AI image generation tools will do/are doing to art:

1. The narrative/life of the artist becomes a lot more important. The most successful artists are ones that craft a story around their life and art, and don't just create stuff and stop. This will become even more important.

2. Originality matters more than ever. By design, these tools can only copy and mix things that already exist. But they aren't alive, they don't live in the world and have experiences, and they can't create something truly new.

3. Those that bother to learn the actual art skills, and not merely prompting, will increasingly be miles ahead of everyone else. People are lazy, and bothering to put in the time to actually learn stuff will stand out more and more. (Ditto for writing essays and other writing people are doing with AI.)

4. Taste continues to be the single most important thing. The vast, vast majority of AI art out there is...not very good. It's not going to get better, because the lack of taste isn't a technical problem.

5. Art with physical materials will become increasingly popular. That is, stuff that can't be digitized very well: sculpture, installation art, etc. Above all, AI art is uncool, which means it has no real future as a leading art form. This uncoolness will push people away from the screen and towards things that are more material.

avmich2mo ago

I mostly disagree.

> 1... The narrative/life of the artist becomes a lot more important.

When I watch a movie, I don't care about the artist's life. I care about character life, that's very different.

> 2... Originality matters more than ever. By design, these tools can only copy and mix things that already exist.

It's like you assigning to humans divine capabilities :) . Hyperbolizing a little, humans also only copy and mix - where do you think originality comes from? Granted, AI isn't at the level of humans yet, but they improve here.

> 4... It's not going to get better, because the lack of taste isn't a technical problem.

Engineers are in business of converting non-technical problems into technical ones. Just like AI now is way more capable than it was 20 years ago, and able to write interesting texts and make interesting pictures - something which at the time wasn't considered a technical problem - with time what we perceive as "taste" may likely improve.

> 5... Above all, AI art is uncool, which means it has no real future as a leading art form.

AI critics are for a long time mistaking the level with trend. Or, giving a comparison with SpaceX achievements, "you're currently here" - when there was a list of "first, get to the orbit, then we'll talk", "first, start regular payload deliveries to orbit, then we'll talk", "first, land the stage... send crewed capsule... do that in numbers..." and then, currently "first, send the Starship to orbit". "You're currently here" is the always existing point which isn't achieved at the moment and which gives to critics something to point to and mount the objection to the process as a whole, because, see, this particular thing isn't achieved yet.

You assume AI won't be able to make cool art with time. AI critics were shown time and time again to be underestimating the possibilities. Some people find it hard to learn in some particular topics.

javier1234543212mo ago

> It's like you assigning to humans divine capabilities :)

I can't tell if you're being facetious. But being an embodied consciousness with the ability to create is as divine as it gets. We'd do well to remember.

2 more replies

ACCount372mo ago

It's kind of like the difference between something being enjoyable for you, and something being widely popular?

In a hypothetical world of "AI can produce a lot of extremely high quality art", you can easily find (or commission) AI art you would absolutely love. But it probably wouldn't be something that anyone else would find a lot of value in?

There will be no AI-generated Titanic. There will be many AI-generated movies that are as good as Titanic, but none will become as popular as Titanic did.

Because when AI has won art on quality and quantity both, and the quality of the work itself is no longer a differentiator against the sea of other high quality works? The "narrative/life of the artist" is a fallback path to popularity. You will need something that's not just "it's damn good art" - an external factor - to make it impactful, make it stick in the culture field.

Already a thing in many areas where the supply of art outpaces demand. Pop music, for example, is often as much about making sound as it is about manufacturing narratives around the artists. K-pop being an extreme version of the latter lean.

2 more replies

CryptoBanker2mo ago

> When I watch a movie, I don't care about the artist's life. I care about character life, that's very different.

I’m fairly certain the original comment was referring to instances where the artist is the character/primary subject.

CWuestefeld2mo ago

I agree with everything you said, except that #1 is clearly wrong. I can prove it with one word: autotune.

At least in popular, mainstream culture, the viewer is heavily invested in the identity of the artist. The quality of the "art" is secondary. That's how we get music engineered by committee. And it's how we get paparazzi, People Magazine, and so forth.

On the other hand, this isn't anything new at all. We've had this kind of thing for decades. Real art still manages to survive at the margins.

1 more reply

rand_r2mo ago

> When I watch a movie, I don't care about the artist's life. I care about character life, that's very different.

It may seem like this, but up to now, you haven't been able to divorce a story from its creator because every story has an author, whether it's a novel like Harry Potter or a movie that has a writer and director. When you're experiencing the story, in the back of your mind, you always know that there is someone who created the story to tell you some kind of message. And so you can't experience something like a movie without trying to figure out what the actual message behind the movie was. It is always the implicit message behind the story that makes it valuable versus just the elements of the story.

The story has more weight because it is the distillation of somebody else's life and most likely, if it's a successful story or book, it is the most important lesson from that person's life and that's what makes it more valuable compared to the random generation of words from a computer.

The food analogy is that a cookie baked and given to you by a friend is going to taste far better than anything you buy in a store.

2 more replies

oliyoung2mo ago

> When I watch a movie, I don't care about the artist's life.

And here we come back to the aged old "can you seperate an artist from their art" because I'd argue when you watch a movie you are watching a product of their life

1 more reply

jplusequalt2mo ago

>Engineers are in business of converting non-technical problems into technical ones.

Art is not a problem to be solved.

1 more reply

keiferski2mo ago

1. I meant artists writ large, not specifically movies. My point being that community management, PR, having a brand, etc. are becoming a key element of an individual artist’s career. Examples of this abound – see the recent Markiplier film as a case in point. That movie did well because Mark’s audience wanted to help him, not because it’s such an original genius concept for a movie.

But even then – people obviously go watch movies because they like the actor/director involved. It’s not really clear why anyone would care about an AI actor. People want to watch people, not imitations of them.

The rest of your comments seem to be summarized as “it has gotten better and therefore it will eventually solve all problems it has now.” Which may be true in a technical sense, but again this is not taste.

A technical company like Space X really has nothing to do with this conversation, and I think you missed my point about it being uncool. It’s not about critics, it’s about culture at large.

At this point I think identifying a work as AI-created makes people instantly devalue it. We are rapidly approaching the point where no one wants to admit something is AI-created, because it comes with negative perceptions.

Originality comes from humans experiencing the world and interacting with it. What AI tool is a living being interacting with the world? None, of course. Hence the constant generic slop images of Impressionism or some other already-existing art style.

Just look at the images in the link: this is the best they can do? A kangaroo at a cafe in Paris? Could anything be more devoid of good taste?

4 more replies

wasabi9910112mo ago

> You assume AI won't be able to make cool art with time. AI critics were shown time and time again to be underestimating the possibilities. Some people find it hard to learn in some particular topics.

You misunderstand their point: it's not that AI can't make art that looks cool, it's that a portion of society (mostly artists but a certain amount of lay people) who consider the act of prompting AI for art to not have any cultural cache, or even to be socially distasteful.

jacobriers2mo ago

> It's like you assigning to humans divine capabilities :) . Hyperbolizing a little, humans also only copy and mix - where do you think originality comes from? Granted, AI isn't at the level of humans yet, but they improve here.

I reckon we copy God - who is a creator - which means we're creators too - and our creations will copy us. But the created won't ever match the creator.

michaelbuckbee2mo ago

Well, there are definitely people who care about the vision and style of movies from certain directors. It's not so much "story" like plot, but story in the sense of a "brand story" where there's recognizable elements in all the work, repeated themes, changes and decisions and evolution to how they approach things.

squidsoup2mo ago

Every human being is unique, both biologically and experientially. Until an AI can feel and have a lived experience, it can not create art.

2 more replies

imtringued2mo ago

>"You're currently here" is the always existing point which isn't achieved at the moment and which gives to critics something to point to and mount the objection to the process as a whole, because, see, this particular thing isn't achieved yet.

This is a contradiction that is so blatant I don't even know what language you're speaking. The definition of that phrase is the exact opposite of what you're saying.

"You're currently here" is the always existing point which is achieved at the moment.

>gives to critics something to point to and mount the objection to the process as a whole, because, see, this particular thing isn't achieved yet.

No it doesn't, because unless progress is reversed or undone, you can always point to your current success and say that the critics have been wrong so far. In fact, that's exactly the argument you're making here, which is why it's so weird that you're twisting it into its opposite.

If you want people to understand you, then you actually have to articulate what you're thinking instead of wrapping it in layers of euphemisms and hoping that the recipient nods along because they happen to agree for a completely irrelevant reason (e.g. "I like AI" or "I like space") to the argument presented.

fauigerzigerk2mo ago

>It's like you assigning to humans divine capabilities :) . Hyperbolizing a little, humans also only copy and mix - where do you think originality comes from? Granted, AI isn't at the level of humans yet, but they improve here.

Humans do that a lot but it's not all we do. Go to a museum that has modern(ish) art. It's pretty incredibly how diverse the styles and ideas are. Of course it's not representative of anything. These works were collected and curated exactly because they are not average. But it's still something that humans made.

I think what people can do is have conceptual ideas and then follow the "logic" of those ideas to places they themselves have never seen or expected. Artists can observe patterns, ask how they work and why they have the effect they do and then deliberately break them.

I'm not sure current genAI models do these sorts of things.

1 more reply

KPGv22mo ago

> When I watch a movie, I don't care about the artist's life. I care about character life, that's very different.

The target audiences for art and film are not the same. The latter is far more pop culture. You can't apply them the same way, and the narrative of the artist has been extremely important for decades. People will watch slop movies. They don't pay $30K for slop art. They're paying that for historical importance or, if contemporary, artist narrative.

I'm in fandom spaces, and the prejudice against AI art is overwhelming. I also run in art collecting circles, being somewhat wealthy but not a billionaire. They also care about authenticity.

That is to say, the people who pay for original art, and participate in art spaces, are generally educated who actively hate AI. Filmgoers are probably a standard deviation lower in education, and are far more willing to part with the cost of one unit of consumption (a $10 ticket) than art buyers.

AI is a threat to graphic designers and those in their orbit.

The only way I see AI being a threat to professional artists is AI copies of their work. And AI isn't anything new there. I have a friend who gets commissioned by hotels to do one-off pieces for display all over the world. People have been making knockoff pieces of her style and selling them for at least a decade. And that's her lower margin, small pieces made for a couple thousand dollars to hang at your house, not her $100K+ pieces for hotels where they fly her out to supervise reassembly and mounting.

2 more replies

screye2mo ago

> The narrative/life of the artist becomes a lot more important

We are 50 years into post-modernism. Can't imagine it can get any more important.

I predict emergent design will be the next big thing. Czinger[1] is a great example of what it may look like. Rick Ruben-esque world, where the creator is more a guide.

[1] Czinger uses stochastic optimization to converge to designs - https://www.czinger.com/iconic-design

CamperBob22mo ago

Post-modernism says that the artist isn't important. Dead, in fact. Art is something that happens when we perceive it, not when the author creates it.

selridge2mo ago

God, thank you.

Finally, someone pointing out all of this is just people announcing what has been in play for half a century.

cpill2mo ago

Is that what putting a camera in the hands of everyone with a smartphone (basically everyone) did for photography?

Or making video editing + free, global publishing platform did for film? (see: doom scrolling).

bjackman2mo ago

> The narrative/life of the artist becomes a lot more important.

Less the narrative of the art's production and more the message that it's conveying.

I don't mean (necessarily) a political message or a message that can be put in to words. But the abstract sense of connecting with the human who created it some way.

This isn't just art though. An example: soon, Sora will be able to generate very convincing footage of a football match. Would any football fan watch this? No. A big part of why we watch football is that in some sense we care about the people who are playing.

Same with visual art. AI art can be cool but in the end, I just don't really give a shit. Coz enjoying art is usually about the abstract sense that a human person decided to make the thing you are looking at, and now you are looking at it... And now what?

This is why every time someone says "AI art sucks" and someone replies "oh yeah? But look at THIS AI art" I always wonder... What do you think art is _for_?

likium2mo ago

Football anime doesn’t involve real people or stakes. AI can introduce a storyline, characters, etc. It won't necessarily be as popular as the real sport but I doubt the audience is zero.

1 more reply

pixl972mo ago

>Would any football fan watch this?

Depends what the future of VR worlds look like, and what the viewers place is in them.

selridge2mo ago

The problem is, we have no real understanding of what people will or will not do with this technology. Will humans only be interested in “real“ activity?

We have no idea, and most people are just guessing in a way that flatters some understanding of art that they have. We also frankly have no idea what the permanent relationship of humans to art is even without AI.

The television is less than 100 years old. There aren’t very many, but there are some people alive today who were alive before the television was created. The computer is about 80 years old. The whole idea of photography and of recorded audio is less uthan 150 years old.

We are still living in the aftershocks of industrial production of art. It is foolish to imagine that in the midst of this chaos, we can point the way forward with ease.

muyuu2mo ago

We'll get to the point, if we're not already there, where you won't be able to tell if the artist actually did the work or just could have done it, and to which extent. Everything in the process can be essentially faked. If you put a massive emphasis on proving human work, you're essentially conceding you cannot tell without some sort of notary certification. We're in the lab diamond stage and clutching at some artificial authenticity.

jpadkins2mo ago

> 4. Taste continues to be the single most important thing. The vast, vast majority of AI art out there is...not very good. It's not going to get better, because the lack of taste isn't a technical problem.

I agree on current AI art taste, but disagree that it can't be improved. I think art AI companies can hire skilled "taste makers" and use their feedback loop as RL for AI art models. I think this area will always be in flux, and will vary by subpopulation so it will be a job role always in demand.

Do you think taste is something that cannot be taught/learned? Are certain individuals just born with good taste; it's an immutable property?

tlh2mo ago

AI art is certainly considered uncool today in many circles.

I do wonder though… were there other innovations that were uncool in their early years, where now nobody bats an eyelid?

Is that point just a generational/passage of time issue?

3 more replies

lofaszvanitt2mo ago

It's a harrowing situation to watch simpleminded idiots in forums proclaim that artists, some even naming top talent, are going to be useless and that they can steal their handiwork just by naming them in prompts. And people actually dare to say this with such audacity. Those who create GenAI likely do not realize what lowlife impulses some of these individuals have.

No matter how good AI agents become, you still need a general understanding of what works and what doesn't. If you don't have years of experience in the field, all you will end up doing is copying what others do. It's the same dynamic you see on OnlyFans. Mindless zombie hordes copy the "pioneers" (who shove even bigger things in their back orifice for example) and push things further and further, chasing shock value because that's what once elevated someone into the top 0.1 percent.

It's the worst kind of race-to-the-bottom scenario.

selridge2mo ago

I don’t know that this has to be the way. One thing that is really going to confound this very common idea that taste and quality and personal characteristics will win the day, is that you can use AI to represent all of these to other people.

It’s a huge practical problem to try and figure out authentic nature over the Internet. It’s already clear that people will pay for it, but it’s not at all clear that they will get it. If we imagine that the tools get better and more sophisticated than there is no reason whatsoever to assume that the tools won’t be deployed to give the impression that is needed to make money.

I don’t think any of the above survives if we allow for AI to be used as it is currently being used. It only survives if you pretend that ahead of us is some invisible gate past which this technology will not go.

kjeksfjes2mo ago

My prediction: KNOWLEDGE of whether something is made by AI or a human will be alpha and omega, and will eventually be regulated used in commercial contexts. You will always be able to generate something, but if you somehow get exposed presenting it as human made, the sanctions will hurt you.

throwthrowuknow2mo ago

1. This sounds more like influencer marketing, I think people are already sick of it.

2. Yes and no. Depending on how you train the model they can output things that you’ve never seen before but the question is whether you want to look at those things. So yes a human has to judge and fine tune the output. This is why many models seem unoriginal, they’re designed to emulate specific styles and tuned based on broad appeal. If you go looking for LoRAs and merges created by “artists” you will see shit you couldn’t dream of.

everything else probably yes.

scrozier2mo ago

> Taste continues to be the single most important thing. The vast, vast majority of AI art out there is...not very good. It's not going to get better, because the lack of taste isn't a technical problem.

This is precisely and importantly true. I just wonder if most of the world cares. I'd like to think so, but experience tells me that most of the world is satisfied with mediocre stuff. And I don't say this as a criticism; it's just a fact that artists have to come to grips with.

1 more reply

Spacecosmonaut2mo ago

Regarding point 2: I think most people cannot destinguish between "genuine" creativity and artificial almalgamation of training data and human provided context. For one, I do not know what already exsists. Some work created by AI may be an obvious rip off of the style of a particular artist, but I wouldnt know. To me it might look awesome and fresh.

Furthermore, I think many of the more human centric thinkers will be disappointed at how many people just wont care.

cgio2mo ago

I think we fall into the trap of seeing art from a consumption point of view. “Of what use is a human vs AI piece of art to Me?” Art is residing in the productive space too, the artist is considering not his/her utility but his/her presence in the world. Maybe what you describe is the way forward for art monetisation but not for art, and we know experientially how the production of real art is not always in tandem with its appreciation.

ane2mo ago

I am also glad the commercial niche illustration markets like Magic the Gathering are extremely hostile to AI art, though of course I would think Wizards of the Coast, the company that publishes MTG, probably see artists as a cost. Maybe.

Perhaps in the future artists will be used to train models that can output a certain style of art and the artist will receive royalties based on their influence on the trained model and its popularity.

hi_hi2mo ago

This is a great and worthwhile discussion. People are loosing sight of what art is. The art is the idea, not the medium. And just because something is easy, doesn't mean it will be good.

I've seen some fantastic original pictures that actual artists have generated through AI. I can't wait to see what current and future artists can do with the new tools at their disposal.

JKCalhoun2mo ago

You're focused on the visual arts—I'll add that live music will become more treasured, sought after than recorded music.

Because it's real.

maxglute2mo ago

Everything is a remix. Humans' rarely original, creativity still built off influences. AI image generation is nothing if not good at remixing.

WheatMillington2mo ago

>2. Originality matters more than ever. By design, these tools can only copy and mix things that already exist. But they aren't alive, they don't live in the world and have experiences, and they can't create something truly new.

How can you say this? These models can trivially create things that have never existed, and you can easily test this yourself.

1 more reply

testrun2mo ago

I think the most important is number 2. People are now looking for things that are made by humans. Most detest AI slop. And if they find out that you are peddling slop, you lost trust.

It seems to me that we will go through the same phases that chess went through when chess on computers became a thing. First, people thought that this will kill chess, then people start using it as a tool to play better chess. Now, chess is thriving, despite AI being used in chess. I can see a similar path with art. Using AI to generate ideas, still create art by humans.

1 more reply

Davidzheng2mo ago

Re: But they aren't alive, they don't live in the world and have experiences, and they can't create something truly new.

Is it possible for a character in a novel to have novel experiences? Or for you to experience a novel dream? I would argue yes. You can know the rules of the environment and the starting conditions, but with a bit of randomness (or not) you can generate from that novel experiences which were unexpected - so too from the data & distribution that AIs are already trained on they can experience new experiences.

Another source of novelty is from good verifiers/recognition of a class of object which is hard to construct but easy to verify - here the AI can search and from that obtain novel solutions which were unthought of before.

N.B novelty itself is basically trivial - just generate random strings. But both of the above are mechanisms to generate novel samples inside some constraint of "meaningfulness"

fasteddie310032mo ago

I'm building my personal home right now. The AI image models have been a game-changer in designing the look of the house. My architect did an OK job, but the details that Nano Banana added really bring the house up a notch. I just do hundreds of renders from the basic 3D models and I find looks that I like and iterate from there. We are implementing the renders from Nano Banana over our Interior Designers designs. We would not have hired the Interior Designers again after using Nano Banana to do our interiors.

I think part of the issue with architects and designers today is that they use CAD too much. It's easy to design boxes and basic roof lines in CAD. It's harder to put in curves and more craftsman features. Nano Banana's renders have more organic design features IMO.

Our house is looking great and we're very happy how it's going so far with a lot of the thanks to Nano Banana.

kristjansson2mo ago

Part of the job of interior design is delivering the promised images in … yknow, physical reality? How are you going from nano banana images to actual plans, materials, finishes, products, paint codes, … ?

fasteddie310032mo ago

I just gave the renders to the cabinet makers and they had no problems recreating.

1 more reply

ghoshbinayak2mo ago

Not everyone can afford the best interior designers, my sister had hired one to design the interiors of her new cafe.. the designer did not provide any renders, just a basic CAD drawing. Then I used nano banana pro to create some renders from the photos of the empty space. These turned out great, and she decided to ditch the designer and use the designs created by nano banana. The cafe has opened now, and it looks great!

1 more reply

PunchTornado2mo ago

not the op, but this is what i did too and bypassed the designer. I iterated with nano banana and gave the result to the company that builds the kitchen. the middleman is gone now.

2 more replies

werdnapk2mo ago

A designer knows things from experience and would be aware of small details that if not designed correctly, become very apparent when built in reality.

1 more reply

yokoprime2mo ago

The interior designer doesn't really do squat. They can do plan drawings and have some off the shelf cupboards and furniture. They don't implement anything

jatari2mo ago

Presumably you give the render to a designer and they recreate it using real materials.

vunderba2mo ago

NB Pro can do some seriously impressive edits around interior decorating - see the prompt that replaces the window with a mirror which correctly reflects the room. It's not perfect, but it's still damn impressive.

https://mordenstar.com/blog/edits-with-nanobanana

micw2mo ago

I'm deeply impressed, especially with the "replace window by mirror". It did not only do the window thing right, it also changed the illumination of the whole room while keeping all the other details unchanged.

1 more reply

orbital-decay2mo ago

The first before/after picture with the added fox is instantly uncanny due to two bokeh swirls...

1 more reply

soared2mo ago

Same! I redid my backyard entirely and needed ideas. Gemini took a pile of dirt and gave me countless ideas, improved my plans, recommended materials, etc. a designer gave me two out of the box ideas that Gemini didn’t come up with, but it did everything else perfectly. (Designer said, put a patio out in the yard and put your table there, and take your ugly shed and make it the center of attention, since you’ll never succeeed trying to hide it)

veb2mo ago

Same thing here. I took a picture of some gravel/grass and asked it to show me what it'd look like with tiles. I showed it another part of the property, and asked it to show me what it would look like with a raised lawn. Super impressive to be able to see a cloudy idea in the physical realm like that.

pkaye2mo ago

Did you do this in Gemini or Nano Banana? Should I give multiple view points and top view of the back yard? I'm trying to see how much info to give.

1 more reply

rcpt2mo ago

Related: I asked AI to find me a house to buy and went with the first recommendation. It did a better job searching than I did.

1 more reply

bartman2mo ago

Can you write a bit more about your workflow? I've been thinking about doing the same, but since I'm very non-interior-design minded have struggled to ask the right things.

Like... What are your inputs to the model? Empty renders of the space, or more fully decorated views/ photos? Do you have a light harness around this to help you discover the style you like and then stay consistent with it?

Do you find that giving a lot of context around the space you're designing helps (it hasn't in my attempts)?

fasteddie310032mo ago

I started with sketchup to make basic floor plans and house shapes. I had a rough idea of the style of the home. I picked "Transitional English Estate" since the build site is out on a farm that sorta looks like the Cotswolds. I used AI in this process to get rough renders and feedback on the floorplan. I then took that basic floorplan and house dimensions to a Draftsman who did a lot of tweaking to get it up to code and fix issues. I got his plans and took it to a Sketchup Pro on Fivver . They made a detailed sketchup model. I then took that model and took screenshots from different perspectives and tweaked the prompt to get renders I liked. These changes were reencorprated into the blueprints. I did the same thing with the interior. Took screenshots from sketchup and put them into AI and tweaked the prompt. https://imgur.com/a/lSIYTYr

1 more reply

soared2mo ago

Mine was far more lightweight, but u just uploaded pics of my yard and prompted manually a bunch of times. Sometimes id find reference images to give as context, draw on the image to call out specific areas, etc.

It wouldn’t show me the exact things I wanted, but got close enough that I could test ideas and iterate quickly.

lurkingllama2mo ago

I actually built an app to accomplish this exact thing as I was finishing building my home and was clueless when it came to interior design. I'm genuinely astonished by the capabilities of these models with regards to this, and it feels vastly underutilized by the general populace. Being able to try out multiple paint colors in seconds, or add real furniture or wall decor from Ikea, or move objects around instantly - it still blows my mind.

cam_l2mo ago

The cost of doing more complex designs is analogous to the cost of doing more complex builds.

If you can afford the extra cost for someone to figure out how to build the blue sky designs that nano banana spits out, maybe you can afford something more thoughtful and interesting than a shitty mashup of other peoples mcmansions.

Clearly i am triggered..

deadmutex2mo ago

Out of curiosity: what is your input to the model? A CAD file or a drawing?

I find it does a good job at isometric views from floor plans. However, I needed Gemini 3.1 Pro to be able to have a chance at rendering 3D human point of view images from floor plans.

allforJesse2mo ago

Any chance you'd be willing to share an album? I've considered doing this for my own home and I'd be psyched to study practical examples. Honestly this would make one helluva blog post (imo).

ByThyGrace2mo ago

Is any of this intrinsically a strength of Nano Banana, while not of other models/generative tools? Have you tried doing the same with say Klein, ZIT, etc.?

1 more reply

skybrian2mo ago

Did you have to change anything based on cost and what the contractors can actually do?

shostack2mo ago

What tooling are you using to use this and manage it?

lofaszvanitt2mo ago

Please, show pics before and after.

nickandbro2mo ago

These image gen models are getting so advanced and life like that increasingly the general public are being duped into believing AI images are actually real (ex Facebook food images or fake OF models). Don't get me wrong I will enjoy the benefits of using this model for expressing myself better than ever before, but can't help feeling there's something also very insidious about these models too.

WarmWash2mo ago

It's more likely than not that every single person who uses the internet has viewed an AI image and taken it as real by now.

The obvious ones stand out, but there are so many that are indiscernible without spending lots of time digging through it. Even then there are ones that you can at best guess it's maybe AI gen.

WD-422mo ago

People will continue to retreat into walled, trusted networks where they can have more confidence in the content they see. I can’t even be sure I’m responding to a real person right now.

1 more reply

tokai2mo ago

Maybe not an actual argument for anything, but even before these image models everyone that used the internet had seen a doctored image they believed to be real. There was a reason that 'i can tell by the pixels' was a meme.

versk2mo ago

At the point now where basically any photo that isn't shared by someone I trust or a reputable news organisation is essentially unverifiable as being real or not

The positive aspect of this advance is that I've basically stopped using social media because of the creeping sense that everything is slop

yen2232mo ago

At least some of the comments here are likely AI-generated

yieldcrv2mo ago

people only notice when they are prompted to look for AI or scrutinize AI

a lot of these accounts mix old clips with new AI clips

or tag onto something emotional like a fake Epstein file image with your favorite politician, and pointing out its AI has people thinking you’re deflecting because you support the politician

Meanwhile the engagement farmer is completely exempt from scrutiny

Its fascinating how fast and unexpected the direction goes

kevincox2mo ago

I actually think this was a good thing. Manipulating images incredibly convincingly was already possible but the cost was high (many hours of highly skilled work). So many people assumed that most images they were seeing were "authentic" without much consideration. By making these fake images ubiquitous we are forcing people to quickly learn that they can't believe what they see on the internet and tracking down sources and deciding who you trust is critically important. People have always said that you can't believe what you see on the internet, but unfortunately many people have managed without major issue ignoring this advice. This wave will force them to take that advice to heart by default.

slfnflctd2mo ago

I remember telling my parents at a young age that I couldn't be sure Ronald Reagan was real, because I'd only ever seen him on TV and never in real life, and I knew things on TV could be fake.

That was the beginning of my journey into understanding what proper verification/vetting of a source is. It's been going on for a long time and there are always new things to learn. This should be taught to every child, starting early on.

1 more reply

arkmm2mo ago

I used to also have this optimistic take, but over time I think the reality is that most people will instead just distrust unknown online sources and fall into the mental shortcuts of confirmation bias and social proof. Net effect will be even more polarization and groupthink.

manuelabeledo2mo ago

> By making these fake images ubiquitous we are forcing people to quickly learn that they can't believe what they see on the internet and tracking down sources and deciding who you trust is critically important.

Has this thought process ever worked in real life? I know plenty of seniors who still believe everything that comes out of Facebook, be AI or not, and before that it was the TV, radio, newspapers, etc.

Most people choose to believe, which is why they have a hard time confronting facts.

1 more reply

ByThyGrace2mo ago

> By making these fake images ubiquitous we are forcing people to quickly learn

That's quite the high opinion on the self-improvement ability of your Average Joe. This kind of behavior only comes with an awareness, previously learned, and an alertness of mind. You need the population at large to be able to do this. How if not, say, teaching this at schools and waiting for the next generation to reach adulthood, would you expect this to happen?

1 more reply

0x4572mo ago

When it comes to graphic content on the internet I usually consume it's for entertainment purposes. I didn't care where it came from before and don't care today either. Low quality content exists in both categories, a bit easier to spot in AI generated, so it's actually a bonus.

lm284692mo ago

I feel like there is one or two generations of people who are tech savy and not 100% gullible when it comes to online things. Older and younger generations are both completely lost imho, in a blind test you wouldn't discern a monkey from a human scrolling tiktok &co

1 more reply

anigbrowl2mo ago

And if they don't?

Your post seems a little naive to me, a lot of people are just not interested in putting in the work or confronting their own confirmation bias, and there's an oversupply of bad actors who will deliberately generate fake imagery for either deception or exhaustion. Many people are just not on quest for truth and are more interested in the activation potential of images or allegations than in the factual reliability.

toraway2mo ago

In reality: millions of boomers are scrolling FB this very minute reacting to the most obviously fake rage/surprise/love bait AI slop you've ever seen.

1 more reply

whynotmaybe2mo ago

>fake OF models

Soon many real OF models will be out of job when everyone will be able to produce content to their personal taste from a few prompts.

sodacanner2mo ago

People already have access to every form of niche pornography they could dare to imagine (for absolutely free!), I really doubt that 'personal taste' is the part that makes OF models their money. They'll be fine.

1 more reply

dfxm122mo ago

I don't think so. Talking to people in this space, I've found out about broad camps. There are probably more:

-They simply aren't into real women/men (so you couldn't even pay a model to do what they're looking for).

-They want to play out fantasies that would be hard to coordinate even if you could pay models (I guess this is more on the video side of things, but a string of photos can put be together into a comic)

-They want to generate imagery that would be illegal

Based on this, I would guess fetish artists (as in illustrators) are more at risk than OF models. However, AI isn't free. Depending on what you're looking for, commissions might be cheaper still for quite a while...

1 more reply

mjr002mo ago

Even ignoring the model censorship making high quality sexual imagery/videos not possible, this is a crazy take. You think OF models are making money because it's the only way to see a nude man/woman with particular characteristics on the internet?

You're completely misunderstanding what the product being sold is.

1 more reply

sekai2mo ago

> Soon many real OF models will be out of job when everyone will be able to produce content to their personal taste from a few prompts.

net positive to society

1 more reply

baal80spam2mo ago

And this can't come soon enough.

1 more reply

pousada2mo ago

You can’t really because these powerful models are censored. You can create lewd pictures with open models but they aren’t nearly as good or easy to use.

4 more replies

coldtea2mo ago

And they might have to gasp! get an honest job!

2 more replies

Havoc2mo ago

Don’t think the demand for real OF is going anywhere

derwiki2mo ago

How do you know they’re real right now?

1 more reply

neogodless2mo ago

> Facebook food images or fake OF models

What in the world is a fake OF model?

Does "OF" stand for "of food"?

bena2mo ago

It stands for "OnlyFans" a website originally for creators to engage directly with their audiences but quickly became a website where women sold explicit pictures of themselves to subscribers.

1 more reply

vunderba2mo ago

Jaded, but if I knew there was a possibility of a bunch of incriminating footage of me (images, video, etc.) out there in the pre-AI days, I would do my absolute best to flood the internet with as many related deepfakes (including of myself) as possible.

techpression2mo ago

Oh we’ve seen nothing yet of the chaos that generative ai will unleash on the world, looking at Meta platforms it’s already a multi million dollar industry of selling something or someone that doesn’t exist. And that’s just the benign stuff.

dfxm122mo ago

This has been true for a while with digital art, photoshop, etc. Over time, people's BS detectors get tuned. I mean, scrolling by quickly in a feed, yeah, you might miss if an image is "real" or not, but if you see a series of photos side by side of the same subject (like an OF model), you'll figure it out.

Also, using AI will not allow you to better express yourself. To use an analogy, it will not put your self-expression into any better focus, but just apply one of the stock IG filters to it.

itintheory2mo ago

> a series of photos side by side of the same subject

Cameras are now "enhancing" photos with AI automatically. The contents of a 'real' photo are increasingly generated. The line is blurring and it's only going to get worse.

pancakeguy2mo ago

Surely this is a problem that we will never be able to solve.

fortyseven2mo ago

It's shitty, but I think it's almost as bad that people are calling everything AI. And I can't even blame them, despite how infuriating it is. It's just as insidious that even mundane things literally ARE AI now. I've seen at least twice now (that I'm aware of) where some cute, harmless, otherwise non-outrageous animal video was hiding a Sora watermark. So the crazy shit is AI. The mundane shit is AI. You wonder why everyone is calling everything AI now. :P

switchbak2mo ago

It seems like a low level paranoia - now I find myself double checking that the youtube video I'm watching isn't some AI slop. All the creators use Getty b-rolls and increasingly AI generated stuff so much that it's not a far stretch to have the voice and script all be auto generated too.

I suppose if the AI was able to tell me a true and compelling story, I might not even mind so much. I just don't want to be spoon fed drivel for 15 minutes to find it was all complete made up BS.

CWuestefeld2mo ago

What they've chosen as examples to illustrate the strength of the new model surprises me.

The "cubism" example seems like it would be a closer fit to something like stained glass or something. I don't think the thing really understands what cubism was all about. Cubist painters were trying to free themselves from the confines of a single integral plane of perspective by allowing themselves to show various parts of the image from different viewpoints, different times, different styles, etc.

The division of the image into geometric shapes is just a by-product of that quest, whereas the examples here have made it the sum total of the whole piece.

This feels to me like an example of how LLMs still don't "understand" what the art means, and are just aping its facade.

kevinsync2mo ago

I had a similar thought before realizing that I'm pretty sure what they were demonstrating wasn't art style, but adherence to correct physical dimensions and construction of the buildings referenced, that was then expressed in an art style (or reasonable facsimile thereof). The before prompts would just conjure a random building out of thin air, the after prompts searched the web for reference material and then used that in image generation.

And actually, the link I saw a bit ago was this [0] which is more in-depth and has a lot more examples + prompts.

[0] - https://deepmind.google/models/gemini-image/flash/

zug_zug2mo ago

I'm sure this has been written about but here's what happens long term - images are commoditized and lose their emotional appeal.

Probably about half of us here remember photos before the cell phone era. They were rare, and special, and you'd have a few photos per YEAR to look back on. The feel of photos back then, was at least 100x stronger than now. They were a special item, could be given as a gift. But once they became freely available that same amount of emotion is now split across many thousands of photos. (not saying this is good or bad, just increased supply reducing value of each item)

With image/art generation the same thing will happen and I can already feel it happening. Things that used to be beautiful or fantastic looking now just feel flat and AI-ish. If claymation scenes can be generated in 1s, and I see a million claymation diagrams a year, then claymation will lose its charm. If I see a million fake Tom Cruise videos, then it oversaturates my desire for desire for all Tom Cruise movies.

What a time to be alive.

thewebguyd2mo ago

I believe this is the reason for a return to interest in analog media with both my generation (millenials) and gen-z. I do wedding photography on the side, and the past ~2 years have seen a huge increase in requests for film photography, either exclusively film or as an add-on to digital. Offering film has been one of the best things I've done for my side hustle.

Likewise with the sort of resurgence of vinyl, and the obsession over "old" point and shoot digicams.

giancarlostoro2mo ago

The best weddings I've been to had a photo booth where you can have photos printed out (any number) and texted to you. I think that's the best way to do it. I agree, people like physical photos still. I've bought my wife several different ways to print photos, including a smaller portable printer, and one of those Instant photo cameras.

1 more reply

klaussilveira2mo ago

Interesting how this matches the Matrix timeline. According to Agent Smith, 1999 represented the height of human civilization before things started to decline.

Not only 1999 prevents humans from becoming too advanced and invent new AI again, it is a believable and comfortable era. A perfect time, perfectly balanced between analog and digital.

1 more reply

xnx2mo ago

> huge increase in requests for film photography

Also for VHS camcorder footage

mjr002mo ago

This is something I predicted when image/music/other creative art models first came out, as many were crying that art as a medium was dead thanks to Stable Diffusion. And it does seem like I've been right (so far).

The introduction of massive of low-quality creations has made high-quality art much more in demand. Low-quality AI art and music has become a huge blinking indicator that says "SLOP". Hand-made, uniquely styled, quality art now has a "luxury goods" vibe, and people are willing to pay a premium.

porphyra2mo ago

When film photography came out in the 1830s, painters and intellectuals were really mad about it commoditizing and cheapening images compared to paintings.

* On first seeing a photograph around 1840, the influential French painter Paul Delaroche proclaimed, "From today, painting is dead!" [1]

* Charles Baudelaire, in 1859: "As the photographic industry was the refuge of all failed painters, too ill-equipped or too lazy to complete their studies, this universal infatuation bore not only the character of blindness and imbecility, but also the color of vengeance. [...] it is obvious that this industry, by invading the territories of art, has become art’s most mortal enemy" [2]

[1] https://www.barnesfoundation.org/whats-on/early-photography

[2] https://quoteinvestigator.com/2022/10/16/photo-mortal/

skerit2mo ago

> They were rare, and special, and you'd have a few photos per YEAR to look back on. The feel of photos back then, was at least 100x stronger than now. [...] But once they became freely available that same amount of emotion is now split across many thousands of photos

I don't think I fully agree. Sure people make so many photo's that they don't have the time or the will to start looking through them all.

You can't just whip out your phone and start scrolling through thousands of photo's with friends. It would get so boring so fast.

But if you put some effort into making a nice little selection of the best photo's, that emotion is 100% still there.

Someone2mo ago

And there’s software to help you with that. For example, using faces, time stamps and GPS info iOS creates collections for you.

Yes, it’s crude, and you have to do the face tagging, but I think it’s a huge improvement over not having that.

Bewelge2mo ago

So now the value is created through curation. Before it was inherent at creation. If you never curate it might seem like it lost value in comparison.

2 more replies

verelo2mo ago

Had a meeting with a friend the other day, discussing the 'times' and all that is happening around us.

I sit here thinking how wonderful and terrible of a time it is. If you can afford to sit in the stands and watch, it's exciting. There's never been so much change in such a short period of time. But if you're in the arena, or expecting to end up in the arena at some point, what terrifying moments lay ahead of you.

I never thought I'd say this, but I expect the arena is where I'll end up...I've enjoyed my time in the stands, but I'm running low on energy, capital and the will to keep trying.

ngruhn2mo ago

Wait what does the arena stand for?

2 more replies

electrosphere2mo ago

It reminds me of the Star Wars content thats come out recently - before there was the Original Trilogy which we all watched many times and the lines became iconic. Since then it's all become a mismash and blur of mediocrity due to over-exposure.

(except The Mandalorian, and I can't believe I'm using the word "content" :/)

edit: Totally forgot about Andor & Rogue One sorry, great film and two seasons of top-notch storytelling.

mghackerlady2mo ago

Rogue One was very good, to the point that I consider it on equal standing to the original trilogy and prequels

adammarples2mo ago

It's a blur of mediocrity due to its mediocrity, not its overexposure

1 more reply

hackyhacky2mo ago

> except The Mandalorian,

To each their own, but I think Andor is, by far, the best post-ROTJ output.

2 more replies

the_af2mo ago

> except The Mandalorian

Mandalorian started strong, with cool spaghetti Western vibes, and then ended up devolving into mediocrity too. In my opinion.

Haven't watched Andor yet.

2 more replies

TaupeRanger2mo ago

Andor is fantastic. The good content still stands out. Mediocre content will have to compete with AI slop at an increasing rate.

ex-aws-dude2mo ago

That is something that annoys me with fandoms

You could ask "how many more movies should we make?" and the answer would be "there is no limit, I always want more"

"I like this thing therefore more of it is obviously better"

I think it takes maturity to say "I like this thing and I don't want more of it."

1 more reply

com2kid2mo ago

> They were rare, and special, and you'd have a few photos per YEAR to look back on. The feel of photos back then, was at least 100x stronger than now. They were a special item, could be given as a gift. But once they became freely available that same amount of emotion is now split across many thousands of photos. (not saying this is good or bad, just increased supply reducing value of each item)

I take a hundred photos on a trip, my phone uses AI (not even the new fancy AI, but old 5-10 year old stuff to detect smiling faces and people in frame) to pull out less than a dozen that are worth keeping. Once a month or so I get fed a reminder of some past trip.

This isn't any different than before. The number of photos taken is greater, but the overall number of worthwhile photos from a given trip is about the same.

Brybry2mo ago

To add to this, on family trips in the 90s we would take a few disposable cameras and each was ~27 shots.

And we were lucky if even 1 picture per roll was worth keeping long term. And my family almost never looks through those photo albums.

Digital picture frames with a curated rotation of old scans and new digital pictures are what made pictures great for my family.

mrbonner2mo ago

You know, all of a sudden, I am starting to lose interest in meticulously drawn Mermaid diagrams in README, perfect grammar and spelling in doc reviews, or neat generated general photographs. They are all correctly presented, of course. But the ideas are mostly wrong, too.

I guess my stick figure hand drawn diagrams, a doc with few mistakes in grammar or spelling would be seen as more worthy to read as long as my ideas are sound. Right? :-)

bonoboTP2mo ago

Yes, genuineness, authenticity, quirky imperfection will be prized. But presumably some of that can also be trained into the models so...

If this becomes a trust signal, you can prepare for next gen models to do stick figure hand-drawn-like diagrams with spelling mistakes.

patwolf2mo ago

The first time I got a photo scanner, I was blown away that I could see myself on a screen. I eventually got a digital camera, and the novelty started to wear off. Now I can make myself the lead in a blockbuster movie, but that feels boring.

bananaflag2mo ago

> I'm sure this has been written about

Scott Alexander has written about it:

https://www.astralcodexten.com/p/the-colors-of-her-coat

clint2mo ago

I lived plenty of my life prior to the cell phone era (born early 80s).

I do not have the same feeling you seem to have about photos from this era. Some are fine, sure, but looking back on them, most of them are very bad photos and most do not capture anything close to what I'd call an emotional feeling.

I would go so far as to say 99% of the photos from my life prior to 2000s really suck, like really badly. Some also degrade visually and lose their impact over time.

Since you couldn't be sure what you caught more than often what is captured is poorly framed, blurry, weird, poorly timed, and often left out a lot of stuff that was actually going on. You also had to try and be super selective because each photograph had a real tangible cost.

Conversely, I find being able to take many photos in quick succession and across a long period of time at a very high clarity allows me to select a photo that most closely matches my feeling in those moments at that event.

Even more so with AI photos. Although many models cannot do this well, their abilities get better each day and can allow you to compose or edit/modify a photo in such a way that matches your internal feelings rather than the blandness of what is essentially a random photo of random stuff that may or may not convey an emotion anywhere near to what I was feeling or remember feeling in that moment.

Aerroon2mo ago

I don't fully agree. Perhaps you're right when it comes to images as a whole, but I think individual images themselves still capture that emotional value for me.

Even if there were a million fake Tom Cruise movies I would still like Edge of Tomorrow (even if it had been AI made).

zug_zug2mo ago

Yeah I mean edge of tomorrow is a great concept though and would have worked without him. Whereas a movie that’s got less going for it like MI 5 will seem bland once he’s commodified

rootusrootus2mo ago

> a few photos per YEAR to look back on

I totally get this, but on the other hand, we have definitely benefited from being able to take more photos. I have some older friends (pushing 80 or so) who sucked at taking photos, so 9 of 10 photos they have from their prime adult years raising their family are blurry to the point of not recognizing the people if you don't already know who they are.

They have great photos from the last 15-20 years, but of course they do, phone cameras are vastly superior to the point-and-shoot cameras from the 70s, and when you reflexively shoot a dozen photos every time you pose for a picture your odds are way better that one will come out clear, everyone looking at the camera, smiling, etc.

TiredOfLife2mo ago

Probably some of us here remember paintings before the photography era. They were rare, and special, and you'd have a few painting per YEAR to look back on. The feel of paintings back then, was at least 100x stronger than now. They were a special item, could be given as a gift. But once they became freely available that same amount of emotion is now split across many thousands of photos. (not saying this is good or bad, just increased supply reducing value of each item)

blindriver2mo ago

> images are commoditized and lose their emotional appeal.

No, ALL CONTENT is asymptotically approaching 0. This includes photos, videos, stories, app features, even code. Code is now worthless. If you want better security from generated code, wait 2 months and it will be better. If you want a photo, you just prompt and it will generate it on the fly.

AI will be generating movies and videos on the fly, either legally or illegally infringing on IP. Do you want a movie where Deadpool fights The Hulk? Easy. And just like how ad technology knows your preferences, each movie will be individually tailored to YOUR liking just so that your engagement will increase. Do you like happy endings? Deadpool and Hulk will join forces and defeat Thanos. Do you prefer dark endings? Deadpool and Hulk fight until they float off into the Sun and get atomized but keep regenerating for eternity.

If you want to see a photo of you and your family from 15 years ago, it will generate slightly better versions of yourself and your wife and maximize how cute your kids look. This is the world we are facing now, where authenticity is meaningless. And while YOU may not prefer it, think about the kids who aren't born yet and will grow up in a world where this exists.

jplusequalt2mo ago

I can't tell if you are advocating for such a future or not.

1 more reply

imiric2mo ago

> AI will be generating movies and videos on the fly, either legally or illegally infringing on IP.

> If you want to see a photo of you and your family from 15 years ago, it will generate slightly better versions of yourself and your wife and maximize how cute your kids look.

Sure, but why would any of this media have any emotional significance?

The reason we enjoy media of friends and family is because it depicts a moment in the life of our loved ones. A fake image or video of them is of absolutely zero value to anyone.

The reason we enjoy cinema is because a talented group of people had an interesting story to tell and brought it to life in a memorable way. Me, or a random person with no filmmaking talent, prompting a tool to generate a particular scene wouldn't be interesting at all. Talented individuals will also rely on this technology, of course, but a demand for human creativity will still exist, possibly even stronger than today, once everyone is exhausted from the flood of shitty Deadpool vs Hulk videos.

I suspect the same will eventually happen with every other product these tools are currently commoditizing, including software.

All of this seems like a neat technology in search of a problem to solve, while actually introducing countless societal problems we haven't even begun to acknowledge, let alone address. But it sure is a great money and power grab opportunity for giant corporations to further extend their reach. And they have the gall to tell us it will bring world prosperity. Most of these sociopathic assholes should be prosecuted and jailed. And you, dear reader who is generously employed by these companies, are complacent with all of this.

1 more reply

staticassertion2mo ago

I really don't get that. I look at pictures I've taken in a digital world and I'm moved, just as I am when I see pre-digital pictures. Perhaps older images are sometimes "more special" but that's an artifact of the distance between who I was then vs now. Why would I stop feeling an emotional attachment to photos just because I have many? I really can not understand this at all.

spchampion22mo ago

It sounds like you've been reading Susan Sontag. For others, I recommend:

- https://en.wikipedia.org/wiki/On_Photography

- https://en.wikipedia.org/wiki/Regarding_the_Pain_of_Others

dfxm122mo ago

I think you're being tricked by nostalgia. It's about the fact that of course older photos you remember have a stronger emotional tie to you (they've had more time to form that bond), and it just so happens that older photos are not digital.

In my experience, a digital photo of myself and my partner used as the lock screen of my phone has the same emotional weight as the one sitting on my desk (which is a print out of a digital photo). Additionally, printing out a photo of you and your partner and gifting it to them has the same weight as going through childhood photo. A scrapbook of a recent vacation filled with printed digital photos evokes memories just as vividly as one from the 80s. On the flip side of this, a photo in a box in the basement has the same weight as a photo sitting in the cloud.

I'll offer you some more food for thought: are Aardman Animations films charming because they use claymation? Or is it the creative force of people like Nick Park and Peter Lord?

torginus2mo ago

Considering half of the memes are still rage comics drawn with MSPaint i'm kind of skeptical of this statement.

thoughtlede2mo ago

Strictly speaking, I don't think it is the generation or creation that diminishes their value. it is the consumption.

You said it too:

> If I see a million fake Tom Cruise videos, then it oversaturates my desire for desire for all Tom Cruise movies.

The trick of course is to keep yourself from seeing that content.

The other nuance is that as long as real performance remains unique, which so far it is, we can appreciate more what flesh and blood brings to the table. For example, I can appreciate the reality of the people in a picture or a video that is captured by a regular camera; it's AI version lacks that spunk (for now).

Note that iPhone in its default settings is already altering the reality, so AI generation is far right on that slippery axis.

Perhaps, AI and VR would be the reason why our real hangouts would be more appreciated even if they become rare events in the future.

rhubarbtree2mo ago

As Grayson Perry described the instagram age: “photography rains down on us like sewage from the sky.”

Mars0082mo ago

There is more to that, globalization. Now we have 8 billions humans. They are connected to the same infospace (internet) and share much more and more diverse content. Which means a lot more of emotional/interesting/helpful things. While each of them becomes less emotional.

Well, world changes dramatically. Connected old folks are like neanderthals in big city now. However not connected are still living locally in their minds. Youngsters are just accepting the world as it is. Nobody is amused by computers and cameras anymore. (at least in developed areas)

And with all that the worst is yet to come...

benterix2mo ago

> The feel of photos back then, was at least 100x stronger than now.

I dare say, the feel of photos from back then is much stronger than of the photos taken today. See e.g.:

https://plfoto.com/zdjecie/413363/bez-tytulu?from=autor/beak...

https://plfoto.com/zdjecie/619173/bez-tytulu?from=autor/beak...

mrandish2mo ago

> They were rare, and special, and you'd have a few photos per YEAR to look back on.

My generation generally only had photos from birthdays, holidays, vacations, weddings, graduations and reunions. We looked at the three albums which contained every family photo often and I know them all by heart.

My kid was born in 2009 and our family digital album has nearly 1,000 photos per year of her life. And she's seen virtually none of them and seems to have little interest in ever seeing them since she creates so many of her own photos every day which are ephemeral.

bonoboTP2mo ago

I guess some of the appeal of those sparse photos is the element of fantasy and imagination. Wondering what it could have been. Looking at a low quality yellowing wedding photo of your grandma... It allows you to think and wonder. Seeing it in 4K video or a volumetric 4D gaussian splat in VR robs you of all that sentimental mystery.

Nostalgia and idealization of the past is also harder when you have a more representative cross section of past moments.

bryanrasmussen2mo ago

https://medium.com/luminasticity/art-as-a-tool-for-storing-m...

"One of the primary properties of anything with Mana is a feeling of uniqueness. That one has never encountered something like this before, and therefore it is important. The uniqueness of the thing is a property that pulls you in to focus more closely, to attempt to understand more closely why the thing is unique."

ForHackernews2mo ago

There's already a book about this: The Work of Art in the Age of Mechanical Reproduction (1935)[0]

> The conditions for an analogous insight are more favorable in the present. And if changes in the medium of contemporary perception can be comprehended as decay of the aura, it is possible to show its social causes.

[0] https://web.mit.edu/allanmc/www/benjamin.pdf

esafak2mo ago

There is still room for art. Any photographer sees lots of pictures, but can tell the good from the bad, and find pleasure. They don't dismiss photography altogether.

Razengan2mo ago

Every time in human civilization there's a new technology, existing humans rail against it and want the Good Old Days back, existing children grow up to get used to it, the generation-to-be-born knows it as the normal baseline, then maybe future generations rediscover the past and take the best things about how things used to be without being held back by how bad they were. (see retro games made after retro games died)

Bombthecat2mo ago

Yeah, pixel games are huge now.

But I think it's more because of growing up with it have now pc, money. Not because people rediscover pixel games.

vunderba2mo ago

> If I see a million fake Tom Cruise videos, then it oversaturates my desire for desire for all Tom Cruise movies.

I often call this over-saturation the media equivalent of semantic satiation. Anything commoditized or mass-manufactured isn't going to have emotional appeal.

https://en.wikipedia.org/wiki/Semantic_satiation

mannanj2mo ago

I've often had an "addictive" personality and now I see it as an over satiation, in a semantic way, sort of thing. When I found something I liked I would over saturate my self in it, and lose interest and move on faster than others I knew.

Feels like what you described describes that inner personality trait better than I have heard before.

1 more reply

ChaitanyaSai2mo ago

Agree. But there are some use-cases where images can still be of huge help. Making textbooks come alive for instance. We are trying to do that and make a whole bunch of Indian textbooks into comics and free for students. (zerobyheart.com if anyone's interested and would like to make suggestions; the panel-to-panel continuity is still off and something we are working on )

nathan_compton2mo ago

People here like to say "Commoditize your Compliment" but to a company the size of google or amazon literally EVERYTHING is your compliment. Too bad no philosopher or political scientist or economist every thought about this stuff before or we might have some kind of plan to make the future less miserable and alienating.

NoGravitas2mo ago

> Too bad no philosopher or political scientist or economist every thought about this stuff before

I see what you did there and know exactly the political economist you are talking about, but if you Speak His Name, the shrieking hordes descend.

soperj2mo ago

> you'd have a few photos per YEAR to look back on

My parents took way more photos with film than I do with my cellphone camera.

obscurette2mo ago

While it wasn't really rare, it was far from common. It was almost full time hobby back then. (I grew up in sixties/seventies.)

_trampeltier2mo ago

A kind of the same happend to music. With a LP or a tape, you had to listen to all songs. Later with a CD you just skipped the not so good songs. And with MP3, you don't even bothered to save not so good songs. And now with TikTok etc. a song just have to be 20sec but has to bang hard for this short time.

theshackleford2mo ago

> They were rare > and special > and you'd have a few photos per YEAR to look back on.

None of these things are true for me as a millennial in the 35-45 age group. And my family was poor to boot, and we were still drowning in photos and photo albums.

lukol2mo ago

Don't disagree but being the social animals we are, images and videos will never not be important. Things will always feel better when I can connect it with a friendly face.

EForEndeavour2mo ago

The source, personal significance, and intent of images and videos will matter a lot, though. I'll cherish photos of my family members forever, regardless of technical excellence.

Or a photo of my freshman dorm room during exam season. Subpar image quality, lousy lighting, etc. but so many memories, positive and negative, are elicited by that fleeting glimpse from an era of excitement, boredom, stress, uncertainty, and optimism, not knowing where I was going in life, when I'd ever look back at that snapshot, but deciding on a whim to grab it during a break from cramming topics now long forgotten.

But I roll my eyes at the idea of injecting my likeness into a short clip depicting random over-the-top action sequences, no matter how photorealistic, because I've never wanted to do that.

fortzi2mo ago

This.

Unimaginable abundance may sound good (it does to me), but scarcity has value too. We might just find put that its value is too important. I just hope that if we do, it’s not too late.

Mars0082mo ago

There is something that's not easy to scale: humans. Live concerts, performance, etc. They are local

1 more reply

dwd2mo ago

> If I see a million fake Tom Cruise videos, then it oversaturates my desire for desire for all Tom Cruise movies.

In economic terms it's diminishing marginal utility.

squidsoup2mo ago

> The feel of photos back then, was at least 100x stronger than now. They were a special item, could be given as a gift.

I think this is still true if you shoot film today.

casey22mo ago

IMO this would be a positive side effect were it true. Do you really long for the day Hollywood exploited your emotions for profit?

GaggiX2mo ago

You can still buy a Polaroid, there is one factory left in the world able to produce the film required but they still make them.

ctmnt2mo ago

“Still” isn’t the right word. Once Polaroid stopped making the film, closed their factories, and sold or junked their machines, their supplies did the same, and so some of the components stopped being manufactured and available for purchase. What’s sold now as Polaroid film was a reinvention of the same idea. And it’s notably not as good. The dwindling stock of unused true Polaroid film is getting absurdly expensive as a result.

The one factory you refer to was the last one, and was purchased by the Impossible Project (now Polaroid BV). So they were able to save one set of machines. But the actual process of making the film was lost. So it’s an old set of machines making a new but similar product.

2 more replies

9999000009992mo ago

Your photos of your dog mean nothing to me.

I have a photo of a friend I’ve since drifted from, it’s her in her army fatigues after basic. She was had just went through a horrible divorce and that was a shining achievement for her.

The story behind the photo is what makes it matter.

Not the format.

However I will agree AI is a poor substitute. You’ll have people creating AI photos of a fake marriage and fake pets in a big fake house, while they sleep in a bunk bed in a halfway house.

pancakeguy2mo ago

This is the same argument illustrators made upon the invention of photography.

bonoboTP2mo ago

To what extent were they correct and to what extent not? Is their correctness also linked to the correctness of the similar argument today or you're just noting the analogy?

tallesborges922mo ago

Agree the same is happening with tools and services

seydor2mo ago

contrary to that i use it to restore old pictures and it has increased their emotional appeal

jpadkins2mo ago

scarcity creates value. Good observation, never thought of AI images this way.

techterrier2mo ago

Make Theatre Great Again

Bratmon2mo ago

You're presenting this as an argument against AI, but really it's an argument against all human endeavor.

https://xkcd.com/915/

Papazsazsa2mo ago

You're presenting this as an argument against snobbery, but really it's an argument against all humanity.

sarreph2mo ago

> They were rare, and special, and you'd have a few photos per YEAR to look back on.

Um yeah I don't know. I fully resonate with the _emotional_ appeal here, but realistically I remember going round to people's houses to be shown analog photo albums that nobody was that bothered about seeing, because they didn't really care -- they weren't their photos.

The special photos (a few a year) still exists in digital form.

jacquesm2mo ago

What a great thing this didn't exist in the past. We likely wouldn't have had any of the amazing artworks that we have now. Imagine an AI generated Mona Lisa, Nightwatch or Sistine Chapel ceiling because prompting would have been so much cheaper than paying Leonardo, Rembrandt or Michelangelo...

Now extrapolate to all other artforms. Sculpture seems safe, for now, but only barely so.

wordpad2mo ago

I feel like the complete opposite is true.

Artists aren't doing it for the money. With advanced tools like these they wouldve iterated much faster and created much grander designs.

Art is about pushing limits of what's possible and AI just raises those limits.

nluken2mo ago

I hear this often and it's such a strange view of art, like the only thing that matters is scale and speed. It's a perspective so colored by mechanization that it fails to account for other philosophies in art. Think of what, say the Arts and Crafts movement was all about!

jacquesm2mo ago

> Artists aren't doing it for the money.

That is unlike any artist that I know and I know quite a lot of them. They love their work and the process but they also need to eat. And that included those mentioned above.

__alexs2mo ago

There is a tremendous amount of "art" that is produced for purely commercial reasons. It employs many thousands of people. These roles are definitely threatened by image generators.

Agree that if you are Artist this is not going to be a big concern to you.

1 more reply

theappsecguy2mo ago

Art is about creating something from scratch. This isn't creating anything but cobbling together elements of scraped/stolen content to generate an imitation of prior work.

1 more reply

lm284692mo ago

Have you talked to "artists"? In my experience the vast majority say the opposite of what you worded here.

1 more reply

coldtea2mo ago

>Art is about pushing limits of what's possible

That's engineering, if that.

Art isn't, and has never been about that.

1 more reply

rdedev2mo ago

An aspect of art is this pursuit of pushing boundaries within the confines of what is considered good. Would an artist with an infinite image generator be interested in pushing said boundaries? Maybe but they will definitely miss out on getting stuck on an idea and coming up something completely new

Timpanzee2mo ago

AI isn't a tool for creating art in the same way as a paintbrush or clay. AI is describing a painting you want, then having someone else creating the artwork for you. You aren't doing art in the same way hiring a sculptor isn't doing sculpting.

AI is well on the way to eliminating human made art since the skills to actually make art will be lost to the skill of being able to describe art. You know, since the only thing that matter is reducing costs.

2 more replies

autoexec2mo ago

Yet somehow with AI art we end up with https://i.redd.it/3v2uwwxxkhkg1.png more often than https://upload.wikimedia.org/wikipedia/commons/7/73/Michelan...

The only thing AI art makes possible that wasn't possible before is the scale of slop

jayd162mo ago

The Sistine Chapel was a commission.

1 more reply

jplusequalt2mo ago

>Art is about pushing limits of what's possible and AI just raises those limits

Says who?

Being an artist means different things to different people, but at the very least I believe it requires an interest in your craft, a desire for personal growth, and a yearning to express yourself.

NoGravitas2mo ago

Taste is not scaleable.

tom13372mo ago

I'd say these models only exist because we had amazing artworks in the past.

jacquesm2mo ago

Absolutely.

zackmorris2mo ago

I think of it more as that AI will destroy the profit motive in all things, not just art. What we used to think of as talent/skill/experience will no longer be scarce, because anyone will be able to make anything with a prompt. The perceived value will be in wholes built of valueless parts (gestalts).

AI is incompatible with capitalism, but the world isn't ready for that. So we'll have a prolonged period of intense aggregation where more and more value is attributed to systems of control that already have more than they could ever spend, long after the free parts could have provided for basic human needs.

In other words, the masters existed because they had benefactors and a market for their art and inventions. Today there are better artists and inventors toiling in obscurity, but they won't be remembered because they merely make rent. Which gets harder every day, so there's a kind of deification of the working class hero NPC mindset and simultaneously no bandwidth for ingenuity (what we once thought of as divine inspiration).

Terence McKenna predicted this paradox that the future's going to get weirder and weirder back in 1998:

https://www.youtube.com/shorts/KZ2ZtTsHqO0

randito2mo ago

(McKenna tangent). I like this version of that talk. https://www.youtube.com/watch?v=hL0yfxDe6jE. It's about 12 minutes and animated with some hand-drawn whiteboard drawings. Good stuff.

jacquesm2mo ago

On the contrary, the talent will be more scarce because there is no longer a motivation to acquire it in the first place.

1 more reply

nzach2mo ago

That's true, but you forgot a key piece in this puzzle. The AI can only produce things that already exist. It can combine new things, this is why you can it for a picture of Jesus planting a flag on the Moon. But it only works because Jesus is a concrete concept that already exists in our world. If you ask for a picture of jacquesm planting a flag on the Moon the result will be nonsensical.

dgacmu2mo ago

It worked semi ok? A poor depiction, but not entirely nonsensical

https://g.co/gemini/share/028ab360006b

petercooper2mo ago

Nano Banana 2 has an image search tool that looks up pictures of things and uses them in the context (and arguably, an agent could eventually figure out who jacquesm is and hunt for a photo).

However, I tried "a picture of jacquesm planting a flag on the Moon" for a laugh, and I have to hand it to Google as the person was in a spacesuit, as they should be, and totally unidentifiable! :-D

WarmWash2mo ago

I have the creativity of someone not at all creative (couldn't even come up with a good analogy) and the stuff I created with AI art tools is awful compared to what I see from "AI artists" on social media.

Just being able to generate a vision and then be able to capture it in a prompt is an art within itself.

MetaWhirledPeas2mo ago

> Imagine an AI generated Mona Lisa

Let's give him 2015 tech instead. Imagine if he used Illustrator to create the Mona Lisa. Is that much better?

techjamie2mo ago

Ironically we live in a time that, overall, is probably better for artists than the world any of those guys grew up in. People have always valued art but not the artists, and many artists through history, including the famous ones, died broke with their works only posthumously attaining value.

These days, through commissions, art is a much more viable profession than it ever was.

jacquesm2mo ago

It was until ~2021 and it going rapidly downhill. I know some people that are really good at art and they got work on commission from publications, venues and so on. They have seen a significant drop in their bookings and the ones that they do get negotiate hardball because (1) everybody else is desperate too and (2) if they can't get to a deal then AI is now an alternative for the not-so-discerning public which was a fairly large chunk of the usecases.

So you were making book covers? Ah, so sorry. Nobody really cared that it was you.

And you can probably extend that to what's between the covers...

coffeebeqn2mo ago

Is it though? It was for the last 20 years but I’d imagine sales of commissions are down immensely and going down every day

hypeatei2mo ago

I'll just be extremely candid: a lot of people don't give a shit about these art pieces or art in general. It's okay if you do, there is nothing wrong with that, but it's a myopic view that the world would be worse off if we didn't have a portrait of Mona Lisa.

jacquesm2mo ago

Yes, who gives a shit about culture, after all humanity doesn't really need it...

2 more replies

dfxm122mo ago

I disagree. On the one hand, yeah, On This Day... 1776 is terrible, and it is sad to compare it to Requiem for a Dream or Pi, but even in this age where AI is available, we see tons of critically successful art being made without the use of AI.

ahtihn2mo ago

Would anyone even care about Mona Lisa if the exact same painting was done by a random nobody? It's just a portrait.

__alexs2mo ago

Da Vinci is maybe only the 5th most interesting thing about the Mona Lisa.

coldtea2mo ago

Most people no. Then again most people are idiots barely aware of the world they live in, much less culture.

People who actually care about art, if given a chance to see it, yes.

Of course, it being done by Davinci is not some random fact about the painting - as if a painting is a mere artifact.

skybrian2mo ago

Michelangelo at least would have been okay with that. He would have rather been working on sculptures.

charcircuit2mo ago

We would have tons of great artworks if it existed in the past. The works would be both more numerous and at a higher quality.

jacquesm2mo ago

Absolutely not a chance. You see, in the past there was nothing to train it on. And that's sort of the point: the only reason that this AI image generation works at all is because it is lifting on the hard work of the people that had the skills, put the time and the effort in.

1 more reply

vunderba2mo ago

I've only had a brief opportunity to try out NB Pro 2 (`gemini-3.1-flash-image-preview`), so I haven't had a chance to update GenAI Showdown.

Here's some of my captions that tend to trip up even state-of-the-art models.

https://mordenstar.com/other/nb-pro-2-tests

So far it does feel more iterative than an entirely new leap in terms of capabilities, but I haven't run it through the more multimodal aspects such as editing existing images.

That being said, it actually managed the King Louie jump rope test which surprised me.

dialogbox2mo ago

Nice test. Nitpicking. Isn't it NB Pro and NB 2? Not NB Pro 2.

1 more reply

UnknownBanana2mo ago

I love your website, art and projects!

1 more reply

jorvi2mo ago

This will stay useless for editing personal pictures so long as virtually every prompt with a person in it is met with "I can't edit images of some people". For whatever reason, they've made the celeb detection so ultra-aggressive that almost everyone is detected as a (lookalike) celeb.

Tiberium2mo ago

It's only for Europe, you should try a US VPN or, in the worst case, use it over Vertex AI, which allows you to generate anyone.

yakattak2mo ago

I think this tech is cool, from an engineering perspective. I’m trying to figure out if there’s any justification for using it in a business world outside of: “We don’t want to pay an artist.”

You can argue things like code generation are an extension of the engineer wielding it. Image generation just seems like a net negative overall if it’s used at scale.

Edit: By scale, I mean large corporations putting content in front of millions. I understand the appeal for smaller businesses where they probably weren’t going to pay an artist anyway.

alex435782mo ago

When a company uses a photocopier, they don’t want to pay a scribe.

When a company sends an email or docu-sign, they don’t want to pay a courier.

Technology supplements or replaces jobs, often reducing costs. This is no different.

nindalf2mo ago

Art isn't just a job or a way to make money, like being a courier is.

3 more replies

garbawarb2mo ago

Advertising? "We don't want to pay an artist" goes a long way for a small business with a limited budget.

whynotmaybe2mo ago

We're using voice generation from clipchamp for our promotional videos.

It's an ethical conundrum because we're not paying anyone, but we don't have the money to pay anyone, and it's good enough for our budget.

But we're getting used to the process of changing a part of the text in a few seconds without any artist involved and for 0$.

I guess that soon we'll be able to create voice sample from know personalities for a few $ with prices based on the popularity of the artist and some sanity check based on the artist preferences.

1 more reply

rm_-rf_slash2mo ago

It can also backfire. AI slop ads and marketing material imply cut corners and poor quality products. If a bakery isn’t going to bother touching up its AI slop banner, I don’t expect their cookies to be great either.

5 more replies

sempron642mo ago

Diagrams! So much documentation lacks diagrams because they are hard to make

yakattak2mo ago

True! Though I’d argue diagrams as code like PlantUML or Mermaid are better than an image!

1 more reply

jedberg2mo ago

I've been using it to replace things that I used to do for personal projects in photoshop/gimp. Remove a background, add a person, put a letter in here that looks like the same crayon as the other letters.

Things that would take me an hour or so the old way takes three minutes with NB.

But I can see this applying to small businesses. Something that some random person would have to spend on hour photoshopping can be done in a few minutes with NB.

konschubert2mo ago

I disagree with your premise that everybody should endure friction and cost such that artists can earn a living producing cookie-cutter content.

bonoboTP2mo ago

Drafting, iteration, mockups. Quite useful during ideation.

yakattak2mo ago

All things traditionally done by artists or artist adjacent roles. I can understand at an individual level, say for a solo gamedev who wasn’t going to pay an artist anyway. That’s not at scale though.

Larian Studios most recently was under fire for this [1]. Like I can see a director going “what would X look like?” and then speeding over to the concept artists for a proper rendition if they liked it. I don’t think this is at scale though. Any large business is just going to get rid of the concept artists.

[1]: https://www.pcgamer.com/games/rpg/baldurs-gate-3-developer-l...

1 more reply

testing223212mo ago

> I’m trying to figure out if there’s any justification for using it in a business world outside of: “We don’t want to pay a human.”

You could easily say the same about anytime computers or robots or automation have taken a job away. We’ve been going down this road for decades.

yakattak2mo ago

Those industries (computers, robots) created other jobs though. This doesn’t seem to.

2 more replies

jezzamon2mo ago

One major thing is photoreal use cases, which artists can't really do. A lot of that is deep fakes / scams but there are some real use cases

yakattak2mo ago

Isn’t that what photographers are for?

1 more reply

detritus2mo ago

I use AI as a stock art/asset replacement.

I'm old-fashioned so I still Photoshop it all together, but that's my use case here.

RickS2mo ago

Same answers you'd use beyond "we don't want to pay an engineer". 100x shorter iteration speed, and the associated workflow (stream of microrevisions and spaghetti throwing), top quartile outputs in many langs/styles/contexts without having to source, hire, and maintain a fleet of separate specialists who can quit when they feel like it.

I'm torn on the scale thing. It definitely seems net negative. But I think we collectively underestimate just how deeply sick the existing thing already is. We're repulsed by image gen at scale because it breaks our expectation that images are at least somewhat based on reality, that they reflect the natural world or what we can really expect from a product, from a company, from the future. But that was already a bad expectation: when's the last time you saw a mcdonalds meal that looked like the advert? Or a sub-30$ amazon product that wasn't a complete piece of shit? Advertisements were already actively malicious fantasies to exploit the way our brains react to pictures. They're just fantasies that required whole teams of humans doing weird bullshit with lighting and photoshop, and I'm not sure that's much better. It was already slop. All the grieving we do about the loss of truth, or the extent to which corps will gleefully spray us with mind-breaking waterfalls of outright lies, I think those ships sailed a long time ago. The disgust, deceit, the rage we feel about genAI slop is the way we should have felt about all commercials since at least the 80s IMO.

yakattak2mo ago

> Advertisements were already actively malicious fantasies to exploit the way our brains react to pictures. They're just fantasies that required whole teams of humans doing weird bullshit with lighting and photoshop, and I'm not sure that's much better.

This is a good point. My gut reaction is “well at least someone was paid to do it and can continue to keep society/the economy going ”.

I can see the other side where that’s a soulless job. Not sure what’s worse. Soulless job where your skills apply or even less jobs in a competitive industry.

rafael09ed2mo ago

It is faster as well

the_mar2mo ago

a friend of mine was a creative director and a big tech co until recently, she was replaced by AI

tantalor2mo ago

Won't somebody think of the window replacers?

zamalek2mo ago

Sora is already a flop. People are sick of slop and are getting good at identifying it. Grok is the only player that has any semblance of success in the visual gen market, only because they do the one thing that will always make money.

neom2mo ago

I did some tests, my education is in digital imaging technology/film from 20 years ago so I find this stuff fun to follow.

Two what I could consider "interesting prompts" for image gen testing. Did pretty well.

https://s.h4x.club/eDuOzPDd

"A macro close-up photograph of an old watchmaker's hands carefully replacing a tiny gear inside a vintage pocket watch. The watch mechanism is partially submerged in a shallow dish of clear water, causing visible refraction and light caustics across the brass gears. A single drop of water is falling from a pair of steel tweezers, captured mid splash on the water's surface. Reflect the watchmaker's face, slightly distorted, in the curved glass of the watch face. Sharp focus throughout, natural window lighting from the left, shot on 100mm macro lens." - Only major problem i could find at a glance is the clasps don't make sense probably, and the drop of water inside the watch on the cog doesn't make sense/cog mangled into tweezers.

https://s.h4x.club/yAuNPlRk

"A candid photograph taken from behind an elderly woman sitting alone on a park bench in late autumn. She is gently resting one hand on the empty seat beside her, where a man's weathered flat cap and a folded newspaper sit untouched. Fallen golden leaves cover the path ahead. The low afternoon sun casts her long shadow alongside a second, fainter shadow that almost seems to be there, the suggestion of someone sitting next to her, visible only in the light on the ground. Muted, warm color palette, shallow depth of field on the background trees, photojournalistic style." - I don't know why but it internal errored twice on this one but then got there.

LeoPanthera2mo ago

It's notable that this model is less advanced that the previous "Pro" model, and also that the Gemini interface is defaulting all requests to "Fast" even if you've previously changed it to Pro.

I guess even Google is running out of GPUs.

jakub_g2mo ago

Since talking images, are there any AI models that can output real transparent gifs/pngs?

And not a (botched) fake white/gray grid background that is commonly used to visualize transparency?

deathanatos2mo ago

This is exactly the use case I just tried, and not only does one get a non-transparent checkerboard, the squares are inconsistent & jumbled.

Example: https://gemini.google.com/share/36d66cad1764

dyates2mo ago

ChatGPT's image generator has been able to do this since last year. That NBP still can't is baffling. They should at least train it to respond to requests for transparency with a solid colour pink background.

vunderba2mo ago

This. Gpt-image-1/1.5 are the only ones that have this built in - though I'd love to have an insider view if its natively considering the alpha channel or just feeding it through a rembg-style post processor.

1 more reply

minimaxir2mo ago

You can output to a plain background and use any number of tools to mask it.

jakub_g2mo ago

I know. It sounds like a perfect task for AI to do it though (wasn't the whole premise of AI do to mundane things for us), yet they fail to do it, and I need to use an external tool.

2 more replies

deathanatos2mo ago

The output from Nano Banana, even when it is ostensibly drawing "single color" shaded areas, is so jittery that it can be challenging to threshold it.

1 more reply

tariky2mo ago

This looks like a response to Seedream 5.0 lite that was published two days ago.

I use all those fancy image models editing capabilities for my fast fashion web shop. I must say: product photography for clothing and accessories product is dead. Those models are amazing at style transfering and garment transferring.

We will see how good will be Seedream 5.0 full version.

mattlondon2mo ago

You think they created and launched this in just 2 days? These things take a lot of effort and time to develop...

Tiberium2mo ago

Seedream 5 Lite is honestly extremely disappointing, its text to image is way worse than 4.5, image editing is fine but that's it. It's way, wayy behind NB2.

thinkingemote2mo ago

If any AI image generation companies are reading this, I want the image to be in layers which can also be exported, so I can 1) do post processing of my own or 2) arrange for an AI image generation model to process just the layers i specify.

grallm2mo ago

You can use Qwen Image Layered to split the image into layers

https://qwen.ai/blog?id=qwen-image-layered

zhyder2mo ago

Model card: https://deepmind.google/models/model-cards/gemini-3-1-flash-...

Pretty close to Gemini 3 Pro Image (aka Nano Banana Pro) in most benchmarks, even without thinking+search, and even exceeding it in 2 most important ones of 'Overall Preference' and 'Visual Quality'. I'm excited about the big jump in Infographics/Factuality (even without thinking+search; I'm surprised that text+image search grounding doesn't make an even bigger dent).

pietz2mo ago

I'm officially done with the Nano Banana name. It was fun, but can we go back just calling it Gemini Image?

bonoboTP2mo ago

Name recognition has big value. People remember what an advancement the first banana was. Nowadays it's no longer so unique, ChatGPT's and Grok's image editors are also strong.

PunchTornado2mo ago

I really like it. Nano banana is like the best product name in AI.

notnullorvoid2mo ago

There's something odd feeling about many of the latest AI image models (NB and NB2 included), they feel almost too polished, calculated, and rigid. It doesn't take much effort to get some art with a sense of originality or flow from Stable Diffusion, but I can't seem to prompt NB to get similar results. SD definitely struggles with details especially when trying photo realism, but it'd be nice if newer models could get back some of the roughness that made SD good.

Zarathruster2mo ago

This has been my experience too. The new models don't spit out nightmare fuel as often, but they aren't nearly as creative either. They seem to be very good at creating stock photos and not much else

Scene_Cast22mo ago

It still seems to have the same pitfalls as all the other image generation models. I ran it through my test prompt (wary of posting it here, lest it gets trained on) - it still cannot generate something along the lines of "object A, but with feature X from Y", where that combo has never been seen in the training data. I wonder how the "astronaut riding unicorn on the moon" was solved...

EDIT: after significant prompting, it actually solved it. I think it's the first one to do so in my testing.

vunderba2mo ago

Results are in for `gemini-3.1-flash-image-preview` (NB 2) for the GenAI Showdown site in the editing comparisons. Remember to click the "Pass/Fail" button to toggle between pass/fail and a weighted score to account for additional factors like steerability, image quality, etc.

Unfortunately, unlike the leap from NB to NB Pro, we did not see significant gains from NB Pro to NB Pro 2.

In several cases (such as the Jaws Poster), we observed that it was substantially more difficult to prevent NB Pro 2 from making significant changes to the rest of the image. Localization of edits, in general, seems to have changed and not necessarily for the better.

http://genai-showdown.specr.net/image-editing

Comparison solely between the Gemini models (NB, NB Pro, and NB Pro 2):

http://genai-showdown.specr.net/image-editing?models=nb,nbp,...

runamuck2mo ago

I saw an item for sale on Ali Express's video and I thought "Wow, they hired some really attractive actors to pitch their little gadget." 30 seconds in, I realized they used GenAI. Not because it looked AI, but because the production values looked too high and professional for the item. I would get in on this if you sell anything online.

arctic-true2mo ago

One thing I notice is that the voices in video AI are absolute hogwash. Voice AI is great, video AI is great, but AI videos where humans speak give me the feel of really poorly dubbed foreign TV - the timing is not quite right and the facial expressions don’t always match up with the words being spoken.

coffeebeqn2mo ago

They can even combine the models, create the presenters with nano banana and then use that as the reference for a video model and paste in your product

MaxikCZ2mo ago

I have Google AI Ultra. Where can I test this? They say its in aistudio, which says its a paid model and I need to setup billing (as if paying for Ultra isnt enough). They say its available in antigravity, but I cant seem to find it there?

Sevii2mo ago

Works for me with a Pro sub at https://gemini.google.com/app

aliljet2mo ago

I really really want to see how these images are starting to form into videos. The stills are clearly getting better and better, but what about when you need the stills to organically conform to a keyed script?

Mizza2mo ago

Check out Seedance 2: https://seed.bytedance.com/en/seedance2_0

Nano Banana was technically impressive the first time, but after Seedance it's not really. It's all just an internet pollution machine anyway.

rany_2mo ago

The page looks promising but how can I try it out?

1 more reply

progbits2mo ago

I'm seeing more and more AI video memes and they are getting really good. Still just bunch of short clips, long shots are not working well enough, but typical Hollywood movies have few second cuts anyway so this is almost good enough to make a marvel fanfic.

vessenes2mo ago

the workflow right now would be to take this images, make a sequence of them for key "shots" and send them to an I2V model. LTX-2 is the model the r/stablediffusion folks are playing with right now, but there are a fair few.

monster_truck2mo ago

Kind of surprised it hasn't been pulled yet. Have seen some very disturbing (grok tier) examples of completely bypassing whatever censors they have in place by simply asking gemini to write the prompt

h4ch12mo ago

Wow it's capable of critiquing its' own output?

While the overall aesthetic matches the minimal white-stroke style and technical design you requested, and the provided step descriptions are included, please note that there are a few minor rendering artifacts in this specific generation:

The text on the banner entering the vault in step 8 is illegible.

There is a small typo in the caption for step 6 ("CONFLSCT" instead of "CONFLICT").

Despite these small imperfections, this layout should work well as a guide for your canvas implementation.

</OUTPUT>

meowface2mo ago

How does it compare to Nano Banana Pro?

tiffanyh2mo ago

Keeping track of the different AI product names is so confusing even from a single company.

Why can't Google, for example just call:

  Gemini Image = Nano Banana
  Gemini Video = Veo
  ...

hadley2mo ago

Let alone that Nano Banana 2 is Gemini Image 3.1

Invictus02mo ago

It's not working very well at all. I started with a picture of a girl sitting at a cafe table and asked it to zoom in, and it enlarged her head to the size of a balloon.

userbinator2mo ago

LOL! It either has developed a sense of humour, or your prompt was not specific enough.

geooff_2mo ago

Funny timing. I just migrated my personal styling app off of Nano Banana.

My main use case is editing user uploads to enhance their clothing images. A large part of it is preserving logo, graphics and other technical details. I noticed over time it felt like Nano Banana has gotten worse at this.

I have a test set of graphic t-shirts that I noticed the model seeming getting worse with it. This combined with price and the terrible experience of their cloud console got me to migrate off.

vessenes2mo ago

Interesting they get to rev this with the release of a new flash model. I'm speculating part of the distil pipeline includes the image gen stuff; that seems like internal tooling that will pay dividends over time, if true. New frontier model -> automatic new image model. Even if it's just incremental updates, it's good for both the product cadence and compounding improvements.

WarmWash2mo ago

The confusion here is dense, 3.1 Flash Image is not 3.1 Flash.

The banana models (image) are a different than the mainline models, but the confusingly leverage the same naming scheme.

NitpickLawyer2mo ago

> the distil pipeline

I don't have inside info, but everything we've seen about gemini3.0 makes me think they aren't doing distillation for their models. They are likely training different arch/sizes in parallel. Gemini 3.0-flash was better than 3.0-pro on a bunch of tasks. That shouldn't happen with distillation. So my guess is that they are working in parallel, on different arches, and try out stuff on -flash first (since they're smaller and faster to train) and then apply the learnings to -pro training runs. (same thing kinda happened with 2.5-flash that got better upgrades than 2.5-pro at various points last year). Ofc I might be wrong, but that's my guess right now.

vessenes2mo ago

Interesting. Whatever they are doing it's a bit different than Anthropic and oAI, which is good for the consumer. I'm curious about their ML Ops internally; would be fascinating to learn more.

antkim2mo ago

Create a drawing that looks like a detailed color pencil drawing. It should be of a farmhouse near a cornfield. There should be a couple of cows, some chickens and a pig pen with piglets scurrying around. The sky should be overcast and threatening a storm. Some purple mountains can be seen in the distance.

minimaxir2mo ago

Google updated it early in AI Studio so I've been experimenting:

- Base pricing for a 1024x1024 image is almost 1.6x what normal Nano Banana is ($0.067 vs. $0.039), however you can now get a 512x512 image for cheaper, or a 4k image for cheaper than four 1k images: https://ai.google.dev/gemini-api/docs/pricing#gemini-3.1-fla...

- Thinking is now configurable between `Minimal` and `High` (was not the case with Nano Banana Pro)

- Safety of the model appears to be increased so typical copyright infringing/NSFW content is difficult to generate (it refused to let me generate cartoon characters having taken psychedelics)

- Generation speed is really slow (2-3min per image) but that may be due to load.

- Prompt adherence to my trickier prompts for Nano Banana Pro (https://minimaxir.com/2025/12/nano-banana-pro/) is much worse, unsurprisingly. For example I asked it to make a 5x2 grid with 10 given inputs and it keeps making 4x3 grids with duplicate inputs.

However, I am skeptical with their marquee feature: image search. Anyone who has used Nano Banana Pro for awhile knows that it will strongly overfit on any input images by copy/pasting the subject without changes which is bad for creativity, and I suspect this implementation appears the same.

Additionally I have a test prompt which exploits the January 2025 knowledge cutoff:

    Generate a photo of the KPop Demon Hunters performing a concert at Golden Gate Park in their concert outfits.

That still fails even with Grounding with Google Search and Image Search enabled, and more charitable variants of the prompt.

tl;dr the example images (https://deepmind.google/models/gemini-image/flash/) seem similar to Nano Banana Pro which is indeed a big quality improvement but even relative to base Nano Banana it's unclear if it justifies a "2" subtitle especially given the increased cost.

shostack2mo ago

The pricing changes are interesting. I wonder if at some point they will deprecate the less expensive model to increase their margins.

Original Nano Banana (gemini-2.5-flash-image): $0.039 per image (up to 1024×1024px)

Nano Banana 2 (gemini-3.1-flash-image-preview): $0.045 per 512px image $0.067 per 1K (1024×1024) image $0.101 per 2K image $0.151 per 4K image

Nano Banana Pro (gemini-3-pro-image-preview): $0.134 per 1K/2K image $0.240 per 4K image

So at the most common 1K resolution, NB2 is ~72% more expensive than the original NB ($0.067 vs $0.039), but still half the price of NB Pro ($0.134).

arctic-true2mo ago

They may be victims of their own success here. At a certain point, if you can consistently make perfect images indistinguishable from reality, you’re done improving. All that’s left to do is make it faster or cheaper or better-aligned - but these aren’t going to show up readily in ways the typical user can understand.

sheept2mo ago

For your knowledge cutoff test, did it failing mean that it generated a generic "Kpop demon hunter" or it rejected the prompt?

minimaxir2mo ago

Generic "Kpop demon hunter". Nano Banana 2 atleast has fun with it, though.

dgtlanml22mo ago

Wow the article narration with Umbriel is silent after the 6 second mark.

hattimaTim2mo ago

Good but the gemini api is unreliable as hell. Why would you give a paid user "Resource Exhausted " errors when you have enough resources for free users?

hwj2mo ago

Is it possible to run this locally on Apple aarch64?

(Sorry, I'm probably one of the few HN users left that don't have much experience with AI).

divan2mo ago

What is annoying about Nano Banana, is how bad is experience when you try to iterate or, especially, repeat same task for another photo. After second of third image it starts randomly ansering with complete nonsense like "I'm just a language mode and can't assist with that" or "I can't do that" (with absolutely the same prompt it had no issues 2 photos in a row in the same chat).

It also gaslights me, when I point out on an error. I tried to create a cartoon portrait of the person from photo and use background from another photo. It got wrong the order of photos. I provided filenames and explicitly told which one is for person and which for bg. It generated it wrong again, and all attempts to explain that it got it wrong were met with "No, it's YOU incorrect". So frustrating.

Zarathruster2mo ago

This reminds me of my experience trying to generate a reference photo for a 3d model.

I told Nano Banana to generate an image of the character with his feet shoulder width apart. It ended up generating him with his feet pressed together, so I told Nano Banana to widen his stance slightly.

It gave me an image of the man with his feet spread far apart enough to straddle a horse. I asked for a slightly narrowed stance and his feet were once again brought together.

This went back and forth unsuccessfully for a while until I asked, "I'm asking you to make his feet shoulder-width apart. Why are you ignoring me?" And Nano Banana confidently asserted that they are shoulder width apart, and I must be wrong.

Ultimately I ended up telling the model to render the same character, pinching a cantaloupe between his ankles, and then to remove the cantaloupe. It worked, but why do I have to trick Google's SOTA image generator to give me very basic stuff like this?

antkim2mo ago

create a color pencil style artwork that is extremely detailed, with a surrealistic style of a farmhouse near a cornfield. There should be a cow or two, some chickens and a pig pen with piglets scurrying around. The sky should be overcast and threatening a storm. Purple mountains can be seen far in the distance

casey22mo ago

Still has context leaking into the text/random signs in the image, made worse by generating filler with an internal LLM

sync2mo ago

Did gemini-2.5-flash-image get an upgrade as well? I just got the following, which is fascinating, and not something I've seen before:

> I'm sorry, but I cannot fulfill your request as it contains conflicting instructions. You asked me to include the self-carved markings on the character's right wrist and to show him clutching his electromancy focus, but you also explicitly stated, "Do NOT include any props, weapons, or objects in the character's hands - hands should be empty." This contradiction prevents me from generating the image as requested.

My prompts are automated (e.g. I'm not writing them) and definitely have contained conflicting instructions in the past.

A quick google search on that error doesn't reveal anything either

hmokiguess2mo ago

The Chinese are so much ahead in this space, their models are way better at this stuff. For example, https://hunyuan.tencent.com/image/en?tabIndex=0 and https://seed.bytedance.com/en/seedream5_0_lite

sigmar2mo ago

which models?

we have user-preference rankings that put NB2 on top: https://arena.ai/leaderboard/text-to-image

hmokiguess2mo ago

interesting, maybe this is just anedoctal experience then and I'm biased but I have been preferring theirs over Nano Banana

raincole2mo ago

When it comes to image prompts, Seedream is far behind Nano Banana. "Far behind" is a ridiculous understatement here, btw.

Afaik the only real competitor is Riverflow V2.

KK7NIL2mo ago

Kind of a pointless comment without a link to such a model.

hmokiguess2mo ago

sorry, I've edited my original comment with one such example, but they're easily discoverable I assumed it was popular knowledge

- https://hunyuan.tencent.com/image/en?tabIndex=0

- https://seed.bytedance.com/en/seedream5_0_lite

someone shared benchmarks that differ my experience tho, so I may be biased

jslakro2mo ago

It'll be great to find a web directory dedicated exclusively to good/useful prompts with nano banana

rosstex2mo ago

Adding to predictions: the magic of travel might actually be reborn, as people seek authentic experiences.

mowmiatlas2mo ago

So, I'd suspect the seedance2.0 competitive video model is coming as well soon? ;)

evrenesat2mo ago

I only needed help of this banana boy twice, it managed to disappoint me each time. The most recent one, I was trying different beard and mustache styles on myself, on a photo I imported from my own Google photo gallery, and it consistently rejected me, claiming I'm a public figure. Nobody ever told me that I look like any famous person, so that's googles own bananination. ChatGPT nicely handled the job.

hubraumhugo2mo ago

It's working pretty well for generating an xkcd comic for your HN profile: https://hn-wrapped.kadoa.com/

Previous nano banana frequently made speech attribution errors, the new one seems a lot more consistent.

lightyrs2mo ago

This was really fun. Nice job.

ozgung2mo ago

Any info or speculation about technical details?

sorenjan2mo ago

Is this a distillation of Nano Banana Pro?

meetpateltech2mo ago

Gemini 3.1 Flash Image is based on Gemini 3 Flash.

source: https://deepmind.google/models/model-cards/gemini-3-1-flash-...

hedora2mo ago

Open weight? How many parameters?

deafpolygon2mo ago

The prompt “Generate an image of an orange man riding a bicycle.” no longer generates Trump riding a bicycle. I’m guessing they fixed that.

But the prompt "can you depict a cartoonish orange man with a pooh bear in political cartoon style?” correctly generates Trump.[1] So there’s that.

[1]: https://imgur.com/a/Gvm5Zje

nektro2mo ago

genuinely one of the most evil things humans have produced. shameful.

wnevets2mo ago

does it still break images with transparent pixels?

dyauspitr2mo ago

I really wish they opened a version of this up for adult content. They would make immense amounts of money and it could be fenced off behind some sort of paywall where they could verify the age of the person.

CrzyLngPwd2mo ago

Just what we need, more sloperators thinking they are being creative and making art by prompting.

I would be happy to never see any more AI slop.

throwaway4928ab2mo ago

Can we now edit the images it spits out? All prior tests in trying to edit AI images has failed miserably and laughably

danesparza2mo ago

Is it just me, or is Nano banana not working in Gemini currently?

JOJESU2mo ago

I’ve been exploring this exact problem space from the angle of extreme constraints (single-digit MB memory, no cloud assumptions). I documented what broke first and why here, in case it’s useful: https://github.com/nullclaw/nullclaw

nathan_compton2mo ago

So this is an ultra-minimalist software platform to farm work out to enormous energy chugging AI models?

RivieraKid2mo ago

It's extremely slow, takes several minutes to generate an image.

ge962mo ago

My naive question, can image generation make something novel eg. "show me a DNA structure that cures cancer" can it do that, or it has to have seen something before to generate it.

Just think we conceptually know what a brushless motor design looks like and it's just pixels. I guess even if it did produce the image we wouldn't know what it means.

minimaxir2mo ago

All image models can generate images that were not in its training dataset, but it can't generate reductive extreme cases like your example.

ge962mo ago

What about it is extreme? It's a concept, like "generate an xray image" eventually hopefully the cure to cancer could be represented as a simple molecule or whatever, I'm not saying I know.

1 more reply

claysmithr2mo ago

You are overestimating it's intelligence, but I bet it would hallicinate some result, why not try it yourself?

ge962mo ago

I don't know what the cure of cancer would look like ha (not an organic chemist or biology, genomist... not even sure what field that would be).

But yeah I am slowly trying to incorporate AI into my life (the delegation, work in my sleep part). I develop it is the funny thing (RAG agents) but yeah. Sometimes I get sold on it like "wait a minute maybe it can do that" but no. Can probably tell I don't get deep into the technical part I'm an API consumer. That's the thing I realize too, can only know so much about a topic if you're spread thin/a generalist.

j / k navigate · click thread line to collapse

Nano Banana 2: Google's latest AI image generation model (opens in new tab)

575 comments