The AI equivalent of saying “don’t think of a polka dotted purple elephant”
I specifically used the phrase “Please generate an image of a room with zero elephants”. It created two images that were almost identical and both contained pictures/paintings of elephants in frames. Cheeky.
I responded with “Each image contains an elephant.”
It generated two more, one of which still had a painting of an elephant.
Now I’m out of generation until tomorrow. Overall a fairly shit first experience with Dall-e
To be fair, both those rooms have almost no elephant in them.
Why is this my favorite thing today.
The elephant kinda looks like he know he wasn’t supposed to be there.
Ahhh I couldn’t figure out why I found the picture so funny, that’s why! Hahah thanks
Same with Bing!
Yep
Don’t say anything!
You should have seen how many there were before it drew the room.
… I don’t see an elephant. Oh hey, by the way, can some one help me with this captcha?
Yeah, telling ai what not to do is highly ineffective
“Do not injure a human or through inaction allow a human to come to harm.”
Case in point, Asimov’s laws never worked haha
Yeah, but in Asimov’s case it was because a strict adherence to the Three Laws often prevented the robots from doing what the humans actually wanted them to do. They wouldn’t just ignore a crucial “not” and do the opposite of what a law said.
I decided to go try this. It’s being a smart ass.
Is that the Futurama font?
It is I think. and the wall is the color of the ship.
No, this is correct. The four elephants you see through the window are outside the room. The several elephants on the wall are pictures, they aren’t actual elephants. And the one in the corner is clearly a statue of an elephant, as an actual elephant would be much bigger.
What about the tusked drapes?
Ceci n’est pas un éléphant
This is a very human reaction, actually. You try picturing zero elephants if told to.
I just did it was filled to the brim with flamingoes.
I gotta see that
Now do an empty room with absolutely no elephants
Give me some credit, I was doing really well up until about the point where you said elephants
“but you drew…” “Don’t mention it.”
“can you draw a room with absolutely no elephants in it? not a picture not in the background, none, no elephants at all. seriously, no elephants anywhere in the room. Just a room any at all, with no elephants even hinted at.”
I’m getting the impression, the “Elephant Test” will become famous in AI image generation.
It’s not a test of image generation but text comprehension. You could rip CLIP out of Stable Diffusion and replace it with something that understands negation but that’s pointless, the pipeline already takes two prompts for exactly that reason: One is for “this is what I want to see”, the other for “this is what I don’t want to see”. Both get passed through CLIP individually which on its own doesn’t need to understand negation, the rest of the pipeline has to have a spot to plug in both positive and negative conditioning.
Mostly it’s just KISS in action, but occasionally it’s actually useful as you can feed it conditioning that’s not derived from text, so you can tell it “generate a picture which doesn’t match this colour scheme here” or something. Say, positive conditioning text “a landscape”, negative conditioning an image, archetypal “top blue, bottom green”, now it’ll have to come up with something more creative as the conditioning pushes it away from things it considers normal for “a landscape” and would generally settle on.
“Can you a room as aboluteyy no eleephant it all?”
Dunno what’s giving more “clone of a clone” vibes, the dialogue or the 3 small standing “elephants” in that image.
“We do not grant you the rank of master” - Mace Windu, Elephant Jedi.
thought about this prompt again, thought I’d see how it was doing now, so this is the seven month update. It’s learning…
DALL-E:
Edit: Changed “aloud” to “allowed.” Thanks to M137 for the correction.
“Aloud”
Seriously?
This is the society you all have created by bullying the Grammar Nazis off the internet.
I welcome the help. English, fat fingers, and fading memory make for strange bedfellows.
I’m prone to typos and I don’t use auto-correct. Appreciate the notice.