Let’s see how SDXL does @aihorde@lemmy.dbzer0.com draw for me a red sphere on top of a blue cube. Behind them is a green triangle, on the right is a dog, on the left is a cat
Yeah… SD up till now has been just really good at people but terrible at multiple concepts. I’ve been pretty impressed with Dall-E 3, hoping SD 3 catches up or surpasses it.
Given the infinite pockets of OpenAI, I doubt this is possible, But if they get close enough, the FOSS community is having weekly breakthroughs and can take it much further. Just look at how good the SD 1.5 finetunes and customization is by now
Let’s see how SDXL does @aihorde@lemmy.dbzer0.com draw for me a red sphere on top of a blue cube. Behind them is a green triangle, on the right is a dog, on the left is a cat
Here are some images matching your request
Prompt: a red sphere on top of a blue cube. Behind them is a green triangle, on the right is a dog, on the left is a cat
Style: fustercluck
It’s ok SDXL, you tried
What a fustercluck
Yeah… SD up till now has been just really good at people but terrible at multiple concepts. I’ve been pretty impressed with Dall-E 3, hoping SD 3 catches up or surpasses it.
Given the infinite pockets of OpenAI, I doubt this is possible, But if they get close enough, the FOSS community is having weekly breakthroughs and can take it much further. Just look at how good the SD 1.5 finetunes and customization is by now
SD 1.5 needs something like controlnet and inpaint to get close to Dall-E 3. I’m just amazed how Dall-E can do all that without any extra work.
But yeah, really hoping 3 has the community friendly tunability with at least some of that power that Dall-E has.
Heh, that third picture with the blue cat face. Funny, the other cat has the colors of the dog it wanted, but turned it into a cat.