I’m a little scared how well the new model works

Artstation <–more, I’m little too lazy to upload and link all photos in lemmy \

  • kostel_thecreed@lemmy.ca
    link
    fedilink
    English
    arrow-up
    8
    ·
    1 year ago

    Holy shit that looks realistic. I’m on /r/all (idk what the Lemmy term is for it) and thought this was a random picture from the pics subreddit. Fascinating how good this AI has become! Are the prompts complicated, or are they far simpler to create now?

    • Thelsim@sh.itjust.worksM
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 year ago

      That’s one of the most realistic I’ve seen so far, even the letters on the riot police say “Police”.
      Usually letters are just some weird gobbledygook

    • Anime@lemmy.pipipopo.plOP
      link
      fedilink
      English
      arrow-up
      5
      ·
      edit-2
      1 year ago

      You can write to him in normal sentences, and he will understand everything, I think it will be really hard to distinguish the real from the fake

      • kostel_thecreed@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        If you hadn’t said AI in the title, I would has been clueless lol. It’s more than scary at this point, but also beautiful - excited to see the direction AI will take.

        • D4rkstalker@pawb.social
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 year ago

          At the start of this year people on twitter were like “This is image is so obviously ai generated because it’s terrible” And now half a year later it’s “how dare you trick us into liking this image by not clearly labelling it as ai generated”

      • kostel_thecreed@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        It would seem so awkward (to me atleast) if I were to randomly say “I was browsing all” as it makes no sense in a way. Would “I was browsing /c/all” be better? No clue lol.

      • b3nsn0w@pricefield.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        oh wow, i expected some highly tuned community model. these shots are incredible

        i only hope that we’ll retain the same level of control over the process that we currently have with controlnet and dreambooth/lora

  • I’ve had trouble getting AI to understand the concept of a mid rise or medium density city. Like no matter how you describe the city, it’s always a densely populated wasteland which worries me about the way AI perceives what cities are and why we have them.

    • leonardo_arachoo@lemm.ee
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 year ago

      I don’t think we can consider AI as a monoloth. A text to image AI surely has no conception at all of what a city is for. An LLM might have such a concept, but I wouldn’t be worried about what it thinks based on limitations of a totally unrelated model.