Multiple Subjects in a SINGLE image - Latent Couple for Stable Diffusion!

แชร์
ฝัง
  • เผยแพร่เมื่อ 15 ต.ค. 2024
  • Getting two different subject in a single image such as a cat and a dog, or a man and a woman, can be quite tricky! Thankfully, the Automatic1111 WebUI for Stable Diffusion comes packed with available extensions and these can aid us on our quest for complete control... Yay for Latent Couple! This is so awesome I’m sure you’ll see it everywhere soon 😉
    == Links! ==
    Automatic1111 Web UI - github.com/AUT...
    ControlNet Extension - github.com/Mik...
    Installing ControlNet - • How To Install And Use...
    Multiple ControlNets - • Multi-ControlNet and m...
    Composable Diffusion - • Stable Diffusion AND C...
    Fork with masks - github.com/ash...
    How do I create an animated SD avatar? - • Create your own animat...
    Installing Anaconda for MS Windows Beginners - • Anaconda - Python Inst...
    Stable Diffusion Playlist! - th-cam.com/users/pl...
    == Interested in adding things to your AI Art? Try these! ==
    Dreambooth Playlist - • Stable Diffusion Dream...
    Textual Inversion Playlist - • Stable Diffusion Textu...

ความคิดเห็น • 119

  • @amj2048
    @amj2048 ปีที่แล้ว +48

    oh wow 😯 we are getting to the point where we need a better interface that can take advantage of all these tools and make them easier to use, it's so cool to see what is happening though

    • @audiogus2651
      @audiogus2651 ปีที่แล้ว +7

      yah someone must be cooking up a poser type interface with 3d posable hands etc etc

    • @Ghost_Text
      @Ghost_Text ปีที่แล้ว +5

      Seriously. Outside of Invoke and all these photoshop clones, there hasnt been a simple gui to quickly integrate all these tools. Will it be adobe or blender due to the controlNET functions? Who knows

    • @ShawnFumo
      @ShawnFumo ปีที่แล้ว +2

      @@audiogus2651 Funny timing! Aitrepreneur just posted a video showing the new special Blender model which lets you pose the skeleton and hands and feet and you can save out the openpose colored skeleton but also a depth map and canny map of the hands and feet.

  • @vi6ddarkking
    @vi6ddarkking ปีที่แล้ว +16

    We are almost there. The ultimate version Of Stable Diffusion is almost Here.
    It will be a Blender Addon that will combine the recently released Blender Skeleton for MULTI-CONTROLNET.
    Combined with the next version of this which will allow us to assign a Prompt, Hypernetworks and Multi-Controlnets to each Skeleton and or "Control Meshs" and the Background.
    And once Text To 3D, AI Animation and Images to 3D are also inevitably implemented as Blender Addons The fusion of the 2D and 3D Workflows will be Complete.
    And with it The full democratization of animation.
    It Will be Glorious and at the rate we are going It will be here Sooner than we realize.

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +4

      The official stable diffusion Blender add on is out, in case you haven’t seen it 😉

    • @vi6ddarkking
      @vi6ddarkking ปีที่แล้ว

      @@NerdyRodent I Know, my introduction to AI Art was Dream Textures.

  • @tnkrtrll
    @tnkrtrll ปีที่แล้ว +9

    Thanks for sharing and explaining, SD (with the help of automatic1111) is such a wonderous toolbox. Hard to keep up with all the new stuff popping up

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      Glad it was helpful!

  • @goatnamese
    @goatnamese ปีที่แล้ว +5

    This is perfect for parents who have young kids who refuse to stand still for portraits.

  • @jurandfantom
    @jurandfantom ปีที่แล้ว +3

    Next to cover - LoRA weight extension ? Thank you for disassembly those extensions and their mechanics, to create video with explanation what where and how.

    • @juandiegozuniga2638
      @juandiegozuniga2638 ปีที่แล้ว

      you can lower the mlroa weight on the end of the lora ,. noraly is :1 , but you can change thato :0.55 for 55%

  • @operationancut
    @operationancut ปีที่แล้ว +5

    I wonder if it is possible to make 2 subject interact with each other like handshake or hug ?

  • @madrooky1398
    @madrooky1398 ปีที่แล้ว +3

    The interesting part is, the language model used in SD is in theory easily capable to allow more control. But the models are not trained for that purpose, yet.

    • @gorkskoal9315
      @gorkskoal9315 ปีที่แล้ว

      lol most of them are chaising hentai tentacles as far as I can tell.

  • @DajaMythBusters
    @DajaMythBusters ปีที่แล้ว +2

    Thank you for taking the time to share your knowledge, another excellent video.👍

  • @Mimeniia
    @Mimeniia ปีที่แล้ว +2

    Multiple characters in an image was the one thing that pushed me to put my fist through the monitor. 😄

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      And now it’s super easy! 😉

  • @Some1uNo
    @Some1uNo ปีที่แล้ว +5

    Hold on to your Pap😄

  • @banzai316
    @banzai316 ปีที่แล้ว +3

    Cool, definitely very useful with ControlNet!

  • @protomato6427
    @protomato6427 ปีที่แล้ว +2

    Tortoise TTS got a bunch of updates in a recent month - faster inference, finetuning. Hope you will consider making an update video on it

    • @banzai316
      @banzai316 ปีที่แล้ว

      Cool, are you on Windows or Linux?

  • @5150hammernuts
    @5150hammernuts ปีที่แล้ว +2

    Fantastic vid mate. I was JUST asking about how to do this technique on reddit. Any chance on getting more info or the text doc for the different ratios for the regions. Love the AI video in the corner too.

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +2

      It’s just x,y co-ordinates 😉

  • @contrarian8870
    @contrarian8870 ปีที่แล้ว +7

    @00:39 So we're just gonna ignore a random bearded head hiding in the fur between the two Washington-Lincolns?

  • @Vacancy23
    @Vacancy23 28 วันที่ผ่านมา

    So, do we need ControlNet to run latent couple and the other as I see all the videos just using it?

  • @Trinketorium
    @Trinketorium 10 หลายเดือนก่อน

    I installed and copied your settings and enabled it, and it just did the same as normal and merged the prompts into one. I set it for a bird in the top and a cat in the bottom and I got a cat bird

  • @goldenspirit3369
    @goldenspirit3369 ปีที่แล้ว +2

    HOLDING ON TO MY PAPERS!! SAY IT MAN JUST SAY IT!!!

  • @andresz1606
    @andresz1606 ปีที่แล้ว +1

    Do you have a video on how to do the exact same thing on ComfyUI?

  • @IntiArtDesigns
    @IntiArtDesigns ปีที่แล้ว +2

    I can't get this to work at all. It's only generating a single character, i don't understand what i'm missing. I'm using the same settings as the video, i've restarted and updated the webui, updated all my extensions, tried different models and different samplers, made sure latent couple is enabled, out of 40 generations, not a single one has worked, i get one character or some weird merge of the two.

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      There are two steps to it -
      1. Define your areas in the latent couple box and enable
      2. Define your area sub-prompts, separated by AND
      If you only have one character, like I show at 4:38, then you might have forgotten step 2.

    • @IntiArtDesigns
      @IntiArtDesigns ปีที่แล้ว

      @@NerdyRodent Check and check. Done all that. Not working for me. I don't get it.

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว

      @@IntiArtDesigns The only other thing I can think of is having too few steps, so probably worth raising an issue in the GitHub with screenshots to show.

    • @IntiArtDesigns
      @IntiArtDesigns ปีที่แล้ว

      @@NerdyRodent Ok, thanks. I've seen others raising the same issue i have, so hopefully, whatever it is, will get fixed soon. This looks like so much fun, i can't wait to use it.

  • @Dannysingh-r7b
    @Dannysingh-r7b ปีที่แล้ว

    Using chair and sofa images, I have finetuned a stable diffusion model. However, the model does not generate a living room using the chair and sofa.
    Please help me !!!!

  • @kariannecrysler640
    @kariannecrysler640 ปีที่แล้ว +2

    Hehehe 🤭
    Tickled by whiskers 🐭

  • @slashkeyAI
    @slashkeyAI ปีที่แล้ว

    What a time to be alive!
    You know why he is always holding onto his papers? Because he craps his pants in surprise too often :)

  • @panpdx8919
    @panpdx8919 ปีที่แล้ว +1

    OOhh! would you be into sharing your parameters txt file cut and pastes for areas?

  • @DeBeau
    @DeBeau 23 วันที่ผ่านมา

    It seems the User Interfaces for both Composable Lora and Latent Couple has changed completely, since this video was uploaded. Anybody who can provide guides, or possibly links to guides for the current (as of Sept. 23. '24) version of those two plugins for Stable Diffusion (Forge)?
    Am at a complete loss, for similar solutions.

  • @tobinrysenga1894
    @tobinrysenga1894 10 หลายเดือนก่อน

    I tried this again today and still just get nightmare fuel (TM). I see there is regional prompter now but I get equally bad results. I even tried the dog/cat you demo'd but I get a dog/cat asking to be put down :(

  • @lioncrud9096
    @lioncrud9096 ปีที่แล้ว

    how do you stop it from mashing to subjects together...I don't want the half faces (one half one person the other half another person), i want two entirely separate characters

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      I use this latent couple extension!

  • @jason-sk9oi
    @jason-sk9oi ปีที่แล้ว +2

    What is this software? Is it possible to run online and yhen access with my Windows 8 PC? Are there any setup guides or videos on how to do this? How much does this all cost? I'm an old-school professional Graphic Designer since 1998 - meaning I'm excited to start becoming a master of this amazing ai toolset. Thank you in advance for any help!!

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +2

      This is stable diffusion (links in the description). It can run on Microsoft Windows, but it’s best to start out something like the Google Colab as unless you’re a nerd, you probably won’t have a computer capable of running it. There you get a free Linux system with a gpu than you can use via your browser - github.com/camenduru/controlnet-colab

  • @tiagotiagot
    @tiagotiagot ปีที่แล้ว +2

    Interesting, looks like 1.5 handles "photo of a dog and a cat" better than 2.1 ...

  • @rodrigosouza8471
    @rodrigosouza8471 9 หลายเดือนก่อน

    this extension has been updated and it works with different words now, new video?

    • @NerdyRodent
      @NerdyRodent  9 หลายเดือนก่อน +1

      It hasn’t been updated for over a year. It will still work exactly the same with old versions of automatic, though may have issues with newer ones

  • @digitalflick
    @digitalflick ปีที่แล้ว +1

    i get an error when clicking "visualize" under the rectangular tab. Working for anybody else?

    • @mockingbird1227
      @mockingbird1227 ปีที่แล้ว

      nope, i also get an error. anyone knows how do fix it..?

  • @mdlieber99
    @mdlieber99 ปีที่แล้ว

    I'm using regional prompter to do roughly the same thing as latent couple. My question now is how do you generate two particular people for the image. Like I have created a LoRA for my face and a LoRA for my wife's face. If I put each LoRA in a separate prompting area with a prompt for a man in one and a prompt for a woman in the other, it just gives me a man and a woman whose faces are a blend of the two LoRA's. Any suggestion on how to resolve this?

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว

      As with the cat and dog examples, basically!

  • @EdgardMello
    @EdgardMello ปีที่แล้ว +1

    Cool tech!

  •  ปีที่แล้ว

    Great! Very useful! Unfortunately, I speak bad English, maybe that's why I missed it: do I always have to enter the numbers manually in the "Extra generation params"? Isn't there a formula, you just have to guess? Or copy out the values that are wrongly shown in the video?

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +2

      You can use the defaults for where the areas are going to be, or you can choose your own 😉

  • @Bookedtuyo
    @Bookedtuyo ปีที่แล้ว +1

    great video! but i have a trouble, the latent couple section does not appear to me 🤧

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +2

      Remember to both install and restart the UI 😉

    • @mbc93-dg8cx
      @mbc93-dg8cx ปีที่แล้ว

      Hi. I had the same issue. I updated the webUI to the latest version and that fixed it for me.

  • @patakanz
    @patakanz ปีที่แล้ว +1

    Is there a way to use this when generating images from text? The way it's been shown here I don't see much advantage over using img2img and inpainting over one of the subjects with a high denoise value to get it to draw a different subject.

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +3

      This is generating images from text 😉

  • @arunuday8814
    @arunuday8814 ปีที่แล้ว

    Great video, thx a ton! One question:
    Instead of diffrent LORA models, can we compose using different Dreambooth models?

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +2

      Not using this, no

    • @arunuday8814
      @arunuday8814 ปีที่แล้ว

      @@NerdyRodent Is there any other way of accomplishing that? Thx

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      @@arunuday8814 Not that I’m aware of, no

    • @arunuday8814
      @arunuday8814 ปีที่แล้ว

      @@NerdyRodent Thx, appreciate your help!

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว

      @@arunuday8814 you’re welcome!

  • @___x__x_r___xa__x_____f______
    @___x__x_r___xa__x_____f______ ปีที่แล้ว

    this extension seems broken. are there any alternatives you are aware of to compose regionally?

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +2

      Thankfully it isn’t broken, but yes, that are a few versions of this

    • @___x__x_r___xa__x_____f______
      @___x__x_r___xa__x_____f______ ปีที่แล้ว

      @@NerdyRodent sadly does not work with latest gradio. Could you suggest alternative? thanks

    • @___x__x_r___xa__x_____f______
      @___x__x_r___xa__x_____f______ ปีที่แล้ว

      @@NerdyRodent also, i use the webui auto launcher, which doesnt allow patching the SAG unofficial extension. Stuck here

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      @@___x__x_r___xa__x_____f______ just downgrade to gradio 3.28.1

    • @___x__x_r___xa__x_____f______
      @___x__x_r___xa__x_____f______ ปีที่แล้ว

      @@NerdyRodent I am on 3.28.1

  • @freemoney8844
    @freemoney8844 ปีที่แล้ว

    how to install "Fork with masks" on windows10 - SD AUTOMATIC1111 ???

  • @SweetieNerdygirl
    @SweetieNerdygirl ปีที่แล้ว

    Good video! But I guess this can be used only for using the ai to generate two subjects? What if we want to train two people by using their photos, how could we make it happen ? Thank you very much !

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว

      It’s for multiple subjects - your choice 😉

    • @SweetieNerdygirl
      @SweetieNerdygirl ปีที่แล้ว

      @@NerdyRodent So does it mean I need to train two subjects on SD+ dreambooth first and then generate a new image by using the latent couple extension? I have tried to train two men at once but the sd+dreambooth ended up mixing them and generated two identical ones instead of two different persons. Any idea maybe ? Thank you very much !!!

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว

      @@SweetieNerdygirl no, no training required. You can just use it straight away on any model exactly as shown 👍

  • @darmok072
    @darmok072 ปีที่แล้ว

    Think it would be easier just to use Blender or equivalent to generate guide images for the algorithm rather than using control net or extensions like this. A little steeper learning curve but much better potential results.

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว

      Give it a go and let us know!

  • @Tsero0v0
    @Tsero0v0 ปีที่แล้ว

    WHy your Lora have icons on them?

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      I just like seeing previews. You don’t have to have them, it’s simply a personal choice 😀

  • @flonixcorn
    @flonixcorn ปีที่แล้ว +1

    Very nice

  • @ItsmeCoringa
    @ItsmeCoringa ปีที่แล้ว

    Man, i installed the Latent Couple and even so it not appears the box to use it. What can it be?

    • @ItsmeCoringa
      @ItsmeCoringa ปีที่แล้ว

      In the cmd log says: cannot import name 'CFGDenoisedParams' from 'modules.script_callbacks

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      Sure you’re running the latest a1111?

    • @gianluca3131
      @gianluca3131 ปีที่แล้ว

      I fixed it by updating to the latest version of Automatic1111

  • @androidgamerxc
    @androidgamerxc ปีที่แล้ว

    I dont have control model showing up in my thing

  • @timmi3701
    @timmi3701 ปีที่แล้ว +2

    I really gotta update my 1060 6gb soon....

  • @pn4960
    @pn4960 ปีที่แล้ว

    How do you put an image preview on the LoRA files ?

    • @TheSchwarzKater
      @TheSchwarzKater ปีที่แล้ว

      Civitai Helper extension, will download images, prompts and even allows you to set your own thumbnail.

  • @the_RCB_films
    @the_RCB_films ปีที่แล้ว

    soooo why doesnt it show up after i installed???? this is literally riving me bonkers...

    • @the_RCB_films
      @the_RCB_films ปีที่แล้ว

      nvm i finally got it

    • @AlexGNewMediaJournalism
      @AlexGNewMediaJournalism ปีที่แล้ว

      @@the_RCB_films how?

    • @AlexGNewMediaJournalism
      @AlexGNewMediaJournalism ปีที่แล้ว

      @@the_RCB_films how have you resolved not showing up?

    • @the_RCB_films
      @the_RCB_films ปีที่แล้ว

      @@AlexGNewMediaJournalism just update the webui, if that doesn't work install a new copy of it and just copy the parts you need.

  • @genAIration
    @genAIration ปีที่แล้ว

    Anyone able to use with additional-network-extension?

  • @smokeacoil
    @smokeacoil ปีที่แล้ว

    i cant find Latent Couple script

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว

      github.com/opparco/stable-diffusion-webui-two-shot

  • @viper5955
    @viper5955 4 หลายเดือนก่อน

    I can't get this to work

    • @NerdyRodent
      @NerdyRodent  4 หลายเดือนก่อน

      Unfortunately A1111 is pretty much unused at this point. If you're lucky, the update may work - sd-webui-regional-prompter - but most stuff is done in ComfyUI now

    • @viper5955
      @viper5955 4 หลายเดือนก่อน

      @@NerdyRodent i need to learn ComfyU

  • @fjccommish
    @fjccommish ปีที่แล้ว

    Two months later, and it still can't get hands right.

  • @JohnVanderbeck
    @JohnVanderbeck ปีที่แล้ว +1

    bonus points for Vox Machina even if you did mispronounce it :D

  • @giovanith
    @giovanith ปีที่แล้ว

    hello, thanks but this explanation was not enough .... Not running here

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว

      Remember to restart after clicking install!

  • @xmattar
    @xmattar ปีที่แล้ว +1

    I tried adding Freddy fazbear to Roblox using ai
    SOMEHOW
    I got nsfw

  • @baxter987
    @baxter987 ปีที่แล้ว +1

    It's over

  • @simpleandfrank
    @simpleandfrank ปีที่แล้ว

    There must be another way, using only the prompt...

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +1

      Do let us know if you find it!

  • @synthoelectro
    @synthoelectro ปีที่แล้ว

    act now! only 19.95, that's 19.95, for all this, void where prohibited.

    • @NerdyRodent
      @NerdyRodent  ปีที่แล้ว +4

      Is all this free and open source, of course 😉

    • @synthoelectro
      @synthoelectro ปีที่แล้ว

      @@NerdyRodent I know :D

  • @popwwrestling3940
    @popwwrestling3940 หลายเดือนก่อน

    Latent couple is not listed in available would you know why?

  • @Strife3dx
    @Strife3dx ปีที่แล้ว +1

    This should help with porn scenes for sure