Stable Diffusion 3 API Released.

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 มิ.ย. 2024
  • stability.ai/news/stable-diff...
    x.com/StabilityAI/status/1780...
    Prompt styles for Stable diffusion Automatic1111, Forge, ComfyUI & Vlad/SD.Next: / sebs-hilis-79649068
    Get early access to videos and help me, support me on Patreon / sebastiankamph
    Chat with me in our community discord: / discord
    Stable Diffusion for Beginners Playlist • Stable Diffusion Begin...
    My Weekly AI Art Challenges • Let's AI Paint - Weekl...
    My Stable diffusion workflow to Perfect Images • Revealing my Workflow ...
    ControlNet tutorial and install guide • NEW ControlNet for Sta...
    Famous Scenes Remade by ControlNet AI • Famous Scenes Remade b...
  • แนวปฏิบัติและการใช้ชีวิต

ความคิดเห็น • 137

  • @malch2843
    @malch2843 หลายเดือนก่อน +98

    The music is very distracting, maybe have the music volume lower next time.

    • @rifz42
      @rifz42 หลายเดือนก่อน +16

      or just don't add music : ) thanks!

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +19

      Thank you for the feedback!

    • @Tymon0000
      @Tymon0000 หลายเดือนก่อน +11

      @@sebastiankamph Please don't add music with lyrics when you are talking. Thanks.

    • @rensxx
      @rensxx หลายเดือนก่อน +3

      I am a native Spanish speaker, and I had to go back to the video when reading the comment to check how loud the music volume was because I didn't even realize there was music hahaha. Maybe it's just each person's experience. Great content as always! Looking forward to the launch of the weights. Great content as always Sebastian! Cheers from Uruguay!

    • @slalomsteve
      @slalomsteve หลายเดือนก่อน +4

      Agreed. Music is added by default to lots of things now for no reason what so ever. Even my local radio news has a constant drumbeat in the background and it's annoying to the point I can't listen to it any more. What most people fail to realise is that it reduces accessibility. Background music cases havoc for people who are hard of hearing and who need hearing aids. The devices often amplify the wrong things so the voice gets drowned out completely.

  • @Aitrepreneur
    @Aitrepreneur หลายเดือนก่อน +68

    A few precision:
    This is NOT the "real" SD3 model, the API one is a much older model that is not gonna be representative of the final model because...well...the SD3 model is STILL in training and will be released when it's ready. The API one was probably released because of a contract with Fireworks AI who made the workflow for that version of the model.
    So YES, for those asking the SD3 model will be free and open-source, it will be much better than what you see here and it will be released to the public when it's ready, so be patient yall.

    • @Oxes
      @Oxes หลายเดือนก่อน

      thnks

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +11

      While it is correct that it is not the final form of the SD3 model (which was addressed in the video and in Stability's news post), it is in fact very real and not "much older". There are different internal versions of SD3 currently as the training progresses.
      You are also right that the final version will be free and open-source. With free comes licensing limitations however.
      Source: Stability AI

    • @vi6ddarkking
      @vi6ddarkking หลายเดือนก่อน +7

      @@sebastiankamph Sure licensing limitations that will be vigorously ignored by the vast majority of the community.
      Besides. No one really wants SD3.
      We all want the Fine Tunes and Loras based on SD3.

    • @DaniDani-zb4wd
      @DaniDani-zb4wd หลายเดือนก่อน +1

      ⁠@@vi6ddarkking straight to the point. I really wonder how how hard it’s gonna be for developers to finetune this model or to create loras. This is why it took so long for sdxl to get “good” many people still use 1.5 simply because they don’t wanna give up on all the loras… and still even in present I feel like there are more loras released for sd1.5 than sdxl due to training issues..

  • @Arewethereyet69
    @Arewethereyet69 หลายเดือนก่อน +90

    get rid of the background music

    • @Omsip123
      @Omsip123 หลายเดือนก่อน +18

      Look up the word “please”… please

    • @ThoughtsFew
      @ThoughtsFew หลายเดือนก่อน +2

      Nah its insane

  • @scotadam
    @scotadam หลายเดือนก่อน +42

    The problem is not the music but the fact that there are lyrics.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +9

      Good feedback. Was testing an AI generated song instead of the usual background music.

    • @scotadam
      @scotadam หลายเดือนก่อน

      @@sebastiankamph I watched that video. I will have to try that program. The music is fun. I am hoping Bandlab will eventually upgrade its AI music features.

  • @obscuremusictabs5927
    @obscuremusictabs5927 หลายเดือนก่อน +35

    Please no music. It sounds like another tab is open.

  • @20xd6
    @20xd6 หลายเดือนก่อน +7

    Gunna be hard to get me off 1.5 with my 50 extensions, 100 trained models, and 3000 Loras.

  • @OmriSadeh
    @OmriSadeh หลายเดือนก่อน +10

    Was hoping you would show mainly images you created on sd3, especially if you’ve had prior access

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +8

      They didn't really want us showing those, as they improved on the model before releasing it publically ;(

  • @ADELTUF
    @ADELTUF หลายเดือนก่อน

    do you have a tutorial about how to train stable diffusion to generate similar videos to the video you give it as a source? TY

  • @vi6ddarkking
    @vi6ddarkking หลายเดือนก่อน +2

    I am honestly salivating for the next few months.
    Once the Community has had the time to Fine Tune SD3 And Develop the best practices to train the New Models and Loras.
    Things are about to get really fun.

  • @arothmanmusic
    @arothmanmusic หลายเดือนก่อน

    Now that we appear to have a functional text generation, I'm curious about the implications of copyright for the fonts in the training data. AI companies are already being sued by creators of text and images used in the training data… are foundries the next to jump into the fray?

  • @TheBurningBuffalo
    @TheBurningBuffalo หลายเดือนก่อน

    In the sofa picture one of the dots disappears from 2:00 to 5:15. I wonder how good the text really is, how often they tweaked the pictures before releasing them.

  • @heitorb2460
    @heitorb2460 หลายเดือนก่อน

    When they do the open release, will it be uncensored? I’ve just tried and for example “woman in bikini” fails because of content moderation

  • @mr_pip_
    @mr_pip_ หลายเดือนก่อน +1

    In fact, apart from a further advance announcement, of which there have already been several, there is still nothing.
    I'm curious to see when the models will finally come out for download so that you can really see what you can do with them. Until then, I find other developments more exciting at the moment.

  • @KodandocomFaria
    @KodandocomFaria หลายเดือนก่อน

    Why don't we have different open source models like LLM? For instance there are many architecture derived from transformers, like mistral, llama ... But for stable diffusion there are a lot of finetuned models but not new architectures. Do you know any other kind of architecture used to generate image with high quality like stable diffusion?

  • @Panda-ik4uk
    @Panda-ik4uk หลายเดือนก่อน +2

    I have been enjoying SD2 w/a1111. Will something like that every be created for SD3 so i can run locally, for free, as much as I want?

    • @hakuhyo174
      @hakuhyo174 หลายเดือนก่อน +1

      ComfyUI. The way it’s designed should work out of box (or with minimal update) for SD3.0

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +4

      SD3 will be available for all user interfaces as soon as the weights are released. Currently it's api only.

    • @Panda-ik4uk
      @Panda-ik4uk หลายเดือนก่อน +1

      @@sebastiankamph Thank God. I appreciate the positive news!

  • @ThoughtFission
    @ThoughtFission หลายเดือนก่อน

    So how do you use it?

  • @mootzartdev
    @mootzartdev หลายเดือนก่อน

    Is Automatic1111 still the thing at this stage? or is time for me to move on do you think? I use all kinds of plugins etc.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน

      I still use a1111 primarily. Sometimes I use Comfy, sometimes Fooocus, sometimes Forge.

    • @mootzartdev
      @mootzartdev 25 วันที่ผ่านมา

      @@sebastiankamph Ahh ok thank you. Have you heard word of a model being around soon?

  • @hakuhyo174
    @hakuhyo174 หลายเดือนก่อน

    ELLA did such a great job in prompt comprehension to the point that it’s difficult to see what SD3.0 is adding, if quality of example is what to go by.

  • @nicolas.c
    @nicolas.c หลายเดือนก่อน

    haha great info, and the joke in the middle made my day!👏

  • @yanus_ai
    @yanus_ai หลายเดือนก่อน

    hi there, is there a way of booking a call with you for consultation?

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน

      Yes, pm me on Discord.

  • @silentrobcanada
    @silentrobcanada 5 วันที่ผ่านมา

    I'm a little worried about Stability AI's future with the round of layoffs and CEO departure. I hope whomever acquires them continues to keep the open source ethos.

  • @YVZSTUDIOS
    @YVZSTUDIOS หลายเดือนก่อน

    interesting. the first time I watched this video the music wasn't distracting to me at all. I didn't even notice it that much. but liked that there was something in the background to listen

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน

      Different strokes for different folks I guess. Thanks for the feedback :)

  • @AI_EmeraldApple
    @AI_EmeraldApple หลายเดือนก่อน

    I don't like it that SD3 currently looks bad compared to lykon's examples on his twitter page. I think it was a bad move to release a half-baked workflow version of SD3 that doesn't meet the aesthetic quality of MJ6. Looks like i'll be sticking with SD1.5 models for a while longer

  • @Suketh
    @Suketh หลายเดือนก่อน

    "Thank you S. for a great video... It's great that you got to try SD3. When it comes to pricing for commercial use, which payment model are they talking about then, and how much? It would also be good to get a simplified explanation of what type of product use they envision is acceptable. Obviously, many have used SD because that model has been free and maintained a relatively good standard regardless of blood, boobs, and other personal nuances, which as you know are hard to even come close to with models like MJ."

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +1

      Thank you! You can read more about that here: stability.ai/membership#select_membership

  • @artist.zahmed
    @artist.zahmed หลายเดือนก่อน +7

    it wll be localy or not ?

    • @LuckyPed10
      @LuckyPed10 หลายเดือนก่อน +6

      Yes, in few weeks or so hopefully, free for personal uses. not commercial tho.

    • @rhym8882
      @rhym8882 หลายเดือนก่อน +1

      @@LuckyPed10 where did you get this info?

    • @oraz.
      @oraz. 26 วันที่ผ่านมา

      I think it either won't be, or they are waiting to bake censorship into the weights before releasing. The politics are different in the company now that Emad resigned.

  • @francaleu7777
    @francaleu7777 หลายเดือนก่อน +1

    Do you have and idea how to use it? it looks complicated, I don't understand anything 😅

    • @dkemil
      @dkemil หลายเดือนก่อน +1

      Wait for someone to implement it on their website so you don't have to use the API yourself.

  • @titankronos6517
    @titankronos6517 หลายเดือนก่อน

    What's the point of sd3 if it as censored as mid journey and dalle 3, atleast Mj and dalle 3 has better image quality than sd3, i hope that a less censored version of sd3 will be available in the future.

  • @Deadgray
    @Deadgray หลายเดือนก่อน +2

    So... you said that you have access to SD3 for few weeks and all you show are images I can find on net myself. Clickbait?

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +3

      They wouldn't let us show images from the closed testing. And if I did, I wouldn't get invited to closed tests like that again.

    • @Deadgray
      @Deadgray หลายเดือนก่อน +1

      @@sebastiankamph So my apologies and thanks for the quick reply. This explains everything.

  • @teambellavsteamalice
    @teambellavsteamalice หลายเดือนก่อน +1

    I don't like the focus on the simple, instant result. While nice and impressive to the majority of people, the prompt to image is only the first step imo.
    The options to fix and improve upon images, things like controlnet and comfyui, that is where the magic happens!

  • @AIFuzz59
    @AIFuzz59 หลายเดือนก่อน

    I think text implementation will be better overall. The initial base model will always be the “start” and people will often overreact at the quality. As time goes on and with improvements and fine tuning, the forthcoming forms of SD3 will be better.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +1

      Yes! 100% agree. I'm sure the improvements they make in the coming weeks will get it even further, and then the custom trained finetunes will take it all the way.

  • @TR-707
    @TR-707 หลายเดือนก่อน +1

    they are not gonna paywall everything are they?

  • @HistoryIsAbsurd
    @HistoryIsAbsurd หลายเดือนก่อน

    Music too loud but ty for the vid.
    Wouldve been nice to see more examples & how we can use it. Also its good to mention like half the leadership of Stability AI left during the last month or so due to their not actually being open.
    Its semi open sourced...not fully.

  • @Onsearching
    @Onsearching หลายเดือนก่อน

    Have tested it SD3 and i am disappointed, prompt coherence in not even close to Dalle or ideogram... very sad...

  • @MilesBellas
    @MilesBellas หลายเดือนก่อน

    SD could add an option for HUMAN FEEDBACK to continually improve, as with MJ ?

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +1

      It is a possibility for sure. It will also skew results towards what people "like" instead of what might actually be correct. MJ had that problem a very long time, everything was just looking beautiful and artsy, for a time it was almost impossible to achieve simple realism.

  • @juanjesusligero391
    @juanjesusligero391 หลายเดือนก่อน +1

    5:20 Your dad jokes give me life XD

  • @Thedeepseanomad
    @Thedeepseanomad หลายเดือนก่อน

    Stability: we MUST stop smut at all costs!

  • @juraganposter
    @juraganposter หลายเดือนก่อน

    the best thing is: uncensored

  • @vladiyudi5112
    @vladiyudi5112 หลายเดือนก่อน

    Emad says SD3 can generate video as good as Sora. Did anyone try generating videos?

  • @MaisnerProductions
    @MaisnerProductions หลายเดือนก่อน

  • @espen990
    @espen990 หลายเดือนก่อน

    "this is what turtles, uh, would've looked like if, uh, was kinda, half, semi, real"
    turtles are the new pidgeons?

  • @taiconan8857
    @taiconan8857 หลายเดือนก่อน

    The music was nice IMO, particularly when there weren't singers though. It's the additional "talking" I think that makes it particularly problematic, (plus it's in a similar register for a double whammy) that makes it tricky to understand/catch your voice alongside it. Do the Spanish uphold their ideals? Si'Bastion!

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +1

      Thanks for the feedback. I don't know what bastion means, but it sounds like a great dad joke

    • @taiconan8857
      @taiconan8857 หลายเดือนก่อน

      @@sebastiankamph it's English for a kind of 'last stand' 😉
      Bastion: an institution, place, or person strongly defending or upholding particular principles, attitudes, or activities.

  • @somedude5951
    @somedude5951 หลายเดือนก่อน

    I preferred Stable Diffusion 1 over Stable Diffusion 2. In part because of bikini's in Rembrandt style, but also because it had more freedom in creativity. Stable Cascade could not do artist styles any more. Reading this "Bad Actors" text here, I expect this one will be even worse, although can maybe draw hands and text 😢

  • @gdizzzl
    @gdizzzl หลายเดือนก่อน +1

    I just wanna tell everybody in the comments that we have enough anime porn to last us a lifetime so if you guys wanna focus on some other type of art, that’d be great

  • @supercurioTube
    @supercurioTube หลายเดือนก่อน +3

    I couldn't watch the whole video because of the volume to the background music with lyrics. Too fatiguing.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +2

      Thank you for the feedback.

  • @TheBagOfHolding
    @TheBagOfHolding หลายเดือนก่อน

    Why and how is all this free?

  • @SonnyBurnett2012
    @SonnyBurnett2012 หลายเดือนก่อน

    Still free or not?

  • @handsomejack672
    @handsomejack672 หลายเดือนก่อน

    please cover Hyper SD

  • @MilesBellas
    @MilesBellas หลายเดือนก่อน

    What are the technical differences between SD3, SD3 Turbo and Cascade?
    Interesting video topic ?

    • @MilesBellas
      @MilesBellas หลายเดือนก่อน

      via Pi
      .
      Great question! Stable Diffusion 3 and Stable Cascade are two distinct models developed by Stability AI, and they differ in their architecture and capabilities.
      * **Stable Diffusion 3:** This model uses a spatial compression factor of 8, encoding a 1024 x 1024 image into a 128 x 128 representation. This enables efficient processing of high-resolution images.
      * **Stable Cascade:** This model employs a unique, three-stage architecture, achieving a much higher compression factor of 42. Stage C transforms user inputs into compact 24x24 latents, while Stages A and B act as a Latent Decoder, similar to the role of a VAE in Stable Diffusion. This architecture allows for additional training and finetuning on Stage C, including ControlNets and LoRAs.
      In summary, the main difference between Stable Diffusion 3 and Stable Cascade lies in their architectures and compression capabilities, with Stable Cascade offering a more efficient compression factor for handling high-resolution images.

  • @TheBagOfHolding
    @TheBagOfHolding หลายเดือนก่อน

    The music didn't bother me.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน

      Thanks for the feedback :)

  • @tabs1913
    @tabs1913 หลายเดือนก่อน

    Noone tell him that turtles are real.

  • @11305205219
    @11305205219 หลายเดือนก่อน +1

    *Maybe it will become open source in future*

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +1

      Yes, you will be able to download the models (weights).

    • @deadlyrobot5179
      @deadlyrobot5179 หลายเดือนก่อน +4

      If it doesn't it belongs to the trash.

    • @patnor7354
      @patnor7354 หลายเดือนก่อน

      Good joke

  • @mufeedco
    @mufeedco หลายเดือนก่อน

    The background music is very loud and distracting.

  • @quaterman1270
    @quaterman1270 หลายเดือนก่อน

    I just hope they stay open source. That would be a real downfall if this goes closed source and censored.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน

      As of right now, their plans are still to keep it open source.

  • @jodus
    @jodus หลายเดือนก่อน

    I hope my 6gb card can somehow run it, just to try it once.

  • @peterpui7219
    @peterpui7219 หลายเดือนก่อน +2

    SD3 for ComfyUI node just available today

    • @RonnieMirands
      @RonnieMirands หลายเดือนก่อน

      Is that serious?

  • @no-handles
    @no-handles หลายเดือนก่อน

    donatello

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน

      He was my favourite! Which one was yours?

    • @no-handles
      @no-handles หลายเดือนก่อน

      @@sebastiankamph Don and Leo for sure!

    • @michaelleue7594
      @michaelleue7594 หลายเดือนก่อน

      @@sebastiankamph Michelangelo had the best sense of humor and was the least burdened by pointless stuff. Also nunchucks are cooler than sticks, sharp sticks, or pointy sticks.

  • @RikkTheGaijin
    @RikkTheGaijin หลายเดือนก่อน +3

    Porn. That's the main difference. SD can do Porn. The other closed source models cannot.

    • @ADMNtek
      @ADMNtek หลายเดือนก่อน

      correct the power of boners is stronger. and if V3 can't be used for adult content adoption will be low.

  • @user-in1mg9id2u
    @user-in1mg9id2u หลายเดือนก่อน

    so, there is a high possibility of being "open source" as it was, I thought they are now going to stop being open and start get paid for their models

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +3

      It will be open source and available to download. They will have a pricing model for licensing.

    • @FlexibleToast
      @FlexibleToast หลายเดือนก่อน

      Open source doesn't mean you can't make money. Red Hat, SUSE, Canonical all exist as companies that make money.

  • @aaronhkg
    @aaronhkg 28 วันที่ผ่านมา

    The bg music is too loud... either you speak louder or just remove it totally.

  • @svenhinrichs4072
    @svenhinrichs4072 หลายเดือนก่อน

    So sad the dream of the community based models comes to a quick end... money making $$$

  • @TheCynicalNihilist
    @TheCynicalNihilist หลายเดือนก่อน +2

    At this point i think its best, for professionals, to use midjourney because the details and being so on prompt is looking unreachable anytime soon by any other source BUT to use SD for inpainting what you cant get in MJ.
    Sucks, i wish SD in automatic1111 could get on that level.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +10

      I mean if you're just using a prompt and then being happy with that image, sure, MJ has got a lot of them beat. But with client briefs and demands, MJ has no place in my workflow where images and videos have to look exactly as described, with particular poses, colours, fabrics, faces etc.

    • @Pawel_Mrozek
      @Pawel_Mrozek หลายเดือนก่อน +1

      It's hard to call something "professional" if you have no creative control over your work.

  • @aisamanin3279
    @aisamanin3279 หลายเดือนก่อน

    Not free

  • @yermano
    @yermano หลายเดือนก่อน

    half of the video and already i am shocked... is this a joke? stable diff 3 is this? even with instagram edits u can u do more... has to be a joke right?

  • @hleet
    @hleet หลายเดือนก่อน

    annoying background music. put an instrumental next time 😊

  • @antiplouc
    @antiplouc หลายเดือนก่อน +6

    this mediocre loud music is unnecessary and annoying. just give us the info. We're not partying here.

  • @MrLight85
    @MrLight85 หลายเดือนก่อน

    Music? Man! You are not 13 years old boy!

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน

      Boy? Sir, I am a 13 year old man!

  • @deadlyrobot5179
    @deadlyrobot5179 หลายเดือนก่อน +1

    Now the waiting game begins, so people train their models.
    I hope the training process is faster than SDXL, and to be honest SDXL was a disappointment.

    • @AscendantStoic
      @AscendantStoic หลายเดือนก่อน +1

      SDXL models and the turbo variants are great, not sure what are you on about.

  • @vanteal
    @vanteal หลายเดือนก่อน

    Not free. Costs credits. F-all that garbage.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +1

      That's because you're using someone else's service through an api. When the weights are released, it will be free to use with your own machine.

    • @vanteal
      @vanteal หลายเดือนก่อน

      @@sebastiankamph Got'cha.. Thanks.

  • @knightride9635
    @knightride9635 หลายเดือนก่อน +1

    Honestly disappointed, saw a lot of pics generated on Reddit and it is not really mind-blowing. The hands are still shit. I am sure SDXL is more than enough.

    • @sebastiankamph
      @sebastiankamph  หลายเดือนก่อน +5

      I think you have to consider that it's a base model. The base models of 1.5 and SDXL are not great, far from it. With prompt understanding like this and then custom fine-tuning them for quality, I have hopes we'll see stuff that is similar to, or surpasses, previous models. But that's my opinion.

    • @Fritz0id
      @Fritz0id หลายเดือนก่อน

      The "mind-blowing" part seems already solved, even by SD 1.5 if you master some of the custom models. The problem is in text generation, composition, AI-spew etc. This means the AI part takes up only a small slice of my overall workflow. Lots of 3D modelling in Daz and Blender->pre-composition in Pixelmator->AI wrestling->recomp and enhancements in Pixelmator... With big chunks of that workflow requiring a LOT of iterations and backtracking.

    • @TheBagOfHolding
      @TheBagOfHolding หลายเดือนก่อน

      ​@@sebastiankamphit is mind blowing for a base model. The base models for the others can't make a good picture at all from what I have seen.

  • @59aml
    @59aml หลายเดือนก่อน +6

    get rid of the background music

    • @DeadPixelGuy
      @DeadPixelGuy หลายเดือนก่อน

      Yeah, I can't hear the hentai in the other screen