NVIDIA SANA In ComfyUI - 100 Times Faster Than Flux And Render 4K Images

Future Thinker @Benji

มุมมอง 45 763

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 28 ม.ค. 2025

ความคิดเห็น • 195

@TheFutureThinker หลายเดือนก่อน ⁺¹⁵
I listed all diffusion model, vae link in this blog post, so how YT don't like HF links in description.
thefuturethinker.org/nvidia-sana-in-comfyui-setup-tutorial-guide/
@KaganParlatan หลายเดือนก่อน ⁺¹³
Please provide us working vae link. All people having problem with grey,black,blurry colorful images. Someone told that oldest version working, give it a chance and inform us.
@slobodanblazeski0 หลายเดือนก่อน
@@KaganParlatan TH-cam deletes comments to the hf links Efficient-Large-Model/Sana_1600M_1024px_diffusers/tree/38ebe9b227c30cf6b35f2b7871375e9a28c0ccce/vae add huggingface dot co in front
@luislozano2896 หลายเดือนก่อน ⁺⁴⁴
The fastest at making a blank image! RIP ExtraVAELoader.
@dhanang หลายเดือนก่อน ⁺⁵⁸
Damn, the pace of AI development right now is just ridiculous. Something I learned 3-6 months ago is already outdated.
@bobdole3251 หลายเดือนก่อน
@@dhanangright
@TheFutureThinker หลายเดือนก่อน ⁺⁴
@dhanang sometimes wake up another, a new thing release
@maknien หลายเดือนก่อน ⁺¹²
3-6 Months?? I'd say 3-6 Weeks max 😅
@crazyleafdesignweb หลายเดือนก่อน ⁺⁷
@@maknien3-6 days😂
@Larimuss หลายเดือนก่อน ⁺⁷
Don’t even bother watching videos from a month ago 😂, honestly every time I open my TH-cam a new major model is out including 2x video models. New apps, new quants, new great workflows, new control nets 😂 I can’t download fast enough
@Burnrate หลายเดือนก่อน ⁺²⁰
If you get black you can get the oldest vae from the commit history on hugging face. There is only one vae file in that commit and it works at the moment.
@Pernicuz หลายเดือนก่อน ⁺¹
yup that worked for me too, thanks!
@kaiserscharrman หลายเดือนก่อน ⁺⁵
@@Pernicuz I downloaded what I thought would have been the oldest version, but without success, still a black image. could you guys please leave a direct link here ? thanks a lot
@glenyoung1809 หลายเดือนก่อน ⁺²
Thanks for the pointer it worked, this should be the top comment.
Download the earliest vae model and it works, looks like they broke something in the meantime.
@glenyoung1809 หลายเดือนก่อน ⁺¹
@@kaiserscharrman Tried leaving a link but YT immediately deletes the comment.
The commit number is 38ebe9b227c30cf6b35f2b7871375e9a28c0ccce
@gosammy1971 หลายเดือนก่อน
i don't think that this is the VAE, the Ksampler shows no preview at all and all samples i have seen have ksampler preview. The extra models are simply broken.
@marshallodom1388 หลายเดือนก่อน ⁺¹⁴
Censored and sanitized for my protection?
@SUP3RMASSIVE หลายเดือนก่อน ⁺¹²
I'm just getting a pixelated mess. 😥
@РоманСырватка หลายเดือนก่อน ⁺²⁰
I did everything as in the video. I downloaded all the databases. It is generated without errors , but the end result is a black square ((
ComfyUI has been updated. Does not help. (
config win 10/10400/64 gb/ 3080 10gb (latest drivers)
@dnero6911 หลายเดือนก่อน ⁺⁷
I'm also only getting black images
@alexiannelli1010 หลายเดือนก่อน ⁺⁶
Same. I just get grey square.
@VuTCNguyenArtist หลายเดือนก่อน ⁺²
I got a black gray image too... not sure what I missed... did exactly as the video
@dowhigawoco หลายเดือนก่อน
same here
@dnero6911 หลายเดือนก่อน ⁺¹
At least I don't feel like the only one hahaha
@moviecartoonworld4459 หลายเดือนก่อน ⁺⁸
The error "Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat1 in method wrapper_CUDA_addmm)" appears. How can I fix this?
For your information, I am using an RTX 3080.
@graylife_ หลายเดือนก่อน
@@moviecartoonworld4459 I get the same error. Couldn't find a solution.
@MsDalilou หลายเดือนก่อน ⁺²
I was exposed to the same message with the general video card in my device RTX3050 4gb
How did you solve the problem, please?
@bluedynno 24 วันที่ผ่านมา ⁺¹
Looking forward for this solution, facing the same problem too
@cr_cryptic หลายเดือนก่อน
Great video! Thanks! 🙏
@netandif หลายเดือนก่อน ⁺³
I am only getting black or grey images.
This error keeps appearing in console:
Unused kwargs: ['_load_in_4bit', '_load_in_8bit', 'quant_method']. These kwargs are not used in .
@UnclePapi_2024 หลายเดือนก่อน ⁺³
Competitive Aquarium Design?!!!😱 I didn't know that was a thing!!!.... Oh... Sana is pretty cool too 😁👍
@TheFutureThinker หลายเดือนก่อน
@UnclePapi_2024 search it, IAPLC 😉 you might be addicted to this hobby if you like nature, water, animals, and plants.
@dsphotos หลายเดือนก่อน ⁺¹
like others same issue on windows 11: GemmaLoader, No package metadata was found for bitsandbytes
@jorgemiranda2613 หลายเดือนก่อน
Than you for taking the time to explain it !
@alpaykasal2902 หลายเดือนก่อน ⁺²
i LOVE that your hobby came through in this video!!!!
@TheFutureThinker หลายเดือนก่อน
I am not just tech and AI 😄😅
@marcovth2 หลายเดือนก่อน ⁺³
Is there any news on SANA Lora training?
@alexanderpina5913 26 วันที่ผ่านมา
I have this error:
GemmaLoader
The checkpoint you are trying to load has model type `gemma2` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
@roastedfails3769 13 วันที่ผ่านมา
Does Sana faceswap already exist like Pulid or similar nodes in ComfyUI?
@Andro-Meta 13 วันที่ผ่านมา
I can't seem to run the 4k model and would like to see if you can figure it out!
@cgdtb หลายเดือนก่อน ⁺³
Are generated images allowed for commercial use? From what I read in the license files, they are not permitted.
@kaiserscharrman หลายเดือนก่อน ⁺¹
NO, the license clearly says "non commercial use only". so I don't even bother with the black rectangle it generates due to an VAE issue any longer...
@Frankleitor หลายเดือนก่อน ⁺¹
hi, i don't know what hapens, always displays error "No package metadata was found for bitsandbytes" , i installed bitsandbytes and reinstalled torch and cuda and the error stays there
@TheFutureThinker หลายเดือนก่อน ⁺¹
Are you using Windows?
@michaelbayes802 หลายเดือนก่อน
I have the same error
@michaelbayes802 หลายเดือนก่อน
with windows 11 4060 16gb ti
@DIGIMANN-e2t หลายเดือนก่อน ⁺⁴
Hello, Im new to all this, and Im getting a bit of an error after following your steps, this is the error and I have no idea how to fix it, Input type (torch.cuda.HalfTensor) and weight type (torch.HalfTensor) should be the same, might you have an idea how to fix this and help a newbie out. any help is greatly appreciated... so it seems its telling me my model is running on GPU and data is on CPU, not sure how to make the adjustment to have both the same, any ideas, or is there a GPU work flow for this
@insurancecasino5790 หลายเดือนก่อน ⁺²
I was literally looking at NVDA for a long lol. This is awesome.
@TheFutureThinker หลายเดือนก่อน ⁺¹
Haha 😂😂
@TheFutureThinker หลายเดือนก่อน ⁺²
But this model still in early stage. Wait for the fine tune.
@insurancecasino5790 หลายเดือนก่อน
@@TheFutureThinker This is more exciting than a new car, bro. Cause it's freedom to create. That would include a car design with no limits. I want to make smoke paint. It's possible with AI. Thanks for the vids.
@jeffh236 หลายเดือนก่อน
Hi, Great video! I'm getting the following error: "Tokenizer class GemmaTokenizer does not exist or is not currently imported." Any ideas on a solve would be greatly appreciated.
@bartosak หลายเดือนก่อน
Thank You for this video! :)
@TheFutureThinker หลายเดือนก่อน
Hope you like it 😄
@Edacuatica หลายเดือนก่อน
Thank you. Hopefuly, we will be moving from flux to other models.
@jubbee1024 29 วันที่ผ่านมา
followed the video but generating only colors, either grey, black or yellow
@genome692002 หลายเดือนก่อน
wont download fetch files at the beginning.. fetching 9 file stops at 0%
@nicktumi หลายเดือนก่อน
Is there a way to have an output of various cfg levels? I'd think it makes sense to have an array of the different variants.
@thecrumpeffect 5 วันที่ผ่านมา
What about training Lora and such?
@DrMacabre หลายเดือนก่อน
How is it with generating text in images ?
@damarcta หลายเดือนก่อน
yeah, thanks for the updates!!
@TheFutureThinker หลายเดือนก่อน
Try this out😉
@modestasgrazys5547 หลายเดือนก่อน ⁺¹
I wonder how inpainting and attention to details works with Sana. As far as I’ve seen in this video, Sana mostly fails to correctly represent objects and ignores generation of small details, like skin texture (woman’s example). But if more high quality FLUX-like results will be introduced, it’ll be amazing to have such a fast model to play with
@TheFutureThinker หลายเดือนก่อน
Well, its a base model and very small size model. So it happened, some type of image not showing detail.
@modestasgrazys5547 หลายเดือนก่อน
@ hopefully the infrastructure to fine-tune Sana and use it with other generation pipelines components will be developed soon:))
@tranthanhdong1992 หลายเดือนก่อน
I got error GemmaLoader Can't use dtype 'torch.float16' with CPU! Set dtype to 'default'. I have set it to default, but i got error "The checkpoint you are trying to load has model type `gemma2` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date." Can you help me?
@youtubeccia9276 หลายเดือนก่อน
Mindblowing o_O
@foreropa หลายเดือนก่อน ⁺¹
Nice video, but please, when you do this kind of videos, explain how you do things. I´m searching for a way to search inside ComfyUI and I don´t find anyway to do it, I´m stuck almost at the beginning. Edit: Ok, I found a way and now I don´t get how to do the rest because you don´t explain how to do it, so it´s not a comprehensive tutorial as you call it, it´s a tutorial for people that knows what to do.
@HanaMinhTran หลายเดือนก่อน
output were grey photos even followed your instructions in install on Window
@tombyrer1808 28 วันที่ผ่านมา
How much VRAM & CPU RAM please?
@MilesBellas หลายเดือนก่อน ⁺¹
Nerdy Rodent was required to remove his Hunyuan video since he is in the UK and there are EU and UK license issues with that model.
This is video too ?
@javi22022 หลายเดือนก่อน
No, image
@MilesBellas หลายเดือนก่อน
@@javi22022 ?
@RikkTheGaijin 15 วันที่ผ่านมา
It's fast but the images look like from 2023
@TheFutureThinker 15 วันที่ผ่านมา
On lun 9
@Slav4o911 12 วันที่ผ่านมา
These days I can do better in SD 1.5 than what SANA is capable of. Until the community makes a refinement of SANA I don't think it's worth it. It feels like basic SDXL, back in the day.
@italo9537 หลายเดือนก่อน
The next step is when some company launch a common AI model. For example ... use resource from sd 1.5 in Pony, SDXL, or other model.
@robertaopd2182 หลายเดือนก่อน ⁺¹
what about sana vs flux dev? can sana win? any do tests alredy?
@javi22022 หลายเดือนก่อน
@@robertaopd2182 flux for quality, sana for speed
@Saoru71 หลายเดือนก่อน
Great video! thank you. I'm getting blank images generated even though everything is set up properly. Any ideas why? Edit: Ijust noticed everyone else has already reported it.. oh! and Nvidia's license is extremely non user-friendly...shame
@DodiInkoTariah หลายเดือนก่อน
Does this work with Mac considering it’s Nvidea?
@dowhigawoco หลายเดือนก่อน ⁺³
i got only a solid grey colored picture ...
@wonder111 หลายเดือนก่อน ⁺¹
Same here. Everything is updated, but no image.
@TheFutureThinker หลายเดือนก่อน
Make sure got the right VAE for the model. I can't post the HF links on the description, so how YT don't like it.
And I list it all in my blog post. thefuturethinker.org/nvidia-sana-in-comfyui-setup-tutorial-guide/
@sherifOneWay หลายเดือนก่อน ⁺¹
getting same result did anyone fix it yet ? btw i had downloaded the same vae from the link same as the model
@derek303 หลายเดือนก่อน ⁺²
@@TheFutureThinker seems several are having the same issue. Still happening after verifying VAE
@KaganParlatan หลายเดือนก่อน
I dont know if it is related but google/gemma-2-2b-it does not have access to install.
@2thecurve 19 วันที่ผ่านมา
Getting a Grey output image
@bordignonjunior หลายเดือนก่อน ⁺¹
nice tutorial. but it only generates a solid gray image.
@animation-nation-1 หลายเดือนก่อน
Thanks, great ideo. shame these models probably don't have controlnets yet?
@ian2593 หลายเดือนก่อน
Followed exactly but get "`.to` is not supported for `4-bit` or `8-bit` bitsandbytes models. Please use the model as it is, since the model has already been set to the correct devices and casted to the correct `dtype`."
@ian2593 หลายเดือนก่อน
updated comfyui today and it's fixed but now I get the black screen. Will try to find that old vae.
@schnauzeprincessin หลายเดือนก่อน
Thanks it works
@brianervin2430 หลายเดือนก่อน
I keep getting this at the ksampler: KSampler 'int' object is not subscriptable
@omsterdotcom หลายเดือนก่อน
I got the same error when I used the whole number 1 .... 1.1 works, so it must be some scripting error
@hebercloward1695 27 วันที่ผ่านมา
Watched the whole video called "nVidia SANA in ComfyUI - is it worth it?" And at the end they said NOT READY FOR PRIMETIME. So NO you wont be able to get it to work yet as of 1/1/25
@knightride9635 หลายเดือนก่อน ⁺³
Without finishing the video, how are the hands and realism in general?
@youtubeccia9276 หลายเดือนก่อน
Excellent explained like everytime! ❤
@TheFutureThinker หลายเดือนก่อน
Thanks
@glenyoung1809 หลายเดือนก่อน
The model released doesn't output 4K images, it's a 1024 pixel model meaning 1million pixels(1024x1024) when you put in 4.00 its an aspect ratio of 1 to 4 not a 4K resolution.
@TheFutureThinker หลายเดือนก่อน
Check the research paper. And yes, 4.0 that is ratio, but check the research paper where is talk have mentioned 4K res.
@glenyoung1809 หลายเดือนก่อน
@@TheFutureThinker Yes they do mention they've tested to 4K but in an unreleased version of the model.
I've read comments on other channels as well and some users are annoyed after downloading and setting it up thinking they're getting 4K images out of the box and that's not the case.
@TheFutureThinker หลายเดือนก่อน
@@glenyoung1809 this is the most common mistake of people nowadays , see something, zombie brain mode on, then rush to download. Haven't go through the steps or detail. Thats why many said error or something not work, etc. at the final only some are able to use AI and make it work.
And in the previous video , where I only focus on this AI model research paper and the MIT demo page testing. I did mention the 4K res, and its roadmap.
Also this is a base model , a lots of people have forgot about this point.
@glenyoung1809 หลายเดือนก่อน
@@TheFutureThinker Most people see 4K in the title and that's it, the majority could care less about research papers, only that they can get their hands on the model and start pumping out images.
The problem is the majority of users forget this is the bleeding edge, all of this is experimental and that comes with being a beta tester(which most users don't realize they are).
There have been reported issues with blank images and then they conclude this model is crap when in fact this is all experimental and changing rapidly.
To be fair to users there was that mess with Stable Diffusion 3.5 medium, which was crap and too hastily released and it didn't help that Black Forest released Flux 1.0 which only made SD3.5Med look even worse.
@golddiggerprankz หลายเดือนก่อน
CPU wahh🤩
@TheFutureThinker หลายเดือนก่อน
@@golddiggerprankz yup yup the text encoder do the same as T5 but very lightweight.
@FusionDeveloper หลายเดือนก่อน ⁺¹
not for generating the image, only for the prompt.
@traviswatts1305 หลายเดือนก่อน ⁺²
Did this work for anyone?
@vivekkarumudi หลายเดือนก่อน
i am getting two errors back to back "Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat1 in method wrapper_CUDA_addmm" , "Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat1 in method wrapper_CUDA_addmm
@ДмитрийКарпич หลายเดือนก่อน
After 3 hours of googling and re-installing - yes. But... This model feels like proof of concept, not real thing.
@Veselin_Angelov หลายเดือนก่อน ⁺²
The question every user wants to ask, and no developer wants to answer:
"Does it do hands?"
@biggreg100 หลายเดือนก่อน ⁺³
All I get is a blnk picture on every run.
@Kvision25th หลายเดือนก่อน
iit works but not sure it bether than sd15 or sdxl :(
@petertremblay3725 หลายเดือนก่อน
I have try Sana and i prefer by far flux, something is just not right in the lighting and details.
@Gaitchs หลายเดือนก่อน
@@petertremblay3725 same with redux and pulid flux still rules
@kattamaran หลายเดือนก่อน
Nsfw? I might try this for up-res of flux fills
@AlexGarcia-wz9bm 23 วันที่ผ่านมา
Nvidia Sana in spanish means: Healthy Envy
@kd4pba หลายเดือนก่อน
Yea so you lost me at search box when you moved the mouse around.
@squoblat หลายเดือนก่อน
Do LORAs work with this?
@WiseOwlLearning หลายเดือนก่อน
Hope it supports lora, controlnet and ipadapter soon
@MilesBellas หลายเดือนก่อน
Maybe a TensorRT version next ?
@lucasfrancelino5141 หลายเดือนก่อน
Ai Never sleeps, dam!
@TheFutureThinker หลายเดือนก่อน
😂😂😂yup
@Ryuraaan หลายเดือนก่อน
what's left is to make them read our thoughts xD Typing cost to much time haha
@SeanieinLombok หลายเดือนก่อน
brooo waiting on aqua scaping channel now :D
@TheFutureThinker หลายเดือนก่อน
I know in your place that have beautiful wood 😁 and good water condition for fish
@fabiojojo-x9c หลายเดือนก่อน ⁺¹
GemmaLoader prbleme
@chrisgreenwell3404 หลายเดือนก่อน
Now just need a controlnet for it
@peacetoall1858 หลายเดือนก่อน
Can it do hands well? Also I guess loras will come to civitai soon enough.
@masterwillian7785 หลายเดือนก่อน ⁺¹
yes, im currently developing loras for human realism, the flux process seems hundreds of times longer than this new one from NVIDIA, im happy.
@TheFutureThinker หลายเดือนก่อน
@@masterwillian7785nice 👍 please keep us update on your Lora for Sana.
@peacetoall1858 หลายเดือนก่อน
@@masterwillian7785 That sounds great. Will watch your video on the lora when you release it.
@kalakala4803 หลายเดือนก่อน
Haha, i remember our office with a 180cm planted tank 😂 when will you setup a new layout again?
@TheFutureThinker หลายเดือนก่อน
Maybe coming 2025 I join again the IAPLC 😁🤫. And I need to buy some new stone. Too bad AI cannot generate that. LOL
@TheSeniorzone 2 วันที่ผ่านมา
Only grey results, no matter my zombie brain try to fix it. 🙄
@purelife_ai หลายเดือนก่อน
How about people?
@taucalm หลายเดือนก่อน ⁺¹
As its about 4 times smaller database than flux it pretty much cant be better and as variable. What comes to VRAM they should publish specific checkpoints. Like people, cats, dogs, buildings, landscapes etc.. 4 sure its faster. So is SDXL.
Better in quality? Nope. More variance? Nope.
I hate the hype going around everything released just to get clicks.
@eltalismandelafe7531 หลายเดือนก่อน ⁺¹
It is wonderful for creating landscapes but for creating humans it is very bad, the worst.
@RobertMcDonald_trz หลายเดือนก่อน
6:55
@大支爺 หลายเดือนก่อน ⁺²
Censored.
@MiracleMan-ol6gd หลายเดือนก่อน
The downside with Comfyui is not primarily the cluttered UI, but the difficulty in gauging the settings for an optimal output. Trial and error takes precious time, and there are no standardized and proved settings for the best results. Opinions are like rear ends.
@Slav4o911 12 วันที่ผ่านมา
For better results you should add noise don't use empty latent image. Also don't forget the model needs sometime to warm up, if the model has not "warmed up" the results will be bad. Give it a time add noise to the latent image (CPU and GPU random noise gives different results) let the model warm up and look how the magic happens. I usually write my prompt on the fly as it generates images, I add prompt and then click 2 or 4 generations ahead, if I don't like what I see I just delete some of the last words but I never stop the model, if you want 1 prompt 1 good result, that would not happen. Also add a LoRA or two and experiment with different settings as the model continues to generate. You can also start by giving less noise and gradually increase it. Also don't go too overboard with LoRAs, some are bad. Usually I add 2 or 3 max, rarely more than 4. After 4 LoRAs the results tend to get worse.
Also don't forget, what you generate depends of the model, SDXL and Pony models tend to give better results than SD 1.5, most of the time. Though SD 1.5 can also give some excellent results, but it's more chaotic and less controllable, i.e. to achieve similar results with SD 1.5 you'll have to write longer prompts. From my experience so far it seems Stable Forge likes SD 1.5 and Comfy does better with SDXL and Pony. I use Stable Forge for SD 1.5 and Comfy for the larger and newer models.
@peterr6595 หลายเดือนก่อน ⁺¹
But I will let SANA sit for a month while you guys work out the kinks. Too much too fast. BTW 16GB cards will be the minimum for 2025. Buy a new GPU before those orange man tariffs kick in.
@janvollgod7221 26 วันที่ผ่านมา
fastest black image generator. Somehow the model don't do anything.
@emiln1977 25 วันที่ผ่านมา
This Sana draws people just terribly :(( No miracle happened, I’m staying with Flux Schnel
@koganboss4874 16 วันที่ผ่านมา
Well, what did you expect? It's just a small model of 0.6B parameters, while FLUX is 6B - 11B parameters. Of course the results will be terrible. =)
@comfyuiopenart 19 วันที่ผ่านมา
1000 times slower and resalt black frame
@glythlablood6295 19 วันที่ผ่านมา
i need a better computer sadly :/
@koganboss4874 16 วันที่ผ่านมา
I didn't see anything surprising, it's just a small model, so it requires less resources and produces terrible results.
@biggreg100 หลายเดือนก่อน
I

ต่อไป

เล่นอัตโนมัติ